textVQG

Implementation of the paper Look, Read and Ask: Learning to Ask Questions by Reading Text in Images (ICDAR-2021) (Resolving some issues with respect to evaluation code. [Issue with NLG_eval and evaluate_textvqg.py code also, the model and entire code structure was previously written in python 2.7 environment and now it is being changed to python 3.8. Will update the complete code sooner.])

paper

Requirements

Use pytorch 1.7.0 CUDA 10.2
Other requirements from 'requirements.txt'

To setup environment

# create new env 
$ virtualenv -p python2.7 textvqg

# activate 
$ source textvqg/bin/activate

# install pytorch, torchvision
$ conda install pytorch==1.7.0 torchvision==0.8.0 cudatoolkit=10.2 -c pytorch

# install other dependencies
$ pip install -r requirements.txt

Model Training

# Create the vocabulary files required for textVQG.
python utils/vocab.py

# Create the hdf5 dataset.
python utils/store_dataset.py

# Train the model.
python train_textvqg.py

# Evaluate the model.
python evaluate_textvqg.py

Results

The following are some results of the proposed method:

Cite

If you find this code/paper useful for your research, please consider citing.

@InProceedings{10.1007/978-3-030-86549-8_22,
author="Jahagirdar, Soumya
and Gangisetty, Shankar
and Mishra, Anand",
editor="Llad{\'o}s, Josep
and Lopresti, Daniel
and Uchida, Seiichi",
title="Look, Read and Ask: Learning to Ask Questions by Reading Text in Images",
booktitle="Document Analysis and Recognition -- ICDAR 2021",
year="2021",
publisher="Springer International Publishing",
address="Cham",
pages="335--349"
}

Acknowledgements

This repo uses few utility function provided by https://github.com/ranjaykrishna/iq.

Contact

For any clarification, comment, or suggestion please create an issue or contact Soumya Shamarao Jahagirdar.

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
models		models
utils		utils
README.md		README.md
data_creation.ipynb		data_creation.ipynb
evaluate_textvqg.py		evaluate_textvqg.py
requirements.txt		requirements.txt
train_textvqg.py		train_textvqg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

textVQG

Requirements

Model Training

Results

Cite

Acknowledgements

Contact

About

Uh oh!

Releases

Packages

Languages

Uh oh!

Uh oh!

soumyasj/textVQG

Folders and files

Latest commit

History

Repository files navigation

textVQG

Requirements

Model Training

Results

Cite

Acknowledgements

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages