Replicating Visual Question Answering (VQA) with OFA

This project aims to replicate the Visual Question Answering (VQA) task using the OFA (One-For-All) framework, specifically evaluating the performance of the OFA Base model. The goal is to reproduce the results reported in the OFA paper for the VQAv2 dataset.

Project Structure

main.ipynb: The primary script to load the OFA model, preprocess the VQAv2 dataset, and evaluate the VQA task.
requirements.txt: Contains the dependencies required to run the project.

Dataset

The VQA task in this project is evaluated on the VQAv2 dataset, as used in the OFA paper. The dataset includes:

Images
Questions
Answers

For more details, visit the VQAv2 dataset page.

Setup Instructions

Clone the Repository:

git clone <repository-url>
cd <repository-directory>

Install Dependencies: Make sure you have Python 3.8 or later installed.

pip install -r requirements.txt

Run the Evaluation: Execute the main.ipynb script to preprocess the dataset and evaluate the OFA Base model on the VQA task. python main.ipynb

Notes Ensure that the transformers and datasets libraries are properly installed. The preprocessing pipeline handles image resizing, tokenization, and loading ground-truth annotations. References OFA Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework VQAv2 Dataset: https://visualqa.org/

Name		Name	Last commit message	Last commit date
Latest commit History 722 Commits
criterions		criterions
data		data
examples		examples
fairseq		fairseq
models		models
ofa_module		ofa_module
run_scripts		run_scripts
tasks		tasks
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_EncouragingLoss.md		README_EncouragingLoss.md
README_mmspeech.md		README_mmspeech.md
caption_transformers (1).ipynb		caption_transformers (1).ipynb
checkpoints.md		checkpoints.md
checkpoints_cn.md		checkpoints_cn.md
colab.md		colab.md
datasets.md		datasets.md
evaluate.py		evaluate.py
main.ipynb		main.ipynb
modelscope.md		modelscope.md
prompt_tuning.md		prompt_tuning.md
requirements.txt		requirements.txt
spaces.md		spaces.md
test.ipynb		test.ipynb
train.py		train.py
trainer.py		trainer.py
transformers.md		transformers.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Replicating Visual Question Answering (VQA) with OFA

Project Structure

Dataset

Setup Instructions

About

Uh oh!

Releases

Packages

Languages

License

AneeshShamraj/OFA

Folders and files

Latest commit

History

Repository files navigation

Replicating Visual Question Answering (VQA) with OFA

Project Structure

Dataset

Setup Instructions

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages