TFSGC

The repository for our paper: Transforming Visual Scene Graphs to Image Captions

The 61st Annual Meeting of the Association for Computational Linguistics (ACL2023) main conference

Installation

git clone https://github.com/GaryJiajia/TSG.git
cd TSG
python -m pip install -e

More detailed environment settings can be found on: https://github.com/ruotianluo/ImageCaptioning.pytorch

Our Main Enviroment

Python 3.7
PyTorch 1.8.2
TorchVision 0.8.0
numpy
tqdm
gensim
matplotlib
yacs
lmdbdict

## Please check the version of torch/torchvision if it matches your own version
pip install -r requirements.txt

cider
coco-caption(Remember to follow initialization steps in coco-caption/README.md)

Data preparing

1. Download:

coco_pred_sg, and unzip it in data/.

2. feature preparing:

The features of MSCOCO 2014 are extracted by previous works: coco_bu_feats and coco_swin_feats. (You can also extract image features of MSCOCO 2014 using resnet model, and swin transformer model) Download them, unzip them in data/.

|-- TSG
|    |
|    |- cider
|    |- coco-caption
|    |- data
|        |
|        |- coco_pred_sg
|        |- coco_swin_feats
|        |- cocobu.json
|        |- cocobu_label.h5
|        |- ...

If you download all the swin feature files, you can use the following command to decompress in a Linux system:

cat [compressed_file_name].* | tar -xzf -
## for example: (Don't forget the . after gz)
cat feats.tar.gz.* | tar -xzf -

Training

The core code are given in the models/TSGMModel3.py You can use the configs in the /tsg_configs to start training:

python train_tsg.py --cfg tsg_configs/tsgmt1.yml

Args:

checkpoint_path: The storage location for the checkpoints.
input_att_dir: The path for cocobu_att/coco_swin_feats. When you use the coco_swin_feats for input_att_dir, you need to add the att_feat_size into the config.
batch_size: batch size=14 can be adapted to a single RTX 3090GPU (24GB), 20 requires around 33GB. You can modify it to fit your device and modify the structure_after, max_epochs to achieve better training results.

Evaluation

Evaluate on Karpathy's test split

python eval_tsg.py --dump_images 0 --num_images 5000 --model tsgmt1/modeltsgmt10011.pth --infos_path tsgmt1/infos_tsgmt10011.pkl  --language_eval 1 --beam_size 5

Evaluate on COCO test set

$ python eval_tsg.py --input_json data/cocotest.json --input_fc_dir data/cocotest_bu_fc --input_att_dir data/cocotest_bu_att --input_label_h5 none --num_images -1 --model model.pth --infos_path infos.pkl --language_eval 0  --beam_size 5

You can download the preprocessed file cocotest.json, cocotest_bu_att and cocotest_bu_fc from link according to ruotianluo/ImageCaptioning.

Citing

If you found this repository useful, please consider citing:

@inproceedings{yang-etal-2023-transforming,
    title = "Transforming Visual Scene Graphs to Image Captions",
    author = "Yang, Xu and Peng, Jiawei and Wang, Zihua and Xu, Haiyang and Ye, Qinghao and Li, Chenliang and Huang, Songfang and Huang, Fei and Li, Zhangzikang and Zhang, Yu",
    booktitle = "Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    year = "2023",
    publisher = "Association for Computational Linguistics",
    doi = "10.18653/v1/2023.acl-long.694",
    pages = "12427--12440",
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
misc		misc
models		models
scripts		scripts
tsg_configs		tsg_configs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataloader_tsg.py		dataloader_tsg.py
eval_ensemble.py		eval_ensemble.py
eval_multi.py		eval_multi.py
eval_tsg.py		eval_tsg.py
eval_utils_tsg.py		eval_utils_tsg.py
opts.py		opts.py
prepro_ngrams.py		prepro_ngrams.py
prepro_ngrams_weos.py		prepro_ngrams_weos.py
process_ssg.py		process_ssg.py
requirements.txt		requirements.txt
ssg2kg.py		ssg2kg.py
train_tsg.py		train_tsg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TFSGC

Table of Contents

Installation

Our Main Enviroment

Data preparing

1. Download:

2. feature preparing:

Training

Evaluation

Evaluate on Karpathy's test split

Evaluate on COCO test set

Citing

About

Uh oh!

Releases

Packages

Languages

License

palimisis/TSG

Folders and files

Latest commit

History

Repository files navigation

TFSGC

Table of Contents

Installation

Our Main Enviroment

Data preparing

1. Download:

2. feature preparing:

Training

Evaluation

Evaluate on Karpathy's test split

Evaluate on COCO test set

Citing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages