TIGER

This is the pytorch implementation of the paper at NeurIPS 2023:

Recommender Systems with Generative Retrieval

Shashank Rajput, Nikhil Mehta, Anima Singh, Raghunandan H. Keshavan, Trung Vu, Lukasz Heldt, Lichan Hong, Yi Tay, Vinh Q. Tran, Jonah Samost, Maciej Kula, Ed H. Chi, Maheswaran Sathiamoorthy.

Usage

Data

The experimental datasets should be preprocessed into JSON format. You may refer to this example data for guidance.

Training & Evaluation

1. Train the RQ-VAE Model

python run_gr_id.py

2. Train the T5 Model with Online Tokenization

Once the RQ-VAE model is trained, you can proceed to train the T5 model using online tokenization (i.e., tokenization is performed during training, rather than stored offline):

python run_gr_rec.py

Note

This project is based on the LETTER repository, and is compatible with using LETTER as a tokenizer. However, unlike LETTER which removes duplicates through post-processing, our implementation introduces deduplication directly via suffix tokens during token generation.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
run_gr_id.py		run_gr_id.py
run_gr_rec.py		run_gr_rec.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

TIGER

Usage

Data

Training & Evaluation

1. Train the RQ-VAE Model

2. Train the T5 Model with Online Tokenization

Note

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

baiyimeng/TIGER

Folders and files

Latest commit

History

Repository files navigation

TIGER

Usage

Data

Training & Evaluation

1. Train the RQ-VAE Model

2. Train the T5 Model with Online Tokenization

Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages