LLM4Rec-IGD

This repository contains the implementation code for the paper:

IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized Recommendation

Setup and Installation

1. Download Datasets

# Take the book dataset as an example 
# Download the dataset
wget https://datarepo.eng.ucsd.edu/mcauley_group/data/amazon_v2/categoryFiles/Books.json.gz
wget https://datarepo.eng.ucsd.edu/mcauley_group/data/amazon_v2/metaFiles2/meta_Books.json.gz

# Unzip
gunzip Books.json.gz
gunzip meta_Books.json.gz

2. Create Python Environment

conda create -n IGD python=3.10
conda activate IGD
pip install -r requirements.txt

3. Preprocess Dataset

# Preprocess and extract Item-frequency information
bash compute_item_freq.sh

4. IGD-Tuning

bash ig_monitor.sh

Parameter Settings for IGD-Tuning

beta adjusts the weight of zero-IG tokens.
To implement the baseline, set beta=1.0.
For our method, beta=0.1 works well in general. You can grid search over: [0.08, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6]

5. IGD-Decoding

bash evaluate.sh

Parameter Settings for IGD-Decoding

Adjust the alpha parameter in the evaluate.sh script:
- alpha=0.0 is the baseline.
- In the inference script, you can set: (0.0 0.1 0.2 0.3 0.4)
- alpha=0.2 generally yields the good results.
For D3 method, set length penalty to 0.0
For BIGRec method, set the length penalty to 1.0 in the script.

Comparison Methods

CFT Method

Uses cft_monitor.py. According to the original paper, search over:
- beta = 0.09, 0.16, 0.29, 0.38, 0.5, 0.66, 0.9, 0.96
- alpha = 0.01, 0.02, 0.025, 0.05, 0.1, 0.2, 0.3

Pos Method

Part of the CFT method. Set alpha=0, and only tune beta.

Hardware Notes

In our experiments, we trained our methods on an H100 96G GPU and tested on an A5000 GPU. Different hardware configurations may cause minor differences in results.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
code		code
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
cft_monitor.sh		cft_monitor.sh
compute_item_freq.sh		compute_item_freq.sh
evaluate.sh		evaluate.sh
ig_monitor.sh		ig_monitor.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM4Rec-IGD

Setup and Installation

1. Download Datasets

2. Create Python Environment

3. Preprocess Dataset

4. IGD-Tuning

Parameter Settings for IGD-Tuning

5. IGD-Decoding

Parameter Settings for IGD-Decoding

Comparison Methods

CFT Method

Pos Method

Hardware Notes

About

Uh oh!

Releases

Packages

Languages

License

ZJLin2oo1/IGD

Folders and files

Latest commit

History

Repository files navigation

LLM4Rec-IGD

Setup and Installation

1. Download Datasets

2. Create Python Environment

3. Preprocess Dataset

4. IGD-Tuning

Parameter Settings for IGD-Tuning

5. IGD-Decoding

Parameter Settings for IGD-Decoding

Comparison Methods

CFT Method

Pos Method

Hardware Notes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages