BackwardLens

Description

This is a demo provided for the paper: Backward Lens: Projecting Language Model Gradients into the Vocabulary Space (EMNLP 2024 main conference, Best Paper Award)

Try our demo:

Installation

Prerequisites

Make sure you have Python 3.10 installed on your machine.

All files must be in the same folder

Installing Dependencies

In your Python environment, install the required dependencies using pip and the requirements.txt file:

pip install -r requirements.txt

Usage

Make sure all files are in the same folder and run backward_lens_demo.ipynb.

This script demonstrates how to obtain the backward lens projection from GPT-2 models.

Additionally, the final blocks of the script show how to exclude tokens' rankings from VJPs and how to observe the differences in VJPs' norms between layers.

Forward Pass Shift

Use the submodule memit_for_BackwardLens or directly use the repo here.

Citing

@article{katz2024backward,
  title={Backward lens: Projecting language model gradients into the vocabulary space},
  author={Katz, Shahar and Belinkov, Yonatan and Geva, Mor and Wolf, Lior},
  journal={arXiv preprint arXiv:2402.12865},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
flow_graph_configs		flow_graph_configs
memit_for_BackwardLens @ f257efb		memit_for_BackwardLens @ f257efb
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
backward_lens_demo.ipynb		backward_lens_demo.ipynb
hook_collect_hidden_states.py		hook_collect_hidden_states.py
llm_utils.py		llm_utils.py
opt_utils.py		opt_utils.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

BackwardLens

Description

Table of Contents

Installation

Prerequisites

Installing Dependencies

Usage

Forward Pass Shift

Citing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

shacharKZ/BackwardLens

Folders and files

Latest commit

History

Repository files navigation

BackwardLens

Description

Table of Contents

Installation

Prerequisites

Installing Dependencies

Usage

Forward Pass Shift

Citing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages