This repository is a fork of the Tevatron project and includes several custom changes for our experiments.
- Flash Attention disabled
- MultiModalDenseModel disabled
- Added
lr_scheduler_type
argument
This project remains under the Apache License 2.0, identical to the original Tevatron. See the LICENSE file for details.