Thanks to visit codestin.com
Credit goes to github.com

Skip to content
forked from jy0205/LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

License

sayankotor/LaVIT

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

56 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fine-tuning Video-LaVIT

This is the fork of the official Video-LaVIT repository for the fine-tuning it on https://huggingface.co/datasets/lmms-lab/LLaVA-Video-178K dataset.

We updated the weights of the model images and video tokenizers (), sinse the rest of the model remain unchanged.

We use PyTorch Lightning framework for fine-tuning and processed conversations between human and assistant using the Chat template from VideoLLaMA2 with minor changes

News and Updates

  • 2025.08.01 The notebook was updated (bug about visual token in dataset, unfrozen weights)

About

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 97.7%
  • Python 2.3%