Fine-tuning Video-LaVIT

This is the fork of the official Video-LaVIT repository for the fine-tuning it on https://huggingface.co/datasets/lmms-lab/LLaVA-Video-178K dataset.

We updated the weights of the model images and video tokenizers (), sinse the rest of the model remain unchanged.

We use PyTorch Lightning framework for fine-tuning and processed conversations between human and assistant using the Chat template from VideoLLaMA2 with minor changes

News and Updates

2025.08.01 The notebook was updated (bug about visual token in dataset, unfrozen weights)

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
LaVIT		LaVIT
VideoLaVIT		VideoLaVIT
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
USE_POLICY.md		USE_POLICY.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

Fine-tuning Video-LaVIT

News and Updates

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

sayankotor/LaVIT

Folders and files

Latest commit

History

Repository files navigation

Fine-tuning Video-LaVIT

News and Updates

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages