Self Forcing Plus

Self-Forcing-Plus focuses on step distillation and CFG distillation for bidirectional models. Building upon Self-Forcing, we support 4-step T2V-14B model training and higher quality 4-step I2V-14B model training.

🔥 News

(2025/09) Support Wan2.2-Moe distillation! wan22

Model Type	Model Link
T2V-14B	Huggingface
I2V-14B-480P	Huggingface

Installation

Create a conda environment and install dependencies:

conda create -n self_forcing python=3.10 -y
conda activate self_forcing
pip install -r requirements.txt
pip install flash-attn --no-build-isolation
python setup.py develop

Quick Start

Download checkpoints

huggingface-cli download Wan-AI/Wan2.1-T2V-14B --local-dir wan_models/Wan2.1-T2V-14B
huggingface-cli download Wan-AI/Wan2.1-I2V-14B-480P --local-dir wan_models/Wan2.1-I2V-14B-480P

T2V Training

DMD training for bidirectional models do not need ODE initialization.

DataSet Preparation

We build the dataset in the following way, each file contains a single prompt:

data_folder
  |__1.txt
  |__2.txt
  ...
  |__xxx.txt

DMD Training

torchrun --nnodes=8 --nproc_per_node=8 \
--rdzv_id=5235 \
--rdzv_backend=c10d \
--rdzv_endpoint=${MASTER_ADDR}:${MASTER_PORT} \
train.py \
--config_path configs/self_forcing_14b_dmd.yaml \
--logdir logs/self_forcing_14b_dmd \
--no_visualize \
--disable-wandb

Our training run uses 3000 iterations and completes in under 3 days using 64 H100 GPUs.

I2V-480P Training

DataSet Preparation

Generate a series of videos using the original Wan2.1 model.
Generate the VAE latents.

python scripts/compute_vae_latent.py \
--input_video_folder {video_folder} \
--output_latent_folder {latent_folder} \
--model_name Wan2.1-T2V-14B \
--prompt_folder {prompt_folder}

Separate the first frame of the videos and create an lmdb dataset.

python scripts/create_lmdb_14b_shards.py \
--data_path {latent_folder} \
--prompt_path {prompt_folder} \
--lmdb_path {lmdb_folder}

DMD Training

torchrun --nnodes=8 --nproc_per_node=8 \
--rdzv_id=5235 \
--rdzv_backend=c10d \
--rdzv_endpoint=${MASTER_ADDR}:${MASTER_PORT} \
train.py \
--config_path configs/self_forcing_14b_i2v_dmd.yaml \
--logdir logs/self_forcing_14b_i2v_dmd \
--no_visualize \
--disable-wandb

Our training run uses 1000 iterations and completes in under 12 hours using 64 H100 GPUs.

Acknowledgements

This codebase is built on top of the open-source implementation of CausVid, Self-Forcing and the Wan2.1 repo.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
configs		configs
demo_utils		demo_utils
demos		demos
model		model
pipeline		pipeline
prompts		prompts
scripts		scripts
templates		templates
trainer		trainer
utils		utils
wan		wan
LICENSE.md		LICENSE.md
README.md		README.md
convert_checkpoint.py		convert_checkpoint.py
demo.py		demo.py
inference.py		inference.py
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Self Forcing Plus

🔥 News

Installation

Quick Start

Download checkpoints

T2V Training

DataSet Preparation

DMD Training

I2V-480P Training

DataSet Preparation

DMD Training

Acknowledgements

About

Uh oh!

Releases

Packages

Languages

License

GoatWu/Self-Forcing-Plus

Folders and files

Latest commit

History

Repository files navigation

Self Forcing Plus

🔥 News

Installation

Quick Start

Download checkpoints

T2V Training

DataSet Preparation

DMD Training

I2V-480P Training

DataSet Preparation

DMD Training

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages