🚀 T2ITrainer

⚠️ Development Notice: Currently in active development - stability not guaranteed. Frequent updates - check changelogs regularly.

T2ITrainer is a diffusers based training script. It aims to provide simple yet implementation for lora training.

❗ Mandatory: Update diffusers to latest github version

pip install git+https://github.com/huggingface/diffusers.git -U

📅 Major Updates

2025-12-20: Node Based Frontend UI for configuration with visualization capabilities. Flexible dataset configuration. (Still under development)
2025-12-20: Support LongCat Image and LongCat Edit, 6B MMDIT flux vae models, Lora Training

🛡️ Prerequisites

PyTorch: torch>=2.3.0+cu121 (CUDA 12.1 supported)
Node.js: node>=14.0.0 (Required for frontend UI)

💻 Supported Training Configurations

Model Type	VRAM Requirements	Status
LongCat Image/Edit	24GB GPU	✅ Supported
Qwen Edit	48GB GPU (bf16)	✅ Supported
Qwen Image	24GB GPU (nf4) 48GB GPU (bf16)	✅ Supported
Flux Fill, Kontext	24GB GPU	✅ Supported

⚙️ Installation Guide

0. System Requirements

❗ Mandatory: Install Microsoft Visual C++ Redistributable if encountering DLL errors

0.1 Frontend Requirements

❗ Mandatory: Install Node.js (version 14 or higher) for the Node-Based Frontend UI

After installing Node.js, verify the installation:

node --version
npm --version

1. Automated Setup

Recommended Method

  git clone https://github.com/lrzjason/T2ITrainer.git
  cd T2ITrainer
  setup.bat

Handles: Virtual Environment • Dependency Installation • Model Downloads • Frontend Dependencies

The automated setup will:

Create a Python virtual environment
Install Python dependencies
Install Node.js dependencies for the frontend
Build the frontend UI
Download required models

2. Manual Installation

Clone Repository 🌐

    git clone https://github.com/lrzjason/T2ITrainer.git
    cd T2ITrainer

Virtual Environment 🛠️

    python -m venv venv
    call venv\Scripts\activate
    pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121

Frontend Setup 🖥️

    cd frontend
    npm install
    npm run build
    cd ..

Backend Dependencies 📦

    pip install -r requirements.txt

Model Downloads 📥 ❗ Notice: Only download the models you want to train. Install huggingface-cli if you haven't (or update the huggingface-cli if you have an old version). You could find the download scripts in download_xxx.txt

    # NF4 Qwen Image
    hf download "lrzjason/qwen_image_nf4" --local-dir qwen_models/qwen_image_nf4/

    # NF4 Flux kontext
    hf download "lrzjason/flux-kontext-nf4" --local-dir flux_models/kontext/

    # NF4 Flux Fill for low gpu
    hf download "lrzjason/flux-fill-nf4" --local-dir flux_models/fill/

    # Kolors
    hf download Kwai-Kolors/Kolors --local-dir kolors_models/

    # SD3.5 Models
    hf download "stabilityai/stable-diffusion-3.5-large" --local-dir "sd3.5L/"

    # download original repo for lokr training
    hf download "Qwen/Qwen-Image" --local-dir qwen_models/qwen_image/
    hf download "Qwen/Qwen-Image-Edit" --local-dir qwen_models/qwen_image_edit/

🚀 Launch Options

Command Line Interface

Model	Command	Special Notes
Qwen Edit	`python train_qwen_image_edit.py`	48GB VRAM Recommended for original model
Qwen Image	`python train_qwen_image.py`	24GB VRAM Recommended for nf4, 48GB VRAM Recommended for original model
Flux kontext	`python ui_flux_fill.py`	24GB VRAM Recommended
Flux Fill	`python ui_flux_fill.py`	24GB VRAM Recommended
LongCat Image	`python train_longcat.py`	24GB VRAM Recommended
LongCat Image Edit	`python train_longcat_edit.py`	24GB VRAM Recommended

Node-Based Frontend UI (Recommended)

For the new Node-Based Frontend UI with visualization capabilities:

Development Mode (Fastest for development):

# Terminal 1: Start backend
python backend_api.py

# Terminal 2: Start frontend (auto-reloads on changes)
cd frontend
npm run dev

Access at: http://localhost:3000

Production Mode (Optimized for performance):

# Build and serve the frontend with backend
python main.py

Access at: http://localhost:7860

Preview Mode (Pre-built optimized version):

# Terminal 1: Start backend
python backend_api.py

# Terminal 2: Serve pre-built frontend (faster than main.py)
cd frontend
npm run preview

Access at: http://localhost:7860

Performance Note: npm run dev provides the fastest experience with hot reloading, while npm run preview offers optimized performance similar to production. The python main.py approach uses npm run preview internally for better performance but still requires the backend to be running separately.

🔧 Parameter Configuration Guide

🌌 Qwen Model Management

Config	Usage
`config_qwen_single.json`	Train qwen image with a single image; leave the suffix empty to use all images without a suffix.

Usage: python train_qwen_image.py --config_path config_qwen_single.json

Config	Usage
`config_qwen_single.json`	Train Qwen Image/Edit with a single image; leave the suffix empty to use all images without a suffix.
`config_qwen_edit_pairs.json`	Traditional Qwen Edit training using `_T` and `_R` suffixed images.
`config_qwen_edit_pairs_multiple.json`	Train with multiple reference images by setting suffixes like `_T`, `_R`, and `_G`.

Usage: python train_qwen_image_edit.py --config_path config_qwen_single.json

Qwen Model Installation

Inpainting Model Setup

  hf download"lrzjason/qwen_image_nf4" --local-dir qwen_models/qwen_image_nf4/

For more details (example dataset):

https://github.com/lrzjason/T2ITrainer/blob/main/doc/qwen.md

⚙️ Qwen Recommended Parameters

Qwen Image NF4

Category	Settings
Base Configuration	Rank 32, AdamW, Learn Rate 1e-4
24GB GPU	512 resolution, Batch Size 1
Precision	bf16

Qwen Image Model

Category	Settings
Base Configuration	Rank 32~64, AdamW, Learn Rate 1e-4
48GB GPU	1024 resolution, Batch Size 1
Precision	bf16

Qwen Edit Model

Category	Settings
Base Configuration	Rank 32~64, AdamW, Learn Rate 1e-4
48GB GPU	512 resolution, Batch Size 1
Precision	bf16

💻 VRAM Usage (nf4, bs1, blocks_to_swap=20)

VRAM Peak

💻 VRAM Usage (nf4, bs1, blocks_to_swap=0)

VRAM Peak

💻 VRAM Usage (Original, bf16, bs1, blocks_to_swap=0)

VRAM Peak
Around 43GB

🌌 Flux Model Management

Config	Usage
`config_new_single.json`	Train Kontext with a single image; leave the suffix empty to use all images without a suffix.
`config_new_pairs.json`	Traditional Kontext training using `_T` and `_R` suffixed images.
`config_new_pairs_multiple.json`	Train with multiple reference images by setting suffixes like `_T`, `_R`, and `_G`.
`config_new_mixed.json`	Train Kontext using a mixed layout—e.g., combine traditional pair training with single-image training.

Usage: python train_flux_lora_ui_kontext_new.py --config_path config_new_single.json

Kontext Model Installation

Inpainting Model Setup

  hf download"lrzjason/flux-kontext-nf4" --local-dir flux_models/kontext/

For more details (example dataset):

Fill Model Installation (Skip if train kontext)

Inpainting Model Setup

  hf download"lrzjason/flux-fill-nf4" --local-dir flux_models/fill/

For more details (example dataset):

Dev Model Download (Skip if train fill and kontext)

Dev Model Installation

  hf download"black-forest-labs/FLUX.1-dev" --local-dir flux_models/dev/

⚙️ Flux Training Recommended Parameters

Category	Settings
Base Configuration	Rank 16, AdamW, Lr 1e-4
24GB GPU	512 resolution, Batch Size 1
VRAM Optimization	Use nf4 based training
Precision	bf16

🌌 LongCat Model Management

Config	Usage
`config_longcat_dev.json`	Train LongCat Image with a single image; leave the suffix empty to use all images without a suffix.
`config_longcat_edit.json`	Train LongCat Image Edit with paired images using various suffixes like `_T`, `_R`, etc.

Usage (LongCat Image): python train_longcat.py --config_path config_longcat_dev.json
Usage (LongCat Image Edit): python train_longcat_edit.py --config_path config_longcat_edit.json

LongCat Model Installation

LongCat Model Setup

  hf download "Meituan/LongCat-Image" --local-dir longcat_models/LongCat-Image/
  hf download "Meituan/LongCat-Image-Edit" --local-dir longcat_models/LongCat-Image-Edit/

⚙️ LongCat Training Recommended Parameters

Category	Settings
Base Configuration	Rank 32~64, AdamW, Learn Rate 1e-4
24GB GPU	1024 resolution, Batch Size 1
Precision	bf16

💻 VRAM Usage nf4

VRAM Peak

💻 VRAM Usage (bf16, blocks_to_swap=10)

VRAM Peak

VRAM Low

🔧 Visualize Training Data

Register WandB before using it Setup WandB env

pip install wandb
wandb login

Install Tensorboard first if you choice to use Tensorboard To visualize training data, run the following command in your terminal:

tensorboard --logdir=.\logs

Configuration Guide: 📖 CivitAI Article

🆘 Troubleshooting

Kolors Black Image Issue: Ensure you're using FP16 Fixed VAE
VRAM Limitations: Adjust blocks_to_swap parameter (higher values reduce memory usage)
Windows DLL Errors: Verify VC++ Redistributable installation
Frontend Not Loading: Ensure Node.js is installed and frontend is built (cd frontend && npm install && npm run build)
Templates Not Found: In production builds, ensure the backend is running (python backend_api.py) before accessing the frontend
Slow Frontend Performance: Use npm run dev for development or npm run preview for optimized local serving instead of python main.py

Star History

Old Change logs:

https://github.com/lrzjason/T2ITrainer/blob/main/doc/change_logs.md

Recent Change Logs:

2025-07-30:
Fix: Remove text attention mask in lora training.

Sponsor:

Thanks to all the contributors and sponsors for improving the project!
Sponsor List:
https://github.com/lrzjason/T2ITrainer/blob/main/sponsor/sponsor_list.txt

📬 Contact

𝕏 Twitter: @Lrzjason
📧 Email: [email protected]
💬 QQ Group: 866612947
💬 WeChat ID: fkdeai
🎨 CivitAI: xiaozhijason

Sponsors me for more open source projects:

Buy me a coffee:

WeChat:

- Thanks to 猫不爱吃香菜 sponsor for adding lokr support. - Thanks to AIGate(https://waas.aigate.cc/) providing compute power for the development.

Acknowledgements:

Thanks to chenpipi0807 for Chinese translation and language switch support
Thanks for diffusers and Terminus Research Group
Thanks to minienglish1 and Freon in EveryDream Discord for the assistance.
Special thanks to kohya ss for references from the training codebase.
Thanks to Kblueleaf for coding reference on hunyuandit gradient checkpoint implementation.
Thanks to Kolors for the open-source checkpoint.
Thanks to comfyui for the wonderful codebase.
Thanks to emojiiii for the setup.bat script and other updates.
Thanks to Rohit Gandikota and related authors of Concept Sliders https://github.com/rohitgandikota/sliders

Name		Name	Last commit message	Last commit date
Latest commit History 565 Commits
aesthetic		aesthetic
cache		cache
captioner		captioner
comfy		comfy
config_template		config_template
doc		doc
flux		flux
frontend		frontend
hunyuandit		hunyuandit
kolors		kolors
longcat		longcat
mps		mps
object_detection		object_detection
old		old
prepare_data		prepare_data
qwen		qwen
slider		slider
sponsor		sponsor
test		test
trainer/models		trainer/models
utils		utils
z_img		z_img
.gitignore		.gitignore
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
README.md		README.md
Update Implementation.txt		Update Implementation.txt
accelerate_config.yaml		accelerate_config.yaml
backend_api.py		backend_api.py
check_pairs.py		check_pairs.py
config.json		config.json
config_longcat_dev.json		config_longcat_dev.json
config_longcat_dev_local.json		config_longcat_dev_local.json
config_longcat_edit.json		config_longcat_edit.json
config_longcat_edit_local.json		config_longcat_edit_local.json
config_new.json		config_new.json
config_qwen_edit_pairs.json		config_qwen_edit_pairs.json
config_qwen_edit_pairs_multiple.json		config_qwen_edit_pairs_multiple.json
config_qwen_single.json		config_qwen_single.json
config_z_image_single.json		config_z_image_single.json
convert_diffusion_model_to_diffusers.py		convert_diffusion_model_to_diffusers.py
download_flux_kontext.txt		download_flux_kontext.txt
download_fluxdev.txt		download_fluxdev.txt
download_fluxfill.txt		download_fluxfill.txt
download_long_cat_edit.txt		download_long_cat_edit.txt
download_qwen.txt		download_qwen.txt
download_qwen_edit.txt		download_qwen_edit.txt
download_sd3.5L.txt		download_sd3.5L.txt
download_sd3.5M.txt		download_sd3.5M.txt
frontend_init.zip		frontend_init.zip
main.py		main.py
package-lock.json		package-lock.json
package.json		package.json
requirements.txt		requirements.txt
run new script command.txt		run new script command.txt
run.bat		run.bat
run.sh		run.sh
run_with_venv.bat		run_with_venv.bat
run_with_venv.sh		run_with_venv.sh
setup.bat		setup.bat
setup.sh		setup.sh
shared_models.txt		shared_models.txt
slider_config.json		slider_config.json
temp.json		temp.json
test_drop_idx.py		test_drop_idx.py
train_flux_lora_ui.py		train_flux_lora_ui.py
train_flux_lora_ui_kontext_new.py		train_flux_lora_ui_kontext_new.py
train_flux_lora_ui_kontext_slider.py		train_flux_lora_ui_kontext_slider.py
train_flux_lora_ui_with_mask.py		train_flux_lora_ui_with_mask.py
train_longcat.py		train_longcat.py
train_longcat_edit.py		train_longcat_edit.py
train_qwen_image.py		train_qwen_image.py
train_qwen_image_edit.py		train_qwen_image_edit.py
train_qwen_image_edit_new.py		train_qwen_image_edit_new.py
train_z_image.py		train_z_image.py
ui_flux_fill.py		ui_flux_fill.py
ui_flux_fill_old.py		ui_flux_fill_old.py
update.bat		update.bat

License

lrzjason/T2ITrainer

Folders and files

Latest commit

History

Repository files navigation

🚀 T2ITrainer

⚠️ Development Notice: Currently in active development - stability not guaranteed. Frequent updates - check changelogs regularly.

📅 Major Updates

🛡️ Prerequisites

💻 Supported Training Configurations

⚙️ Installation Guide

0. System Requirements

0.1 Frontend Requirements

1. Automated Setup

2. Manual Installation

🚀 Launch Options

Command Line Interface

Node-Based Frontend UI (Recommended)

🔧 Parameter Configuration Guide

🌌 Qwen Model Management

Qwen Model Installation

⚙️ Qwen Recommended Parameters

Qwen Image NF4

Qwen Image Model

Qwen Edit Model

💻 VRAM Usage (nf4, bs1, blocks_to_swap=20)

💻 VRAM Usage (nf4, bs1, blocks_to_swap=0)

💻 VRAM Usage (Original, bf16, bs1, blocks_to_swap=0)

🌌 Flux Model Management

Kontext Model Installation

Fill Model Installation (Skip if train kontext)

Dev Model Download (Skip if train fill and kontext)

⚙️ Flux Training Recommended Parameters

🌌 LongCat Model Management

LongCat Model Installation

⚙️ LongCat Training Recommended Parameters

💻 VRAM Usage nf4

💻 VRAM Usage (bf16, blocks_to_swap=10)

🔧 Visualize Training Data

🆘 Troubleshooting

Star History

Old Change logs:

Recent Change Logs:

Sponsor:

📬 Contact

Sponsors me for more open source projects:

Acknowledgements:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages