Feat/multispectral dataloader 5ch #11517

snwtz · 2025-05-07T14:42:24Z

new modules: dataloader and dataloader test (local, not optimized for GPU)
multispectral data loading and preprocessing
TODO: explicit caching and order validation
adapted main training script for MS data loading

This commit adds support for 5-channel multispectral data in the VAE architecture while maintaining compatibility with Stable Diffusion 3's latent space requirements. Key changes: - Add AutoencoderKLMultispectral5Ch implementation with 5 input/output channels - Implement 8x downsampling to match SD3's latent space requirements - Add comprehensive test suite for multispectral VAE functionality - Add training script for 5-channel multispectral data - Update documentation with detailed implementation notes Technical details: - Uses 4 downsampling blocks for 8x downsampling - Maintains 4-channel latent space for SD3 compatibility - Implements group normalization (32 groups) for stable training - Preserves spectral information through careful normalization - Handles 16-bit multispectral data with proper scaling Files changed: - src/diffusers/models/autoencoders/autoencoder_kl_multispectral_5ch.py - tests/models/autoencoders/test_models_autoencoder_kl_multispectral_5ch.py - examples/multispectral/train_multispectral_vae_5ch.py

…nel data

Zina Gleaves added 3 commits May 1, 2025 15:46

feat: add multispectral dataloader and SD3 training script for 5-chan…

6997dff

…nel data

fixed caching in test script

fbb137b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/multispectral dataloader 5ch #11517

Feat/multispectral dataloader 5ch #11517

snwtz commented May 7, 2025

Feat/multispectral dataloader 5ch #11517

Are you sure you want to change the base?

Feat/multispectral dataloader 5ch #11517

Conversation

snwtz commented May 7, 2025