Thanks to visit codestin.com
Credit goes to github.com

Skip to content

WikiChao/FreSca

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FreSca: Scaling in Frequency Space Enhances Diffusion Models

University of Rochester, Hong Kong University of Science and Technology, The University of Texas at Dallas
If our project helps you, please give us a star ⭐ on GitHub to support us.
访客统计 GitHub stars Static Badge
💡 Where and why you should apply Frequency Scaling in diffusion models.

📋 Overview

FreSca is a novel frequency scaling approach that enhances diffusion models without additional training. Our key findings:

  • Frequency scaling improves visual quality with minimal computational overhead
  • Energy-based cutoff strategy provides stable and consistent improvements
  • Compatible with existing diffusion models as a plug-and-play enhancement (image generation, video generation, depth estimation, image editing)

📰 News

  • [2025-05] 🔥 Major Update! Our new version reveals the optimal locations for frequency scaling, showing improvements across image generation, video synthesis, depth estimation, and editing tasks. New energy-based cutoff technique delivers superior stability!
  • [2025-04] Exciting discovery! FreSca boosts video diffusion models' performance without training! Preliminary VideoCrafter2 results added.
  • [2025-04] Released example implementation for FreSca.

🎬 Gallery

Video Generation Results

VideoCrafter2: Original vs FreSca
Original
FreSca
"A motorcycle riding along a desert highway, sand dunes stretching beside"
"Lanterns drifting into the night sky, a calm lake mirroring their glow"
"A train winding through sunflower fields, bright yellow blooms on both sides"
"A campfire on a lakeshore, stars sparkling in the dark sky"
"A tiger walks in the forest, photorealistic, 4k, high definition"

Click on any GIF to view the full MP4 video.

Depth Estimation Results

Depth Estimation Comparison

FreSca enhances depth estimation quality by preserving fine details and improving edge accuracy.

🤖 Implementation

The core implementation is available in core/fresca.py and can be easily integrated into any diffusion model.

Key Components

  1. Frequency Space Transformation: Convert latent features to frequency domain using FFT
  2. Selective Scaling: Apply customizable scaling to low- or high-frequency components
  3. Energy-based Cutoff: Adaptive boundary determination for frequency scaling

Recommended Parameters

  • Cutoff ratio: 0.5~0.9 (energy-based approach)
  • High-frequency scaling factor: 1.0 < h < 1.5
  • Position: Apply the nosie predictions

Find more example implementations in the demo/ directory.

📑 Citation

If you use this code for your research, please cite our work:

@article{huang2025fresca,
        title={FreSca: Unveiling the Scaling Space in Diffusion Models},
        author={Huang, Chao and Liang, Susan and Tang, Yunlong and Ma, Li and Tian, Yapeng and Xu, Chenliang},
        journal={arXiv preprint arXiv:2504.02154},
        year={2025}
}

📧 Contact

For questions or feedback, please open an issue on GitHub or contact the authors directly.

About

[CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Model

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages