Releases: leejet/stable-diffusion.cpp
Releases · leejet/stable-diffusion.cpp
master-ddc4a18
fix: make tiled VAE reuse the compute buffer (#821)
master-dc46993
feat: increase work_ctx memory buffer size (#814)
master-c607fc3
feat: use Euler sampling by default for SD3 and Flux (#753) Thank you for your contribution.
master-b54bec3
fix: do not force VAE type to f32 on SDXL (#716) This seems to be a leftover from the initial SDXL support: it's not enough to avoid NaN issues, and it's not not needed for the fixed sdxl-vae-fp16-fix .
master-9e7befa
fix: harden for large files (#643)
master-5869987
fix: make weight override more robust against ggml changes (#760)
master-55c2e05
feat: optimize tensor loading time (#790) * opt tensor loading * fix build failure * revert the changes * allow the use of n_threads * fix lora loading * optimize lora loading * add mutex * use atomic * fix build * fix potential duplicate issue * avoid duplicate lookup of lora tensor * fix progeress bar * remove unused remove_duplicates --------- Co-authored-by: leejet <[email protected]>
master-52a97b3
feat: add vace support (#819) * add wan vace t2v support * add --vace-strength option * add vace i2v support * fix the processing of vace_context * add vace v2v support * update docs
master-48956ff
feat: reduce CLIP memory usage with no embeddings (#768)
master-2c9b1e2
feat: add VAE encoding tiling support and adaptive overlap (#484) * implement tiling vae encode support * Tiling (vae/upscale): adaptative overlap * Tiling: fix edge case * Tiling: fix crash when less than 2 tiles per dim * remove extra dot * Tiling: fix edge cases for adaptative overlap * tiling: fix edge case * set vae tile size via env var * vae tiling: refactor again, base on smaller buffer for alignment * Use bigger tiles for encode (to match compute buffer size) * Fix edge case when tile is bigger than latent * non-square VAE tiling (#3) * refactor tile number calculation * support non-square tiles * add env var to change tile overlap * add safeguards and better error messages for SD_TILE_OVERLAP * add safeguards and include overlapping factor for SD_TILE_SIZE * avoid rounding issues when specifying SD_TILE_SIZE as a factor * lower SD_TILE_OVERLAP limit * zero-init empty output buffer * Fix decode latent size * fix encode * tile size params instead of env * Tiled vae parameter validation (#6) * avoid crash with invalid tile sizes, use 0 for default * refactor default tile size, limit overlap factor * remove explicit parameter for relative tile size * limit encoding tile to latent size * unify code style and format code * update docs * fix get_tile_sizes in decode_first_stage --------- Co-authored-by: Wagner Bruna <[email protected]> Co-authored-by: leejet <[email protected]>