Thanks to visit codestin.com
Credit goes to github.com

Skip to content

Releases: leejet/stable-diffusion.cpp

master-ddc4a18

14 Sep 04:15
ddc4a18
Compare
Choose a tag to compare
fix: make tiled VAE reuse the compute buffer (#821)

master-dc46993

14 Sep 05:52
dc46993
Compare
Choose a tag to compare
feat: increase work_ctx memory buffer size (#814)

master-c607fc3

14 Sep 05:10
c607fc3
Compare
Choose a tag to compare
feat: use Euler sampling by default for SD3 and Flux (#753)

Thank you for your contribution.

master-b54bec3

14 Sep 04:54
b54bec3
Compare
Choose a tag to compare
fix: do not force VAE type to f32 on SDXL (#716)

This seems to be a leftover from the initial SDXL support: it's
not enough to avoid NaN issues, and it's not not needed for the
fixed sdxl-vae-fp16-fix .

master-9e7befa

14 Sep 05:21
9e7befa
Compare
Choose a tag to compare
fix: harden for large files (#643)

master-5869987

14 Sep 04:49
5869987
Compare
Choose a tag to compare
fix: make weight override more robust against ggml changes (#760)

master-55c2e05

14 Sep 15:20
55c2e05
Compare
Choose a tag to compare
feat: optimize tensor loading time (#790)

* opt tensor loading

* fix build failure

* revert the changes

* allow the use of n_threads

* fix lora loading

* optimize lora loading

* add mutex

* use atomic

* fix build

* fix potential duplicate issue

* avoid duplicate lookup of lora tensor

* fix progeress bar

* remove unused remove_duplicates

---------

Co-authored-by: leejet <[email protected]>

master-52a97b3

14 Sep 09:31
52a97b3
Compare
Choose a tag to compare
feat: add vace support (#819)

* add wan vace t2v support

* add --vace-strength option

* add vace i2v support

* fix the processing of vace_context

* add vace v2v support

* update docs

master-48956ff

14 Sep 04:53
48956ff
Compare
Choose a tag to compare
feat: reduce CLIP memory usage with no embeddings (#768)

master-2c9b1e2

14 Sep 08:33
2c9b1e2
Compare
Choose a tag to compare
feat: add VAE encoding tiling support and adaptive overlap  (#484)

* implement  tiling vae encode support

* Tiling (vae/upscale): adaptative overlap

* Tiling: fix edge case

* Tiling: fix crash when less than 2 tiles per dim

* remove extra dot

* Tiling: fix edge cases for adaptative overlap

* tiling: fix edge case

* set vae tile size via env var

* vae tiling: refactor again, base on smaller buffer for alignment

* Use bigger tiles for encode (to match compute buffer size)

* Fix edge case when tile is bigger than latent

* non-square VAE tiling (#3)

* refactor tile number calculation

* support non-square tiles

* add env var to change tile overlap

* add safeguards and better error messages for SD_TILE_OVERLAP

* add safeguards and include overlapping factor for SD_TILE_SIZE

* avoid rounding issues when specifying SD_TILE_SIZE as a factor

* lower SD_TILE_OVERLAP limit

* zero-init empty output buffer

* Fix decode latent size

* fix encode

* tile size params instead of env

* Tiled vae parameter validation (#6)

* avoid crash with invalid tile sizes, use 0 for default

* refactor default tile size, limit overlap factor

* remove explicit parameter for relative tile size

* limit encoding tile to latent size

* unify code style and format code

* update docs

* fix get_tile_sizes in decode_first_stage

---------

Co-authored-by: Wagner Bruna <[email protected]>
Co-authored-by: leejet <[email protected]>