Codestin Search App

danielzgtg · 2025-08-13T17:58:45Z

Users with 1GB RAM will be able (though ill-advised) to run any model by letting it swap. Users with 16GB RAM will be able to restart the Orpheus model quickly now that the page cache is not evicted every run.

The mmap code is from llama.cpp. I also deduplicated the model loading code.

This is currently non-enforcing and unapplied per file until said file receives a major edit.

This'll allow adding supporting files to each model.

Users with 1GB RAM will be able (though ill-advised) to run any model by letting it swap. Users with 16GB RAM will be able to restart the Orpheus model quickly now that the page cache is not evicted every run.

danielzgtg · 2025-08-16T11:34:07Z

Merging to see if this helps #103 and #108

aPaleBlueDot · 2025-08-18T21:04:38Z

I failed to leverage mmap to fit Parler mini into memory, as it still asks for 5.3GB somehow. I must be doing something wrong.

danielzgtg · 2025-08-18T21:45:59Z

@aPaleBlueDot Can you upload your cmake-build-release/CMakeCache.txt file and post the arguments you are invoking cmake-build-release/bin/tts-cli with?

aPaleBlueDot · 2025-08-19T04:56:04Z

@danielzgtg Here is the file
CMakeCache.txt
and here are the parameters (w/ added line breaks for readability):

`
generation_configuration config(

  "",      // voice (empty)

  30,      // top_k (reduced from 50)

  1.0f,    // temperature

  1.1f,    // repetition_penalty

  false,   // use_cross_attn (disabled to save memory)

  "",      // espeak_voice_id (empty)

  256,     // max_tokens (reduced from 512)

  0.95f,   // top_p

  true     // sample

);

int n_threads = 4; // 4 threads on iOS

bool cpu_only = true; // Force CPU-only mode
`

And it was the 5-bit quantized model.

mmwillet#105 (comment) Co-authored-by: aPaleBlueDot <[email protected]>

danielzgtg mentioned this pull request Aug 13, 2025

5GB memory allocated up front #103

Open

danielzgtg force-pushed the feat/mmap branch from d2c3e81 to 93708c8 Compare August 13, 2025 18:10

danielzgtg requested review from ecyht2 and mmwillet August 13, 2025 18:10

danielzgtg added 8 commits August 16, 2025 07:22

chore: Import llama.cpp .clang-format

d8f40b5

This is currently non-enforcing and unapplied per file until said file receives a major edit.

refactor: Extract quantize_impl.cpp

17ddc4f

refactor: Split models into directories

5aa97b8

This'll allow adding supporting files to each model.

refactor: Extract models/*/loader.cpp

5ed1f91

refactor: Use virtual methods for tts_runner

20c9876

refactor: Have loaders.cpp call assign_weight

5779de4

refactor: Add tts_model_loader registry

fe6a6b0

feat: mmap

1ec1409

Users with 1GB RAM will be able (though ill-advised) to run any model by letting it swap. Users with 16GB RAM will be able to restart the Orpheus model quickly now that the page cache is not evicted every run.

danielzgtg force-pushed the feat/mmap branch from 93708c8 to 1ec1409 Compare August 16, 2025 11:26

danielzgtg merged commit a23f65c into mmwillet:main Aug 16, 2025
2 checks passed

danielzgtg added a commit to danielzgtg/TTS.cpp that referenced this pull request Aug 19, 2025

tests: Add aPaleBlueDot's memory usage test

724d97f

mmwillet#105 (comment) Co-authored-by: aPaleBlueDot <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: mmap#105

feat: mmap#105
danielzgtg merged 8 commits intommwillet:mainfrom
danielzgtg:feat/mmap

danielzgtg commented Aug 13, 2025

Uh oh!

danielzgtg commented Aug 16, 2025

Uh oh!

Uh oh!

aPaleBlueDot commented Aug 18, 2025

Uh oh!

danielzgtg commented Aug 18, 2025

Uh oh!

aPaleBlueDot commented Aug 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

danielzgtg commented Aug 13, 2025

Uh oh!

danielzgtg commented Aug 16, 2025

Uh oh!

Uh oh!

aPaleBlueDot commented Aug 18, 2025

Uh oh!

danielzgtg commented Aug 18, 2025

Uh oh!

aPaleBlueDot commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

aPaleBlueDot commented Aug 19, 2025 •

edited

Loading