Is there support for loading a sharded gguf file ?

**Is your feature request related to a problem? Please describe.**
Inquiring whether this project supports loading a "sharded" gguf model file ? The llama cpp project appears to add tooling for splitting gguf files into pieces (more [here](https://github.com/ggerganov/llama.cpp/discussions/6404)). Was curious of the this project supports loading gguf files in that format since I didn't see any mention of it in the documentation or issues.

If it is supported, could you point me to the documentation on this or provide a code example ? If not, perhaps this feature could be added ?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is there support for loading a sharded gguf file ? #1341

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Is there support for loading a sharded gguf file ? #1341

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions