Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
-
Couldn't load subscription status.
- Fork 0
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
License
Couldn't load subscription status.
eclouder/MoE-LLM
About
Implementation of the paper: "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published