π―
Focusing
π¨ researching next-generation modeling paradigms; building scalable foundation model systems
Pinned Loading
-
NVIDIA/Megatron-LM
NVIDIA/Megatron-LM PublicOngoing research training transformer models at scale
-
XueFuzhao/OpenMoE
XueFuzhao/OpenMoE PublicA family of open-sourced Mixture-of-Experts (MoE) Large Language Models
-
EvolvingLMMs-Lab/lmms-engine
EvolvingLMMs-Lab/lmms-engine PublicA simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
-
dlms-are-super-data-learners
dlms-are-super-data-learners PublicThe official github repo for "Diffusion Language Models are Super Data Learners".
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.