xlm-research
Popular repositories Loading
Repositories
Showing 10 of 34 repositories
- ms-swift Public Forked from modelscope/ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, Phi4, ...) (AAAI 2025).
xlm-research/ms-swift’s past year of commit activity - SpecForge Public Forked from sgl-project/SpecForge
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
xlm-research/SpecForge’s past year of commit activity - Megatron-LM Public Forked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
xlm-research/Megatron-LM’s past year of commit activity - Pai-Megatron-Patch Public Forked from alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
xlm-research/Pai-Megatron-Patch’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…