Thanks to visit codestin.com
Credit goes to github.com

csgcmai

Follow

😜

Be the fire and wish for the wind

Guangcan MAI csgcmai

😜

Be the fire and wish for the wind

Follow

Computer Vision @ YY Live, Baidu Inc

60 followers · 164 following

YY Live, Baidu Inc
guangcan.tech

Achievements

Achievements

Starred repositories

microsoft / BitBLAS

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 704 53 Updated Aug 6, 2025

vanna-ai / vanna

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 21,233 1,977 Updated Oct 29, 2025

Haochen-Wang409 / Grasp-Any-Region

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 78 8 Updated Oct 26, 2025

CURRENTF / LowRankClone

Official Codebase for the paper: A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.

Python 30 Updated Jun 16, 2025

qibin0506 / Cortex

个人构建MoE大模型：从预训练到DPO的完整实践

Python 1,694 132 Updated Oct 21, 2025

coalboss / MISP-Meeting

MISP-Meeting Dataset & Code

Python 2 2 Updated Jul 23, 2025

bytedance / tarsier

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 497 28 Updated Aug 14, 2025

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,404 200 Updated Oct 24, 2025

yanring / Megatron-MoE-ModelZoo

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 117 23 Updated Oct 16, 2025

InternLM / xtuner

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,956 379 Updated Oct 28, 2025

tensorwavecloud / ScalarLM

ScalarLM - a unified training and inference stack

Python 91 10 Updated Oct 1, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,273 453 Updated Oct 29, 2025

allenai / OLMo-core

PyTorch building blocks for the OLMo ecosystem

Python 312 57 Updated Oct 28, 2025

scuba-illinois / ArgCMV

Repository containing code and data for the paper "ArgCMV: An Argument Summarization Benchmark for the LLM-era", accepted at EMNLP 2025 Main Conference.

1 Updated Aug 25, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 861 56 Updated Oct 24, 2025

RUCAIBox / awesome-llm-pretraining

Awesome LLM pre-training resources, including data, frameworks, and methods.

270 17 Updated Apr 29, 2025

PrincetonUniversity / LLMCompass

Python 198 50 Updated Oct 24, 2025

hahnyuan / LLM-Viewer

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 567 69 Updated Sep 11, 2024

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 461 54 Updated Apr 19, 2025

harleyszhang / llm_counts

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

Python 109 9 Updated Jul 11, 2025

Xnhyacinth / Awesome-LLM-Long-Context-Modeling

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,792 76 Updated Oct 28, 2025

jd-opensource / xllm

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 609 74 Updated Oct 28, 2025

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 6,334 921 Updated Mar 27, 2024

multimodal-art-projection / MAP-NEO

Python 964 89 Updated Feb 7, 2025

OpenGVLab / EfficientQAT

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 308 22 Updated May 22, 2025

AIoT-MLSys-Lab / Efficient-LLMs-Survey

[TMLR 2024] Efficient Large Language Models: A Survey

1,226 97 Updated Jun 23, 2025

weigao266 / Awesome-Efficient-Arch

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

349 30 Updated Aug 29, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,276 10,849 Updated Oct 28, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,461 3,190 Updated Oct 29, 2025

HazyResearch / Megakernels

kernels, of the mega variety

Python 592 26 Updated Sep 28, 2025

Starred topics

text-to-video

video-object-segmentation

saliency-detection