Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View csgcmai's full-sized avatar
😜
Be the fire and wish for the wind
😜
Be the fire and wish for the wind

Block or report csgcmai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 704 53 Updated Aug 6, 2025

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 21,233 1,977 Updated Oct 29, 2025

Official implementation of "Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs".

Python 78 8 Updated Oct 26, 2025

Official Codebase for the paper: A Token is Worth over 1,000 Tokens: Efficient Knowledge Distillation through Low-Rank Clone.

Python 30 Updated Jun 16, 2025

个人构建MoE大模型:从预训练到DPO的完整实践

Python 1,694 132 Updated Oct 21, 2025

MISP-Meeting Dataset & Code

Python 2 2 Updated Jul 23, 2025

Tarsier -- a family of large-scale video-language models, which is designed to generate high-quality video descriptions , together with good capability of general video understanding.

Python 497 28 Updated Aug 14, 2025

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 1,404 200 Updated Oct 24, 2025

Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.

Python 117 23 Updated Oct 16, 2025

A Next-Generation Training Engine Built for Ultra-Large MoE Models

Python 4,956 379 Updated Oct 28, 2025

ScalarLM - a unified training and inference stack

Python 91 10 Updated Oct 1, 2025

AllenAI's post-training codebase

Python 3,273 453 Updated Oct 29, 2025

PyTorch building blocks for the OLMo ecosystem

Python 312 57 Updated Oct 28, 2025

Repository containing code and data for the paper "ArgCMV: An Argument Summarization Benchmark for the LLM-era", accepted at EMNLP 2025 Main Conference.

1 Updated Aug 25, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 861 56 Updated Oct 24, 2025

Awesome LLM pre-training resources, including data, frameworks, and methods.

270 17 Updated Apr 29, 2025

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 567 69 Updated Sep 11, 2024

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 461 54 Updated Apr 19, 2025

llm theoretical performance analysis tools and support params, flops, memory and latency analysis.

Python 109 9 Updated Jul 11, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,792 76 Updated Oct 28, 2025

A high-performance inference engine for LLMs, optimized for diverse AI accelerators.

C++ 609 74 Updated Oct 28, 2025

Transformer related optimization, including BERT, GPT

C++ 6,334 921 Updated Mar 27, 2024

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 308 22 Updated May 22, 2025

[TMLR 2024] Efficient Large Language Models: A Survey

1,226 97 Updated Jun 23, 2025

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

349 30 Updated Aug 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 61,276 10,849 Updated Oct 28, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,461 3,190 Updated Oct 29, 2025

kernels, of the mega variety

Python 592 26 Updated Sep 28, 2025
Next