Stars
verl: Volcano Engine Reinforcement Learning for LLMs
Machine Learning Engineering Open Book
An Efficient "Factory" to Build Multiple LoRA Adapters
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
💡 LeetCode in C++23/Java/Python/MySQL/TypeScript (respect coding conventions)
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.
A fast inference library for running LLMs locally on modern consumer-class GPUs
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
LlamaIndex is the leading document agent and OCR platform
A quick guide (especially) for trending instruction finetuning datasets
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Run evaluation on LLMs using human-eval benchmark
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Open Academic Research on Improving LLaMA to SOTA LLM
bigcode-project / Megatron-LM
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
Home of StarCoder: fine-tuning & inference!
C++ Parallel Computing and Asynchronous Networking Framework
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
程序员鱼皮的编程宝典 ⭐️ 2026年最全编程学习路线图!包含Java学习路线、前端学习路线、Python学习路线、C++学习路线、算法学习路线、计算机基础学习路线等。提供编程入门教程、技术知识分享、学习资源推荐、项目实战教程、热门面试题、求职经验、简历优化、编程自学指南等内容,适用于所有零基础学编程、转行程序员、计算机专业学生、求职找工作的同学 💎 编程学习,就来编程导航!