Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
SGLang is a fast serving framework for large language models and vision language models.
A high-throughput and memory-efficient inference and serving engine for LLMs
Steering vectors for transformer language models in Pytorch / Huggingface
A library for mechanistic interpretability of GPT-style language models
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Model interpretability and understanding for PyTorch
Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, B…
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models (CRFM) at Stanford for holistic, reproducible and transparen…
Awesome-LLM: a curated list of Large Language Model
An autoregressive character-level language model for making more things
Robust Speech Recognition via Large-Scale Weak Supervision
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
Self-contained, minimalistic implementation of diffusion models with Pytorch.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…