Stars
My blogs and code for machine learning. http://cnblogs.com/pinard
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Python Implementation of Reinforcement Learning: An Introduction
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
《大模型白盒子构建指南》:一个全手搓的Tiny-Universe
Litex is a simple formal language Learnable in 2 hours, not 1 year. It scales formal reasoning in AI era.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
拼好RAG:手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索;整合DeepSearch技术实现私域RAG的推理;自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRAG, and Neo4j-llm-graph-builder for knowledge graph construct…
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.
No fortress, purely open ground. OpenManus is Coming.
LLM Finetuning with peft
QLoRA: Efficient Finetuning of Quantized LLMs
Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
《动手学大模型Dive into LLMs》系列编程实践教程
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
An Open-Source Framework for Prompt-Learning.
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Latest Advances on System-2 Reasoning