Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View stikhidyidtd's full-sized avatar

Block or report stikhidyidtd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

复现大模型相关算法及一些学习记录

Python 2,400 331 Updated Oct 25, 2025

My blogs and code for machine learning. http://cnblogs.com/pinard

Jupyter Notebook 8,632 3,744 Updated Feb 16, 2024

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,661 6,163 Updated Jul 13, 2023

Python Implementation of Reinforcement Learning: An Introduction

Python 14,368 4,958 Updated Aug 9, 2024

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,499 565 Updated Sep 22, 2025

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

Jupyter Notebook 3,935 400 Updated Aug 30, 2025

Litex is a simple formal language Learnable in 2 hours, not 1 year. It scales formal reasoning in AI era.

Go 564 6 Updated Oct 29, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,106 7,389 Updated Oct 27, 2025

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 135,891 18,085 Updated Oct 14, 2025

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,627 6,818 Updated Oct 16, 2025

拼好RAG:手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索;整合DeepSearch技术实现私域RAG的推理;自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRAG, and Neo4j-llm-graph-builder for knowledge graph construct…

Python 1,378 188 Updated Oct 29, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,758 7,080 Updated Oct 29, 2025

Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.

Jupyter Notebook 877 263 Updated May 25, 2024

No fortress, purely open ground. OpenManus is Coming.

Python 50,555 8,830 Updated Oct 29, 2025

LLM Finetuning with peft

Jupyter Notebook 2,687 694 Updated Aug 1, 2025

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,721 866 Updated Jun 10, 2024

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Jupyter Notebook 540 73 Updated Oct 5, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,271 806 Updated Oct 27, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 9,223 922 Updated Oct 10, 2025

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 3,665 507 Updated Aug 15, 2024

An Open-Source Framework for Prompt-Learning.

Python 4,749 479 Updated Jul 16, 2024

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 21,960 2,628 Updated Jun 12, 2025

大模型基础: 一文了解大模型基础知识

6,092 509 Updated Feb 24, 2025

Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"

Python 135 8 Updated Jun 28, 2025

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python 74,093 9,635 Updated Oct 19, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,402 199 Updated May 7, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,841 375 Updated Oct 17, 2025

Latest Advances on System-2 Reasoning

Python 1,260 72 Updated Jun 8, 2025
Next