stikhidyidtd

Amethyst stikhidyidtd

4 followers · 10 following

Stars

wyf3 / llm_related

复现大模型相关算法及一些学习记录

Python 2,400 331 Updated Oct 25, 2025

ljpzzz / machinelearning

My blogs and code for machine learning. http://cnblogs.com/pinard

Jupyter Notebook 8,632 3,744 Updated Feb 16, 2024

dennybritz / reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Jupyter Notebook 21,661 6,163 Updated Jul 13, 2023

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python 14,368 4,958 Updated Aug 9, 2024

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

8,499 565 Updated Sep 22, 2025

datawhalechina / tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Jupyter Notebook 3,935 400 Updated Aug 30, 2025

litexlang / golitex

Litex is a simple formal language Learnable in 2 hours, not 1 year. It scales formal reasoning in AI era.

Go 564 6 Updated Oct 29, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,106 7,389 Updated Oct 27, 2025

f / awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.

JavaScript 135,891 18,085 Updated Oct 14, 2025

dair-ai / Prompt-Engineering-Guide

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 65,627 6,818 Updated Oct 16, 2025

1517005260 / graph-rag-agent

拼好RAG：手搓并融合了GraphRAG、LightRAG、Neo4j-llm-graph-builder进行知识图谱构建以及搜索；整合DeepSearch技术实现私域RAG的推理；自制针对GraphRAG的评估框架| Integrate GraphRAG, LightRAG, and Neo4j-llm-graph-builder for knowledge graph construct…

Python 1,378 188 Updated Oct 29, 2025

weiwill88 / Local_Pdf_Chat_RAG

Python 736 142 Updated Oct 25, 2025

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,758 7,080 Updated Oct 29, 2025

mrdbourke / simple-local-rag

Build a RAG (Retrieval Augmented Generation) pipeline from scratch and have it all run locally.

Jupyter Notebook 877 263 Updated May 25, 2024

FoundationAgents / OpenManus

No fortress, purely open ground. OpenManus is Coming.

Python 50,555 8,830 Updated Oct 29, 2025

ashishpatel26 / LLM-Finetuning

LLM Finetuning with peft

Jupyter Notebook 2,687 694 Updated Aug 1, 2025

artidoro / qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,721 866 Updated Jun 10, 2024

dvgodoy / FineTuningLLMs

Official repository of my book "A Hands-On Guide to Fine-Tuning LLMs with PyTorch and Hugging Face"

Jupyter Notebook 540 73 Updated Oct 5, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,271 806 Updated Oct 27, 2025