Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View lucky1day's full-sized avatar

Block or report lucky1day

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 33,484 3,900 Updated Nov 10, 2025

从0实现一个简洁清晰的Deep Search Agent

Python 591 150 Updated Aug 19, 2025

a toolkit on knowledge distillation for large language models

Python 199 21 Updated Nov 3, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,550 6,061 Updated Nov 10, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 873 57 Updated Nov 4, 2025

🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.

Python 460 37 Updated Sep 28, 2025

项目描述:构建⼀个能够⾃动撰写多模态呈现、具备专业性和深度、数据融合与事实溯源、规范 有逻辑的各类⾦融研报的智能 Agent 系统。 主要负责:根据赛题思路,对目标公司生成金融研报,通过 LLM 获取目标公司的竞争对手,用 akshare 获取数 据源的的三大报表数据,通过 duckduckgo 获取公司信息、行业信息、股份信息等,通过设计数据分析师智能 体,包含三个动作代码生成和执行、收集…

Python 20 1 Updated Aug 1, 2025

AFAC2025挑战组-赛题三:金融领域中的长思维链压缩-冠军(第一名)解决方案

Python 36 3 Updated Sep 3, 2025
Python 172 15 Updated Apr 30, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 29,141 3,058 Updated Nov 13, 2025

Implementation of my agent used in 2025 AFAC TianChi competition

Jupyter Notebook 18 3 Updated Oct 6, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,986 456 Updated Oct 6, 2025

deepResearch

Python 76 8 Updated Apr 23, 2025

[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning

Python 668 83 Updated Oct 12, 2025

Fully open reproduction of DeepSeek-R1

Python 25,636 2,398 Updated Sep 8, 2025

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 653 43 Updated Oct 15, 2025

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,817 602 Updated Nov 12, 2025

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,466 101 Updated Nov 13, 2025

算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!

646 32 Updated Mar 24, 2025
Python 22 4 Updated Aug 8, 2023

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,059 2,253 Updated Nov 11, 2025

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 365 30 Updated Sep 6, 2024

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

2,325 159 Updated Dec 26, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,206 69 Updated Mar 9, 2025

MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs

Python 934 58 Updated May 15, 2023

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,190 111 Updated Aug 16, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,517 289 Updated Nov 13, 2025

A unified evaluation framework for large language models

Python 2,742 218 Updated Oct 13, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,068 702 Updated Nov 13, 2025
Next