lucky1day

lucky1day

0 followers · 2 following

Stars

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 33,484 3,900 Updated Nov 10, 2025

666ghj / DeepSearchAgent-Demo

从0实现一个简洁清晰的Deep Search Agent

Python 591 150 Updated Aug 19, 2025

modelscope / easydistill

a toolkit on knowledge distillation for large language models

Python 199 21 Updated Nov 3, 2025

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,550 6,061 Updated Nov 10, 2025

Alpha-VLLM / Lumina-DiMOO

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 873 57 Updated Nov 4, 2025

camel-ai / loong

🐉 Loong: Synthesize Long CoTs at Scale through Verifiers.

Python 460 37 Updated Sep 28, 2025

yunduo0517mht / tianchi_AFAC_AGENT

项目描述：构建⼀个能够⾃动撰写多模态呈现、具备专业性和深度、数据融合与事实溯源、规范有逻辑的各类⾦融研报的智能 Agent 系统。主要负责：根据赛题思路，对目标公司生成金融研报，通过 LLM 获取目标公司的竞争对手，用 akshare 获取数据源的的三大报表数据，通过 duckduckgo 获取公司信息、行业信息、股份信息等，通过设计数据分析师智能体，包含三个动作代码生成和执行、收集…

Python 20 1 Updated Aug 1, 2025

liuliAI / AFAC2025-Challenge-Compression-of-Long-Thinking-Chains-in-the-Financial-Field-Gold-Medal-Solution

AFAC2025挑战组-赛题三：金融领域中的长思维链压缩-冠军（第一名）解决方案

Python 36 3 Updated Sep 3, 2025

SuperGPQA / SuperGPQA

Python 172 15 Updated Apr 30, 2025

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 29,141 3,058 Updated Nov 13, 2025

DXWEIE / AIDM_AFAC_Agent

Implementation of my agent used in 2025 AFAC TianChi competition

Jupyter Notebook 18 3 Updated Oct 6, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,986 456 Updated Oct 6, 2025

zhoujx4 / python-node-deepresearch

deepResearch

Python 76 8 Updated Apr 23, 2025

pat-jj / DeepRetrieval

[COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning

Python 668 83 Updated Oct 12, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,636 2,398 Updated Sep 8, 2025

GAIR-NLP / DeepResearcher

Scaling Deep Research via Reinforcement Learning in Real-world Environments.

Python 653 43 Updated Oct 15, 2025

OpenPipe / ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!

Python 7,817 602 Updated Nov 12, 2025

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,466 101 Updated Nov 13, 2025

Icecream-blue-sky / Five-year-algorithm-interview-three-year-simulation

算法岗笔试面试大全，励志做算法届的《五年高考，三年模拟》！

646 32 Updated Mar 24, 2025

MANGA-UOFA / fdistill

Python 22 4 Updated Aug 8, 2023

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 18,059 2,253 Updated Nov 11, 2025

tianyi-lab / Reflection_Tuning

[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

Python 365 30 Updated Sep 6, 2024

km1994 / LLMs_interview_notes

该仓库主要记录大模型（LLMs）算法工程师相关的面试题

2,325 159 Updated Dec 26, 2024

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,206 69 Updated Mar 9, 2025

kuleshov / minillm

MiniLLM is a minimal system for running modern LLMs on consumer-grade GPUs

Python 934 58 Updated May 15, 2023

Alibaba-NLP / ZeroSearch

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Python 1,190 111 Updated Aug 16, 2025

datajuicer / data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,517 289 Updated Nov 13, 2025

microsoft / promptbench

A unified evaluation framework for large language models

Python 2,742 218 Updated Oct 13, 2025

ntunlp / Critical-Review-of-LLM-Eval

Python 3 Updated Jan 6, 2025

Infrasys-AI / AIInfra

AIInfra（AI 基础设施）指AI系统从底层芯片等硬件，到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,068 702 Updated Nov 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly