Stars
Judging backend server for the DMOJ online judge.
(EMNLP 2025 Findings) Source Evaluation scripts for Humanity's Last Code Exam
🤖📐专为数学建模设计的 Agent ,自动完成数学建模,生成一份完整的可以直接提交的论文。 An Agent Designed for Mathematical Modeling ,Automatically complete mathmodel and generate a complete paper ready for submission.
Web-Bench is a benchmark designed to evaluate the performance of LLMs in actual Web development.
Repo-level benchmark for real-world Code Agents: from repo understanding → env setup → incremental dev/bug-fixing → task delivery, with cost-aware α metric.
An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to inference, as well as some practical experiences and conclusions.…
Source codes for paper "MACRec: A Multi-Agent Collaboration Framework for Recommendation" at SIGIR 2024
Chinese version of GPT2 training code, using BERT tokenizer.
A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.
A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
This is the NLP practice for beginners in the year of 2023.
Repo for BenTsao [original name: HuaTuo (华驼)], Instruction-tuning Large Language Models with Chinese Medical Knowledge. 本草(原名:华驼)模型仓库,基于中文医学知识的大语言模型指令微调