Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View chuanyangjin's full-sized avatar

Highlights

  • Pro

Block or report chuanyangjin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1 1 Updated Oct 6, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 14,863 2,368 Updated Oct 29, 2025

Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks

Python 248 11 Updated May 5, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,607 435 Updated Oct 29, 2025

github profile

20 Updated Aug 26, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 294 32 Updated Oct 23, 2025
Python 2 Updated Nov 15, 2024

AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling

Python 29 4 Updated Jul 26, 2025

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 166 12 Updated Oct 8, 2025

Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

Python 5 Updated Apr 16, 2025

Collection of advice for prospective and current PhD students

1,894 140 Updated Jul 10, 2024

[ICLR 2024] Source codes for the paper "Building Cooperative Embodied Agents Modularly with Large Language Models"

Python 279 45 Updated Mar 30, 2025

AWM: Agent Workflow Memory

Python 335 30 Updated Jan 31, 2025

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

Jupyter Notebook 889 117 Updated Apr 3, 2025

List of language agents based on paper "Cognitive Architectures for Language Agents"

TeX 1,051 69 Updated Jan 16, 2025

Social-AI papers across computing communities, courses, and dissertations.

22 1 Updated Jun 10, 2025

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,640 573 Updated Jan 16, 2025

Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

Python 475 28 Updated Jan 15, 2025

MuMA-ToM: Multi-modal Multi-Agent Theory of Mind

Python 32 2 Updated Jan 23, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,559 2,498 Updated Sep 30, 2025

AllenAI's post-training codebase

Python 3,274 453 Updated Oct 29, 2025

本人的科研经验

7,827 448 Updated Aug 12, 2025

[ACL 2025] A Neural-Symbolic Self-Training Framework

C 116 4 Updated Jun 1, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 4,000 802 Updated Sep 4, 2025
JavaScript 3,654 1,553 Updated Jun 21, 2024
Next