Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View DEM1TASSE's full-sized avatar
Focusing
Focusing

Highlights

  • Pro

Block or report DEM1TASSE

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 76,698 11,297 Updated Oct 22, 2025

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 434 91 Updated Oct 27, 2025

🌎💪 BrowserGym, a Gym environment for web task automation

Python 942 132 Updated Oct 27, 2025

SkillWeaver is a framework to enable web agent self-improvement through environment exploration and skill synthesis.

Python 98 8 Updated Apr 14, 2025

Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"

Python 31 5 Updated Apr 24, 2025
Python 2 Updated Oct 21, 2025

2026 AI/ML internship & new graduate job list updated daily

3,797 158 Updated Oct 27, 2025

AI-powered desktop companion to boost your efficiency

Python 2 1 Updated Jul 27, 2025

The absolute trainer to light up AI agents.

Python 2,947 220 Updated Oct 28, 2025

A repo for open research on building large reasoning models

Python 108 14 Updated Oct 27, 2025

[AI4MATH@ICML2025] Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs

Python 40 1 Updated May 20, 2025

[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs

Python 186 11 Updated Jun 28, 2025

Chrome extension for clipping arXiv articles to Notion.

JavaScript 130 18 Updated Oct 2, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 60,886 7,359 Updated Oct 27, 2025

Official Repository of "Learning to Reason under Off-Policy Guidance"

Python 355 41 Updated Oct 4, 2025

[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling

Python 591 51 Updated Jun 16, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,317 1,518 Updated Apr 24, 2025

🔥 How to efficiently and effectively compress the CoTs or directly generate concise CoTs during inference while maintaining the reasoning performance is an important topic!

63 4 Updated May 22, 2025

Tongji Univ. Undergraduate Graduation Project 2021. | 🎉含: 同济er毕设答辩PPT模板

Python 386 11 Updated Jun 8, 2023

ICLR 2025 Agent-Related Papers

73 1 Updated Nov 14, 2024

🏡 GitHub Pages template for personal academic homepage

HTML 435 250 Updated Oct 23, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 534 29 Updated Oct 27, 2025

Overseas Summer Research Guidance 海外暑研申请指南

317 5 Updated Aug 28, 2025
Python 18 3 Updated Nov 1, 2024

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 283 12 Updated Jul 18, 2025

(已支持sqlsugar).NetCore、.Net6、Vue2、Vue3、Vite、TypeScript、Element plus+uniapp前后端分离,全自动生成代码;支持移动端(ios/android/h5/微信小程序。http://www.volcore.xyz/

C# 4,147 1,345 Updated Oct 24, 2025

Project 2 for CS161@UC Berkeley, Spring 2023

Go 2 Updated Jul 30, 2023

The model, data and code for the visual GUI Agent SeeClick

HTML 433 23 Updated Jul 13, 2025

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,270 315 Updated Oct 23, 2025

Fine-tune LLM agents with online reinforcement learning

Python 1,242 60 Updated Mar 19, 2024
Next