Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View xuetf's full-sized avatar

Block or report xuetf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

EvoCUA: Evolving Computer Use Agent

Python 86 3 Updated Jan 14, 2026

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,477 372 Updated Jan 9, 2026

R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Python 19 1 Updated Oct 21, 2025

Fully Open Framework for Democratized Multimodal Training

Python 690 56 Updated Dec 27, 2025

R-HORIZON: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Python 18 1 Updated Oct 21, 2025

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 2,067 171 Updated Dec 16, 2025

ScaleCUA is the open-sourced computer use agents that can operate on cross-platform environments (Windows, macOS, Ubuntu, Android).

Python 1,054 73 Updated Jan 7, 2026
Python 44 4 Updated Mar 19, 2024

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

TypeScript 36 9 Updated Sep 3, 2025

OpenCUA: Open Foundations for Computer-Use Agents

Python 636 78 Updated Jan 9, 2026

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 175 12 Updated Oct 8, 2025

Community maintained hardware plugin for vLLM on Ascend

Python 1,572 742 Updated Jan 17, 2026

A curated collection of resources, tools, and frameworks for developing GUI Agents.

283 11 Updated Jan 13, 2026

UI-Venus is a native UI agent designed to perform precise GUI element grounding and effective navigation using only screenshots as input.

Python 606 35 Updated Dec 30, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,806 377 Updated Oct 21, 2025

[AAAI 2026] GUI-G²: Gaussian Reward Modeling for GUI Grounding

Python 296 8 Updated Nov 9, 2025

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Python 106 9 Updated Jul 27, 2025

Think Beyond Images

Python 556 35 Updated Sep 23, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,596 2,020 Updated Jan 13, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,136 247 Updated Dec 15, 2025
Python 16 Updated Jul 15, 2025

Unleashing the Power of Reinforcement Learning for Math and Code Reasoners

Python 738 44 Updated Jun 6, 2025

Scaling RL on advanced reasoning models

Python 658 40 Updated Oct 20, 2025

RM-R1: Unleashing the Reasoning Potential of Reward Models

Python 155 15 Updated Jun 26, 2025
Python 1,073 65 Updated Nov 20, 2025

Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent

Python 408 29 Updated Apr 22, 2025
Next