Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View richard28039's full-sized avatar
  • National Chung Cheng University - CS

Block or report richard28039

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Declarative Continuous Deployment for Kubernetes

Go 21,056 6,494 Updated Oct 27, 2025

Curated list of project-based tutorials

247,970 32,419 Updated Aug 15, 2024

The open-source CapCut alternative

TypeScript 42,826 4,041 Updated Oct 24, 2025

Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.

C 13,531 8,125 Updated Oct 27, 2025

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Python 14,287 1,834 Updated Jul 3, 2024

A hyperparameter optimization framework

Python 12,929 1,178 Updated Oct 27, 2025

BoT-SORT: Robust Associations Multi-Pedestrian Tracking

Jupyter Notebook 164 239 Updated May 12, 2024

Text-audio foundation model from Boson AI

Python 7,513 551 Updated Sep 15, 2025

For road++@ECCV2024 track1

Python 1 Updated Sep 6, 2024

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

TypeScript 9,664 430 Updated Oct 6, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,398 289 Updated Oct 4, 2025

Solve Visual Understanding with Reinforced VLMs

Python 5,650 362 Updated Oct 21, 2025

A fork to add multimodal model training to open-r1

Python 1,412 70 Updated Feb 8, 2025

Fully open reproduction of DeepSeek-R1

Python 25,581 2,396 Updated Sep 8, 2025

Train transformer language models with reinforcement learning.

Python 16,027 2,254 Updated Oct 27, 2025

Your AI Operator for Web, Android, Automation & Testing.

TypeScript 10,530 715 Updated Oct 27, 2025

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 166 12 Updated Oct 8, 2025
Python 114 8 Updated Apr 8, 2025

Stable Diffusion web UI

Python 157,584 29,248 Updated Oct 7, 2025

Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.

HTML 1,016 101 Updated Apr 27, 2024

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,810 189 Updated May 21, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 274 30 Updated Oct 27, 2025

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,236 98 Updated Jul 22, 2025

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 777 80 Updated Apr 30, 2025

Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)

Python 252 19 Updated Jul 16, 2024

🙌 OpenHands: Code Less, Make More

Python 64,480 7,829 Updated Oct 27, 2025

A lightweight, powerful framework for multi-agent workflows

Python 16,864 2,776 Updated Oct 27, 2025
Python 19 1 Updated May 23, 2025

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Python 791 102 Updated Feb 3, 2025

Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.

Python 99 8 Updated Jul 27, 2025
Next