Lists (1)
Sort Name ascending (A-Z)
Stars
Declarative Continuous Deployment for Kubernetes
Curated list of project-based tutorials
The open-source CapCut alternative
Primary Git Repository for the Zephyr Project. Zephyr is a new generation, scalable, optimized, secure RTOS for multiple hardware architectures.
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
Text-audio foundation model from Boson AI
Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Solve Visual Understanding with Reinforced VLMs
A fork to add multimodal model training to open-r1
Fully open reproduction of DeepSeek-R1
Train transformer language models with reinforcement learning.
Your AI Operator for Web, Android, Automation & Testing.
[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis
Stable Diffusion web UI
Tuning LLMs with no tears💦; Sample Design Engineering (SDE) for more efficient downstream-tuning.
Out-of-the-box (OOTB) GUI Agent for Windows and macOS
GUI Grounding for Professional High-Resolution Computer Use
Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
A lightweight, powerful framework for multi-agent workflows
[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Enable AI to control your PC. This repo includes the WorldGUI Benchmark and GUI-Thinker Agent Framework.