Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"
Demo of a UI testing agent using the OpenAI CUA model and the Responses API.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…
微舆:人人可用的多Agent舆情分析助手,打破信息茧房,还原舆情原貌,预测未来走向,辅助决策!从0实现,不依赖任何框架。
Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills with automatic conflict detection
An Illusion of Progress? Assessing the Current State of Web Agents
An open-source AI agent that brings the power of Gemini directly into your terminal.
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
AIPex: agentic assistant in your browser, automate your browser using natural language. ChatGPT Atlas Alternative, no migration need
AgentCPM-GUI: An on-device GUI agent for operating Android apps, enhancing reasoning ability with reinforcement fine-tuning for efficient task execution.
A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your interactions 🖱 ⌨
Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.
⚡A CLI tool for code structural search, lint and rewriting. Written in Rust
Generate a timeline of your day, automatically
An Application Framework for AI Engineering
💫 Toolkit to help you get started with Spec-Driven Development
Chrome DevTools for coding agents
A powerful TypeScript code indexing and search tool with Language Server Protocol support and MCP integration.
🔥 A list of tools, frameworks, and resources for building AI web agents
Open-Source Chrome extension for AI-powered web automation. Run multi-agent workflows using your own LLM API key. Alternative to OpenAI Operator.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
The AI Browser Automation Framework
A lightweight, powerful framework for multi-agent workflows and voice agents
AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, c…
This is sub-moudle, see: https://github.com/dtyq/magic for more.