Starred repositories
WebEyeTrack: Real-time Eye-Tracking in the Browser
Chrome DevTools for coding agents
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, c…
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 on OpenRouter. 750k+ Kilo Coders. 6.1 trillion tokens/month.
📖 MCP server for fetch deepwiki.com and get latest knowledge in Cursor and other Code Editors
Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme
A text to sql demo application using nextjs and mastra
Open-source infrastructure for Computer-Use Agents. Sandboxes, SDKs, and benchmarks to train and evaluate AI agents that can control full desktops (macOS, Linux, Windows).
WebGazer.js: Scalable Webcam EyeTracking Using User Interactions
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
Utilities intended for use with Llama models.
Roo Code gives you a whole dev team of AI agents in your code editor.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Model Context Protocol Servers
YomiTokuはAIを活用した日本語文書解析エンジンを提供するPythonパッケージです。 Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
A powerful framework for building realtime voice AI agents 🤖🎙️📹
This repository outlines the procedures and general information for the Speech Translator project.
This tool is used to install `pyenv` and friends.
Tesseract Open Source OCR Engine (main repository)
real time face swap and one-click video deepfake with only a single image
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data