Stars
The Ultimate Collection of 130+ Agentic Skills for Claude Code/Antigravity/Cursor. Battle-tested, high-performance skills for AI agents including official skills from Anthropic and Vercel.
VS Code extension for monitoring Google Antigravity AI quotas. Features Webview dashboard, QuickPick mode, and quota grouping.
只需轻松一点,即可安全高效的清理C盘残留和垃圾,并且对电脑毫无危险
DeepAudit:人人拥有的 AI 黑客战队,让漏洞挖掘触手可及。国内首个开源的代码漏洞挖掘多智能体系统。小白一键部署运行,自主协作审计 + 自动化沙箱 PoC 验证。支持 Ollama 私有部署 ,一键生成报告。支持中转站。让安全不再昂贵,让审计不再复杂。
World's First Full-Chinese Ray-Ban Meta AI Assistant - 全球首个全中文 Ray-Ban Meta 智能眼镜 AI 助手
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
一款功能强大的可视化网页自动化工具,通过拖拽模块的方式快速构建自动化工作流,无需编写代码即可实现网页数据采集、表单填写、自动化测试等任务。
OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.
Large Audio Language Model for Natural Voice Interactions - All-in-One Docker Image with 7 Processing Modes
Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…
Multilingual Voice Understanding Model
Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。
Virtual whiteboard for sketching hand-drawn like diagrams
X-Talk is an open-source full-duplex cascaded spoken dialogue system framework enabling low-latency, interruptible, and human-like speech interaction with a lightweight, pure-Python, production-rea…
Utilizes ONNX Runtime for audio denoising.
基于 FunASR SenseVoice 模型的实时语音识别服务,支持说话人识别、音频降噪、ASR 错误修正等高级功能。
Port of Funasr's Sense-voice model in C/C++
Utilizes ONNX Runtime to transcribe audio into text.
Pseudo Streaming SenseVoice with Hotwords
很多镜像都在国外。比如 gcr 。国内下载很慢,需要加速。致力于提供连接全世界的稳定可靠安全的容器镜像服务。
一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。
Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
Android Automation Tool Based on Vision-Language Models