Starred repositories
AgentScope: Agent-Oriented Programming for Building LLM Applications
Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Build resilient language agents as graphs.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
How can we build a true AI agent? Like Claude Code.
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
基于深度学习的肿瘤辅助诊断系统,以图像分割为核心,利用人工智能完成肿瘤区域的识别勾画并提供肿瘤区域的特征来辅助医生进行诊断。有完整的模型构建、后端架设、工业级部署和前端访问功能。TensorRT、PyTorch 、OpenCV 、Flask、Vue
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Robust Speech Recognition via Large-Scale Weak Supervision
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
No fortress, purely open ground. OpenManus is Coming.
Driving all platforms UI automation with vision-based model
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
usefulness / webp-imageio
Forked from gotson/webp-imageioJava ImageIO WebP support (includes ARM chips support)
Flexible and powerful framework for managing multiple AI agents and handling complex conversations
TwelveMonkeys ImageIO: Additional plug-ins and extensions for Java's ImageIO
🤗 smolagents: a barebones library for agents that think in code.
A LLM-based Agent that predict its tasks proactively.