Stars
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Command-line program to download videos from YouTube.com and other video sites
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
[ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
实时语音交互数字人,支持端到端语音方案(GLM-4-Voice - THG)和级联方案(ASR-LLM-TTS-THG)。可自定义形象与音色,无须训练,支持音色克隆,首包延迟低至3s。Real-time voice interactive digital human, supporting end-to-end voice solutions (GLM-4-Voice - THG) and …
[NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication
Real time interactive streaming digital human
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
[AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.
🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.
Open-source vector similarity search for Postgres
The Postgres development platform. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB
🪐 Markdown with superpowers — from ideas to papers, presentations and books.
Convert PDF to markdown + JSON quickly with high accuracy
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Open-Source AI Presentation Generator and API (Gamma, Beautiful AI, Decktopus Alternative)
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
OCR, layout analysis, reading order, table recognition in 90+ languages
A lightweight LMM-based Document Parsing Model