Starred repositories
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
rCM: SOTA Diffusion Distillation & Few-Step Video Generation based on sCM/MeanFlow
UniFace: A Comprehensive Library for Face Detection, Recognition, Landmark Analysis, Face Parsing, Gaze Estimation, Age, and Gender Detection
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
✨ Agentic IM ChatBot Infrastructure — 聊天智能体基础设施 ✨ 多消息平台集成(QQ / Telegram / 企微 / 飞书 / 钉钉等),强大易用的插件系统,支持 OpenAI / Gemini / Anthropic / Dify / Coze / 阿里云百炼 / 知识库 / Agent 智能体
将视频瞬间转化为手绘故事 Turn Video Moments into Hand-Drawn Stories
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
Lets make video diffusion practical!
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
Enjoy the magic of Diffusion models!
本项目为量化开源课程,可以帮助人们快速掌握量化金融知识以及使用Python进行量化开发的能力。
Wan: Open and Advanced Large-Scale Video Generative Models
Tongyi Deep Research, the Leading Open-source Deep Research Agent
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
An efficient video loader for deep learning with smart shuffling that's super easy to digest
📚 《从零开始构建智能体》——从零开始的智能体原理与实践教程
Official code of CVPR '23 paper "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
List of recent advances for human avatars, including generation, reconstruction, and editing, etc.
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"
This repository contains a PyTorch implementation of "AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head Synthesis".
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer