Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View RuyangFan's full-sized avatar

Block or report RuyangFan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MotionAgent is your AI assistent to convert ideas into motion pictures.

Python 307 40 Updated Sep 2, 2024

LLM inference in C/C++

C++ 91,799 14,184 Updated Dec 22, 2025

在本项目中模拟健康档案私有知识库构建和检索全流程,通过一份代码实现了同时支持多种大模型(如OpenAI、阿里通义千问等)的RAG(检索增强生成)功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库;在线步骤:获取用户问题->用户问题向量化->检索向量数据库->将检索结果和用户问题填入prompt模版->用最终的prompt调用LLM->由LLM生成回复

Python 215 31 Updated Sep 6, 2024

A pipeline parallel training script for diffusion models.

Python 1,772 238 Updated Dec 21, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,855 686 Updated Jun 4, 2025

The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment

Python 1,016 118 Updated Dec 13, 2025

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,813 2,525 Updated Mar 13, 2025

remove watermark. 去除图片中的水印

Python 660 148 Updated Nov 1, 2017

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,502 2,366 Updated Apr 29, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,060 95 Updated Dec 8, 2025

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Python 1,189 152 Updated Jan 26, 2025

Universal markup converter

Haskell 40,928 3,717 Updated Dec 21, 2025

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 274 14 Updated Dec 17, 2025

基于Cosyvoice2-0.5B模型实现的多角色语音克隆项目,使用flet开发,支持多音色管理、历史记录管理、一键克隆,仅需短短几秒的人声音频即可快速生成。

Python 50 9 Updated May 20, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,341 77 Updated Sep 12, 2025

SoftVC VITS Singing Voice Conversion

Python 27,870 5,079 Updated Nov 11, 2023

Text Normalization & Inverse Text Normalization

Python 706 91 Updated Dec 1, 2025

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Python 4,248 492 Updated Sep 15, 2025

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 4,797 787 Updated Mar 7, 2025

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,314 117 Updated Sep 30, 2025

🪞 Instant AI Face Swap 一键 AI 换脸,发现更美的你

TypeScript 2,751 211 Updated Jul 10, 2025

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,829 110 Updated Jul 3, 2024

Automatically remove the mosaics in images and videos, or add mosaics to them.

Python 2,504 465 Updated Aug 30, 2024

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,852 1,331 Updated Aug 14, 2024

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,804 1,109 Updated Nov 1, 2024

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,140 816 Updated Mar 5, 2025

ECCV2020 paper "Whole-Body Human Pose Estimation in the Wild"

Python 835 74 Updated Apr 22, 2025

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,856 293 Updated Jan 16, 2024

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 52,420 5,612 Updated Dec 19, 2025

LLM Frontend for Power Users.

JavaScript 21,154 4,440 Updated Dec 21, 2025
Next