RuyangFan

RuyangFan RuyangFan

1 follower · 1 following

Stars

modelscope / motionagent

MotionAgent is your AI assistent to convert ideas into motion pictures.

Python 307 40 Updated Sep 2, 2024

ggml-org / llama.cpp

LLM inference in C/C++

C++ 91,799 14,184 Updated Dec 22, 2025

NanGePlus / RagLangChainTest

在本项目中模拟健康档案私有知识库构建和检索全流程，通过一份代码实现了同时支持多种大模型（如OpenAI、阿里通义千问等）的RAG（检索增强生成）功能:(1)离线步骤:文档加载->文档切分->向量化->灌入向量数据库；在线步骤:获取用户问题->用户问题向量化->检索向量数据库->将检索结果和用户问题填入prompt模版->用最终的prompt调用LLM->由LLM生成回复

Python 215 31 Updated Sep 6, 2024

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 1,772 238 Updated Dec 21, 2025

multimodal-art-projection / YuE

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 5,855 686 Updated Jun 4, 2025

tencent-ailab / SongGeneration

The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment

Python 1,016 118 Updated Dec 13, 2025

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 22,813 2,525 Updated Mar 13, 2025

SixQuant / nowatermark

remove watermark. 去除图片中的水印

Python 660 148 Updated Nov 1, 2017

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,502 2,366 Updated Apr 29, 2025

OpenMOSS / MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,060 95 Updated Dec 8, 2025

stylegan-human / StyleGAN-Human

StyleGAN-Human: A Data-Centric Odyssey of Human Generation

Python 1,189 152 Updated Jan 26, 2025

jgm / pandoc

Universal markup converter

Haskell 40,928 3,717 Updated Dec 21, 2025

lzyhha / VisualCloze

[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)

Python 274 14 Updated Dec 17, 2025

HG-ha / Parrot

基于Cosyvoice2-0.5B模型实现的多角色语音克隆项目，使用flet开发，支持多音色管理、历史记录管理、一键克隆，仅需短短几秒的人声音频即可快速生成。

Python 50 9 Updated May 20, 2025

bytedance / UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,341 77 Updated Sep 12, 2025

svc-develop-team / so-vits-svc

SoftVC VITS Singing Voice Conversion

Python 27,870 5,079 Updated Nov 11, 2023

wenet-e2e / WeTextProcessing

Text Normalization & Inverse Text Normalization

Python 706 91 Updated Dec 1, 2025

zuruoke / watermark-removal

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Python 4,248 492 Updated Sep 15, 2025

yisol / IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Python 4,797 787 Updated Mar 7, 2025

muzishen / IMAGDressing

[AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation with flexible garment, pose, and scene control, ensuring high …

Python 1,314 117 Updated Sep 30, 2025

idootop / MagicMirror

🪞 Instant AI Face Swap 一键 AI 换脸，发现更美的你

TypeScript 2,751 211 Updated Jul 10, 2025

libukai / Awesome-ChatTTS

官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,829 110 Updated Jul 3, 2024

HypoX64 / DeepMosaics

Automatically remove the mosaics in images and videos, or add mosaics to them.

Python 2,504 465 Updated Aug 30, 2024

open-mmlab / mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Python 4,852 1,331 Updated Aug 14, 2024

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,804 1,109 Updated Nov 1, 2024

Zyphra / Zonos

Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …

Python 7,140 816 Updated Mar 5, 2025

jin-s13 / COCO-WholeBody

ECCV2020 paper "Whole-Body Human Pose Estimation in the Wild"

Python 835 74 Updated Apr 22, 2025

deepseek-ai / DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,856 293 Updated Jan 16, 2024

Mintplex-Labs / anything-llm

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 52,420 5,612 Updated Dec 19, 2025

SillyTavern / SillyTavern

LLM Frontend for Power Users.

JavaScript 21,154 4,440 Updated Dec 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly