SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

C# 3,282 313 Updated Oct 26, 2025

lamm-mit / PDF2Audio

Jupyter Notebook 1,340 173 Updated Apr 18, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,482 190 Updated Oct 28, 2025

Tencent-Hunyuan / HunyuanWorld-1.0

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,345 194 Updated Oct 22, 2025

EvoAgentX / EvoAgentX

🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents

Python 2,103 149 Updated Oct 22, 2025

boson-ai / higgs-audio

Text-audio foundation model from Boson AI

Python 7,517 552 Updated Sep 15, 2025

OmniSVG / OmniSVG

[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…

Python 2,172 70 Updated Sep 18, 2025

1038lab / ComfyUI-RMBG

A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multiple models including RMBG-2.0, INSPYRENET, BEN, BEN2, BiRefN…

Python 1,439 66 Updated Oct 5, 2025

Yaofang-Liu / Pusa-VidGen

Pusa: Thousands Timesteps Video Diffusion Model

Python 659 47 Updated Sep 5, 2025

kijai / ComfyUI-WanVideoWrapper

Python 5,041 430 Updated Oct 28, 2025

OpenMOSS / MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 995 87 Updated Sep 28, 2025

somanchiu / ReSwapper

ReSwapper aims to reproduce the implementation of inswapper. This repository provides code for training, inference, and includes pretrained weights.

Python 209 26 Updated Jun 14, 2025

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 2,581 250 Updated Oct 28, 2025

kyutai-labs / delayed-streams-modeling

Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.

Python 2,533 258 Updated Sep 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Harry C hoveychen

Achievements

Achievements

Highlights

Organizations

Block or report hoveychen

Stars

bytedance / FaceCLIP

karpathy / nanochat

kijai / ComfyUI-WanAnimatePreprocess

tauri-apps / tauri

Breakthrough / PySceneDetect

yfeng95 / DECA

tencent-ailab / SongBloom

ZHO-ZHO-ZHO / Nano-Bananary

fingerprintjs / fingerprintjs

MeiGen-AI / InfiniteTalk

TencentARC / ToonComposer

AIDC-AI / Ovis

facebookresearch / dinov3

Fantasy-AMAP / fantasy-portrait

shadcn-ui / ui

resemble-ai / chatterbox

mcmonkeyprojects / SwarmUI