YinHan-Zhang

YinHan-Zhang

Achievements

Stars

km1994 / AwesomeMultiModel

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享大语言模型（LLMs），大模型高效微调（SFT）,检索增强生成（RAG），智能体（Agent），PPT自动生成, 角色扮演，文生图（Stable Diffusion），图像文字识别（OCR），语音识别（ASR），语音合成（TTS），人像分割（SA），多模态（VLM），Ai 换脸(Face Swapping), 文生视频(VD)，图生…

38 3 Updated Apr 26, 2025

NVlabs / rcm

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 244 12 Updated Oct 19, 2025

facebookresearch / co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,633 323 Updated Jan 21, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,478 189 Updated Oct 27, 2025

tianweiy / DMD2

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,023 51 Updated Mar 5, 2025

ostris / ai-toolkit

The ultimate training toolkit for finetuning diffusion models

Python 6,645 792 Updated Oct 27, 2025

bytedance / UNO

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,319 77 Updated Sep 12, 2025

aigc-apps / VideoX-Fun

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,496 109 Updated Oct 27, 2025

TheDenk / wan2.1-dilated-controlnet

Controlnet module for Wan2.1

Python 24 1 Updated Aug 4, 2025

feizc / DiT-MoE

Scaling Diffusion Transformers with Mixture of Experts

Python 393 19 Updated Sep 9, 2024

FoundationVision / Infinity

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,477 79 Updated Oct 25, 2025

limuloo / MIGC

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

Python 609 29 Updated May 15, 2025

Haian-Jin / LVSM

[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"

Python 436 23 Updated Aug 4, 2025

muskie82 / MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,854 179 Updated Aug 7, 2024

KwaiVGI / ReCamMaster

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,556 72 Updated Oct 23, 2025

hanyang-21 / VideoScene

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 320 9 Updated Jul 4, 2025

datawhalechina / llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

Jupyter Notebook 21,932 2,624 Updated Jun 12, 2025

YinHan-Zhang / MagicColor

The official implementation of "MagicColor: Multi-Instance Sketch Colorization"

Python 114 7 Updated Jun 30, 2025

KovenYu / WonderWorld

Code release for https://kovenyu.com/WonderWorld/

Python 661 33 Updated Apr 14, 2025

nv-tlabs / stmc

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Python 131 8 Updated Jun 18, 2024

milkymap / transformer-image-captioning

Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING

Python 31 6 Updated Jun 1, 2022

zarzouram / image_captioning_with_transformers

Pytorch implementation of image captioning using transformer-based model.

Jupyter Notebook 68 9 Updated Apr 13, 2023

OpenMotionLab / MotionGPT

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,782 128 Updated Jul 1, 2025

EricGuo5513 / momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 1,150 92 Updated Sep 13, 2024

HKUDS / LightRAG

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 22,152 3,317 Updated Oct 27, 2025

YaoFANGUK / video-subtitle-remover

基于AI的图片/视频硬字幕去除、文本水印去除，无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API，本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,275 1,031 Updated Jun 26, 2025

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 51,858 5,682 Updated Sep 10, 2025

Sanster / IOPaint

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,252 2,305 Updated Apr 29, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,328 506 Updated Aug 11, 2025

kingnobro / AniClipart

(IJCV 2024) Code of "AniClipart: Clipart Animation with Text-to-Video Priors"

Python 45 7 Updated Feb 11, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YinHan-Zhang

Achievements

Achievements

Block or report YinHan-Zhang

Stars

km1994 / AwesomeMultiModel

NVlabs / rcm

facebookresearch / co-tracker

hao-ai-lab / FastVideo

tianweiy / DMD2

ostris / ai-toolkit

bytedance / UNO

aigc-apps / VideoX-Fun

TheDenk / wan2.1-dilated-controlnet

feizc / DiT-MoE

FoundationVision / Infinity

limuloo / MIGC

Haian-Jin / LVSM

muskie82 / MonoGS

KwaiVGI / ReCamMaster

hanyang-21 / VideoScene

datawhalechina / llm-cookbook

YinHan-Zhang / MagicColor

KovenYu / WonderWorld

nv-tlabs / stmc

milkymap / transformer-image-captioning

zarzouram / image_captioning_with_transformers

OpenMotionLab / MotionGPT

EricGuo5513 / momask-codes

HKUDS / LightRAG

YaoFANGUK / video-subtitle-remover

RVC-Boss / GPT-SoVITS

Sanster / IOPaint

antgroup / echomimic_v2

kingnobro / AniClipart