Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View YinHan-Zhang's full-sized avatar

Block or report YinHan-Zhang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

【AIGC 实战入门笔记 —— AIGC 摩天大楼】分享 大语言模型(LLMs),大模型高效微调(SFT),检索增强生成(RAG),智能体(Agent),PPT自动生成, 角色扮演,文生图(Stable Diffusion) ,图像文字识别(OCR),语音识别(ASR),语音合成(TTS),人像分割(SA),多模态(VLM),Ai 换脸(Face Swapping), 文生视频(VD),图生…

38 3 Updated Apr 26, 2025

rCM: SOTA Diffusion Distillation & Few-Step Video Generation

Python 244 12 Updated Oct 19, 2025

CoTracker is a model for tracking any point (pixel) on a video.

Jupyter Notebook 4,633 323 Updated Jan 21, 2025

A unified inference and post-training framework for accelerated video generation.

Python 2,478 189 Updated Oct 27, 2025

(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,023 51 Updated Mar 5, 2025

The ultimate training toolkit for finetuning diffusion models

Python 6,645 792 Updated Oct 27, 2025

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,319 77 Updated Sep 12, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,496 109 Updated Oct 27, 2025

Controlnet module for Wan2.1

Python 24 1 Updated Aug 4, 2025

Scaling Diffusion Transformers with Mixture of Experts

Python 393 19 Updated Sep 9, 2024

[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 1,477 79 Updated Oct 25, 2025

[CVPR 2024 Highlight] MIGC and [TPAMI 2024] MIGC++ (Official Implementation)

Python 609 29 Updated May 15, 2025

[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"

Python 436 23 Updated Aug 4, 2025

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,854 179 Updated Aug 7, 2024

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,556 72 Updated Oct 23, 2025

[CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Python 320 9 Updated Jul 4, 2025

面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版

Jupyter Notebook 21,932 2,624 Updated Jun 12, 2025

The official implementation of "MagicColor: Multi-Instance Sketch Colorization"

Python 114 7 Updated Jun 30, 2025

Code release for https://kovenyu.com/WonderWorld/

Python 661 33 Updated Apr 14, 2025

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Python 131 8 Updated Jun 18, 2024

Implementation of the paper CPTR : FULL TRANSFORMER NETWORK FOR IMAGE CAPTIONING

Python 31 6 Updated Jun 1, 2022

Pytorch implementation of image captioning using transformer-based model.

Jupyter Notebook 68 9 Updated Apr 13, 2023

[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs

Python 1,782 128 Updated Jul 1, 2025

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"

Python 1,150 92 Updated Sep 13, 2024

[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"

Python 22,152 3,317 Updated Oct 27, 2025

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 8,275 1,031 Updated Jun 26, 2025

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 51,858 5,682 Updated Sep 10, 2025

Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.

Python 22,252 2,305 Updated Apr 29, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 4,328 506 Updated Aug 11, 2025

(IJCV 2024) Code of "AniClipart: Clipart Animation with Text-to-Video Priors"

Python 45 7 Updated Feb 11, 2025
Next