Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View qinghew's full-sized avatar
:octocat:
:octocat:

Block or report qinghew

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).

Python 411 11 Updated Aug 26, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,492 235 Updated Nov 12, 2025

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,474 468 Updated May 21, 2025

NextFlow🚀: Unified Sequential Modeling Activates Multimodal Understanding and Generation

265 14 Updated Jan 9, 2026

UniVideo: Unified Understanding, Generation, and Editing for Videos

Python 322 15 Updated Jan 8, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 2,605 300 Updated Jan 15, 2026

Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"

Python 32 1 Updated Jan 6, 2026

DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

Python 448 67 Updated Jan 13, 2026

Scalable and memory-optimized training of diffusion models

Python 1,322 142 Updated Jun 4, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 626 77 Updated Jan 15, 2026

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 3,216 216 Updated Jan 16, 2026

Official code for StoryMem: Multi-shot Long Video Storytelling with Memory

Python 615 59 Updated Dec 26, 2025
Python 304 33 Updated Jan 16, 2026

Official repo for UAE

Python 147 4 Updated Dec 29, 2025

Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…

Python 1,230 104 Updated Dec 23, 2025

Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"

150 2 Updated Oct 10, 2025

HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 968 74 Updated Jan 13, 2026

Orient Anything V2, NeurIPS 2025 Spotlight

Python 163 7 Updated Dec 16, 2025

Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]

Python 194 29 Updated Sep 21, 2022
Jupyter Notebook 18 Updated Jan 14, 2026

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Python 170 5 Updated Dec 29, 2025

基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks from videos or Pictures.

Python 9,244 1,156 Updated Dec 3, 2025

Code release for https://wonderzoom.github.io/

92 1 Updated Dec 11, 2025

The official repository of "Astra : General Interactive World Model with Autoregressive Denoising"

Python 187 4 Updated Jan 14, 2026

Mixture-of-Groups Attention for End-to-End Long Video Generation

89 Updated Oct 22, 2025

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 1,856 337 Updated Dec 15, 2025

LongLive: Real-time Interactive Long Video Generation

Python 970 72 Updated Jan 11, 2026

Official implementation of "MV-TAP: Tracking Any Point in Multi-View Videos"

Python 36 Updated Dec 7, 2025

视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.

Python 8,311 858 Updated Aug 21, 2025

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

Python 102 4 Updated Dec 16, 2025
Next