Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View lavinal712's full-sized avatar
🎾
🎾

Block or report lavinal712

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Unified Controllable Visual Generation Model

Python 650 34 Updated Jan 27, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,670 623 Updated Oct 27, 2025

Official Repo of From Masks to Worlds: A Hitchhiker’s Guide to World Models.

37 Updated Oct 26, 2025

Contexts Optical Compression

Python 18,381 1,212 Updated Oct 25, 2025

[CVPR 2024 Highlight] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Python 1,274 55 Updated Apr 7, 2025

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Python 430 29 Updated Oct 17, 2025

TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in flow-based generation.

Python 802 45 Updated Oct 28, 2025

Pytorch DTensor native training library for LLMs/VLMs with OOTB Hugging Face support

Python 141 17 Updated Oct 29, 2025

A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its variants as the primary backbone with support for ImageNet train…

Python 109 6 Updated Oct 16, 2025

The best ChatGPT that $100 can buy.

Python 34,073 3,796 Updated Oct 28, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,401 34 Updated Oct 15, 2025
Python 1,059 92 Updated Oct 22, 2025

Open-source framework for the research and development of foundation models.

HTML 539 54 Updated Oct 29, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,329 98 Updated Oct 14, 2025

Lynx: Towards High-Fidelity Personalized Video Generation

Python 278 35 Updated Sep 26, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 861 56 Updated Oct 24, 2025

存放个人制作的Galgame AI翻译补丁

Python 48 5 Updated Oct 28, 2025

qq群相册下载

JavaScript 68 11 Updated Aug 16, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,769 155 Updated Oct 9, 2025

Fully Open Framework for Democratized Multimodal Training

Python 589 40 Updated Oct 21, 2025

This is a 3DGS(3D Gaussian Splatting) viewer built on Three.js, with features for marking, measurements, text watermarks, etc.

TypeScript 402 41 Updated Oct 28, 2025

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 16,496 1,250 Updated Oct 28, 2025

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,147 36 Updated Oct 26, 2025

Official repository for the UAE paper, unified-GRPO, and unified-Bench

Python 147 6 Updated Sep 12, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,298 1,586 Updated Oct 10, 2025

HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation​

Python 652 49 Updated Oct 14, 2025

Transition Models

Python 131 7 Updated Oct 7, 2025

TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models

Python 1,419 147 Updated Apr 18, 2025
Next