Stars
3D Reconstruction of Indoor Scenes with a Generative Framework
Code Release of MVInverse: Feedforward Multi-view Inverse Rendering in Seconds
Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentanglement.
Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing
Official code for StoryMem: Multi-shot Long Video Storytelling with Memory
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while achieving up to 6$\times$ acceleration in inference speed.
A Foundation Model for Generalist Gaming Agents
HexStrike AI MCP Agents is an advanced MCP server that lets AI agents (Claude, GPT, Copilot, etc.) autonomously run 150+ cybersecurity tools for automated pentesting, vulnerability discovery, bug b…
A Model Context Protocol (MCP) server implementation that provides database capabilities for Chroma
PersonaLive! : Expressive Portrait Image Animation for Live Streaming
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties
Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".
Offical Implementation of SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
Native and Compact Structured Latents for 3D Generation
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
interview-coder-withoupaywall-opensource
Light-X: Generative 4D Video Rendering with Camera and Illumination Control