Lists (6)
Sort Name ascending (A-Z)
Stars
Native and Compact Structured Latents for 3D Generation
一款将 PDF 到 Word 转换工具,a PDF to Word conversion tool
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Enjoy the magic of Diffusion models!
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Stand-In is a lightweight, plug-and-play framework for identity-preserving video generation.
Repo for SeedVR2 & SeedVR (CVPR2025 Highlight)
Unlimited-length talking video generation that supports image-to-video and video-to-video generation
OmniGen2: Exploration to Advanced Multimodal Generation. https://arxiv.org/abs/2506.18871
FrankenDriver. Frankenstein Driver. Drivers for video cards RTX 30XXm, RTX 40XXm from aliexpress. Driver for RTX 40XXm, RTX 30XXm, RTX 20XX from aliexpress. Driver for graphics cards with a laptop …
Wan: Open and Advanced Large-Scale Video Generative Models
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
Text-audio foundation model from Boson AI
From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
Free and Open Source Machine Translation API. Self-hosted, offline capable and easy to setup.
[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from sim…
An open-source AI agent that brings the power of Gemini directly into your terminal.
MAGI-1: Autoregressive Video Generation at Scale
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。
SkyReels-V2: Infinite-length Film Generative model
A lightweight LMM-based Document Parsing Model