- Shanghai, China
- [email protected]
- https://civitai.com/user/Y_Man
-
ai-game-devtools Public
Here we will keep track of the latest AI Game Development Tools, including LLM, World Model, Agent, Code, Image, Texture, Shader, 3D Model, Animation, Video, Audio, Music, Singing Voice and Analyti…
-
ai-agent-toolkit Public
Explore the latest AI Agent Toolkit!
-
-
AI-Native-Game Public
Here we will track the latest AI-Native Game! 🎮
-
harmony Public
Forked from openai/harmonyRenderer for the harmony response format to be used with gpt-oss
Rust Apache License 2.0 UpdatedAug 7, 2025 -
ComfyUI-Manager Public
Forked from Comfy-Org/ComfyUI-Manager -
ComfyUI-Qwen-Image Public
ComfyUI-Qwen-Image is now available in ComfyUI, Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
-
Qwen-Image Public
Forked from QwenLM/Qwen-ImageQwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
-
ComfyUI-SkyworkUniPic Public
ComfyUI-SkyworkUniPic is now available in ComfyUI, Skywork-UniPic is a unified autoregressive multimodal model with 1.5 billion parameters that natively integrates image understanding, text-to-imag…
-
UniPic Public
Forked from SkyworkAI/UniPicUnified Autoregressive Modeling for Visual Understanding and Generation
Python MIT License UpdatedJul 30, 2025 -
ComfyUI-HiggsAudio Public
ComfyUI-HiggsAudio is now available in ComfyUI, Higgs Audio v2 is a text-audio foundation model from Boson AI.
-
ComfyUI-ThinkSound Public
ComfyUI-ThinkSound is now available in ComfyUI, ThinkSound is a unified Any2Audio generation framework with flow matching guided by Chain-of-Thought (CoT) reasoning.
-
audio-development-tools Public
Audio Development Tools (ADT) is a project for advancing sound, speech, and music technologies, featuring components for machine learning, sound synthesis, speech and music generation, signal proce…
-
ThinkSound Public
Forked from FunAudioLLM/ThinkSoundPyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
Python UpdatedJul 10, 2025 -
context-engineering Public
Context Engineering - The art of providing all the context for the task to be plausibly solvable by the LLM.
-
ai-audio-datasets Public
AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…
-
ComfyUI-Ovis-U1 Public
ComfyUI-Ovis-U1 is now available in ComfyUI, Ovis-U1 is a 3-billion-parameter unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a…
-
-
Ovis-U1 Public
Forked from AIDC-AI/Ovis-U1An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.
-
ComfyUI-PosterCraft Public
ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that excels in precise text rendering, seamless integration of abstr…
-
ComfyUI-OmniGen2 Public
ComfyUI-OmniGen2 is now available in ComfyUI, OmniGen2 is a powerful and efficient unified multimodal model. Its architecture is composed of two key components: a 3B Vision-Language Model (VLM) and…
-
OmniGen2 Public
Forked from AlonzoLeeeooo/OmniGen2OmniGen2: Unified Image Understanding and Generation.
Jupyter Notebook Apache License 2.0 UpdatedJun 24, 2025 -
magenta-realtime Public
Forked from magenta/magenta-realtimePython Apache License 2.0 UpdatedJun 23, 2025 -
PosterCraft Public
Forked from Ephemeral182/PosterCraftRethinking High-Quality Aesthetic Poster Generation in a Unified Framework
Python Other UpdatedJun 19, 2025 -
vLLM-PyTorch Public
PyTorch implementation of vLLM.
-
ComfyUI-Hunyuan3D-2.1 Public
ComfyUI-Hunyuan3D-2.1 is now available in ComfyUI, Hunyuan3D-2.1 is a scalable 3D asset creation system that advances state-of-the-art 3D generation through two pivotal innovations: Fully Open-Sour…
-
Hunyuan3D-2.1 Public
Forked from Tencent-Hunyuan/Hunyuan3D-2.1From Images to High-Fidelity 3D Assets with Production-Ready PBR Material
Python Other UpdatedJun 15, 2025 -
-
ComfyUI-Vui Public
ComfyUI-Vui is now available in ComfyUI, Vui is a llama based transformer that predicts audio tokens.
-
ComfyUI-Direct3D-S2 Public
ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑S2 is a scalable 3D generation framework based on sparse vol…