Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View brainhome's full-sized avatar

Block or report brainhome

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

TTS model capable of streaming conversational audio in realtime.

Python 920 77 Updated Nov 29, 2025

Open-Source Dual-Arm Mobile Robot with Motorized Lift

Python 644 76 Updated Dec 18, 2025

PyTorch native quantization and sparsity for training and inference

Python 2,582 387 Updated Dec 21, 2025

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1,476 126 Updated Aug 5, 2025

Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications

JavaScript 380 26 Updated Nov 25, 2025

The best ChatGPT that $100 can buy.

Python 38,959 4,927 Updated Dec 9, 2025

Software for amblyopia treatment done for Meta Quest 3

C# 16 1 Updated Dec 16, 2025

Help shape the future of Project G-Assist

Python 206 35 Updated Dec 18, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 34,627 2,341 Updated Dec 20, 2025
Python 24 5 Updated Apr 30, 2025

Go ahead and axolotl questions

Python 10,973 1,223 Updated Dec 19, 2025

MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics (NeurIPS 2025)

Python 67 3 Updated Nov 19, 2025
Python 241 36 Updated Sep 30, 2025

Baby Dragon Hatchling (BDH) – Architecture and Code

Python 3,320 169 Updated Oct 28, 2025

Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Python 488 34 Updated Apr 15, 2024

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,140 192 Updated Oct 9, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 16,873 2,029 Updated Dec 2, 2025

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 614 47 Updated Sep 15, 2025

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,273 92 Updated Sep 22, 2025

This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models.

Python 91 6 Updated Oct 21, 2025

ComfyUI custom nodes and web utilities for real-time AI generation and interaction

Python 321 35 Updated Dec 16, 2025
Python 318 20 Updated Aug 28, 2025

Hierarchical Reasoning Model Official Release

Python 12,162 1,777 Updated Sep 9, 2025

BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to software applications. Here, you can find a comprehensive so…

C 234 28 Updated Aug 16, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,388 803 Updated Dec 20, 2025

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation

Python 268 32 Updated Aug 22, 2025

FastAPI Implementation of Orpheus TTS streaming Chatbot

Python 23 4 Updated Jun 19, 2025
Next