-
Home Intelligent System
- Poland
-
14:50
(UTC -12:00) - https://gadzety360.pl
- @Gadzety360pl
- https://medium.com/@brainhome9
Stars
TTS model capable of streaming conversational audio in realtime.
Open-Source Dual-Arm Mobile Robot with Motorized Lift
PyTorch native quantization and sparsity for training and inference
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…
Instant Skinned Gaussian Avatars for Web, Mobile and VR Applications
Software for amblyopia treatment done for Meta Quest 3
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚
MPMAvatar: Learning 3D Gaussian Avatars with Accurate and Robust Physics-Based Dynamics (NeurIPS 2025)
Baby Dragon Hatchling (BDH) – Architecture and Code
Official repository for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
On the Theoretical Limitations of Embedding-Based Retrieval
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
This is the code repository for IntPhys 2, a video benchmark designed to evaluate the intuitive physics understanding of deep learning models.
ComfyUI custom nodes and web utilities for real-time AI generation and interaction
Hierarchical Reasoning Model Official Release
BUDDIE is the first full-stack open-source AI voice interaction solution, providing a complete end-to-end system from hardware design to software applications. Here, you can find a comprehensive so…
Supercharge Your LLM with the Fastest KV Cache Layer
MoDA: Multi-modal Diffusion Architecture for Talking Head Generation
FastAPI Implementation of Orpheus TTS streaming Chatbot