Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View longcw's full-sized avatar

Block or report longcw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🎦 Micam 是一个专为小米摄像头设计的 RTSP 桥接服务(非官方),能够将小米摄像头的视频流本地转推到RTSP服务器,支持接入 HomeAssistant、Go2rtc、Frigate、Scrypted、Homekit 等多种NVR和智能家居系统。该项目采用 Docker Compose 快速部署方案,基于小米官方的Miloco,并集成Go2rtc实现RTSP流服务,无需GPU即可运行…

Python 551 24 Updated Dec 2, 2025

A Fully Self-Hosted Solution for Full-Duplex Voice Interaction

Python 461 34 Updated Sep 28, 2025

​​Unlimited-length talking video generation​​ that supports image-to-video and video-to-video generation

Python 4,343 729 Updated Dec 18, 2025
TypeScript 53 14 Updated Jan 11, 2026

bitHuman SDK examples

HTML 9 1 Updated Dec 31, 2025

🔊 让小爱音箱「听见你的声音」,解锁无限可能。

Rust 1,854 200 Updated Jan 8, 2026

[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.

Python 450 60 Updated Nov 10, 2025

OpenAI Agents adapter for Livekit

Python 7 2 Updated Jun 25, 2025

A tool for Container Debloating that removes bloat and improves performance.

Go 637 16 Updated Aug 12, 2025

LiveKit Agent integrated with MCP server of Home Assistant

Python 18 6 Updated May 25, 2025

Turns any OpenAI voice agent into a lively visual agent with bitHuman SDK

Python 3 Updated Apr 21, 2025

Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching

Python 4,259 708 Updated Jan 4, 2026

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 5,176 708 Updated Sep 26, 2025

A debugging and profiling tool that can trace and visualize python code execution

Python 7,503 471 Updated Jan 11, 2026

coredumpy saves your crash site for post-mortem debugging

Python 751 20 Updated Jan 5, 2026

A lightweight, powerful framework for multi-agent workflows

Python 18,286 3,053 Updated Jan 12, 2026

Voice activity detector (VAD) for the browser with a simple API

TypeScript 1,771 246 Updated Jan 3, 2026

The complete stack for AI Engineers: framework, runtime and control plane.

Python 36,802 4,870 Updated Jan 12, 2026

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,405 95 Updated Sep 21, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

Python 71,267 7,795 Updated Jan 12, 2026

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 39,819 2,686 Updated Jan 11, 2026

A powerful framework for building realtime voice AI agents 🤖🎙️📹

Python 9,005 2,343 Updated Jan 12, 2026

Human: AI-powered 3D Face Detection & Rotation Tracking, Face Description & Recognition, Body Pose Tracking, 3D Hand & Finger Tracking, Iris Analysis, Age & Gender & Emotion Prediction, Gaze Tracki…

HTML 2,931 404 Updated Dec 13, 2025

Playground Web UI using segment-anything-2 models from the Meta.

Python 55 6 Updated Dec 4, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 3,068 157 Updated Nov 20, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,916 2,048 Updated Dec 26, 2025

Instant voice cloning by MIT and MyShell. Audio foundation model.

Python 35,777 3,988 Updated Apr 19, 2025
Next