Starred repositories
React Flow |Ā Svelte Flow - Powerful open source libraries for building node-based UIs with React (https://reactflow.dev) or Svelte (https://svelteflow.dev). Ready out-of-the-box and infinitely custā¦
Import a 3D Model and automatically assign and export animations
A feature-rich command-line audio/video downloader
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
Examples of WebRTC applications that are large, or use 3rd party libraries
Tissue - Blender's add-on for computational design
A curated collection of fun and creative examples generated with Nano Bananaš, Gemini-2.5-flash-image based model. We also release Nano-consistent-150K openly to support the community's developmentā¦
Text-audio foundation model from Boson AI
A light-weight torrent media center at one place.
Self-hosted torrent video streaming service compatible with Chromecast, AppleTV & Kodi deployable in the cloud
Add object detection, tracking, and mobile notifications to any RTSP Camera or iPhone.
WiFi-3D-Fusion is an open-source research project that leverages WiFi CSI signals and deep learning to estimate 3D human pose, fusing wireless sensing with computer vision techniques for next-generā¦
Wan: Open and Advanced Large-Scale Video Generative Models
deepbeepmeep / Wan2GP
Forked from Wan-Video/Wan2.1A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, Dā¦
The AI coding agent built for the terminal.
browser plugin to send youtube, insta (all social videos) to local backend and process audio and video in all sorts of ways.
Generate audiobooks from pdf or epub using Next-gen AI Chatterbox-tts from Resemble-AI
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
Guttersnipe - convert from MIDI to ASCII tab (and abc, vex, or back again)
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
Streaming and Fine-tuning for Chatterbox TTS