Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View kiranbaby14's full-sized avatar
:electron:
Focusing
:electron:
Focusing

Block or report kiranbaby14

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Production-ready implementation of InvisPose - a revolutionary WiFi-based dense human pose estimation system that enables real-time full-body tracking through walls using commodity mesh routers

Python 3,790 304 Updated Jun 9, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 19,478 1,358 Updated Nov 27, 2025

A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

Python 255 36 Updated Dec 15, 2025

tiny vision language model

Python 9,127 706 Updated Nov 14, 2025

Open Vision Agents by Stream. Build Vision Agents quickly with any model or video provider. Uses Stream's edge network for ultra-low latency.

Python 3,465 276 Updated Dec 24, 2025

Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback…

TypeScript 276 50 Updated Apr 14, 2025

Implementation of "Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length"

Python 1,208 110 Updated Dec 18, 2025

very good whiteboard infinite canvas SDK

TypeScript 44,354 2,900 Updated Dec 27, 2025

👨‍🎨 The ergonomic way to storyboard. Turns sketches and annotations into videos by drawing on a canvas.

TypeScript 107 9 Updated Dec 25, 2025

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 10,556 818 Updated Dec 4, 2024

MotionStream: Real-Time Video Generation with Interactive Motion Controls

450 16 Updated Nov 13, 2025

🎒 Token-Oriented Object Notation (TOON) – Compact, human-readable, schema-aware JSON for LLM prompts. Spec, benchmarks, TypeScript SDK.

TypeScript 21,170 930 Updated Dec 15, 2025

"ViMax: Agentic Video Generation (Director, Screenwriter, Producer, and Video Generator All-in-One)"

Python 1,639 284 Updated Dec 15, 2025

A tiny CPU simulator written in Python

Python 1,108 28 Updated Dec 15, 2025

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,165 395 Updated Jul 11, 2024

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 786 51 Updated Oct 15, 2025

The best ChatGPT that $100 can buy.

Python 39,342 4,997 Updated Dec 23, 2025

A comprehensive toolkit for reliably locking, packing and deploying environments for ComfyUI workflows.

Python 200 28 Updated Nov 10, 2025

A fast AI Video Generator for the GPU Poor. Supports Wan 2.1/2.2, Qwen Image, Hunyuan Video, LTX Video and Flux.

Python 3,577 498 Updated Dec 24, 2025

Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sharing.

Python 1,300 165 Updated Nov 3, 2025
Python 16 1 Updated Sep 29, 2025

for rileys podcast

TypeScript 175 152 Updated Jul 8, 2025

AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.

TypeScript 11,020 1,011 Updated Dec 26, 2025

Kortix – build, manage and train AI Agents.

TypeScript 18,893 3,259 Updated Dec 27, 2025

[NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ MoE ckpt released! Only 4GB VRAM is enough to run!

Python 2,054 114 Updated Dec 19, 2025

[CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"

Jupyter Notebook 845 66 Updated Dec 8, 2025

Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow a…

Python 368 46 Updated Feb 26, 2025

Lets make video diffusion practical!

Python 16,407 1,599 Updated Oct 16, 2025

đź’ˇ VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning

Python 291 29 Updated Oct 12, 2025

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 242 23 Updated Apr 22, 2025
Next