Thanks to visit codestin.com
Credit goes to github.com

arjun-kava

Follow

🎯

Focusing

Arjun Kava arjun-kava

🎯

Focusing

Follow

Developer turned Founder @ videosdk.live

141 followers · 1.4k following

Achievements

Achievements

Lists (8)

Sort

build-systems

DX

✨ Inspiration

QUIC Implementation

Research List

Scaling WebRTC

Training

TTS + STT

Stars

wsntxxn / UniFlow-Audio

Python 48 3 Updated Oct 17, 2025

dvlab-research / DreamOmni2

This project is the official implementation of 'DreamOmni2: Multimodal Instruction-based Editing and Generation''

Python 2,044 182 Updated Oct 20, 2025

EzioBy / Ditto

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 334 29 Updated Oct 22, 2025

GalaxyGeneralRobotics / OpenWBT

Official implementation of OpenWBT.

Python 760 83 Updated Jul 30, 2025

OpenTeleVision / TeleVision

[CoRL 2024] Open-TeleVision: Teleoperation with Immersive Active Visual Feedback

Python 1,091 119 Updated Sep 27, 2024

MVIG-SJTU / AlphaPose

Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System

Python 8,433 2,020 Updated May 13, 2024

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,596 571 Updated Oct 25, 2025

videosdk-live / NAMO-Turn-Detector-v1

High-performance, semantic turn detection for conversational AI

Python 10 1 Updated Oct 1, 2025

PRIME-RL / RL-Compositionality

FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones

Python 30 3 Updated Oct 17, 2025

SamsungSAILMontreal / TinyRecursiveModels

Python 5,156 686 Updated Oct 8, 2025

ghnmqdtg / VM-ASR

The official PyTorch implementation of VM-ASR, a model designed for high-fidelity audio super-resolution.

Python 15 Updated Sep 8, 2025

humanlayer / humanlayer

The best way to get AI coding agents to solve hard problems in complex codebases.

TypeScript 6,462 498 Updated Oct 25, 2025

rsxdalv / TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, …

TypeScript 2,676 282 Updated Oct 23, 2025

AudioLLMs / Awesome-Audio-LLM

Audio Large Language Models

Python 762 38 Updated Jul 5, 2025

Michael-A-Kuykendall / shimmy

⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.

Rust 3,080 218 Updated Oct 23, 2025

TencentCloudADP / youtu-agent

A simple yet powerful agent framework that delivers with open-source models

Python 3,643 353 Updated Oct 24, 2025

KempnerInstitute / nicewebrl

NiceWebRL is a Python library for quickly making human subject experiments that leverage machine reinforcement learning environments.

Python 70 7 Updated Oct 9, 2025

X-PLUG / MobileAgent

Mobile-Agent: The Powerful GUI Agent Family

Python 6,108 609 Updated Oct 17, 2025

BytedanceSpeech / seed-tts-eval

Python 1,442 131 Updated Jun 14, 2024

kyegomez / awesome-multi-agent-papers

A compilation of the best multi-agent papers

TeX 968 79 Updated Oct 20, 2025

videosdk-live / videosdk-rtc-ios-spm

A Swift framework for real-time audio and video communication for iOS applications.

Objective-C 2 2 Updated Aug 18, 2025

OHF-Voice / piper1-gpl

Fast and local neural text-to-speech engine

C++ 1,322 148 Updated Sep 10, 2025

aiola-lab / whisper-ner

Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"

Python 196 8 Updated Feb 25, 2025

JiarongQian / AgentMed

JavaScript 8 Updated Sep 3, 2025

msu-video-group / memfof

[ICCV'2025 Highlight] MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation

Python 68 2 Updated Sep 29, 2025

trymirai / uzu

A high-performance inference engine for AI models

Rust 1,345 34 Updated Oct 24, 2025

videosdk-community / ai-telephony-demo

Build an AI Telephony Agent for Inbound and Outbound Calls

Python 224 22 Updated Sep 22, 2025

halsay / ASR-TTS-paper-daily

Update ASR paper everyday

Python 346 18 Updated Oct 25, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,940 1,867 Updated Oct 23, 2025

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 22,192 4,339 Updated Oct 23, 2025