Lists (2)
Sort Name ascending (A-Z)
Stars
分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.
Robust Speech Recognition via Large-Scale Weak Supervision
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Interact with your documents using the power of GPT, 100% privately, no data leaks
Developer-first error tracking and performance monitoring
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
🦔 PostHog is an all-in-one developer platform for building successful products. We offer product analytics, web analytics, session replay, error tracking, feature flags, experimentation, surveys, d…
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Automate browser based workflows with AI
Avatars for Zoom, Skype and other video-conferencing apps.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Official implementations for paper: Anydoor: zero-shot object-level image customization
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
Towards Real-Time Diffusion-Based Streaming Video Super-Resolution — An efficient one-step diffusion framework for streaming VSR with locality-constrained sparse attention and a tiny conditional de…
Helper scripts to install pip, in a Python installation that doesn't have it.
The code releasing for https://image-dream.github.io/