Stars
High performance self-hosted photo and video management solution.
Collection of scripts and notebooks for OpenAI's latest GPT OSS models
A .NET library that simplifies working with OpenXML documents. Create, read, and manipulate Wordprocessing documents (Docx), Excel spreadsheets (Xlsx), and PowerPoint presentations (Pptx) effortles…
A unified hub for centralized management and dynamic organization of multiple MCP servers/APIs into streamable HTTP (SSE) endpoints, with support for flexible routing strategies
Learning English through the method of constructing sentences with conjunctions
🎉 基于Taro3、React的H5和微信小程序多端图表组件
一款JavaSDK用于快速接入AI大模型应用,整合多平台大模型,如OpenAi、智谱Zhipu(ChatGLM)、深度求索DeepSeek、月之暗面Moonshot(Kimi)、腾讯混元Hunyuan、零一万物(01)等等,提供统一的输入输出(对齐OpenAi)消除差异化,优化函数调用(Tool Call),优化RAG调用、支持向量数据库(Pinecone)、内置联网增强,并且支持JDK1.…
🤖 A visualization mcp contains 25+ visual charts using @antvis. Using for chart generation and data analysis.
Cornerstone is a set of JavaScript libraries that can be used to build web-based medical imaging applications. It provides a framework to build radiology applications such as the OHIF Viewer.
JavaScript event calendar. Modern alternative to fullcalendar and react-big-calendar.
A Vue.js full calendar, no dependency, no BS. 🤘
Have a natural, spoken conversation with AI!
chat log tool, easily use your own chat data. 聊天记录工具,轻松使用自己的聊天数据
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
Awesome curated collection of images and prompts generated by GPT-4o and gpt-image-1. Explore AI generated visuals created with ChatGPT and Sora, showcasing OpenAI’s advanced image generation capab…
Lets make video diffusion practical!
A generative speech model for daily dialogue.
Speech To Speech: an effort for an open-sourced and modular GPT4-o
OCR, layout analysis, reading order, table recognition in 90+ languages
A Conversational Speech Generation Model
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
Open-Sora: Democratizing Efficient Video Production for All
Use your locally running AI models to assist you in your web browsing
利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
Silero VAD: pre-trained enterprise-grade Voice Activity Detector