- All languages
- Batchfile
- C
- C#
- C++
- CMake
- CSS
- Dart
- Dockerfile
- Gherkin
- Go
- Groovy
- HTML
- Haskell
- Java
- JavaScript
- Jinja
- Jupyter Notebook
- Kotlin
- LLVM
- Lua
- MLIR
- Makefile
- Markdown
- MoonScript
- Mustache
- Objective-C
- PHP
- PLpgSQL
- Perl
- PowerShell
- Python
- Raku
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- SaltStack
- Scala
- Shell
- Smarty
- Solidity
- Swift
- Tcl
- TeX
- TypeScript
- Vim Script
- Vue
Starred repositories
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
Safety helmet wearing detect dataset, with pretrained model
Simple, online, and realtime tracking of multiple objects in a video sequence.
This is a Computer vision package that makes its easy to run Image processing and AI functions. At the core it uses OpenCV and Mediapipe libraries.
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎)
Label Studio is a multi-type data labeling and annotation tool with standardized output format
使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别(借助ncnn框架)
LLMs-from-scratch项目中文翻译
仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理
《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。
A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API …
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
A Comprehensive Toolkit for High-Quality PDF Content Extraction
[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation
A topic-centric list of HQ open datasets.
Specification and documentation for the Model Context Protocol
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.