neeker

码农三德子 neeker

17 followers · 7 following

长沙

Achievements

Starred repositories

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 18,203 1,193 Updated Oct 25, 2025

Unstructured-IO / unstructured.PaddleOCR

Forked from PaddlePaddle/PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 40 6 Updated Mar 17, 2025

castorini / rank_llm

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 545 76 Updated Oct 27, 2025

njvisionpower / Safety-Helmet-Wearing-Dataset

Safety helmet wearing detect dataset, with pretrained model

Python 1,610 414 Updated Dec 17, 2019

abewley / sort

Simple, online, and realtime tracking of multiple objects in a video sequence.

Python 4,282 1,136 Updated Nov 28, 2023

cvzone / cvzone

This is a Computer vision package that makes its easy to run Image processing and AI functions. At the core it uses OpenCV and Mediapipe libraries.

Python 1,293 275 Updated May 10, 2024

espressif / esp-sr

Speech recognition

C 1,149 168 Updated Oct 21, 2025

xinnan-tech / xiaozhi-esp32-server

本项目为xiaozhi-esp32提供后端服务，帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

Python 7,267 2,471 Updated Oct 28, 2025

78 / xiaozhi-esp32

An MCP-based chatbot | 一个基于MCP的聊天机器人

C++ 20,793 4,192 Updated Oct 28, 2025

jhy549 / credible_LLM_watermarking

Python 5 3 Updated Mar 20, 2025

PaddlePaddle / Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice （『飞桨』核心框架，深度学习&机器学习高性能单机、分布式训练和跨平台部署）

C++ 23,358 5,863 Updated Oct 28, 2025

PaddlePaddle / Paddle-Lite

PaddlePaddle High Performance Deep Learning Inference Engine for Mobile and Edge (飞桨高性能深度学习端侧推理引擎）

C++ 7,167 1,627 Updated May 22, 2025

ultralytics / ultralytics

Ultralytics YOLO 🚀

Python 47,974 9,254 Updated Oct 28, 2025

HumanSignal / label-studio

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 25,207 3,138 Updated Oct 28, 2025

zhahoi / Nanodet-YoloV8-Pose-MeterReader

使用Nanodet+YoloV8-Pose实现指针仪表的实时检测、高精度读数识别（借助ncnn框架）

C++ 174 16 Updated Oct 31, 2024

MLNLP-World / LLMs-from-scratch-CN

LLMs-from-scratch项目中文翻译

Jupyter Notebook 1,847 300 Updated Oct 15, 2025

datawhalechina / llms-from-scratch-cn

仅需Python基础，从0构建大语言模型；从0逐步构建GLM4\Llama3\RWKV6，深入理解大模型原理

Jupyter Notebook 3,651 506 Updated Aug 15, 2024

skindhu / Build-A-Large-Language-Model-CN

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书，适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材，我决定将其翻译成中文，并通过 GitHub 进行开源共享。

HTML 2,503 445 Updated Sep 7, 2025

mindspore-lab / mindformers

Python 175 21 Updated Oct 27, 2025

IBM / mcp-context-forge

A Model Context Protocol (MCP) Gateway & Registry. Serves as a central management point for tools, resources, and prompts that can be accessed by MCP-compatible LLM applications. Converts REST API …

Python 2,745 362 Updated Oct 28, 2025

opendatalab / MinerU

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 47,616 3,926 Updated Oct 28, 2025

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 8,843 664 Updated Jan 3, 2025

opendatalab / OmniDocBench

[CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation

Python 1,093 93 Updated Oct 28, 2025

awesomedata / awesome-public-datasets

A topic-centric list of HQ open datasets.

69,911 10,854 Updated Oct 15, 2025

modelcontextprotocol / modelcontextprotocol

Specification and documentation for the Model Context Protocol

TypeScript 6,044 1,076 Updated Oct 27, 2025

Unstructured-IO / unstructured-api

Python 830 176 Updated Oct 20, 2025

modelscope / 3D-Speaker

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 2,506 225 Updated Aug 12, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,559 288 Updated Aug 14, 2025

xiangyuecn / Recorder

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

JavaScript 5,459 1,086 Updated Mar 31, 2025

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,244 1,341 Updated Oct 1, 2025

码农三德子 neeker

Starred repositories

Kubernetes

Go

Chrome