Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View owenliang's full-sized avatar

Block or report owenliang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Active Noise Cancelling Algorithms implementation

Jupyter Notebook 103 18 Updated Jul 8, 2021

[arXiv 2023] Embodied Task Planning with Large Language Models

Python 192 13 Updated Aug 22, 2023

LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA

Python 231 24 Updated Aug 17, 2025

将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调

Python 401 44 Updated Sep 8, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 5,049 527 Updated Oct 23, 2025

简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目

Python 38 4 Updated Sep 23, 2024

Pseudo Streaming SenseVoice with Hotwords

Python 366 42 Updated Mar 13, 2025

ASR system using transformers neural networks from scratch

PureBasic 3 Updated Apr 14, 2022

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,141 646 Updated Oct 23, 2025

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,180 1,337 Updated Oct 1, 2025

Play with neural networks!

TypeScript 12,610 2,662 Updated Sep 10, 2025

Simple MCP Client for remote MCP Servers 🌐

Python 24 8 Updated Jun 15, 2025

A working pattern for SSE-based MCP clients and servers

Python 298 51 Updated Mar 6, 2025

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 11,294 1,683 Updated Jul 2, 2025

A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.

Python 282 21 Updated Aug 15, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 47,341 3,901 Updated Oct 23, 2025

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 1,724 132 Updated Apr 14, 2025

A Unified Toolkit for Deep Learning Based Document Image Analysis

Python 5,547 513 Updated Aug 15, 2024

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,221 5,819 Updated Oct 23, 2025

resp-benchmark is a benchmark tool for testing databases that support the RESP protocol, such as Redis, Valkey, and Tair.

Rust 22 6 Updated Jul 18, 2025

Fully open reproduction of DeepSeek-R1

Python 25,567 2,397 Updated Sep 8, 2025

🔥 🔥 🔥 自建Docker镜像加速服务,基于官方Docker Registry 一键部署Docker、K8s、Quay、Ghcr、Mcr、Nvcr等镜像加速\管理服务。支持免服务器部署到 ClawCloud\Render\Koyeb

JavaScript 3,722 547 Updated Oct 15, 2025

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,782 1,604 Updated Oct 23, 2025

How to do Real Time Trigger Word Detection with Keras | DLology

Jupyter Notebook 161 52 Updated Sep 6, 2019

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Python 637 128 Updated Sep 17, 2025

A hackers AI voice assistant, built using Python and PyTorch.

Python 1,095 368 Updated Mar 31, 2024

A lightweight, simple-to-use, RNN wake word listener

Python 938 240 Updated Nov 25, 2023

A resource for learning about Machine learning & Deep Learning

Python 8,295 2,795 Updated Aug 17, 2024
Next