Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View huycq1712's full-sized avatar
🚬
focus
🚬
focus
  • Hanoi University of Science and Technology
  • Ha Noi
  • Codestin Search App @huycq1712

Block or report huycq1712

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Contexts Optical Compression

Python 18,223 1,196 Updated Oct 25, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,982 3,158 Updated Oct 28, 2025

VietASR - Vietnamese Automatic Speech Recognition

Python 154 57 Updated Oct 29, 2024

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 261 19 Updated Feb 12, 2023

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 711 115 Updated Oct 23, 2023

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,274 230 Updated Aug 7, 2025
Python 1,267 376 Updated Oct 5, 2025

A Configurable template for a FastAPI application, with Authentication, User integration, Admin pages and a snappy CLI to control it all!

Python 205 14 Updated Oct 27, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 8,569 950 Updated Oct 28, 2025

Speech-to-text server framework with next-gen Kaldi

C++ 803 131 Updated Oct 28, 2025

SoTA open-source TTS

Python 14,295 1,890 Updated Sep 25, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,460 3,190 Updated Oct 28, 2025

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

3,081 513 Updated Oct 19, 2023

Port of OpenAI's Whisper model in C/C++

C++ 44,087 4,877 Updated Oct 28, 2025

A mcp server to allow LLMS gain context about shadcn ui component structure,usage and installation,compaitable with react,svelte 5,and vue

TypeScript 2,407 266 Updated Oct 23, 2025

TTS Dia finetuning for Vietnamese

Python 108 32 Updated Aug 20, 2025

ViStreamASR - Real-Time Vietnamese Speech Recognition

Python 46 15 Updated Jul 12, 2025

The training program for libfacedetection for face detection and 5-landmark detection.

Python 820 214 Updated Jan 19, 2024

An open source library for face detection in images. The face detection speed can reach 1000FPS.

C++ 12,661 3,044 Updated Sep 14, 2025

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 10,949 857 Updated Oct 13, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

93,492 25,259 Updated Oct 19, 2025

Open Source Resources

1,044 Updated Oct 21, 2025

MCP server that interacts with Obsidian via the Obsidian rest API community plugin

Python 2,323 288 Updated Jun 28, 2025

[IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Python 147 1 Updated Aug 3, 2025

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 1,788 133 Updated Aug 25, 2025

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,176 2,921 Updated Oct 21, 2025

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 23,799 4,390 Updated Oct 9, 2025

Declaratively deploy your Kubernetes manifests, Kustomize configs, and Charts as Helm releases. Generate all-in-one manifests for use with ArgoCD.

Go 4,787 311 Updated Oct 27, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,668 621 Updated Oct 27, 2025
Next