huycq1712

🚬

focus

Huy Q Can huycq1712

🚬

focus

15 followers · 30 following

Hanoi University of Science and Technology
Ha Noi
@huycq1712

Achievements

Starred repositories

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 18,223 1,196 Updated Oct 25, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 15,982 3,158 Updated Oct 28, 2025

dangvansam / viet-asr

VietASR - Vietnamese Automatic Speech Recognition

Python 154 57 Updated Oct 29, 2024

kssteven418 / Squeezeformer

[NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition

Python 261 19 Updated Feb 12, 2023

openspeech-team / openspeech

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python 711 115 Updated Oct 23, 2023

k2-fsa / k2

FSA/FST algorithms, differentiable, with PyTorch compatibility.

Cuda 1,274 230 Updated Aug 7, 2025

Tencent / POINTS-Reader

182 7 Updated Sep 16, 2025

k2-fsa / icefall

Python 1,267 376 Updated Oct 5, 2025

seapagan / fastapi-template

A Configurable template for a FastAPI application, with Authentication, User integration, Admin pages and a snappy CLI to control it all!

Python 205 14 Updated Oct 27, 2025

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 8,569 950 Updated Oct 28, 2025

k2-fsa / sherpa

Speech-to-text server framework with next-gen Kaldi

C++ 803 131 Updated Oct 28, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,295 1,890 Updated Sep 25, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,460 3,190 Updated Oct 28, 2025

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

3,081 513 Updated Oct 19, 2023

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 44,087 4,877 Updated Oct 28, 2025

Jpisnice / shadcn-ui-mcp-server

A mcp server to allow LLMS gain context about shadcn ui component structure,usage and installation,compaitable with react,svelte 5,and vue

TypeScript 2,407 266 Updated Oct 23, 2025

TuananhCR / Dia-Finetuning-Vietnamese

TTS Dia finetuning for Vietnamese

Python 108 32 Updated Aug 20, 2025

nguyenvulebinh / ViStreamASR

ViStreamASR - Real-Time Vietnamese Speech Recognition

Python 46 15 Updated Jul 12, 2025

ShiqiYu / libfacedetection.train

The training program for libfacedetection for face detection and 5-landmark detection.

Python 820 214 Updated Jan 19, 2024

ShiqiYu / libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

C++ 12,661 3,044 Updated Sep 14, 2025

tadata-org / fastapi_mcp

Expose your FastAPI endpoints as Model Context Protocol (MCP) tools, with Auth!

Python 10,949 857 Updated Oct 13, 2025

x1xhlol / system-prompts-and-models-of-ai-tools

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus Agent Tools, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae…

93,492 25,259 Updated Oct 19, 2025

samayun / devbooks

Open Source Resources

1,044 Updated Oct 21, 2025

MarkusPfundstein / mcp-obsidian

MCP server that interacts with Obsidian via the Obsidian rest API community plugin

Python 2,323 288 Updated Jun 28, 2025

NiceRingNode / LGGPT

[IJCV 2025] Smaller But Better: Unifying Layout Generation with Smaller Large Language Models

Python 147 1 Updated Aug 3, 2025

NanoNets / docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Python 1,788 133 Updated Aug 25, 2025

google-gemini / gemini-fullstack-langgraph-quickstart

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,176 2,921 Updated Oct 21, 2025

TauricResearch / TradingAgents

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 23,799 4,390 Updated Oct 9, 2025

helmfile / helmfile

Declaratively deploy your Kubernetes manifests, Kustomize configs, and Charts as Helm releases. Generate all-in-one manifests for use with ArgoCD.

Go 4,787 311 Updated Oct 27, 2025

bytedance / Dolphin

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 7,668 621 Updated Oct 27, 2025

Linux

Natural language processing