Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View talkking's full-sized avatar
  • Shanghai Jiao Tong University

Block or report talkking

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 167 11 Updated Dec 24, 2025

Spark-TTS Inference Code

Python 10,853 1,162 Updated Apr 9, 2025

Open-Source Frontier Voice AI

Python 19,013 2,100 Updated Dec 17, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 24,569 2,210 Updated Dec 23, 2025

DeepTalk: Towards Seamless and Smart Speech Interaction with Adaptive Modality-Specific MoE

Python 6 1 Updated Jul 23, 2025

✨✨[NeurIPS 2025] VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

Python 669 60 Updated May 24, 2025

Fast and memory-efficient exact attention

Python 21,284 2,245 Updated Dec 25, 2025

[SLT2024] DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition

Shell 8 3 Updated Feb 8, 2025

[ICASSP2024] One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models

Python 6 2 Updated Mar 6, 2025

[ICASSP2023] Joint Discriminator and Transfer Based Fast Domain Adaptation for End-to-End Speech Recognition

Python 3 Updated Feb 8, 2025

LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

Python 58 3 Updated Apr 14, 2025

[MICCAI 2024] Can LLMs' Tuning Methods Work in Medical Multimodal Domain?

Python 17 2 Updated Sep 18, 2024

微信聊天记录年度报告

Vue 1,292 136 Updated Jan 5, 2022

Fantastic Data Engineering for Large Language Models

93 4 Updated Dec 29, 2024

✨✨Latest Advances on Multimodal Large Language Models

17,055 1,098 Updated Dec 23, 2025

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,093 527 Updated Jul 1, 2025

Stable Diffusion web UI

Python 159,264 29,619 Updated Dec 18, 2025

大麦网抢票脚本

Python 5,321 920 Updated Mar 13, 2024

📚 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计

183,231 51,251 Updated Aug 21, 2024

基于Qt编写的智能管家系统客户端,实现语音识别,按钮音效,摄像头采集。

C++ 14 2 Updated Aug 1, 2020