jpWang

Jiapeng Wang jpWang

@SCUT-DLVCLab

48 followers · 31 following

South China University of Technology
Guangzhou, China

Achievements

Organizations

Stars

Mountchicken / Resophy

🎯 Read research papers faster with AI. Resophy is an HTML-based AI paper reader with: 🤖 AI Translation & Analysis — instantly understand structure, contributions, and results 🚀 Daily arXiv Recommen…

Python 96 3 Updated Dec 18, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,163 201 Updated Dec 23, 2025

shi-yx / URaG

Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026 Oral).

31 Updated Nov 14, 2025

Alibaba-NLP / DeepResearch

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 17,697 1,355 Updated Dec 17, 2025

xhyumiracle / Awesome-AgenticLLM-RL-Papers

1,342 60 Updated Sep 5, 2025

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,227 71 Updated Mar 9, 2025

ByteDance-Seed / VeOmni

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

Python 1,443 122 Updated Dec 22, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,154 193 Updated Oct 9, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,673 3,404 Updated Dec 23, 2025

ASLP-lab / Hum-Dial

ICASSP2026 HumDial Challenge

Python 28 3 Updated Dec 13, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 17,709 2,868 Updated Dec 23, 2025

XiaomiMiMo / MiMo-Audio

MiMo-Audio: Audio Language Models are Few-Shot Learners

Python 908 87 Updated Sep 20, 2025

xpzouying / xiaohongshu-mcp

MCP for xiaohongshu.com

Go 7,598 1,192 Updated Dec 21, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,343 3,243 Updated Dec 22, 2025

stepfun-ai / Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python 1,276 92 Updated Sep 22, 2025

ASLP-lab / OSUM

OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.

Python 461 30 Updated Nov 23, 2025

XiaoMi / dasheng

Official PyTorch code for Deep Audio-Signal Holistic Embeddings

Python 171 12 Updated Nov 7, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,449 332 Updated Dec 22, 2025

TEN-framework / ten-framework

Open-source framework for conversational voice AI agents

Python 9,370 1,098 Updated Dec 22, 2025

k2-fsa / ZipVoice

Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Python 744 104 Updated Dec 2, 2025

k2-fsa / sherpa-onnx

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,415 1,040 Updated Dec 22, 2025

HelloWorldU / vllm

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 2 Updated Nov 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jiapeng Wang jpWang

Achievements

Achievements

Organizations

Block or report jpWang

Stars

Mountchicken / Resophy

NVIDIA-NeMo / RL

shi-yx / URaG

Alibaba-NLP / DeepResearch

xhyumiracle / Awesome-AgenticLLM-RL-Papers

Tebmer / Awesome-Knowledge-Distillation-of-LLMs

ByteDance-Seed / VeOmni

QwenLM / Qwen3-Omni

NVIDIA / Megatron-LM

ASLP-lab / Hum-Dial

volcengine / verl

XiaomiMiMo / MiMo-Audio

xpzouying / xiaohongshu-mcp

NVIDIA-NeMo / NeMo

stepfun-ai / Step-Audio2

ASLP-lab / OSUM

XiaoMi / dasheng

vllm-project / llm-compressor

TEN-framework / ten-framework

k2-fsa / ZipVoice

k2-fsa / sherpa-onnx

HelloWorldU / vllm

librosa / librosa

TakHemlata / SSL_Anti-spoofing

modelscope / FunASR

wenet-e2e / wespeaker

opendilab / CleanS2S

xiaomi-research / dasheng-lm

iver56 / audiomentations

wenet-e2e / wenet