Thanks to visit codestin.com
Credit goes to github.com

YuanGongND

Follow

Yuan Gong YuanGongND

Follow

Research Scientist, MIT CSAIL

434 followers · 2 following

MIT
Cambridge, MA
23:55 (UTC -05:00)
yuangongnd.github.io

Achievements

Achievements

Pinned Loading

ltu ltu Public

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Python 466 41
whisper-at whisper-at Public

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

Python 413 35
gopt gopt Public

Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

Python 196 37
cav-mae cav-mae Public

Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

Python 286 25
ssast ssast Public

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Python 411 66
ast ast Public

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1.4k 242