Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View YuanGongND's full-sized avatar

Block or report YuanGongND

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. ltu ltu Public

    Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

    Python 466 41

  2. whisper-at whisper-at Public

    Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

    Python 413 35

  3. gopt gopt Public

    Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".

    Python 196 37

  4. cav-mae cav-mae Public

    Code and Pretrained Models for ICLR 2023 Paper "Contrastive Audio-Visual Masked Autoencoder".

    Python 286 25

  5. ssast ssast Public

    Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

    Python 411 66

  6. ast ast Public

    Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

    Jupyter Notebook 1.4k 242