Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View ms-dot-k's full-sized avatar

Block or report ms-dot-k

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. Lip-to-Speech-Synthesis-in-the-Wild Lip-to-Speech-Synthesis-in-the-Wild Public

    PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)

    Python 70 7

  2. Multi-head-Visual-Audio-Memory Multi-head-Visual-Audio-Memory Public

    PyTorch implementation of "Distinguishing Homophenes using Multi-Head Visual-Audio Memory" (AAAI2022)

    Python 27 5

  3. Visual-Context-Attentional-GAN Visual-Context-Attentional-GAN Public

    PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)

    Python 25 5

  4. Visual-Audio-Memory Visual-Audio-Memory Public

    PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)

    Python 20 4

  5. AVSR AVSR Public

    PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring" (CVPR2023) and "Visual Context-driven Audio Feature Enhan…

    Python 20 2

  6. TMT TMT Public

    TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages

    Jupyter Notebook 18