-
Marconi ML Lab | Makerere AI Lab
- Kampala, Uganda
- https://www.linkedin.com/in/kagumire-sulaiman-3b2a97135/
-
CrisperWhisper Public
Forked from nyrahealth/CrisperWhisperVerbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
Python Other UpdatedJun 3, 2025 -
whisper-lm Public
Forked from hitz-zentroa/whisper-lmAdd n-gram and large language model support to Whisper models.
Jupyter Notebook Apache License 2.0 UpdatedApr 9, 2025 -
whisper-timestamped Public
Forked from linto-ai/whisper-timestampedMultilingual Automatic Speech Recognition with word-level timestamps and confidence
Python GNU Affero General Public License v3.0 UpdatedMar 31, 2025 -
yolov12 Public
Forked from sunsmarterjie/yolov12YOLOv12: Attention-Centric Real-Time Object Detectors
Python GNU Affero General Public License v3.0 UpdatedMar 18, 2025 -
Spark-TTS Public
Forked from SparkAudio/Spark-TTSSpark-TTS Inference Code
Python Apache License 2.0 UpdatedMar 5, 2025 -
SimpleSpeech Public
Forked from yangdongchao/SimpleSpeechThe open source code for SimpleSpeech series
Python UpdatedOct 8, 2024 -
-
explainerdashboard Public
Forked from oegedijk/explainerdashboardQuickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
Python MIT License UpdatedAug 10, 2023 -
AudioGPT Public
Forked from AIGC-Audio/AudioGPTAudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Python Other UpdatedJul 26, 2023 -
masakhane-pos Public
Forked from masakhane-io/masakhane-posPOS for African languages
Jupyter Notebook MIT License UpdatedJul 24, 2023 -
NeMo Public
Forked from NVIDIA-NeMo/NeMoNeMo: a toolkit for conversational AI
Python Apache License 2.0 UpdatedMay 18, 2023 -
-
deep-speaker Public
Forked from philipperemy/deep-speakerDeep Speaker: an End-to-End Neural Speaker Embedding System.
Jupyter Notebook MIT License UpdatedMay 4, 2023 -
-
-
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
Python Apache License 2.0 UpdatedApr 27, 2023 -
StyleTTS Public
Forked from yl4579/StyleTTSOfficial Implementation of StyleTTS
Python MIT License UpdatedApr 20, 2023 -
naturalspeech Public
Forked from heatz123/naturalspeechA fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
Python UpdatedApr 17, 2023 -
-
denoiser Public
Forked from facebookresearch/denoiserReal Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
Python Other UpdatedMar 14, 2023 -
minREV Public
Forked from karttikeya/minREVA simple minimal implementation of Reversible Vision Transformers
Python UpdatedFeb 10, 2023 -
sunbird-speech Public
Forked from SunbirdAI/sunbird-speechSunbird Speech Recognition Toolkit
Jupyter Notebook UpdatedJan 30, 2023 -
CleanUNet Public
Forked from NVIDIA/CleanUNetOfficial PyTorch Implementation of CleanUNet (ICASSP 2022)
Python MIT License UpdatedNov 7, 2022 -
vits Public
Forked from jaywalnut310/vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Python MIT License UpdatedOct 22, 2022 -
DeepLearningExamples Public
Forked from NVIDIA/DeepLearningExamplesDeep Learning Examples
Python UpdatedOct 18, 2022 -
-
FullSubNet-plus Public
Forked from RookieJunChen/FullSubNet-plusThe official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".
Python Apache License 2.0 UpdatedAug 18, 2022 -
mmdetection Public
Forked from open-mmlab/mmdetectionOpenMMLab Detection Toolbox and Benchmark
Python Apache License 2.0 UpdatedAug 8, 2022 -
-