Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zzajic's full-sized avatar

Highlights

  • Pro

Block or report zzajic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A toolkit for speaker diarization.

Jupyter Notebook 319 33 Updated Oct 8, 2025

Repo for AI Republic's AI Engineering Course - Winter 2024

Jupyter Notebook 47 25 Updated Nov 24, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 1,073 164 Updated Oct 13, 2025

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

Python 229 40 Updated Sep 9, 2025

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Jupyter Notebook 8,631 963 Updated Oct 23, 2025

Python package for combining diarization system outputs.

Python 90 12 Updated Oct 12, 2023

A PyTorch-based Speech Toolkit

Python 10,730 1,594 Updated Nov 2, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,043 3,177 Updated Nov 5, 2025

wav2vec2 audio classification for prosodic boundary detection and other tasks

Jupyter Notebook 42 6 Updated Aug 11, 2023

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,117 31,045 Updated Nov 5, 2025

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python 593 164 Updated Jan 20, 2022

Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN

Python 96 22 Updated Sep 15, 2021

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 31,918 6,622 Updated Sep 30, 2025

End-to-End Neural Diarization

Python 409 63 Updated Aug 30, 2021

kaldi-asr/kaldi is the official location of the Kaldi project.

Shell 15,205 5,368 Updated Sep 22, 2025

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,470 516 Updated Jun 13, 2025

State-of-the-art 2D and 3D Face Analysis Project

Python 26,946 5,814 Updated Sep 27, 2025