stefantaubert

🐍

Stefan Taubert stefantaubert

🐍

PhD researcher in neural speech synthesis; FOSS maintainer (20+ PyPI packages).

39 followers · 32 following

Chemnitz University of Technology
Chemnitz, Germany
22:06 (UTC +01:00)
https://stefantaubert.com
https://orcid.org/0000-0002-4932-2874
in/stefan-taubert

Achievements

Highlights

Organizations

Lists (3)

Sort

🔮 Future ideas

✨ Inspiration

🚀 My stack

Stars

microsoft / table-transformer

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,773 303 Updated Jun 24, 2024

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,475 766 Updated May 27, 2025

CorentinJ / librispeech-alignments

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

Python 170 24 Updated Mar 25, 2019

stanford-cs336 / spring2025-lectures

Python 1,985 421 Updated Oct 28, 2025

resemble-ai / chatterbox

SoTA open-source TTS

Python 14,410 1,930 Updated Sep 25, 2025

liantze / AltaCV

Yet another alternative curriculum vitae/résumé class with LaTeX

TeX 1,470 374 Updated Jul 30, 2025

agstenf / wasm-fingerprinting

research into WebAssembly-based fingerprinting

JavaScript 4 Updated Jul 22, 2025

monero-ecosystem / monero-python

A comprehensive Python module for handling Monero cryptocurrency

Python 249 79 Updated Sep 5, 2023

fastapi / typer

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,210 798 Updated Nov 3, 2025

keonlee9420 / Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 190 44 Updated Nov 18, 2021

cofinley / free-roam

An attempt to recreate the major parts of Roam for offline use

JavaScript 109 16 Updated May 8, 2023

tbvdm / sigtop

Export messages from Signal Desktop

Go 488 28 Updated Oct 4, 2025

google-research / perch-hoplite

Python 73 24 Updated Oct 28, 2025

briansunter / logseq-plugin-gpt3-openai

A plugin for GPT-3 AI assisted note taking in Logseq

TypeScript 741 84 Updated Oct 9, 2024

google-research / perch

Python 282 60 Updated Oct 29, 2025

anuraghazra / github-readme-stats

⚡ Dynamically generated stats for your github readmes

JavaScript 76,790 26,720 Updated Nov 3, 2025

birdnet-team / birdnetR

This is a wrapper for the birdnet Python package for automated bird sound ID

R 21 1 Updated Jul 21, 2025

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,724 1,379 Updated Dec 6, 2023

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,401 3,113 Updated Nov 4, 2025

birdnet-team / BirdNET-Analyzer

BirdNET analyzer for scientific audio data processing.

Python 1,285 223 Updated Nov 3, 2025

olivierlacan / keep-a-changelog

If you build software, keep a changelog.

Haml 6,401 3,588 Updated Oct 10, 2025

TensorSpeech / TensorFlowTTS

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,986 810 Updated Jul 5, 2024

speechio / chinese_text_normalization

Chinese text normalization for speech processing

Python 711 149 Updated Mar 18, 2023

selwin / python-user-agents

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

Python 1,497 196 Updated Feb 16, 2023