-
Chemnitz University of Technology
- Chemnitz, Germany
-
22:06
(UTC +01:00) - https://stefantaubert.com
- https://orcid.org/0000-0002-4932-2874
- in/stefan-taubert
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset
Yet another alternative curriculum vitae/résumé class with LaTeX
research into WebAssembly-based fingerprinting
A comprehensive Python module for handling Monero cryptocurrency
Typer, build great CLIs. Easy to code. Based on Python type hints.
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
An attempt to recreate the major parts of Roam for offline use
A plugin for GPT-3 AI assisted note taking in Logseq
⚡ Dynamically generated stats for your github readmes
This is a wrapper for the birdnet Python package for automated bird sound ID
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
BirdNET analyzer for scientific audio data processing.
If you build software, keep a changelog.
😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Chinese text normalization for speech processing
A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.
This project focuses on developing a graphical interface in Django for tools for synthesizing speech developed by @stefantaubert.
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
Universal Command Line Interface for Amazon Web Services
a MUSHRA compliant web audio API based experiment software
Official repository for Citation Style Language (CSL) citation styles.