Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View stefantaubert's full-sized avatar
🐍
🐍

Highlights

  • Pro

Organizations

@birdnet-team

Block or report stefantaubert

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,773 303 Updated Jun 24, 2024

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 9,475 766 Updated May 27, 2025

Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset

Python 170 24 Updated Mar 25, 2019

SoTA open-source TTS

Python 14,410 1,930 Updated Sep 25, 2025

Yet another alternative curriculum vitae/résumé class with LaTeX

TeX 1,470 374 Updated Jul 30, 2025

research into WebAssembly-based fingerprinting

JavaScript 4 Updated Jul 22, 2025

A comprehensive Python module for handling Monero cryptocurrency

Python 249 79 Updated Sep 5, 2023

Typer, build great CLIs. Easy to code. Based on Python type hints.

Python 18,210 798 Updated Nov 3, 2025

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python 190 44 Updated Nov 18, 2021

An attempt to recreate the major parts of Roam for offline use

JavaScript 109 16 Updated May 8, 2023

Export messages from Signal Desktop

Go 488 28 Updated Oct 4, 2025

A plugin for GPT-3 AI assisted note taking in Logseq

TypeScript 741 84 Updated Oct 9, 2024
Python 282 60 Updated Oct 29, 2025

⚡ Dynamically generated stats for your github readmes

JavaScript 76,790 26,720 Updated Nov 3, 2025

This is a wrapper for the birdnet Python package for automated bird sound ID

R 21 1 Updated Jul 21, 2025

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,724 1,379 Updated Dec 6, 2023

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,401 3,113 Updated Nov 4, 2025

BirdNET analyzer for scientific audio data processing.

Python 1,285 223 Updated Nov 3, 2025

If you build software, keep a changelog.

Haml 6,401 3,588 Updated Oct 10, 2025

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Python 3,986 810 Updated Jul 5, 2024

Chinese text normalization for speech processing

Python 711 149 Updated Mar 18, 2023

A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) user agent strings.

Python 1,497 196 Updated Feb 16, 2023

This project focuses on developing a graphical interface in Django for tools for synthesizing speech developed by @stefantaubert.

JavaScript 1 Updated Feb 11, 2024

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

Python 264 68 Updated Nov 28, 2022

Universal Command Line Interface for Amazon Web Services

Python 16,484 4,390 Updated Nov 4, 2025

a MUSHRA compliant web audio API based experiment software

JavaScript 401 159 Updated Sep 26, 2025

Official repository for Citation Style Language (CSL) citation styles.

Ruby 3,610 3,988 Updated Nov 2, 2025

汉字转拼音(pypinyin)

Python 5,192 624 Updated Oct 6, 2025
Next