Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View diagomike's full-sized avatar

Block or report diagomike

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Free and Unlimited Google translate API for node.js

TypeScript 34 8 Updated Sep 14, 2025

Fetch transcript from a youtube video

TypeScript 484 101 Updated Jul 30, 2024

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…

Python 6,366 665 Updated Oct 13, 2025

A simple api for google translate

Python 291 80 Updated Jan 2, 2021

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,506 1,982 Updated Oct 27, 2025

Build full-stack Next.js apps, incredibly fast

TypeScript 2,792 129 Updated Apr 11, 2025

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Python 124 45 Updated Jun 16, 2022

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Python 142 33 Updated Aug 22, 2022

Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions

Python 262 55 Updated Jan 13, 2025

Official Implementation of StyleTTS-VC

Python 191 27 Updated Jan 14, 2025

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Python 511 112 Updated Jan 13, 2025

Inference code for Llama models

Python 58,884 9,816 Updated Jan 26, 2025

SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs

16 1 Updated Jul 19, 2023

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 6,019 627 Updated Aug 10, 2024

Official Implementation of StyleTTS

Python 453 67 Updated Jan 13, 2025

Whisper combined with Silero VAD, for improved long-form transcriptions

Jupyter Notebook 53 6 Updated Dec 11, 2022

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,143 11,285 Updated Sep 8, 2025

RegExr is a HTML/JS based tool for creating, testing, and learning about Regular Expressions.

JavaScript 10,248 1,006 Updated Jul 17, 2025

Windows desktop front end for Spleeter - AI source separation

C# 2,590 270 Updated Oct 7, 2023

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 43,222 5,726 Updated Aug 16, 2024

Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration

JavaScript 806 114 Updated Mar 16, 2023

This repository contains all the code I use in my YouTube tutorials.

Python 434 221 Updated Mar 24, 2022