beckgom

😆

Young Han Lee beckgom

😆

Speech Research @ KETI

12 followers · 18 following

Achievements

Highlights

moltbot Public
Forked from openclaw/openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript MIT License Updated Jan 28, 2026
MSST-WebUI Public
Forked from SUC-DriverOld/MSST-WebUI

A WebUI app for Music-Source-Separation-Training and we packed UVR together!

Python GNU Affero General Public License v3.0 Updated Apr 2, 2025
Applio Public
Forked from IAHispano/Applio

A simple, high-quality voice conversion tool focused on ease of use and performance.

Python MIT License Updated Mar 30, 2025
DeepSeek-VL2 Public
Forked from deepseek-ai/DeepSeek-VL2

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python MIT License Updated Jan 29, 2025
DeepSeek-V3 Public
Forked from deepseek-ai/DeepSeek-V3

Python MIT License Updated Jan 26, 2025
TalkingGaussian Public
Forked from Fictionarry/TalkingGaussian

[ECCV'24] TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

Python Updated Nov 26, 2024
F5-TTS Public
Forked from SWivid/F5-TTS

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python MIT License Updated Oct 16, 2024
SLAM-LLM Public
Forked from X-LANCE/SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python MIT License Updated Oct 5, 2024
fish-speech Public
Forked from fishaudio/fish-speech

Brand new TTS solution

Python Other Updated Sep 16, 2024
3dgs-avatar-release Public
Forked from mikeqzy/3dgs-avatar-release

3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

Python MIT License Updated Sep 10, 2024
webMUSHRA Public
Forked from audiolabs/webMUSHRA

a MUSHRA compliant web audio API based experiment software

JavaScript Other Updated Aug 9, 2024
VoiceCraft Public
Forked from jasonppy/VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook Other Updated Apr 3, 2024
HierSpeechpp Public
Forked from sh-lee-prml/HierSpeechpp

The official implementation of HierSpeech++

Python MIT License Updated Feb 20, 2024
vall-e Public
Forked from lifeiteng/vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python Apache License 2.0 Updated Feb 1, 2024
MiniGPT-4 Public
Forked from Vision-CAIR/MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2

Python BSD 3-Clause "New" or "Revised" License Updated Oct 13, 2023
sherpa Public
Forked from k2-fsa/sherpa

Speech-to-text server framework with next-gen Kaldi

C++ Apache License 2.0 Updated Aug 16, 2023
bark Public
Forked from suno-ai/bark

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook MIT License Updated May 4, 2023
audiolm-pytorch Public
Forked from lucidrains/audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python MIT License Updated Mar 20, 2023
tuning_playbook Public
Forked from google-research/tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

Other Updated Feb 2, 2023
CLAP Public
Forked from LAION-AI/CLAP

Contrastive Language-Audio Pretraining

Python Creative Commons Zero v1.0 Universal Updated Jan 24, 2023
korean-romanizer Public
Forked from osori/korean-romanizer

A Python library for Korean romanization

Python Other Updated Jan 17, 2023
beckgom.github.com Public

HTML MIT License Updated Dec 5, 2022
UUVC Public
Forked from b04901014/UUVC

Python MIT License Updated Nov 26, 2022
vdm Public
Forked from google-research/vdm

Jupyter Notebook Apache License 2.0 Updated Sep 20, 2022
FACEGOOD-Audio2Face Public
Forked from FACEGOOD/FACEGOOD-Audio2Face

http://www.facegood.cc

Python MIT License Updated Jun 25, 2022
gpu-burn Public
Forked from wilicc/gpu-burn

Multi-GPU CUDA stress test

C++ BSD 2-Clause "Simplified" License Updated Jun 2, 2022
photometric_optimization Public
Forked from HavenFeng/photometric_optimization

Photometric optimization code for creating the FLAME texture space and other applications

Python MIT License Updated Mar 31, 2022
Speech-Backbones Public
Forked from huawei-noah/Speech-Backbones

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Python Updated Mar 6, 2022
wenet Public
Forked from wenet-e2e/wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

C++ Apache License 2.0 Updated Nov 24, 2021
YOLOX_AUDIO Public
Forked from intflow/YOLOX_AUDIO

Audio event detection model based on YOLOX

Python Apache License 2.0 Updated Nov 16, 2021

Young Han Lee beckgom

Achievements

Achievements

Highlights

moltbot Public

Uh oh!

MSST-WebUI Public

Uh oh!

Applio Public

Uh oh!

DeepSeek-VL2 Public

Uh oh!

DeepSeek-V3 Public

Uh oh!

TalkingGaussian Public

Uh oh!

F5-TTS Public

Uh oh!

SLAM-LLM Public

Uh oh!

fish-speech Public

Uh oh!

3dgs-avatar-release Public

Uh oh!

webMUSHRA Public

Uh oh!

VoiceCraft Public

Uh oh!

HierSpeechpp Public

Uh oh!

vall-e Public

Uh oh!

MiniGPT-4 Public

Uh oh!

sherpa Public

Uh oh!

bark Public

Uh oh!

audiolm-pytorch Public

Uh oh!

tuning_playbook Public

Uh oh!

CLAP Public

Uh oh!

korean-romanizer Public

Uh oh!

beckgom.github.com Public

Uh oh!

UUVC Public

Uh oh!

vdm Public

Uh oh!

FACEGOOD-Audio2Face Public

Uh oh!

gpu-burn Public

Uh oh!

photometric_optimization Public

Uh oh!

Speech-Backbones Public

Uh oh!

wenet Public

Uh oh!

YOLOX_AUDIO Public

Uh oh!