Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View thelinhbkhn2014's full-sized avatar

Block or report thelinhbkhn2014

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,436 176 Updated Mar 28, 2025

LLM101n: Let's build a Storyteller

35,418 1,927 Updated Aug 1, 2024

OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training

Python 542 46 Updated Jan 13, 2025

Pioneering in Vietnamese Multimodal Large Language Model

Python 53 5 Updated Jan 23, 2025

Macaw-LLM: Multi-Modal Language Modeling with Image, Video, Audio, and Text Integration

Python 1,586 132 Updated Jan 1, 2025

A Framework for Speech, Language, Audio, Music Processing with Large Language Model

Python 910 95 Updated Oct 24, 2025

Sentence aligner

C++ 118 39 Updated May 21, 2021

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,610 7,450 Updated Oct 30, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,946 7,116 Updated Nov 2, 2025

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,741 257 Updated May 17, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,179 333 Updated Sep 10, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,958 790 Updated Feb 11, 2024

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 21,997 2,388 Updated Jul 7, 2025

PhoGPT: Generative Pre-training for Vietnamese (2023)

Python 795 73 Updated Nov 12, 2024

Explanation to key concepts in ML

8,108 662 Updated Jun 30, 2025

State-of-the-Art Text Embeddings

Python 17,810 2,705 Updated Oct 22, 2025

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Python 253 27 Updated Sep 30, 2025

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Python 426 34 Updated Jul 23, 2024

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 342 40 Updated Jul 22, 2024

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Python 150 19 Updated Dec 31, 2024

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,724 1,379 Updated Dec 6, 2023

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,989 412 Updated May 10, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,198 4,037 Updated Jul 17, 2024

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)

22 2 Updated Jun 5, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,966 31,020 Updated Nov 2, 2025

Source Code for IJCAI 2020 paper "On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification"

Python 20 7 Updated Jan 6, 2022

PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)

46 4 Updated Jun 3, 2025
Next