RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 66,946 7,116 Updated Nov 2, 2025

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,439 428 Updated Sep 13, 2024

AnswerDotAI / RAGatouille

Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.

Python 3,741 257 Updated May 17, 2025

lifeiteng / vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,179 333 Updated Sep 10, 2025

Plachtaa / VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,958 790 Updated Feb 11, 2024

BuilderIO / gpt-crawler

Crawl a site to generate knowledge files to create your own custom GPT from a URL

TypeScript 21,997 2,388 Updated Jul 7, 2025

VinAIResearch / PhoGPT

PhoGPT: Generative Pre-training for Vietnamese (2023)

Python 795 73 Updated Nov 12, 2024

dair-ai / ML-Papers-Explained

Explanation to key concepts in ML

8,108 662 Updated Jun 30, 2025

huggingface / sentence-transformers

State-of-the-Art Text Embeddings

Python 17,810 2,705 Updated Oct 22, 2025

VinAIResearch / Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Python 253 27 Updated Sep 30, 2025

VinAIResearch / WaveDiff

Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)

Python 426 34 Updated Jul 23, 2024

VinAIResearch / XPhoneBERT

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 342 40 Updated Jul 22, 2024

VinAIResearch / PhoNLP

PhoNLP: A BERT-based multi-task learning model for part-of-speech tagging, named entity recognition and dependency parsing (NAACL 2021)

Python 150 19 Updated Dec 31, 2024

jaywalnut310 / vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Python 7,724 1,379 Updated Dec 6, 2023

thelinhbkhn2014 / Text2PhonemeSequence

Python 51 13 Updated Aug 28, 2024

enhuiz / vall-e

An unofficial PyTorch implementation of the audio LM VALL-E

Python 2,989 412 Updated May 10, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,198 4,037 Updated Jul 17, 2024

VinAIResearch / PhoST

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)

22 2 Updated Jun 5, 2025

thelinhbkhn2014 / VnCoreNLP_Wrapper

Python 25 6 Updated Aug 28, 2024

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 151,966 31,020 Updated Nov 2, 2025

HKUST-KnowComp / BMGF-RoBERTa

Source Code for IJCAI 2020 paper "On the Importance of Word and Sentence Representation Learning in Implicit Discourse Relation Classification"

Python 20 7 Updated Jan 6, 2022

VinAIResearch / PhoMT

PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)

46 4 Updated Jun 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Linh The Nguyen thelinhbkhn2014

Achievements

Achievements

Block or report thelinhbkhn2014

Stars

VITA-MLLM / VITA

karpathy / LLM101n

PrimeIntellect-ai / OpenDiloco

baochi0212 / LaVy

lyuchenyang / Macaw-LLM

X-LANCE / SLAM-LLM

danielvarga / hunalign

hiyouga / LLaMA-Factory

infiniflow / ragflow