Stars
A collection of papers on discrete diffusion models
✨✨Latest Advances on Multimodal Large Language Models
verl: Volcano Engine Reinforcement Learning for LLMs
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
Reference implementation for DPO (Direct Preference Optimization)
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
Making large AI models cheaper, faster and more accessible
Implementation of benchmark RL algorithms
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)
This is the official clone for the implementation of the NIPS18 paper Multi-Layered Gradient Boosting Decision Trees (mGBDT) .
Named Entity Recognition (LSTM + CRF) - Tensorflow
Implementation of model compression with knowledge distilling method.
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to …
Code Samples from Neural Networks for NLP