Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View yxzero's full-sized avatar

Block or report yxzero

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of papers on discrete diffusion models

166 2 Updated Jun 30, 2025

✨✨Latest Advances on Multimodal Large Language Models

16,680 1,075 Updated Nov 12, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,474 2,498 Updated Nov 13, 2025

Examples for MS-AMP package.

Shell 30 11 Updated Jul 17, 2025

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,364 811 Updated Nov 9, 2025

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

744 42 Updated Nov 5, 2025

Best practice for training LLaMA models in Megatron-LM

Python 658 56 Updated Jan 2, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,778 230 Updated Aug 11, 2024

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,672 2,059 Updated May 19, 2025

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,943 1,877 Updated Jul 15, 2025

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 8,261 767 Updated Oct 16, 2024

Making large AI models cheaper, faster and more accessible

Python 41,236 4,539 Updated Nov 13, 2025

Inference code for Llama models

Python 58,917 9,816 Updated Jan 26, 2025

Implementation of benchmark RL algorithms

Python 471 82 Updated Jul 20, 2022
Python 883 110 Updated May 24, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,806 1,560 Updated Sep 8, 2025
Python 36 3 Updated Jun 12, 2023

Code for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"

Python 623 185 Updated Jul 23, 2020

Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)

Python 435 87 Updated Mar 22, 2022

Pytorch + NLP, 一份友好的项目实践仓库

Python 466 96 Updated Jul 30, 2019

自然语言基础模型

Python 563 205 Updated Apr 29, 2019

This is the official clone for the implementation of the NIPS18 paper Multi-Layered Gradient Boosting Decision Trees (mGBDT) .

Python 104 25 Updated Nov 19, 2018

Named Entity Recognition (LSTM + CRF) - Tensorflow

Python 1,953 702 Updated Oct 16, 2020

Implementation of model compression with knowledge distilling method.

Python 342 101 Updated Jan 3, 2017

Knowledge Distillation using Tensorflow

Python 142 47 Updated Aug 12, 2019

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

Python 6,941 2,880 Updated Jan 15, 2025

Bi-directional Attention Flow (BiDAF) network is a multi-stage hierarchical process that represents context at different levels of granularity and uses a bi-directional attention flow mechanism to …

Python 1,539 677 Updated May 31, 2023

Code Samples from Neural Networks for NLP

Python 1,318 483 Updated Jan 27, 2020
C++ 94 69 Updated Jun 19, 2022
Next