Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View lisiG9's full-sized avatar

Block or report lisiG9

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,675 856 Updated Jan 12, 2026

一套代码指令微调大模型

Python 38 3 Updated Aug 1, 2023
Python 8 2 Updated Oct 23, 2023

The code and data for GrammarGPT.

Python 178 8 Updated Oct 10, 2023

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,449 7,952 Updated Jan 11, 2026

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Jupyter Notebook 474 42 Updated Apr 21, 2024

[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Generic Plug-In".

Python 60 5 Updated Jul 12, 2024

超长文本分类(大于1000字);文档级/篇章级文本分类;主要是解决长距离依赖问题

Python 131 31 Updated Oct 9, 2021

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1,589 100 Updated Jun 3, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

22,062 2,098 Updated May 19, 2025

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,350 4,776 Updated Jun 2, 2025

TBC

Python 28 1 Updated Nov 2, 2022

ChatGPT 中文调教指南。各种场景使用指南。学习怎么让它听你的话。

57,825 13,583 Updated Jan 1, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 37,044 6,119 Updated Nov 10, 2025

PyTorch分类网络:Python训练_测试_模型转换 && Windows_LibTorch_C++部署

Python 19 4 Updated Sep 16, 2021

专注于中文领域大语言模型,落地到某个行业某个领域,成为一个行业大模型、公司级别或行业级别领域大模型。

Python 126 16 Updated Mar 6, 2025

精选 OpenAI 的 [ChatGPT](https://chat.openai.com) 资源清单, 跟随最新资源并添加中文相关Work

683 66 Updated Apr 22, 2023

使用Bert,ERNIE,进行中文文本分类

Python 4,380 929 Updated Jun 28, 2024

CCL 2023 电信网络诈骗案件分类评测baseline

Python 3 Updated May 3, 2023

中文图书语料MD5链接

Python 218 23 Updated Jan 31, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,971 1,870 Updated Jul 15, 2025

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Python 4,559 794 Updated Nov 21, 2023

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,846 1,556 Updated Sep 8, 2025

记录本人整理的一些数据集

1,083 135 Updated Jun 16, 2022

中文对话数据清洗

Python 32 7 Updated Nov 8, 2022

异常文本处理,移除异常空格、换行,英文标点符号替换成中文标点,去除乱码,全角字符转半角等

Python 7 2 Updated May 26, 2022

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

4,084 285 Updated Jan 3, 2026

Punctuation Restoration using Transformer Models for High-and Low-Resource Languages

Python 227 65 Updated Jul 29, 2024

text correction papers

314 18 Updated Jan 23, 2024

📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)

Python 753 84 Updated Dec 21, 2024
Next