Thanks to visit codestin.com
Credit goes to github.com

#

gpt-2

Here are 1,044 public repositories matching this topic...

BlinkDL / RWKV-LM

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.

deep-learning transformers pytorch transformer lstm rnn gpt language-model attention-mechanism gpt-2 gpt-3 linear-attention rwkv chatgpt

Updated May 8, 2026
Python

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

deep-learning pytorch lora language-model adaptation roberta low-rank gpt-2 gpt-3 deberta

Updated Dec 17, 2024
Python

NielsRogge / Transformers-Tutorials

This repository contains demos I made with the Transformers library by HuggingFace.

transformers pytorch bert gpt-2 layoutlm vision-transformer

Updated Apr 20, 2026
Jupyter Notebook

codota / TabNine

AI Code Completions

Updated Sep 4, 2025
Shell

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

transformers generative-model image-generation auto-regressive-model gpt neurips gpt-2 diffusion-models autoregressive-models vision-transformer large-language-models generative-ai

Updated Nov 10, 2025
Jupyter Notebook

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

transformers gpt language-model gpt-2 gpt-3

Updated Feb 25, 2022
Python

Morizeyao / GPT2-Chinese

Chinese version of GPT2 training code, using BERT tokenizer.

nlp text-generation transformer chinese gpt-2

Updated Apr 25, 2024
Python

lonePatient / awesome-pretrained-chinese-nlp-models

Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合

nlp dataset chinese gpt pretrained-models pangu bert multimodel roberta gpt-2 ernie xlnet nezha nlu-nlg simbert large-language-models llm

Updated May 15, 2026
Python

jaymody / picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

python nlp machine-learning deep-learning neural-network gpt gpt-2 large-language-models

Updated Apr 24, 2023
Python

dbiir / UER-py

Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo

Updated May 9, 2024
Python

guillaume-be / rust-bert

Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)

nlp rust machine-learning translation deep-learning sentiment-analysis transformer rust-lang question-answering bart gpt ner bert language-generation electra roberta gpt-2

Updated Jan 13, 2026
Rust

yangjianxin1 / GPT2-chitchat

GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)

nlp text-generation transformer gpt-2 gpt2 dialogpt chichat dialogue-model

Updated Oct 30, 2023
Python

stochasticai / xTuring

Build, personalize and control your own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6

adapter deep-learning llama lora quantization language-model mistral fine-tuning peft finetuning mixed-precision gpt-2 gpt-j llm generative-ai gen-ai

Updated Mar 4, 2026
Python

microsoft / DialoGPT

Large-scale pretraining for dialogue

machine-learning dialogue text-generation pytorch transformer data-processing text-data gpt-2 dialogpt

Updated Oct 17, 2022
Python

asyml / texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

python machine-learning natural-language-processing deep-learning tensorflow machine-translation text-generation data-processing bert text-data dialog-systems gpt-2 texar xlnet casl-project

Updated Aug 26, 2021
Python

BrikerMan / Kashgari

Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

nlp machine-learning text-classification named-entity-recognition seq2seq transfer-learning ner bert sequence-labeling nlp-framework bert-model text-labeling gpt-2

Updated Sep 3, 2024
Python

lxe / simple-llm-finetuner

Simple UI for LLM Model Finetuning

ai pytorch llama peft gpt-2 huggingface huggingface-transformers gpt-3 llm

Updated Dec 21, 2023
Jupyter Notebook

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

macos swift ios ai llama gpt-2 rwkv ggml gptneox starcoder

Updated Jan 30, 2026
C

thu-coai / CDial-GPT

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

dialogue text-generation pytorch gpt gpt-2 lccc

Updated Jun 12, 2023
Python

VHellendoorn / Code-LMs

Guide to using pre-trained large language models of source code

deep-learning source-code gpt-2

Updated Jul 7, 2024
Python

Improve this page

Add a description, image, and links to the gpt-2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpt-2 topic, visit your repo's landing page and select "manage topics."