Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Duguce's full-sized avatar
☘️
Focusing
☘️
Focusing

Block or report Duguce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Train transformer language models with reinforcement learning.

Python 15,993 2,243 Updated Oct 23, 2025

Ongoing research training transformer models at scale

Python 13,930 3,179 Updated Oct 24, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph…

Python 10,612 918 Updated Oct 23, 2025

Paper list for Efficient Reasoning.

705 25 Updated Oct 20, 2025

dInfer: An Efficient Inference Framework for Diffusion Language Models

Python 245 20 Updated Oct 22, 2025

Structured Outputs

Python 12,737 643 Updated Oct 15, 2025

structured outputs for llms

Python 11,682 877 Updated Oct 23, 2025

The official GitHub repository of the paper "Recent advances in large langauge model benchmarks against data contamination: From static to dynamic evaluation"

45 2 Updated Sep 13, 2025

"what, how, where, and how well? a survey on test-time scaling in large language models" repository

HTML 73 2 Updated Oct 24, 2025
Python 2,542 306 Updated May 19, 2024

[ACMMM 2025] Official Code of DetectAnyLLM: Towards Generalizable and Robust Detection of Machine-Generated Text Across Domains and Models

Python 14 Updated Sep 23, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,494 2,046 Updated May 19, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 31,363 3,608 Updated Oct 23, 2025

A resource repository for machine unlearning in large language models

498 29 Updated Jul 20, 2025

Dream 7B, a large diffusion language model

Python 1,027 57 Updated Sep 26, 2025

Awesome LLM pre-training resources, including data, frameworks, and methods.

269 17 Updated Apr 29, 2025

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details.

Python 217 17 Updated Jul 25, 2025

Minimalistic large language model 3D-parallelism training

Python 2,269 251 Updated Sep 3, 2025

A Simple Framework of Small-scale LMMs for Video Understanding

Python 95 6 Updated Jun 11, 2025

A Framework of Small-scale Large Multimodal Models

Python 911 94 Updated Apr 26, 2025
Python 33 1 Updated Oct 4, 2025

MLNLP: Paper Picture Writing Code

TeX 1,195 120 Updated Nov 5, 2022

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 38,185 8,707 Updated Oct 19, 2025

A powerful toolkit for compressing large models including LLM, VLM, and video generation models.

Python 597 63 Updated Aug 22, 2025
Python 17 2 Updated Feb 25, 2025

A simple and effective LLM pruning approach.

Python 810 115 Updated Aug 9, 2024

Existing Literature about Machine Unlearning

920 112 Updated Aug 29, 2025

My personal blog

HTML 1 Updated Oct 21, 2025

buaa 北航 收集优秀的buaaer创造的实用的工具和脚本

332 32 Updated Mar 5, 2025

Official repository of Automated Privacy Information Annotation in Large Language Model Interactions

Python 2 Updated Aug 4, 2025
Next