Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View Eric-mingjie's full-sized avatar

Block or report Eric-mingjie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code release for "Idiosyncrasies in Large Language Models"

Python 52 7 Updated Jul 21, 2025

Make huge neural nets fit in memory

Python 2,827 278 Updated Apr 26, 2020

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,750 272 Updated Jul 18, 2025

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 293 24 Updated Jun 3, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 50,792 4,193 Updated Jan 16, 2026

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

Python 77 11 Updated Jan 23, 2025
Python 30 1 Updated Jul 22, 2024
113 1 Updated Mar 14, 2024

Code accompanying the paper "Massive Activations in Large Language Models"

Python 193 13 Updated Mar 4, 2024

Diffusion Classifier leverages pretrained diffusion models to perform zero-shot classification without additional training

Python 487 43 Updated Feb 28, 2024

[NeurIPS D&B '25] The one-stop repository for LLM unlearning

Python 464 122 Updated Dec 24, 2025

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,394 68 Updated Aug 4, 2025
Python 191 12 Updated Sep 26, 2024

PyTorch code and models for the DINOv2 self-supervised learning method.

Jupyter Notebook 12,250 1,155 Updated Dec 22, 2025

Solve puzzles. Learn CUDA.

Jupyter Notebook 11,891 917 Updated Sep 1, 2024

Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"

Python 78 8 Updated Jul 7, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,072 8,747 Updated Nov 12, 2025

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,508 258 Updated Aug 13, 2024

A simple and effective LLM pruning approach.

Python 841 122 Updated Aug 9, 2024

Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".

Python 165 13 Updated May 7, 2024
Python 1,630 148 Updated Apr 27, 2023

Benchmarking large language models' complex reasoning ability with chain-of-thought prompting

Jupyter Notebook 2,766 145 Updated Aug 4, 2024

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Python 5,792 596 Updated Jan 16, 2025

Test-Time Adaptation via Conjugate Pseudo-Labels

Python 42 3 Updated May 25, 2023

Tools for understanding how transformer predictions are built layer-by-layer

Python 561 61 Updated Aug 7, 2025

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,915 368 Updated Dec 7, 2024

A prize for finding tasks that cause large language models to show inverse scaling

620 27 Updated Oct 11, 2023

DeblurSR: Event-Based Motion Deblurring Under the Spiking Representation (AAAI 2024)

Python 29 1 Updated Nov 8, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,281 4,683 Updated Jan 17, 2026
Next