Thanks to visit codestin.com
Credit goes to Github.com

Skip to content
View hankyul2's full-sized avatar
🐛
Bugging since 2019/04/30
🐛
Bugging since 2019/04/30

Highlights

  • Pro

Organizations

@Algostu @SWCapstone2021

Block or report hankyul2

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code repository for "Understanding the Performance Behaviors of End-to-End Protein Design Pipelines on GPUs [IEEE CAL 25]"

Python 4 Updated Jan 11, 2026

Code repo for the paper "SpinQuant LLM quantization with learned rotations"

Python 372 65 Updated Feb 14, 2025

BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.

Python 747 57 Updated Aug 6, 2025

An official implementation of "Scheduling Weight Transitions for Quantization-Aware Training" (ICCV 2025) in PyTorch.

Python 58 13 Updated Nov 17, 2025

[WACV'26] ForestSplats: Deformable transient field for Gaussian Splatting in the Wild

Python 11 1 Updated Dec 7, 2025

PyTorch implementation of quantization-aware matrix factorization (QMF) for data compression

Jupyter Notebook 15 5 Updated Jul 14, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 9,153 1,641 Updated Jan 23, 2026
Python 37 2 Updated Oct 17, 2025

C++ extensions in PyTorch

Python 1,179 250 Updated Jan 13, 2026

Activation-aware Singular Value Decomposition for Compressing Large Language Models

Python 84 16 Updated Oct 22, 2024

[ICML 2024] BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

Python 230 17 Updated Jan 11, 2025
Python 577 50 Updated Oct 29, 2024

Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"

Python 394 35 Updated Feb 24, 2024

PB-LLM: Partially Binarized Large Language Models

Python 157 8 Updated Nov 20, 2023

Official code repository for "Pimba: A Processing-in-Memory Acceleration for Post-Transformer Large Language Model Serving [MICRO'25]"

Python 22 2 Updated Oct 23, 2025

Implementation of LPLR algorithm for matrix compression

Python 31 1 Updated Nov 21, 2023

Less is Enough: Training-Free Video Diffusion Acceleration via Runtime-Adaptive Caching

Python 278 4 Updated Aug 29, 2025

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 582 78 Updated Nov 12, 2025
Python 3 1 Updated Nov 25, 2025

Official code repository for "Déjà Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse [VLDB 25]"

Python 10 1 Updated Jun 17, 2025

A python library for self-supervised learning on images.

Python 3,667 317 Updated Dec 19, 2025
Python 80 13 Updated May 23, 2025

[ACL 2025 Main] EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 327 25 Updated Nov 26, 2025

[ICLR 2025] Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Python 72 2 Updated Mar 29, 2025

Official PyTorch implementation of QA-LoRA

Python 145 12 Updated Mar 13, 2024
Python 234 23 Updated Jun 11, 2024

[LPCVC2025] Official PyTorch implementation of the 2025 IEEE Low-Power Computer Vision Challenge Track1 Winner at the CVPR 2025 Workshop.

Python 2 Updated Apr 28, 2025

[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization

Python 38 2 Updated Sep 24, 2024

[ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retention

Python 67 4 Updated Apr 15, 2024

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python 51,099 4,220 Updated Jan 23, 2026
Next