Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View nbasyl's full-sized avatar
🌵
I am Groot
🌵
I am Groot

Highlights

  • Pro

Block or report nbasyl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. NVlabs/DoRA NVlabs/DoRA Public

    [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

    Python 878 60

  2. NVlabs/DLER NVlabs/DLER Public

    DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

    Python 8 1

  3. NVlabs/EoRA NVlabs/EoRA Public

    EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

    Python 26 2

  4. LLM-FP4 LLM-FP4 Public

    The official implementation of the EMNLP 2023 paper LLM-FP4

    Python 217 21

  5. OFQ OFQ Public

    The official implementation of the ICML 2023 paper OFQ-ViT

    Python 33 1

  6. ModelCloud/GPTQModel ModelCloud/GPTQModel Public

    LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

    Python 874 127