Thanks to visit codestin.com
Credit goes to github.com

NeonBohdan

Follow

NeonBohdan

Follow

6 followers · 13 following

Achievements

Achievements

Stars

adbar / trafilatura

Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML

Python 5,092 334 Updated Sep 12, 2025

mxssl / tg-captcha-bot

Telegram Сaptcha Bot

Go 275 84 Updated Dec 16, 2025

umputun / tg-spam

Anti-spam bot for Telegram and general-purpose anti-spam library and server

Go 394 73 Updated Dec 24, 2025

qodo-ai / qodo-cover

Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞

Python 5,236 486 Updated Jun 24, 2025

zhuhanqing / APOLLO

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python 266 14 Updated Nov 29, 2025

euclaise / SlimTrainer

Full finetuning of large language models without large memory requirements

Python 94 3 Updated Sep 22, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,759 2,407 Updated Nov 24, 2025

allenai / open-instruct

AllenAI's post-training codebase

Python 3,475 477 Updated Dec 27, 2025

data-prep-kit / data-prep-kit

Open source project for data preparation for GenAI applications

HTML 876 239 Updated Dec 19, 2025

Ledzy / BAdam

[NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

Python 278 15 Updated Mar 15, 2025

zyushun / Adam-mini

Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793

Python 446 15 Updated May 13, 2025

togethercomputer / MoA

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,842 378 Updated Jan 7, 2025

OpenMOSS / CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Python 417 58 Updated Aug 28, 2024

VITA-Group / Q-GaLore

Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.

Python 202 17 Updated Jul 17, 2024

jiaweizzhao / GaLore

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,637 163 Updated Oct 28, 2024

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 96,157 26,368 Updated Dec 27, 2025

OpenLMLab / LOMO

LOMO: LOw-Memory Optimization

Python 991 68 Updated Jul 2, 2024

llamastack / llama-stack

Composable building blocks to build LLM Apps

Python 8,208 1,227 Updated Dec 24, 2025

sinpcw / showcase-optimizer

Python 8 1 Updated Oct 5, 2023

lapp0 / distily

Distily: Language Model Distillation Toolkit and Library

Python 8 Updated Sep 25, 2024

lucidrains / lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Python 2,180 56 Updated Nov 27, 2024

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,368 3,245 Updated Dec 25, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 41,094 4,672 Updated Dec 24, 2025

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,593 389 Updated Dec 26, 2025

build-with-groq / g1

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 4,220 367 Updated Sep 11, 2025

arcee-ai / EvolKit

EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Models (LLMs).

Jupyter Notebook 245 26 Updated Oct 30, 2024

intel / auto-round

Advanced quantization toolkit for LLMs and VLMs. Support for WOQ, MXFP4, NVFP4, GGUF, Adaptive Schemes and seamless integration with Transformers, vLLM, SGLang, and llm-compressor

Python 780 65 Updated Dec 26, 2025

NousResearch / DisTrO

Distributed Training Over-The-Internet

973 47 Updated Oct 14, 2025

huggingface / nanotron

Minimalistic large language model 3D-parallelism training

Python 2,387 262 Updated Dec 11, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,984 455 Updated Dec 27, 2025