Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View shamio's full-sized avatar

Block or report shamio

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Lord of Large Language and Multi modal Systems Web User Interface

CSS 4,763 579 Updated Oct 21, 2025

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

C++ 8,792 573 Updated Oct 26, 2025

The definitive Web UI for local AI, with powerful features and easy setup.

Python 45,231 5,823 Updated Oct 23, 2025

Official implementation for 'Extending LLMs’ Context Window with 100 Samples'

Python 80 3 Updated Jan 18, 2024
Jupyter Notebook 66 8 Updated Jul 24, 2024

[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

Python 256 21 Updated Dec 16, 2024

Modeling, training, eval, and inference code for OLMo

Python 6,055 664 Updated Oct 24, 2025

An innovative library for efficient LLM inference via low-bit quantization

C++ 349 39 Updated Aug 30, 2024

⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡

Python 2,164 215 Updated Oct 8, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 7,094 391 Updated Jul 11, 2024

High-speed Large Language Model Serving for Local Deployment

C++ 8,369 448 Updated Aug 2, 2025

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python 18,924 1,964 Updated Oct 24, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,403 3,878 Updated Oct 26, 2025

This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and benchmark tasks that evaluate a model’s information retrieval cap…

Python 596 45 Updated Nov 17, 2023

Web UI for ExLlamaV2

JavaScript 511 47 Updated Feb 5, 2025

Tools for merging pretrained large language models.

Python 6,398 624 Updated Sep 17, 2025

Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

Python 277 23 Updated Nov 3, 2023

LLM inference in C/C++

C++ 88,312 13,432 Updated Oct 26, 2025

Explore large language models in 512MB of RAM

HTML 1,196 81 Updated Jul 25, 2025