Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View slavaGanzin's full-sized avatar

Block or report slavaGanzin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

llm

14 repositories

Interact with your documents using the power of GPT, 100% privately, no data leaks

Python 56,915 7,591 Updated Nov 13, 2024

An open source multi-tool for exploring and publishing data

Python 10,605 797 Updated Dec 17, 2025

HAAS = Hierarchical Autonomous Agent Swarm - "Resistance is futile!"

Python 3,090 397 Updated Feb 16, 2024

QLoRA: Efficient Finetuning of Quantized LLMs

Jupyter Notebook 10,789 869 Updated Jun 10, 2024

Chat language model that can use tools and interpret the results

Python 1,588 118 Updated Dec 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,780 12,069 Updated Dec 19, 2025

Fabric is an open-source framework for augmenting humans using AI. It provides a modular system for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Go 35,284 3,601 Updated Dec 19, 2025

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Python 3,087 423 Updated Apr 2, 2025

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Python 1,207 77 Updated Oct 30, 2025

A neurosymbolic perspective on LLMs

Python 1,645 83 Updated Dec 19, 2025

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,693 339 Updated Oct 28, 2025

The official Meta Llama 3 GitHub site

Python 29,141 3,502 Updated Jan 26, 2025

Model API for GALACTICA

Jupyter Notebook 2,742 269 Updated Mar 5, 2023

Perplexica is an AI-powered answering engine. It is an Open source alternative to Perplexity AI

TypeScript 27,752 2,897 Updated Dec 19, 2025