Stars
rezaakb / peft-vit
Forked from bwconrad/vit-finetuneParameter Efficient Fine-tuning of Self-supervised ViTs without Catastrophic Forgetting
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Official inference repo for FLUX.1 models
This is our own implementation of 'Layer Selective Rank Reduction'
The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction
Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch
Suno AI's Bark model in C/C++ for fast text-to-speech generation
MARS5 speech model (TTS) from CAMB.AI
ImageBind One Embedding Space to Bind Them All
Create automatic playlists by using Deep Learning to *listen* to the music.
Letta is the platform for building stateful agents: AI with advanced memory that can learn and self-improve over time.
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
π¦π The platform for reliable agents.
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Windows compile of bitsandbytes for use in text-generation-webui.
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
A high-throughput and memory-efficient inference and serving engine for LLMs
π Text-Prompted Generative Audio Model
Tools for merging pretrained large language models.
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Simple next-token-prediction for RLHF