-
Hugging Face
- Netherlands
- https://tomaarsen.com
- https://huggingface.co/tomaarsen
- in/tomaarsen
- @tomaarsen
- @tomaarsen.com
Highlights
Starred repositories
Fast BM25 search engine with category theory abstractions
Mutlimodal reranker training and benchmarks
Nearly Inference Free Embeddings: make your RAG queries 500x faster
GenAI Agent Framework, the Pydantic way
Pruna is a model optimization framework built for developers, enabling you to deliver faster, more efficient models with minimal overhead.
Post-training with Tinker
Connect any LLM to your internal knowledge sources and chat with it in real time alongside your team. OSS alternative to NotebookLM, Perplexity, and Glean. Join our Discord: https://discord.gg/ejRN…
Swift Package to implement a transformers-like API in Swift
Parkiet is a 1.6B parameter Dutch text-to-speech model (TTS)
Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contain…
Sparse Embedding Compression for Scalable Retrieval in Recommender Systems
A tool for generating embeddings of classes organized into an an ontology
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Build, enrich, and transform datasets using AI models with no code
A massively multilingual modern encoder language model
RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
Knowledgeable Embedding: Injecting dynamically updatable entity knowledge into embeddings to enhance RAG
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Scalable, fast, and disk-friendly vector search in Postgres, the successor of pgvecto.rs.
Data transformation framework for AI. Ultra performant, with incremental processing. 🌟 Star if you like it!