Stars
Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
GenAI Agent Framework, the Pydantic way
a unified framework for leveraging LLMs
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, or on-prem).
Official inference library for Mistral models
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
A web application which scrapes several German real estate listing websites and visualizes offers in a map.
A library for efficient similarity search and clustering of dense vectors.
Tips and tricks for working with Large Language Models like OpenAI's GPT-4.
flathunters / flathunter
Forked from mordax7/flathunterA bot to help people with their rental real-estate search. 🏠🤖
Fine tune a T5 transformer model using PyTorch & Transformers🤗
A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.
NeuSpell: A Neural Spelling Correction Toolkit
Using 30,000 hand-graded Wikipedia articles and NLP to predict the quality of Wikipedia articles and create a knowledge graph that identifies both articles and topics in need of editorial and epist…
Arabic Dictionary for Morphological analysis
AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding mod…
A curated list of resources for Document Understanding (DU) topic
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Public repo for DeepLearning.AI MLEP Specialization
Repository fo Data Engineering Course
Evidently is an open-source ML and LLM observability framework. Evaluate, test, and monitor any AI-powered system or data pipeline. From tabular data to Gen AI. 100+ metrics.
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/