Stars
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
AgentLLM is a PoC for browser-native autonomous agents
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
Master programming by recreating your favorite technologies from scratch.
Get your documents ready for gen AI
UQLM: Uncertainty Quantification for Language Models, is a Python package for UQ-based LLM hallucination detection
Hide your scrapers IP behind the cloud. Provision proxy servers across different cloud providers to improve your scraping success.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
A fully featured Material UI V5 implementation of TanStack React Table V8, written from the ground up in TypeScript
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
This is a repo with links to everything you'd ever want to learn about data engineering
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Python Linter for performance anti patterns
Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)
DSPy: The framework for programming—not prompting—language models
Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
Supercharge Your LLM Application Evaluations 🚀
Rich is a Python library for rich text and beautiful formatting in the terminal.
Convert Ingress resources to Gateway API resources
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.