Stars
Collection of OCR-related python tools and wrappers from @OCR-D
Questions helping you determine your Cloud Sovereignty Score based on EU's Cloud Sovereignty Framework (unofficial)
The 500 AI Agents Projects is a curated collection of AI agent use cases across various industries. It showcases practical applications and provides links to open-source projects for implementation…
Free Weather Forecast API for non-commercial use
Hands-on lab to try all Trident's features & architectures
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
The Prometheus monitoring system and time series database.
Open-source AI agents for penetration testing
AWS for Bioinformatics Researchers
GCP for Bioinformatics Researchers
Study resources for learning quantum computing
Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations. Use Python, R, and SQL locally in your favorite IDE, then scale to Deepnot…
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
All Algorithms implemented in Python
Portable file server with accelerated resumable uploads, dedup, WebDAV, FTP, TFTP, zeroconf, media indexer, thumbnails++ all in one file, no deps
Detect compliance and security violations across Infrastructure as Code to mitigate risk before provisioning cloud native infrastructure.
This GitHub repository contains comprehensive code samples and automation scripts for FSx for Netapp ONTAP operations, promoting the use of Infrastructure as Code (IAC) tools and encouraging develo…
Terragrunt is a flexible orchestration tool that allows Infrastructure as Code written in OpenTofu/Terraform to scale.
A high-throughput and memory-efficient inference and serving engine for LLMs
AG-UI: the Agent-User Interaction Protocol. Bring Agents into Frontend Applications.
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
This is a repo with links to everything you'd ever want to learn about data engineering
Jupyter Notebooks for Mastering LLM with Advanced RAG Course
This repository contains various advanced techniques for Retrieval-Augmented Generation (RAG) systems.
Official inference framework for 1-bit LLMs
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics