Stars
A Data Streaming Library for Efficient Neural Network Training
12 Lessons to Get Started Building AI Agents
Simple, unified interface to multiple Generative AI providers
PySpark test helper methods with beautiful error messages
Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.
Efficient Triton Kernels for LLM Training
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
All Algorithms implemented in Python
This is a guide to PySpark code style presenting common situations and the associated best practices based on the most frequent recurring topics across the PySpark repos we've encountered.
Feature toggle system API-Client SDK follow OpenFeature specification.
⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-powered business intelligence in seconds.
A set of IaC artifacts to automatically configure the infrastructure resources needed by a Flyte deployment
📺 Instill Console for 🔮 Instill Core: https://github.com/instill-ai/instill-core
🔮 Instill Core is a full-stack AI infrastructure tool for data, model and pipeline orchestration, designed to streamline every aspect of building versatile AI-first applications
An extremely fast Python package and project manager, written in Rust.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
A light-weight, flexible, and expressive statistical data testing library
An extremely fast Python linter and code formatter, written in Rust.
🤖 The Semantic Engine for Model Context Protocol(MCP) Clients and AI Agents 🔥
Fine-tuning LLMs on Flyte and Union Cloud
Data Catalog is a service for indexing parameterized, strongly-typed data artifacts across revisions. It also powers Flytes memoization system
Control Plane for Flyte. Flyteadmin is a gRPC + REST Service written in golang and uses a RDBMs to store meta information and management information for Flyte Tasks and Workflows.
A apache commons style library in Golang, use by the Flyte project. Contains utilities for metrics, pflags, config management, storage abstraction, caching etc
Flyte Backend Plugins contributed by the Flyte community.