Lists (1)
Sort Name ascending (A-Z)
Stars
Data Lineage Tracking And Visualization Solution
Alluxio, data orchestration for analytics and machine learning in the cloud
Apache Druid: a high performance real-time analytics database.
Minecraft server startup flags for GraalVM
GraalVM compiles applications into native executables that start instantly, scale fast, and use fewer compute resources 🚀
Example code from Learning Spark book
Apache Spark - A unified analytics engine for large-scale data processing
Shuttle:High Available, High Performance Remote Shuffle Service
Shuttle:High Available, High Performance Remote Shuffle Service
🐬DeepChat - A smart assistant that connects powerful AI to your personal world
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
This is a repository for the LinkedIn Learning course Advanced SQL for Query Tuning and Performance Optimization
Qubole Sparklens tool for performance tuning Apache Spark
https://umbertogriffo.gitbook.io/apache-spark-best-practices-and-tuning/
Notes talking about the design and implementation of Apache Spark
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
Examples To Help You Learn Apache Spark
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
The Auron accelerator for distributed computing framework (e.g., Spark) leverages native vectorized execution to accelerate query processing