Stars
Asgarde allows simplifying error handling with Apache Beam Java, with less code, more concise and expressive code.
BigTesty is a framework that allows to create Integration Tests with BigQuery on a real and short lived Infrastructure.
SoftClient4ES is a modular and version-resilient interface built on top of Elasticsearch clients, providing a unified and stable API that simplifies migration across Elasticsearch versions, acceler…
Surfalytics projces on Data Engineering and Analytics
Declarative text based tool for data analysts and engineers to extract, load, transform and orchestrate their data pipelines.
The best place to learn data engineering. Built and maintained by the data engineering community.
This is a public repository to go over all the LLM-driven data engineering concepts.
Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data types and format characters) using Java.
Create a Chatbot app on your own data with GCP tools
hive_compared_bq compares/validates 2 (SQL like) tables, and graphically shows the rows/columns that are different.
Create BigQuery views that unify sets of table with the same prefix and different versions.
Introduction à la science des données et à l’intelligence artificielle
A curated list of resources for learning about Google Cloud Platform certifications and how to prepare for it.
Google BigQuery support for Spark, Structured Streaming, SQL, and DataFrames with easy Databricks integration.
by ex-googlers, for ex-googlers - a lookup table of similar tech & services
A list of useful Apache NiFi resources, processor bundles and tools
Fuctional rest api in Scala with http4s, doobie and circe
Transfers inotify events from NFS server to client (for example, for MPD library auto-update)
A Chef cookbook that manages Scala's CI infrastructure.