Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View gaba00's full-sized avatar

Block or report gaba00

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 91,235 8,128 Updated Oct 27, 2025

Python API for Deequ

Jupyter Notebook 801 147 Updated Apr 1, 2025

Automated data quality suggestions and analysis with Deequ on AWS Glue

Scala 88 24 Updated Dec 29, 2022

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 42,949 15,851 Updated Oct 28, 2025

A list of online resources for quantitative modeling, trading, portfolio management

3,579 636 Updated Jun 15, 2024
TeX 375 326 Updated Sep 9, 2024

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

Python 4,675 975 Updated Oct 1, 2025

Dockerfile to run digdag-server on Amazon ECS

Dockerfile 18 1 Updated Apr 3, 2019

Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark

Java 1,366 852 Updated Aug 22, 2023

⚑ Serverless Framework – Effortlessly build apps that auto-scale, incur zero costs when idle, and require minimal maintenance using AWS Lambda and other managed cloud services.

JavaScript 46,888 5,746 Updated Oct 22, 2025

Python Serverless Microframework for AWS

Python 10,953 1,008 Updated May 29, 2025

A library that allows you to easily mock out tests based on AWS infrastructure.

Python 8,077 2,143 Updated Oct 27, 2025

pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…

Python 4,070 721 Updated Oct 24, 2025

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,963 2,582 Updated Feb 14, 2025

😎 Awesome lists about all kinds of interesting topics

409,940 32,023 Updated Oct 27, 2025

Treasure Boxes - pre-built pieces of code for developing, optimizing, and analyzing your data.

Python 112 74 Updated Oct 27, 2025

A Python MapReduce and HDFS API for Hadoop

Python 241 62 Updated Feb 4, 2025
Java 1,679 288 Updated Oct 17, 2025

Distributed Big Data Orchestration Service

Java 1,756 373 Updated Aug 6, 2025

A testing framework for Presto

Java 62 31 Updated May 2, 2025

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Scala 3,531 572 Updated Aug 27, 2025

a high-performance, POSIX-ish Amazon S3 file system written in Go

Go 5,448 535 Updated Jul 18, 2024

Tools for writing awesome Fabric files

Python 1,250 205 Updated Dec 13, 2019

A guide for Mozilla's developers and data scientists to analyze and interpret the data gathered by our data collection systems.

Shell 93 157 Updated Oct 22, 2025

Bigquery ETL

Python 325 124 Updated Oct 27, 2025

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

Java 409 218 Updated Oct 22, 2025

Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017

Jupyter Notebook 1,398 733 Updated Oct 24, 2025
Python 4,883 1,459 Updated Dec 17, 2023

πŸš€ Awesome list of open source applications for macOS. https://t.me/s/opensourcemacosapps

46,029 2,426 Updated Sep 14, 2025

TonY is a framework to natively run deep learning frameworks on Apache Hadoop.

Java 708 163 Updated Oct 14, 2023
Next