Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows
-
Updated
Nov 16, 2021 - Python
Git repo to accompany the AWS DevOps Blog: Using AWS DevOps Tools to model and provision AWS Glue workflows
AWS Glue type annotations (stub files).
Batch data ingestion into Amazon OpenSearch Service using AWS Glue
Serverless Data Lake on AWS. Slideshare: https://www.slideshare.net/SmartBizVN/serverless-data-lake-on-aws
This workshop is to build a serverless data lake architecture using Amazon Kinesis Firehose for streaming data ingestion, AWS Glue for Data Integration (ETL, Catalogue Management), Amazon S3 for data lake storage, Amazon Athena for SQL big data analytics.
This project demonstrates a fully automated ETL pipeline built on AWS Cloud to extract playlist data from the Spotify API, transform it using AWS Glue (Apache Spark), and load it into Snowflake for analytics and visualization via Power BI.
IMDB Movie Data ETL Pipeline using S3, Glue, Redshift, EventBridge, SNS
End-to-end project using S3, Glue, Athena, and QuickSight to build a secure, automated data processing and visualisation workflow
Data Engineer End-to-End ETL Pipeline - Ticketmaster API
Spark and Data Lakes Project: STEDI Human Balance Analytics (Udacity Data Engineering with AWS Nanodegree)
Fully Automation end-to-end ETL airlines data ingestion
Spotify listening trends analyzed and visualized using AWS cloud services
Power BI dashboard for Memphis public safety analytics, built on AWS Athena, Glue, and S3.
Terraform implementation of a commonly-used AWS architecture pattern of performing streaming ETL on a Kinesis data stream using a Glue job.
Add a description, image, and links to the glue-etl topic page so that developers can more easily learn about it.
To associate your repository with the glue-etl topic, visit your repo's landing page and select "manage topics."