ADMW - AWS Data Migration Workflow 📦

A data migration pipeline built on several AWS services.
From flat file to relational database.
Data is processed in parallel using AWS Glue and Spark to decrease the time needed to extract, load, and transform data.

Two-Stage Migration Process ⏩

Preliminary Stage

Reads data from a flat file and perform any needed data transformations on the data.
Loads the transformed data into a preliminary table (contains additional columns for status tracking and data integrity checks).
Generates a report of the data migration process.

Final Stage

Moves data from the preliminary table to the final table (finalised schema with the addition of a source file name column).
Generates a report of the data migration process.
Compresses and stores source file.

Global Features

A tracking table in RDS instance is continually updated to aid in monitoring of a data migration process.
Report summaries are sent via email using SNS.
Users are notified of any errors occurring during a migration process via email.
A Jupyter notebook is provided to allow developers to create and test both transformation and reconcilation logic easily.

AWS Services Used ⭯

AWS S3 - storage service
AWS Lambda - serverless computing service
AWS Glue - data integration service
AWS RDS - relational database service
AWS SNS - messaging service
AWS Step Functions - orchestration service
AWS CloudWatch - monitoring service
AWS IAM - access control service

Video Demonstration 🎥

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
config		config
db_generator		db_generator
dmlib		dmlib
envs		envs
extractinator		extractinator
glue		glue
iam		iam
lambda		lambda
screenshots_diagrams		screenshots_diagrams
setup_details		setup_details
sfn		sfn
.gitignore		.gitignore
README.rst		README.rst
other_resources.txt		other_resources.txt
requirements_dev.txt		requirements_dev.txt
requirements_layer.txt		requirements_layer.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

ADMW - AWS Data Migration Workflow 📦

Two-Stage Migration Process ⏩

AWS Services Used ⭯

Video Demonstration 🎥

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Uh oh!

Uh oh!

maximus-lee-678/ADMW

Folders and files

Latest commit

History

Repository files navigation

ADMW - AWS Data Migration Workflow 📦

Two-Stage Migration Process ⏩

AWS Services Used ⭯

Video Demonstration 🎥

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages