Thanks to visit codestin.com
Credit goes to github.com

Skip to content
View zakapior's full-sized avatar

Block or report zakapior

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
zakapior/README.md

Hi! πŸ‘‹ My name is Jakub and I am a ▢️ Data Engineer ◀️

πŸ‡΅πŸ‡± Gliwice, Poland πŸ’¬ Polish/English πŸ§” he/him ⚑ LinkedIn 🎡 Last.fm 🚲 Strava ⛷️

:bowtie: an IT professional with 12+ years of commercial experience

Data Engineer πŸ‘” presales πŸ–₯️ software development management βš™οΈ Linux/Solaris/Windows Server administration πŸ€·β€β™‚οΈ team leader πŸ§‘β€πŸ€β€πŸ§‘ project management πŸ€” product management πŸ‘“ frontend team management πŸ“² strong mobile messaging expertise πŸ•΄οΈ strong business awareness

πŸŽ“ graduated the 800+ hours Data Engineering course on Turing College

Python (Polars, Pandas) 🎯 SQL (PostgreSQL, MySQL, SQL Server, ClickHouse) 🎯 Apache Spark (PySpark) 🎯 Hadoop (HDFS, Hive, Hortonworks Sandbox) 🎯 Apache Airflow (managing Python, Bash and Docker containerized pipelines) 🎯 dbt 🎯 Docker 🎯 Kubernetes 🎯 Data Warehousing (Kimball) 🎯 Data Modeling 🎯 Data Mesh

😎 18+ years of Linux & open source enthusiasm

I πŸ’˜ GNU\Linux and non-GNU\Linux and use it since 2006 (Mandriva 2007.0 Spring).

Pinned Loading

  1. spark-airflow-docker-smart_meters spark-airflow-docker-smart_meters Public

    A portfolio repository, that showcase using Spark for data transformations and loading in a Data Lake environment, using Airflow to orchestrate PySpark jobs that are encapsulated in Docker containers.

    Python

  2. airflow-pipelines-model-training-precious-metal-prices airflow-pipelines-model-training-precious-metal-prices Public

    A portfolio repository, that showcase using Airflow to orchestrate ETL pipelines that would prepare the precious metal prices data to be used with machine learning model and then train the model.

    Python

  3. jobs-data-pipelines-with-python-and-airflow jobs-data-pipelines-with-python-and-airflow Public

    A portfolio repository, that showcase using Airflow to create a data pipeline in Python, that would present job offers from several job boards.

    Python

  4. loan-eligibility-with-docker-and-airflow loan-eligibility-with-docker-and-airflow Public

    A portfolio repository, that showcase using Airflow to manage Docker containers to prepare the environment and drive ETL process of loan eligibility data.

    Jupyter Notebook

  5. superstore-dimensional-modeling-postgresql superstore-dimensional-modeling-postgresql Public

    A portfolio repository, that showcase using dbt for Kimball-style dimensional modeling on a Superstore Sales dataset.

    Dockerfile

  6. weather-data-system-with-python-and-sql weather-data-system-with-python-and-sql Public

    A portfolio repository, that showcase creating a data pipeline in Python with data from OpenWeatherAPI.

    Python