This repository contains a collection of Python scripts and notebooks demonstrating various automation tasks. It serves as a practical introduction to automating common data-related workflows with Python.
- Provide a re-introduction to Python for practical applications.
- Demonstrate the use of Python notebooks for automation tasks.
- Showcase the ability to work with and switch between different infrastructure pieces.
The notebooks in this repository cover a range of automation tasks, including:
- Data Extraction:
- Reading data from websites.
- Extracting tables from PDF files.
- Reading data from CSV files.
- Web Scraping:
- Using Selenium to scrape data from websites.
- Data Analysis and Visualization:
- Creating pivot tables from Excel data.
- Generating and saving charts in Excel files.
- AutomateLesson1.ipynb: Covers reading data from various sources like Wikipedia and CSV files using the
pandasandcamelotlibraries. - AutomateLesson2.ipynb: Demonstrates web scraping with
seleniumto get the latest news from The Sun's website and saving it to a CSV file. - AutomateLesson3.ipynb: Focuses on data manipulation with
pandasto create pivot tables andopenpyxlto create and save charts in Excel.