Codestin Search App

Branches Tags

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
Readme.txt		Readme.txt
pagescrape.py		pagescrape.py
scrape_to_csv.py		scrape_to_csv.py
urlscrape.py		urlscrape.py

Repository files navigation

To use prodscrape change the urls in the files to the pages you want to Scrape
Run urlscrape.py to grab all sub links on a page
Run pagescrape.py to go through sublinks saved by urlscrape.py and save them as html
Run scrape_to_csv.py to go through html saved by urlscrape.py and create a csv with relevant html elements as columns