jeanhx/ProdScrape
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|
Repository files navigation
To use prodscrape change the urls in the files to the pages you want to Scrape Run urlscrape.py to grab all sub links on a page Run pagescrape.py to go through sublinks saved by urlscrape.py and save them as html Run scrape_to_csv.py to go through html saved by urlscrape.py and create a csv with relevant html elements as columns