Amazon-Bestseller-Analysis-R-shiny-project

Result - Shiny web created for this project

Project Description:

This project is to analyze Bestseller Books of 2010~2020 which listed on Amazon.com Bestseller list. I crawled Bestseller data from Amazon page using crawler code that I wrote. Through this project, I wanted to see if variables such as book price, ratings, number of reviews, year, etc...have any affect or relationship with books being bestseller. Link to: Amazon Best Sellers of 2010-2020

About Data Preparation - Crawling using BeautifulSoup:

Please refer to the crawler code 2020 file. I wrote the crawler code on Jupyter notebook (python 3) and this can be used for scraping each year (2010-2020) by changing webpage link in 'requests.get' line.

About Dataset:

Our dataset, 'amazon_bs', consists of 1094 obs of 7 variables (Year, Rank, Book.Title, Author, Rating, Num_Customers_Rated, Price). This data is crawled from Amazon Best Sellers list of 2010~2020. It was originally 1100 obs, but through data preprocessing, I dropped 6 obs with NA and empty values

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Amazon Book Bestsellers Crawler code (2020).ipynb		Amazon Book Bestsellers Crawler code (2020).ipynb
Amazon_Bestseller_Shinyapp.R		Amazon_Bestseller_Shinyapp.R
LICENSE		LICENSE
README.md		README.md
project - Amazon Book Bestsellers Crawling - 2020 -new.py		project - Amazon Book Bestsellers Crawling - 2020 -new.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amazon-Bestseller-Analysis-R-shiny-project

Preview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Amazon-Bestseller-Analysis-R-shiny-project

Preview

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages