Thanks to visit codestin.com
Credit goes to github.com

Skip to content

jiyoungkimcr/Amazon-Bestseller-Analysis-R-shiny-project

Repository files navigation

Amazon-Bestseller-Analysis-R-shiny-project

Result - Shiny web created for this project

Project Description:

This project is to analyze Bestseller Books of 2010~2020 which listed on Amazon.com Bestseller list. I crawled Bestseller data from Amazon page using crawler code that I wrote. Through this project, I wanted to see if variables such as book price, ratings, number of reviews, year, etc...have any affect or relationship with books being bestseller. Link to: Amazon Best Sellers of 2010-2020

About Data Preparation - Crawling using BeautifulSoup:

Please refer to the crawler code 2020 file. I wrote the crawler code on Jupyter notebook (python 3) and this can be used for scraping each year (2010-2020) by changing webpage link in 'requests.get' line.

About Dataset:

Our dataset, 'amazon_bs', consists of 1094 obs of 7 variables (Year, Rank, Book.Title, Author, Rating, Num_Customers_Rated, Price). This data is crawled from Amazon Best Sellers list of 2010~2020. It was originally 1100 obs, but through data preprocessing, I dropped 6 obs with NA and empty values

Preview

image

image

image

About

As a part of R final project, this project is to analyze Bestseller Books of 2010~2020 which listed on Amazon.com Bestseller list. I crawled Bestseller data from Amazon page using crawler code that I wrote. Through this project, I wanted to see if variables such as book price, ratings, number of reviews, year, etc...have any affect or relationsh…

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors