0% found this document useful (0 votes)

15 views19 pages

Final Task

The document outlines methods for cleaning and analyzing music release data, focusing on the 'Release_date' field and handling missing values in various fields. It includes statistical analysis of popularity and artist followers by genre, along with hypothesis testing to compare the popularity of house and dance/electronic genres. Additionally, it emphasizes the importance of data visualization and predictive analysis for understanding trends and improving decision-making in the music industry.

Uploaded by

samiya akhtar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views19 pages

Final Task

Uploaded by

samiya akhtar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

Question # 01

(a) (i)
To clean the "Release_date" field, we can use the following method:

 Examine all of the various dates in the "Release_date" field, focusing solely on dates, years,
and any data that is missing.
 Subsequent to recognizing conflicting information, we will eliminate the missing information
in the "Release_date" field.
 If the data only include the year, we will assume that the release date and month are the first
day of that year's first month.
 Any dates that cannot be changed will be removed. Six rows from the
Spotify_database_Ireland file have been removed because they contain 1957-09 data in this
format. Therefore, we are unable to comprehend that 09 is a month or a date. This is why we
removed these six rows.
 After that, add the cleaned dates to the database's "Release_date" field and verify that some of
the cleaned dates are correct.
We will achieve the consistent and accurate "Release_date" field by implementing this strategy.

(ii)
We can employ the following strategy to deal with the problem of missing values in the fields
"anger," "anticipation," "disgust," "fear," "joy," "sadness," "surprise," "trust," "n words," and
"LDA Topic":

• As can be seen, the order of the missing values in each of the aforementioned fields is the same.
So we will erase entire lines to settle this issue.
• However, unlike other fields, the sequence of missing values in the "Released after 2017" field
is unique. Consider putting in the missing values for the "Released after 2017" field that is
blank because of missing data.
• This can be accomplished in a number of ways, such as by estimating the missing values using
a predictive model or by using the average or median of the available data.
• Thus, the missing values are straight out (i.e., not mathematical), we can consider utilizing a
mode-based ascription strategy. This includes supplanting the missing values with the most
widely recognized esteem in the accessible information.
We will be able to resolve the issue of missing values in the "anger," "anticipation," "disgust,"
"fear," "joy," "sadness," "surprise," "trust," "n words," "LDA Topic," and "Released after 2017"
fields by implementing this strategy.
(b)
Table 1: Average, Standard deviation, min, max, std error of Popularity

GENRE MAX OF MIN OF AVERAGE OF STD DEV OF STD ERROR OF

POPULARITY POPULARITY POPULARITY POPULARITY POPULARITY
BOY BAND 67830.35 2.4 2421.112185 8059.897219 738.8495667
COUNTRY 3359.9 3.2 568.2384615 941.7547279 261.1957662
DANCE/ELE- 81863.8 1.6 5168.688589 12666.91456 815.9476947
CTRONIC
ELSE 64696.8 1.6 2812.230882 7650.813274 1710.773856
FUNK 10184.9 4 699.2525 2246.565392 502.3472933
HIP HOP 87573.85 0.8 3767.849398 10033.16676 449.596953
HOUSE 60212.15 2.4 6372.150855 10946.22717 1011.97906
INDIE 51539.6 0.8 4716.740878 9399.304208 772.6180466
K-POP 135264.2 8 13466.55588 25668.16525 3594.26206
LATIN 8764.6 4.8 2189.391667 3502.113001 1429.731646
METAL 1740.95 1.6 165.5873016 311.269691 39.21629491
POP 146629.4 0.8 6830.350535 14415.4964 361.6326662
R&B/SOUL 66495.2 2.4 5762.954717 11469.53011 909.5929038
RAP 81093.8 1.6 6937.373746 14598.22138 844.2369407
REGGAE 352 10.4 190.4 171.541715 99.03965536
ROCK 62891.85 1.6 1634.642138 5604.730633 314.297687
TRAP 19773.7 5.6 3666.375926 6508.351545 1252.532839

Table 2: Average, Standard deviation, min, max, std error of Artist_followers

GENRE AVERAGE OF MAX OF ARTIST_ MIN OF ARTIST_ STD DEV OF STD ERROR OF
ARTIST_ FOLLOWERS FOLLOWERS ARTIST_ ARTIST_
FOLLOWERS FOLLOWERS FOLLOWERS
BOY BAND 7754882 18426467 69457 7166799 656979.4
COUNTRY 1374046 4289740 18580 1386892 384654.7
DANCE/ELE- 4149964 27670203 3729 6734846 433829.6
CTRONIC
ELSE 1084989 8310347 6 1829418 140310
FUNK 3191166 9903432 43767 3237713 723974.6
HIP HOP 14722451 50593376 16745 18096348 810916.8
HOUSE 6868570 24925809 5580 8375604 774324.8
INDIE 4977709 27701635 86735 9647260 792999.9
K-POP 5919622 24755789 756869 7886430 1104321
LATIN 11116259 26265604 2221744 10600946 4327818
METAL 9305144 18354552 315426 6814627 858562.3
POP 12941298 71783101 1897 16609199 416664.7
R&B/SOUL 8094196 25562055 31441 9515997 754667.7
RAP 11802712 29173640 14818 9150355 529178.7
REGGAE 3532420 8781395 246452 4593621 2652129
ROCK 6936113 30723081 7644 7961139 446438.5
TRAP 1824558 2441830 76126 1027723 197785.3

Cardinality of the Popularity and Artists_followers fields split by genre is many-to-many

relationship.

Null Hypothesis: There will be no significant difference in popularity between the house and
dance/electronic genres for the new album produced by the independent music recording company.

Alternative Hypothesis: The popularity of the house genre for the new album produced by the
independent music recording company will be significantly greater than the popularity of the
dance/electronic genre.

(ii) State what type of hypothesis test you plan to use and why.

A one-tailed, two-sample t-test is the most appropriate method for testing the hypothesis that house
is more popular than dance or electronic.

The new album's genre, which can be house or dance/electronic, is the independent variable. The
new album's popularity is the dependent variable.

We plan to use a one-tailed independent samples t-test to compare the mean popularity of the two
genres. We choose the one-tailed test because we are interested in testing if the popularity of house
genre is greater than the popularity of dance/electronic genre.
(iii) Carry out the test and comment its results.

Levene's Test for

Equality of
Variances
Significance
One- Two-
F Sig. t df Sided p Sided p
Popularity Equal 0.024 0.877 -0.880 356 0.190 0.379
variances
assumed
Equal -0.926 262.274 0.178 0.355
variances not
assumed

As we see that p-value > 0.05 level of significance, so we accept the null hypothesis i.e., There
will be no significant difference in popularity between the house and dance/electronic genres for
the new album produced by the independent music recording company. Hence, we conclude that
the popularity of dance/electronic and house genre is same.

Question # 02

(a)
Data visualization is the process of visualizing data and information in formats like graphs, charts,
and maps. This makes it easier for people to understand complex data and find patterns and insights
that may not be immediately apparent.

Dashboards that work well and are easy to use are important for a number of reasons. First, they
give a clear and concise overview of the data. This makes it easy for users to quickly find trends,
patterns, and outliers and make decisions based on the data that are well-informed. Second,
dashboards make it easier for non-technical stakeholders to comprehend the data and gain insight
by providing a simple and visual way to convey complex information. Thirdly, dashboards enable
users to interact with the data in real time, allowing them to examine the data in greater depth and
drill down into specific areas of interest. Finally, user-friendly dashboards have the potential to
enhance the user experience as a whole, increase data and insight adoption and engagement, and
ultimately improve decision-making.

In conclusion, effective dashboards and data visualization are essential tools for understanding
complex data, recognizing patterns and insights, and communicating key findings to stakeholders.
Dashboards can assist users in making informed decisions based on the data by presenting it in a
clear and understandable manner, resulting in improved performance and outcomes.
(b)

In this line chart:

 Certain genres are showing consistently higher or lower popularity over time.
 Some genres are showing an unexpected rise or fall in popularity over time.
In this line chart:

 Non-explicit songs have a different trend in popularity compared to explicit songs.

 The popularity of explicit songs shows a sudden rise or fall over time due to external
factors.
In this line chart:

 Certain LDA topics have consistently become more popular over time.
 Certain LDA topics have consistently higher or lower popularity over time.
(e)

This stacked bar chart reveals the overall trend in the number of songs in the Top10 position, year-
wise changes, and genre distribution. This can help music industry professionals understand
market trends and inform their strategies.
This stacked bar chart provides insights into the trends and genre distribution of songs that have
reached the Top50 position over time. This information can be used to inform marketing and
investment strategies in the music industry.
(f)

If a genre has a smaller box plot with a lower median value, it could indicate that songs in that
genre tend to have lower maximum positions. Alternatively, if a genre has a larger box plot with a
higher median value, it could indicate that songs in that genre tend to have higher maximum
positions. Similarly, differences in the box plots of different topics within a genre could indicate
variations in the popularity and success of songs within that genre.
Question #03

(a)

The process of extracting insights from graphical representations of data, such as maps and charts,
typically involves a series of steps. Firstly, it is necessary to carefully examine the visualization
and identify the key data points being represented. This involves analyzing labels, scales, and
legends, as well as colors and shapes used to represent the data.

After identifying the key data points, the next step is to ask questions to gain insights into the data.
Questions can fluctuate contingent upon the sort of perception and the information being
addressed. For example, questions might include: What trends or patterns can be observed in the
data? Are there any outliers or anomalies that need to be investigated further? What relationships
or correlations exist between different data points? What are the possible causes or drivers of
observed trends or patterns?

To extract insights effectively, it is essential to understand the context in which the data was
collected and processed. Therefore, it is important to document the data mining process, including
the data sources, any pre-processing steps, and any assumptions or limitations that were made.

Furthermore, it is critical to document the methods used to analyze the data and extract insights,
including any statistical techniques, algorithms, or machine learning models used. This
documentation should include visualizations that were created, as well as any interpretations or
conclusions drawn from the data.

Finally, it is necessary to communicate the insights effectively to stakeholders, using visualizations

and clear explanations to help them understand the findings. In summary, extracting insights from
graphical representations of data involves careful analysis, asking the right questions, documenting
the data mining process and methods used, and effective communication of the findings to
stakeholders.

(b)
There are several types of models that can be used for predictive analysis, including:

 Regression Models: These models are used to predict continuous numerical values, such as
stock prices or temperatures. Regression models use statistical techniques to identify
relationships between independent and dependent variables, and then create a formula that can
be used to make predictions. Simple linear regression (SLR), Multiple linear regression
(MLR), Polynomial regression and Logistic regression are some regression models.
 Classification Models: These models are used to predict categorical outcomes, such as whether
a customer will buy a product or not. Classification models use algorithms to identify patterns
in the data and then assign labels to new data based on these patterns. K- nearest neighbors,
Naïve Bayes (NB), SVM and random forest (RF) are some classification models.
 Time Series Models: These models are used to analyze data over time, such as stock prices or
website traffic. Time series models use statistical techniques to identify trends, patterns, and
seasonal variations in the data and then make predictions based on these patterns.
Autoregressive, moving average, ARIMA model, SARIMA model, VAR model and BSTS
model are some time series models.
 Clustering Models: These models are used to identify groups of similar data points. Clustering
models use algorithms to group data points together based on their similarity, and then assign
labels to these groups. Fuzzy clustering, subspace clustering, hierarchical clustering and k-
means clustering are some clustering models.
Benefits of carrying out predictive data analysis include:

 Personalized Recommendations: By using predictive analysis to examine user habits and

patterns, Spotify is able to create customized recommendations for users depending on their
playing interests. This may increase user retention and engagement.
 Music Discovery: Predictive analysis can also help Spotify discover new music based on user
preferences and behavior. This can help the platform expand its library and offer users more
diverse content.
 Targeted Marketing: Predictive analysis can help Spotify identify user segments based on their
behavior, preferences, and demographics. This can enable the platform to target specific groups
with personalized marketing campaigns, improving conversion rates and ROI.
 Predictive Maintenance: Predictive analysis can also be used to identify potential issues with
the platform's infrastructure, allowing Spotify to perform proactive maintenance and prevent
downtime.
 Improved Business Performance: By using predictive analysis to identify trends and patterns
in user behavior, Spotify can make data-driven decisions that improve its business
performance. For example, it can use predictive analysis to identify popular artists, genres, or
songs, and invest in acquiring more content in those areas.
 Competitive Advantage: By using predictive analysis to understand user behavior and
preferences, Spotify can gain a competitive advantage over other music streaming platforms.
This can enable the platform to differentiate itself by offering more personalized
recommendations, a better user experience, or a more diverse library of content.

(c)
There are many mathematical functions that can be used to capture trends in data, but some of the
most popular ones are:

 Linear Regression: This function is used to model the association between a response variable
and more than one regressors. It assumes that the relationship is linear, and it finds the best-fit
line that represents the data.
 Polynomial Regression: This function is similar to linear regression, but it allows for more
complex relationships between a response variable and regressors. It models the data with a
polynomial equation of a specified degree, which can capture non-linear trends in the data.
 Exponential Functions: These functions are used to model data that shows exponential growth
or decay over time. They are often used in finance and economics to model interest rates,
population growth, or the spread of diseases.
 Logarithmic Functions: These functions are used to model data that shows a diminishing rate
of change over time. They are often used in engineering, physics, and finance to model
phenomena such as resistance, signal attenuation, or stock prices.
 Sigmoid Functions: These functions are used to model data that shows a saturation effect or an
S-shaped curve. They are often used in biology and neuroscience to model the response of
neurons or the growth of organisms.
 Fourier Transform: This function is used to decompose a signal into its component frequencies.
It can be used to analyze periodic data such as sound waves, electrical signals, or climate data.
 Wavelet Transform: This function is similar to Fourier Transform, but it can analyze non-
periodic signals and capture transient features in the data. It is often used in image processing,
signal processing, and geophysics.
Overall, these mathematical functions are widely used to capture trends in the data and model
complex relationships between variables. The choice of function depends on the type of data and
the research question at hand.
In my opinion, the 2nd degree polynomial trend line best captures the historical trend in the
variation of the songs' energy levels over time. This is because the polynomial trend line shows a
consistent increase in energy levels over time, which is a reasonable trend to expect given the
increasing popularity of electronic dance music over the years. Additionally, the R-squared value
for the polynomial trend line is higher than the other trend lines, indicating that it explains more
of the variation in the data. To prove this, I would compare the R-squared values of each of the
four trend lines and choose the one with the highest R-squared value as the best fit for the data.
The R-squared values of four trend lines are following as:

Trend lines R-squared values

polynomial 0.6247

exponential 0.5425

logarithmic 0.5266

linear 0.5241
Q3: (d)
(b)

(c)
(d)
The stacked bar charts show the number of positive and negative songs across different genres for
different values of the "Factor" parameter.

When the factor is set to 1, the impact of speech on the sentiment of a song is minimal, meaning
that the sentiment of the song remains mostly positive, even when the speech level is high. As a
result, for a factor of 1, the stacked bar chart shows that the majority of songs in each genre are
positive, while there are very few negative songs. However, as the factor increases, the impact of
speech on the sentiment of a song becomes more pronounced, and the sentiment of the song
becomes increasingly negative as the speech level increases. This is evident in the stacked bar
chart, where the number of negative songs increases as the factor increases. For example, when
the factor is set to 2.5, there are more negative songs than for a factor of 1, but the majority of
songs in each genre are still positive. However, when the factor is set to 5.0, the majority of songs
in each genre become negative, indicating that the impact of speech on the sentiment of a song is
now significant enough to cause the sentiment of the song to become negative, even at relatively
low speech levels.

TYT DM UVF10 Programming Guide v1.0 PDF
No ratings yet
TYT DM UVF10 Programming Guide v1.0 PDF
48 pages
Hip Hop Drum Patterns User Guide
100% (5)
Hip Hop Drum Patterns User Guide
58 pages
Data Preparation For Analytics Using SAS
100% (1)
Data Preparation For Analytics Using SAS
440 pages
In The Style of Jaco Pastorius
No ratings yet
In The Style of Jaco Pastorius
2 pages
Southeast Asian Music
86% (22)
Southeast Asian Music
5 pages
Power of The Cross Hymn
No ratings yet
Power of The Cross Hymn
2 pages
Task
No ratings yet
Task
5 pages
Spotify Analysis
No ratings yet
Spotify Analysis
3 pages
EDA - Unit 1
No ratings yet
EDA - Unit 1
82 pages
Stats Presentation
No ratings yet
Stats Presentation
58 pages
Descriptive Analytics Basics
No ratings yet
Descriptive Analytics Basics
29 pages
T Sivaprakash MBA BA03 040 Capstone Project
No ratings yet
T Sivaprakash MBA BA03 040 Capstone Project
16 pages
Rock-Paper-Scissors & Ice Cream Data Analysis
No ratings yet
Rock-Paper-Scissors & Ice Cream Data Analysis
16 pages
Questionnaire & Data Analysis Guide
No ratings yet
Questionnaire & Data Analysis Guide
49 pages
R Final
No ratings yet
R Final
19 pages
Introduction To Data Science Module 1
No ratings yet
Introduction To Data Science Module 1
32 pages
Aneesha Big Data Project
No ratings yet
Aneesha Big Data Project
4 pages
Seminar 08
No ratings yet
Seminar 08
24 pages
25 Essential Data Analysis Terms Every Analyst Should Know
No ratings yet
25 Essential Data Analysis Terms Every Analyst Should Know
11 pages
Module 4
No ratings yet
Module 4
69 pages
Chapt 2 Data Organizatiion and Presentaion
No ratings yet
Chapt 2 Data Organizatiion and Presentaion
67 pages
Lectura The Art of Data Science
No ratings yet
Lectura The Art of Data Science
22 pages
Exp 4-10 Merged
No ratings yet
Exp 4-10 Merged
89 pages
Calculating For Descriptive Statistics Jazmine Ibarra
No ratings yet
Calculating For Descriptive Statistics Jazmine Ibarra
4 pages
CH 2 Notes Filled
No ratings yet
CH 2 Notes Filled
22 pages
Amit Khilare Used Device Data PM Project
No ratings yet
Amit Khilare Used Device Data PM Project
25 pages
Ad3301 Apr May 2024 Answer Key
No ratings yet
Ad3301 Apr May 2024 Answer Key
31 pages
Social Media Data Analysis Guide
No ratings yet
Social Media Data Analysis Guide
12 pages
Visual Presentation of Data
No ratings yet
Visual Presentation of Data
26 pages
STAT2024 Assignment 3-1
No ratings yet
STAT2024 Assignment 3-1
4 pages
Camm BA 5e PPT CH02 03-09-23 PC - Final
No ratings yet
Camm BA 5e PPT CH02 03-09-23 PC - Final
52 pages
Crash Course Data Science
No ratings yet
Crash Course Data Science
7 pages
UNIT 6 + UNIT 16 - Collecting Data and Interpreting Results
No ratings yet
UNIT 6 + UNIT 16 - Collecting Data and Interpreting Results
8 pages
Unit 5 PDF
No ratings yet
Unit 5 PDF
106 pages
The Art of Data Science Roger D. Peng - Get Instant Access To The Full Ebook Content
100% (1)
The Art of Data Science Roger D. Peng - Get Instant Access To The Full Ebook Content
86 pages
BA1 Introduction 2025
No ratings yet
BA1 Introduction 2025
55 pages
Ch2 - Descriptive Statistics - Tabular and Graphical Presentations
100% (1)
Ch2 - Descriptive Statistics - Tabular and Graphical Presentations
47 pages
Module 5
No ratings yet
Module 5
20 pages
MIT 212 Collecting and Organizing Data - Tutorial 08
No ratings yet
MIT 212 Collecting and Organizing Data - Tutorial 08
5 pages
Ba Lecture 2
No ratings yet
Ba Lecture 2
54 pages
Project Report
No ratings yet
Project Report
39 pages
EDA - Module 4
No ratings yet
EDA - Module 4
35 pages
Lesson 2 Notes
No ratings yet
Lesson 2 Notes
11 pages
DataUnderstandingAndPreparation DOM304
No ratings yet
DataUnderstandingAndPreparation DOM304
19 pages
R Assignment
No ratings yet
R Assignment
32 pages
STK110 - Chapter 2
No ratings yet
STK110 - Chapter 2
29 pages
Data Analysis and Report Writing BRM
No ratings yet
Data Analysis and Report Writing BRM
49 pages
Spotify Analysis - 1
No ratings yet
Spotify Analysis - 1
2 pages
12-Exploratory Data Analysis, Anomaly Detection-28!03!2023
No ratings yet
12-Exploratory Data Analysis, Anomaly Detection-28!03!2023
79 pages
Data Analysis Techniques Guide
No ratings yet
Data Analysis Techniques Guide
9 pages
Gracy File Report
No ratings yet
Gracy File Report
18 pages
DAV Lab Sample
No ratings yet
DAV Lab Sample
21 pages
Spotify Analysis
No ratings yet
Spotify Analysis
1 page
ADDB - Week 1
No ratings yet
ADDB - Week 1
44 pages
CHAPTER 4 Data Management
No ratings yet
CHAPTER 4 Data Management
16 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
92 pages
Chapter 1
No ratings yet
Chapter 1
62 pages
Chapter 2 DESCRIPTIVE ANALYTICS
No ratings yet
Chapter 2 DESCRIPTIVE ANALYTICS
86 pages
Lecture 1-1 Methods of Data Collection
No ratings yet
Lecture 1-1 Methods of Data Collection
30 pages
Chapter2 091117004812 Phpapp01
100% (1)
Chapter2 091117004812 Phpapp01
55 pages
2 Mark Dev
No ratings yet
2 Mark Dev
6 pages
Chapter 2 - Describing The Data
No ratings yet
Chapter 2 - Describing The Data
9 pages
Quantitative Methods 3
No ratings yet
Quantitative Methods 3
174 pages
QM 1
No ratings yet
QM 1
58 pages
Obladi Oblada Lyrics Summary
No ratings yet
Obladi Oblada Lyrics Summary
2 pages
Statistics The Art and Science of Learning From Data 3rd Edition Alan Agresti Instant Download
No ratings yet
Statistics The Art and Science of Learning From Data 3rd Edition Alan Agresti Instant Download
41 pages
We Needto Talk TG
No ratings yet
We Needto Talk TG
3 pages
Nunca - Song Samba
No ratings yet
Nunca - Song Samba
2 pages
Battle Belongs - SongSelect Chart in B-1
No ratings yet
Battle Belongs - SongSelect Chart in B-1
2 pages
A Galaxy Called 'Mikrokosmos' - A Composer's View (Andre Hadju)
100% (1)
A Galaxy Called 'Mikrokosmos' - A Composer's View (Andre Hadju)
21 pages
Keyboard Lab1 PDF
No ratings yet
Keyboard Lab1 PDF
72 pages
Companion
No ratings yet
Companion
47 pages
Audio 1979 10
No ratings yet
Audio 1979 10
188 pages
Bài tập 1
No ratings yet
Bài tập 1
13 pages
Activities Halo
No ratings yet
Activities Halo
2 pages
Country Music An Illustrated History Dayton Duncan PDF Download
No ratings yet
Country Music An Illustrated History Dayton Duncan PDF Download
56 pages
4exp Activity 2 Fifth Grade
No ratings yet
4exp Activity 2 Fifth Grade
2 pages
Final Exam B06
No ratings yet
Final Exam B06
3 pages
Charlie Christian Lick #1 (Patreon Bonus)
No ratings yet
Charlie Christian Lick #1 (Patreon Bonus)
1 page
Popular Culture: The Following Selection Is About The Invention of The Compact Disc, and Explains How It Works
No ratings yet
Popular Culture: The Following Selection Is About The Invention of The Compact Disc, and Explains How It Works
2 pages
Answer:: Buses To The City Centre Leave This Bus Stop Every 20 Minutes
No ratings yet
Answer:: Buses To The City Centre Leave This Bus Stop Every 20 Minutes
16 pages
Ragtime & Joplin: Grade 11 Music
0% (1)
Ragtime & Joplin: Grade 11 Music
3 pages
Royally Fudged Steamy Sweet Instalove Holiday Romance 1st Edition Fern Fraser Download
No ratings yet
Royally Fudged Steamy Sweet Instalove Holiday Romance 1st Edition Fern Fraser Download
49 pages
Musical Illusions and Phantom Words - How Music and Speech Unlock Mysteries of The Brain
100% (3)
Musical Illusions and Phantom Words - How Music and Speech Unlock Mysteries of The Brain
264 pages
Cabzeus MONO - Quickstart Guide
No ratings yet
Cabzeus MONO - Quickstart Guide
2 pages
Baritone Saxophone Sheet Music
No ratings yet
Baritone Saxophone Sheet Music
2 pages
NJT1946A
No ratings yet
NJT1946A
3 pages
Complete Wedding With DJ Worksheets.4pgs
No ratings yet
Complete Wedding With DJ Worksheets.4pgs
4 pages
A Modern Method To Learn The Morse Code
100% (1)
A Modern Method To Learn The Morse Code
18 pages

Final Task

Uploaded by

Final Task

Uploaded by

Question # 01

GENRE MAX OF MIN OF AVERAGE OF STD DEV OF STD ERROR OF

Table 2: Average, Standard deviation, min, max, std error of Artist_followers

Cardinality of the Popularity and Artists_followers fields split by genre is many-to-many

Levene's Test for

In this line chart:

 Non-explicit songs have a different trend in popularity compared to explicit songs.

Finally, it is necessary to communicate the insights effectively to stakeholders, using visualizations

 Personalized Recommendations: By using predictive analysis to examine user habits and

Trend lines R-squared values

You might also like