Spotify Data Analysis Report
Project Title:
Spotify Data Analysis
Team Members:
Mohammed El Hamlili
Yousra Chlih
Hamza Khiate
Adam Assaid
Anas Neffar
Saad Eddine Belkhoukh
Professor and Module Name:
Pr. Mehdia AJANA
Compétences Numériques
Submission Date:
22 / 12 / 2024
1. Introduction
Spotify is one of the leading music streaming platforms globally. This dataset
contains detailed information about the most-streamed songs on Spotify, including:
Total Streams: The cumulative number of streams each song has achieved.
Daily Streams: The average number of streams per day.
Artist and Title: The artist's name and the song's title.
The goal of this project was to:
1. Explore trends in the dataset.
2. Identify the most popular songs and artists.
3. Gain insights into user listening patterns through data visualization.
This analysis provides a snapshot of Spotify's music trends, highlighting which songs
dominate both overall and daily streams.
2. Methods
This section describes the steps taken to analyze the dataset:
a. Data Cleaning
Columns such as Streams and Daily were originally in string format with
commas. These were cleaned and converted into numeric types for analysis.
Missing values in the Daily column were identified and replaced with the
column's mean to ensure continuity in analysis.
b. Data Inspection
Basic dataset information was explored, including column types, first and last
rows, and the presence of missing values.
A summary of the dataset was generated to understand key metrics such as
mean, minimum, and maximum values.
c. Visualization
Three visualizations were created to identify trends and patterns:
1. Top 10 Songs by Total Streams: Bar chart displaying the top-performing
songs.
2. Top 10 Songs by Daily Streams: Bar chart highlighting songs with high daily
activity.
3. Scatter Plot: Comparing total streams and daily streams to identify
relationships.
3. Results
a. Key Statistics
The average total streams per song are approximately 757 million.
The highest total streams recorded are over 3.78 billion.
The average daily streams are around 446,000, with a maximum of 9.2
million per day.
b. Graphs
Graph 1: Top 10 Songs by Total Streams
This bar chart shows the dominance of "Blinding Lights" by The Weeknd,
followed by hits like "Shape of You" and "Dance Monkey." These songs
have maintained their popularity over several years.
Graph 2: Top 10 Songs by Daily Streams
The second visualization highlights "Seven" by Jung Kook as the most-streamed
song on a daily basis. This suggests its recent popularity and consistent streaming
performance.
Graph 3: Scatter Plot of Total vs. Daily Streams
The scatter plot reveals that most songs have a high cumulative stream count but
relatively low daily activity. A few songs, however, maintain consistent high daily
streams, showcasing their recent relevance.
4. Challenges Faced
During this project, the following challenges were encountered:
1. Data Cleaning:
o Columns like Streams and Daily required careful conversion from strings to
numeric types due to commas.
2. Handling Missing Data:
o Missing values in the Daily column were addressed by replacing them with the
mean, ensuring no bias in the analysis.
3. Visualization:
o Ensuring the visualizations were clear and informative required adjusting axis
labels, colors, and layout.
These challenges were resolved using Python's pandas and matplotlib libraries.
4. Team Work Experience:
Our journey of collaboration on this Python project was a dynamic
and enriching experience.
As a team of six, we leveraged tools like Trello and Discord to
streamline our workflow and maintain clear communication.
Trello allowed us to organize tasks effectively, ensuring everyone had
a clear understanding of their responsibilities, while Discord
facilitated real-time discussions and troubleshooting.
From brainstorming ideas to tackling coding challenges, each member
brought unique strengths to the table, fostering a sense of mutual
respect and shared purpose.
Despite the occasional hurdles, such as aligning schedules and
resolving conflicting ideas, we prioritized open communication and
adaptability.
This project not only showcased the power of teamwork but also
reinforced the importance of cooperation, organization, and
perseverance in achieving a common goal.
5. Conclusion:
This analysis highlights the massive impact of specific songs and artists on Spotify's
platform.
Key takeaways include:
Certain songs like "Blinding Lights" dominate total streams due to their long-
standing popularity.
Recent songs such as "Seven" exhibit impressive daily activity, showcasing
their current relevance.
Lessons Learned
The importance of data cleaning for effective analysis cannot be
overstated.
Visualization is a powerful tool to uncover insights and
communicate results.
Future work could involve analyzing listener demographics or comparing
Spotify data with other streaming platforms for a broader perspective.