0% found this document useful (0 votes)

21 views11 pages

DMML Lab Report 02

This lab report details the process of data visualization using Python libraries in a Data Mining and Machine Learning course. It includes code snippets for connecting to Google Drive, importing libraries, loading datasets, and creating various visualizations such as scatter plots, bar plots, pie charts, histograms, box plots, and heatmaps. The report is submitted by Fardus Alam and reviewed by Sadman Sadik Khan at Daffodil International University.

Uploaded by

Atick Arman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views11 pages

DMML Lab Report 02

Uploaded by

Atick Arman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 11

Lab report

Course code: CSE326

Course Title: Data Mining and Machine Learning Lab
Lab report: 02
Topic: Data Visualization

Submitted To:
Name: Sadman Sadik Khan
Designation: Lecturer
Department: CSE
Daffodil International University

Submitted By:
Name: Fardus Alam
ID: 222-15-6167
Section: 62-G
Department: CSE
Daffodil International University

Submission Date: 15-03-2025

Code:
1. from google.colab import drive
2. drive.mount('/content/drive')
3.

Explanation:
Connecting google drive with the google colab.

Code:
1. import pandas as pd
2. import numpy as np
3. import matplotlib.pyplot as plt
4. import seaborn as
sb 5.

Explanation:
Importing necessary libraries for data frame handling and visualization.

Code:
1. df = pd.read_csv('/content/drive/MyDrive/lab dataset
data mining/healthcare-dataset-stroke-data2.csv')
2. df.head()
3.

Output:
Explanation:
Loading the csv file from google drive and showing first 5 data point form df data set.

Code:

1. df.info()
2.

Output:

Explanation:
Showing some basic informations like not null count and data type of each column from df data set.

Code:
1. df.describe()
2.
Output:

Explanation:
Showing some statistics of each numerical columns. Statistics – count, mean, std, min, max and quartiles.

Code: Scatter Plot

1. x = df['age']
2. y =
df['bmi'] 3.
4. sns.scatterplot(data= df, x=x, y=y, hue = 'gender')
5. plt.show()
6.

Output:
Explanation:
Seaborn's scatterplot() function is used to create scatter plots, which visualize the relationship between
two numerical variables (age and bmi). It allows customization of colors, sizes, and styles based on
additional categorical variables. I use here hue for gender column that separate male and female as color.

Code: scatter plot using matplotlib

1. plt.scatter(df['age'],df['bmi'], c='g', label='age & bmi')
2. plt.scatter(df['age'],df['avg_glucose_level'],
c='b',label='age & avg_glucose_level')
3.
4. plt.title('Scatter plot using matplotlib with different
color and label')
5. plt.xlabel('age')
6. plt.ylabel('bmi &
avg_glucose_level') 7.
8. plt.legend()
9. plt.show()
10.

Output:
Explanation:
This code creates a scatter plot using Matplotlib to visualize the relationships between age vs. bmi and
age vsavg_glucose_level, with different colors for distinction. It helps compare how age correlates with
both bmi and avg_glucose_level

Code: Barplot
1. plt.title('Barplot between work_type & stroke')
2. sns.barplot(data=df, x='work_type', y='stroke',
hue= 'gender',errorbar=None)
3. plt.show()
4.

Output:

Explanation:
Above code creates a bar plot using Seaborn to compare the relationship between work type and stroke
occurrence, while differentiating by gender.
Code: Pie chart
1. gender = df['gender'].value_counts()
2. colors = ['r', 'g',
'b'] 3.
4. plt.pie(gender, labels=gender.index, autopct='%1.2f%%',
colors=colors, startangle=0, explode=(0.1,0,0,),
wedgeprops={'edgecolor': 'black'})
5.
6. plt.title("Pie Chart")
7. plt.show()
8.

Output:

Explanation:
Create pie chart using matplotlib for gender colums
Key terms:
 df['gender'].value_counts(): Gets gender counts dynamically.
 colors =['r', 'g', 'b']: Assigns red, green, and blue to slices.
 autopct='%1.2f%%': Displays percentages with two decimal places.
 explode=(0.1, 0, 0): Slightly separates the first slice for emphasis.
 wedgeprops={'edgecolor': 'black'}: Adds black borders for clarity.
 startangle=0: Starts the chart from 0 degrees

Code: Histogram
1. sns.histplot(data = df, x= 'work_type', color='g')
2. plt.title('Histogram')
3. plt.show()
4.

Output:
Explanation:
This code creates a histogram using Seaborn to visualize the distribution of the work_type variable in the
DataFrame df.

Key terms:

 sns.histplot(): Plots the histogram for the specified variable.

 x='work_type': Specifies that the data for the work_type column will be used.
 color='g': Sets the color of the bars to green.
 plt.title(): Adds the title "Histogram".
 plt.show(): Displays the plot.

Code: Box plot

1. sns.boxplot(data= df, x='bmi', hue= 'gender', )
2. plt.title('Box Plot')
3. plt.show()
4.

Output:
Explanation:
Above code creates a box plot using Seaborn to compare the distribution of bmi across different genders
in the DataFrame df.

Key terms:

 sns.boxplot(): Plots the box plot.

 x='bmi': Plots the bmi values on the x-axis.

 hue='gender': Differentiates the data by gender using different colors.

Code: Heatmap
1. new_df =
df[['age','bmi','avg_glucose_level','stroke']] 2.
3. plt.figure(figsize = (10 , 5))
4. sns.heatmap(new_df.corr(), annot = True, linewidths=0.2)
5. plt.title('Heatmap ')
6. plt.show()
7.

Output:
Explanation:
Heatmap to visualize the correlation matrix of selected columns in the new_df DataFrame.

Key Features:

 new_df.corr(): Calculates the correlation coefficients between the columns (age, bmi,
avg_glucose_level, stroke).
 sns.heatmap(): Plots the heatmap with annotations showing the correlation values.
 annot=True: Displays the correlation values inside the heatmap cells.
 linewidths=0.2: Adds thin lines between cells for better separation

Bio (In Focus Year 12)
67% (3)
Bio (In Focus Year 12)
636 pages
Denise Dailyroutine
No ratings yet
Denise Dailyroutine
10 pages
Data Analysis and Visualisation With Python
No ratings yet
Data Analysis and Visualisation With Python
42 pages
Crash Barrier BBS & QTY
100% (10)
Crash Barrier BBS & QTY
4 pages
Data Visualization Using Matplotlib For Beginners. - by Chinmai Rane - GDSC UMIT - Medium
No ratings yet
Data Visualization Using Matplotlib For Beginners. - by Chinmai Rane - GDSC UMIT - Medium
10 pages
Diabetes Prediction 1704256341
No ratings yet
Diabetes Prediction 1704256341
17 pages
Practical 4
No ratings yet
Practical 4
3 pages
Ai&Ml Bail606 ML Lab Manual
No ratings yet
Ai&Ml Bail606 ML Lab Manual
50 pages
Tung Wah College GEN3005 / GED3005 Big Data and Data Sciences
No ratings yet
Tung Wah College GEN3005 / GED3005 Big Data and Data Sciences
7 pages
Advanced Plot Types With Seaborn
No ratings yet
Advanced Plot Types With Seaborn
8 pages
Data Visualization
No ratings yet
Data Visualization
23 pages
DL Lab Programs
No ratings yet
DL Lab Programs
16 pages
Anemia Code
No ratings yet
Anemia Code
33 pages
Ex No 10
No ratings yet
Ex No 10
5 pages
Visualisation Basic
No ratings yet
Visualisation Basic
17 pages
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
No ratings yet
BDA Lab 4: Python Data Visualization: Your Name: Mohamad Salehuddin Bin Zulkefli Matric No: 17005054
10 pages
Seaborn
No ratings yet
Seaborn
7 pages
DATA SCIENCE AsSIGNMENT - Ipynb - Colab
No ratings yet
DATA SCIENCE AsSIGNMENT - Ipynb - Colab
4 pages
Data Visualization - 1 by Matplot Lib
No ratings yet
Data Visualization - 1 by Matplot Lib
19 pages
Logistic Regression With Pyspark
No ratings yet
Logistic Regression With Pyspark
19 pages
Explanationdocx
No ratings yet
Explanationdocx
9 pages
DM Assignment2
No ratings yet
DM Assignment2
10 pages
Data Visualization Techniques Guide
No ratings yet
Data Visualization Techniques Guide
48 pages
Pima Indians Diabetes Patient Classification
No ratings yet
Pima Indians Diabetes Patient Classification
22 pages
Description of Data Visualization Tools
No ratings yet
Description of Data Visualization Tools
15 pages
Data Visualization with Matplotlib
No ratings yet
Data Visualization with Matplotlib
23 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
Data Visualisation
No ratings yet
Data Visualisation
6 pages
DSBDAL - Assignment No 9
No ratings yet
DSBDAL - Assignment No 9
12 pages
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
No ratings yet
K-Nearest Neighbors For Diabetes Prediction: Malik Yousaf (F2020019038) Ahsan Rauf (F2020019057)
15 pages
Aphical Representation
No ratings yet
Aphical Representation
12 pages
Data Visualization with Python Lab
No ratings yet
Data Visualization with Python Lab
21 pages
Lab Manual For Students
No ratings yet
Lab Manual For Students
38 pages
Datascience 2 PDF
No ratings yet
Datascience 2 PDF
24 pages
DV Lab Manual 2022-23
No ratings yet
DV Lab Manual 2022-23
10 pages
ML Expt 1 Description
No ratings yet
ML Expt 1 Description
15 pages
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
No ratings yet
Content From Jose Portilla's Udemy Course Learning Python For Data Analysis and Visualization Notes by Michael Brothers, Available On
13 pages
ML Lab Manual-Iso
No ratings yet
ML Lab Manual-Iso
40 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
Experiment No 9
No ratings yet
Experiment No 9
13 pages
Matplotlib Guide for Data Scientists
No ratings yet
Matplotlib Guide for Data Scientists
5 pages
Data Visualization Using Python
No ratings yet
Data Visualization Using Python
3 pages
DSA Lab Manual Pgms - fINAL
No ratings yet
DSA Lab Manual Pgms - fINAL
34 pages
Visualization Demo 2
No ratings yet
Visualization Demo 2
9 pages
Python
No ratings yet
Python
29 pages
Ai Record Programs
No ratings yet
Ai Record Programs
34 pages
EDAusingpython SAlaruri
No ratings yet
EDAusingpython SAlaruri
50 pages
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
No ratings yet
Asset-V1 VIT+MBA109+2020+type@asset+block@Introductio To ML Using Python
7 pages
Unit2 Modified
No ratings yet
Unit2 Modified
42 pages
Data Visualization
No ratings yet
Data Visualization
159 pages
Lecture 4
No ratings yet
Lecture 4
60 pages
TY9-56 Exp2a
No ratings yet
TY9-56 Exp2a
4 pages
Python EDA Workshop with Olympics Data
No ratings yet
Python EDA Workshop with Olympics Data
12 pages
Sl-3 Assignment No.8
No ratings yet
Sl-3 Assignment No.8
21 pages
Data Perparation Penting
No ratings yet
Data Perparation Penting
12 pages
ML Proj Diabetes
No ratings yet
ML Proj Diabetes
51 pages
Pandas 3-2
No ratings yet
Pandas 3-2
27 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
42 pages
Advanced Plot Types With Seaborn
No ratings yet
Advanced Plot Types With Seaborn
4 pages
Internship Report
No ratings yet
Internship Report
23 pages
Motivation Interview Question For Lithuania - Final
No ratings yet
Motivation Interview Question For Lithuania - Final
2 pages
Case Study 3
No ratings yet
Case Study 3
3 pages
World University of Bangladesh
No ratings yet
World University of Bangladesh
13 pages
Case Study... 1
No ratings yet
Case Study... 1
2 pages
Personal Profile: Atick Arman
No ratings yet
Personal Profile: Atick Arman
2 pages
Appllo Hospital
No ratings yet
Appllo Hospital
14 pages
World University of Bangladesh: Submitted by
No ratings yet
World University of Bangladesh: Submitted by
23 pages
New Criticism and Formalism PPT - PPT - 20240224 - 120834 - 0000
No ratings yet
New Criticism and Formalism PPT - PPT - 20240224 - 120834 - 0000
23 pages
Chapter 4 Practice
No ratings yet
Chapter 4 Practice
10 pages
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
100% (9)
Sociology of Families Change Continuity and Diversity 1st Edition Ciabattari Test Bankinstant Download
49 pages
Catamaran Inclining Report
No ratings yet
Catamaran Inclining Report
24 pages
Manual Instruction CPAM-EKA AIR C16 EKA KOOL V2
No ratings yet
Manual Instruction CPAM-EKA AIR C16 EKA KOOL V2
8 pages
BALLOU Inclusion VS Empathy
No ratings yet
BALLOU Inclusion VS Empathy
5 pages
Acknowledgement Abstract
No ratings yet
Acknowledgement Abstract
6 pages
1-6 Practice
No ratings yet
1-6 Practice
2 pages
Lec 7
No ratings yet
Lec 7
40 pages
IDEALS Essay Framework
No ratings yet
IDEALS Essay Framework
1 page
Physics Project
No ratings yet
Physics Project
15 pages
GEZE - Product Data Sheet - EN - 697800130822
No ratings yet
GEZE - Product Data Sheet - EN - 697800130822
3 pages
Experiment Explanation - Grade 7
No ratings yet
Experiment Explanation - Grade 7
5 pages
95 843 Xiameter Ofx 0531 Fluid
No ratings yet
95 843 Xiameter Ofx 0531 Fluid
5 pages
Oscor Blue
No ratings yet
Oscor Blue
6 pages
Bridge Works - Miscellaneous
No ratings yet
Bridge Works - Miscellaneous
26 pages
SBM Assessment Tool For Online Validation With Essential MOVs
No ratings yet
SBM Assessment Tool For Online Validation With Essential MOVs
10 pages
Capstone Update 2
No ratings yet
Capstone Update 2
2 pages
Classical ALV Reporting - Overview of ALV
No ratings yet
Classical ALV Reporting - Overview of ALV
54 pages
Monograph (Cha0406) MULA - Dead Leaves Fall (Oef)
No ratings yet
Monograph (Cha0406) MULA - Dead Leaves Fall (Oef)
135 pages
Constructive and Destructive Feedback Notes
No ratings yet
Constructive and Destructive Feedback Notes
5 pages
Biomimetics 06 00027 v3
No ratings yet
Biomimetics 06 00027 v3
16 pages
3rd Module
No ratings yet
3rd Module
5 pages
Philmetals 2014 - Rev - Reduced PDF
No ratings yet
Philmetals 2014 - Rev - Reduced PDF
82 pages
(L6) - (JEE 2.0) - 3D Geometry - 28th Nov
No ratings yet
(L6) - (JEE 2.0) - 3D Geometry - 28th Nov
44 pages
Homework Hotline d428
100% (1)
Homework Hotline d428
5 pages
Ensayo Sobre El Patriotismo
100% (1)
Ensayo Sobre El Patriotismo
6 pages