0% found this document useful (0 votes)

63 views3 pages

Data Visualization Assignment Guide

The document outlines instructions for a data visualization assignment, emphasizing the need for complete submissions including code and documentation. It covers various tasks such as creating univariate and bivariate plots, understanding bin parameters, and analyzing skewness and distribution of data. Additionally, it provides guidelines for evaluating work post-submission and hints at following the CRISP-ML(Q) methodology.

Uploaded by

gauthamarvndhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views3 pages

Data Visualization Assignment Guide

Uploaded by

gauthamarvndhh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

2b.

Graphical Representation
Instructions:
Please share your answers filled in-line in the word document. Submit code
separately wherever applicable.

Please ensure you update all the details:

Name: _____________ Batch ID: ___________
Topic: Data Visualization

Guidelines:
1. An assignment submission is considered complete only when the correct and executable code(s) is
submitted along with the documentation explaining the method and results. Failing to submit either
of those will be considered an invalid submission and will not be considered a correct submission.
2. Ensure that you submit your assignments correctly. Resubmission is not allowed.
3. Post the submission you can evaluate your work by referring to the keys provided. (will be available
only post the submission).

Hints: Follow CRISP-ML(Q) methodology steps, where were appropriate.

1. Data Understanding: work on each feature of the dataset to create a data
dictionary as displayed in the image below:

Make a table as shown above and provide information about the features such as its data
type and its relevance to the model building. And if not relevant, provide reasons and a
description of the feature.
Problem Statements:

1. Univariate plots for UNIV data (Plot must have Title, X & Y label)
A) Plot numerical column with 3 different plots ?
B) What are bin parameters? What are the methods to define the number of bins
and bin sizes ?
○ Ans:)
● Number of bins, Bin width, Bin edges, Starting point.
● Number of bins - Square root method , Sturges' formula , Freedman-Diaconis' rule
● Bin width = (max value - min value) / number of bins

C) Why do density plots exceed the range values of the column ?

○ Ans:)
● The density plot can extend beyond the actual values because it is an estimate of the
probability density function of the underlying distribution that the data comes from.

D) Plot categorical columns by taking unique values ?

© 360DigiTMG. All Rights Reserved.

2. Bivariate graphs for UNIV data (Plot must be readable [use rotation], have all labels)
A) Plot 2 numerical columns with scatter plot [use grid] ?
B) 2 Different plots for plotting a numerical column with a categorical column (bar,
line) ?
C) How are bar plots different from histogram?
Ans:)

Bar Plots Histograms

Used to compare the values of Used to show the distribution of a

different categories or groups. single variable.

Can be vertical or horizontal. Can only be vertical.

Bars are separated and do not touch Bars are touching each other and form
each other. a continuous distribution.

Suitable for categorical or discrete Suitable for continuous or numerical

data. data.

X-axis represents categories or groups. X-axis represents the range of the

variable being measured.

Y-axis represents a numerical value. Y-axis represents the frequency or

count of the data in each bin.

3. Plot multivariate graphs (correlation heatmap, pairplot)

A) Plot for only numerical data ?

B) Plot multivariate graphs for both numerical and categorical columns ?
C) What does it mean when a correlation value says 1? When it is negative? When it is
zero?
Ans:)
Correlation value of 1: It means that there is a perfect positive correlation between the two
variables. This means that when one variable increases, the other variable also increases
proportionally, and vice versa.
Correlation value of -1: It means that there is a perfect negative correlation between the two
variables. This means that when one variable increases, the other variable decreases
proportionally, and vice versa.
Correlation value of 0: It means that there is no correlation between the two variables. This
means that there is no relationship between the variables. However, it's important to note that
a correlation of 0 does not necessarily imply independence between the variables, as there may
be other types of relationships that are not captured by correlation.

4. Plot Skewness & Probability distribution for each column of marks data. (Hist, box,
density)

A) What is normally distributed and What will be the relationship between mean,
median & mode ?
Ans:)
● Math column is normally distributed. Generally, Mean = Median = Mode when data is
normally distributed.

B) Which data variables are positively skewed and What will be the relationship
between mean, median & mode ?
Ans:
● Science column is positively distributed. Generally, Mean > Median > Mode when data is
positively distributed.

C) What are negatively skewed/distributed and What will be the relationship

between mean, median & mode
Ans:)
● Social studies column is positively distributed Generally Mean < Median < Mode when
data is negatively distributed.

D) What are the distinctive differences between skewness and distribution?

Ans:)
Distinctive differences between skewness and distribution:
● Distribution refers to the shape or spread of the data points in a dataset, while
skewness refers to the degree and direction of the asymmetry of the
distribution.
● Skewness measures how far the data is spread out on one side of the mean
compared to the other side, while distribution refers to the values of the data
points themselves.
● A distribution can be symmetrical, positively skewed, or negatively skewed, while
skewness can be positive, negative, or zero.
● Skewness can be affected by extreme values, while distribution is not.

STAB22 Lecture's Notes
No ratings yet
STAB22 Lecture's Notes
64 pages
Different Types of Graphs
No ratings yet
Different Types of Graphs
11 pages
Aphical Representation
No ratings yet
Aphical Representation
8 pages
Aphical Representation
No ratings yet
Aphical Representation
12 pages
Basic Functions of Excel
No ratings yet
Basic Functions of Excel
76 pages
QBSTS C1
No ratings yet
QBSTS C1
27 pages
L1 QM02 High Yield Notes
No ratings yet
L1 QM02 High Yield Notes
10 pages
Data Visualization UNIT II
No ratings yet
Data Visualization UNIT II
26 pages
Mvda - Question Bank
No ratings yet
Mvda - Question Bank
14 pages
STATISTICS Reviewer
No ratings yet
STATISTICS Reviewer
4 pages
Introduction To Statistics 2024-2025
No ratings yet
Introduction To Statistics 2024-2025
40 pages
Algebra 1 Unit 6 Describing Data Notes
No ratings yet
Algebra 1 Unit 6 Describing Data Notes
13 pages
Computatm Solution
No ratings yet
Computatm Solution
6 pages
Different Types of Graphs
No ratings yet
Different Types of Graphs
11 pages
U1 Exploring One-Variable Data
No ratings yet
U1 Exploring One-Variable Data
22 pages
DEV UNIT 3,4 MCQs
No ratings yet
DEV UNIT 3,4 MCQs
6 pages
Applied - Data - Science MODULE 3 SEM 8
No ratings yet
Applied - Data - Science MODULE 3 SEM 8
41 pages
IT326 - Ch2
No ratings yet
IT326 - Ch2
44 pages
Concepts and Techniques: - Chapter 2
No ratings yet
Concepts and Techniques: - Chapter 2
29 pages
GAC - Math Definition - Statistics
100% (1)
GAC - Math Definition - Statistics
3 pages
Bam 212
No ratings yet
Bam 212
7 pages
Word File For Prob and Stats
No ratings yet
Word File For Prob and Stats
25 pages
Assignment 3
No ratings yet
Assignment 3
4 pages
Mathematics in The Modern World
No ratings yet
Mathematics in The Modern World
2 pages
Introduction to Statistics Basics
No ratings yet
Introduction to Statistics Basics
23 pages
DS - Unit 3
No ratings yet
DS - Unit 3
37 pages
Describing The Data
No ratings yet
Describing The Data
42 pages
Unit 01 Statistics
No ratings yet
Unit 01 Statistics
10 pages
Ids 3,4,5. Exclusive
No ratings yet
Ids 3,4,5. Exclusive
43 pages
STK11O - Chapter 1-7 Notes
No ratings yet
STK11O - Chapter 1-7 Notes
22 pages
CHAPTER 2 Descriptive Statistics
No ratings yet
CHAPTER 2 Descriptive Statistics
5 pages
Business Club: Basic Statistics
No ratings yet
Business Club: Basic Statistics
26 pages
12-Exploratory Data Analysis, Anomaly Detection-28!03!2023
No ratings yet
12-Exploratory Data Analysis, Anomaly Detection-28!03!2023
79 pages
Test 1 Notes
No ratings yet
Test 1 Notes
6 pages
DWDM Unit-2
No ratings yet
DWDM Unit-2
19 pages
DMV
No ratings yet
DMV
56 pages
Data Analytics Summary
No ratings yet
Data Analytics Summary
80 pages
Displaying Descriptive Statistics: Chapter 2 Map
No ratings yet
Displaying Descriptive Statistics: Chapter 2 Map
58 pages
Eda U3
No ratings yet
Eda U3
54 pages
OCR MEI S1 Summary Sheets
No ratings yet
OCR MEI S1 Summary Sheets
9 pages
Dev Answer Key
100% (1)
Dev Answer Key
17 pages
ADDB - Week 1
No ratings yet
ADDB - Week 1
44 pages
2/ Organizing and Visualizing Variables: Dcova
No ratings yet
2/ Organizing and Visualizing Variables: Dcova
4 pages
Different Types of Graphs
No ratings yet
Different Types of Graphs
11 pages
Lecture2 SummarizingData
No ratings yet
Lecture2 SummarizingData
33 pages
Graphical Representation of Data
No ratings yet
Graphical Representation of Data
4 pages
Solution Manual For Business Statistics Communicating With Numbers 1st Edition by Jaggia Kelly ISBN 0077501373 9780077501372 Download
100% (2)
Solution Manual For Business Statistics Communicating With Numbers 1st Edition by Jaggia Kelly ISBN 0077501373 9780077501372 Download
161 pages
MATH210 - Stats Custom Text
No ratings yet
MATH210 - Stats Custom Text
145 pages
IOT-Domain Analyst
No ratings yet
IOT-Domain Analyst
68 pages
DATA202-02 - Descriptive Statistics (Part 2)
No ratings yet
DATA202-02 - Descriptive Statistics (Part 2)
18 pages
SLIDES Statistics-Chapter 2
No ratings yet
SLIDES Statistics-Chapter 2
31 pages
BUSS1020
No ratings yet
BUSS1020
6 pages
Descriptive Statistics Lecture 7-9 Measures of Shape 2025
No ratings yet
Descriptive Statistics Lecture 7-9 Measures of Shape 2025
40 pages
Data Visualization Essentials
No ratings yet
Data Visualization Essentials
87 pages
Guiang Mamow Paper 1 Statistical Terms
No ratings yet
Guiang Mamow Paper 1 Statistical Terms
5 pages
AP Statistics Chapter 1-3 Outlines
No ratings yet
AP Statistics Chapter 1-3 Outlines
9 pages
Biostatistics in A Nutshell
No ratings yet
Biostatistics in A Nutshell
45 pages
Asgn-6 Soln
No ratings yet
Asgn-6 Soln
16 pages
Wet-Gas Metering for Beginners
No ratings yet
Wet-Gas Metering for Beginners
28 pages
Comment On Decagonal and Quasi Crystalline Tilling in Medieval Islamic Architecture
No ratings yet
Comment On Decagonal and Quasi Crystalline Tilling in Medieval Islamic Architecture
3 pages
Risks: Machine Learning in P&C Insurance: A Review For Pricing and Reserving
No ratings yet
Risks: Machine Learning in P&C Insurance: A Review For Pricing and Reserving
26 pages
Assignment Maths
No ratings yet
Assignment Maths
2 pages
MA 1140: Elementary Linear Algebra: Dipankar Ghosh (IIT Hyderabad)
No ratings yet
MA 1140: Elementary Linear Algebra: Dipankar Ghosh (IIT Hyderabad)
16 pages
Automation Chapter 4
No ratings yet
Automation Chapter 4
44 pages
MATLAB for Earthquake Data Analysis
No ratings yet
MATLAB for Earthquake Data Analysis
14 pages
Friction Losses in Pipes Consisting of Bends and Elbows
86% (28)
Friction Losses in Pipes Consisting of Bends and Elbows
11 pages
ADC SNR Jitter
No ratings yet
ADC SNR Jitter
6 pages
History of Business Statistics
No ratings yet
History of Business Statistics
3 pages
Optimization of The SWAT Model To Adequately Predict Different Segments of A Managed Streamflow Hydrograph
No ratings yet
Optimization of The SWAT Model To Adequately Predict Different Segments of A Managed Streamflow Hydrograph
21 pages
LSS Curriculm
No ratings yet
LSS Curriculm
3 pages
JMST Template v2
No ratings yet
JMST Template v2
2 pages
An Introduction To Synthetic CDO and Its Structure
100% (2)
An Introduction To Synthetic CDO and Its Structure
39 pages
The Secant Method
No ratings yet
The Secant Method
7 pages
Me6301 Engineering Thermodynamics Nov Dec 2011
No ratings yet
Me6301 Engineering Thermodynamics Nov Dec 2011
3 pages
DPS FINAL MATHS PAPER 2023 (1) (Practice)
No ratings yet
DPS FINAL MATHS PAPER 2023 (1) (Practice)
4 pages
Algebra Notes From The Underground 1st Edition Paolo Aluffi Instant Download
No ratings yet
Algebra Notes From The Underground 1st Edition Paolo Aluffi Instant Download
82 pages
PID Controller
No ratings yet
PID Controller
5 pages
Mtap G4S1 Student
No ratings yet
Mtap G4S1 Student
2 pages
GATE 2021 Mechanical Engineering Syllabus
No ratings yet
GATE 2021 Mechanical Engineering Syllabus
11 pages
Stas 2634 1980 en
No ratings yet
Stas 2634 1980 en
25 pages
QUBE-Servo 2 - Second Order Systems Workbook (Student)
No ratings yet
QUBE-Servo 2 - Second Order Systems Workbook (Student)
6 pages
Engineering Deflection Analysis
No ratings yet
Engineering Deflection Analysis
7 pages
CHANDRA DZDA STAT6174037 ProbabilityTheoryandAppliedStatistics
No ratings yet
CHANDRA DZDA STAT6174037 ProbabilityTheoryandAppliedStatistics
17 pages
HCFLCM Questions
No ratings yet
HCFLCM Questions
4 pages
Lss. Uace Math 2
No ratings yet
Lss. Uace Math 2
6 pages
List of Syllabus of All Subjects in IISc
100% (1)
List of Syllabus of All Subjects in IISc
225 pages
Mathematics Chapter Tracker - JEE Main 2026 - MathonGo
No ratings yet
Mathematics Chapter Tracker - JEE Main 2026 - MathonGo
35 pages

Data Visualization Assignment Guide

Uploaded by

Data Visualization Assignment Guide

Uploaded by

2b.

Please ensure you update all the details:

Hints: Follow CRISP-ML(Q) methodology steps, where were appropriate.

C) Why do density plots exceed the range values of the column ?

D) Plot categorical columns by taking unique values ?

© 360DigiTMG. All Rights Reserved.

Bar Plots Histograms

Used to compare the values of Used to show the distribution of a

Can be vertical or horizontal. Can only be vertical.

Suitable for categorical or discrete Suitable for continuous or numerical

X-axis represents categories or groups. X-axis represents the range of the

Y-axis represents a numerical value. Y-axis represents the frequency or

3. Plot multivariate graphs (correlation heatmap, pairplot)

A) Plot for only numerical data ?

© 360DigiTMG. All Rights Reserved.

C) What are negatively skewed/distributed and What will be the relationship

D) What are the distinctive differences between skewness and distribution?

© 360DigiTMG. All Rights Reserved.

You might also like