0% found this document useful (0 votes)

53 views6 pages

Statistics: Types, Data, and Measures

I - Statistics is the study of methods for collecting and analyzing quantitative data affected by many causes. Descriptive statistics describes data through summaries and visualizations, while inferential statistics makes predictions about populations from samples. II - Raw data are original facts and numbers without interpretation. Data types include nominal (labels without order), ordinal (ordered labels), discrete (can be counted), continuous interval (ordered with equal differences but no true zero), and continuous ratio (ordered with equal differences and a true zero). Common visualizations depend on the data type. III - Measures of central tendency summarize data locations. The mean is the average, the median is the middle value, and the mode is the most frequent value. Me

Uploaded by

Dhansu Tunnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views6 pages

Statistics: Types, Data, and Measures

Uploaded by

Dhansu Tunnu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Things to Remember

I - What is statistics

○ By Statistics, we mean methods specially adapted to the elucidation of quantitative data

affected to a marked extent by a multiplicity of causes”.
Yule and Kendal

● Difference between descriptive and inferential statistics

Basis of comparison Descriptive Statistics Inferential Statistics

Meaning Descriptive statistics seeks to Inferential statistics deals with

describe the data, but do not making inferences about a
attempt to make inferences population from a sample
from the sample to the whole
population

What it does ? Summarize, organize and Conclusion and prediction of

present the data in a data
meaningful way

II - Data

Data Vs Information - When analysts are bewildered by plethora of data, which do not make
any sense on the surface of it, they are looking for methods to classify data that would convey meaning.
The idea here is to help them draw the right conclusion. Data needs to be arranged into information.

Raw Data - Raw Data represent numbers and facts in the original format in which the data have been
collected. We need to convert the raw data into information for decision making.

Types of Data:
It is very important to have a good understanding of the different data types, also called measurement
scales, is a crucial prerequisite for doing Exploratory Data Analysis (EDA)

Types of Data

Nominal Data Ordinal Data

Categorical Data
( represents This data represents discrete units Ordinal data represent discrete
characteristics, also and use to label variables that have and ordered units.Order is
called as qualitative no quantitative value . Nominal important in case of this data
data ) data has no order

Example What is your gender Winners in Hackathon

● Male ● First
● Female ● Second
What programming languages you ● Third
know : Proficiency in programming
● Python ● High
● R ● Medium
● SAS ● Low

Visualization Bar chart and Pie chart

methods

Numerical Data Discrete Data Continuous Data

Data which can’t be measured but Data which can be measured but
can be counted. Data can take on can’t be counted
only certain values
Two types of Continuous data
● Interval Data - Ordered
units have the same
difference. But it has no
true zero points.
● Ratio Data -same as
interval values, with the
difference that they do
have an absolute zero

Example Team members in a cricket team- It Interval Data -Temperature of a

can be 11 but not 11.5 particular place
● -10
● -5
● 0
● 5
● 10
( here 0 has no true meaning )
Ratio Data - Equal difference
● 0
● 5
● 10
● 15

Visualization Boxplots and Histogram

Technique

III - Measures of Central Tendency

Measures of Central Tendency

Mean Median Mode

Meaning The mean is simply the The median is the The mode is the value
average and “middle” value or or category that occurs
considered the most midpoint in your data most often within the
reliable measure of data
central tendency.

The mean is computed

by the sum of all
values, divided by the
number of values.

Example Uber Rating - After With 10,000 people, the Which is the most
every ride, you give a mean salary might be popular video on
rating for your $45,000, but the range is youtube? How will you
experience and final $20,000 to $3,000,000 find out? - Ans - The
rating which comes for with a mean of one which has the
the driver is calculated $100,000. Mean is maximum likes
using mean affected by extreme
values. In order to get a
real figure in cases
where we have outliers
in data median is
calculated

III - Measures of Dispersion

Meaning - refers to the idea of variability within your data. It answers unambiguously the question
"What is the magnitude of departure from the average value for different groups having identical
averages?".

Different types of measures of dispersion

1) Range is the simplest of all measures of dispersion. It is calculated as the difference between the
maximum and minimum value in the data set.

Range =Largest Value − Lowest Value

The range is also the most affected by outliers as it uses only the extreme values.It is advisable to use
range only for very small distributions with no outliers

2) Interquartile Range is the distance between the lower and upper quartiles of a data.

IQR = Q3 - Q1

IQR is considered a good measure of variation in skewed datasets as it is resistant to outliers.

3) Standard deviation is a measure of how much data values deviate away from the mean.
Larger the standard deviation, the greater the amount of variation.

SD = √ Σ( Data value - arithmetic mean )2 / Total number of values in the dataset
Standard deviation is a good measure of variability for normal distributions or distributions that aren’t
extremely skewed

4) Coefficient of variation is equal to the standard deviation divided by the mean. It is a
useful measure for comparing the variability between two different datasets. For eg. if
we need to compare the sales of Apple mobile phones between India and the US, the
coefficient of variation would be used as it's a relative measure free of units of
measurement.

Standard deviation will not be useful as sales in India would be given in INR and for US in
dollars and won’t give any meaningful result,therefore coefficient of variation is used
and is also called as relative standard deviation

IV - Boxplot
Boxplot is five numbers that help describe the centre, spread and shape of data are:

● Xsmallest
● First Quartile (Q1)
● Median (Q2)
● Third Quartile (Q3)
● Xlargest

v) Skewness - It refers to a lack of symmetry. Skewness results in inequality in the values of mean,
median and mode and lower and upper quartiles are not situated at equal distance from median.

● Skewness may be positive or negative

● In case of positive skewness for a distribution
○ Mean > Median > Mode
○ ( Q3 - Median) > ( Median - Q1 )

● In case of negative skewness for a distribution

○ Mean < Median < Mode
○ ( Q3 - median) < ( median - Q1 )

Relationships among the five-number summary and distribution shape

Left-Skewed Symmetric Right-Skewed

Median – Xsmallest
Median – Xsmallest
Median – Xsmallest

> ≈ <

Xlargest – Median
Xlargest – Median
Xlargest – Median
Q Q Q
1 – Xsmallest 1 – Xsmallest 1 – Xsmallest

> ≈ <

X X X
largest – Q3 largest – Q3 largest – Q3

Median – Q1 Median – Q1 Median – Q1

> ≈ <

Q3 – Median Q3 – Median Q3 – Median

Statistics For Data Science PDF - Statistics-for-Data-Science PDF
No ratings yet
Statistics For Data Science PDF - Statistics-for-Data-Science PDF
14 pages
Descriptive Analytics Notes
No ratings yet
Descriptive Analytics Notes
6 pages
Chapter 2 BSC TY Statistical Data Analysis
No ratings yet
Chapter 2 BSC TY Statistical Data Analysis
124 pages
Chapter Test Topic: Normal Distribution: I. MULTIPLE CHOICE. Choose The Letter of The Best Answer. (1 Point Each)
100% (1)
Chapter Test Topic: Normal Distribution: I. MULTIPLE CHOICE. Choose The Letter of The Best Answer. (1 Point Each)
3 pages
3 Data Visualization
No ratings yet
3 Data Visualization
75 pages
2 - Introduction To Statistics
No ratings yet
2 - Introduction To Statistics
97 pages
2 - Statistics
No ratings yet
2 - Statistics
50 pages
Psychology Project
No ratings yet
Psychology Project
14 pages
Clock, Variation, Progression and Miscellaneous Problems
No ratings yet
Clock, Variation, Progression and Miscellaneous Problems
10 pages
IML U2
No ratings yet
IML U2
15 pages
Statistical Analysis - Descriptive Stat
No ratings yet
Statistical Analysis - Descriptive Stat
6 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
Basic Statistics
No ratings yet
Basic Statistics
7 pages
Interpreting Test Score: Online Workshop 8602 Aiou
100% (1)
Interpreting Test Score: Online Workshop 8602 Aiou
39 pages
Share MBBS - Lecture 4 (1) - 1
No ratings yet
Share MBBS - Lecture 4 (1) - 1
68 pages
Math
No ratings yet
Math
50 pages
Lecture Notes 2 - Descriptive Statistics-1720598791715
No ratings yet
Lecture Notes 2 - Descriptive Statistics-1720598791715
21 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
63 pages
Statistics
No ratings yet
Statistics
63 pages
Assignment No 3
No ratings yet
Assignment No 3
16 pages
Central Limit Theorem
No ratings yet
Central Limit Theorem
21 pages
Statistics Basics for Beginners
No ratings yet
Statistics Basics for Beginners
2 pages
Day 3 Educational Statistics
No ratings yet
Day 3 Educational Statistics
37 pages
Stats
No ratings yet
Stats
109 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
73 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
14 pages
MS Excel in Data Analytics
No ratings yet
MS Excel in Data Analytics
56 pages
Analytics Compendium (Incl Stats)
No ratings yet
Analytics Compendium (Incl Stats)
31 pages
Basic of Statistics #5 (!!!)
No ratings yet
Basic of Statistics #5 (!!!)
49 pages
N M Shah Numericals
0% (2)
N M Shah Numericals
2 pages
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
No ratings yet
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
23 pages
Descriptive Statistics Basics
No ratings yet
Descriptive Statistics Basics
72 pages
Lesson 02 Probability and Statistics
No ratings yet
Lesson 02 Probability and Statistics
127 pages
ASA Notes
No ratings yet
ASA Notes
28 pages
Statistics
No ratings yet
Statistics
21 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
PDS Unit4
No ratings yet
PDS Unit4
18 pages
Introduction To Statistics Lecture 7
No ratings yet
Introduction To Statistics Lecture 7
32 pages
Statistics
No ratings yet
Statistics
10 pages
2nd Unit - Statistics
No ratings yet
2nd Unit - Statistics
15 pages
Business Statistics - KMBN104
No ratings yet
Business Statistics - KMBN104
25 pages
Session 1 ISM May 2024
No ratings yet
Session 1 ISM May 2024
59 pages
CH 2 Lecture Notes
No ratings yet
CH 2 Lecture Notes
12 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
2 Research - 2ND QT - Week 1 - 10 14 2024
No ratings yet
2 Research - 2ND QT - Week 1 - 10 14 2024
13 pages
Further Bound Reference
No ratings yet
Further Bound Reference
42 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
It B.tech II Year II Sem DV (R18a0555)
No ratings yet
It B.tech II Year II Sem DV (R18a0555)
73 pages
1 Basics of Stat (Statistics IEM 2-2)
No ratings yet
1 Basics of Stat (Statistics IEM 2-2)
29 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
31 pages
Cba101 MT
No ratings yet
Cba101 MT
4 pages
Article Review 1 Eng
No ratings yet
Article Review 1 Eng
30 pages
Foundations or Research Analysis
No ratings yet
Foundations or Research Analysis
31 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Statistics Notes
No ratings yet
Statistics Notes
16 pages
LabModule - Exploratory Data Analysis - 2023ic
No ratings yet
LabModule - Exploratory Data Analysis - 2023ic
24 pages
Unit 4
No ratings yet
Unit 4
152 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
Descriptive Stat
No ratings yet
Descriptive Stat
13 pages
Statistics for Analysts
100% (3)
Statistics for Analysts
27 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Quality Control in Education
No ratings yet
Quality Control in Education
2 pages
Unit-3 DS Students
No ratings yet
Unit-3 DS Students
35 pages
Chapter 1 - 2
No ratings yet
Chapter 1 - 2
36 pages
Statistics and Math Formulas
No ratings yet
Statistics and Math Formulas
8 pages
Statistics Frequency Distribution Table
No ratings yet
Statistics Frequency Distribution Table
7 pages
Data Analysis for Students
No ratings yet
Data Analysis for Students
7 pages
E300 PDF
No ratings yet
E300 PDF
24 pages
What Is Standard Deviation?
No ratings yet
What Is Standard Deviation?
4 pages
Intro to Normal Distribution
No ratings yet
Intro to Normal Distribution
10 pages
18bge14a U3
No ratings yet
18bge14a U3
18 pages
Business Stats for Students
No ratings yet
Business Stats for Students
101 pages
Summer 578 Assignment 2 Solutions
100% (1)
Summer 578 Assignment 2 Solutions
13 pages
Data Preprocessing Techniques in ML
No ratings yet
Data Preprocessing Techniques in ML
12 pages
Enabling Assessment in Probability Distribution
No ratings yet
Enabling Assessment in Probability Distribution
2 pages
Stat1 (Grade 11)
No ratings yet
Stat1 (Grade 11)
3 pages
Probability and Stochastic Processes
No ratings yet
Probability and Stochastic Processes
24 pages
C-529 Set-A Junior Auditor Post Code 672
No ratings yet
C-529 Set-A Junior Auditor Post Code 672
32 pages
Business Statistics and RM: Hamendra Dangi 9968316938
No ratings yet
Business Statistics and RM: Hamendra Dangi 9968316938
21 pages
Statistics: Central Tendency & Data Analysis
No ratings yet
Statistics: Central Tendency & Data Analysis
22 pages
Digital Remote Sensing Imagery Guide
No ratings yet
Digital Remote Sensing Imagery Guide
8 pages
Business Statistics Problems
No ratings yet
Business Statistics Problems
20 pages
Applied Statistics: Discrete & Continuous Distributions
No ratings yet
Applied Statistics: Discrete & Continuous Distributions
23 pages
Chapter 3: Review On Statisti Cs and Databases: Descriptive Statistics
No ratings yet
Chapter 3: Review On Statisti Cs and Databases: Descriptive Statistics
17 pages
TI-Nspire Sampling Guide
No ratings yet
TI-Nspire Sampling Guide
8 pages
MATH 11 MODULE 10 Measures of Central Tendency Grouped Data
No ratings yet
MATH 11 MODULE 10 Measures of Central Tendency Grouped Data
7 pages
Biostatistics 100A: Laboratory Two Spring 2021 Computer Exercise and Competency Assessment
No ratings yet
Biostatistics 100A: Laboratory Two Spring 2021 Computer Exercise and Competency Assessment
4 pages

Statistics: Types, Data, and Measures

Uploaded by

Statistics: Types, Data, and Measures

Uploaded by

Things to Remember

○ By Statistics, we mean methods specially adapted to the elucidation of quantitative data

● ​Difference between descriptive and inferential statistics

Basis of comparison Descriptive Statistics Inferential Statistics

Meaning Descriptive statistics seeks to Inferential statistics deals with

What it does ? Summarize, organize and Conclusion and prediction of

Nominal Data Ordinal Data

Example What is your gender Winners in Hackathon

Visualization Bar chart and Pie chart

Numerical Data Discrete Data Continuous Data

Example Team members in a cricket team- It Interval Data -Temperature of a

Visualization Boxplots and Histogram

III - Measures of Central Tendency

Measures of Central Tendency

Mean Median Mode

The mean is computed

III - Measures of Dispersion

Different types of measures of dispersion

Range =Largest Value − Lowest Value

IQR is considered a good measure of variation in skewed datasets as it is resistant to outliers.

● Skewness may be positive or negative

● In case of negative skewness for a distribution

Relationships among the five-number summary and distribution shape

Left-Skewed Symmetric Right-Skewed

Median – Q​1 Median – Q​1 Median – Q1​

Q3​ ​– Median Q​3 –​ Median Q​3 –​ Median

You might also like

● Difference between descriptive and inferential statistics

Median – Q1 Median – Q1 Median – Q1

Q3 – Median Q3 – Median Q3 – Median