0% found this document useful (0 votes)

6 views41 pages

Basics and Descriptive Statistics

The document provides an overview of basic concepts in statistics, including definitions of statistics, data, variables, populations, and samples. It explains the differences between descriptive and inferential statistics, as well as various scales of measurement and methods for summarizing data. Additionally, it covers measures of central tendency and dispersion, highlighting their importance in statistical analysis.

Uploaded by

elahavainfodesk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views41 pages

Basics and Descriptive Statistics

Uploaded by

elahavainfodesk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 41

BASIC CONCEPTS IN

STATISTICS
Dr. Omosivie Maduka
Consultant Public Health Physician
Senior Lecturer,
Department of Preventive and Social Medicine
University of Port Harcourt
BASIC CONCEPTS
What is Statistics?
• Statistics is the scientific method of collecting,
organizing, summarizing, analyzing, interpreting
and presenting data.
• Research is therefore the process of arriving at
dependable solutions to problems through
statistics”.
• The branch of statistics that deals primarily with
the biological sciences and medical/health-
related disciplines is the Biostatistics or Medical
Statistics or Biometrics.
DATA AND VARIABLE
Data is a Latin word and plural form of datum, but since statistics is
about groups of person or objects, the word data predominates.

Data is information with unit of analysis, variable and value.

Unit of Analysis is the subject of interest

Variable is the characteristic of the subject of interest, and has

the tendency to vary or change or fluctuate

Value is the reading, recording or measurement obtained

from a data set.

Most times data and variable are used interchangeably, but

variable is however a component of data that has the ability
to take on different values
Classification of Data/Variable
1. Quantitative (Numerical)
• Continuous (Infinite)
• Discrete (Finite)
2. Qualitative (Categorical)
• Dichotomous (Binary)
• Nominal
• Ordinal
POPULATIONS AND SAMPLES
Population
• Population is a set of persons (or objects) having a common
observable characteristic (popular population).
• Population can also be referred to as the observable
characteristics of persons or things (statistical population).
• In statistics two types of populations can be distinguished
based of the size populations (infinite and finite).
• Infinite populations can be thought of as large populations
while finite populations are those that are smaller.
• The distinction is arbitrary, although some researchers
regards populations that are 10,000 or more as large
populations, while those that are less than 10,000 are
referred to as small populations.
POPULATIONS AND SAMPLES
Sample A sample is a subset of a population.

The distinction between population and sample

is crucial to understanding of research.

This is because more often than not, the researcher is

not able to carry out observation on all the units
constituting a population for cost and logistic reasons.

He/she can still conduct the research by observing a

subset of the population by taking a representative
sample after which an extrapolation is made from the
results gotten from the sample to the population.
PARAMETER AND STATISTIC
Parameter
• A parameter is defined as any summarization of
the elements of a population
Statistic
• Statistic is any summarization of the elements of
a sample
• The distinction between parameters and statistics
is so fundamental to statistical thinking that two
different conventions are commonly employed
for their representation.
• The most popularly use alphabets as shown
below.
Symbols for representing parameter and
statistic

Summary of characteristic Parameter Statistic

Mean μ X
Standard deviation σ S
Variance σ2 S2
Proportion ᴧ p
SCALES OF MEASUREMENT IN
STATISTICS
• The scales of measurement were first described
by Stanley S. Stevens in his book entitled “On the
theory of scales of measurement” in 1946.
• According to Stevens the measurement process
can be conceived of as existing on four different
levels which he referred to as the nominal,
ordinal, interval, and ratio scales.
(Mnemonics: “NOIR”)
SCALES OF MEASUREMENT IN STATISTICS
Nominal Scale
• This assigns variables into categories without ranking
• Examples of variables measured: sex (male/female), treatment type
(surgery/chemotherapy), blood group (A, B, AB, O).
Ordinal Scale
• This assigns variables into categories with ranking, but no attribute of how much more
and how much less
• Examples of variables measured: severity of disease (mild/moderate/severe), BMI
(underweight, normal weight, overweight, obese).
SCALES OF MEASUREMENT IN STATISTICS
Interval Scale
• This assigns variables into categories with ranking and with the attribute of how much
more and how much less
• Examples of variables measured: temperature measured in Centigrade or Fahrenheit
Ratio Scale
• This assigns variables into categories with ranking and with the attribute of how much
more and how much less, and with true zero origin
• Examples of variables measured: weight, height blood sugar, temperature measured in
thermodynamic or Kelvin scale.
Descriptive and Inferential
Statistics
Dr. Omosivie Maduka
Consultant Public Health Physician
Senior Lecturer,
Department of Preventive and Social Medicine
University of Port Harcourt
Descriptive and Inferential
Statistics
• Descriptive statistics is made up of various techniques used
to summarize the information contained in a set of data.
• Thus, descriptive statistics, as the name implies deals with
description of data.
• Inferential statistics is made up of various techniques used
to provide information about parameter values based on
observations made on the values of statistics.
Population

(Parameter)

μ=?

Inferential Fig 1: The relationship between population and sample,

parameter and statistic, and inferential and descriptive
statistics
statistics

= 73kg

(Statistic)

Sample Descriptive statistics

The relationship between descriptive and inferential statistics

Descriptive
Probability Inferential
statistics
statistics
Reminder: There are two
types of data
Each type of data behaves differently with implications for
distribution and summarization

• Qualitative data

• Quantitative data
The Behavioural Characteristics of Data
Sets that encourages summarization

• Values follow some form of distribution and this

can be presented numerically, in tabular form and
in graphical form.

• Values tend to cluster around a central point

• Values also exhibit variability from each other and

from the central point
Understanding the concept of
distribution

A distribution is the pattern observed in a

collection of values for a variable.
• Frequency
• Relative frequency
• Cumulative frequency
• Cumulative relative frequency
Parity Frequency Relative Cumulative Cumulative
Frequency (%) Frequency Relative
Frequency (%)
0 15 30 15 30
1 11 22 26 52
2 8 16 34 68
3 6 12 40 80
4 5 10 45 90
≥5 5 10 50 100
Total 50 100
Distribution of parity among women with ovarian cancer
Displaying frequency
distributions

1. Graphical form

2. Tabular form
Comparing Qualitative and Quantitative
Data for Distributions
Frequency Distribution of Qualitative Frequency Distribution of Quantitative
Data Data
Graphically Graphically
- Frequency polygon
- Bar chart
- Histogram
- Pie chart Tabular
Tabular - Frequency distribution
- Frequency distribution - Relative Frequency distribution
- Cumulative frequency distribution
- Relative Frequency distribution
- Cumulative relative frequency
distribution
Bar chart
Pie chart

Distrubution of males and females

100, 40%
Male
Female
150, 60%
Histogram
Summarizing quantitative data

• Measures of central tendency

• Measures of dispersion (variability)

• Measures of relative position: quantiles

• Measures of distribution shape: skewness & kurtosis

Measures of
Central Tendency
Summarizing qualitative data
Numerator related to
denominator

Is numerator
included in
denominator?

No Yes

Is time included
in denominator?

No Yes

Measure: Ratio Proportion Rate

Example
: Maternal mortality ratio Prevalence Incidence rate
Summarizing Quantitative Data
• Mean: Many Entries, Average Number

• Median – number in the middle

• Mode – most frequently occurring

Mean
Advantages of mean Disadvantages of mean

• Simple to calculate • Takes longer time to be calculated

• Used for further statistical • Not easily understood by non-
calculations statisticians
• All the values in the data are taken • Does not always represent actual
into consideration, therefore more scores belonging to some members
representative than mode and of the population
median
• Relatively reliable because it does not • When used with discrete variables,
vary much when repeated samples it often yields unrealistic values
are taken from the same population. • Affected by extreme values
i.e. smaller sample error (outliers)
Median
Advantages median Disadvantage of median

• It is easy to calculate • Not representative because not

• It often represents actual score all data are considered in the
belonging to some members of calculation
the population
• Not affected by extreme values
(outliers)
• Easily understood by many
people
Mode
Advantages of mode Disadvantages of mode
• It is easy to calculate • It represents a misleading
• It often represents actual score picture of a distribution that
belonging to some members of does not have a regular shape
the population • Not representative because not
• Not affected by extreme values all data are considered in the
(outliers) calculation
• For qualitative data, only mode • It may not exist
can be meaningfully employed • It may not be unique e.g.
• multimodal distribution
Measures of
Dispersion
Measures of
dispersion
(variability)
• Range
• Deviation
• Variance
• Standard deviation
• Inter-quartile range
• Coefficient of variation
Range
• This is the simplest
measure of variation.

• It is the difference between

the largest and smallest
observations in a sample
hence unreliable.

• It is not sensitive to other

characteristics of data
variability.

• It tends to be larger with

sample size.
Interquartile
range

• This is the difference

between the upper and
lower quartiles.
• It is not sensitive to
extreme outlying
observations.
• It increases as variability
increases.
Deviation
• The deviation of the ith observation xi from the sample mean is the
difference between them.
• For any sample, the sum of all deviations about the mean =,
equals 0.
• For this reason, summary measures of variation use either absolute
values or squares of the deviations.
• Variance is the average of
the squared deviations of
the data from the mean.
The variance of n
Variance observations is:
• The units of measurement
are the squares of those of
the original data. Variance
is difficult to interpret.
• Note that in calculation
variance, n – 1 rather than
n is used. The n – 1 is
called the degree of
freedom.
Standard • This is the positive
Deviation square root of the
variance. Sample
standard deviation is
denoted by s.
Coefficient of variation

This expresses s as a percentage of the sample mean.

Independent of the units of observation

(CV) = s/ (100)
How to compute mean, variance and standard
deviation
2
x (x – ) (x – )
2
y (y – y) (y – y)
3 -2 4 4 -4 16
4 -1 1 6 -2 4
5 0 0 8 0 0
6 1 1 10 2 4
7 2 4 12 4 16
2
Σx = 25 Σ(x – ) = 0
2
Σ(x – ) = 10 Σy = 40 Σ(y – y) = 0 Σ(y – y) = 40
= 25/5 = 5, sx2 = 10/4 = 2.5, sx = √2.5 = 1.6 = 40/5 = 8, sy2 = 40/4 = 10, sy = √10 = 3.2

(Ebook PDF) Business Statistics: A First Course, Global Edition 8Th Edition
No ratings yet
(Ebook PDF) Business Statistics: A First Course, Global Edition 8Th Edition
49 pages
Oscar Winners' Age and Birth Analysis
No ratings yet
Oscar Winners' Age and Birth Analysis
6 pages
Data Analysis & Sampling Guide
100% (1)
Data Analysis & Sampling Guide
20 pages
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
100% (1)
Basics of Statistics: Definition: Science of Collection, Presentation, Analysis, and Reasonable
33 pages
Chapter One Definition of Statistics
No ratings yet
Chapter One Definition of Statistics
17 pages
Mean, Mode, Median, and Standard Deviation
No ratings yet
Mean, Mode, Median, and Standard Deviation
17 pages
18BCE10291 - Outliers Assignment
No ratings yet
18BCE10291 - Outliers Assignment
10 pages
Quantiles for Data Analysis
No ratings yet
Quantiles for Data Analysis
8 pages
10 Oran Oranti
No ratings yet
10 Oran Oranti
10 pages
Introduction To Statistics
100% (1)
Introduction To Statistics
60 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
Basic Concepts in Statistics
No ratings yet
Basic Concepts in Statistics
42 pages
Biostatistics Final Project
No ratings yet
Biostatistics Final Project
9 pages
Introduction To Statistics and SPSS
100% (1)
Introduction To Statistics and SPSS
110 pages
Basic Statistics (3685) PPT - Lecture On 20-01-2019
100% (1)
Basic Statistics (3685) PPT - Lecture On 20-01-2019
64 pages
Week One: Introduction To Quantitative Methods MBA 2013
No ratings yet
Week One: Introduction To Quantitative Methods MBA 2013
49 pages
1.3 Measure of Variability and Position
No ratings yet
1.3 Measure of Variability and Position
47 pages
Unit II: Basic Data Analytic Methods
No ratings yet
Unit II: Basic Data Analytic Methods
38 pages
Introduction To Biostatistics: Dr. M. H. Rahbar
No ratings yet
Introduction To Biostatistics: Dr. M. H. Rahbar
35 pages
Ashish Maths 10 B Statistics
No ratings yet
Ashish Maths 10 B Statistics
14 pages
Central Tendency Exercises (Mean)
No ratings yet
Central Tendency Exercises (Mean)
2 pages
Understanding Skewness in Statistics
No ratings yet
Understanding Skewness in Statistics
13 pages
Activity No. 10
No ratings yet
Activity No. 10
6 pages
Chapter 1 Introduction To Statistics
No ratings yet
Chapter 1 Introduction To Statistics
28 pages
Mean, Variance and Standard Deviations
No ratings yet
Mean, Variance and Standard Deviations
23 pages
WK 1b Biostat
No ratings yet
WK 1b Biostat
38 pages
Levine Bsfc7ge Ch12 1
No ratings yet
Levine Bsfc7ge Ch12 1
93 pages
Powerpoint Presentation On: "Frequency
100% (2)
Powerpoint Presentation On: "Frequency
36 pages
Chapter1 Statistics
No ratings yet
Chapter1 Statistics
17 pages
Basic Concepts in Biostatistics-2
No ratings yet
Basic Concepts in Biostatistics-2
35 pages
DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
No ratings yet
DR - Nesrin H. Darwesh University of Duhok-College of Dentistry
15 pages
Class 1 - Descripritive Statistics
No ratings yet
Class 1 - Descripritive Statistics
46 pages
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
No ratings yet
W1 Lesson 1 - Basic Statistical Concepts - Module PDF
11 pages
1 Biostatistics LECTURE 1
100% (1)
1 Biostatistics LECTURE 1
64 pages
Hand-Out in Statistics Statistics
No ratings yet
Hand-Out in Statistics Statistics
4 pages
Bba2002 2022jan
No ratings yet
Bba2002 2022jan
4 pages
Statistics Intro
No ratings yet
Statistics Intro
17 pages
Regression Analysis Summary
No ratings yet
Regression Analysis Summary
6 pages
احصاء حيوي
No ratings yet
احصاء حيوي
37 pages
Introduction to Statistics and Biostatistics
100% (1)
Introduction to Statistics and Biostatistics
87 pages
1.5 - S1 Chapter 2
No ratings yet
1.5 - S1 Chapter 2
25 pages
1-Introduction To Statistics
100% (1)
1-Introduction To Statistics
19 pages
Statistics
No ratings yet
Statistics
47 pages
Emdad Rahman
No ratings yet
Emdad Rahman
85 pages
Stats Guide for Engineering Students
No ratings yet
Stats Guide for Engineering Students
63 pages
Biostatistics for Students
No ratings yet
Biostatistics for Students
101 pages
Basic Statistical Concepts Module
No ratings yet
Basic Statistical Concepts Module
12 pages
1.9 Partition Values
No ratings yet
1.9 Partition Values
20 pages
Edexcel Gcse Statistics Coursework Example
100% (2)
Edexcel Gcse Statistics Coursework Example
6 pages
Basic Concepts in Biostatistics-1
No ratings yet
Basic Concepts in Biostatistics-1
40 pages
Lecture No 01 Statistics 13-2-24
No ratings yet
Lecture No 01 Statistics 13-2-24
34 pages
CHP1 Mat161
No ratings yet
CHP1 Mat161
4 pages
Mixed Models Day 3 - 2023
No ratings yet
Mixed Models Day 3 - 2023
49 pages
Q3 - WK 2.1 Introduction To Statistics
No ratings yet
Q3 - WK 2.1 Introduction To Statistics
31 pages
Introduction to Biostatistics Basics
No ratings yet
Introduction to Biostatistics Basics
29 pages
6.descriptve PPHD
No ratings yet
6.descriptve PPHD
70 pages
Understandingstatisticsinresearch 151026064600 Lva1 App6892
No ratings yet
Understandingstatisticsinresearch 151026064600 Lva1 App6892
37 pages
Statistics
No ratings yet
Statistics
11 pages
Ns Statistics 2022
No ratings yet
Ns Statistics 2022
70 pages
ML - Lab-3.ipynb - Colab
No ratings yet
ML - Lab-3.ipynb - Colab
2 pages
Free Pine Script Indicators
No ratings yet
Free Pine Script Indicators
15 pages
Formula Sheet For Midterm - F24
No ratings yet
Formula Sheet For Midterm - F24
3 pages
Biostatistics Notes-Numbered
No ratings yet
Biostatistics Notes-Numbered
21 pages
Lesson 1 Intro To Statistics
No ratings yet
Lesson 1 Intro To Statistics
3 pages
1 - 2 Biostatistics
No ratings yet
1 - 2 Biostatistics
24 pages
Unit 1 - Examining Distributions
No ratings yet
Unit 1 - Examining Distributions
80 pages
VAMPIRE 5E NPC Stat Breakdown
No ratings yet
VAMPIRE 5E NPC Stat Breakdown
4 pages
Lecture 1 - Online - INTRODUCTION TO BIOSTATISTICS (Compatibility Mode)
100% (1)
Lecture 1 - Online - INTRODUCTION TO BIOSTATISTICS (Compatibility Mode)
28 pages
Eco 7
No ratings yet
Eco 7
7 pages
Intoduction To Biostatistics
No ratings yet
Intoduction To Biostatistics
87 pages
Basics of Statistics
No ratings yet
Basics of Statistics
40 pages
Introduction To Biostistics
No ratings yet
Introduction To Biostistics
5 pages
SPROB Polished
No ratings yet
SPROB Polished
8 pages
Concrete Strength - Ipynb - Colab
No ratings yet
Concrete Strength - Ipynb - Colab
7 pages
Educational-Statistics Basic-Terms Sampling Data-Gathering
No ratings yet
Educational-Statistics Basic-Terms Sampling Data-Gathering
21 pages
Chap 7 - Two Sample Test
No ratings yet
Chap 7 - Two Sample Test
59 pages
Statistics Theory
No ratings yet
Statistics Theory
3 pages
ST Topic 1
No ratings yet
ST Topic 1
164 pages
Educational Statistics Notes
No ratings yet
Educational Statistics Notes
32 pages
Electronic Statistics and Probabilities
No ratings yet
Electronic Statistics and Probabilities
241 pages
Study Notes MPT
No ratings yet
Study Notes MPT
32 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
59 pages
PHS202 Biostatistics
No ratings yet
PHS202 Biostatistics
26 pages
Introductory Statistics For The Behavioral Sciences, 7th Edition, 7th Edition ISBN 0470907762, 9780470907764 High-Quality Download
No ratings yet
Introductory Statistics For The Behavioral Sciences, 7th Edition, 7th Edition ISBN 0470907762, 9780470907764 High-Quality Download
14 pages
AYT3
No ratings yet
AYT3
2 pages
Introduction To Biostatistics 1.Zp256050
No ratings yet
Introduction To Biostatistics 1.Zp256050
68 pages
NURS 201 - Week 3 Epidemiology and Research Terms Part II
No ratings yet
NURS 201 - Week 3 Epidemiology and Research Terms Part II
25 pages
Biostatistics 1
No ratings yet
Biostatistics 1
45 pages

Basics and Descriptive Statistics

Uploaded by

Basics and Descriptive Statistics

Uploaded by

BASIC CONCEPTS IN

Data is information with unit of analysis, variable and value.

Unit of Analysis is the subject of interest

Variable is the characteristic of the subject of interest, and has

Value is the reading, recording or measurement obtained

Most times data and variable are used interchangeably, but

The distinction between population and sample

This is because more often than not, the researcher is

He/she can still conduct the research by observing a

Summary of characteristic Parameter Statistic

Inferential Fig 1: The relationship between population and sample,

Sample Descriptive statistics

• Values follow some form of distribution and this

• Values tend to cluster around a central point

• Values also exhibit variability from each other and

A distribution is the pattern observed in a

Distrubution of males and females

• Measures of central tendency

• Measures of dispersion (variability)

• Measures of relative position: quantiles

• Measures of distribution shape: skewness & kurtosis

Measure: Ratio Proportion Rate

• Median – number in the middle

• Mode – most frequently occurring

• Simple to calculate • Takes longer time to be calculated

• It is easy to calculate • Not representative because not

• It is the difference between

• It is not sensitive to other

• It tends to be larger with

• This is the difference

This expresses s as a percentage of the sample mean.

Independent of the units of observation

You might also like