0% found this document useful (0 votes)

3 views99 pages

Introduction To Statistics

The document provides an introduction to statistics, covering essential concepts such as types of statistics, levels of measurement, and data classification. It emphasizes the importance of statistics in decision-making across various fields, including finance, marketing, and operations management. Additionally, it outlines methods for data collection, sampling techniques, and data presentation through various graphical representations.

Uploaded by

kumar3727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views99 pages

Introduction To Statistics

Uploaded by

kumar3727

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 99

Introduction To Statistics

Pravin kumar
Conditions for successful course
completion
Lab Work Evaluation
Recommended literature:

Anderson, D.R. - Sweeney, D.J. -

Williams, T.A.: Statistics for Business
and Economics. South-Western Pub.,
2005, 320 p., ISBN 978-032-422-486-3
Why study statistics?

1. Data are everywhere

2. Statistical techniques are used to make
many decisions that affect our lives
Applications of statistical
concepts in the business world
Finance – correlation and regression, index
numbers, time series analysis
Marketing – hypothesis testing, chi-square tests,
nonparametric statistics
Personel – hypothesis testing, chi-square tests,
nonparametric tests
Operating management – hypothesis testing,
estimation, analysis of variance, time series
analysis
Statistics
The science of collectiong, organizing, presenting,
analyzing, and interpreting data to assist in making
more effective decisions
Statistical analysis – used to manipulate
summarize, and investigate data, so that useful
decision-making information results.
Types of statistics
Descriptive statistics – Methods of organizing,
summarizing, and presenting data in an informative
way
Inferential statistics – The methods used to
determine something about a population on the basis
of a sample
Population –The entire set of individuals or
objects of interest or the measurements obtained
from all individuals or objects of interest
Sample – A portion, or part, of the population of
interest
Data and Statistics
Data consists of information coming from
observations, counts, measurements, or responses.

Statistics is the science of collecting,

organizing, analyzing, and interpreting data
in order to make decisions.
A population is the collection of all
outcomes, responses, measurement, or
counts that are of interest.
A sample is a subset of a population.
Populations & Samples
Example:
In a recent survey, 250 college students at
Union College were asked if they smoked
cigarettes regularly. 35 of the students
said yes. Identify the population and the
sample.
Responses of all students
at Union College
(population)

Responses of
students in survey
(sample)
Parameters & Statistics
A parameter is a numerical description of a
population characteristic.

A statistic is a numerical description of a

sample characteristic.

Parameter Population

Statistic Sample
Parameters & Statistics
Example:
Decide whether the numerical value describes a population parameter or a sample statistic.

a.) A recent survey of a sample of 450

college students reported that the
average weekly income for students is
$325.
Because the average of $325 is
based on a sample, this is a sample
statistic.
b.) The average weekly income for all
students is $405.
Because the average of $405 is based
on a population, this is a population
parameter.
Branches of Statistics
The study of statistics has two major
branches: descriptive statistics and
inferential statistics.
Statistics

Descriptive Inferential
statistics statistics
Involves the Involves using a
organization, sample to draw
summarization, conclusions
and display of about a
data. population.
Descriptive and Inferential
Example:
Statistics
In a recent study, volunteers who had less than 6 hours of sleep were four times more likely to answer incorrectly on a science test than were participants who
had at least 8 hours of sleep. Decide which part is the descriptive statistic and what conclusion might be drawn using inferential statistics.

The statement “four times more

likely to answer incorrectly” is a
descriptive statistic. An inference
drawn from the sample is that all
individuals sleeping less than 6
hours are more likely to answer
science question incorrectly than
individuals who sleep at least 8
Data
Classification
Types of Data
Data sets can consist of two types of data:
qualitative data and quantitative data.
Data

Qualitative Quantitative
Data Data
Consists of Consists of
attributes, numerical
labels, or non- measurements or
numerical counts.
entries.
Levels of Measurement
The level of measurement determines which
statistical calculations are meaningful. The
four levels of measurement are: nominal,
ordinal, interval, and ratio.
Nominal
Lowest to
Levels of
Measurement
Ordinal highest

Interval
Ratio
Nominal Level of
Measurement
Data at the nominal level of measurement
are qualitative only.
Nominal
Calculated using names, labels, or
Levels qualities. No mathematical
of computations can be made at this
Measurement level.

Colors in Names of students Textbooks you are

the US in your class using this
flag semester
Ordinal Level of
Measurement
Data at the ordinal level of measurement
are qualitative or quantitative.

Levels Ordinal
of Arranged in order, but differences
Measurement between data entries are not
meaningful.

Class standings: Numbers on the Top 50 songs

freshman, back of each played on the radio
sophomore, junior, player’s shirt
senior
Interval Level of
Measurement
Data at the interval level of measurement
are quantitative. A zero entry simply
represents a position on a scale; the entry is
not an inherent zero.
Levels Interval
of Arranged in order, the differences between
data entries can be calculated.
Measurement
Temperatures Years on a timeline Atlanta Braves
World Series
victories
Ratio Level of Measurement
Data at the ratio level of measurement are
similar to the interval level, but a zero entry is
meaningful.
A ratio of two data values can be formed so
Levels one data value can be expressed as a ratio.
of
Measurement
Ratio

Ages Grade point Weights

averages
Summary of Levels of
Measurement
Arran Determine
Put
Level of ge Subtract if one data
data in
measurem data data value is a
categor
ent in values multiple of
ies
order another
Nominal Yes No No No
Ordinal Yes Yes No No
Interval Yes Yes Yes No
Ratio Yes Yes Yes Yes
Population and Sample
Inferential Statistics

Estimation
e.g., Estimate the
population mean
weight using the
sample mean weight
Hypothesis testing
e.g., Test the claim
that the population
Inference
mean is is
weight the
70process
kg of drawing
conclusions or making decisions
about a population based on sample
Sampling
a sample should have the same characteristics
as the population it is representing.
Sampling can be:
with replacement: a member of the population
may be chosen more than once (picking the candy
from the bowl)
 without replacement: a member of the
population may be chosen only once (lottery
ticket)
Sampling methods
Sampling methods can be:
random (each member of the population has an equal
chance of being selected)
nonrandom

The actual process of sampling causes sampling

errors. For example, the sample may not be large
enough or representative of the population. Factors not
related to the sampling process cause nonsampling
errors. A defective counting device can cause a
nonsampling error.
Random sampling methods
simple random sample (each sample of the same
size has an equal chance of being selected)
stratified sample (divide the population into groups
called strata and then take a sample from each
stratum)
cluster sample (divide the population into strata and
then randomly select some of the strata. All the
members from these strata are in the cluster sample.)
systematic sample (randomly select a starting point
and take every n-th piece of data from a listing of the
population)
Descriptive Statistics

Collect data
e.g., Survey

Present data
e.g., Tables and graphs

Summarize data

e.g., Sample mean = X i
n
Statistical data
The collection of data that are relevant to the
problem being studied is commonly the most
difficult, expensive, and time-consuming part of
the entire research project.
Statistical data are usually obtained by counting
or measuring items.
Primary data are collected specifically for the
analysis desired
Secondary data have already been compiled
and are available for statistical analysis
A variable is an item of interest that can take on
many different numerical values.
A constant has a fixed numerical value.
Data
Statistical data are usually obtained by counting or
measuring items. Most data can be put into the
following categories:
Qualitative - data are measurements that each fail
into one of several categories. (hair color, ethnic
groups and other attributes of the population)
quantitative - data are observations that are
measured on a numerical scale (distance traveled to
college, number of children in a family, etc.)
Qualitative data
Qualitative data are generally described by words or
letters. They are not as widely used as quantitative data
because many numerical techniques do not apply to the
qualitative data. For example, it does not make sense to
find an average hair color or blood type.
Qualitative data can be separated into two subgroups:
dichotomic (if it takes the form of a word with two
options (gender - male or female)
polynomic (if it takes the form of a word with more
than two options (education - primary school,
secondary school and university).
Quantitative data
Quantitative data are always numbers and are the
result of counting or measuring attributes of a
population.
Quantitative data can be separated into two
subgroups:
discrete (if it is the result of counting (the number
of students of a given ethnic group in a class, the
number of books on a shelf, ...)
continuous (if it is the result of measuring
(distance traveled, weight of luggage, …)
Types of variables
Variables

Qualitative Quantitative

Dichotomic Polynomic Discrete Continuous

Children in Amount of
Gender, marital Brand of Pc, family, income tax
status hair color Strokes on a paid, weight
golf hole of a student
measurement:
Nominal – consist of categories in each of which the
number of respective observations is recorded. The
categories are in no logical order and have no
particular relationship. The categories are said to be
mutually exclusive since an individual, object, or
measurement can be included in only one of them.
Ordinal – contain more information. Consists of
distinct categories in which order is implied. Values in
one category are larger or smaller than values in other
categories (e.g. rating-excelent, good, fair, poor)
Interval – is a set of numerical measurements in
which the distance between numbers is of a known,
constant size.
Ratio – consists of numerical measurements where
the distance between numbers is of a known, constant
size, in addition, there is a nonarbitrary zero point.
Data presentation
Numerical presentation of
qualitative data
pivot table (qualitative dichotomic statistical
attributes)
contingency table (qualitative statistical attributes
from which at least one of them is polynomic)

You should know how to convert absolute

values to relative ones (%).
Frequency distributions –
numerical presentation of
quantitative data
Frequency distribution – shows the frequency, or
number of occurences, in each of several
categories. Frequency distributions are used to
summarize large volumes of data values.
When the raw data are measured on a qunatitative
scale, either interval or ratio, categories or classes
must be designed for the data values before a
frequency distribution can be formulated.
Steps for constructing a
frequency distribution
1. Determine the number of classes m n
 max  min 
2. Determine the size of each class h
m
3. Determine the starting point for the first class
4. Tally the number of values that occur in each
class
5. Prepare a table of the distribution using actual
counts and/ or percentages (relative frequencies)
Frequency table
absolute frequency “ni” (Data TabData
AnalysisHistogram)
relative frequency “fi”
Cumulative frequency distribution shows the total
number of occurrences that lie above or below
certain key values.
cumulative frequency “Ni”
cumulative relative frequency “Fi”
Charts and graphs
Frequency distributions are good ways to present
the essential aspects of data collections in concise
and understable terms
Pictures are always more effective in displaying
large data collections
Histogram
Frequently used to graphically present interval and
ratio data
Is often used for interval and ratio data
The adjacent bars indicate that a numerical range is
being summarized by indicating the frequencies in
arbitrarily chosen classes
Histogram
Frequency polygon
Another common method for graphically
presenting interval and ratio data
To construct a frequency polygon mark the
frequencies on the vertical axis and the
values of the variable being measured on
the horizontal axis, as with the histogram.
If the purpose of presenting is comparation
with other distributions, the frequency
polygon provides a good summary of the
data
Frequency Polygon
Ogive
A graph of a cumulative frequency distribution
Ogive is used when one wants to determine how
many observations lie above or below a certain
value in a distribution.
First cumulative frequency distribution is
constructed
Cumulative frequencies are plotted at the upper
class limit of each category
Ogive can also be constructed for a relative
frequency distribution.
Ogive
Pie Chart
The pie chart is an effective way of displaying the
percentage breakdown of data by category.
Useful if the relative sizes of the data components
are to be emphasized
Pie charts also provide an effective way of
presenting ratio- or interval-scaled data after they
have been organized into categories
Pie Chart
Bar chart
Another common method for graphically
presenting nominal and ordinal scaled data
One bar is used to represent the frequency for each
category
The bars are usually positioned vertically with
their bases located on the horizontal axis of the
graph
The bars are separated, and this is why such a
graph is frequently used for nominal and ordinal
data – the separation emphasize the plotting of
frequencies for distinct categories
Bar Chart
Time Series Graph
The time series graph is a
graph of data that have been
measured over time.
The horizontal axis of this
graph represents time periods
and the vertical axis shows the
numerical values corresponding
to these time periods
Time Series Graph
Mean,
Median,
Mode,
Skewness

STAT-702 Unit # 1
No ratings yet
STAT-702 Unit # 1
84 pages
Decision Making Notes (Complete)
No ratings yet
Decision Making Notes (Complete)
36 pages
Statistics Course Overview
100% (3)
Statistics Course Overview
43 pages
Fundamentals of Engineering Economics by Pravin Kumar
75% (4)
Fundamentals of Engineering Economics by Pravin Kumar
1 page
Intro to Statistics for Beginners
100% (1)
Intro to Statistics for Beginners
23 pages
Chapter 01
No ratings yet
Chapter 01
13 pages
Nature of Statistics
100% (1)
Nature of Statistics
7 pages
Probability and Statistics: Rusdianto Roestam PHD
No ratings yet
Probability and Statistics: Rusdianto Roestam PHD
28 pages
Math in A Modern World
No ratings yet
Math in A Modern World
4 pages
Chapter 1
No ratings yet
Chapter 1
27 pages
Definition of Statistics
No ratings yet
Definition of Statistics
4 pages
STAT. Lec.1
No ratings yet
STAT. Lec.1
30 pages
Nature of Statistics
No ratings yet
Nature of Statistics
7 pages
Chapter 1 Introduction To Statistics
No ratings yet
Chapter 1 Introduction To Statistics
28 pages
Stat195 Handout (Rev)
50% (2)
Stat195 Handout (Rev)
101 pages
Basic Concepts 2
No ratings yet
Basic Concepts 2
18 pages
Lecture1 2 3
No ratings yet
Lecture1 2 3
86 pages
STAT110 Biostatistics
No ratings yet
STAT110 Biostatistics
21 pages
Short Answer Questions Research Methods: Psychology
No ratings yet
Short Answer Questions Research Methods: Psychology
21 pages
1data Management Mamw 100
100% (1)
1data Management Mamw 100
84 pages
Chapter 02
50% (2)
Chapter 02
7 pages
Fundamentals of Engineering Economics3
No ratings yet
Fundamentals of Engineering Economics3
22 pages
Fundamentals of Engineering Economics3
No ratings yet
Fundamentals of Engineering Economics3
22 pages
Chapter 1 An Overview of Statistics
No ratings yet
Chapter 1 An Overview of Statistics
4 pages
MTPDF1 - Introduction To Statistics
No ratings yet
MTPDF1 - Introduction To Statistics
106 pages
2 Weighing The Right Way en
No ratings yet
2 Weighing The Right Way en
3 pages
MMW GE 4 Week 10 PPT 23 24
No ratings yet
MMW GE 4 Week 10 PPT 23 24
23 pages
Statistics Analysis With Software Application
No ratings yet
Statistics Analysis With Software Application
22 pages
Inferential Statistics Course
No ratings yet
Inferential Statistics Course
46 pages
Statistics for Decision Making
No ratings yet
Statistics for Decision Making
4 pages
1 - Basic Concepts
No ratings yet
1 - Basic Concepts
71 pages
Topic 1 ELEMENTARY STATISTICS
No ratings yet
Topic 1 ELEMENTARY STATISTICS
29 pages
Research Methodology Course Material
No ratings yet
Research Methodology Course Material
30 pages
'Inductive and Deductive Reasning' PDF
No ratings yet
'Inductive and Deductive Reasning' PDF
9 pages
Lesson 1 Basic Concepts of Statistics
No ratings yet
Lesson 1 Basic Concepts of Statistics
9 pages
Lecture No 01 Statistics 13-2-24
No ratings yet
Lecture No 01 Statistics 13-2-24
34 pages
Stats Bio Supp. 1
No ratings yet
Stats Bio Supp. 1
11 pages
Skripsi Revisi Full Khodim
No ratings yet
Skripsi Revisi Full Khodim
70 pages
Lecture 1 Data Overview and Introduction To SPSS VJU
No ratings yet
Lecture 1 Data Overview and Introduction To SPSS VJU
49 pages
Statistik 1
No ratings yet
Statistik 1
17 pages
Statistics and Probability - Midterm Reviewer
No ratings yet
Statistics and Probability - Midterm Reviewer
12 pages
Unit 5: Test of Significance/Hypothesis Testing (Topics 20, 22, 23)
No ratings yet
Unit 5: Test of Significance/Hypothesis Testing (Topics 20, 22, 23)
24 pages
Faculty Development & Productivity
No ratings yet
Faculty Development & Productivity
52 pages
Through The Looking Glass John Cage and Avant-Garde Film (Richard H. Brown JR) (Z-Library)
No ratings yet
Through The Looking Glass John Cage and Avant-Garde Film (Richard H. Brown JR) (Z-Library)
257 pages
STA132 Complete Note
No ratings yet
STA132 Complete Note
110 pages
Introduction Book 1
No ratings yet
Introduction Book 1
41 pages
Intro To Biostatistics Lecture BSMLS 3-A&B
No ratings yet
Intro To Biostatistics Lecture BSMLS 3-A&B
74 pages
Statistics Course
No ratings yet
Statistics Course
75 pages
Chapter 1 The Nature of Probability and Statistics Updated Spring 2023-2024
No ratings yet
Chapter 1 The Nature of Probability and Statistics Updated Spring 2023-2024
38 pages
1 Pengantar Statistika
No ratings yet
1 Pengantar Statistika
22 pages
Intro 123243 Ewqs 1
No ratings yet
Intro 123243 Ewqs 1
37 pages
Deadlock Detection & Avoidance Algorithms
No ratings yet
Deadlock Detection & Avoidance Algorithms
20 pages
SocPsy Textbook Notes
No ratings yet
SocPsy Textbook Notes
6 pages
Intro To Statistics LECTURE 1
No ratings yet
Intro To Statistics LECTURE 1
28 pages
Statistics - Unit1 PDF
No ratings yet
Statistics - Unit1 PDF
94 pages
Business Statistics Overview
No ratings yet
Business Statistics Overview
94 pages
Module 3 Problem Solving and Reasoning
No ratings yet
Module 3 Problem Solving and Reasoning
16 pages
Business Statistics: A Decision-Making Approach: The Where, Why, and How of Data Collection
No ratings yet
Business Statistics: A Decision-Making Approach: The Where, Why, and How of Data Collection
129 pages
Math As A Tool Data Management Introduction and Central Tendency
No ratings yet
Math As A Tool Data Management Introduction and Central Tendency
12 pages
Note For Int To Statistics
No ratings yet
Note For Int To Statistics
24 pages
Dissertation Writing Course Syllabus
100% (2)
Dissertation Writing Course Syllabus
9 pages
Module 1.1 - Definition of Statistical Concepts and Principles
No ratings yet
Module 1.1 - Definition of Statistical Concepts and Principles
17 pages
Lecture 01 Introduction To Statistics PPT 06022025 095924am
No ratings yet
Lecture 01 Introduction To Statistics PPT 06022025 095924am
40 pages
Lecture-1 Introduction To Statistical Theory
No ratings yet
Lecture-1 Introduction To Statistical Theory
83 pages
Ghon Stat Chapter1
No ratings yet
Ghon Stat Chapter1
39 pages
Refugee Entrepreneurship Development
No ratings yet
Refugee Entrepreneurship Development
25 pages
Statistics Lesson 1
No ratings yet
Statistics Lesson 1
111 pages
Collaborative Foresight Pre Print
No ratings yet
Collaborative Foresight Pre Print
20 pages
A Proposal On Understanding Brand Loyalty of Customers On Smartphones
No ratings yet
A Proposal On Understanding Brand Loyalty of Customers On Smartphones
4 pages
Lesson 1
No ratings yet
Lesson 1
32 pages
MATH2203 Statistics I - Week 1
No ratings yet
MATH2203 Statistics I - Week 1
27 pages
Research As A High School Student
No ratings yet
Research As A High School Student
1 page
Sociology Notes Final
No ratings yet
Sociology Notes Final
63 pages
My Research
No ratings yet
My Research
21 pages
ISC Class 11 Sociology Syllabus 2023 24
No ratings yet
ISC Class 11 Sociology Syllabus 2023 24
5 pages
Statistical Analysis (Lecture 1)
No ratings yet
Statistical Analysis (Lecture 1)
40 pages
List
No ratings yet
List
9 pages
Munication Skill-Sm 2 & 3 - 1729163232000
No ratings yet
Munication Skill-Sm 2 & 3 - 1729163232000
121 pages
Inbound 6102089472106320430
No ratings yet
Inbound 6102089472106320430
41 pages
Chapter 5 Research Gap Research Question and Research Design - Edited With Watermark
No ratings yet
Chapter 5 Research Gap Research Question and Research Design - Edited With Watermark
15 pages
DATA MANAGEMENT STATISTICSMeasures of Central Tendency Measures
No ratings yet
DATA MANAGEMENT STATISTICSMeasures of Central Tendency Measures
60 pages
Week 1 Chapter 1 - Introduction To Statistics and Sata Collection
No ratings yet
Week 1 Chapter 1 - Introduction To Statistics and Sata Collection
28 pages
Inbound 4581472665919544689
No ratings yet
Inbound 4581472665919544689
38 pages
Session 01
No ratings yet
Session 01
16 pages
Session 9 Supply Chain Integration
No ratings yet
Session 9 Supply Chain Integration
27 pages
Session 9 Supply Chain Integration
No ratings yet
Session 9 Supply Chain Integration
27 pages
Session 10 Sourcing Decisions in A Supply Chain
No ratings yet
Session 10 Sourcing Decisions in A Supply Chain
62 pages
Session 12 E-Commerce
No ratings yet
Session 12 E-Commerce
27 pages
Poisson and Exponential Distribution
No ratings yet
Poisson and Exponential Distribution
55 pages
Chapter 10 PPC
No ratings yet
Chapter 10 PPC
14 pages
Session 11 Information Risks Management in Supply Chains
No ratings yet
Session 11 Information Risks Management in Supply Chains
39 pages
Chapter 11 Work Study
No ratings yet
Chapter 11 Work Study
86 pages
Chapter 12 Reliability and Maintenance Engg
No ratings yet
Chapter 12 Reliability and Maintenance Engg
43 pages
STS Notes
No ratings yet
STS Notes
12 pages
The Insider's Edge To Real Estate Investing 1st Edition James Nelson Instant Download
No ratings yet
The Insider's Edge To Real Estate Investing 1st Edition James Nelson Instant Download
103 pages
Construction Project Scheduling and Control - Ebook PDF Version Download
No ratings yet
Construction Project Scheduling and Control - Ebook PDF Version Download
154 pages
(Ebook PDF) International Relations 2nd Edition by Eric B. Shiraev Download
100% (1)
(Ebook PDF) International Relations 2nd Edition by Eric B. Shiraev Download
98 pages
Spinoza's Ethics Michael Lebuffe Instant Download
No ratings yet
Spinoza's Ethics Michael Lebuffe Instant Download
82 pages
Session 5 Sport Obermeyer
No ratings yet
Session 5 Sport Obermeyer
40 pages

Introduction To Statistics

Uploaded by

Introduction To Statistics

Uploaded by

Introduction To Statistics

Anderson, D.R. - Sweeney, D.J. -

1. Data are everywhere

Statistics is the science of collecting,

A statistic is a numerical description of a

a.) A recent survey of a sample of 450

The statement “four times more

Colors in Names of students Textbooks you are

Class standings: Numbers on the Top 50 songs

Ages Grade point Weights

The actual process of sampling causes sampling

Dichotomic Polynomic Discrete Continuous

You should know how to convert absolute

You might also like