Data Analysis

This document provides an introduction and overview of key concepts in statistics. It discusses descriptive versus inferential statistics, levels of measurement, measures of central tendency and variability, hypothesis testing, and common statistical tests like t-tests, ANOVA, correlation, and chi-square. The goal is to summarize different numerical representations of data and how statistics depend on sampling methods and can be used for descriptive or comparative objectives.

Uploaded by

Rakhshanda Ayub Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

118 views34 pages

Data Analysis

Uploaded by

Rakhshanda Ayub Khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 34

An Introduction and Overview

 Numerical representations of our data

 Can be:
 Descriptive statistics summarize data.
 Inferential statistics are tools that indicate how
much confidence we can have when we generalize
from a sample to a population.
 Statistics depend on our sampling methods:
 Probability or Non-probability? (i.e. Random or
not?)
 Even with probability samples, there is a
possibility that the statistics we obtain do not
accurately reflect the population.
 Sampling Error
 Inadequate sampling frame, low response rate,
coverage (some people in population not given a
chance of selection)
 Non-Sampling Error
 Problems with transcribing and coding data;
observer/ instrument error; misrepresenation as
error.
 Levels of Measurement – the relationship
among the values that are assigned to a
variable and the attributes of that variable.
 Nominal- naming
 Ordinal- rank order (high to low but no
indication of how much higher or lower one
subject is to another)
 Interval- equal intervals between values
 Ratio- equal intervals AND an absolute zero
(i.e. a ruler)
 Age: under 30, 30-39, 40-49, 50-59
 Gender: Male, Female
 Level of Agreement: Strongly Agree, Agree,
Neutral, Disagree, Strongly Disagree
 Percentage of the library budget spent on staff
salaries.
 Descriptive  Comparative
objectives/ research objectives/
questions: hypotheses

 Descriptive statistics  Inferential Statistics

 Can be applied to any measurements
(quantitative or qualitative)
 Offers a summary/ overview/ description of
data. Does not explain or interpret.
 Number  Variability
 Frequency Count  Variance and
 Percentage standard deviation
 Deciles and quartiles  Graphs
 Measures of Central  Normal Curve
Tendency (Mean,
Midpoint, Mode)
 Averages
 Mode: most frequently occurring value in a
distribution (any scale, most unstable)
 Median: midpoint in the distribution below which
half of the cases reside (ordinal and above)
 Mean: arithmetic average- the sum of all values in a
distribution divided by the number of cases (interval
or ratio)
 Example (11 test scores)
61, 61, 72, 77, 80, 81, 82, 85, 89, 90, 92

The median is 81 (half of the scores fall above 81,

and half below)
 Example (6 scores)
3, 3, 7, 10, 12, 15

Even number of scores= Median is half-way

between these scores
Sum the middle scores (7+10=17) and divide by 2
17/2= 8.5
 Insensitive to extremes

3, 3, 7, 10, 12, 15, 200

 Mean is half the sum of a set of values:
 Scores: 5, 6, 7, 10, 12, 15
 Sum: 55
 Number of scores: 6
 Computation of Mean: 55/6= 9.17
 Mode is the most frequently occurring value in
a set.
 Best used for nominal data.
 Skewed to the right (positive) or left (negative)
 An extremely hard test that results in a lot of
low grades will be skewed to the right:
 the mode is smaller than the median, which is
smaller than the mean. This relationship exists
because the mode is the point on the x-axis
corresponding to the highest point, that is the
score with greatest value, or frequency. The
median is the point on the x-axis that cuts the
distribution in half, such that 50% of the area
falls on each side.
 An extremely easy test will result in a lot of
high grades, and will skew to the left (negative)
 The order of the measures of central tendency
would be the opposite of the positively skewed
distribution, with the mean being smaller than
the median, which is smaller than the mode.
 Variability is the differences among scores-
shows how subjects vary:
 Dispersion: extent of scatter around the “average”
 Range: highest and lowest scores in a distribution
 Variance and standard deviation: spread of scores in
a distribution. The greater the scatter, the larger the
variance
 Interval or ration level data
 Standard deviation: how much subjects differ
from the mean of their group
 Measures how much subjects differ from the
mean of their group
 The more spread out the subjects are around
the mean, the larger the standard deviation
 Sensitive to extremes or “outliers”
 Allows for comparisons across variables
 i.e. is there a relation between one’s occupation and
their reason for using the public library?
 Hypothesis Testing
 The level of significance is the predetermined
level at which a null hypothesis is not
supported. The most common level is p < .05
 P =probability
 < = less than (> = more than)
 Type I error  Type II error
 Reject the null  Fail to reject the null
hypothesis when it is hypothesis when it is
really true really false
 By using inferential statistics to make decisions,
we can report the probability that we have
made a Type I error (indicated by the p value
we report)
 By reporting the p value, we alert readers to
the odds that we were incorrect when we
decided to reject the null hypothesis
 Chi-square test of independence: two variables
(nominal and nominal, nominal and ordinal, or
ordinal and ordinal)
 Affected by number of cells, number of cases
 2-tailed distribution= null hypothesis
 1-tailed distribution= directional hypothesis
 Correlation—the extent to which two variables
are related across a group of subjects
 Pearson r
 It can range from -1.00 to 1.00
 -1.00 is a perfect inverse relationship—the strongest possible
inverse relationship
 0.00 indicates the complete absence of a relationship
 1.00 is a perfect positive relationship—the strongest possible
direct relationship
 The closer a value is to 0.00, the weaker the relationship
 The closer a value is to -1.00 or +1.00, the stronger it is
 Spearman rho
 t-test
 Test the difference between two sample means for
significance
 pretest to posttest
 Relates to research design
 Perhaps used for information literacy instruction

Analysis of variance
 Regression analysis (including step-wise
regression)
Analysis of variance (ANOVA) tests the
difference(s) among two or more means

 It can be used to test the difference between

two means
 So use t-test or ANOVA?
 KEY: ANOVA also can be used to test the
difference among more than two means in a
single test—which cannot be done with a t test
 Parametric statistical tests generally require
interval or ratio level data and assume that the
scores were drawn from a normally distributed
population or that both sets of scores were
drawn from populations with the same
variance or spread of scores
 Nonparametric methods do not make
assumptions about the shape of the population
distribution. These are typically less powerful
and often need large samples

Business Statistics, 5 Ed.: by Ken Black
No ratings yet
Business Statistics, 5 Ed.: by Ken Black
34 pages
CG8 Data-Analysis
No ratings yet
CG8 Data-Analysis
63 pages
Statistics For Business & Economics 13th Edition (Ebook PDF) Full
100% (1)
Statistics For Business & Economics 13th Edition (Ebook PDF) Full
100 pages
StatisticsRefresher Part1
No ratings yet
StatisticsRefresher Part1
7 pages
Statistical Biology - Reviewer
100% (1)
Statistical Biology - Reviewer
6 pages
Main Title: Planning Data Analysis Using Statistical Data
100% (2)
Main Title: Planning Data Analysis Using Statistical Data
40 pages
MMW Data Management and Analysis
No ratings yet
MMW Data Management and Analysis
96 pages
5 q2 Practical Research
No ratings yet
5 q2 Practical Research
41 pages
Statistics Equationls
No ratings yet
Statistics Equationls
5 pages
L8a - Central Tendency (Nota Shah Alam)
No ratings yet
L8a - Central Tendency (Nota Shah Alam)
79 pages
Statistical Tools and Techniques: College-Level Notes
No ratings yet
Statistical Tools and Techniques: College-Level Notes
14 pages
ANCOVA Explained for Researchers
No ratings yet
ANCOVA Explained for Researchers
14 pages
Advanced Educational Stats Guide
No ratings yet
Advanced Educational Stats Guide
25 pages
A Nova Module
No ratings yet
A Nova Module
21 pages
Soluciones Unidad 3 Opcionales
No ratings yet
Soluciones Unidad 3 Opcionales
15 pages
Lesson 18 Basic Statistical Tool
100% (1)
Lesson 18 Basic Statistical Tool
36 pages
1-Day Training Workshop On Basic Statistics Techniques and Predictive Analysis (Module 2)
No ratings yet
1-Day Training Workshop On Basic Statistics Techniques and Predictive Analysis (Module 2)
85 pages
EPSTEIN & MARTIN - Quantitative Aproach To Empirical Legal Research PDF
No ratings yet
EPSTEIN & MARTIN - Quantitative Aproach To Empirical Legal Research PDF
25 pages
Statistical Estimation Guide
No ratings yet
Statistical Estimation Guide
68 pages
3 4 Research 8 2
No ratings yet
3 4 Research 8 2
54 pages
Applied Statistics and Probability For Engineers, 6 Edition: Z P Z P Z P P X P X P
No ratings yet
Applied Statistics and Probability For Engineers, 6 Edition: Z P Z P Z P P X P X P
21 pages
Statistics: Fouzia Tauqeer Assistant Professor Bahria University, Islamabad. Lecture #1
No ratings yet
Statistics: Fouzia Tauqeer Assistant Professor Bahria University, Islamabad. Lecture #1
37 pages
Data Analysis and Statistical Treatment
No ratings yet
Data Analysis and Statistical Treatment
99 pages
DS100 Sp22 Lec 09 - Intro To Modeling, SLR
No ratings yet
DS100 Sp22 Lec 09 - Intro To Modeling, SLR
69 pages
Les8e PPT Study 07 02
No ratings yet
Les8e PPT Study 07 02
46 pages
Statistics
No ratings yet
Statistics
33 pages
Statistics През
No ratings yet
Statistics През
46 pages
Reviewer For Psych Stats
No ratings yet
Reviewer For Psych Stats
36 pages
HI6007 Business Statistics Exam
No ratings yet
HI6007 Business Statistics Exam
8 pages
Linier Regression
No ratings yet
Linier Regression
19 pages
A334 Carino, Patricia Andrea Assignment On Forecasting
No ratings yet
A334 Carino, Patricia Andrea Assignment On Forecasting
3 pages
Statistical Technique and Data Analysis 1
No ratings yet
Statistical Technique and Data Analysis 1
4 pages
Data Analysi Quantitative Lesson 4
No ratings yet
Data Analysi Quantitative Lesson 4
24 pages
Psychological Stats Reviewer
No ratings yet
Psychological Stats Reviewer
11 pages
2statistical Analysis of Data 2
No ratings yet
2statistical Analysis of Data 2
43 pages
Assignment#3 Multiple Regression and Manova 2021
No ratings yet
Assignment#3 Multiple Regression and Manova 2021
9 pages
Module I. Basic Calculations. Average, Standard Deviation by Excel
No ratings yet
Module I. Basic Calculations. Average, Standard Deviation by Excel
48 pages
Supervised Learning Mathematical - Dalia Chakrabarty
No ratings yet
Supervised Learning Mathematical - Dalia Chakrabarty
385 pages
SPROB Polished
No ratings yet
SPROB Polished
8 pages
Class Exercise of Time Series
No ratings yet
Class Exercise of Time Series
4 pages
MMW Data Management
No ratings yet
MMW Data Management
35 pages
Statistics SS2020
No ratings yet
Statistics SS2020
12 pages
Levels of Data
100% (1)
Levels of Data
26 pages
Chapter 5 Data Analysis Ab
No ratings yet
Chapter 5 Data Analysis Ab
56 pages
Experimental Lec 2
No ratings yet
Experimental Lec 2
5 pages
Unit I Probability
No ratings yet
Unit I Probability
40 pages
Biostatistics Notes
100% (1)
Biostatistics Notes
8 pages
Branches of Statistics, Data Types, and Graphs
No ratings yet
Branches of Statistics, Data Types, and Graphs
6 pages
250 Lec 5 Fall 13
No ratings yet
250 Lec 5 Fall 13
42 pages
Hoffman Biostatistics PHD Statement of Purpose
No ratings yet
Hoffman Biostatistics PHD Statement of Purpose
6 pages
EDU 411 Topic 5 Data Analysis
No ratings yet
EDU 411 Topic 5 Data Analysis
9 pages
Week 4 Statistics Recap MAKING MEANING OF MEASUREMENTS & RAW TEST SCORES
No ratings yet
Week 4 Statistics Recap MAKING MEANING OF MEASUREMENTS & RAW TEST SCORES
39 pages
1 Data Collection Procedure Research Instrument and Interpretation of Data
No ratings yet
1 Data Collection Procedure Research Instrument and Interpretation of Data
57 pages
3 Matm111
No ratings yet
3 Matm111
3 pages
Basic Statistics in Assessment: Mean, Variability, Correlation
No ratings yet
Basic Statistics in Assessment: Mean, Variability, Correlation
18 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
Psyc 103 (Stats)
No ratings yet
Psyc 103 (Stats)
75 pages
Econometrics Test: Sales Revenue Analysis
No ratings yet
Econometrics Test: Sales Revenue Analysis
4 pages
GLM Analysis for SAS Users
No ratings yet
GLM Analysis for SAS Users
4 pages
Chap 4 Research Method and Technical Writing
No ratings yet
Chap 4 Research Method and Technical Writing
33 pages
It Is Also Including Hypothesis Testing and Sampling
No ratings yet
It Is Also Including Hypothesis Testing and Sampling
12 pages
Psyc 2F23 - Stat Notes
No ratings yet
Psyc 2F23 - Stat Notes
8 pages
Statistical Inference-BSA&F-III-Morning
No ratings yet
Statistical Inference-BSA&F-III-Morning
3 pages
Data Analysis and Statistical Methods
No ratings yet
Data Analysis and Statistical Methods
44 pages
Lecture Notes: (Introduction To Medical Laboratory Science Research)
No ratings yet
Lecture Notes: (Introduction To Medical Laboratory Science Research)
13 pages
Exploring Statistics
No ratings yet
Exploring Statistics
33 pages
Tut 7
No ratings yet
Tut 7
1 page
Stats 201
No ratings yet
Stats 201
5 pages
Inferential Statistics Course
No ratings yet
Inferential Statistics Course
46 pages
Statistical Instruments and References Writing in Research
No ratings yet
Statistical Instruments and References Writing in Research
36 pages
Statistics in Research Processing and Data Analysis
No ratings yet
Statistics in Research Processing and Data Analysis
34 pages
Biostatistics 140127003954 Phpapp02
No ratings yet
Biostatistics 140127003954 Phpapp02
47 pages
Statistics: An Introduction and Overview
No ratings yet
Statistics: An Introduction and Overview
51 pages
Hypothesis Testing Lesson Plan
0% (2)
Hypothesis Testing Lesson Plan
5 pages
Statistics
86% (7)
Statistics
33 pages
Biostatistics Notes: Descriptive Statistics
No ratings yet
Biostatistics Notes: Descriptive Statistics
16 pages
Intro to Statistics Basics
No ratings yet
Intro to Statistics Basics
18 pages
(Wills, N.D.) : Null Hypothesis (Ho)
No ratings yet
(Wills, N.D.) : Null Hypothesis (Ho)
4 pages
Statistic Refresher
No ratings yet
Statistic Refresher
3 pages
The World of Statistics
No ratings yet
The World of Statistics
1 page
Assignment 5
No ratings yet
Assignment 5
2 pages
Cheat Sheet: Model Inquiry Data Strategy Answer Strategy
No ratings yet
Cheat Sheet: Model Inquiry Data Strategy Answer Strategy
1 page
Example 04.02 Butler With Deliveries-JayDomingoFinal
No ratings yet
Example 04.02 Butler With Deliveries-JayDomingoFinal
75 pages

Data Analysis

Uploaded by

Data Analysis

Uploaded by

An Introduction and Overview

 Numerical representations of our data

 Descriptive statistics  Inferential Statistics

The median is 81 (half of the scores fall above 81,

Even number of scores= Median is half-way

3, 3, 7, 10, 12, 15, 200

 It can be used to test the difference between

You might also like