0% found this document useful (0 votes)

371 views12 pages

Important Statistics Formulas

This document presents numerous statistical formulas across various topics including parameters, statistics, correlation, regression, counting, probability, random variables, distributions, estimation, hypothesis testing, sampling, and more applied statistics. Formulas are provided for measures of central tendency and variability, probability calculations, distributions like binomial and normal, confidence intervals, hypothesis tests, regression, and more.

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

371 views12 pages

Important Statistics Formulas

Uploaded by

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Important Statistics Formulas

This web page presents statistics formulas described in the Stat Trek tutorials. Each formula links to
a web page that explains how to use the formula.

Parameters

Population mean = = ( Xi ) / N

Population standard deviation = = sqrt [ ( Xi - )2 / N ]

Population variance = 2 = ( Xi - )2 / N

Variance of population proportion = P2 = PQ / n

Standardized score = Z = (X - ) /

Population correlation coefficient = = [ 1 / N ] * { [ (Xi - X) / x ] * [ (Yi - Y) / y ] }

Statistics

Unless otherwise noted, these formulas assume simple random sampling.

Sample mean = x = ( xi ) / n

Sample standard deviation = s = sqrt [ ( xi - x )2 / ( n - 1 ) ]

Sample variance = s2 = ( xi - x )2 / ( n - 1 )

Variance of sample proportion = sp2 = pq / (n - 1)

Pooled sample proportion = p = (p1 * n1 + p2 * n2) / (n1 + n2)

Pooled sample standard deviation = sp = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]

Sample correlation coefficient = r = [ 1 / (n - 1) ] * { [ (xi - x) / sx ] * [ (yi - y) / sy ] }

Correlation

Pearson product-moment correlation = r = (xy) / sqrt [ ( x2 ) * ( y2 ) ]

Linear correlation (sample data) = r = [ 1 / (n - 1) ] * { [ (xi - x) / sx ] * [ (yi - y) / sy ] }

Linear correlation (population data) = = [ 1 / N ] * { [ (Xi - X) / x ] * [ (Yi - Y) / y ] }

Simple Linear Regression

Simple linear regression line: = b0 + b1x

Regression coefficient = b1 = [ (xi - x) (yi - y) ] / [ (xi - x)2]

Regression slope intercept = b0 = y - b1 * x

Regression coefficient = b1 = r * (sy / sx)

Standard error of regression slope = sb1 = sqrt [ (yi - i)2 / (n - 2) ] / sqrt [ (xi - x)2 ]

Counting

n factorial: n! = n * (n-1) * (n - 2) * . . . * 3 * 2 * 1. By convention, 0! = 1.

Permutations of n things, taken r at a time: nPr = n! / (n - r)!

Combinations of n things, taken r at a time: nCr = n! / r!(n - r)! = nPr / r!

Probability

Rule of addition: P(A B) = P(A) + P(B) - P(A B)

Rule of multiplication: P(A B) = P(A) P(B|A)

Rule of subtraction: P(A') = 1 - P(A)

Random Variables

In the following formulas, X and Y are random variables, and a and b are constants.

Expected value of X = E(X) = x = [ xi * P(xi) ]

Variance of X = Var(X) = 2 = [ xi - E(x) ]2 * P(xi) = [ xi - x ]2 * P(xi)

Normal random variable = z-score = z = (X - )/

Chi-square statistic = 2 = [ ( n - 1 ) * s2 ] / 2

f statistic = f = [ s12/12 ] / [ s22/22 ]

Expected value of sum of random variables = E(X + Y) = E(X) + E(Y)

Expected value of difference between random variables = E(X - Y) = E(X) - E(Y)

Variance of the sum of independent random variables = Var(X + Y) = Var(X) + Var(Y)

Variance of the difference between independent random variables = Var(X - Y) = Var(X) + Var(Y)

Sampling Distributions

Mean of sampling distribution of the mean = x =

Mean of sampling distribution of the proportion = p = P

Standard deviation of proportion = p = sqrt[ P * (1 - P)/n ] = sqrt( PQ / n )

Standard deviation of the mean = x = /sqrt(n)

Standard deviation of difference of sample means = d = sqrt[ (12 / n1) + (22 / n2) ]

Standard deviation of difference of sample proportions = d = sqrt{ [P1(1 - P1) / n1] + [P2(1 - P2) /
n2] }

Standard Error

Standard error of proportion = SEp = sp = sqrt[ p * (1 - p)/n ] = sqrt( pq / n )

Standard error of difference for proportions = SEp = sp = sqrt{ p * ( 1 - p ) * [ (1/n1) + (1/n2) ] }

Standard error of the mean = SEx = sx = s/sqrt(n)

Standard error of difference of sample means = SEd = sd = sqrt[ (s12 / n1) + (s22 / n2) ]

Standard error of difference of paired sample means = SEd = sd = { sqrt [ ((di - d)2 / (n - 1) ] } /
sqrt(n)
Pooled sample standard error = spooled = sqrt [ (n1 - 1) * s12 + (n2 - 1) * s22 ] / (n1 + n2 - 2) ]

Standard error of difference of sample proportions = sd = sqrt{ [p1(1 - p1) / n1] + [p2(1 - p2) / n2] }

Discrete Probability Distributions

Binomial formula: P(X = x) = b(x; n, P) = nCx * Px * (1 - P)n - x = nCx * Px * Qn - x

Mean of binomial distribution = x = n * P

Variance of binomial distribution = x2 = n * P * ( 1 - P )

Negative Binomial formula: P(X = x) = b(x; r, P) = x-1Cr-1 Pr * (1 - P)x - r

Mean of negative binomial distribution = x = rQ / P

Variance of negative binomial distribution = x2 = r * Q / P2

Geometric formula: P(X = x) = g(x; P) = P * Qx - 1

Mean of geometric distribution = x = Q / P

Variance of geometric distribution = x2 = Q / P2

Hypergeometric formula: P(X = x) = h(x; N, n, k) = [ kCx ] [ N-kCn-x ] / [ NCn ]

Mean of hypergeometric distribution = x = n * k / N

Variance of hypergeometric distribution = x2 = n * k * ( N - k ) * ( N - n ) / [ N2 * ( N - 1 ) ]

Poisson formula: P(x; ) = (e-) (x) / x!

Mean of Poisson distribution = x =

Variance of Poisson distribution = x2 =

Multinomial formula: P = [ n! / ( n1! * n2! * ... nk! ) ] * ( p1n1 * p2n2 * . . . * pknk )

Linear Transformations

For the following formulas, assume that Y is a linear transformation of the random variable X,
defined by the equation: Y = aX + b.

Mean of a linear transformation = E(Y) = Y = aX + b.

Variance of a linear transformation = Var(Y) = a2 * Var(X).

Standardized score = z = (x - x) / x.

t statistic = t = (x - x) / [ s/sqrt(n) ].

Estimation

Confidence interval: Sample statistic + Critical value * Standard error of statistic

Margin of error = (Critical value) * (Standard deviation of statistic)

Margin of error = (Critical value) * (Standard error of statistic)

Hypothesis Testing
Standardized test statistic = (Statistic - Parameter) / (Standard deviation of statistic)

One-sample z-test for proportions: z-score = z = (p - P0) / sqrt( p * q / n )

Two-sample z-test for proportions: z-score = z = z = [ (p1 - p2) - d ] / SE

One-sample t-test for means: t statistic = t = (x - ) / SE

Two-sample t-test for means: t statistic = t = [ (x1 - x2) - d ] / SE

Matched-sample t-test for means: t statistic = t = [ (x1 - x2) - D ] / SE = (d - D) / SE

Chi-square test statistic = 2 = [ (Observed - Expected)2 / Expected ]

Degrees of Freedom

The correct formula for degrees of freedom (DF) depends on the situation (the nature of the test
statistic, the number of samples, underlying assumptions, etc.).

One-sample t-test: DF = n - 1

Two-sample t-test: DF = (s12/n1 + s22/n2)2 / { [ (s12 / n1)2 / (n1 - 1) ] + [ (s22 / n2)2 / (n2 - 1) ] }

Two-sample t-test, pooled standard error: DF = n1 + n2 - 2

Simple linear regression, test slope: DF = n - 2

Chi-square goodness of fit test: DF = k - 1

Chi-square test for homogeneity: DF = (r - 1) * (c - 1)

Chi-square test for independence: DF = (r - 1) * (c - 1)

Sample Size

Below, the first two formulas find the smallest sample sizes required to achieve a fixed margin of
error, using simple random sampling. The third formula assigns sample to strata, based on a
proportionate design. The fourth formula, Neyman allocation, uses stratified sampling to minimize
variance, given a fixed sample size. And the last formula, optimum allocation, uses stratified
sampling to minimize variance, given a fixed budget.

Mean (simple random sampling): n = { z2 * 2 * [ N / (N - 1) ] } / { ME2 + [ z2 * 2 / (N - 1) ] }

Proportion (simple random sampling): n = [ ( z2 * p * q ) + ME2 ] / [ ME2 + z2 * p * q / N ]

Proportionate stratified sampling: nh = ( Nh / N ) * n

Neyman allocation (stratified sampling): nh = n * ( Nh * h ) / [ ( Ni * i ) ]

Optimum allocation (stratified sampling):

nh = n * [ ( Nh * h ) / sqrt( ch ) ] / [ ( Ni * i ) / sqrt( ci ) ]

Statistics Tutorial

Descriptive Statistics

Quantitative measures

Variables
Central tendency

Variability

Measures of position

Charts and graphs

Patterns in data

Dotplots

Histograms

Stemplots

Boxplots

Cumulative plots

Scatterplots

Comparing plots

Tabular displays

One-way tables

Two-way tables

Probability

Probability basics

Sets and subsets

Stat experiments

Counting data points

Probability laws

What is probability

Probability problems

Rules of probability

Bayes' rule

Random variables

Types of variables

Distributions

Mean and variance

Independence

Combining
Transforming

Sampling theory

Random sampling

Central tendency

Variability

Sampling distribution

Diff between props

Diff between means

Distributions

Distribution basics

Probability dist

Discrete/continuous

Discrete

Binomial distribution

Negative binomial

Hypergeometric

Multinomial

Poisson

Continuous

Normal distribution

Standard normal

Student's t

Chi-square

F distribution

Estimation

Estimation theory

Estimation overview

Standard error

Margin of error

Confidence intervals

Proportions
Estimate proportion

Small samples

Diff between props

Mean scores

Estimate mean

Diff between means

Matched pairs

Hypothesis Testing

Foundations of testing

Hypothesis tests

How to test

Mean scores

Test of the mean

Diff between means

Diff between pairs

Proportions

Test for a proportion

Small samples

Diff between props

Power

Region of acceptance

Power of a test

How to find power

Chi-square tests

Goodness of fit

Homogeneity

Independence

Survey Sampling

Sampling methods

Data collection

Sampling methods
Survey sampling bias

Simple random samples

Survey sampling

SRS analysis

Stratified samples

Stratified sampling

Stratified analysis

Cluster samples

Cluster sampling

CLS analysis

Sample planning

Sample size: SRS

Sample size: STR

Find right method

More Applied Statistics

Linear regression

Measurement scales

Linear correlation

Linear regression

Regression example

Regression tests

Residual analysis

Transformations

Influential points

Slope estimate

Slope significance

Experiments

Experiment intro

Experimental design

Simulations

Appendices
Notation

Statistics Formulas

Texas Instruments TI-89 Advanced Graphing Calculator

Buy Used: $35.95

Buy New: $130.00

Approved for AP Statistics and Calculus

Excel 2007 Data Analysis For Dummies

Stephen L. Nelson

List Price: $26.99

Buy Used: $4.24
Buy New: $15.63

Cracking the AP Statistics Exam, 2008 Edition (College Test Preparation)

Princeton Review

List Price: $19.00

Buy Used: $2.21
Buy New: $9.00
AP Statistics Crash Course Book + Online (Advanced Placement (AP) Crash Course)
Michael D'Alessio, Advanced Placement, Statistics Study Guides

List Price: $14.95

Buy Used: $1.57
Buy New: $12.70

5 Steps to a 5 AP Statistics, 2014-2015 Edition (5 Steps to a 5 on the Advanced Placement

Examinations Series)
Duane Hinders

List Price: $18.95

Buy Used: $1.00
Buy New: $14.17

Cracking the AP Statistics Exam, 2015 Edition (College Test Preparation)

Princeton Review

List Price: $19.99

Buy Used: $1.02
Buy New: $6.93

Advanced Excel for Scientific Data Analysis

Robert de Levie
List Price: $59.50
Buy Used: $4.41
Buy New: $55.80

Cracking the AP Statistics Exam, 2013 Edition (College Test Preparation)

Princeton Review

List Price: $19.99

Buy Used: $0.77
Buy New: $5.00

Sampling of Populations: Methods and Applications

Paul S. Levy, Stanley Lemeshow

List Price: $173.00

Buy Used: $110.79
Buy New: $126.29

Texas Instruments TI-83 Plus Graphing Calculator

List Price: $92.99

Buy Used: $41.95
Buy New: $92.99

Parameters: Unless Otherwise Noted, These Formulas Assume
No ratings yet
Parameters: Unless Otherwise Noted, These Formulas Assume
6 pages
A Mini History of The Printing Press
No ratings yet
A Mini History of The Printing Press
5 pages
Important Statistics Formulas
No ratings yet
Important Statistics Formulas
7 pages
2013-14 Material Rates Guide
No ratings yet
2013-14 Material Rates Guide
1,945 pages
Statistics Formula
No ratings yet
Statistics Formula
6 pages
Real Statistics Examples Part 2
No ratings yet
Real Statistics Examples Part 2
1,110 pages
EC2303 Final Formula Sheet PDF
No ratings yet
EC2303 Final Formula Sheet PDF
8 pages
Chapter 1 Displaying and Describing Data Distributions
100% (1)
Chapter 1 Displaying and Describing Data Distributions
40 pages
Basic Statistics
No ratings yet
Basic Statistics
31 pages
Advanced Regression Analysis Guide
No ratings yet
Advanced Regression Analysis Guide
68 pages
Modeling Basketball's Points Per Possession With Application To Predicting The Outcome of College Basketball Games
No ratings yet
Modeling Basketball's Points Per Possession With Application To Predicting The Outcome of College Basketball Games
19 pages
Time Series Analysis
100% (1)
Time Series Analysis
15 pages
Outer Loading
No ratings yet
Outer Loading
114 pages
mk5144 Digital Metrics 3
No ratings yet
mk5144 Digital Metrics 3
96 pages
Sampling Theory and Methods
100% (5)
Sampling Theory and Methods
191 pages
Gamma Distribution Overview
No ratings yet
Gamma Distribution Overview
8 pages
Probability
No ratings yet
Probability
3 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
No ratings yet
CE 459 Statistics: Assistant Prof. Muhammet Vefa AKPINAR
211 pages
Unit Cost (C) Customer Type No of Customers Consump Tion Annual Consump Tion (D) Order Size (Q) Order Frequency (N) Transportation Cost (S)
No ratings yet
Unit Cost (C) Customer Type No of Customers Consump Tion Annual Consump Tion (D) Order Size (Q) Order Frequency (N) Transportation Cost (S)
2 pages
NH-28 Lucknow-Ayodhya Project
50% (2)
NH-28 Lucknow-Ayodhya Project
2 pages
Keyboard Shortcuts RStudio
No ratings yet
Keyboard Shortcuts RStudio
6 pages
Chapter 03 - Random Variables
No ratings yet
Chapter 03 - Random Variables
14 pages
JSREP - Volume 34 - Issue 163 جزء 1 - Pages 917-948
No ratings yet
JSREP - Volume 34 - Issue 163 جزء 1 - Pages 917-948
32 pages
Hypothesis Testing, Test Statistic (Z, P, T, F)
100% (3)
Hypothesis Testing, Test Statistic (Z, P, T, F)
22 pages
Process Late Request
No ratings yet
Process Late Request
5 pages
Naïve Bayes for Data Scientists
No ratings yet
Naïve Bayes for Data Scientists
68 pages
Assignment Exercise Anova
No ratings yet
Assignment Exercise Anova
9 pages
The Desk Reference of Statistical Quality Methods PDF
100% (1)
The Desk Reference of Statistical Quality Methods PDF
560 pages
AddMath Chapter 5 - Form 5
No ratings yet
AddMath Chapter 5 - Form 5
6 pages
Multiple Regression Essentials
100% (4)
Multiple Regression Essentials
75 pages
Markov Random Fields in Image Analysis
No ratings yet
Markov Random Fields in Image Analysis
22 pages
CHL5230 2025w Lecture 08 v1
No ratings yet
CHL5230 2025w Lecture 08 v1
22 pages
Fidic05 ws13 Thomopulos
No ratings yet
Fidic05 ws13 Thomopulos
29 pages
Analysis of Categorical Data
No ratings yet
Analysis of Categorical Data
75 pages
Lec5 CostBehavior
No ratings yet
Lec5 CostBehavior
23 pages
Nonlife Actuarial Models: Ruin Theory
No ratings yet
Nonlife Actuarial Models: Ruin Theory
28 pages
Understanding Averages and Statistics Concepts
No ratings yet
Understanding Averages and Statistics Concepts
105 pages
Introduction To Statistics
100% (1)
Introduction To Statistics
4 pages
Types of Distributions: Probablity Distribution (Non Specific) Binomial Distribution
No ratings yet
Types of Distributions: Probablity Distribution (Non Specific) Binomial Distribution
1 page
Chapter 3 Sta404
No ratings yet
Chapter 3 Sta404
11 pages
Statistical Inference
100% (1)
Statistical Inference
11 pages
CH7 - Continuous Probability Distribution
0% (1)
CH7 - Continuous Probability Distribution
50 pages
Essential Statistical Formulas Guide
No ratings yet
Essential Statistical Formulas Guide
4 pages
Hvac Chapter 6 Solution Manual
No ratings yet
Hvac Chapter 6 Solution Manual
20 pages
Empirical Rule-Examples (Normal Distribution)
100% (1)
Empirical Rule-Examples (Normal Distribution)
17 pages
Statistics Tutorial: Working With Probability: How To Interpret Probability
No ratings yet
Statistics Tutorial: Working With Probability: How To Interpret Probability
9 pages
Poisson Distribution Applications
No ratings yet
Poisson Distribution Applications
10 pages
Assignment 7 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 7 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
4 pages
Asda PDF
No ratings yet
Asda PDF
5 pages
Sampling Distribution and Confidence Interval
No ratings yet
Sampling Distribution and Confidence Interval
28 pages
Environmental Modelling & Software
No ratings yet
Environmental Modelling & Software
5 pages
Statistical Inference
No ratings yet
Statistical Inference
106 pages
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
No ratings yet
Statistics For Management: Q.1 A) 'Statistics Is The Backbone of Decision Making'. Comment
10 pages
Statistics Assignment
No ratings yet
Statistics Assignment
11 pages
523-530 Jurnal Ministal Teguh Setiawan
No ratings yet
523-530 Jurnal Ministal Teguh Setiawan
8 pages
Excel Box and Whisker Diagrams (Box Plots) - Peltier Tech Blog
No ratings yet
Excel Box and Whisker Diagrams (Box Plots) - Peltier Tech Blog
32 pages
UPI Transactions vs Cash Withdrawals
No ratings yet
UPI Transactions vs Cash Withdrawals
5 pages
Chapter 3
No ratings yet
Chapter 3
2 pages
Markov Trading Model 1719707206
No ratings yet
Markov Trading Model 1719707206
4 pages
Final End-Term Question Paper
No ratings yet
Final End-Term Question Paper
3 pages
1010 - Analytical Data - Interpretation and Treatment
No ratings yet
1010 - Analytical Data - Interpretation and Treatment
16 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
Continuous Random Variables II
No ratings yet
Continuous Random Variables II
1 page
Categorical Data Frequency Distribution
No ratings yet
Categorical Data Frequency Distribution
6 pages
The Normal Distribution: Learning Objectives
No ratings yet
The Normal Distribution: Learning Objectives
5 pages
Formulas Statistics II: ∫ = E (X) = ∫ = E (X) = ∫ ∫ Γ (p + 1) =
No ratings yet
Formulas Statistics II: ∫ = E (X) = ∫ = E (X) = ∫ ∫ Γ (p + 1) =
1 page
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
55 pages
Introduction To Data and Statistics With R
No ratings yet
Introduction To Data and Statistics With R
45 pages
Statistics
100% (1)
Statistics
3 pages
02-Organizing, Presenting, and Describing Data
No ratings yet
02-Organizing, Presenting, and Describing Data
11 pages
Full Stats Notes
No ratings yet
Full Stats Notes
126 pages
Algebra 1: Box Plot Basics
No ratings yet
Algebra 1: Box Plot Basics
33 pages
An Introduction To Bayesian Statistics and MCMC Methods
No ratings yet
An Introduction To Bayesian Statistics and MCMC Methods
69 pages
Book-Sher Muhammad Chaudary - 89-133 PDF
100% (1)
Book-Sher Muhammad Chaudary - 89-133 PDF
45 pages
Finding Z - Scores & Normal Distribution: Using The Standard Normal Distribution Week 9 Chapter's 5.1, 5.2, 5.3
No ratings yet
Finding Z - Scores & Normal Distribution: Using The Standard Normal Distribution Week 9 Chapter's 5.1, 5.2, 5.3
28 pages
Introduction To Sampling Methods/Theory
No ratings yet
Introduction To Sampling Methods/Theory
34 pages
Parametric and Non Parametric Test
No ratings yet
Parametric and Non Parametric Test
14 pages
PSSC Maths Statistics Project Handbook Eff08 PDF
No ratings yet
PSSC Maths Statistics Project Handbook Eff08 PDF
19 pages
Introduction To Probability 1
No ratings yet
Introduction To Probability 1
71 pages
Descriptive Statistics Guide
100% (4)
Descriptive Statistics Guide
66 pages
Sampling Techniques - Towards Data Science
No ratings yet
Sampling Techniques - Towards Data Science
10 pages
Poisson Distribution
No ratings yet
Poisson Distribution
22 pages
Detecting Data Outliers Guide
No ratings yet
Detecting Data Outliers Guide
7 pages
Basic Business Statistics: 11 Edition
No ratings yet
Basic Business Statistics: 11 Edition
24 pages
Correlation and Chi-Square Test - LDR 280
100% (1)
Correlation and Chi-Square Test - LDR 280
71 pages
Probability Distributions Guide
No ratings yet
Probability Distributions Guide
33 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
195 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
25 pages
Exponential Distribution
No ratings yet
Exponential Distribution
16 pages
Ratio Regression R
No ratings yet
Ratio Regression R
20 pages
Basic Business Statistics: Analysis of Variance
No ratings yet
Basic Business Statistics: Analysis of Variance
85 pages
The Normal Distribution
No ratings yet
The Normal Distribution
26 pages

Important Statistics Formulas

Uploaded by

Important Statistics Formulas

Uploaded by

Important Statistics Formulas

Population standard deviation = = sqrt [ ( Xi - )2 / N ]

Variance of population proportion = P2 = PQ / n

Population correlation coefficient = = [ 1 / N ] * { [ (Xi - X) / x ] * [ (Yi - Y) / y ] }

Unless otherwise noted, these formulas assume simple random sampling.

Sample standard deviation = s = sqrt [ ( xi - x )2 / ( n - 1 ) ]

Variance of sample proportion = sp2 = pq / (n - 1)

Pooled sample proportion = p = (p1 * n1 + p2 * n2) / (n1 + n2)

Sample correlation coefficient = r = [ 1 / (n - 1) ] * { [ (xi - x) / sx ] * [ (yi - y) / sy ] }

Pearson product-moment correlation = r = (xy) / sqrt [ ( x2 ) * ( y2 ) ]

Linear correlation (sample data) = r = [ 1 / (n - 1) ] * { [ (xi - x) / sx ] * [ (yi - y) / sy ] }

Linear correlation (population data) = = [ 1 / N ] * { [ (Xi - X) / x ] * [ (Yi - Y) / y ] }

Simple Linear Regression

Simple linear regression line: = b0 + b1x

Regression coefficient = b1 = [ (xi - x) (yi - y) ] / [ (xi - x)2]

Regression slope intercept = b0 = y - b1 * x

Regression coefficient = b1 = r * (sy / sx)

n factorial: n! = n * (n-1) * (n - 2) * . . . * 3 * 2 * 1. By convention, 0! = 1.

Combinations of n things, taken r at a time: nCr = n! / r!(n - r)! = nPr / r!

Rule of addition: P(A B) = P(A) + P(B) - P(A B)

Rule of multiplication: P(A B) = P(A) P(B|A)

Rule of subtraction: P(A') = 1 - P(A)

Expected value of X = E(X) = x = [ xi * P(xi) ]

Variance of X = Var(X) = 2 = [ xi - E(x) ]2 * P(xi) = [ xi - x ]2 * P(xi)

Normal random variable = z-score = z = (X - )/

f statistic = f = [ s12/12 ] / [ s22/22 ]

Expected value of sum of random variables = E(X + Y) = E(X) + E(Y)

Expected value of difference between random variables = E(X - Y) = E(X) - E(Y)

Variance of the sum of independent random variables = Var(X + Y) = Var(X) + Var(Y)

Mean of sampling distribution of the mean = x =

Mean of sampling distribution of the proportion = p = P

Standard deviation of proportion = p = sqrt[ P * (1 - P)/n ] = sqrt( PQ / n )

Standard deviation of the mean = x = /sqrt(n)

Standard error of proportion = SEp = sp = sqrt[ p * (1 - p)/n ] = sqrt( pq / n )

Standard error of difference for proportions = SEp = sp = sqrt{ p * ( 1 - p ) * [ (1/n1) + (1/n2) ] }

Standard error of the mean = SEx = sx = s/sqrt(n)

Discrete Probability Distributions

Binomial formula: P(X = x) = b(x; n, P) = nCx * Px * (1 - P)n - x = nCx * Px * Qn - x

Mean of binomial distribution = x = n * P

Variance of binomial distribution = x2 = n * P * ( 1 - P )

Negative Binomial formula: P(X = x) = b*(x; r, P) = x-1Cr-1 * Pr * (1 - P)x - r

Mean of negative binomial distribution = x = rQ / P

Variance of negative binomial distribution = x2 = r * Q / P2

Geometric formula: P(X = x) = g(x; P) = P * Qx - 1

Mean of geometric distribution = x = Q / P

Variance of geometric distribution = x2 = Q / P2

Hypergeometric formula: P(X = x) = h(x; N, n, k) = [ kCx ] [ N-kCn-x ] / [ NCn ]

Mean of hypergeometric distribution = x = n * k / N

Variance of hypergeometric distribution = x2 = n * k * ( N - k ) * ( N - n ) / [ N2 * ( N - 1 ) ]

Poisson formula: P(x; ) = (e-) (x) / x!

Mean of Poisson distribution = x =

Variance of Poisson distribution = x2 =

Multinomial formula: P = [ n! / ( n1! * n2! * ... nk! ) ] * ( p1n1 * p2n2 * . . . * pknk )

Mean of a linear transformation = E(Y) = Y = aX + b.

Variance of a linear transformation = Var(Y) = a2 * Var(X).

Confidence interval: Sample statistic + Critical value * Standard error of statistic

Margin of error = (Critical value) * (Standard deviation of statistic)

Margin of error = (Critical value) * (Standard error of statistic)

One-sample z-test for proportions: z-score = z = (p - P0) / sqrt( p * q / n )

Two-sample z-test for proportions: z-score = z = z = [ (p1 - p2) - d ] / SE

One-sample t-test for means: t statistic = t = (x - ) / SE

Two-sample t-test for means: t statistic = t = [ (x1 - x2) - d ] / SE

Matched-sample t-test for means: t statistic = t = [ (x1 - x2) - D ] / SE = (d - D) / SE

Chi-square test statistic = 2 = [ (Observed - Expected)2 / Expected ]

Two-sample t-test, pooled standard error: DF = n1 + n2 - 2

Simple linear regression, test slope: DF = n - 2

Chi-square goodness of fit test: DF = k - 1

Chi-square test for homogeneity: DF = (r - 1) * (c - 1)

Chi-square test for independence: DF = (r - 1) * (c - 1)

Mean (simple random sampling): n = { z2 * 2 * [ N / (N - 1) ] } / { ME2 + [ z2 * 2 / (N - 1) ] }

Proportion (simple random sampling): n = [ ( z2 * p * q ) + ME2 ] / [ ME2 + z2 * p * q / N ]

Proportionate stratified sampling: nh = ( Nh / N ) * n

Neyman allocation (stratified sampling): nh = n * ( Nh * h ) / [ ( Ni * i ) ]

Optimum allocation (stratified sampling):

Charts and graphs

Sets and subsets

Negative Binomial formula: P(X = x) = b(x; r, P) = x-1Cr-1 Pr * (1 - P)x - r