Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
67 views3 pages

Key Statistics Formulas Guide

This document defines and describes various statistical formulas used for descriptive statistics, hypothesis testing, regression analysis, and confidence/prediction intervals. It includes formulas for mean, variance, standard deviation, standard error, z-scores, regression lines, residuals, sums of squares, coefficient of determination, correlation coefficient, slope, intercept, and confidence/prediction intervals. The descriptions are technical and aimed at statisticians or students of statistics.

Uploaded by

Kiran Poudel
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
67 views3 pages

Key Statistics Formulas Guide

This document defines and describes various statistical formulas used for descriptive statistics, hypothesis testing, regression analysis, and confidence/prediction intervals. It includes formulas for mean, variance, standard deviation, standard error, z-scores, regression lines, residuals, sums of squares, coefficient of determination, correlation coefficient, slope, intercept, and confidence/prediction intervals. The descriptions are technical and aimed at statisticians or students of statistics.

Uploaded by

Kiran Poudel
Copyright
© Attribution Non-Commercial (BY-NC)
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

STATISTICS FORMULAS DESCRIPTIVE STATISTICS: MEAN: VARIANCE: STANDARD DEVIATION: STANDARD ERROR: Z-SCORE: REGRESSION LINES: For a data

set , where ( ) are the centroids (means) of the data set, and is the correlation coefficient: LEAST-SQUARES REGRESSION LINE: RESIDUALS: SSM SSE
2

HYPOTHESIS TESTING MEANS: STANDARD ERROR: MARGIN OF ERROR: m = CONFIDENCE INTERVAL: C.I. = SAMPLE SIZE FOR A GIVEN m: ONE SAMPLE Z-TEST: TWO SAMPLE Z-TEST: T-TEST: or m =

TWO SAMPLE T-TEST:

SST = SSM+SSE

PROPORTION: STANDARD ERROR:

, where X= number of successes

COEFFICIENT OF DETERMINATION: r = CORRELATION COEFFICIENT: r = SLOPE: INTERCEPT:

MARGIN OF ERROR: m = Z-TEST, ONE-SAMPLE PROPORTION:

STD ERR, 2-SAMP PROP: VARIANCE: STANDARD ERROR b1: SEb1 = PLUS FOUR PROPORTIONS: STANDARD ERROR bo: SEb0 = CONFIDENCE LEVEL FOR THE INTERCEPT CONFIDENCE LEVEL FOR THE SLOPE: PREDICTION INTERVAL: : : t*SEb0 t*SEb1 POOLED PROPORTION: POOLED STD ERR: TWO SAMPLE Z-SCORE: EST DIFF BTWN PROPS: STD DEV: ST DEV: MARGIN OF ERR, 2-SAMP PROP: m =

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. New York: W.H. Freeman & Co, 5 edition.

th

DESCRIPTIONS OF STATISTICS FORMULAS MEAN: The mean, symbolized by x-bar, equals one divided by the number of samples multiplied by the sum of all data points, symbolized by x-sub-i. VARIANCE: Variance, symbolized by s squared, equals 1 divided by the number of samples minus one, multiplied by the sum of each data point subtracted by the mean then squared. STANDARD DEVIATION: Standard deviation, symbolized by s, equals the square root of the variance s-squared. STANDARD ERROR: The standard error of the mean equals the standard deviation divided by the square root of the number of samples. Z-SCORE: Z equals the test data minus the population mean, then divided by the population standard deviation. REGRESSION LINES: LEAST-SQUARES REGRESSION LINE: The predicted value, symbolized by y-hat, equals the intercept, symbolized by b-sub-o, plus the slope, symbolized by b-sub-1, times the data point x. RESIDUALS: The residual, symbolized by e-sub-I, equals the data point y, symbolized by y-sub-I, minus the predicted value from the least-squares regression line, symbolized y-hat. SSM, SSE, SST: Sum of square means equals the sum of the centriod, symbolized by y-bar, minus the predicted value of each x data point, symbolized by y-hat sub I. Sum of square errors equal the sum of each y data point, symbolized by y-sub-I, minus the predicted value of each data point, symbolized by y-hat-sub-I, then squared. The Sum of Square Total = Sum of Square Means plus Sum of Square Errors. COEFFICIENT OF DETERMINATION: The coefficient of determination, symbolized r-squared, equals the sum of square means divided by the sum of squares total. CORRELATION COEFFICIENT: The correlation coefficient r equals the square root of the coefficient of determination, symbolized by r-squared. SLOPE: Slope, symbolized b-sub-one, equals the correlation coefficient r multiplied by the ratio of the standard deviation of the x data points to the standard deviation of the y data points. INTERCEPT: Intercept, symbolized by b-sub-zero, equals the mean of the y data points, symbolized by y-bar, minus the slope, symbolized by b-sub-one multiplied by the mean of the x data points, symbolized by x-bar. VARIANCE: Mean of Square Errors, symbolized s-squared or MSE, is equal to the sum of the residuals, symbolized by e-sub-I, squared then divided by the number of data points subtracted by two. STANDARD DEVIATION, symbolized by s, equals the square root of variance. STANDARD ERROR: The standard error of the slope, symbolized by SE-sub-b1, equals the standard deviation, symbolized by s, divided by the square root of the sum of each data point, symbolized by x-sub-I, subtracted from the mean of all x data points, symbolized by s-bar, then squared. The STANDARD ERROR of the intercept, symbolized by SE-sub-bo, equals the standard deviation, symbolized by s, multiplied by the square root of one divided by the number of data points plus the mean of all xs squared, symbolized by x -bar squared, divided by the sum of all x data points, symbolized by x-sub-I minus the mean of all x data points, symbolized by x-bar, squared.

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. Ne w York: W.H. Freeman & Co, 5th edition.

CONFIDENCE LEVEL FOR THE INTERCEPT: The confidence level for the intercept, symbolized beta-sub-zero, equals the sample intercept, symbolized by b-sub-zero, plus or minus the t-score for the interval, symbolized by t, multiplied by the standard error of the intercept. CONFIDENCE LEVEL FOR THE SLOPE: The confidence level for the slope, symbolized by beta-sub-one, equals the sample slope, symbolized by b-sub-one, plus or minus the t-score for the interval, symbolized by t, multiplied by the standard error of the slope. PREDICTION INTERVAL: The prediction interval equals the predicted value of y, symbolized by y-hat, plus or minus the t-score for the interval, symbolized by t, multiplied by the standard error.

Reference: Moore DS, McCabe GP & Craig BA. Introduction to the Basic Practice of Statistics. Ne w York: W.H. Freeman & Co, 5th edition.

You might also like