Unit 2 DSRP

Uploaded by

foredu48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views56 pages

Unit 2 DSRP

Uploaded by

foredu48

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 56

UNIT-2

•Descriptive Statistics
•Basic Statistical Analysis
Descriptive Statistics
• Measures of central tendency
• Measures of location of dispersions
• Practice and analysis with R
Measures of Central Tendency & Dispersion
• Measures that indicate the approximate
center of a distribution are called measures of
central tendency
• Measures that describe the spread of the
data are measures of dispersion
• These measures include the mean, median,
mode, range, upper and lower quartiles,
variance, and standard deviation
Process of Descriptive Analysis
Measure of central tendency
• It represents the whole set of data by a single
value. It gives us the location of central points.
There are three main measures of central
tendency:
• Mean
• Mode
• Median
Measure of variability
OR
Measure of Dispersion
Measure of variability is known as the spread of
data or how well is our data is distributed. The most
common variability measures are:
• Range
• Variance
• Standard deviation
Practice and analysis with R
• getwd()
• setwd("C:/Users/USHARAM/Desktop/R-Practice")
• mydata=read.csv("CGF.csv")
• print(head(mydata))
• mean=mean(mydata$Age)
• print(mean)
• median = median(mydata$Age)
• print(median)
• install.packages("modeest")
• library(modeest)
• mode = mfv(mydata$Age)
• print(mode)
• max = max(mydata$Age)
• min=min(mydata$Age)
• range=max-min
• cat("Range is:\n")
• print(range)
• r = range(mydata$Age)
• print(r)
• variance = var(mydata$Age)
• print(variance)
• std = sd(mydata$Age)
• print(std)
• quartiles = quantile(mydata$Age)
• print(quartiles)
• IQR = IQR(mydata$Age)
• print(IQR)
• summary = summary(mydata$Age)
• print(summary)
• q()
Basic Statistical Analysis
• Statistical hypothesis generation and testing
• Chi-Square test
• t-Test
• Analysis of variance
• Correlation analysis
• Maximum likelihood test
• Practice and analysis with R
Hypothesis Testing in R Programming
• A hypothesis is made by the researchers about
the data collected for any experiment or data
set.
• A hypothesis is an assumption made by the
researchers that are not mandatory true
• a hypothesis is a decision taken by the
researchers based on the data of the
population collected
• Hypothesis Testing in R Programming is a
process of testing the hypothesis made by the
researcher or to validate the hypothesis.
• To perform hypothesis testing, a random
sample of data from the population is taken
and testing is performed. Based on the results
of testing, the hypothesis is either selected or
rejected. This concept is known as Statistical
Inference.
The four-step process of hypothesis testing,
• One sample T-Testing,
• Two-sample T-Testing,
• Directional Hypothesis,
• one sample -test,
• two sample -test and
• correlation test in R programming.
Two-Sample t-Test with Unequal
Variance
• The general way to use the t.test() command
is to compare two vectors of numeric values.
Two-Sample t-Test with Equal
Variance
• You can override the default and use the
classic t-test by adding the var.equal = TRUE
instruction, which forces the command to
assume that the variance of the two samples
is equal.
• The calculation of the t-value uses pooled
variance and the degrees of freedom are
unmodified; as a result, the p-value is slightly
different from the Welch version:
One-Sample t-Testing
• You can also carry out a one-sample t-test. In
this version you supply the name of a single
vector and the mean to compare it to (this
defaults to 0):
Using Directional Hypotheses
• You can also specify a “direction” to your
hypothesis. In many cases you are simply testing
to see if the means of two samples are different,
but you may want to know if a sample mean is
lower than another sample mean (or greater).
• You can use the alternative = instruction to switch
the emphasis from a two-sided test (the default)
to a one-sided test. The choices you have are
between “two.sided”, “less”, or “greater”, and
your choice can be abbreviated.
U-test
• The U-test is used for comparing the median
values of two samples. You use it when the
data are not normally distributed, so it is
described as a non-parametric test.
• The U-test is often called the Mann-Whitney
U-test but is generally attributed to Wilcoxon
(Wilcoxon Rank Sum test), hence in R the
command is wilcox.test().
• When you have two samples to compare and
your data are non-parametric, you can use the
U-test.
• This goes by various names and may be known
as the Mann-Whitney U-test or Wilcoxon sign
rank test. You use the wilcox.test() command
to carry out the analysis.
Using Directional Hypotheses
• Both one- and two-sample tests use an alternative
hypothesis that the location shift is not equal to 0 as
their default. This is essentially a two-sided
hypothesis.
• You can change this by using the alternative =
instruction, where you can select “two.sided”, “less”,
or “greater” as your alternative hypothesis (an
abbreviation is acceptable but you still need quotes,
single or double).
• You can also specify mu, the location shift. By default
mu = 0. In the following example the hypothesis
• is set to something other than 0:
Paired tests
• The t-test and the U-test can both be used when
your data are in matched pairs. Sometimes this
kind of test is also called a repeated measures test
(depending on circumstance). You can run the test
by adding paired = TRUE to the appropriate
command.
• Here is an example where the data show the
effectiveness of greenhouse sticky traps in
catching whitefly. Each trap has a white side and a
yellow side. To compare white and yellow we can
use a matched pair.
CORRELATION AND COVARIANCE
• When you have two continuous variables you can look for a
link between them; this link is called a correlation.
• You can go about finding this several ways using R. The cor()
command determines correlations between two vectors, all
the columns of a data frame (or matrix), or two data frames
(or matrix objects). The cov() command examines
covariance.
• By default the Pearson product moment (that is regular
parametric correlation) is used but Spearman (rho) and
Kendall (tau) methods (both non-parametric correlation)
can be specified instead. The cor.test() command carries
out a test of significance of the correlation.
Simple Correlation
• Simple correlations are between two
continuous variables and you can use the cor()
command to obtain a correlation coefficient
like so:
• If your vectors are contained within a data
frame or some other object, you need to
extract them in a different fashion. Look at the
women data frame. This comes as example
data with your distribution of R.
1. https://www.youtube.com/watch?v=ZcaKgq
XsEbA
2. https://www.youtube.com/watch?v=ua-CiDN
Nj30
3. https://www.youtube.com/watch?v=xiEC5oF
sq2s

R Hypothesis Testing & Graphs Guide
No ratings yet
R Hypothesis Testing & Graphs Guide
47 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
HLST 2301 Notes Print Me
No ratings yet
HLST 2301 Notes Print Me
29 pages
Module2 BDA
No ratings yet
Module2 BDA
44 pages
Hypothesis Testing & T-Test Guide
No ratings yet
Hypothesis Testing & T-Test Guide
20 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
20 pages
R Unit-4
No ratings yet
R Unit-4
13 pages
Hypothesis Testing and T-tests in R
No ratings yet
Hypothesis Testing and T-tests in R
16 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Unit3-Data Science
No ratings yet
Unit3-Data Science
37 pages
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
No ratings yet
Session 6-15 - Unit II & III: Probability and Distribution, Classical Tests
34 pages
SPSS Guide: Tests of Differences: One-Sample T-Test
No ratings yet
SPSS Guide: Tests of Differences: One-Sample T-Test
11 pages
Module2 Analytical Tool
No ratings yet
Module2 Analytical Tool
25 pages
Commands For Data Analysis Using R
No ratings yet
Commands For Data Analysis Using R
11 pages
Type I and Type II Errors Type I Error
No ratings yet
Type I and Type II Errors Type I Error
7 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
Lab6 - Hypothesis Testing and Confidence Intervals in R
No ratings yet
Lab6 - Hypothesis Testing and Confidence Intervals in R
3 pages
Advanced Statistical Methods Using R Notes
No ratings yet
Advanced Statistical Methods Using R Notes
55 pages
Statistics for Analysts
No ratings yet
Statistics for Analysts
52 pages
BES - R Lab
No ratings yet
BES - R Lab
5 pages
Glossary of Hypothesis Tests 1
No ratings yet
Glossary of Hypothesis Tests 1
26 pages
DA Unit II - II
No ratings yet
DA Unit II - II
47 pages
Statistical Techniques - Bda
No ratings yet
Statistical Techniques - Bda
33 pages
Analysis of Measured Data
No ratings yet
Analysis of Measured Data
77 pages
Practical 8 PDF
No ratings yet
Practical 8 PDF
3 pages
Greenwood Intermediate Statistics With R
No ratings yet
Greenwood Intermediate Statistics With R
429 pages
Module 3 Hypothesis Testing Using R
No ratings yet
Module 3 Hypothesis Testing Using R
7 pages
Non Parametric Tests R Examples
No ratings yet
Non Parametric Tests R Examples
4 pages
Rdias FDP
No ratings yet
Rdias FDP
50 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
R Commands
No ratings yet
R Commands
5 pages
STAT359 Study Guide
No ratings yet
STAT359 Study Guide
7 pages
Parametric & Non Parametric Tests
No ratings yet
Parametric & Non Parametric Tests
18 pages
408 Mid
No ratings yet
408 Mid
7 pages
Pearson R Correlation: Test
No ratings yet
Pearson R Correlation: Test
5 pages
R Manual PDF
No ratings yet
R Manual PDF
78 pages
Advanced Data Analysis Notes
No ratings yet
Advanced Data Analysis Notes
376 pages
CB161 (R Lab Manual)
No ratings yet
CB161 (R Lab Manual)
32 pages
Intro To R
No ratings yet
Intro To R
18 pages
304 BA - Advanced Statistical Methods Using R Notes Till Unit 2
No ratings yet
304 BA - Advanced Statistical Methods Using R Notes Till Unit 2
34 pages
A Guide To Doing Statistics in Second Language Research Using R
No ratings yet
A Guide To Doing Statistics in Second Language Research Using R
320 pages
A Guide To Doing Statistics PDF
No ratings yet
A Guide To Doing Statistics PDF
320 pages
Statistical Analysis and Visualizations Using R: Okan Bulut
No ratings yet
Statistical Analysis and Visualizations Using R: Okan Bulut
96 pages
Biostatistics M1-1
No ratings yet
Biostatistics M1-1
57 pages
BES - R Lab 7
No ratings yet
BES - R Lab 7
5 pages
T TEST Lecture
No ratings yet
T TEST Lecture
26 pages
Lab6 - HT and CI in R Some Solutions
No ratings yet
Lab6 - HT and CI in R Some Solutions
7 pages
T - Test
No ratings yet
T - Test
45 pages
T-Test Guide for Data Analytics Course
No ratings yet
T-Test Guide for Data Analytics Course
30 pages
Statistics With R
No ratings yet
Statistics With R
20 pages
Hypothesis Testing in R
No ratings yet
Hypothesis Testing in R
13 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
Central Tendency Dispersion Visualization
No ratings yet
Central Tendency Dispersion Visualization
34 pages
Bookdown Demo
No ratings yet
Bookdown Demo
448 pages
Advanced Statistics
No ratings yet
Advanced Statistics
259 pages
SAMPLES Assignment 1 SIMPLE Level Plan To Build A Tree House PDF
No ratings yet
SAMPLES Assignment 1 SIMPLE Level Plan To Build A Tree House PDF
62 pages
Lab Program 9
No ratings yet
Lab Program 9
5 pages
Choosing Between A Nonparametric Test and A Parametric Test
No ratings yet
Choosing Between A Nonparametric Test and A Parametric Test
3 pages
Normal Distribution and Its Properties
No ratings yet
Normal Distribution and Its Properties
14 pages
MyOpenMath Practice Quiz
No ratings yet
MyOpenMath Practice Quiz
5 pages
Chap 7
100% (1)
Chap 7
28 pages
4332bQAM601 - Statistics For Management
No ratings yet
4332bQAM601 - Statistics For Management
6 pages
IIT Roorkee 2013 Data Structures Grades
No ratings yet
IIT Roorkee 2013 Data Structures Grades
5 pages
1 Sample Size Computation
No ratings yet
1 Sample Size Computation
4 pages
Histogram Analysis Guide
No ratings yet
Histogram Analysis Guide
3 pages
Survey Sample Size Guide
No ratings yet
Survey Sample Size Guide
9 pages
Result Prediction For European Football Games: Xiaowei Liang Zhuodi Liu Rongqi Yan
No ratings yet
Result Prediction For European Football Games: Xiaowei Liang Zhuodi Liu Rongqi Yan
5 pages
New Course Outline Managerial Statistics-1
100% (1)
New Course Outline Managerial Statistics-1
4 pages
Sample Final Paper For LBOLYTC
No ratings yet
Sample Final Paper For LBOLYTC
39 pages
Econometrics Term Paper
No ratings yet
Econometrics Term Paper
8 pages
Lesson 4
No ratings yet
Lesson 4
27 pages
Shaffana Kintani Azzahra ..
No ratings yet
Shaffana Kintani Azzahra ..
3 pages
Probability Distributions Circuit Training
No ratings yet
Probability Distributions Circuit Training
4 pages
Exploring Data-MC Practice: Use The Data For Questions 1 - 5
No ratings yet
Exploring Data-MC Practice: Use The Data For Questions 1 - 5
2 pages
Wilcoxon Signed Ranks Table
No ratings yet
Wilcoxon Signed Ranks Table
8 pages
Aea2014 Ps Meta
No ratings yet
Aea2014 Ps Meta
24 pages
Automated Variable Selection in Regression
No ratings yet
Automated Variable Selection in Regression
5 pages
ANOVA Analysis for Researchers
No ratings yet
ANOVA Analysis for Researchers
32 pages
Environmental Performance vs. Disclosure
No ratings yet
Environmental Performance vs. Disclosure
11 pages
Numerical Descriptive Measure, Lecture-2
No ratings yet
Numerical Descriptive Measure, Lecture-2
21 pages
Skewness, Moments and Kurtosis
No ratings yet
Skewness, Moments and Kurtosis
23 pages
LOGC TST F 8017 Assessment of Method Precision CL APHA4500D
No ratings yet
LOGC TST F 8017 Assessment of Method Precision CL APHA4500D
3 pages
Lab 6 Answers
No ratings yet
Lab 6 Answers
14 pages
Amanda Murray
No ratings yet
Amanda Murray
2 pages

Unit 2 DSRP

Uploaded by

Unit 2 DSRP

Uploaded by

UNIT-2

You might also like