Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
12 views69 pages

RM LAB File

The document outlines a research methodology practical file for BBA students at the Institute of Information Technology and Management, detailing various statistical exercises and data analysis tasks. It includes objectives, outcomes, and a series of practical assignments aimed at familiarizing students with statistical tools and techniques using software like MS Excel and SPSS. The assignments cover topics such as student satisfaction surveys, data collection, hypothesis testing, and analysis of variance.

Uploaded by

angelrai1001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
12 views69 pages

RM LAB File

The document outlines a research methodology practical file for BBA students at the Institute of Information Technology and Management, detailing various statistical exercises and data analysis tasks. It includes objectives, outcomes, and a series of practical assignments aimed at familiarizing students with statistical tools and techniques using software like MS Excel and SPSS. The assignments cover topics such as student satisfaction surveys, data collection, hypothesis testing, and analysis of variance.

Uploaded by

angelrai1001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 69

RESEARCH METHODOLOGY

(PRACTICAL FILE)
(BBA 208)

INSTITUTE OF INFORMATION TECHNOLOGY AND MANAGEMENT

NEW DELHI 110058

BATCH (2019-2022)

SUBMITTED TO SUBMITTED BY

1. MR. RAGHAV JAIN NAME: R KARTHIK


ASSISTANT PROFESSOR ENROLLMENT NO: 06621101719
2. Dr. VIRENDER DAHIYA BATCH: BBA 2019 - 2022
ASSOCIATE PROFESSOR SEMESTER (IV E2)
RM LAB FILE December 30, 1899

INDEX

S.No. Practical Date Page Signature


No.
1. Prepare a Questionnaire to study the Student satisfaction. 04/03/2021 8-11
Identify the factors behind student’s satisfaction. Prepare a Research
Model for the same.( Construct and Attribute, Dependent and
Independent Variable)

2. Collect the data from 30 students by preparing Google form and 11/03/2021 12-14
analyze it by calculating Mean, Median and Mode.

3. Prepare a dataset of 50 points having 4 types of Cars (Company) and 11/03/2021 15-16
their respective mileages.
a) Calculate Average Mileage of 50 cars.
b) Calculate average mileages of every brand of car.
4. Prepare a data set of 30 points having 2 variables: Height and Age. 18/03/2021 17-18
Calculate Median and Mode value for both the variables.
5. Create a dataset of 100 employees with their names, status 18/03/2021 18-22
(Contract, Full Time, Hourly, Half time), Service Years (1 year to
10 years) and Salary (Rs/Month)
1. Calculate no of employees in each category
2. Calculate total salaries given to each category
3. Calculate average salary for each category
4. Calculate anything with “COUNTIFS” function
a. Calculate no of employees status wise more than
5 years of experiences
b. Calculate no of employees’ category wise, more
than 50,000 INR Salary P.M.

6. Prepare a sales record of 40 data points (Names*Region) with the 25/03/2021 23-24
following details:
Name (Only 10 Names), Product (Soap, Detergent, Shampoo,
Conditioner), Region (East, West, North, South), Units (e.g.
100,120,300 etc…) and Sales (Amount: 20,000, 45000 etc…)
Prepare a PIVOT Table with the following requirements
a. Overall Units sold by sales person (Region
wise)
b. Overall Sales of Product (Region wise)

7. What do you meant by Correlation? Explain the interpretation of 25/03/2021 25

2
RM LAB FILE December 30, 1899

coefficient of correlation.
8. Prepare a data set of NSE: Nifty, FMCG Sector, Infrastructure 01/04/2021 25-32
Sector, Banking Index and Steel Index for 1 Year (1 st April to 31st
March 2020.
9. Prepare Line Graph for every Index on the same Graph. 01/04/2021 33-34
Calculate Correlation between Banking and Steel Index
Calculate Correlation between Nifty and FMCG Sector Index
Calculate Correlation between Nifty and Infrastructure Index
10. There was two types of Drugs (1 and 2) that were tried on some 08/04/2021 34-35
patients for reducing weight. There were 8 adults who were
subjected to Drug 1 and 7 adults who were administered Drug 2.
The decrease in weight (in KGs) is given below:-

Drug 1 10 8 12 14 7 15 13 11

Drug 2 12 10 7 6 12 11 12

Do the Drugs differ significantly in their effect on decreasing


weights? Use 5% level of significance to check the variance.

11. A company selects 8 salesmen random and their sales figures for the 08/04/2021 36-37
previous month are recorded. Then they undergo a training course
devised by a business consultant, and their sales figures for the
following month are compared as shown in the table. Has the
training program caused in improvement in the salesmen’s ability?
Use 5% level of significance.

Previous
Month 75 90 94 95 100 90 70 64

Following
Month 77 101 93 92 105 88 76 68

12. DJB wants to examine the water consumption between 3 different 15/04/2021 37-39
sizes of apartments. It collects a sample of 5 for each of the
apartments. Using the hypothetical data provided below, test
whether the mean water consumption is equal for each type of
apartment. Use significance value at 5%.
Apartments LIH MIG HIG
Water Consumption 643 469 484
655 427 456
702 525 402

3
RM LAB FILE December 30, 1899

620 550 490


710 539 438
13. The following are the number of words per minute which a secretary 15/04/2021 39-41
typed on several occasions on three different typewriters.
Test whether the difference among the mean of the three samples
can be attributed to chance. Use 5% level of significance.

Typewriter 1 71 78 70 69 77 72 65 69

Typewriter 2 74 76 72 70 69 68 72 73

Typewriter 3 70 72 66 64 63 67 69 70

14. Compare the cholesterol contents of the four competing diet foods 15/04/2021 42-44
on the basis of the following data (in milligrams per package) which
were obtained for three randomly taken 6-ounce packages of each of
the diet foods:

Diet Food A 3.6 4.1 4.0

Diet Food B 3.1 3.2 3.9

Diet Food C 3.2 3.5 3.5

Diet Food D 3.5 3.8 3.8

Level of Significance is 5%

15. The following data refers to the Salary Packages (in Lakhs) offered 22/04/2021 45-47
to MBA graduates with different specializations and having studied
at Four different business schools.

Business Schools
Specialization One Two Three Four
6 4 8 6
Marketing 5 5 6 4
7 6 6 9
Finance 6 7 7 8
8 5 10 9
Operations 7 5 9 10

Test the hypothesis:


1. Whether the difference between the pay packages offered by
different business schools can be attributed to chance.

4
RM LAB FILE December 30, 1899

2. Average pay packages by all specializations are equal.

16. The following table presents the number of the defective pieces 22/04/2021 47-49
produced by three workmen operating in turn on three different
machines:
Machine 1 Machine 2 Machine 3
Workman 1 27 34 23
Workman 2 29 32 25
Workman 3 22 30 22

Conduct a Two Way ANOVA to test at 5% level of significance,


whether:-
1. The difference among the means obtained for the three
workmen can be attributed to chance.
2. The difference among the means obtained for the three
Machines can be attributed to chance.

17. Compare the cholesterol contents of the four competing diet foods 29/04/2021 50-51
on the basis of the following data (in milligrams per package) which
were obtained for three randomly taken 6-ounce packages of each of
the diet foods in three different laboratories.
Laboratory
Diet Food One Two Three
Diet Food A 3.6 4.1 4
Diet Food B 3.1 3.2 3.9
Diet Food C 3.2 3.5 3.5
Diet Food D 3.5 3.8 3.8
Perform a two way ANOVA using Level of Significance at 5%.

18. A random sample of 395 people is surveyed and each person was 29/04/2021 51-53
asked to report the highest education level they obtained. The data
resulted from the survey is summarized in the following table:-
High School Bachelors Masters Ph.D. Total
Female 60 54 46 41 201
Male 40 44 53 57 194
Total 100 98 99 98 395

Test whether Gender is independent of education level at 5% level

5
RM LAB FILE December 30, 1899

of significance.

19. The following table gives the number of good and defective parts 20/05/2021 53-55
produced by each of the three shifts in a factory.
Shifts Good Defective Total
Day 900 130 1030
Evening 700 170 870
Night 400 200 600
Total 2000 500 2500

Is there any association between the shifts and the quality of parts
produced? Use 5% level of significance.

The following table gives the number of good and defective parts
produced by each of the three shifts in a factory.
Shifts Good Defective Total
Day 900 130 1030
Evening 700 170 870
Night 400 200 600
Total 2000 500 2500

Is there any association between the shifts and the quality of parts
produced? Use 5% level of significance.

20. A cigarette company interested in the relation between sex of a 20/05/2021


person and the type of cigarettes smoked has collected the following
data from a random sample of 150 persons:-
Gender

Cigarette Male Female Total

A 25 30 55

B 40 15 55

C 30 10 40

Total 95 55 150

Test whether the type of cigarette smoked and the sex are

6
RM LAB FILE December 30, 1899

independent. Use 5% level of significance to test the hypothesis.

7
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No-1 Date: 11/3/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
1. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
2. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
3. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
4. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
5. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)

Relevant
S.No. Questions Course
Outcomes

Prepare a Questionnaire to study the Student satisfaction.


1. Identify the factors behind student’s satisfaction. Prepare a Research Model for the same. CO1
( Construct and Attribute, Dependent and Independent Variable)
Collect the data from 30 students by preparing Google form and analyze it by calculating
2. CO1, CO3
Mean, Median and Mode.

8
RM LAB FILE December 30, 1899

Exploratory Study/ Research


1. Academic
a. Behavior of faculty
b. Knowledge and skill of faculty members
c. Teaching method
d. Use of ICT
2. Infrastructure
a. Canteen
b. Parking facility
c. Library
d. Computer labs
e. Auditorium
3. Activities
a. Availability of clubs
b. Sports facility
c. Seminar/Webinar/Guest Lectures
d. Party: Fresher’s/Farewell
4. Placement
a. No. of opportunities
b. On campus/Off campus
c. Packages offered by company
d. Job profiles offered by companies
5. Environment
a. Crowd
b. Cleanliness
c. Location of campus
d. Safe

9
RM LAB FILE December 30, 1899

Behavior of faculty

Knowledge & skill of


Faculty members
Academic
Teaching method

Use of ICT

Canteen

Parking Facility

Library Infrastructure
Computer Labs

Auditorium

Availability of Clubs
Student
Sports Facility

Activities
Satisfaction
Seminar/Webinar/Guest
Lecture

Party: Fresher/Farewell

No. of opportunities

On campus/off campus
Placement
Packages offered by
company

Job profile offered by


company

Crowd

Cleanliness Environment
Locality of campus
Independent Variable Dependable Variable
Safety
Proposed Research Model

10
RM LAB FILE December 30, 1899

Responses from student (scaling)

Name:

Class & Course:

Gender:

Contact No:

Email ID:

College Name:

1 2 3 4 5
Highly dissatisfied Dissatisfied Neutral Satisfied Very satisfied

S.NO Statements 1 2 3 4 5
1 How much you are satisfied with the behavior of faculty
2 How will you rate the faculty on the basis of their
knowledge and communication skills?
3 How much are satisfied with the teaching method of the
faculty
4 How much are you satisfied with the use of ICT by faculty
5 How much are you satisfied with the college canteen
6 How will you rate the parking facility of your college
7 How much are you satisfied with your college library
8 How will you rate the computer labs of your college
9 How much are you satisfied with your College auditorium
10 How much are you satisfied with the availability of clubs in
your college
11 How will you rate the sports facility of your college
12 How much are you satisfied with Seminar/Webinar/Guest
lecturer in your college
13 How will you rate the fresher’s/farewell party of your
college
14 How much are you satisfied on campus placement facility
in your college
15 How will you rate the package offered by the company
during placement in your college
16 How much are you satisfied with the crowd of your college
17 How will you rate the cleanliness of your college
18 How much are you satisfied with safety measures taken by
college in any situations

11
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No-2 Date: 18/3/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
6. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
7. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
8. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
9. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based).

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)

Relevant
S.No. Questions Course
Outcomes

Prepare a dataset of 50 points having 4 types of Cars (Company) and their respective
1. mileages. CO1
1. Calculate Average Mileage of 50 cars.
2. Calculate average mileages of every brand of car.
Prepare a data set of 30 points having 2 variables: Height and Age.
2. Calculate Median and Mode value for both the variables. CO1

12
RM LAB FILE December 30, 1899

Step 1: Create a table containing 50 data points (4 types of cars) –

1. Calculate Average Mileage of 50 cars.

Step 2: Create a separate table and apply ‘Average If’ formula on the parent table –

Firstly, select the ‘range’ (Range will specify the table from where the data will be collected)

Secondly, select the ‘criteria’ (Criteria will specify the respective value on the basis of which
the formula will work)

13
RM LAB FILE December 30, 1899

And Thirdly, select the ‘average range’ (Average range will specify the respective sub-range
which is directly related to the query, in this case ‘Mileage’)

2. Calculate average mileages of every brand of car.

Step 3: Apply the ‘Average If’ formula on the table –

Select the whole table (containing all the data points) and apply the formula

14
RM LAB FILE December 30, 1899

Q2) Prepare a data set of 30 points having 2 variables: Height and Age. Calculate
Median and Mode value for both the variables.
Answer:

Step 1: Create a table with respective variables (Height & Age) and data points –

Step 2: Calculate median for both the values –

i) Median Calculation
a) Median of Age

15
RM LAB FILE December 30, 1899

Use the formula “=MEDIAN” to calculate the median of the given table. Select the respective
cells, for which the median is to be calculated, i.e., all the cells of ‘Age’ will be included

Median of Age = 14.5

b) Median of Height

Use the formula “=MEDIAN” to calculate the median of the given table. Select the respective
cells, for which the median is to be calculated, i.e., all the cells of ‘Height’ will be included

Median of Age = 150

ii) Mode Calculation


a) Mode of Age

Use the formula “=MODE” to calculate the mode of the given table. Select the respective cells,
for which the mode is to be calculated, i.e., all the cells of ‘Age’ will be included

16
RM LAB FILE December 30, 1899

Mode of Age = 13 [This means the value ‘13’ is the most repetitive in the dataset of Age]

b) Mode of Height

Use the formula “=MODE” to calculate the mode of the given table. Select the respective cells,
for which the mode is to be calculated, i.e., all the cells of ‘Height’ will be included

Mode of Age = 150 [This means the value ‘150’ is the most repetitive in the dataset of Age]

17
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No-3 Date: 18/3/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
10. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
11. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
12. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
13. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
.
Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)

Relevant
S.No. Questions Course
Outcomes

Create a dataset of 100 employees with their names, status ( Contract, Full Time, Hourly,
Half time), Service Years(1 year to 10 years) and Salary (Rs/Month)
5. Calculate no of employees in each category
1. 6. Calculate total salaries given to each category CO1
7. Calculate average salary for each category
8. Calculate anything with “COUNTIFS” function
a. Calculate no of employees status wise more than 5 years of experiences
b. Calculate no of employees category wise, more than 50,000 INR Salary P.M.

18
RM LAB FILE December 30, 1899

Answer:

Step 1: Create a dataset of 100 employees with the respective attributes –

Step 2: Calculate the following –

1. Calculate no of employees in each category

In order to calculate the number of employees in each category, we have to use the formula of
‘Count If’.
First, Apply the formula of ‘Count If’ (=COUNTIF).
Second, select the respective ‘Range’; here it is ‘Employee Status’.
Third, select the ‘Criteria (the value to be found from the range selected).

i) Employees working on ‘Contract’ basis –

19
RM LAB FILE December 30, 1899

Contract Employees = 23

ii) Employees working on ‘Full Time’ basis –

Full Time Employees = 30

iii) Employees working on ‘Half Time’ basis –

20
RM LAB FILE December 30, 1899

iv) Employees working on ‘Hourly’ basis –

2. Calculate total salaries given to each category

In order to calculate the total number of salary given to each category, we have to use the
formula of ‘Sum If’.
First, Apply the formula of ‘Sum If’ (=SUMIF).
Second, select the respective ‘Range’; here it is ‘Employee Status’ as we are calculating the
amount of salary given to each ‘category’
Third, select the ‘Criteria (the value to be found from the range selected).
Fourth, select the ‘Sum range’ (Range which will include the second variable, i.e. ‘Salary’)

i) Total salary given to ‘Contact Employees’ –

21
RM LAB FILE December 30, 1899

ii) Total amount of salary given to ‘Full Time Employees’-

iii) Total amount to salary given to ‘Half Time Employees’ –

22
RM LAB FILE December 30, 1899

iv) Total amount of salary given to ‘Hourly Employees’ –

3. Calculate average salary for each category

In order to calculate the average salary for each category, we have to use the formula of
‘Average If’.

First, Apply the formula of ‘Average If’ (=AVERAGEIF).


Second, select the respective ‘Range’; here it is ‘Employee Status’ as we are calculating the
average amount of salary given to each ‘category’
Third, select the ‘Criteria (the value to be found from the range selected).
Fourth, select the ‘Average range’ (Range which will include the second variable, i.e. ‘Salary’)

i) Average salary given to ‘Contact Employees’ –

23
RM LAB FILE December 30, 1899

ii) Average salary given to ‘Full Time Employees’ –

iii) Average salary given to ‘Half Time Employees’ –

24
RM LAB FILE December 30, 1899

iv) Average salary given to ‘Hourly Employees’ –

4. Calculate anything with “COUNTIFS” function

First, select a ‘Criteria Range’ which will include the main variables with respect to the query,
here it is ‘Employee Status’.
Second, select the ‘Criteria (the value to be found from the range selected).
Third, select the Second Range’ which will include the secondary variable with respect to the
query; here it is ‘Years of Experience’.
Fourth, specify the respective criteria, in this case, ‘more than 5 years’ (>5).

i) Number of ‘Contract employees’ having more than 5 years of experience –

25
RM LAB FILE December 30, 1899

ii) Number of ‘Full Time employees’ having more the 5 years of experience –

iii) Number of ‘Half Time employees’ having more than 5 years of experience –

iv) Number of ‘Hourly employees’ having more than 5 years of experience –

26
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No-4 Date: 30/3/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2019-20

Course Objective:
14. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
15. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
16. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
17. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
18. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)

Relevant
S.No. Questions Course
Outcomes

Prepare a sales record of 40 data points (Names*Region) with the following details:
Name (Only 10 Names), Product (Soap, Detergent, Shampoo, Conditioner), Region (East, CO1
West, North, South), Units (eg. 100,120,300 etc…) and Sales (Amount: 20,000, 45000
1. etc…) &
Prepare a PIVOT Table with the following requirements CO4
a. Overall Units sold by sales person (Region wise)
b. Overall Sales of Product (Region wise)

27
RM LAB FILE December 30, 1899

Answer:
Step 1: Create a table with 40 data points with respective attributes –

Step 2: Create ‘Pivot Table’

First, Go to ‘Insert’ and select ‘Pivot Table’-

28
RM LAB FILE December 30, 1899

Second, Click on the table to specify the worksheet and click on ‘Existing Worksheet’ –

Third, Select the appropriate variables and assign them to row and columns –

i) Create a Pivot table and show the overall Units sold by sales person (Region
wise) –

Assign the row values and column values to the field list:-

29
RM LAB FILE December 30, 1899

30
RM LAB FILE December 30, 1899

ii) Overall Sales of Product (Region wise)

Assign the row values and column values to the field list:-

31
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No-5 Date: 1/4/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
1. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
2. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
3. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
4. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
5. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)
CO4: Students will have ability to design and execute better research projects with the
help of softwares. (Achieve course objective 5)

Questions Relevant
S.No
Course

32
RM LAB FILE December 30, 1899

. Outcomes

What do you meant by Correlation? Explain the interpretation of CO1

1. coefficient of correlation. &

CO2, CO4

Prepare a data set of NSE: Nifty, FMCG Sector, Infrastructure Sector, CO1

2. Banking Index and Steel Index for 1 Year (1st April to 31st March 2020. &

CO2, CO4

Prepare Line Graph for every Index on the same Graph. CO1

3. Calculate Correlation between Banking and Steel Index &


Calculate Correlation between Nifty and FMCG Sector Index
CO2, CO4
Calculate Correlation between Nifty and Infrastructure Index

33
RM LAB FILE December 30, 1899

Q1)
Answer:

Correlation, in the finance and investment industries, is a statistic that measures the degree to
which two securities move in relation to each other. Correlations are used in advanced portfolio
management, computed as the correlation coefficient, which has a value that must fall between -
1.0 and +1.0.

Interpretation of coefficient of correlation:-

Pearson’s correlation coefficient is represented by the Greek letter rho (ρ) for the population
parameter and r for a sample statistic. This correlation coefficient is a single number that
measures both the strength and direction of the linear relationship between two continuous
variables. Values can range from -1 to +1.

 Exactly –1. A perfect downhill (negative) linear relationship

 –0.70. A strong downhill (negative) linear relationship

 –0.50. A moderate downhill (negative) relationship

 –0.30. A weak downhill (negative) linear relationship

 +0.30. A weak uphill (positive) linear relationship

 +0.50. A moderate uphill (positive) relationship

 +0.70. A strong uphill (positive) linear relationship

 Exactly +1. A perfect uphill (positive) linear relationship

Q2) Prepare a data set of NSE: Nifty, FMCG Sector, Infrastructure Sector, Banking Index and
Steel Index for 1 Year (1st April to 31st March 2020)

Answer:

Step 1: Download the data set from ‘nseindia.com’ –

34
RM LAB FILE December 30, 1899

Step 2: Sort and Arrange the data and create a table –

i) NIFTY Banking

ii) NIFTY Metal

35
RM LAB FILE December 30, 1899

iii) NIFTY

iv) FMCG Sector

36
RM LAB FILE December 30, 1899

v) Infrastructure Sector

Q3)

Answer:

a. Correlation between Infrastructure and Nifty –

To find the correlation between two variables, use the formula of ‘CORREL’ (correlation)

Step 1: Apply the formula of correlation (=CORREL) –

37
RM LAB FILE December 30, 1899

Step 2: Select the respective arrays (closing of both the indices)

Line Graph with two variable (Infrastructure and Nifty) –

b. Correlation between Banking and Steel Index –

Step 1: Apply the formula of correlation (=CORREL) –

38
RM LAB FILE December 30, 1899

Step 2: Select the respective arrays (closing of both the indices) –

Line Graph with two variable (Banking and Steel) –

c. Calculate Correlation between Nifty and FMCG Sector Index

Step 1: Apply the formula of correlation (=CORREL) –

39
RM LAB FILE December 30, 1899

Step 2: Select the respective arrays (closing of both the indices)

Line Graph with two variable (Nifty and FMCG) –

40
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No- 6 Date:

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2019-20

Course Objective:
1. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
2. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
3. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
4. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
5. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)
CO4: Students will have ability to design and execute better research projects with the
help of softwares. (Achieve course objective 5)

41
RM LAB FILE December 30, 1899

Relevant
S.No. Questions Course
Outcomes

There were two types of Drugs (1 and 2) that were tried on some patients for reducing
weight. There were 8 adults who were subjected to Drug 1 and 7 adults who were
CO3
administered Drug 2. The decrease in weight (in KGs) is given below:-
1. Drug 1 10 8 12 14 7 15 13 11 &
Drug 2 12 10 7 6 12 11 12 CO4
Do the Drugs differ significantly in their effect on decreasing weights? Use 5% level of
significance to check the variance.
A company selects 8 salesmen random and their sales figures for the previous month are
recorded. Then they undergo a training course devised by a business consultant, and their CO3
sales figures for the following month are compared as shown in the table. Has the training
2. program caused in improvement in the salesmen’s ability? Use 5% level of significance. &
Prev Month 75 90 94 95 100 90 70 64 CO4
Foll. Month 77 101 93 92 105 88 76 68

42
RM LAB FILE December 30, 1899

Question - 1)

Step 1:-Create Hypothesis.

Null
There is no significance difference in Drugs tried on patient for reducing
Hypothesis H0 : D1 = D2 weight.
Alternate
There is significance difference in Drugs tried on patient for reducing
Hypothesis H1 : D1  D2 weight.

Step 2:- Identify the level of Significance & Reject Criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools:

t-Test: Two-Sample Assuming Unequal Variances


Drug 1 Drug 2
Mean 11.25 10
7.92857142 6.33333333
Variance 9 3
Observations 8 7
Hypothesized Mean Difference 0
df 13
0.90784129
t Stat 9
P(T<=t) one-tail 0.19024175
1.77093338
t Critical one-tail 3
P(T<=t) two-tail 0.3804835
2.16036865
t Critical two-tail 2
2.81577190 2.51661147
Standard Deviation 6 8
0.99552570 0.95118973
Standard Error 5 1
Std. Error = Std dev/ sqrt(obs)

Step 4:- Compare p Value with Significance Value:

Result: p value > Sig value

43
RM LAB FILE December 30, 1899

0.38 > 0.05 Drugs do not differ significantly in their effect on decreasing weights

44
RM LAB FILE December 30, 1899

Step 5:- Interpretation:

Therefore, Accept Null Hypothesis


Therefore, it can be concluded that there is no significance difference in Drugs tried on patient for
reducing weight.

Question - 2)

Step 1:-Create Hypothesis.

Null H0 : S1 = S2 There is no significance difference between the sales figure of


Hypothesis Previous Month and Following Month.
Alternate H1: S1 S2 There is significance difference between the sales figure of Previous
Hypothesis Month and Following Month.

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject te null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools:

t-Test: Paired Two Sample for Means


Previous Following
Month Month
Mean 84.75 87.5
Variance 174.5 166
Observations 8 8
Pearson Correlation 0.935051198
Hypothesized Mean Diff. 0
df 7
t Stat -1.650294722
P(T<=t) one-tail 0.071434088
t Critical one-tail 1.894578604
P(T<=t) two-tail 0.142868177
t Critical two-tail 2.364624251

45
RM LAB FILE December 30, 1899

Step 4:- Compare p Value with Significance Value:

Result: p value > Sig value


0.14 > 0.05 The training program didn't improved the salesmen’s ability

Step 5:- Interpretation:

Therefore, Accept Null Hypothesis


Therefore, it can be concluded that there is no significance difference between the sales figure of
Previous Month and Following Month.

46
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No- 7 Date:

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2019-20

Course Objective:
1. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
2. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)
3. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
4. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
5. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)
CO4: Students will have ability to design and execute better research projects with the
help of softwares. (Achieve course objective 5)

47
RM LAB FILE December 30, 1899

Relevant
S.No. Questions Course
Outcomes

DJB wants to examine the water consumption between 3 different sizes of apartments. It
collects a sample of 5 for each of the apartments. Using the hypothetical data provided
below, test whether the mean water consumption is equal for each type of apartment. Use
CO3
significance value at 5%.
1. Apartments LIH MIG HIG &
Water Consumption 643 469 484
655 427 456 CO4
702 525 402
620 550 490
710 539 438
The following are the number of words per minute which a secretary typed on several
occasions on three different typewriters.
Test whether the difference among the mean of the three samples can be attributed to
chance. Use 5% level of significance.
Typewriter 1 71 78 70 69 77 72 65 69
2.
Typewriter 2 74 76 72 70 69 68 72 73

Typewriter 3 70 72 66 64 63 67 69 70

Compare the cholesterol contents of the four competing diet foods on the basis of the
following data (in milligrams per package) which were obtained for three randomly taken 6-
ounce packages of each of the diet foods:
Diet Food A 3.6 4.1 4.0

3. Diet Food B 3.1 3.2 3.9

Diet Food C 3.2 3.5 3.5

Diet Food D 3.5 3.8 3.8

Level of Significance is 5%

48
RM LAB FILE December 30, 1899

Question - 1)
Step 1:-Create Hypothesis:

Null Average water consumption for all apartments is statistically equal.


Hypothesis H0 (µ =µ µ )
1 2= 3

At least one mean is not statistically equal.


3 Cases:
Case 1:-µ ≠µ 1 2

Case 2:-µ ≠µ 2 3
Alternate
Case 3:-µ ≠µ 1 3
Hypothesis H1

Step 2:- Identify the level of Significance & Rejection Criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools.

Anova: Single Factor


SUMMARY
Groups Count Sum Average Variance
LIH 5 3330 666 1499.5
MIG 5 2510 502 2729
HIG 5 2270 454 1290

ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 123573 2 61786.67 33.58884 0.00001 3.88529
1
Within Groups 22074 2 1839.5

1
Total 145647 4

49
RM LAB FILE December 30, 1899

Step 4:- Compare p Value with Significance Value.

Result: p value > Sig value


0.00001< 0.05 If p Value is less than the significance value i.e.0.05, Reject Null Hypothesis

Step 5:- Interpretation.

p Value : 0.00001
Sig Value @ 5% which is 0.05
Here, computed p Value (0.00001) is less than the Sig Value @ 5% level of significance (0.05).

Therefore, Reject Null Hypothesis


Therefore, it can be concluded that mean water consumption is not statistically equal for the three
different types of apartments.

Question - 2)

Step 1:-Create Hypothesis.

Average number of words per minute which a secretary typed on


Null several occasions on three different typewriters is statistically equal.
H0
Hypothesis (µ =µ µ )
1 2= 3

At least one mean are different.


3 Cases:
Case 1:-µ ≠µ1 2

Case 2:-µ ≠µ2 3


Alternate H1
Case 3:-µ ≠µ1 3
Hypothesis

Step 2:- Identify the level of Significance:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

50
RM LAB FILE December 30, 1899

Step 3:- Apply Data Analysis Tools:

Anova: Single Factor


SUMMARY
Groups Count Sum Average Variance
Typewriter 1 8 571 71.375 18.55357
Typewriter 2 8 574 71.75 7.071429
Typewriter 3 8 541 67.625 9.982143

ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 83.25 2 41.625 3.50702 0.04851 3.4668
2
Within Groups 249.25 1 11.8691

2
Total 332.5 3

Step 4:- Compare p Value with Significance Value:

Result: p value > Sig value


0.048513< 0.05 If p Value is less than the significance value i.e.0.05, Reject Null Hypothesis

Step 5:- Interpretation.

p Value : 0.048513
Sig Value @ 5% which is 0.05
Here, computed p Value (0.048513) is less than the Sig Value @ 5% level of significance (0.05)

Therefore, Reject Null Hypothesis


Therefore, it can be concluded that the number of words per minute which a secretary typed on
several occasions on three different typewriters is statistically not equal.

51
RM LAB FILE December 30, 1899

Question - 3)
Step 1:-Create Hypothesis.

An average cholesterol content of the four diet food is statistically


Null equal.
H0
Hypothesis (µ =µ µ µ )
1 2= 3= 4

At least one mean are different.


Cases:
Case 1:-µ ≠µ 1 2

Case 2:-µ ≠µ 1 3

Case 3:-µ ≠µ 1 4

Case 4:-µ ≠µ 2 3

Case 5:-µ ≠µ 2 4

Alternate Case 6:-µ ≠µ 3 2


H1
Hypothesis Case 7:-µ ≠µ 3 4

Step 2:- Identify the level of Significance & Rejection Criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools.

Anova: Single Factor


SUMMARY
Groups Count Sum Average Variance
Diet Food A 3 11.7 3.9 0.07
Diet Food B 3 10.2 3.4 0.19
Diet Food C 3 10.2 3.4 0.03
Diet Food D 3 11.1 3.7 0.03

52
RM LAB FILE December 30, 1899

ANOVA
Source of Variation SS df MS F P-value F crit
Between Groups 0.54 3 0.18 2.25 0.15977 4.06618
Within Groups 0.64 8 0.08

1
Total 1.18 1

Step 4:- Compare p Value with Significance Value.

Result: p value > Sig value


0.159767< 0.05 If p Value is less than the significance value i.e.0.05, Reject Null Hypothesis

Step 5:- Interpretation.

p Value : 0.159767
Sig Value @ 5% which is 0.05
Here, computed p Value (0.159767) is greater than the Sig Value @ 5% level of significance
(0.05)

Therefore, Accept Null Hypothesis


Therefore, it can be concluded that the average cholesterol content of the four diet food is statistically
equal.

53
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No- 8 Date: 18/5/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
19. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
20. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)s
21. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
22. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
23. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)
CO4: Students will have ability to design and execute better research projects with the
help of softwares. (Achieve course objective 5)

54
RM LAB FILE December 30, 1899

Relevant
S.No. Questions Course
Outcomes

The following data refers to the Salary Packeages (in Lakhs) offered to MBA graduates
with different specializations and having studied at Four different business schools
Business Schools

Specialization One Two Three Four

6 4 8 6

Marketing 5 5 6 4
CO3
7 6 6 9
1. &
Finance 6 7 7 8
CO4
8 5 10 9

Operations 7 5 9 10

Test the hypothesis:


1. Whether the difference between the pay packages offered by different business
schools can be attributed to chance
2. Average pay packages by all specializations are equal
The following table presents the number of the defective pieces produced by three
workmen operating in turn on three different machines:
Machine 1 Machine 2 Machine 3

Workman 1 27 34 23

Workman 2 29 32 25
2.
Workman 3 22 30 22

Conduct a Two Way ANOVA to test at 5% level of significance, whether:-


1. The difference among the means obtained for the three workmen can be
attributed to chance
2. The difference among the means obtained for the three Machines can be
attributed to chance

3.
Compare the cholesterol contents of the four competing diet foods on the basis of the

55
RM LAB FILE December 30, 1899

following data (in milligrams per package) which were obtained for three randomly taken
6-ounce packages of each of the diet foods in three different laboratory.
Laboratory

Diet Food One Two Three

Diet Food A 3.6 4.1 4.0

Diet Food B 3.1 3.2 3.9

Diet Food C 3.2 3.5 3.5

Diet Food D 3.5 3.8 3.8

Perform a two way ANOVA using Level of Significance at 5%

56
RM LAB FILE December 30, 1899

Question - 1)

Step 1:-Create Hypothesis:

Null Average pay package for all institute are same.


(µ1=µ2=µ3= µ4)
Hypothesis H01
V1 Average pay package for all institute are not same.
Alternate
Business Schools Cases:
Hypothesis H11
(µ1≠µ2), (µ1≠µ3), (µ2≠µ4), (µ2≠µ1), etc.

Null Average pay package for all specialization are equal.


(µ1=µ2=µ3)
Hypothesis H02

V2 Average pay package for all specialization are not equal.


Alternate Specialization Cases:
(µ1≠µ2), (µ1≠µ3), (µ2≠µ2), (µ3≠µ1), etc.
Hypothesis H12

Step 2:- Identify the level of Significance & Rejection Criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools:

Anova: Two-Factor With Replication


SUMMARY One Two Three Four Total
Marketing
Count 2 2 2 2 8
Sum 11 9 14 10 44
Average 5.5 4.5 7 5 5.5
Variance 0.5 0.5 2 2 1.714286

Finance
Count 2 2 2 2 8
Sum 13 13 13 17 56
Average 6.5 6.5 6.5 8.5 7
Variance 0.5 0.5 0.5 0.5 1.142857

Operations
Count 2 2 2 2 8

57
RM LAB FILE December 30, 1899

Sum 15 10 19 19 63
Average 7.5 5 9.5 9.5 7.875
Variance 0.5 0 0.5 0.5 4.125

Total
Count 6 6 6 6
Sum 39 32 46 46
Average 6.5 5.333333 7.666667 7.666667
Variance 1.1 1.066667 2.666667 5.066667

ANOVA
Source of Variation SS df MS F P-value F crit
Specialization 23.08333333 2 11.54167 16.29412 0.00038 3.885294
Business School 22.45833333 3 7.486111 10.56863 0.001098 3.490295
Interaction 17.91666667 6 2.986111 4.215686 0.016347 2.99612
1
Within 8.5 2 0.708333

2
Total 71.95833333 3

Step 4:- Compare p Value with Significance Value:

1) Hypothesis – 1 & Interpretation

Hypothesis 1
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.000379 < 0.05
Therefore, Rejecting the null hypothesis - It can be concluded that the Average pay package
offered by different business schools are statistically not equal

2) Hypothesis – 2 & Interpretation

Hypothesis 2
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.00109 < 0.05
Therefore, Rejecting the null hypothesis - It can be concluded that the average pay package
offered for different specialization are different

58
RM LAB FILE December 30, 1899

Question - 2)

Step 1:-Create Hypothesis.

Null Average numbers of defective piece produced by three


workmen are same.
Hypothesis H01
(µ1=µ2=µ3)
Alternate V1 At least two means are different.
Workman Cases:
Hypothesis H11
(µ1≠µ2), (µ1≠µ3), (µ2≠µ2), (µ3≠µ1), etc.

Null Average numbers of defective piece produced by three


machines are same.
Hypothesis H02 V2 (µ1=µ2=µ3)
Alternate Machine At least two means are different.
Cases:
Hypothesis H12
(µ1≠µ2), (µ1≠µ3), (µ2≠µ2), (µ3≠µ1), etc.

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Apply Data Analysis Tools:

Anova: Two-Factor Without Replication


SUMMARY Count Sum Average Variance
Workman 1 3 84 28 31
Workman 2 3 86 28.66666667 12.33333333
Workman 3 3 74 24.66666667 21.33333333

Machine 1 3 78 26 13
Machine 2 3 96 32 4
Machine 3 3 70 23.33333333 2.333333333

59
RM LAB FILE December 30, 1899

ANOVA
Source of Variation SS df MS F P-value F crit
Worker 27.55556 2 13.77777778 4.96 0.08257366 6.944272
Machine 118.2222 2 59.11111111 21.28 0.00738064 6.944272
Error 11.11111 4 2.777777778

Total 156.8889 8

Step 4:- Compare p Value with Significance Value & Interpret the solution:

1) Hypothesis 1 & Interpretation

Hypothesis 1
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.08257 > 0.05
Therefore, Accepting the null hypothesis - It can be concluded that the Average number of
defective products produced by three workers are the same

2) Hypothesis 2 & Interpretation

Hypothesis 2
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.00738 < 0.05
Therefore, Rejecting the null hypothesis - It can be concluded that the average defective products produced
by three machines are statistically not equal

60
RM LAB FILE December 30, 1899

Question - 3)

Laboratory
Diet Food One Two Three
Diet Food A 3.6 4.1 4
Diet Food B 3.1 3.2 3.9
Diet Food C 3.2 3.5 3.5
Diet Food D 3.5 3.8 3.8

Step 1:-Create Hypothesis.

Null
Average cholesterol contents of the four diet foods are same
Hypothesis H01
(µ1=µ2=µ3=µ4)

V1 At least two means are not same.


Alternate Diet Food Cases:
(µ1≠µ2), (µ1≠µ3), (µ2≠µ4), (µ2≠µ1), etc.
Hypothesis H11

Null
Average cholesterol content in the three labs is same.
Hypothesis H02
(µ1=µ2=µ3)
V2 At least two means are not same.
Alternate Laboratory Cases:
(µ1≠µ2), (µ1≠µ3), (µ2≠µ2), (µ3≠µ1), etc.
Hypothesis H12

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

61
RM LAB FILE December 30, 1899

Step 3:- Apply Data Analysis Tools:

Anova: Two-Factor Without Replication


SUMMARY Count Sum Average Variance
Diet Food A 3 11.7 3.9 0.07
Diet Food B 3 10.2 3.4 0.19
Diet Food C 3 10.2 3.4 0.03
Diet Food D 3 11.1 3.7 0.03

One 4 13.4 3.35 0.056667


Two 4 14.6 3.65 0.15
Three 4 15.2 3.8 0.046667

ANOVA
Source of Variation SS df MS F P-value F crit
Diet 0.54 3 0.18 4.909091 0.04692 4.757063
Laboratory 0.42 2 0.21 5.727273 0.040619 5.143253
Error 0.22 6 0.036667

Total 1.18 11

Step 4:- Compare p Value with Significance Value

Hypothesis – 1 & Interpretation

Hypothesis 1
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.04692 < 0.05
Therefore, Rejecting the null hypothesis - It can be concluded that the average cholesterol
content of four different diets are statistically not equal

Hypothesis – 2 & Interpretation

Hypothesis 2
Criteria: If p value is less than the Sig value then reject the null hypothesis
0.04092 < 0.05
Therefore, Rejecting the null hypothesis - It can be concluded that the average cholesterol
content of three different laboratory tests are statistically not equal

62
RM LAB FILE December 30, 1899

Institute of Information Technology and Management, New Delhi-58.

Exercise No- 9 Date: 31/5/2021

Programme: BBA Semester: 4 Paper Code: BBA- 208 Academic Year: 2020-21

Course Objective:
Relevant
S.No. Questions Course
Outcomes

A random sample of 395 people is surveyed and each person was asked to report the
highest education level they obtained. The data resulted from the survey is summarized in
the following table:-
High School Bachelors Masters Ph.D. Total CO3

1. Female 60 54 46 41 201 &


Male 40 44 53 57 194 CO4
Total 100 98 99 98 395

Test whether Gender is independent of education level at 5% level of significance.


The following table gives the number of good and defective parts produced by each of the
three shifts in a factory
Shifts Good Defective Total

Day 900 130 1030

2. Evening 700 170 870

Night 400 200 600

Total 2000 500 2500

Is there any association between the shifts and the quality of parts produced? Use 5%
level of significance.
3. A cigarette company interested in the relation between sex of a person and the type of
cigarettes smoked has collected the following data from a random sample of 150 persons:-
Gender
Cigarette
Male Female Total

63
RM LAB FILE December 30, 1899

A 25 30 55

B 40 15 55

C 30 10 40

Total 95 55 150

Test whether the type of cigarette smoked and the sex are independent. Use 5% level of
significance to test the hypothesis.

1. To familiarize students with statistical tools and techniques to analyze data. (Compre-
hension based)
2. To familiarize students with the Advanced Excel/SPSS. (Comprehension based)s
3. To enable students to apply different statistical techniques through MS Excel/ SPSS. (Ap-
plication based)
4. To enable students to make relevant interpretations from output sheets of software. (Ana-
lysis based)
5. To enable the students to prepare their projects in future through the application of statist-
ical technique using advanced softwares. (Synthesis based)
.

Course Outcomes:
Upon completion of this course, students will be able to:

CO1: Students will have better understanding and knowledge of importance of advanced
softwares like MS Excel/ SPSS. (Achieve course objective 1 and 2)
CO2: Students will have ability to test hypothesis by applying various statistical
techniques using softwares like MS Excel/SPSS. (Achieve course objective 1 and 3)
CO3: Students will have ability to make relevant and objective interpretations from
output sheets of softwares. (Achieve course objective 4)
CO4: Students will have ability to design and execute better research projects with the
help of softwares. (Achieve course objective 5)

64
RM LAB FILE December 30, 1899

Question - 1)

High School Bachelors Masters Ph.D. Total


Female 60 54 46 41 201
Male 40 44 53 57 194
Total 100 98 99 98 395

Step 1:-Create Hypothesis.

Null
There are no relationships between the Gender and Level of
Hypothesis H0
Education.
Alternate There are relationships between the Gender and Level of
Hypothesis H1 Education.

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Calculate Expected Observation:

Expected Frequency
High School Bachelors Masters Ph.D. Total
Female 50.89 49.87 50.38 49.87 201
Male 49.11 48.13 48.62 48.13 194
Total 100 98 99 98 395

Step 4:- Calculate Chi-Square Value by applying formula:

Observed Frequency Expected Frequency


(Oi) (Ei) (Oi-Ei) (Oi-Ei)^2 (Oi-Ei)^2/E
1,1 60 50.89 9.11 82.99 1.63
1,2 54 49.87 4.13 17.06 0.34
1,3 46 50.38 -4.38 19.18 0.38
1,4 41 49.87 -8.87 78.68 1.58
2,1 40 49.11 -9.11 82.99 1.69
2,2 44 48.13 -4.13 17.06 0.35
2,3 53 48.62 4.38 19.18 0.39
2,4 57 48.13 8.87 78.68 1.63
Calculated Chi-Square Value 8

65
RM LAB FILE December 30, 1899

Step 5:-Compare Calculated Chi-Square Value with Critical Value:

Calculated Chi-Square Value: 8.00

Critical Value at 5%

Level of Significance, DOF= 3

Chi-Square Critical Value: 7.81

Rule:- If Calculated Chi-Square Value is greater than the Chi-Square Critical Value, Reject
Null Hypothesis.

Step 6:- Interpretation.

Calculated Chi-Square Value: 8.00


Chi-Square Critical Value: 7.81
Here, calculated Chi-Square Value (8.00) > Critical Value at 5% Level of Significance (7.81).

Therefore, Reject Null Hypothesis


Therefore, it can be concluded that Gender is dependent of education level at 5% level of
significance.

Question - 2)

Shifts Good Defective Total


Day 900 130 1030
Evening 700 170 870
Night 400 200 600
Total 2000 500 2500

Step 1:-Create Hypothesis.

Null There are no relationships between the shifts and the quality of parts
Hypothesis H0 produced.
Alternate There are relationships between the shifts and the quality of parts
Hypothesis H1 produced.

66
RM LAB FILE December 30, 1899

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

Step 3:- Calculate Expected Frequency:

Expected Frequency
Shifts Good Defective Total
Day 824 206 1030
Evening 696 174 870
Night 480 120 600
Total 2000 500 2500

Step 4:- Calculate Chi-Square Value by applying formula:

Observed Frequency Expected Frequency


(Oi) (Ei) (Oi-Ei) (Oi-Ei)^2 (Oi-Ei)^2/E
1,1 900 824 76 5776 7.01
1,2 130 206 -76 5776 28.04
2,1 700 696 4 16 0.02
2,2 170 174 -4 16 0.09
3,1 400 480 -80 6400 13.33
3,2 200 120 80 6400 53.33
Calculated Chi-Square Value 101.83

Step 4:- Calculate Chi-Square Value by applying formula:

Observed Frequency Expected Frequency


(Oi) (Ei) (Oi-Ei) (Oi-Ei)^2 (Oi-Ei)^2/E
1,1 900 824 76 5776 7.01
1,2 130 206 -76 5776 28.04
2,1 700 696 4 16 0.02
2,2 170 174 -4 16 0.09
3,1 400 480 -80 6400 13.33
3,2 200 120 80 6400 53.33
Calculated Chi-Square Value 101.83
Step 5:-Compare Calculated Chi-Square Value with Critical Value:

67
RM LAB FILE December 30, 1899

Calculated Chi-Square Value: 101.83

Critical Value at 5% Level of Significance, DOF=2

Chi-Square Critical Value: 5.991

Rule:- If Calculated Chi-Square Value is greater than the Chi-Square Critical Value, Reject
Null Hypothesis.

Step 6:- Interpretation.

Calculated Chi-Square Value: 101.83


Chi-Square Critical Value: 5.991
Here, calculated Chi-Square Value (101.83) is greater than the Critical Value at 5% Level of
Significance (5.991).

Therefore, Reject Null Hypothesis


Therefore, it can be concluded that there are relationships between the shifts and the quality of
parts produced.

Question – 3)

Step 1:-Create Hypothesis.

Null Hypothesis H0 The type of cigarette smoked and the sex are Independent.
Alternate Hypothesis H1 The type of cigarette smoked and the sex are Dependent.

Step 2:- Identify the level of Significance & Rejection criteria:

Rejection : If the p value is less than sig. value than reject the null hypothesis
Sig Value 0.05

68
RM LAB FILE December 30, 1899

Step 3:- Calculate Expected Frequency:


Expected Frequency
Gender
Cigarette
Male Female Total
A 34.83 20.17 55
B 34.83 20.17 55
C 25.33 14.67 40
Total 95 55 150

Observed Frequency Expected Frequency


(Oi) (Ei) (Oi-Ei) (Oi-Ei)^2 (Oi-Ei)^2/E
1,1 25 34.83 -9.83 96.6289 2.77
1,2 30 20.17 9.83 96.6289 4.79
2,1 40 34.83 5.17 26.7289 0.77
2,2 15 20.17 -5.17 26.7289 1.33
3,1 30 25.33 4.67 21.8089 0.86
3,2 10 14.67 -4.67 21.8089 1.49
Calculated Chi-Square Value 101.83

Step 5:-Compare Calculated Chi-Square Value with Critical Value:

Calculated Chi-Square Value: 12.01

Critical Value at 5% Level of Significance, DOF = 2

Chi-Square Critical Value: 5.991

Rule:- If Calculated Chi-Square Value is greater than the Chi-Square Critical Value, Reject
Null Hypothesis.

Step 6:- Interpretation.

Calculated Chi-Square Value: 12.01


Chi-Square Critical Value: 5.991
Here, calculated Chi-Square Value (12.01) is greater than the Critical Value at 5% Level of
Significance (5.991).

Therefore, Reject Null Hypothesis


Therefore, it can be concluded that the type of cigarette smoked and the sex are Dependent.

69

You might also like