Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
19 views39 pages

Harshil

Uploaded by

Kunal kapoor
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
19 views39 pages

Harshil

Uploaded by

Kunal kapoor
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 39

MAHARAJA SURAJMAL INSTITUTE

C4, JANAKPURI, NEW DELHI-110058


Affiliated to Guru Govind Singh Indraprastha University

Department of Business Administration


Business Research Methodology Lab

Submitted to: Submitted by:


Dr. Asha Choudhary Name: Harshil Sikka
Enrollment No: 04714901823
Class: BBA(B&I)
INDEX
S.No. Experiments Signature

1 Introduction

2 Creating Data file

3 Descriptive statistics: Crosstabs

Frequency distribution and Central


4
tendency

a. Correlation
5
b. t-test and p-value

6 Reliability Analysis

7 Regression
WHAT IS RESEARCH :

Research is defined as the creation of new knowledge and/or the use of existing
knowledge in a new and creative way so as to generate new concepts, methodologies
and understandings. Research is creative and systematic work undertaken to increase
the stock of knowledge. It involves the collection, organization,
and analysis of evidence to increase understanding of a topic, characterized by a
particular attentiveness to controlling sources of bias and error.
It is a systematic and methodical process of investigating a specific question or
problem to generate new knowledge or insights. It involves gathering, analyzing, and
interpreting data to answer questions, test hypotheses, or solve problems. Research
can be conducted in various fields, including the sciences, humanities, social sciences,
and more.

Research is crucial in advancing understanding, solving problems, and making


informed decisions in various fields, from medicine and technology to social sciences
and the arts.
RESEARCH PROCESS :

DEFINITION:
The research process is a set of ordered steps a researcher takes to ensure that all
parts of an investigation are completed to a high standard. Following the research
process allows the researcher to cover all angles and ensure that the information they
gather is reliable and effectively presented.

RESEARCH PROCESS:
1. Identify the Problem or Question: Determine what you want to study or what
problem you aim to solve. This involves defining a clear and specific research
question or hypothesis.

2. Conduct a Literature Review: Review existing research and literature related to


your topic. This helps you understand what has already been discovered and identify
gaps in the current knowledge.

3. Formulate a Hypothesis: Based on your understanding of the topic, develop a


hypothesis or a set of hypotheses. These are educated guesses or predictions about the
relationship between variables.

4. Design the Research: Plan how you will conduct your research. This includes
choosing a research design (e.g., experimental, observational, qualitative,
quantitative), selecting methods for data collection, and deciding how you will
analyze the data.

5. Collect Data: Gather information through various methods such as experiments,


surveys, interviews, or observations. Ensure that your data collection methods are
reliable and valid.

6. Analyze Data: Process and analyze the data you’ve collected. Use appropriate
statistical or qualitative analysis techniques to test your hypotheses or answer your
research questions.

7. Interpret Results: Evaluate the results of your analysis. Determine whether they
support your hypothesis or answer your research question. Consider the implications
of your findings and how they fit into the broader context of the existing literature.
8. Draw Conclusions: Summarize the main findings of your research. Discuss the
significance of your results and how they contribute to the field of study.

HOW TO LAUNCH SPSS?

1. Choose Start→ All Programs →IBM SPSS Statistics →SPSS Statistics 27. The
SPSS Welcome dialog shown here appears. This is where you can see what’s new in
the software, provide user feedback, and navigate to data files. You'll open one the
sample SPSS data files.

The SPSS Welcome dialog.

2. Click the Sample Files tab in the lower-left corner of the dialog.
3. Select the bankloan.sav data file and then click Open.

The bank loan dataset consists of 12 variables and 850 cases. The following figure
shows the first few lines of the dataset. Note the structure of the data. Respondents
who participated in the research make up the rows, and information about the
respondents such as age and income constitute the columns.
Typically, SPSS uses not rows and columns for the data, but cases and variables,
respectively. Cases represent the units of analysis, and variables represent the items
that have been measured.
The bank loan dataset in SPSS, the Data Editor window makes up only one of the
three main windows in the program. The others follow:

● Data Editor: Contains the data to be analysed

● Output Viewer: Displays the results of an analysis or graph

● Syntax Editor: Contains the code used to modify or create variables, as well as the
commands to run analyses or graphs

The Data Editor window is comprised of two views:


● Data view: Displays data with cases in rows and variables in columns
● Variable view: Displays detailed variable information, with variables represented
in rows and variable attributes represented in columns

The data view of the Data Editor window has various menus:
● File: Opens various types of files
● Edit: Performs the standard cut, copy, and paste Windows functions
● View: Changes fonts and gridlines
● Data: Performs data manipulations that modify the number of cases
● Transform: Performs data manipulations that modify the number of variables
● Analyze: Runs reports and statistical tests
● Graphs: Creates charts
● Utilities: Improves efficiency
● Extensions: Allows for SPSS to be used with programming languages
● Window: Toggles between windows
● Help: Provides assistance
Scales of measurement is how variables are defined and categorised. Psychologist
Stanley Stevens developed the four common scales of measurement: nominal,
ordinal, interval and ratio. Each scale of measurement has properties that determine
how to properly analyse the data. The properties evaluated are identity, magnitude,
equal intervals and a minimum value of zero.

The four scales of measurement


By understanding the scale of the measurement of their data, data scientists can
determine the kind of statistical test to perform.
1. Nominal scale of measurement
The nominal scale of measurement defines the identity property of data. This scale
has certain characteristics, but doesn’t have any form of numerical meaning. The data
can be placed into categories but can’t be multiplied, divided, added or subtracted
from one another. It’s also not possible to measure the difference between data points.
Examples of nominal data include eye colour and country of birth.
Nominal data can be broken down again into three categories:
● Nominal with order: Some nominal data can be sub-categorised in order, such as
“cold, warm, hot and very hot.”
● Nominal without order: Nominal data can also be sub-categorised as nominal
without order, such as male and female.
● Dichotomous: Dichotomous data is defined by having only two categories or
levels, such as “yes’ and ‘no’.

2. Ordinal scale of measurement


The ordinal scale defines data that is placed in a specific order. While each value is
ranked, there’s no information that specifies what differentiates the categories from each
other. These values can’t be added to or subtracted from.
An example of this kind of data would include satisfaction data points in a survey,
where ‘one = happy, two = neutral and three = unhappy.’ Where someone finished in a
race also describes ordinal data. While first place, second place or third place shows
what order the runners finished in, it doesn’t specify how far the first-place finisher
was in front of the second-place finisher.
3. Interval scale of measurement

The interval scale contains properties of nominal and ordered data, but the
difference between data points can be quantified. This type of data shows both the
order of the variables and the exact differences between the variables. They can be
added to or subtracted from each other, but not multiplied or divided. For
example, 40 degrees is not 20 degrees multiplied by two.
This scale is also characterised by the fact that the number zero is an existing
variable. In the ordinal scale, zero means that the data does not exist. In the
interval scale, zero has meaning – for example, if you measure degrees, zero has a
temperature.
Data points on the interval scale have the same difference between them. The
difference on the scale between 10 and 20 degrees is the same between 20 and 30
degrees. This scale is used to quantify the difference between variables, whereas
the other two scales are used to describe qualitative values only. Other examples
of interval scales include the year a car was made or the months of the year.

4. Ratio scale of measurement

Ratio scales of measurement include properties from all four scales of


measurement. The data is nominal and defined by an identity, can be classified in
order, contains intervals and can be broken down into exact value. Weight, height
and distance are all examples of ratio variables. Data in the ratio scale can be
added, subtracted, divided and multiplied. Ratio scales also differ from interval
scales in that the scale has a ‘true zero’. The number zero means that the data has
no value point. An example of this is height or weight, as someone cannot be zero
centimetres tall or weigh zero kilos – or be negative centimetres or negative kilos.
Examples of the use of this scale are calculating shares or sales. Of all types of
data on the scales of measurement, data scientists can do the most with ratio data
points.
FOLLOWING MUST BE KEPT IN MIND

 MUST START WITH A LETTER


 CAN’T END WITH A DOT OR UNDERSCORE
 HAS TO BE UNIQUE , DUPLICATIONS NOT ALLOWED
 BLANKS AND SPECIAL CHARACTERS CANT BE USED
 LENGTH OF NAME CANT EXCEED 64 BYTES / 64 CHARACTERS (MOST
CASES)
 RESERVE WORDS CANT BE USED AS VARIABLE NAMES
 NAMES ARENT CASE SENSITIVE
ASSIGNMENT 1

➢ Make a data file with the following variables and their information value label

Age
1 <20
2 20-30
3 30-40
4 40-50
5 >50
Gender
1 Male
2 Female
Occupation
1 Student
2 Service
3 Business
4 Other
Min. Income
1 <5000
2 5000-15000
3 15000-25000
4 25000-50000
5 >50000
Computer Brand Preference
1 Apple
2 HP
3 Dell
4 Lenovo
5 Other

Steps:
1. Open SPSS software.
2. Add mentioned variables under the ‘name’ column
3. Add their respective label and values.
4. Select the type of scale measurement as per the data entered.

 Data View
 Variable View
ASSIGNMENT 2

 Creating a descriptive statistics of the data.


Age
1 <20
2 20-25
3 25-30
4 30-40
5 >40
Gender
1 Male
2 Female
Education
1 High School
2 Graduate
3 Graduate with professional degree

4 Post Graduate
5 Post Graduate with Professional
degree

Work (years)
1 <1
2 1-5
3 5-10
4 10-20
5 >20
Steps:
1. Add the data and choose appropriate scale of measurement.
 Variable view:

 Data view:
2. Complete data on data view window of SPPS.

 Prepare cross tabulation between following :

1. Age and Gender


2. Gender and Work
3. Gender and Education
3. Now, Analyze- Descriptive statistics and then from the drop down menu
select Crosstabs.
5. Following output will be presented on window after running above
mentioned steps in SPSS.

 Crosstab between Age and Gender

 Crosstab between Gender and Work


 Crosstab between Gender and Education
ASSIGNMENT 3

 Calculate the frequency distribution and measures of


central tendency for following data:

Gender
1 Male
2 Female

Steps:
1. Insert all the given data on their respective
fields. Data view:
Variable View:

2. Click on Analyze- descriptive statistics – frequencies.


3. Choose statistics from the above window opened and select all
the boxes or selected boxes whatever u want to find and display.
4. Click OK.
Following output will be displayed showing the frequency distribution and
measures of central tendency:
P Value

A p-value, or probability value, is a statistical measurement that indicates how likely it


is that observed data occurred by chance, assuming a null hypothesis is true. P-values
are used in hypothesis testing to help decide whether to reject the null hypothesis.

Here are some of the characteristics of p-value

❖ Smaller p-values are more significant


A smaller p-value indicates a greater statistical difference between the two samples
and a greater likelihood of rejecting the null hypothesis. A p-value of 0.05 or
lower is generally considered statistically significant.

❖ P-values are calculated automatically


Statistical programs usually calculate p-values automatically. You can also use
tables to estimate p-values online.

❖ P-values are contextual


The definition of a p-value depends on what the experimenter planned to do, even
in situations that didn't occur.

❖ P-values are influential


P-values can determine which studies are published, which projects receive
funding, and which university faculty members become tenured. However, p-
values are often misinterpreted.
One Tailed Test

In hypothesis testing, when we test (check) whether the Sample parameter is Higher or
Lower (< or >) than the population parameter, it is known as the One-Tailed Test.

● To reject the null hypothesis, the sample parameter should be either


greater or less than the population parameter.
● The region of rejection (critical region) lies on one side of the sampling
distribution.
● It represents whether the estimated test parameter is less or greater than the
critical value.
● It has the entire 5% of the significance level on one side.
● Null and Alternate Hypotheses must be established before the test.

● It is often use when you want to test your hypothesis in both the direction

It may be either:

Left-Tail: When the population parameter is lower than the assumed value.

Right-Tail: When the population parameter is greater than the assumed

value. Two Tailed Test

In hypothesis testing, when we test whether the sample is greater or less than a certain
range of values, it is referred to as a two-tailed test.

● The region of rejection lies on both the side of the probability distribution
● It is used for Null Hypothesis Testing.
● The significant level is split in both directions.
ASSIGNMENT 4
(CORRELATION)
 Sandhya Gupta wants to see the scatter plot of information
availability (cause) and Online ordering (effect) by the respondents
INFORMATION : 1( Not Important), 2( Less Important), 3(
Important), 4( Very Important), 5( Extremely Important)
ORDER : 1(Never ), 2(Occasionally ), 3(Considerably ), 4( Almost
always ), 5(Always)

Steps:
1. Insert all the given data on respective
fields. Data view:
Variable view:

2. Click on Analyze- Correlate and select Bivariate


Output:

Interpretation of Data-
According to the results of the correlation test , the data is 99% consistent and
the margin of error is only 1%.
The data is highly positively correlated.
ASSIGNMENT 5
t-test:
A t-test is a type of inferential statistic used to determine if there is a significant
difference between the means of two groups, which may be related in certain features.
The t-test is one of many tests used for the purpose of hypothesis testing in statistics.
Calculating a t-test requires three key data values. They include: the difference
between the mean values from each data set (called the mean difference), the standard
deviation of each group, and the number of data values of each group.

There are two types of t-test that can be performed depending on the data and type of
analysis required. They are:

One Sample T-Test – The one-sample t-test is a statistical hypothesis test used to
determine whether an unknown population mean is different from a specific value.
You can use the test for continuous data. Your data should be a random sample from a
normal population.

Paired Sample T-Test – The paired t-test is performed when the samples typically
consist of matched pairs of similar units, or when there are cases of repeated
measures. For example, there may be instances of the same patients being tested
repeatedly— before and after receiving a particular treatment. In such cases, each
patient is being used as a control sample against themselves. This method also applies
to cases where the samples are related in some manner or have matching
characteristics, like a comparative analysis involving children, parents, or siblings.
Correlated or paired t-tests are of a dependent type, as these involve cases where the
two sets of samples are related.

Hypothesis:

H₀ (Null Hypothesis) = There exists no significant relationship between engine


efficiency between the current and previous trial.

H₁ (Alternate Hypothesis) = There exists significant relationship between engine


efficiency between the current and previous trial.
Ques: Indian oil has developed a formulation with increased use of ethanol in petroleum
products, which increases engine efficiency with less harmful emissions. 30 cars were
test driven with and without ethanol, and the number of kilometers per liter was
recorded. The cars that were used were having automatic or manual transmission.

 Create a data file with 30 respondents.

To find out t-test with one sample:

1. Click Analyze> Compare means> One-sample t-test


2. This will open the dialogue box > select the test variables and shift it in the
variable list

3. Click OK.

Output:

Interpretation:

The t-test results for the sample "WITHOUT ETHANOL" show that the calculated t-
value is 15.911 with 29 degrees of freedom (df = 29), and the p-value for a two-sided
test is less than 0.001 (p < .001). This indicates that the mean value of 14.83 is
significantly different from zero at a high confidence level (99.9% confidence level).
The 95% confidence interval for the mean difference ranges from 12.93 to 16.74,
which confirms that the average engine efficiency without ethanol is significantly
above zero.

Thus, we can conclude that there is a statistically significant engine efficiency for
vehicles running without ethanol, rejecting the null hypothesis that the mean
efficiency would be zero in this condition.
To find out paired sample t-test:

1. Click analyze > Compare means > paired sample T test

2. This will open the dialogue box > select the test variables and shift them in the
variable list
3. Click OK to see the output.

Output:

Interpretation:

The t-test results for the paired samples "WITHOUT ETHANOL" and "WITH
ETHANOL" show that the calculated t-value is -1.813 with 29 degrees of freedom.
The two-sided p-value is 0.080, which is slightly above the conventional significance
level of 0.05. This indicates that the observed difference in means may not be
statistically significant at the 5% level.The 95% confidence interval for the mean
difference ranges from -2.838 to 0.171. This interval includes zero, further suggesting
that the observed difference may not be statistically significant.
ASSIGNMENT 6
(RELIABILITY ANALYSIS)
Ques: A Manager conducted a training and development program for the employees of
ABC Ltd. He is interested in determining the effectiveness of the training and
development program and to check whether it leads to enhancement of employee’s
performance or not. Responses were collected from employees to check the employee
satisfaction from the training and development program.
 Create a data file with 50 respondents.

 Assign Values:
Gender: 1 – Male 2 – Female
Experience: 1 – 0-5 Years
2 – 5-10 Years
3 – More than 10Years
 Responses of all questions on Likert 5 – Point Scale:
1 – Strongly Disagree
2 – Disagree
3 – Neutral
4 – Agree
5 – Strongly Agree
 To check reliability following steps are to be performed:

1. Click Analyze> scales> Reliability analysis


2. The Relaibility analysis box will open . Import 12 questions to item . Then
click on statistics.

3. Select Scale and Scale if item deleted . Click continue and then ok.
Output:

Interpretation:

The value of Cronbach’s Alpha should should be or more to report reliability of the data
. As it can be seen that the value of Cronbachs Alpha is .823 which signifies that the
given data is reliable.
ASSIGNMENT 7
(REGRESSION)

1. Enter the data on the data and variable view.

2. After entering data go to Analyze> Regression> Linear.


3. The linear regression dialogue box appears from there, score in
midterm 1 of students is taken as dependent variable and the year
in school of students is taken as independent.
Output:

Interpretation:
The significance of .000 means that there is almost no significant
relationship between the year in school of the students and the score
obtained by them in midterm 1.

You might also like