Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
7 views37 pages

SFM A1.1

This report analyzes SPSS data from 501 observations, focusing on data sources, collection methods, and statistical analysis techniques. It discusses the pros and cons of existing data sources and statistical studies, as well as methods for data cleaning and analysis, including descriptive and inferential statistics. Key findings include insights on accommodation types, pricing, and cleanliness ratings, highlighting significant variations in data.

Uploaded by

thienphuongvu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views37 pages

SFM A1.1

This report analyzes SPSS data from 501 observations, focusing on data sources, collection methods, and statistical analysis techniques. It discusses the pros and cons of existing data sources and statistical studies, as well as methods for data cleaning and analysis, including descriptive and inferential statistics. Key findings include insights on accommodation types, pricing, and cleanliness ratings, highlighting significant variations in data.

Uploaded by

thienphuongvu
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 37

I.

Introduction
This report aims to gather information about the SPSS data, draw conclusions, and make
predictions. We have presented the findings in different formats. In this study, we looked
at 501 different observations. These observations had values that fell between 5800 and
6300.

II. Major findings


Part A:
1. Data sources and data collection methods
According to research, in the field of Business and Economics, we have and classify three
types of data sources (Anderson, 2018).
 Existing source
Existing data sources can be easily observed and found in the data department needed for
a number of specific application platforms in an organization or enterprise and can be
considered as existing in a single database. several shapes. In business subjects, systems
will be clearly divided so that they can contain information systems about employees,
customers, transactions, commercial activities or records. The statistics are provided by
all types of organizations from governmental and non-governmental. Currently, under the
development of the Internet and computing technologies that store a lot of information,
Internet users can access and read data through basic search channels and online
platforms.

Pros Cons
- Maximum cost control - The selection of headings and the
- Time saving content of the data is sometimes not
clear and specific
- The available data will sometimes
be old and not updated in a timely
manner, creating inaccurate
information.
- Some data is only rough, without
thorough analysis.

 Statistical studies
In data research, it follows the situation to track the development of the plan and is
presented in the form of data with variables of interest so that analysis and statistics can
be performed based on it. According to the Australian Bureau of Statistics in 2022, there
are two types of research currently experimental and observational (without testing
impact). Before analyzing the different components of the study, we choose a variable
and assess how it affects the other variables involved. When we are studying certain
domains like population, we don't need to control variables as much because the natural
variability of the domain itself is already taken into account.

Pros Cons
Experimental - The data is controlled - The data may be corrupted by
effectively and personal error
maximally. - When you draw a single card
- The data will help to repeatedly, it can often result
find cause and effect in ongoing conflicts.
effectively, cause and - The parameters of the data can
effect relationship sometimes make predictions
difficult.
Observatio - Real world data will be - Difficulty finding cause and
n
more realistic effect relationships and the
(Non
experiment possibility of variables
al)
appearing.
- The data being studied will
produce results that may
affect the human observer as
well as the study itself.

Pros and cons of Existing source and Statistical studies:


Existing source Statistical studies

Pros - Costs are controlled and optimized - The data provided is


- Time saving in-depth and diverse
in many aspects
Cons - The details and choices to create data are - Old and erroneous
not clear and specific data can lead to false
- Old and outdated information that has not results and
been corrected and lacks depth and detail. erroneous research.
- Cost a lot of time

2. Method of data analysis.


 Descriptive statistics
Descriptive statistics are a collection of numerical coefficients that provide a complete
and detailed overview of data data for both the population and the sample (Adam Hayes,
2022). Descriptive statistics are used to help explain and thoroughly analyze data
collection through the form of summary analysis of sample measures with data. Those
statistics are not too complicated when using basic mathematical formulas such as mean,
mean, and mode. With the average value of a data series, the data after being clearly
classified will be added together and given the number. The calculated mean will have
the effect of processing raw data and is formalized and presented in the form of tables
and charts for easy reading and analysis.

 Inferential statistics
Trochim (2021) explains that inferential statistics is used to draw conclusions about a
larger group of people by analyzing data collected from a smaller sample of that group.
William M.K. Trochim (2022) explains that there are two types of statistics: descriptive
and inferential. When we use inferential statistics, we need to make sure that the samples
we choose are a good reflection of the whole population. This is really important to make
sure our results are reliable.When it comes to sampling, possibility is seen as the
standard.
 Differences between descriptive and inferential statistics
According to research, descriptive and inferential statistics have in common that they
both operate on a fixed set of data (Byrne.G, 2007). These two methods have many
differences in data analysis techniques and in-depth analysis. Descriptive statistics help
analyze and clarify the details of the data. Inferential statistic has the strength of being
able to in-depth analysis of the conclusions and results given from the given data. That
means that the methods will only be effectively applied to situations that really fit the
method.
Descriptive statistics requires selective group selection, measurement with all members
of the experimental group, and careful analysis. Inferential statistics requires the
determination of the population, presentative sampling, and calculations because it is
very difficult to measure the data with certainty and accuracy and it is not possible to take
a few single data and samples to be able to do this. evaluated in general.
3. Data collection method
Collecting data is an important task that involves gathering and analyzing different types
of information such as customer feedback, website traffic, and contact details like names
and emails. The main goal of this process is to help plan better marketing strategies,
improve our understanding of the people we want to reach, and make the most efficient
use of our resources (Jotform, 2022). Collecting data will help to synthesize detailed
information, from which it is possible to analyze and evaluate the strengths and
weaknesses of policies and plans, thereby making changes and improvements to solve
problems. outstanding problems. In addition, brands also need to collect data to be able to
assess customer perceptions, opinions, or identify trends and trends so that they can come
up with new plans or ideas that bring effectiveness
 Quantitative data collection
- Survey:
Jotform (2020) stated that surveys are widely used to collect quantitative data. When we
use a closed-ended survey format, we ask questions that have a limited set of answer
options to choose from. This can include questions with multiple-choice answers or
questions that only have two options, like yes or no. Surveys can help gather information
about people's age, gender, income, and employment status. Next, we'll ask participants
to rate a statement on a scale of one to ten in a survey question with limited answer
options.
- Document review and secondary data collection:
When scholars review documents, they look at numbers and information that come from
original sources like public records and personal papers. They do this to figure out how
important the information is. (Gorman, K., & Johnson, D.E., 2013). Secondary data
collection refers to the process of gathering and analyzing information from sources that
are not considered primary sources. You can use various sources to gather information,
such as the Internet, newspapers, magazines, company records, official documents,
academic journals, and books. The books, magazines, and newspapers that are created by
the government are known as governmental publications.
 Qualitative data collection
- Interview and observation
Interview questions are always set up with topics and issues related to the business.
During the interview process, the interviewee can completely provide more relevant
information at the discretion of the interviewer as well as the interviewer can fully exploit
many aspects of the question to ask more questions. ask.
Observation is widely used by not only academics but even managers in small and
medium enterprises because of its large and simple ability to collect information
(Grbich.C, 2012). This approach requires the researcher to observe carefully over a
period of time and take notes over time. Monitoring can be done with many supporting
tools such as movies, recordings or many tools that have the function of storing
information over time such as mementos.
- Focus group
A focus group usually includes a small group of about eight to ten people, but the exact
number may vary depending on what the researcher needs. According to Grbich (2012),
if researchers want to understand how a specific group of people interprets a particular
topic, focus group discussions can be a useful tool. The researchers looked at the variety
of opinions that were presented and then resolved.
Part B:

1. Data cleaning

Statistics
host_is_superho
room_type room_shared room_private st biz multi
N Valid 501 501 501 501 501 501
Missing 0 0 0 0 0 0

According to the data presented in the table above, there are all 501observations, none of
which are missing.

2. Qualitative analyzed

the type of the accommodation


Cumulative
Frequency Percent Valid Percent Percent
Valid Entire home/apt 208 41.5 41.5 41.5
Private room 293 58.5 58.5 100.0
Total 501 100.0 100.0
In the data table and pie chart presented above, private room is the type with a larger
percentage than the rest, entire home with 58.5% or 293 out of 501 compared to 41.5%.
which is 208 out of 501.

dummy variable (Whether the room is shared or not)


Cumulative
Frequency Percent Valid Percent Percent
Valid 0 501 100.0 100.0 100.0

In the table and the pie chart above, 100% of the rooms or 501 rooms are not shared
rooms.
dummy variable (Whether the room is private or not)
Cumulative
Frequency Percent Valid Percent Percent
Valid 0 208 41.5 41.5 41.5
1 293 58.5 58.5 100.0
Total 501 100.0 100.0

In the data table and pie chart above, a total of 501 units or 501 rooms are counted as
data. In which, there are 208 rooms out of 501 rooms ie 41.5% are not private. The
remaining 293 rooms, or 58.5% are private rooms.

dummy variable (Whether the host is a superhost or not)


Cumulative
Frequency Percent Valid Percent Percent
Valid 0 399 79.6 79.6 79.6
1 102 20.4 20.4 100.0
Total 501 100.0 100.0

According to the data in the statistics table and pie chart above, 399 hosts of rooms over
501 hosts account for 79.6% of non-superhosts. The remaining 102 accounts for 20.4%
are superhosts.

dummy variable (Whether the listing is for multiple rooms or


not)
Cumulative
Frequency Percent Valid Percent Percent
Valid 0 360 71.9 71.9 71.9
1 141 28.1 28.1 100.0
Total 501 100.0 100.0

According to statistics from the data table and pie chart above, 360 out of 501 account for
71.9% of the list of not for multiple rooms. The rest is 141, accounting for 28.1% of
rooms are for multiple rooms.

dummy variable (Whether the listing is for business


purposes or not)
Cumulative
Frequency Percent Valid Percent Percent
Valid 0 313 62.5 62.5 62.5
1 188 37.5 37.5 100.0
Total 501 100.0 100.0
According to statistics from the data table and pie chart above, 313 out of 501 rooms,
respectively 62.5%, are not for business purposes. The remaining 188 rooms, equivalent
to 37.5%, are for business purposes

3. Quantitative analyzed

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
the full price of 501 11994.40 82.55 12076.95 361.6789 587.87133
accommodation for two
people and two nights in
EUR
Valid N (listwise) 501
Extreme value outliers

In the table above, we can clearly see that in a total of 501 data collected on the price of
accommodation for two people and two nights by AirBnB, the average room price of the
main business is 361,6789 Euros. Besides, the lowest room price is only 82.55 Euro and
the highest price room is 12076.95 Euro. It is because of that huge difference that the
price gap between the cheapest and most expensive room is 11994.4 Euros and the
deviation in the exchange rate is 587,87133 Euros.
In the graph above, we can see that most of the accommodation has a reasonable rate,
mainly from 0 to 2000 Euro and gradually decreasing (inversely proportional to the
increase in price).
In the Boxplot table section, we have data to be able to explain the huge variation and
disparity by the existence of some extreme and outliers values. In total, there are very few
rooms with prices from 2000 Euro upwards and only a very few rooms with high prices
at 12000 Euro, the rest consists of rooms with prices ranging under 2000 Euro. Besides,
observations 193, 491, 460, 472,, 295 and 413 are extreme values and observations 318,
366, 315, 344, 184, 310 are outliers in the rents. The graph above tends to be completely
skewed to the left

Valid Missing Total


N Percent N Percent N Percent
the full price of 501 100.0% 0 0.0% 501 100.0%
accommodation for two
people and two nights in
EUR

In the above data table, we can see that the complete data is not missing any part.

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
the maximum number of 501 4 2 6 2.81 1.244
guests that can stay in the
room
Valid N (listwise) 501
In the table above, we can clearly see that the average number of people that can stay in a
room is 2.81 with the minimum number of guests in the room being 2 and the maximum
number of guests in the room being 6. That means is the range of the two units is 4
people because the deviation is 1,244 people.
On the graph, we can see that most of the rooms (more than 300 rooms) are 2 people in 1
room. With the number of people from 3, 4, 5, 6 people in 1 room, they are all under 100
with the number of rooms with 3 and 4 people slightly higher than rooms with 5 and 6
people. The room with 4 people has more number than the room with 3 people, the room
with 6 people has the number of rooms with 5 people. The graph above tends to be
skewed to the left

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
cleanliness rating (Max of 501 8 2 10 9.24 .951
10)
Valid N (listwise) 501
In the table above, we can see that AirBnB's cleanliness rating is very good with an
average score of 9.24. The highest rating for AirBnB is 10 and the lowest score is 2. The
range between the two frames is 8 and the deviation is 0.951.
In the graph section, we can see that the number of absolute 10 points for AirBnB
cleanliness is a lot. Nearly 250 reviews have scored 10 for cleanliness here and about 170
reviews have given it 9 for cleanliness. Reviews with a fairly good level from an above
average cleanliness scale of 6 to 8 are yes but not too much when the number of 8 points
is about 50, 7 is about 20 and 6 is less than 10. There is no 3 rating for AirBnB
cleanliness and there are a few reviews with a low score of 2 to 5 but not significant. The
graph above tends to skew to the right

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
overall rating of the listing 501 80 20 100 91.60 8.471
(Max of 100)
Valid N (listwise) 501
In the table above, the average rating of the listing of AirBnB is 91.60. The highest rating
is 100 and the lowest is 20, which means the range of the two metrics is 80. The deviation
is 8,471.
On the chart above, AirBnB's 100 rating of the listing scores the largest with about 100
votes. Besides, the number of ratings for AirBnB over 80 points excluding the 100 point
mark is also high with the number from about 18 to 70 votes. Ratings with low scores of
80 or less have all numbers below 15, and there are only a few ratings with scores of 60
or less with a negligible number. The graph above tends to skew to the right

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
number of bedrooms (0 for 501 8 0 8 1.12 .620
studios)
Valid N (listwise) 501
Based on the data of the above data table, the mean of the number of bedrooms is 1.12
beds, with the minimum number of bedroom being 0 (for studio) and a maximum of 8
bedroom. The Deviation is 0.620 bedrooms and the range is 8 bedrooms.
Based on the data of the above graph, most of AirBnB's rooms have 1 bedroom with the
number of nearly 400 rooms. The second largest metric is the number of two-bedroom
suites with about 60 rooms. AirBnB also has a number of studio rooms with 0 bedroom
with about 20 rooms. None of the rooms have 6 and 7 bedrooms. The number of rooms
with 3, 4, 5 and 8 bedrooms is not significant, all under 10 rooms with the number of
rooms with 3 bedrooms being the most. The graph above tends to be skewed to the left

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
distance from city centre in 501 15.408 .041 15.448 5.26999 2.588208
km
Valid N (listwise) 501
According to the data in the table above, the average distance of AirBnB rooms from the
city center is 5.26999km. The room with the furthest distance from the city center is
15.448km and the closest is 0.041km. The range between the two figures is 15,408km
and the deviatation is 2,588208km.
According to the graph above, the number of apartments close to the city center (under
5km) is quite a lot with the number of apartments 5km away being the most with about
60 apartments. The number of apartments from 0 to the landmark less than 5km has the
number of about 9 to 35 apartments. The number of apartments over 5km to 10km from
the city center is also relatively large with uneven order, from less than 50 apartments to
10 apartments with landmarks. From the distance of 10km onwards, the number of
apartments is much less. There are only a few apartments far from the city center with a
distance from the 10km mark to the 15km mark with a negligible number. The graph
above tends to be skewed to the left

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
distance from nearest metro 501 6.402 .033 6.435 .93917 1.030232
station in km
Valid N (listwise) 501

According to the data in the table above, the average distance of apartments to the nearest
gas station is 0.93917km. The farthest distance of an apartment to a gas station is
6.435km and the nearest is 0.033km. The range between the two quantifiers is 6.402km
and the deviation is 1.030232km.
According to the graph, most of the apartments are less than 2km from the nearest gas
station, with the number of apartments less than 1km away, the largest number with about
130 apartments. From the distance of 2km to the distance of 6km, the number of
apartments is negligible, all less than 15 apartments. The graph above tends to be skewed
to the left

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
attraction index (0-100) 501 90.619 5.430 96.049 21.13352 14.360858
Valid N (listwise) 501
According to the data of the table above, the average attraction index is 21,13352
listwise. The largest AirBnB attraction metric is 96,049 and the smallest is 5,430. The
range between 2 metrix is 90.619 and the deviation is 14.360858.
According to the graph above, the data of attraction index has a large concentration
ranging from 12 to 40 indexes with 16 index points with a large number of over 125
apartments. From after the 40-index mark to 100-index at the same point, 8 indexes have
a small quantity and are scattered with a negligible amount. The graph above tends to be
skewed to the left

Descriptive Statistics
N Range Minimum Maximum Mean Std. Deviation
restaurant index (0-100) 501 66.738 2.861 69.599 12.08715 9.726296
Valid N (listwise) 501
According to the above data, the average AirBnB restaurant index score is 12,08715 with
the lowest score of 2,861 and the highest index score of 69,599. The range between the 2
matrices is 66,738 and the deviation is 9,726296.
According to the above graph, except for the lowest index point with a negligible
number, the restaurant index points from 0 to 20 restaurant index points have a large
number with a minimum of 30 apartments or more and a maximum of about 170
apartments. From the 20 restaurant index mark onwards, the number of apartments with
scores higher than 20 is not too large, the milestones do not exceed 15 apartments. The
graph above tends to be skewed to the left

Descriptive Statistics
Std.
N Range Minimum Maximum Mean Deviation Variance Skewness
Statistic Statistic Statistic Statistic Statistic Statistic Statistic Statistic Std. Error
the full price of 501 11994.40 82.55 12076.95 361.6789 587.87133 345592.695 16.158 .109
accommodation for two
people and two nights in
EUR
the maximum number of 501 4 2 6 2.81 1.244 1.547 1.351 .109
guests that can stay in
the room
cleanliness rating (Max 501 8 2 10 9.24 .951 .905 -2.114 .109
of 10)
overall rating of the 501 80 20 100 91.60 8.471 71.759 -2.265 .109
listing (Max of 100)
number of bedrooms (0 501 8 0 8 1.12 .620 .385 3.802 .109
for studios)
distance from city centre 501 15.408 .041 15.448 5.26999 2.588208 6.699 .200 .109
in km
distance from nearest 501 6.402 .033 6.435 .93917 1.030232 1.061 2.668 .109
metro station in km
attraction index (0-100) 501 90.619 5.430 96.049 21.13352 14.360858 206.234 2.186 .109
restaurant index (0-100) 501 66.738 2.861 69.599 12.08715 9.726296 94.601 2.734 .109
Valid N (listwise) 501

Based on the data in the table above, the skewness statistic of the full price of
accommodation for two people and two nights in EUR is 16,158, and is the right
skewness. Statistic skewness of the maximum number of guests that can stay in the room
is 1,351 and is right skewness. Statistic skewness of the cleanliness rating is -2.114 and is
left skewness. Statistic skewness of the overall rating of the listing is - 2,265 and is left
skewness. Statistic skewness of number of bedrooms is 3,802 and is right skewness.
Statistic skewness of distance from city is 0.200 and is symmetric. Statistic skewness of
distance from nearest metro station is 2,668 and is right skewness. Statistic of attraction
index is 2,186 and is right skewness. Statistic of restaurant index and right skewness.
4. Correlation coefficient

Correlations
the full price of
accommodation
for two people and
two nights in EUR
the full price of accommodation Pearson Correlation 1
for two people and two nights in Sig. (2-tailed)
EUR N 501
cleanliness rating (Max of 10) Pearson Correlation -.072
Sig. (2-tailed) .108
N 501
overall rating of the listing (Max Pearson Correlation -.030
of 100) Sig. (2-tailed) .507
N 501
distance from city centre in km Pearson Correlation -.188**
Sig. (2-tailed) .000
N 501
distance from nearest metro Pearson Correlation -.108*
station in km Sig. (2-tailed) .015
N 501
attraction index (0-100) Pearson Correlation .208**
Sig. (2-tailed) .000
N 501
restaurant index (0-100) Pearson Correlation .175**
Sig. (2-tailed) .000
N 501
**. Correlation is significant at the 0.01 level (2-tailed).
*. Correlation is significant at the 0.05 level (2-tailed).

Based on the data of the above table, the correlation index of the rating of the listing of
AirBnB is -0.030. Since this indicator is negative, it is negative. The pearson index -
0.030 is in the range -0.0 to -0.3 so it is calculated as negligible correlation.

Based on the data of the above table, the correlation index of cleanliness rating of
AirBnB is -0.072. Since this indicator is negative, it is negative. The pearson index -
0.072 is in the range -0.0 to -0.3 so it is calculated as negligible correlation.
Based on the data of the above table, the correlation index of the distance from city center
of AirBnB is -0.188. Since this indicator is negative, it is negative. The pearson index -
0.188 is in the range -0.0 to -0.3 so it is calculated as negligible correlation. In addition,
the pearson index of this item has 2 asterisks, which is a significant parameter.

Based on the data of the above table, the correlation index of the distance from nearest
metro station of AirBnB is -0.108. Since this indicator is negative, it is negative. The
pearson index -0.108 is in the range -0.0 to -0.3 so it is calculated as negligible
correlation. In addition, the pearson index of this item has an asterisk, which is a
significant parameter.
Based on the data of the above table, the correlation index of AirBnB's attraction index is
0.208. Since this indicator is negative, it is positive. The pearson index of 0.208 is in the
range of 0.0 to 0.3 so it is considered negligible correlation. In addition, the pearson
index of this item has two asterisks, which is a significant parameter.

Based on the data of the above table, the correlation index of AirBnB restaurant index is
0.175. Since this indicator is negative, it is positive. The pearson index is 0.175 and
ranges from 0.0 to 0.3 so it is considered negligible correlation. In addition, the pearson
index of this item has 2 asterisks, which is a significant parameter.

5.
the full price of accommodation for
two people and two nights in EUR
Standard
Mean Deviation
the type of the Entire home/apt 516.56 271.38
accommodation Private room 251.73 714.38
dummy variable (Whether 0 516.56 271.38
the room is private or not) 1 251.73 714.38
dummy variable (Whether 0 377.44 644.93
the host is a superhost or 1 300.01 258.83
not)
dummy variable (Whether 0 380.44 670.34
the listing is for multiple 1 313.77 280.66
rooms or not)
dummy variable (Whether 0 333.52 706.66
the listing is for business 1 408.56 295.54
purposes or not)

Based on the data of the above table, we can see that entire home is the higher priced
home of the two compared with entire home mean of 516.56 and deviation of 271.38.
AirBnB's private room type has a mean of 251.73 and a deviation of 714.38. We can see
that the data of mean and deviation of the two types of houses have a significant
difference and that proves that the prices of these two types of houses are closely
dependent.
With the data of the if the room is private or not section, there is data similar to the
acccommondation type, so the analysis and evaluation will be similar to the above.
With the data of the rest, we can clearly see that the mean index is not too different and
the deviation index has a big difference with the mean of the entire home without
superhost being 377.44 and of the private room with superhost being 300.01 and the
same. with deviations of 644.93 and 258.83 respectively; mean of entire home without
multiple rooms is 380.44 and of private room with multiple rooms is 313.77 with
deviation of 670.34 and 280.66 respectively; mean of entire room for non-business
purpose is 333.52 and of private room for business purpose is 408.56 with deviation of
706.66 and 295.54 respectively, then it can be confirmed that these two components are
not related and be independent with each other.

6.
When we want to visually represent the qualitative attributes of data, we can use a frequency
table along with either a pie chart or a bar chart. We use descriptive tables, histograms, and
boxplots to show numerical data. To get the most accurate comparison between two quantitative
variables, it's common to use a correlation table and a scatter plot.

Part C

- T test:

Group Statistics
dummy variable (Whether the
room is private or not) N Mean Std. Deviation Std. Error Mean
the full price of 0 208 516.5621 271.38441 18.81712
accommodation for two 1 293 251.7277 714.38036 41.73455
people and two nights in
EUR

Independent Samples Test


Levene's Test for Equality of Variances

F Sig. t df Sig. (2-ta


the full price of accommodation Equal variances assumed .722 .396 5.091 499
for two people and two nights in Equal variances not assumed 5.785 399.499
EUR

Before we compare the costs of private and shared accommodations, it's important to
make sure that all the basic features are the same. Both of these ideas need more
examination.
o Equality and Variances:
H0: σ_private2 = σ_(not private)2
H1: σ_(private )2 ≠ σ_(not private)2
Sig(1) = 0.396 > α = 0.05
 Accept H0
 Private room and not private room has equal variances.

o Equality and Means:


H0: μ_private = μ_(not private)
H1: μ_private ≠ μ_(not private)
Sig( 2 – tailer) = 0.000 < α = 0.05
 Accpet H1
 Private room and not private room has no equal means of price.

o Equality and Variances:


H0: σ_( for business purposes)2 = σ_(not for business purposes)2
H1: σ_( for business purposes )2 ≠ σ_(not for business purposes)2
Sig(1) = 0.981 > α = 0.05
 Accept H0
 For business purposes and not for business purposes has equal variances

o Equality and Means:


H0: μ_ for business purposes = μ_(not for business purposes)
H1: μ_ for business purposes ≠ μ_(not for business purposes)
Sig (2 – tailer) = 0.167 > α = 0.05
 Accept H0
 For business purposes and not for business purposes has equal means and price.

- The regression model of price on all other quantitative variables

Model Summary
Adjusted R Std. Error of the
Model R R Square Square Estimate
a
1 .354 .125 .111 554.30037
a. Predictors: (Constant), restaurant index (0-100) , number of
bedrooms (0 for studios), overall rating of the listing (Max of 100),
distance from nearest metro station in km , the maximum number of
guests that can stay in the room , cleanliness rating (Max of 10),
distance from city centre in km, attraction index (0-100)
According to the data table above, the r square parameter of 0.125 is equal to 12.5% of
the full price. The above parameter is described by 8 quantiative variables belonging to
predictors including restaurant index (0-100) , number of bedrooms (0 for studios),
overall rating of the listing (Max of 100), distance from nearest metro station in km , the
maximum number of guests that can stay in the room , cleanliness rating (Max of 10),
distance from city center in km, attraction index (0-100). The remaining 87.5% of the full
price variable are the possibilities for the full price variable described by other variable
types.

Coefficientsa
Model Unstandardized Coefficients Standardized t Sig.
Coefficients
B Std. Error Beta
1 (Constant) -114.202 325.290 -.351 .726
the maximum number of 38.299 22.776 .081 1.682 .093
guests that can stay in the
room
cleanliness rating (Max of -54.011 42.771 -.087 -1.263 .207
10)
overall rating of the listing 6.009 4.801 .087 1.252 .211
(Max of 100)
number of bedrooms (0 for 218.573 45.294 .231 4.826 .000
studios)
distance from city centre in -5.646 18.578 -.025 -.304 .761
km
distance from nearest metro -37.189 29.062 -.065 -1.280 .201
station in km
attraction index (0-100) 9.309 5.125 .227 1.816 .070
restaurant index (0-100) -5.027 6.675 -.083 -.753 .452
a. Dependent Variable: the full price of accommodation for two people and two nights in EUR

Real sum = -114.202 + 38.299 x the maximum of guest – 54.011 x the cleanliness rating
+ 6.009 x rating of the listing + 218.573 x number of bedrooms – 5.646 x distance from
city centre – 37.189 x distance from nearest metro station + 9.309 x attraction index –
5.027 x restaurant index.

From the above data table, we can have the following conclusions:
 The maximum number of guests that can stay in the room increases by 1 person,
its full price will increase to 38,299 Euro.
 The cleanliness rating increases by 1 rate, the full price of accommodation will
decrease by 54,011 Euro
 If the overall rating of the listing increases by 1 rate, the full price of the house
will increase by 6,009 Euros.
 If the number of bedrooms increases to 1 bedroom per accommodation, the full
price will increase by 218,573 Euro.
 If the distance from city center increases the distance to 1km, the full price will be
reduced by 5,646 Euros.
 If the distance from nearest metro station increases the distance by 1km, the full
price will be reduced by 37,189 Euro.
 If the attraction index increases by 1 index, the full price will increase by 9,309
Euros.
 If the restaurant index increases by 1 index, the full price will decrease by 5,027
Euros.

 Coefficients testing
- The maximum number of guests that can stay in the room
H0: β=0
H1: β≠0
Sig ( The maximum number of guests that can stay in the room )= 0.093 > α = 0.05
 Accept H0
 The relationship between the price and the maximum of guest is no significant

- The cleanliness rating


H0: β=0
H1: β≠0
Sig (The cleanliness rating) = 0.207 > α = 0.05
 Accept H0
 The relationship between the price and the cleanliness rating is no significant

- The overall rating of the listing


H0: β=0
H1: β≠0
Sig (The overall rating of the listing) = 0.211 > α = 0.05
 Accept H0
 The relationship between the price and the overall rating of the listing is no
significant.

- The number of bedrooms


H0: β=0
H1: β≠0
Sig( The number of bedrooms) = 0.000 < α = 0.05
 Accept H1
 The relationship between the price and the number of bedrooms is significant

- The distance from city centre


H0: β=0
H1: β≠0
Sig( The distance from city centre) =0.761 > α = 0.05
 Accept H0
 The relationship between the price and the distance from city centre is no
significant

- The distance from nearest metro station


H0: β=0
H1: β≠0
Sig( The distance from nearest metro station) = 0.201 > α = 0.05
 Accept H0
 The relationship between the price and the distance from the nearest metro station
is no significant

- The attraction index


H0: β=0
H1: β≠0
Sig( The attraction index) = 0.070 > α = 0.05
 Accept H0
 The relationship between the price and the attraction index is no significant

- The restaurant index


H0: β=0
H1: β≠0
Sig( The restaurant index) = 0.452 > α = 0.05
 Accept H0
 The relationship between the price and the attraction index is no significant

Below is the data table of the full price and the number of bedrooms

Model Summary
Adjusted R Std. Error of the
Model R R Square Square Estimate
a
1 .258 .067 .065 568.46178
a. Predictors: (Constant), number of bedrooms (0 for studios)

Coefficientsa
Standardized
Unstandardized Coefficients Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 86.352 52.600 1.642 .101
number of bedrooms (0 for 245.007 40.990 .258 5.977 .000
studios)
a. Dependent Variable: the full price of accommodation for two people and two nights in EUR
After the above analysis, we can evaluate these 2 tables that are considered the most
workable model

III. REFERENCES LIST


1. Hayes, A. (2022) Descriptive statistics: Definition, Overview,
types, example,
Investopedia. Investopedia. Available at:
https://www.investopedia.com/terms/d/descriptive_statistics.asp
2. Gorman, K. and Johnson, D.E., 2013. Quantitative analysis
3. Australian Bureau of Statistics (2022). Statistical Language -
Quantitative and
Qualitative Data. Available at:
https://www.abs.gov.au/websitedbs/D3310114.nsf/Home/Statistic
al+Language
+-+quantitative+and+qualitative+data#:~:text=What%20are
%20quantitative
%20and%20qualitative,symbol%2C%20or%20a%20number
%20code.
(Accessed: 20 November 2022)
4. Kabir, S. M. S., 2016. Methods of Data Collection. In: Basic Guidelines for
Research: An Introductory Approach for All Disciplines. 1st ed. Chittagong:
Book Zone Publication, pp. 201-275.
5. Simonsohn, U., Simmons, J. P., & Nelson, L. D. (2019). Specification curve:
Descriptive and inferential statistics on all reasonable specifications. Available at
SSRN 2694998
6. Trochim, W.M.K Inferential statistics, Research Methods
Knowledge Base.
Conjointly. Available at: https://conjointly.com/kb/inferential-
statistics/

You might also like