Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
102 views19 pages

Civ 7101 Assignment 1 2023 Katende Abdulazziz

The document contains data from multiple sources including number of orders received over 40 days, speeds of 45 motorists, yield strength of steel bars, lengths of 100 steel rods measured, major highway defects observed over 90km, and functionality of deep boreholes. For each dataset, the document lists 7-9 questions to analyze the data through frequency tables, histograms, probability distributions, summary statistics, and confidence intervals. This would allow understanding of descriptive statistics and their applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
102 views19 pages

Civ 7101 Assignment 1 2023 Katende Abdulazziz

The document contains data from multiple sources including number of orders received over 40 days, speeds of 45 motorists, yield strength of steel bars, lengths of 100 steel rods measured, major highway defects observed over 90km, and functionality of deep boreholes. For each dataset, the document lists 7-9 questions to analyze the data through frequency tables, histograms, probability distributions, summary statistics, and confidence intervals. This would allow understanding of descriptive statistics and their applications.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Civ 7101 Assignment 1 2023

KATENDE ABDULAZZIZ

The table below shows the number of orders received by Company X for a period of 40 days

24 13 28 15 25 29 15 46
9 10 17 22 23 17 16 32
11 12 18 20 13 27 18 22
20 14 26 14 19 19 40 31
17 21 23 26 18 24 21 27

i. Summarize the above data in a suitable frequency table.


ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

vi. Construct a 95% and 99% Confidence Interval for the data given.
vii. If on a certain day 35 orders were received, is the difference in the orders significant?

…………………………………………………………………………………………………………………………………..

MUHWEZI GILBERT

The road safety transport is concerned about the speed motorists are driving on Jinja road.

Below are the speeds of 45 motorists in miles per hour

15 32 45 46 42 37 68 47 18
31 48 49 56 52 39 48 69 61
44 42 38 52 55 58 63 58 48
56 58 48 47 52 37 64 29 55
38 29 68 49 69 18 64 55 49

i. Summarize the above data in a suitable frequency table.


ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

vi. Construct a 95% and 99% Confidence Interval for the data given.
vii. If a vehicle was randomly taken and was found to be moving at 20 miles per hour, test for
significance

……………………………………………………………………………………………………………………………………….

AGREY NDIKUBWIMANA

The table below shows data of yield strength of steel bars of Y 20 as tested by Roko Construction

665 498 494 463 627


554 531 497 491 636
562 474 611 646 492
598 577 513 552 543
612 605 576 510 483
479 594 532 563 667
495 522 643 609 588
515 469 589 621 643
527 633 624 555 561
486 590 670 484 494
i. Summarize the above data in a suitable frequency table.
ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

vi. Construct a 95% and 99% Confidence Interval for the data given.
vii. If a bar was randomly taken and tested and its compressive strength was found to be 600 MPa,
is the difference significant?

…………………………………………………………………………………………………………………………………………….

KATENDE MICHAEL

The table below shows lengths of 100 steel rods (mm) measured at a construction site. Use the table to
answer questions as you demonstrate your understanding of descriptive statistics.

144 146 154 146


151 150 134 153
145 139 143 152
154 146 152 148
157 153 155 157
157 150 145 147
149 144 137 155
141 147 149 155
158 150 149 156
145 148 152 154
151 150 154 153
155 145 152 148
152 146 152 142
144 160 150 149
150 146 148 157
147 144 148 149
155 150 153 148
157 148 149 153
153 155 149 151
155 142 150 150
146 156 148 160
152 147 158 154
143 156 151 151
151 152 157 149
154 140 157 151

i. Summarize the above data in a suitable frequency table.


ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

vi. Construct a 95% and 99% Confidence Interval for the data given.
vii. If a bar was randomly taken and its diameter was found to be 200 mm, is the difference
significant?
………………………………………………………………………………………………………………………………………………..

KIBUULE IVAN

Raw data collected on highway for major defects at intervals of 1km for 90km in preparation for repair
works.

2 1 1 3 1 1 0 0 0 1
1 3 2 0 0 2 4 0 1 1
2 1 1 2 4 2 1 1 3 4
1 1 1 1 2 4 2 2 1 3
1 5 5 4 5 3 2 1 1 1
0 0 1 0 1 0 0 1 0 0
3 2 2 3 1 3 2 0 1 0
2 3 2 2 1 1 0 2 1 2
0 0 0 2 0 1 1 2 1 1

i. Summarize the above data in a suitable frequency table.


ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

…………………………………………………………………………………………………………………………………………………….
YUSUF SSENTEZA
The data below was collected from Lamwo District Water Department, which indicates the
record of deep boreholes functionality in Lokung sub-county in December, 2020.

S/ Borehole Sub- Completio


Village Location Parish Eastings Northings Status
n Number County n Date
Not
DWD 21406 Aweno Olwi Agur P.7 Dibolyec Lokung 2008 472427 411290
1 Functional
Not
DWD 22929 Aweno Olwi Market Dibolyec Lokung 2006 471837 411350
2 Functional
3 DWD 22901 Aweno Olwi Wange pa Ajik Dibolyec Lokung 2006 471042 411445 Functional
4 DWD 58306 Aweno Olwi Police Post Dibolyec Lokung 2016 472936 413355 Functional
5 DWD 58307 Aweno Olwi Barracks Dibolyec Lokung 2016 475612 413491 Functional
6 DWD 22930 Aweno Olwi Aweno Olwi Dibolyec Lokung 2006 471936 411486 Functional
Not
DWD 31447 Ber Lobo Ber Lobo Dibolyec Lokung 2011 477424 411849
7 Functional
Not
DWD 31395 Latigiriu Latigiro Dibolyec Lokung 2010 470013 412810
8 Functional
Lugwak Not
DWD 4040 Tee Owor Dibolyec Lokung 2008 470898 404580
9 Central Functional
Lugwak Not
DWD 25732 Dibolyec H/C Dibolyec Lokung 2007 471237 404796
10 Central Functional
Not
DWD 35204 Lugwak East Ogwang Gere Dibolyec Lokung 2011 473839 405690
11 Functional
Lukwak Corner Tee
DWD 32569 Dibolyec Lokung 2010 471200 405352 Functional
12 Central Olam
13 DWD 22353 Pateke Kicenga Dibolyec Lokung 2003 479531 412345 Functional
Not
DWD 27566 Pateke Lodeg Ayii Dibolyec Lokung 2004 479459 412087
14 Functional
15 DWD 28688 Ywaya East Dibolyec P.7 Dibolyec Lokung 2009 477012 408918 Functional
16 DWD 31372 Ywaya East Ywaya East Dibolyec Lokung 2010 477882 409668 Functional
17 DWD 45236 Ywaya West Pii Pe Dibolyec Lokung 2015 476597 408285 Functional
18 DWD 27992 Lela Pwot Lela Pwot Lela Pwot Lokung 2009 467468 401186 Functional
Not
DWD 27410 Lelabul East Lelabul East Lela pwot Lokung 2009 461687 410651
19 Functional
20 DWD 29370 Lelabul West Lela Pwot P/S Lela Pwot Lokung 2009 467231 401917 Functional
Lelapwot
DWD 28684 Lelapwot West Lela Pwot Lokung 2010 467544 401179 Functional
21 West
22 DWD 31363 Omer-kongo Lacara Lela pwot Lokung 2009 468717 401544 Functional
23 DWD 27615 Tedo Pe Kwon Odong Lela Pwot Lokung 2008 467658 403854 Functional
24 DWD 29682 Tedo pe Akwangiro Lela pwot Lokung 2009 465550 406437 Functional
25 DWD 29685 Tedo pe Lelabul P/S Lela pwot Lokung 2009 464539 406781 Functional
26 DWD 52229 Tedo Pe Akwera lelapwot lokung 2016 463173 408780 Functional
Not
DWD 12227 Apuk Apuk Licwar Lokung 2005 450160 408068
27 Functional
Not
DWD 12226 Apuk lagwe Tadi Licwar Lokung 1997 446256 408263
28 Functional
29 DWD 44935 Apuk A lot neko nga Licwar Lokung 2014 453187 407457 Functional
Not
DWD 31389 Apuk Apuk Licwar Lokung 2010 451611 408103
30 Functional
31 DWD 33300 Ghana Ghana Licwar Lokung 2011 461930 405332 Functional
DWD Lakwala
Lakwala Licwar Lokung 2013 456059 402606 Functional
32 31447B West
Lakwala
DWD 31130 Dog Gudi Licwar Lokung 2010 457823 402097 Functional
33 west
34 DWD 21501 Licwar West Ogwal wor Licwar Lokung   460240 397203 Functional
35 DWD 21500 Licwar West Dyang Onyono Licwar Lokung 2007 460258 396650 Functional
36 DWD 12231 Licwar West Larobi Licwar Lokung   463081 400383 Functional
Ngomoromo Not
DWD 10450 Ngomoromo Licwar Lokung   453847 408235
37 yoke Functional
Ngomoromo
DWD 22915 Ngomoromo Licwar Lokung 2006 453933 407941 Functional
38 Sch. 2
39 DWD 41898 Ngomoromo Border Market Licwar Lokung 2013 453878 409234 Functional
Not
DWD 22928 Ngomoromo Ngomoromo Licwar Lokung 2006 454295 407849
40 Functional
Ngomoromo Not
DWD 22927 Ngomoromo Licwar Lokung 2006 454327 407795
41 HC Functional
42 DWD 21317 Ngomoromo Ngomoromo Licwar Lokung 2006 454294 407851 Functional
Ngomoromo
DWD 58308 Ngomoromo Licwar Lokung 2016 454403 407495 Functional
43 HC
Akeli Kongo Akeli Kongo Not
CD 3394 Pangira Lokung   457175 395848
44 West West Functional
Akelikongo Akelikongo
DWD 31434 Pangira Lokung 2010 458209 398924 Functional
45 Central Central
Source: Water Department Lamwo District

i. Summarize the above data in a suitable frequency table.


ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis
…………………………………………………………………………………………………………………………………………

OSWAHA MATHEW JOSEPH ODIONGO

In the following set of data, y represents the number of annual claims for flood damage received by an
insurance company (in thousands of dollars) and x represents the annual rainfall (in centimeters) over a
period of 10 years

y x
0 110
2.5 250
2.2 250
0 150
19.5 450
2.5 200
2 210
2 230
3.1 290
0 100

i. Draw the scatter plot


ii. Calculate the correlation coefficient.
iii. Determine the coefficient of determination. What does it mean?
iv. Determine the linear regression equation using the least squares method.
v. Use the regression equation to predict the number of flood damage claims in a year with
350 cm of rainfall. Comment on the likely accuracy of your prediction
vi. Perform a t-test for the correlation coefficient (is there evidence of a linear relationship
between x and y?
vii. Test the regression slope for significance.
viii. List down the most common assumptions of linear regression.

……………………………………………………………………………………………………………………………………………….

ABAASA PATRICK

The following data have been collected regarding sales and advertising expenditure
Advertising
expenditure, Thousands
Sales, million USD of USD
8.5 210
9.2 250
7.9 290
8.6 330
9.4 370
10.1 410

i. Draw the scatter plot


ii. Calculate the correlation coefficient.
iii. Determine the coefficient of determination. What does it mean?
iv. Determine the linear regression equation using the least squares method.
v. Use the regression equation to predict the sales when the advertising expenditure was 350
thousand USD. Comment on the likely accuracy of your prediction
vi. Perform a t-test for the correlation coefficient (is there evidence of a linear relationship
between x and y?
vii. Test the regression slope for significance.
viii. List down the most common assumptions of linear regression.

……………………………………………………………………………………………………………………………………………………………

KAYABYA WILSON

A University is investigating the relationship between performance in Advanced Mathematics Course and
hours studied per week and the general level of intelligence of a student. The data on 10 students is
tabulated below:

Studen Examinatio
t Hours IQ n mark
x1 x2 y
1 9 99 56
2 6 100 45
3 12 119 80

4 14 95 73
5 11 110 71
6 6 117 55
7 19 98 95
8 16 101 86
9 3 100 34
10 9 115 66
i. Calculate the multiple correlation coefficient.
ii. Determine the multiple coefficient of determination. What does it mean?
iii. Determine the multiple linear regression equation using the least squares method.
iv. Use the multiple regression equation to predict the score of a candidate who has worked
for 13 hours per week and who has an IQ of 102. Comment on the likely accuracy of your
prediction
v. Perform a multicollinearity test (is there evidence of a correlation among variables?
vi. Test the multiple regression slopes for significance.
vii. List down the most common assumptions of multiple linear regression.

…………………………………………………………………………………………………………………………………………………………

KAWOOYA JONATHAN

Kampala Capital City Authority is currently implementing upgrade of city drainages in Kampala
City under Lots 1,2,3 with supervision of UB Consulting Engineers. One of the specific tasks
under quality control aspect is concrete testing. One of the questions most frequently asked by
local leaders during site meetings are: If we require more channel in our community, how much
can we expect to invest in form of PPP? The Client has been asked to develop some guidelines
regarding cost estimates. Three variables are thought to relate to the project costs: (1) the mean
monthly rainfall, (2) the number of days affected by the rains considered as delays, and (3) time
so far consumed by the project. To investigate, KCCA selected a random sample of 20 recently
constructed channel in the vicinity.

Channel Mean rainfall Delays


Cost(106) Duration Elapsed
No. (mm) (days)
(weeks)
1 250 35 3 6
2 360 29 4 10
3 165 36 7 3
4 43 60 6 9
5 92 65 5 6
6 200 30 5 5
7 355 10 6 7
8 290 7 10 10
9 230 21 9 11
10 120 55 2 5
11 73 54 12 4
12 205 48 5 1
13 400 20 5 15
14 320 39 4 7
15 72 60 8 6
16 272 20 5 8
17 94 58 7 3
18 190 40 8 11
19 235 27 9 8
20 139 30 7 5

i. Calculate the multiple correlation coefficient.


ii. Determine the multiple coefficient of determination. What does it mean?
iii. Determine the multiple linear regression equation using the least squares method.
iv. Use the multiple regression equation to predict the cost of a channel when the mean rainfall
was 30mm, causing a delay of 4 days and elapsed duration of 6 weeks. Comment on the
likely accuracy of your prediction
v. Perform a multicollinearity test (is there evidence of a correlation among variables?
vi. Test the multiple regression slopes for significance.
vii. List down the most common assumptions of multiple linear regression.

………………………………………………………………………………………………………………………………………….

MAFUKO GEORGE

Non-linear growths eg Exponential growth

The exponential function takes the form


x
y=ab

Where y is the variable to be predicted.

A and b are constants

X denotes the number of period.

If b is more than 1, then there is exponential growth; if b is less than 1, there is exponential decay.
The exponential function can be reduced to a linear form by taking logarithms of the function and
becomes

log y=loga + xlogb

Or

log y= A +Bx
Where, A=log a∧B=log b

In the example below, the growth of sales were recorded as shown in the table below

Sales,
Million
Year USD
1989 100
1990 150
1991 225
1992 337.5
1993 506.25

i. Show that the exponential equation is given by


x x
y=a b =100∗1.5
ii. What are the properties or characteristics of exponential functions?

………………………………………………………………………………………………………………………………….

MUTUMBA MOSES NSEREKO

In Nakaseke district, the number of deep boreholes drilled dry per financial year (FY) were recorded over
a period of 14 years from FY 2005/6 to FY 2018/19 as shown in the following table:

FY 2005/6 2006/7 2007/8 2008/9 2009/10 2010/11 2011/12 2012/13 2013/14


# Drilled 5 18 10 12 12 12 12 7 7
# Drilled 0 0 0 3 1 2 2 1 0
Dry
FY 2014/15 20015/16 2016/17 2017/18 2018/19

# Drilled 14 10 10 9 9
# Drilled 1 0 1 2 3
Dry

The data could have been presented as follows:

# of dry deep 0 1 2 3 >3


boreholes per FY , x
Frequency, f 5 4 3 2 0

i. Determine the summary statistics of the data.


ii. What are the properties or characteristics of Poisson distributions?

…………………………………………………………………………………………………………………………………………………

NAMYALO MARY
Rebound hammer test was used to conduct a condition assessment of a commercial building in
Kampala City. The values below are the corresponding compressive strength results obtained
from the rebound hammer tests on columns of the structure.
38 34 38 38 37 38 39 41 41 40 39 38 38 40 38 40 39 40 39 39 39 44 43 43 37 37
39 42 38 33 34 38 39 39 44 39 38 39 40 35 40 38 37 37 38 38 39 40 37 39 39 40
36 36
i. Summarize the above data in a suitable frequency table.
ii. Draw the histogram, frequency polygon and ogive
iii. From the shape of the frequency polygon, what is the assumed probability distribution of the
data?
iv. What are the properties/characteristics of the assumed probability distribution?
v. Calculate the summary statistics and tabulate your results as shown below
N Statistic Formul Calculation Result
o a
1 Mean
2 Median
3 Mode
4 Range
5 Variance
6 Standard deviation
7 Coefficient of Variation
8 Skewness
9 Kurtosis

vi. Construct a 95% and 99% Confidence Interval for the data given.
vii. If another test was done and the corresponding compressive strength was found to be 35 MPa,
is that difference significant?

………………………………………………………………………………………………………………………

LOWUM SIMON PETER


Hypothesis testing

1. The target thickness for silicon wafers used in a certain type of integrated circuit is 245  m.
A sample of 50 wafers is obtained and the thickness of each one is determined, resulting in a
sample mean thickness of 246.18  m and a sample standard deviation of 3.60  m.
Does this data suggest that true average wafer thickness is
something other than the target value?
2. A manufacturing process produces TV. tubes with an average life
m=1200 hours and s = 300 hours. A new process is thought to give tubes a higher average life.
And out of a sample of 100 tubes we find that they have an average life = 1265 hours. Is the
new process really any better than the old?
3. Urban storm water can be contaminated by many sources, including discarded batteries. When
ruptured, these batteries release metals of environmental significance.
The article “Urban Battery Litter” (J. of Environ. Engr., 2009: 46–57) presented summary data for
characteristics of a variety of batteries found in urban areas around Cleveland.
A sample of 51 Panasonic AAA batteries gave a sample mean zinc mass of 2.06g and a sample
standard deviation of .141g.
Does this data provide compelling evidence for concluding that the population mean zinc mass
exceeds 2.0g?

………………………………………………………………………………………………..
MUHAIRWE ARNOLD
a. The fuel pump at ROKO head office dispenses 80 Litres into the director’s car. The driver
believes the average amount of flue is not 80 litres. Using 40 fuel card samples, he measures
the average amount dispensed by the machine to be 78 litres with a standard deviation of 2.5.
At a 95% confidence level, is there enough evidence to support the idea that the pump is not
working well?
b. ROKO block yard manufactures pavers with an average life span of 2 or more years. An engineer
at Raxio Data Centre construction site believes this value to be less. Using a sample of 10 pavers,
he measures the average life span to be 1.8 years with a standard deviation of 0.15. At a 99%
confidence level, is there enough evidence to discard the null hypothesis?

c. ROKO construction limited is doing the proposed head office for ministry of finance. While
casting the roof slab on Saturday cast various test cubes for testing compressive strengths. The
cubes were measured during the test in May and June 2022 and the results were recorded as
shown in the table below. Use a 5% significance level to test for significance.

CUBE LABELS MAY (psi) JUNE (psi)


TEST CUBE 1 185 169
TEST CUBE 2 192 187
TEST CUBE 3 206 193
TEST CUBE 4 177 176
TEST CUBE 5 225 194
TEST CUBE 6 168 171
TEST CUBE 7 256 228
TEST CUBE 8 239 217
TEST CUBE 9 199 204
TEST CUBE 10 218 195

………………………………………………………………………………………………….

KABUNGA ARTHUR
A given set of data is assumed to be exponential and is described fully by the probability density
function (pdf) as below:
−t
λ
e
f ( t , λ )=
λ

where t is the time taken for some event to occur and λ is a constant rate.
i. What is the likelihood function?
ii. Determine the log likelihood function.
iii. Determine the maximum log likelihood function.
……………………………………………………………………………….

Suppose we have three data points that have been generated by a process that is adequately
described by a Gaussian distribution. The points are 9, 9.5 and 11. Attempt to calculate the
Maximum Likelihood Estimates of the parameter values of the Gaussian (normal) distribution.
……………………………………………………………………………………………………………………………………….

SSEMAKULA PHEASTUS
Gulu District Local Government awarded a contract for Road construction to Kinkizi Traders to
construct a new road from Otema Public – Adak – Awalkok – Idobo – Chome, 31 km within
which the contractor is expected to supply and install the following Concrete culverts:

S/N Diameter Size Number Required


(mm)
01 600 546
02 900 324
03 1200 176
Source: Gulu District Local Government (contracts Document)
It is also required that the minimum compressive strength of these concrete culverts be minimum
45N/ mm2 after 28 days
A sample of 40, of 600 mm Diameter concrete Culverts were randomly selected from the
supplier yard and tested using Insitu Scmidth rebounching Hammer Test and yielded the
following results:
Compressive strength of 600mm Diameter Culvert from Latong and Sons Company Limited in
N/ mm2
52 46 38 44 47 57 53 47 46 48
36 51 42 50 48 56 51 44 45 44
42 58 47 47 46 40 45 46 40 37
56 37 37 46 46 47 46 44 47 48
Source: Gulu District local Government (contract management file)
a. Was this sample size adequate for hypothesis testing? Justify your answer
b. Determine the Confidence Level of the above data.
c. If a culvert was randomly picked and its strength was found to be 50 N/mm2, was this
difference significant?
………………………………………………………………………………………………………………………………………………………

TANYOOMWA RICHARD
a. The following data shows the assessed values and the selling prices (in thousands of
dollars) of eight houses, constituting a random sample of all houses sold recently in the
metropolitan area:

Assessed Value, x Selling Price, y


70.3 114.4
102.0 169.3
62.5 106.2
74.8 125.0
57.9 99.8
81.6 132.1
110.4 174.2
88.0 143.5

i. Draw the scatter plot


ii. Determine the coefficient of correlation, r,
iii. Determine the coefficient of determination, r2. What does the coefficient of
determination you calculated mean?
iv. Write the regression equation.
………………………………………………………………………………………………………………………………………………………….

WILSON WASSWA

a. Test at the α = :05 significance level whether the mean of a random sample of size
n = 16 is statistically significantly less than 10 if the distribution from which the sample was
taken is normal, x = 8:4 and s2= 10:24.
b. A city health department wishes to determine if the mean bacteria count per unit
volume of water at a lake beach is within the safety level of 200. A researcher collected 10 water
samples of unit volume and found the bacteria count to be:

175, 190, 215, 198, 184, 207, 210, 193, 196, 180:

Does the data indicate that the bacteria count is within the safety level? Test at the α = :01 level.
You may assume that the measurements constitute a sample from a normal population.
………………………………………………………………………………………………………………………………

OKEYA JACKOB

1. JOHESA CONSTRUCTION AND ENGINEERING Ltd is a construction company that deals


in building apartments for the purpose of domestic, renting, farm houses, business as well as
industrial shelter on behalf of JOMAYI PROPERTY CONSULTANTS. Over the recent years
it has constructed several shelters for the above mentioned purposes and realized that those it has
built for domestic selling and renting have largely been influenced by the factors independent
variables or predictor variables of:
i. Cost of land in the area of location.
ii. Distance from the nearest town.
iii. Number of rooms in the house
iv. Construction unit rate cost per square meter.
v. Average rent amount per double room in location.
vi. Duration of travel time from Kampala city.

The following are the estimates of the above mentioned factor tabulated that can be used to estimate the
dependent Variables of :
i. The total cost of buying such a house or owning such a facility.
ii. The amount of rent that a tenant may pay monthly
iii. The approximate demand for the constructed households in the area.
iv. The business viability depending on the location and inputs.

NUMBER AVERAGE APPROX.


COST OF DISTANCE OF CONSTRUCTION RENT PER TRAVEL VALUE
LAND FROM ROOMS UNIT RATE PER DOUBLE TIME FROM OF
No. LOCATION [100X100] NEAREST IN HOUSE SQUARE METRE ROOM IN KAMPALA HOUSE
TOWN [UGX] AREA
1.0 Nakawa 48.5m 2.0km 3 32,000 400,000 20mins 350m
2.0 Najera 32.0m 1.5km 5 30,000 280,000 40mins 260m
3.0 Mukono 39.5m 2.0km 4 30,000 300,000 60mins 320m
4.0 Seeta 35.0m 0.5km 4 29,000 295,000 50mins 300m
5.0 Namugongo 42.0m 1.0km 3 28,000 350,000 40mins 400m
6.0 Busega 20.0m 2.5km 5 25,000 270,000 60mins 250m
7.0 Rubaga 38.0m 1.0km 3 29,800 320,000 30mins 300m
8.0 Kisasi 24.0m 1.5km 4 28;000 275,000 45mins 280m
9.0 Mutundwe 25.0m 3.0km 3 30,000 250,000 30mins 240m
10.0 Bweyogerere 33.0m 1.4km 3 32,000 340,000 40mins 350m
11.0 Kawempe 28.8m 0.5km 4 31,000 300,000 30mins 320m
12.0 Wakiso 27.5m 3.5km 4 25,000 260,000 60mins 275m
13.0 Kasangati 32.0m 2.0km 3 28,700 280,000 45mins 290m
14.0 Mengo 40.0m 0.5km 3 32,500 300,000 20mins 340m
15.0 Ntinda 35.0m 1.8km 4 34,000 320,000 25mins 450m
Source [JOHESA Construction and Engineering Ltd and JOMAYI PROPERTY
CONSULTANTS Annual Report(s), 2016]

Note: 1. The above are only time estimates for travel but are certain to change with constraints of
jam and time of the day.
2. The cost of land is also subject to other constraints of topography, nearness to swamp
etc but we have considered an average price of a plot on flat land in the area.

i. Calculate the multiple correlation coefficient.


ii. Determine the multiple coefficient of determination. What does it mean?
iii. Determine the multiple linear regression equation using the least squares
method.
iv. Use the multiple regression equation to predict the cost of a channel when the
mean rainfall was 30mm, causing a delay of 4 days and elapsed duration of 6
weeks. Comment on the likely accuracy of your prediction
v. Perform a multicollinearity test (is there evidence of a correlation among
variables?
vi.Test the multiple regression slopes for significance.
vii. List down the most common assumptions of multiple linear regression.

You might also like