Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
15 views2 pages

R Practical Sheet

The document outlines a series of exercises involving data analysis and statistical calculations using R. It includes tasks such as grouping weights of candy bars, cleaning and analyzing student visa data, examining vehicle registration data in relation to population, and analyzing wages of prisoners. Additionally, it involves calculating sample statistics for telephone call data and assessing the applicability of the Poisson distribution.

Uploaded by

Abu Obaidha Arin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views2 pages

R Practical Sheet

The document outlines a series of exercises involving data analysis and statistical calculations using R. It includes tasks such as grouping weights of candy bars, cleaning and analyzing student visa data, examining vehicle registration data in relation to population, and analyzing wages of prisoners. Additionally, it involves calculating sample statistics for telephone call data and assessing the applicability of the Poisson distribution.

Uploaded by

Abu Obaidha Arin
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

R practical Sheet

Exercise-

Before Halloween two bags of miniature Clark bars were purchased and each bar was weighed (in
grams), 19 in the first bag and 23 in the second bag, yielding the following weights;

19.8 20.3 20.2 19.6 20.0 19.4 15.4 19.9 20.5 20.5 19.8
15.5 21.8 20.0 15.4 20.6 15.7 20.4 21.3 16.7 15.6 14.2
15.9 16.8 15.2 15.0 16.0 16.4 15.4 16.0 16.1 17.0 16.0
15.1 14.1 15.8 15.7 16.6 15.2 15.5 14.9 15.3

(i) Group these weights using as class boundaries 13.95-14.95, 14.95-15.95 and so on and construct
a relative frequency histogram.
(ii) Calculate the values of the sample mean and sample standard deviation .
(iii) Locate on your histogram .
(iv) Give an interpretation about the distribution of the data.

Exercise-

The .csv file of “Destination wise of Indian national students going abroad on student visa” is
given. The file gives the data for three consecutive years 2029, 2020 and 2021.

All the students must do the following:


1) Clean the data i.e. remove rows with no data present. Store it in a new file with proper headers.

2) Extract data for the following groups


a. Range of students going abroad in 2019 0 – 50
b. Range of students going abroad in 2019 51 – 100
c. Range of students going abroad in 2019 101 – 500
d. Range of students going abroad in 2019 501 – 1000
e. Range of students going abroad in 2019 1001 – 5000
f. Range of students going abroad in 2019 5001 – 10000
g. More than 10,000

1) Pick up the subdata from b to g.


2) Analyse the data of your subdata.
3) Pick out the top ranked universities in the countries of your subdata.

Exercise 3
- Go to https://data.gov.in/catalog/total-number-registered-motor-vehicles-india
- Click on the CSV file of Total Number of Registered Motor Vehicles in India during
1951-2012
- You may save the file as Registered_motors.csv
2

- Is the number of total vehicles registered in India correlated with the population? (You will
have to search for the population of India)

Exercise 4
- Go to https://data.gov.in/catalog/details-wages-payments-prisoners
- Click on the CSV file of Details of wages payments of prisoners during 2001-2012
- You may save the file Convict_wages.csv
- Go to https://data.gov.in/catalog/state-wise-literacy-rates
- Click on the Excel file of State-wise Literacy Rates (1951-2001)
- Convert it to .csv file

Exercise 5

Let X equals the number of telephone call per hour that are received by 911 between midnight and noon and
reported in the Holland Sentinel. On October 29 and October 30, the following numbers of calls were reported:

(a) Calculate the sample mean and sample variance for these data? Are they approximately equal to each other?
(b) With draw a probability histogram for the Poisson distribution and a relative frequency histogram of the
data on the same graph.
(c) Could Poisson distribution with be a reasonable probability model based on these limited data?

You might also like