Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
7 views6 pages

Course+Template Syllabus

CSL227 Applied Computational Statistics is a 4-credit open elective course offered by the Department of Computer Science and Engineering, focusing on statistical methods and their applications. The course covers topics such as descriptive statistics, probability, discrete and continuous distributions, inferential statistics, and Bayesian statistics, with a combination of lectures and practical lab work. Students will learn to analyze datasets, calculate probabilities, and apply statistical models to real-world problems.

Uploaded by

ishika.ag
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views6 pages

Course+Template Syllabus

CSL227 Applied Computational Statistics is a 4-credit open elective course offered by the Department of Computer Science and Engineering, focusing on statistical methods and their applications. The course covers topics such as descriptive statistics, probability, discrete and continuous distributions, inferential statistics, and Bayesian statistics, with a combination of lectures and practical lab work. Students will learn to analyze datasets, calculate probabilities, and apply statistical models to real-world problems.

Uploaded by

ishika.ag
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

CSL227 Applied Computational Statistics

COURSE TEMPLATE
1. Department:
Department of Computer Science and Engineering
3. Course Code 4. L-T-P 5. Credits
2. Course Name: Applied Computational
Statistics CSL 227 3-0-2 4

6. Type of Course
(Check one): Programme Core Programme Elective ✓ Open Elective

7. Pre-requisite(s), if any: Programming for Datascience (Python)

8. Frequency of offering (check one): Odd Even ✓ Either semester Every semester

9. Brief Syllabus:

Introduction to Statistics,Types of Data (Quantitative, Qualitative, Logical),Level of measurement of data),


Fundamentals of Descriptive Statistics (moments- Measures of Central Tendency, Measure of spread, Measure of
Shape), Data Visualisation(Histogram, Scatter plots, Box plots,Pie Charts and Line Chart, Bar Plot).Introduction to
probability,Applications,Events (Mutually Exclusive, Disjoints, Independent), Counting Methods (Permutations and
Combinations), Joint, Conditional Probability,Introduction, Probability Mass Function, Cumulative Distribution
Function, Geometric Distribution, Binomial Distribution, Poisson Distribution,Probability Density Functions,
Cumulative Distribution Function,NormalDistribution,Tdistribution,Chi Square Distribution,Sample Statistics and
Population Parameters, Sampling methods, Estimations (Point and Intervals- Confidence intervals with means,
sample, proportions),Central Limit Theorem,Introduction to hypothesis Testing, Confidence Intervals, Critical Value
based approach, P-value based approach, Z Tests, T Tests, the chi-square distribution, ANOVA/ANCOVA.,Bayes’s
Theorem, Computational Statistics, Estimation 1, Estimation 2, Odds, Decision Analysis, Prediction, Observer Bias,
Two Dimensions, Approximate Bayesian Computation, Baysian, Evidence, A Hierarchical Model, Dealing with
Dimensions.
Total lecture, Tutorial and Practical Hours for this course (Take 15 teaching weeks per semester): 90 hours

Practice
Lectures: 45 hours
Tutorials :NA Lab Work: 30 hours
10. Course Outcomes (COs)
On successful completion of this course students will be able to:

CO 1 Apply appropriate descriptive statistical and exploratory methods in the analysis of datasets

Calculate probabilities, and derive the marginal and conditional distributions of bivariate random
CO 2
variables.
Understand the probability mass function and various discrete distributions through application on
CO 3
real world examples
Understand the probability density function and various continuous distributions through application
CO 4
on real world examples
CO 5 Understand and interpret statistical hypothesis test
CO 6 Translate real-world problems into probability models using Bayesian Statistics.
CSL227 Applied Computational Statistics

11. UNIT WISE DETAILS No. of Units:6

Unit Number: 1 Title: Introduction to Classical (Frequentist) Statistics No. of hours: 8

Content Summary:
Introduction to Statistics,Types of Data (Quantitative, Qualitative, Logical),Level of measurement of data),
Fundamentals of Descriptive Statistics (moments- Measures of Central Tendency, Measure of spread, Measure of
Shape), Data Visualisation(Histogram, Scatter plots, Box plots,Pie Charts and Line Chart, Bar Plot).

Unit Number: 2 Title: Probability and Combinatorics No. of hours: 6

Content Summary:
Introduction to probability,Applications,Events (Mutually Exclusive, Disjoints, Independent), Counting Methods
(Permutations and Combinations), Joint, Conditional Probability.
Unit Number: 3 Title: Discrete Distributions No. of hours: 8

Content Summary:
Introduction, Probability Mass Function, Cumulative Distribution Function, Geometric Distribution, Binomial
Distribution, Poisson Distribution.
No. of hours: 10
Unit Number: 4 Title: Continuous Distributions

Content Summary:
Probability Density Functions, Cumulative Distribution Function,NormalDistribution,Tdistribution,Chi Square
Distribution.

No. of hours: 10
Unit Number: 5 Title: Inferential Statistics

Content Summary:
Sample Statistics and Population Parameters, Sampling methods, Estimations (Point and Intervals- Confidence
intervals with means, sample, proportions),Central Limit Theorem,Introduction to hypothesis Testing, Confidence
Intervals, Critical Value based approach, P-value based approach, Z Tests, T Tests, the chi-square distribution,
ANOVA/ANCOVA.
No. of hours: 3
Unit Number: 6 Title: Bayesian Statistics

Content Summary:
Bayes’s Theorem, Computational Statistics, Estimation 1, Estimation 2, Odds, Decision Analysis, Prediction,
Observer Bias, Two Dimensions, Approximate Bayesian Computation, Baysian, Evidence, A Hierarchical Model,
Dealing with Dimensions.
12. Brief Description of Self-learning components by students (through books/resource material etc.): NA

13. Books Recommended :

Textbooks:
1. Business Statistics: For Contemporary Decision Making, Ken Black, 9th Edition ,Wiley Publisher,2017
2Head First Statistics,.Dawn Griffiths, O’Reilly media Inc., 2019

Reference Books:
1. Introduction to Statistical Learning using Python,Thomas Haslwanter,Springer,2016
CSL227 Applied Computational Statistics

2.Timothy C Urdan, Statistics in Plain English, Taylor and Francis Publisher, 2010

Reference Websites: (nptel, swayam, coursera, edx, udemy, lms, official documentation weblink)
• https://www.coursera.org/learn/probability-intro/

• https://www.coursera.org/learn/bayesian/
• www.lms.ncuindia.edu/lms/

Practice (Tutorial/Case Studies/ Industry Visit/Field Work) Content

Sr. No. Topic Unit


covered
1. Calculate Descriptive Statistics from the data such as mean, median, mode, standard
1
deviation, skewness, kutosis.
2. Find Interquartile range, Quarter 1, Quarter 2, Quarter 3 in data. 1
3. A card is selected at random from an ordinary deck of 52 playing cards. If A is the
event that the selected card is an ace and H is the event that selected card is a heart, 2
then find the probability of both the events.
4. A problem is given to three persons P, Q, R whose respective of solving it are 2/7, 4/7,
2
4/9. Find the probability that problem is solved.
5. (i)Basketball player LeBron James makes a free throw shot about 74% of the time,
Find the probability that the first free throw shot LeBron makes occurs on the thirs or
fourth attempt. 2
(ii) Find the probability that LeBron makes his first free throw shot before his third
attempt.
6. From a standard deck of cards, you pick a card, note whether it is club or not, and
replace the card. Repeat the experiment five times. The outcome of each trial classified 3
into two categories S = selecting a club and F = selecting another suit.
7. Microfracture knee surgery has a 75% chance of success on patients with
degenerative knees. The surgery is performed on three patients. Find the probability of 3
the surgery being successful on exactly two patients.
8. A survey indicates that 41% of women in the US consider reading their favorite leisure-
time activity. You randomly select four US women and ask them of reading is their
3
favorite leisure-time activity. Find the probability that - a) exactly two of them respond
yes. b) at least two of them respond yes. c) fewer than two of them respond yes.
9. In Pittsburgh, Pennsylvania, about 56% of the days in a year are cloudy. Find the
mean, variance, standard deviation for the number of cloudy days during the month of 3
June. Interpret the results and determine any unusual values.
10. The mean number of accidents per month at a certain intersection is three. Find the
3
probability that in any given month four accidents will occur at this intersection.
11. Consider an experiment that consists of counting the number of alpha particles given
off in a one-second interval by one gram of radioactive material. If we know from past
3
experience that, on the average, 3.2 such alpha particles re given off, what is a good
approximation to the probability that no more than 2 alpha particles will appear?
12. Suppose you are expecting a message at some time past five pm. From experience
you know that X, the number of hours after five pm, until the message arrives, is a
random variable with pdf:
3
f(x) = 1/1.5 if 0<x<1.5
0 otherwise
Find its expectation.
13. Suppose that a number of miles that a car can run before its battery wears out is
3
exponentially distributed with an average values of 10,000 miles. If a person desires to
CSL227 Applied Computational Statistics

take a 5,000 mile trip, what is the probability that she will be able to complete her car
battery?
14. If X is a random variable with mean three and variance sixteen, find
a) P{ X<11 }
4
b) P { X>-1 }
c) P{ 2<X<7 }
15. The mean of candy for population in each candy is ten and variance is one. Someone
bought thirty pockets. The average number of candies per packet in his sample is 8.5. 4
Find the probability of happening this scenario again.
16. A principal at a certain school claims that the students in his school are above average
intelligence. A random sample of thirty students IQ scores have a mean score of 112. 4
Is there sufficient evidence to support the principal’s claim? The mean population IQ is
100 with a standard deviation of 15.
17. Blood glucose levels for obese patients have a mean of 100 with a standard deviation
of 15. A researcher thinks that a diet high in raw cornstarch will have a positive or
negative effect on blood glucose levels. A sample of 30 patients who have tried the raw 4
cornstarch diet have a mean glucose level of 140. Test the hypothesis that the raw
cornstarch had an effect.
18. A public opinion poll surveyed a simple random sample of 1000 voters. Respondents
were classified by gender (male or female) and by voting preference (Republican,
Democrat, or Independent). Results are shown in the contingency table below

Voting Preferences Row total


Rep Dem Ind 5
Male 200 150 50 400
Female 250 300 50 600
Column total 450 450 100 1000
Is there a gender gap? Do the men's voting preferences differ significantly from the
women's preferences? Use a 0.05 level of significance.
19. If dangerous fires are rare (1%) but smoke is fairly common (10%) due to barbecues,
and 90% of dangerous fires make smoke then what is the probability of dangerous fire 5
when there is smoke?
20. You are planning a picnic today, but the morning is cloudy

Oh no! 50% of all rainy days start off cloudy!


5
But cloudy mornings are common (about 40% of days start cloudy)
And this is usually a dry month (only 3 of 30 days tend to be rainy, or 10%)
What is the chance of rain during the day?
21. You might be interested in finding out a patient’s probability of having liver disease if
they are an alcoholic. “Being an alcoholic” is the test (kind of like a litmus test) for liver
disease.

A could mean the event “Patient has liver disease.” Past data tells you that 10% of
patients entering your clinic have liver disease. P(A) = 0.10. 6
B could mean the litmus test that “Patient is an alcoholic.” Five percent of the clinic’s
patients are alcoholics. P(B) = 0.05.
You might also know that among those patients diagnosed with liver disease, 7% are
alcoholics. This is your B|A: the probability that a patient is alcoholic, given that they
have liver disease, is 7%.
22. In a particular pain clinic, 10% of patients are prescribed narcotic pain killers. Overall,
five percent of the clinic’s patients are addicted to narcotics (including pain killers and
6
illegal substances). Out of all the people prescribed pain pills, 8% are addicts. If a
patient is an addict, what is the probability that they will be prescribed pain pills?
CSL227 Applied Computational Statistics

Practical Content

Sr. No. Title of the Experiment Software based Unit Time


cove Required
red
Write a python program to visualise your data using 2hrs
1. Python (Jupyter) 1
seaborn library.
Write a python program to use boxplot for finding the 2hrs
2. Python (Jupyter) 1
outliers in your data.
Write a python program to find descriptive statistics 2hrs

3. like mean, median, mode, standard deviation, Python (Jupyter) 1


skewness.

4. Write a python program to bootstrap your data. Python (Jupyter) 3 2hrs


Write a python program to draw probability density
5. Python (Jupyter) 3 2hrs
function.
Write a python program to find cumulative
6. Python (Jupyter) 3 2 hrs
distributive function.
Write a python program to find confidence interval
7. Python (Jupyter) 3 2 hrs
for known variance.
Write a python program to find confidence interval
8. Python (Jupyter) 3 4 hrs
for unknown variance.
Write a python program to find hypothesis test for
9. Python (Jupyter) 3 4 hrs
known variance.
Write a python program to find hypothesis test for
10. Python (Jupyter) 3 4 hrs
unknown variance.

Value Added Experiments

Write a program to plot histogram, pdf, cdf using


1. Python (Jupyter) 4 2 hrs
thinkstats library.
Write a program to compute bayes theorem using
2. Python (Jupyter) 6 2 hrs
thinkbayes library.
CSL227 Applied Computational Statistics

Project (To be done as individual/in group): NA

Evaluation Scheme

ALLOTTED
TYPE OF COURSE PARTICULAR RANGE OF PASS CRITERIA
MARKS

Minor Test 15%

Major Test 35%

Continuous Must Secure 30% Marks Out of


Evaluation Through
Class Combined Marks of Major Test Plus
Theory+ Practical 10%
(L-T-P/L-0-P) Tests/Practice/Assig Minor Test with Overall 40% Marks in
nments/Presentation/
Quiz Total.

Online Quiz 5%

Lab Work 35%

PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3
CO1 2 1 1 2 3 2 1 3 2 1 1 2 3 1 2
CO2 2 2 1 2 3 2 1 3 2 1 1 3 2 1 2
CO3 2 3 2 2 3 2 2 3 2 2 2 3 3 1 2
CO4 2 1 2 2 3 2 2 3 3 3 2 3 3 1 2
CO5 3 2 2 3 3 3 2 3 3 3 3 3 3 1 2
CO6 2 1 1 2 3 2 1 3 2 1 1 2 3 1 2

You might also like