Institute of Engineering & Technology, Devi Ahilya University, Indore, (M.P.), India.
(Scheme Effective from July 2016)
Devi Ahilya University, Indore, India IVYear B.E. (Computer Engg.)
Institute of Engineering & Technology (Full Time)
Subject Code & Name Instructions Hours per Credits
Week
CER8C3 L T P L T P Total
Data Sciences 3 1 2 3 1 1 5
Duration of Theory
Paper: 3 Hours
Learning Objectives:
This course will introduce students to data science basic principles and tools as well as its
general mindset.
Students will learn concepts, techniques and tools they need to deal with various facets of
data science practice, including data collection and integration, exploratory data analysis,
predictive modeling, descriptive modeling, data product creation, evaluation, and e
ective communication.
Prerequisites: Basic knowledge of algorithms and sufficient programming experience and
familiarity with basic linear algebra, probability and statistics.
COURSE OF CONTENTS
Unit 1 : Introduction to Data Management
What is Data Science?, Data Science Languagues, Data Warehousing & OLAP, Data Preparation,
Data Wrangling etc.
Unit 2 : Statistics and EDA
Descriptive Statistics, Inferential Statistics, Exploratory Data Analysis, Hypothesis Testing etc.
Unit 3 : Machine Learning
Linear Regression, Supervised Classification – K-Nearest Neighbors, Clustering – K-Means etc.,
Decision Trees, Support Vector Machine and Neural Networks etc.
Unit 4 : Big Data Analytics
Introduction to Big Data and Hadoop, Managing Big Data, Introduction to SPARK, Big Data
Analysis etc.
Unit 5 : Advance Topics
Mining Social Nework graphs; Privacy, Security and Ethical Issues in Data Science; Data
Visualization; Recommended Systems etc.
RECOMMENDED BOOKS
[1] Cathy O'Neil and Rachel Schutt. Doing Data Science, Straight Talk From The Frontline.
O'Reilly. 2014.
[2] Jiawei Han, Micheline Kamber and Jian Pei. Data Mining: Concepts and Techniques, Third
Edition. ISBN 0123814790. 2011.
Institute of Engineering & Technology, Devi Ahilya University, Indore, (M.P.), India.
(Scheme Effective from July 2016)
[3] Trevor Hastie, Robert Tibshirani and Jerome Friedman. Elements of Statistical Learning,
Second Edition. ISBN 0387952845. 2009.
[4] Foster Provost and Tom Fawcett. Data Science for Business: What You Need to Know about
Data Mining and Data-analytic Thinking. ISBN 1449361323. 2013.
[5] Allen Downey. Think Python. Oreilly. First Edition. ISBN 144933072X. 2012.
[6] Joel Grus. Data Science from Scratch. Shroff. ISBN 9352130960. 2015.
Learning Outcomes:
At the conclusion of the course, students should be able to:
Describe what Data Science is and the skill sets needed to be a data scientist.
Use Python to carry out basic statistical modeling and analysis.
Apply EDA and the Data Science process in a case study.
Apply basic machine learning algorithms (Linear Regression, k-Nearest Neighbors (k-
NN),K-Means) for predictive modeling.
Create e
ffective visualization of given data, reason around ethical and privacy issues in data science
conduct and apply ethical practices.