Kathmandu University
Department of Computer Science and Engineering
Subject: Introduction to Data Science Course Code: AICC 202
Level: BTech in AI 2nd year 1st semester Credit Hours: 3
Type: Core [Theory + Practical]
Course Description:
The course helps students to understand the basic concepts of Data Science like information extraction
from the vast amounts of data using different scientific methods. It explains how the data is
manipulated and processed to unravel useful underlying information from the raw and unstructured
data. The course follows an example-based approach in terms of providing a better understanding on
how data science techniques are applied in the real-world problems. Furthermore, the course provides
insights to the ethical use of data, which has become very crucial with the rampant and abundant
generation of data.
Course Objective:
• To introduce the basic concepts and techniques of Data Science.
• To give an insight to data visualization
• To understand the current scenario and application of Data Science.
• To understand how Data Science techniques are used in real-world problems.
Prerequisites
It is expected that students have prior knowledge of mathematical preliminaries such as Probabilities
and Statistics, Calculus and Linear Algebra. Besides, students should have the knowledge of high-
level programming languages like C and C++ or Python to understand the concepts of Data Science
and implement projects.
Course Evaluation:
Internal Examination: 50%
Final Examination: 50%
Chapter 1: Introduction [8 Hrs.]
• Computer Science, Data Science, and Real Science
• Current Trends in Data Science
• Data Types and Properties
• Data Science Hierarchy
§ Data Engineering
§ Data Analytics
§ Data Science
§ Business Intelligence
• Data Science Lifecycle
• Data Science Applications
§ Classification, Regression and Clustering
Chapter 2: Data Wrangling [8 Hrs.]
• Languages for Data Science
• Data Discovery
• Data Structuring
• Data Cleaning
• Data Sampling Techniques
• Data Enriching
• Feature Selection
• Data Validating
• Data Publishing
Chapter 3: Scores and Rankings [4 Hrs.]
• The Body Mass Index (BMI)
• Developing Scoring Systems
• Advanced Ranking Techniques
• Arrow's Impossibility Theorem
Chapter 4: Visualizing Data [6 Hrs.]
• Exploratory Data Analysis (EDA)
• Developing a Visualization Aesthetic
• Chart Types
• Data Visualization Perspectives
• Reading Graphs
• Interactive Visualization
• Common tools for Data Visualization
Chapter: Data Analysis [9 Hrs.]
• Descriptive Analysis
• Diagnostic Analysis
• Predictive Analysis
• Prescriptive Analysis
• Case Studies
Chapter 5: Big Data [6 Hrs.]
• Introduction to Big Data
• Algorithmics for Big Data
• Parallelism
• MapReduce
Chapter 6: Future Trends in Data Science [4 Hrs.]
• Current Operational trends
• Next-Generation Data Scientists
• Cultivating Soft Skills
• Being an Ethical Data Scientist
Text Books
1. Steven S. Skiena. The Data Science Design Manual. Springer Cham, 2017
2. Cathy O'Neil, Rachel Schutt. Doing Data Science. O'REILLY Media, Inc., 2013
Reference Books:
1. Joel Grus. Data Science from Scratch 2e: First Principles with Python. O’REILLY, 2019
2. Ivo D. Dinov. Data Science and Predictive Analytics. Springer, 2018