MTechSyllabus CSE DataScience
MTechSyllabus CSE DataScience
Institute Department
National Institute of Technology Computer Science and Engineering
Agartala
P.O. NIT Agartala Dr. A.P.J. Abdul Kalam Block,
Tripura (West), India, 799046 NIT Agartala, Tripura (West)
Tel:(0381) 2346630 / 2348511 India, 7990046
Fax: (0381) 2346360
Email: [email protected]
http://www.nita.ac.in
M.Tech. in Data Science and Engineering
1
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
PO1: To develop the ability to apply knowledge of mathematics, engineering sciences for
conducting independent research/investigation for solving practical problems.
PO2: To develop the ability to identify, formulate, conduct experiments, interpret data,
synthesize information, and analyze engineering problems by writing and presenting an
effective technical report/document.
PO3: To develop the ability to demonstrate mastery over the area as per the program's
specialization. The knowledge should be at a level higher than the requirements in the
appropriate bachelor's program.
PO4: To develop problem-solving ability to design solutions for complex engineering
problems in the context of societal and environmental commitments.
PO5: To demonstrate the capability of functioning effectively as a member or team leader in
software projects considering multidisciplinary environments, thus solving real-world
multifaceted problems.
2
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
PO6: To develop design thinking capabilities for innovation and contribute to technological
knowledge and intellectual property development.
3
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Title of Curriculum
M.Tech. in Data Science and Engineering
Under Computer Science and Engineering Department, NIT Agartala
Objectives of the Data Science & Engineering (DSE) Specialization
The past two decades has witnessed the involvement of IT enabled services in every sector. With the
proliferation of social media services, the dynamics of the World Wide Web has shifted from data
consumption to a data generation environment. The social media services have enabled not only
organizations but also individual users as the content providers. The Internet traffic is increasing
exponentially and so is the volume of data. Applications such as social media, healthcare, e-commerce,
weather forecast, traffic monitoring, etc., are producing massive amounts of data, the so-called “BIG
DATA”, at an unprecedented scale. This has led to a critical need for skilled professionals, popularly
known as Data Scientists, who can mine and interpret the data. Making sense of this massive data is an
exceedingly difficult challenge for scientific, technological, and industrial disciplines. Unfortunately,
there is a gap between the demand and supply of data scientists and technologists due to the following
reasons:
• Due to the generic nature of Undergraduate courses, they fail to address the issues in this area
in a focused manner.
• There are not many postgraduate courses that focus explicitly on Data Science.
Keeping these factors in mind, the Department of CSE at NIT Agartala, proposes a two-year Masters in
Technology (M.Tech.) program in Data Science and Engineering.
Career opportunities:
This program would provide students an opportunity to learn both foundational and experimental
components of DSE with application of Machine Learning and Deep Learning techniques. A student, on
completion of this program, will be able to undertake industry careers involving innovation and
problem-solving and join the industry as a Data Scientist/Data Analyst/Data Engineer. Along with
courses that provide specialization in DSE, students will also have option to explore some applied
domains such as computer vision, natural language processing, robotics, and software analysis.
Detail Syllabus Annexure -I
Classrooms available YES
Labs available YES
Number of existing Faculty in the areas of Data Science and allied fields in 8
NITA CSE Department
Duration of Program 2 year (4 Semester)
Total Number of Intake 8
Academic eligibility Annexure - A
4
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
5
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Total 0 0 10 10 100
Total 0 0 20 20
Cumulative credit of the course
Semester-I 17 3 5 25 28 800
Semester -II 14 2 9 25 28 800
Semester -III 0 0 10 10 Full 100
Semester -IV 0 0 20 20 Full 300
Total 31 5 44 80 2000
6
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Annexure - A
Eligibility
7
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Annexure- I
Detail Syllabus
Course structure for M. Tech in Data Science and Engineering,
Department of CSE, NIT Agartala
Semester I
LT P
3 ,1, 0: 4 Credits Prerequisites: None
Course Objectives:
1. The course is intended to provide the foundations of the practical implementation
and usage of Algorithms and Data Structures.
2. One objective is to ensure that the student evolves into a competent programmer
capable of designing and analyzing implementations of algorithms and data
structures for different kinds of problems.
3. Another objective is to expose the student to the algorithm analysis techniques, to
the theory of reductions, and to the classification of problems into complexity
classes.
Detailed syllabus:
MODULE I
Introduction to advanced data structures, Fundamentals of the analysis of algorithms,
Algorithms, Performance analysis- time complexity and space complexity, Asymptotic
Notation-Big Oh, Omega and Theta notations, Complexity Analysis Examples. Data
structures-Linear and nonlinear data structures, ADT concept, Linear List ADT, Recurrences:
The substitution method, Recursive tree method, Masters Method, Probabilistic analysis,
Amortized analysis, Randomized algorithms, Mathematical aspects, and analysis of
algorithms.
MODULE II
Divide and Conquer technique, Binary search tree, AVL-trees, red-black trees, B and B+-
trees, Finding the minimum and maximum, Merge sort, Quick sort, Strassen’s matrix
multiplication. Splay Trees, Binomial Heaps, Fibonacci Heaps, Application of k-D tree (k-
dimensional tree) in range searches and nearest neighbor searches.
MODULE III
Greedy algorithms: Introduction, Knapsack problem, Job sequencing with deadlines,
Minimum cost spanning trees, Kruskal’s algorithm, Prim’s algorithm, Optimal storage on
8
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
tapes, Optimal merge pattern, Subset cover problem, Container loading or Bin packing
problem.
MODULE IV
Dynamic algorithms: Introduction Dynamic algorithms, All pair shortest path, 0/1 knapsack,
Travelling salesman problem, Coin Changing Problem, Matrix Chain Multiplication, Flow
shop scheduling, Optimal binary search tree (OBST), Analysis of All problems, Introduction
to NP-Hard And NP- Complete Problems
More algorithms: Dynamic programming, graph algorithms: DFS, BFS, topological sorting,
shortest path algorithms, network flow problems.
MODULE IV
String Matching: The naïve string-matching algorithm, Rabin Karp algorithm,
KnuthMorrisPratt algorithm (KMP), longest common subsequence (LCS), Fractional
cascading, suffix trees, geometric algorithms.
References:
1. Cormen, Leiserson, Rivest and Stein, Introduction to algorithms (Main textbook)
2. Kleinberg and Tardos , Algorithm Design
3. Mark Weiss, Data structures and algorithm analysis in C++ (Java)
4. Aho, Hopcroft and Ullman, Data structures and algorithms
5. S. Sahni, Data Structures, Algorithms, and Applications in C++, Silicon Press
9
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
10
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objective:
1. To understand Data Mining in Knowledge discovery process, and its applications.
2. To understand different data attribute types and apply different data preprocessing
techniques.
3. To understand how to identify association among data objects by learning various
association mining algorithms.
4. To understand the various classification techniques, their applications in different
domains.
5. To understand the various clustering techniques, their applications in different
domains.
6. To learn various data visualization techniques for data analysis.
Detailed syllabus:
MODULE I
Introduction: Data Mining, Motivation, Application, Data Mining—On What Kind of Data?
Data Mining Functionalities, Data Mining Task Primitives, Major Issues in Data Mining.
Data pre-processing: Attribute types, Similarity & Dissimilarity measures.
MODULE II
Data Preprocessing: Data Cleaning, Data Integration, Data Reduction, Data Transformation &
Discretization.
MODULE III
Mining Frequent Patterns: Basic Algorithms, Association Rule Mining, Apriori Algorithm, FP
tree growth Algorithm, Advanced Pattern Mining Techniques.
MODULE IV
Classification Techniques: Decision Tree, Bayes Classification, Bayesian Belief Networks,
Support Vector Machines, Classification Evaluation Techniques, Classification Accuracy
improvement Techniques.
MODULE V
Clustering Techniques: Partitioning algorithms, Hierarchical algorithms, Density-Based
algorithms, Grid-Based algorithms, Evaluation of Clustering. Outlier Detection Techniques.
11
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
MODULE VI
Applications and Trends in Data Mining: Applications, Advanced Techniques, Web Mining,
Web Content Mining, Structure Mining.
Text Books:
1. J. Han and M. Kamber. Data Mining: Concepts and Techniques. 3rd Edition, Morgan
Kaufman. Pang Ning Tan, Introduction to Data Mining, 2nd Edition, Pearson.
2. M. H. Dunham. Data Mining: Introductory and Advanced Topics. Pearson Education.
Roiger & Geatz, Data Mining, Pearson Education
3. A.K.Pujari, Data Mining, University Press
References Books:
1. Charu C. Aggarwal, Data Mining: The Textbook, Springer.
2. I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and
Techniques. Morgan Kaufmann.
3. D. Hand, H. Mannila and P. Smyth. Principles of Data Mining. Prentice, Hall.
12
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
13
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
Detailed syllabus:
MODULE I
Basics of Linear Algebra: Representation of vectors; Linear dependence and independence;
vector space and subspaces (definition, examples, and concepts of basis); linear
transformations; range and null space; matrices associated with linear transformations;
special matrices; eigenvalues and eigenvectors with applications to data problems; Least
square and minimum normed solutions.
MODULE II
Matrices in Machine Learning Algorithms: projection transformation; orthogonal
decomposition; singular value decomposition; principal component analysis and linear
discriminant analysis.
MODULE III
Gradient Calculus: Basic concepts of calculus: partial derivatives, gradient, directional
derivatives, Jacobian, Hessian matrix.
MODULE IV
Optimization: Convex sets, convex function, and their properties, Unconstrained and
Constrained Optimization, Numerical Optimization Techniques for
Constrained/Unconstrained Optimization, Derivative-Free methods (Golden Section,
Fibonacci Search Method, Bisecting Method), Methods using Derivatives (Newton’s Method,
Steepest Descent Method), Penalty Function Methods for Constrained Optimization.
MODULE V
Probability: Basic concepts of probability, conditional probability, total probability,
independent events, Bayes’ theorem, random variable, Moments, moment generating
functions, some useful distributions, Joint distribution, conditional distribution,
transformation of random variables, covariance, correlation.
14
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
MODULE VI
Statistics: Random sample, sampling techniques, statistics, sampling distributions, mixture
models.
Text Books:
1. M. P. Deisenroth, A. A. Faisal, C. S. Ong, Mathematics for Machine Learning,
Cambridge University Press (1st edition)
2. S. Axler, Linear Algebra Done Right. Springer International Publishing (3rd edition)
3. J. Nocedal and S. J. Wright, Numerical Optimization. New York: Springer
Science+Business Media
4. E. Kreyszig, Advanced Engineering Mathematics, John Wiley and Sons, Inc., U.K. (10th
Edition)
5. R. A. Johnson, I. Miller, and J. E.Freund, "Miller & Freund’s Probability and Statistics
for Engineers", Prentice Hall PTR, (8th edition)
6. C. Mohan and K. Deep: “Optimization Techniques”, New Age Publishers, New Delhi.
Course Outcomes:
CO-No. Course Outcome
1 To acquire knowledge on various Mathematical concepts to be used in
Machine Learning and Data Science.
2 To apply the concepts of probability and Statistics.
3 To solve the various problems using optimization problems.
4 To solve various problems on data science using different algorithms.
Where Levels: 1: Slight (LOW) 2: Moderate (MEDIUM) 3: Substantial (HIGH) and for NO
CORELATION “--”
15
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Outcomes:
1. Describe how arrays, records, linked structures, stacks, queues, trees, and graphs are
represented in memory and used by algorithms [ABET (a, b, c, i)].
2. Describe common applications for arrays, records, linked structures, stacks, queues,
trees, and graphs [ABET (a, b, c)] .
3. Write programs that use arrays, records, linked structures, stacks, queues, trees, and
graphs [ABET (a, c) ]
4. Demonstrate different methods for traversing trees [ABET (a)].
Programme Outcomes:
Experiment 3 (Hashing)
I. WAP to store k keys into an array of size n at the location computed using a hash
function, loc = key % n, where k<=n and k takes values from [1 to m], m>n. To handle
the collisions use the following collision resolution techniques, a. Linear, Quadratic,
16
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Experiment 4 (BST and Threaded Trees) Experiment 5 (AVL Trees and Red,Black Trees)
Experiment 6 ( B,Trees)
Experiment 7 (Min,Max Heaps, Binomial Heaps and Fibonacci Heaps )
Experiment 8 (Disjoint Sets) Experiment 9 (Graphs Algorithms)
CO3 Write programs that use arrays, records, linked structures, stacks,
queues,trees, and graphs[ABET (a, c) ]
CO4 Demonstrate different methods for traversing trees [ABET (a)].
Experiment 10 (String Matching)
CO-PO Mapping:
17
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objective:
1. Become familiar with basic Python libraries such as NumPy, Pandas, Matplotlib, Scikit-
Learn.
2. To understand and investigate the statistical nature of the data.
3. To understand the importance of data preprocessing techniques.
4. To implement basic data mining algorithms.
List of Experiments
MODULE I
1. Study of Python data types and functions.
2. Study of Python NumPy library to create multi-dimensional arrays and find its shape
and dimension, create a matrix full of zeros and ones, reshape and flatten data in the
array, append data vertically and horizontally, apply indexing and slicing on array.
3. To implement dot and matrix product of two arrays, compute the Eigen values of a
matrix, solve a linear matrix equation, Compute the multiplicative inverse of a matrix,
Compute the rank of a matrix, and compute the determinant of an array.
MODULE II
1. Study of Python Pandas library.
2. Loading data from CSV and Excel file, Compute the basic statistics of given data -
shape, no. of columns, mean, standard deviation.
3. Visualization of the data distribution.
MODULE III
1. To understand the problem of data preprocessing.
2. Load data, describe the given data and identify missing, outlier data items, find
correlation among all attributes, visualize correlation matrix.
3. Apply data transformation techniques- data discretization (binning etc.), data
normalization ((MinMaxScaler or MaxAbsScaler).
MODULE IV
1. Implementation of association rule mining algorithms.
2. Implementation of frequent pattern mining algorithms.
18
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
CO1 2 1 2 1 1 --
CO2 2 2 3 1 1 --
CO3 2 2 2 2 1 --
CO4 3 3 3 2 1 1
Total 9 8 10 6 4 1
19
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Semester II
2.1. Machine Learning
L T P
3, 1, 0: 4 Credits Prerequisites: None
Course Objectives:
Detailed Syllabus:
MODULE I
Introduction: Definition of learning systems. Goals and applications of machine learning.
Aspects of developing a learning system: training data, concept representation, function
approximation. The concept learning task. Concept learning as search through a hypothesis
space. General-to-specific ordering of hypotheses. Finding maximally specific hypotheses.
Version spaces and the candidate elimination algorithm. Learning conjunctive concepts. The
importance of inductive bias.
MODULE II
Supervised Learning: Classification vs. Regression, Linear and Logistic Regression, Gradient
Descent, Support Vector Machines, Kernels, Decision Trees, ML and MAP Estimates, K-
Nearest Neighbor, Naive Bayes, Introduction to Bayesian Networks, Artificial Neural
Networks.
MODULE III
Unsupervised Learning: Partitioning based methods, Hierarchical methods, Density based
methods, Gaussian Mixture Models, Learning with Partially Observable Data (EM).
Dimensionality Reduction and Principal Component Analysis.
MODULE IV
Optimization Techniques: Bias-Variance tradeoff, Regularization, Evaluation techniques for
supervised and unsupervised learning.
MODULE V
20
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
MODULE VI
Recommender System: Recommender system functions, understanding ratings, Applications
of recommendation systems, Issues with recommender system, Collaborative Filtering,
Content based recommendation.
Textbooks:
1. T. Mitchell, Machine Learning, McGrawHill.
2. Ethem Alpaydın, Introduction to Machine Learning 3rd Edition, MIT Press
3. Kevin Murphy, Machine Learning: A Probabilistic Perspective, MIT Press, 2012
References:
1. Marc Peter Deisenroth, A. Aldo Faisal and Cheng Soon Ong, Mathematics for
Machine Learning, Cambridge University Press, 2020.
2. Shwartz and David, Understanding Machine Learning: From Theory to Algorithms,
Cambridge University Press.
3. C.M. Bishop, Pattern Recognition and Machine Learning, Springer, 2006.
4. Andrew Ng, Machine Learning Yearning.
5. Other online material.
CO-PO Mapping:
Course Outcome PO-1 PO-2 PO-3 PO-4 PO-5 PO-6
CO-1 2 2 2 1 1 --
CO-2 2 2 1 2 2 --
CO-3 3 3 3 3 2 1
CO-4 3 3 2 1 2 2
Total 10 10 8 7 7 3
Average 2.5 2.5 2 1.75 1.75 0.75
Attainment 3 3 2 2 2 1
21
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
Detailed syllabus:
MODULE I
Introduction to big data: Introduction to Big Data Platform; Challenges of Conventional
Systems, Intelligent data analysis; Nature of Data, Analytic Processes and Tools, Analysis vs
Reporting, the four dimensions of Big Data: volume, velocity, variety, veracity, Drivers for Big
Data, Introducing the Storage, Query Stack, Revisit useful technologies and concepts, Real-
time Big Data Analytics.
MODULE II
Mining data streams: Introduction to Streams Concepts; Stream Data Model and
Architecture, Stream Computing, Sampling Data in a Stream; Filtering Streams; Counting
Distinct Elements in a Stream; Estimating Moments; Counting Oneness in a Window;
Decaying Window, Real time Analytics Platform (RTAP) Applications, Case Studies, Real Time
Sentiment Analysis, Stock Market Predictions.
MODULE III
Distributed File Systems: Hadoop Distributed File System History of Hadoop- the Hadoop
Distributed File System; Components of Hadoop Analyzing the Data with Hadoop- Scaling
Out- Hadoop Streaming- Design of HDFS-Java interfaces to HDFS Basics- Developing a Map
Reduce Application-How Map Reduce Works-Anatomy of a Map Reduce Job Run-Failures-
Job Scheduling-Shuffle and Sort; Task execution, Map Reduce Types and Formats- Map
Reduce Features Hadoop environment. Data Consistency.
MODULE IV
Overview of Spark Ecosystem, Understanding Spark Cluster Modes on YARN, RDDs (Resilient
Distributed Datasets), General RDD Operations: Transformations & Actions, Common Spark
22
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Use Cases, Data Frames and Spark SQL, Analyzing Data with Pig, NoSQL and HBase
MODULE V
Scalable Algorithms: Mining large graphs, with focus on social networks and web graphs.
Centrality, similarity, a 11-distances sketches, community detection, link analysis, spectral
techniques. Map-reduce, Pig Latin, and NoSQL using MongoDB, Algorithms for detecting
similar items, Recommendation systems, Data stream analysis algorithms, clustering
algorithms, Detecting frequent items.
MODULE VI
Frameworks and Big Data Issues: Applications on Big Data Using Pig and Hive; Data
processing operators in Pig; Hive services; HiveQL; Querying Data in Hive, fundamentals of
HBase and ZooKeeper, IBM InfoSphere BigInsights and Streams. Privacy, Visualization,
Compliance and Security, Structured vs Unstructured Data.
Text Books:
1. Ohlhorst, Frank J. Big data analytics: turning big data into big money. Vol. 65. John
Wiley & Sons, 2012.
2. Russom, Philip. "Big data analytics." TDWI best practices report, fourth quarter 19,
no. 4 (2011): 1-34.
3. Marr, Bernard. Big Data: Using SMART big data, analytics and metrics to make better
decisions and improve performance. John Wiley & Sons, 2015.
4. LaValle, Steve, Eric Lesser, Rebecca Shockley, Michael S. Hopkins, and Nina
Kruschwitz. "Big data, analytics and the path from insights to value." MIT sloan
management review 52, no. 2 (2011): 21-32.
5. Leskovec, Jure, Anand Rajaraman, and Jeffrey David Ullman. Mining of massive data
sets. Cambridge university press, 2020.
6. Michael Berthold, David J. Hand, “Intelligent Data Analysis”, Springer, 2007.
7. Tom White “Hadoop: The Definitive Guide” Third Edition, O’reilly Media, 2012.
8. Chris Eaton, Dirk De Roos, Tom Deutsch, George Lapis, Paul Zikopoulos,
“Understanding Big Data: Analytics for Enterprise Class Hadoop and Streaming Data”,
McGrawHill Publishing, 2012.
9. Arshdeep Bahga, Vijay Madisetti, “Big Data Science & Analytics: A Hands On
Approach “,VPT, 2016
23
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
CO-1 1 1 1 1 1 --
CO-2 1 2 2 1 1 --
CO-3 2 2 2 1 1 1
CO-4 3 2 2 1 2 1
CO-5 3 3 2 2 2 1
Total 10 10 9 6 7 3
Average 2 2 1.8 1.2 1.4 0.6
Attainment
Eq. Average 2 2 2 1 1 1
Attainment
24
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
LT P Prerequisites: None
0,0,3: 2Credits
Course Objective:
Detailed Syllabus:
MODULE I
Data preprocessing: Introduction to NumPy, Pandas, matplotlib, Scikit-learn.
MODULE II
Supervised Learning: Implementation of Linear and logistic regression, Naïve bayes, Decision
Tree, Support Vector Machines, Neural Networks.
MODULE III
Unsupervised Learning: Implementation of k-means, Agglomerative, DBSCAN,
Dimensionality Reduction and Principal Component Analysis.
MODULE IV
Optimization Techniques: Bias-Variance tradeoff, Cross-validation, Regularization, Precision,
Recall and F-measure.
MODULE V
Other Learning techniques: Implementation of Reinforcement Learning, Recommender
Systems, Anomaly Detection.
MODULE VI
Applications of Machine Learning: Texts, Image, Time-series data.
Textbooks:
25
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
1. Andreas C. Müller and Sarah Guido, Introduction to Machine Learning with Python: A
Guide for Data Scientists, O’Reilly.
2. Other online material.
CO-PO Mapping:
26
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objective:
List of Experiments
MODULE I
Study and implementation of various dimensionality reduction techniques.
1. Implementation of feature selection techniques: Missing values, Low Variance
Filter, High Correlation Filter, Random Forest, Backward Feature Elimination,
Forward Feature Selection.
2. Implementation of Factor Analysis techniques: Principal Component Analysis
(PCA), Independent Component Analysis, Linear Discriminant Analysis, Singular
Valued Decomposition (SVD).
3. Implementation of Projection techniques: Isometric mapping (ISOMAP), t-
distributed Stochastic Neighbor Embedding (t-SNE), Uniform Manifold
Approximation and Projection (UMAP).
MODULE II
Study and implementation of various Vectorization techniques.
1. Implementation of term-frequency-inverse-document-frequency (tf-idf).
2. Implementation of Word2Vec embeddings.
3. Implementation of GloVe embeddings.
4. Implementation of FastText embeddings.
5. Other vectorization techniques.
MODULE III
Study and implementation of classification techniques.
1. Implementation of Decision Tree.
2. Implementation of Naïve Bayes.
MODULE IV
Study and implementation of clustering techniques.
1. Implementation of partitioning-based clustering algorithms: k-means.
27
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
CO1 2 2 2 2 1 0
CO2 2 2 3 2 1 1
CO3 2 3 3 3 1 0
CO4 3 3 3 3 1 1
Total 9 10 11 10 4 1
28
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. Understand the Database and Big data revolution.
2. To learn about NoSQL databases and their concepts.
3. To comprehend and apply columnar and distributed database patterns.
4. To learn how to use various data models for a wide range of databases.
Detailed syllabus:
MODULE I
Database Revolutions -- System Architecture, Relational Database, Database Design Data
Storage, Transaction Management, Data warehouse and Data Mining, Information Retrieval.
MODULE II
Big Data Revolution -- CAP Theorem, Birth of NoSQL, Document Database - XML Databases,
JSON Document Databases, Graph Databases.
MODULE III
Column Databases -- Data Warehousing Schemes, Columnar Alternative, Sybase IQ, C Store
and Vertica, Column Database Architectures, SSD and In-Memory Databases, In Memory
Databases, Berkeley Analytics Data Stack and Spark.
MODULE IV
Distributed Database Patterns -- Distributed Relational Databases, Non-relational Distributed
Databases, MongoDB, Sharing and Replication, HBase, Cassandra Consistency Models, Types
of Consistency, Consistency MongoDB, HBase Consistency, Cassandra Consistency.
MODULE V
Data Models and Storage -- SQL, NoSQL APIs, Return SQL, Advance Databases—PostgreSQL,
Riak, CouchDB, NEO4J, Redis, Future Databases, Revolution Revisited Counter
revolutionaries, Oracle HQ, Other Convergent Databases, Disruptive Database Technologies.
Text Books:
1. Abraham Silberschatz, Henry F. Korth, S. Sudarshan, “Database System Concepts”,
Sixth Edition, McGrawHill.
2. Guy Harrison, “Next Generation Databases”, Apress, 2015.
29
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
3. Eric Redmond, Jim R Wilson, “Seven Databases in Seven Weeks”, LLC. 2012.
4. Dan Sullivan, “NoSQL for Mere Mortals”, Addison-Wesley, 2015.
5. Adam Fowler, “NoSQL for Dummies “, John Wiley & Sons, 2015.
CO-PO Mapping:
30
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. Understand the need for system models that capture random behavior to assess the
risk of undesirable outcomes.
2. Be able to model several important industrial and service systems and analyze those
models to improve system performance.
3. Be able to construct algorithmic solution strategies to explore system models that
have been developed.
Detailed Syllabus:
MODULE I
Introductory Probability: Defining Random Variables (RVs) Events, Measurability,
Independence Sample Spaces, Events, Measures, Probability, Independence, Conditional
probability, Bayes’ theorem Random Variables. RVs: Bernoulli, Binomial, Geometric, Poisson,
Uniform, Exponential, Normal, Lognormal, Expectations, Moments and Moment generating
functions Random Vectors. Random Vectors: Joint and Marginal distributions, Dependence,
Covariance, Copulas, Transformations of random vectors, Order statistics.
MODULE II
Intermediate Probability: Manipulating RVs Conditioning RVs. Conditional Distribution of a
RV, Computing probabilities and expectations by conditioning, RVs Distributions.
Inequalities: Markov, Chebyshev, Jensen, Holder, Convergence of RVs: Weak and Strong
laws, Central limit theorem, Distributions of extreme.
MODULE III
Stochastic Processes: Indexing RVs Markov Chains, Markovian property and Transition
probabilities, Irreducibility and Steady, State probabilities.
Generic Applications: Hidden Markov Chains Exponential Distribution and Poisson Process,
Construction of Poisson Process from Exponential Distribution, Thinning and Conditional
Arrival Times, Service Applications: Waiting Times Normal Distribution and Brownian
Process, Construction of Brownian Process from Normal Distribution, Hitting Times and
Maximum Values, Finance Applications: Option Pricing and Arbitrage Theorem
31
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
References:
1. Introduction to Stochastic Processes. S.M. Ross. Adventures in Stochastic Processes.
S. Resnick. Birkhauser
2. Comparison Methods for Stochastic Models and Risks. A. Muller and D. Stoyan. John
Wiley & Sons Mathematical Theoy of Reliability. R.E. Barlow and F. Proschan.
Course Outcome:
32
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objective:
1. Teach students the leading trends and systems in natural language processing.
2. Make them understand the concepts of morphology, syntax, semantics, and
pragmatics of the language and that they can give the appropriate examples that will
illustrate the mentioned concepts in the syllabus.
3. Teach them to recognize the significance of pragmatics for natural language
understanding.
4. Enable students to be capable to describe the application based on natural language
processing and to show the points of syntactic, semantic, and pragmatic processing.
Detailed syllabus:
MODULE I
Sound: Biology of Speech Processing; Place and Manner of Articulation; Word Boundary
Detection; Argmax based computations; HMM and Speech Recognition.
MODULE II
Words and Word Forms: Morphology fundamentals; Morphological Diversity of Indian
Languages; Morphology Paradigms; Finite State Machine Based Morphology; Automatic
Morphology Learning; Shallow Parsing; Named Entities; Maximum Entropy Models; Random
Fields.
MODULE III
Structures: Theories of Parsing, Parsing Algorithms; Robust and Scalable Parsing on Noisy
Text as in Web documents; Hybrid of Rule Based and Probabilistic Parsing; Scope Ambiguity
and Attachment Ambiguity resolution.
MODULE IV
Meaning and pragmatics: Lexical Knowledge Networks, Wordnet Theory; Indian Language
Wordnets and Multilingual Dictionaries; Semantic Roles; Word Sense Disambiguation; WSD
and Multilinguality; Metaphors; Coreferences. Discourse, Dialogue and Conversational
agents, Natural Language Generation, Machine Translation.
MODULE V
Web 2.0 Applications: Sentiment Analysis; Text Entailment; Robust and Scalable Machine
Translation; Question Answering in Multilingual Setting; Cross Lingual Information Retrieval
33
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
(CLIR).
References:
1. Speech and Language Processing by Daniel Jurafsky, James H. Martin, Second Edition,
Prentice Hall
2. James Allen, "Natural Language Understanding", 2/E, Addison-Wesley, 1994
3. Foundations of Statistical Natural Language Processing by Christopher D. Manning,
Hinrich Schutze, MIT Press.
4. Statistical Language Learning by Charniack, Eugene, MIT Press, 1993.
5. The Handbook of Computational Linguistics and Natural Language Processing,
Alexander Clark, Chris Fox, Shalom Lappin.
6. Steven Bird, Natural Language Processing with Python, 1st Edition, O'Reilly, 2009.
CO No Course Outcome
CO1 Understand the fundamental concept of NLP, Regular Expression, Finite State Automata
along with the concept and application of word tokenization, normalization, sentence
segmentation, word extraction, spell checking in the contextof NLP.
CO2 Understand the concept of Morphology such as Inflectional and Derivational Morphology
and different morphological parsing techniques and scope of ambiguityand it’s resolution.
CO3 Understand the concepts of pragmatics, lexical semantics, lexical dictionary such as
WordNet, lexical computational semantics, distributional word similarity and concepts
related to the field of Information Retrieval in the context of NLP.
CO4 Understand the concepts of Semantic Roles; Word Sense Disambiguation; Multilinguality;
Metaphors; Coreferences. Discourse, Dialogue and Conversational agents, Natural
Language Generation, Machine Translation.
CO5 Understand the concepts related to language modeling with introduction to N-grams,
chain rule, smoothing, spelling and word prediction and their evaluation along with the
concept of Markov chain, HMM, Forward and Viterbi algorithm, POS tagging.
CO6 Describe and apply concepts of discourse machine translation, summarization and
question answering to solve problems in NLP.
34
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO1 1 1 1 1 1 1
CO2 2 2 2 2 1 2
CO3 2 2 3 2 1 3
CO4 2 3 3 3 3 3
CO5 3 3 3 3 3 3
CO6 3 3 3 3 3 3
Total 13 14 15 14 12 15
35
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
4. Soft Computing
L T P
4 , 0 , 0 : 4 Credits Prerequisites: None
Course Objectives:
MODULE I
Introduction to Fuzzy sets, Fuzzy t- and s- norms, projection, cylindrical extension, Fuzzy
relations, Implication relations, Fuzzy relational equations, Possibilistic reasoning, Fuzzy
pattern recognition, Introduction to Fuzzy control and Fuzzy databases.
MODULE II
Boltzmann machine and Mean field learning-Combinational optimization problems using
recurrent Neural network. Competitive Learning, Self-organizing maps, Growing cell
structure, Principal component analysis.
MODULE III
Genetic Algorithm: Binary and real codes, Genetic programming, Particle swarm
optimization, Differential Evolution, Bacterial Foraging
MODULE IV
Hybridization of neuro-fuzzy, neuro-GA, neuro-swarm, neuro-evolution algorithms.
Applications in Pattern Recognition, Robotics, and Image Processing.
MODULE V
Belief Networks: Pearl's Model for Distributed Approach of Belief Propagation and Revision
in a causal network, Concepts of D-separation, Bayesian Belief Networks, Dempster-Shafer
theory for Orthogonal summation of Beliefs, Data Fusion techniques, Uncertainty
management using Belief Networks.
MODULE VI
Visual Perception: Marr's 2- and 1/2-Dimensional Vision, 3-D Vision, Camera Model,
36
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
MODULE VII
Advanced Models of Reasoning: Soundness and Completeness issues of Resolution based
Proof procedures in propositional and predicate logic, Herbrand's theorem and Lifting
Lemma, Herbrand interpretation, Temporal Logic, Reasoning with Space and Time,
Distributed Models of Reasoning using Petri Nets, and other graph theoretic approaches.
Text Books:
1. A. Konar, Computational Intelligence: Principles, Techniques, and Applications,
Springer 2005
2. A. P. Engelbrecht, Computational Intelligence
References:
CO-PO Mapping
CO1 3 3 2 3 2 2
CO2 2 2 3 2 1 2
CO3 3 3 2 2 2 2
CO4 2 2 2 2 2 3
CO5 3 3 3 3 3 3
Total 13 13 12 12 10 12
EquivalentAverage 3 3 2 2 2 2
Attainment
37
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
5. REINFORCEMENT LEARNING
LT P
3 - 1 - 0: 4 Credits Prerequisites: None
Course Objectives:
1. Learn how to define RL tasks and the core principals behind the RL, including policies,
value functions, deriving Bellman equations.
2. Understand and work with tabular methods to solve classical control problems.
3. Understand and work with approximate solutions (deep Q network based algorithms).
4. Learn the policy gradient methods from vanilla to more complex cases.
5. Explore imitation learning tasks and solutions.
Detailed Syllabus:
MODULE I
Foundation: Introduction and Basics of RL, Defining RL Framework and Markov Decision
Process, Polices, Value Functions and Bellman Equations, Exploration vs. Exploitation, Code
Standards and Libraries used in RL (Python/Keras/Tensorflow)
MODULE II
Tabular methods and Q-networks: Planning through the use of Dynamic Programming and
Monte Carlo, Temporal-Difference learning methods (TD(0), SARSA, Q-Learning), Deep Q-
networks (DQN, DDQN, Duelling DQN, Prioritised Experience Replay)
MODULE III
Policy optimization: Introduction to policy-based methods 10. Vanilla Policy Gradient 11.
REINFORCE algorithm and stochastic policy search 12. Actor-critic methods (A2C, A3C) 13.
Advanced policy gradient (PPO, TRPO, DDPG)
MODULE IV
Recent Advances and Applications: Model based RL, Meta-learning, Multi-Agent
Reinforcement Learning, Partially Observable Markov Decision Process, Ethics in RL, Applying
RL for real-world problems
Text Books:
1. Sutton, Richard S., and Andrew G. Barto. “Reinforcement learning: An introduction,”
First Edition, MIT press.
2. Sugiyama, Masashi. “Statistical reinforcement learning: modern machine learning
approaches,” First Edition, CRC Press.
38
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
3. Boris Belousov, Hany Abdulsamad, Pascal Klink, Simone Parisi, and Jan Peters
“Reinforcement Learning Algorithms: Analysis and Applications,”First Edition,
Springer
Reference Books:
Course Course
Outcome Outcome
No
CO1 Learn how to define RL tasks and the core principals behind the RL, including
policies, value functions, deriving Bellman equations.
CO2 Understand and work with tabular methods to solve classical control
problems.
CO3 Understand and work with approximate solutions (deep Q network-based
algorithms)
CO4 Learn the policy gradient methods from vanilla to more complex cases.
CO5 Explore imitation learning tasks and solutions.
CO-PO Mapping:
CO1 2 2 1 1 0 0
CO2 2 2 2 1 0 0
CO3 2 2 1 1 0 1
CO4 2 2 1 1 1 1
CO5 2 2 2 1 3 1
Total 10 10 7 5 4 3
Eq. AverageAttainment 2 2 1 1 1 1
39
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. To introduce concepts in intrusion detection systems
2. To study and analysis, the different Intrusion Detection System Models.
3. To investigate the tools and methods of information assurance.
4. To investigate and simulate network and application security.
5. To explore the nature of secure intrusion detection system.
Detailed Syllabus:
MODULE-I
Introduction to IDS: Intruder types, intrusion methods, processes and detection, message
integrity and authentication, honey pots
MODULE-II
IDS Models: General IDS model and taxonomy, data mining based IDS, Denning model,
Framework for constructing features, and different models for intrusion detection systems,
SVM, probabilistic, and statistical modelling, evaluation of IDS, cost sensitive IDS
MODULE-III
Network Security Threat Detection: NBAD, specification based and rate based DDOS,
scans/probes, predicting attacks, network based anomaly detection, stealthy surveillance
detection; defending against DOS attacks in scout, signature-based solutions, snort rules
MODULE-IV
Host based Threat Detection: Host-based anomaly detection, taxonomy of security flaws in
software, self-modelling system calls for intrusion detection with dynamic window size.
MODULE-V
Secure Intrusion Detection Systems: Network security, secure intrusion detection
environment, secure policy manager, secure IDS sensor, alarm management, intrusion
detection system signatures, sensor configuration, signature and intrusion detection
configuration, IP blocking configuration, intrusion detection system architecture.
40
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Text Books:
CO1 Apply the intrusion detection system concepts for basic data
science problem
CO2 Utilize the different Intrusion Detection System Models for data
science network security and analysis.
Utilize the different open-source tools and methods information
CO3
assurance for data science.
CO4 Demonstrate intrusion detection system using network security
tool.
Implement Firewall design principles and identify various
CO5 intrusion detection systems and be able to achieve highest system
security
CO-PO Mapping:
41
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
7. COMPUTER VISION
LT P Prerequisites: None
4, 0, 0: 4 Credits
Course Objectives:
Detailed Syllabus:
MODULE I
Digital Image Formation and low, level processing: Overview and State of the art,
Fundamentals of Image Formation, Transformation: Orthogonal, Euclidean, Affine,
Projective, etc, Fourier Transform, Convolution and Filtering, Image Enhancement,
Restoration, Histogram Processing. Depth estimation and Multi camera views: Perspective,
Binocular Stereopsis: Camera and Epipolar Geometry, Homography, Rectification, DLT,
RANSAC, 3D reconstruction framework, Auto calibration.
MODULE II
Feature Extraction: Edges , Canny, LOG, DOG, Line detectors (Hough Transform), Corners ,
Harris and Hessian Affine, Orientation Histogram, SIFT, SURF, HOG, GLOH, Scale, Space
Analysis, Image Pyramids and Gaussian derivative filters, Gabor Filters and DWT.
Image Segmentation: Region Growing, Edge Based approaches to segmentation, Graph, Cut,
Mean, Shift, MRFs, Texture Segmentation, Object detection.
MODULE III
Motion Analysis: Background Subtraction and Modeling, Optical Flow, KLT, Spatio, Temporal
Analysis, Dynamic Stereo, Motion parameter estimation.
Shape from X: Light at Surfaces, Phong Model, Reflectance Map, Albedo estimation,
Photometric Stereo, Use of Surface Smoothness Constraint, and Shape from Texture, color,
motion and edges.
MODULE IV
Miscellaneous: Applications: CBIR, CBVR, Activity Recognition, computational photography,
42
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Text Books:
1. Szeliski, R., Computer Vision: Algorithms and Applications, Springer, Verlag London .
2. Forsyth, A., D. and Ponce, J., Computer Vision: A Modern Approach, Pearson
Education.
References:
1. Hartley, R. and Zisserman, A., Multiple View Geometry in Computer Vision Cambridge
University Press.
2. Fukunaga, K., Introduction to Statistical Pattern Recognition, Academic Press, Morgan
Kaufmann.
CO-PO Mapping:
43
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
8. Information Retrieval
L T P
4 , 0 , 0 : 4 Credits Prerequisites: None
Detailed Syllabus:
MODULE I
Basic Concepts of IR, Data Retrieval & Information Retrieval, IR system block diagram.
Automatic Text Analysis, Luhn's ideas, Conflation Algorithm, Indexing and Index Term
Weighing, Probabilistic Indexing, Automatic Classification, Measures of Association, Different
Matching Coefficient, Classification Methods, Cluster Hypothesis. Clustering Algorithms,
Single Pass Algorithm, Single Link Algorithm, Rochhio's Algorithm and Dendograms.
MODULE II
File Structures, Inverted file, Suffix trees & suffix arrays, Signature files, Ring Structure, IR
Models, Basic concepts, Boolean Model, Vector Model, and Fuzzy Set Model. Search
Strategies, Boolean search, serial search, and cluster based retrieval, Matching Function.
MODULE III
Performance Evaluation, Precision and recall, alternative measures reference collection
(TREC Collection), Libraries & Bibliographical system, Online IR system, OPACs, Digital
libraries , Architecture issues, document models, representation & access, Prototypes,
projects & interfaces, standards.
MODULE IV
Taxonomy and Ontology: Creating domain specific ontology, Ontology life cycle Distributed
and Parallel IR: Relationships between documents, Identify appropriate networked
collections, multiple distributed collections, parallel IR, MIMD Architectures, Distributed IR,
Collection Partitioning, Source Selection, and Query Processing.
MODULE V
Multimedia IR models & languages, data modelling, Techniques to represent audio and
44
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
visual document, query languages Indexing & searching, generic multimedia indexing
approach, Query databases of multimedia documents, Display the results of multimedia
searches, one dimensional time series, two-dimensional color images, automatic feature
extraction.
MODULE VI
Searching the Web, Challenges, Characterizing the Web, Search Engines, Browsing, Mata
searchers, Web crawlers, robot exclusion, Web data mining, Metacrawler, Collaborative
filtering, Web agents (web shopping, bargain finder,..), Economic, ethical, legal and political
Issues.
Text Books/References:
1. C.D. Manning, P. Raghavan, H. Schütze., Introduction to Information Retrieval.
Cambridge UP, 2008.
2. R. Baeza-Yates, B. Ribeiro-Neto., Modern Information Retrieval. Addison-Wesley,
1999.
3. D.A. Grossman, O. Frieder., Information Retrieval: Algorithms and Heuristics.,
Springer, 2004.
4. I.H. Witten, A. Moffat, T.C. Bell., Managing Gigabytes., Morgan Kaufmann, 1999.
5. C.J. van Risjbergen., The Geometry of Information Retrieval., Cambridge UP, 2004.
CO-PO Mapping:
CO PO1 PO2 PO3 PO4 PO5 PO6
CO1 1 1 1 1 -- --
CO2 2 2 2 2 1 1
CO3 2 2 2 1 1 1
CO4 2 2 3 3 1 2
CO5 3 3 3 2 2 1
Total 11 11 11 9 5 5
Average 2.2 2.2 2.2 1.8 1 1
Eq. Avg. Attainment 2 2 2 2 1 1
45
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
9. Recommender Systems
L T P
4 , 0 , 0 : 4 Credits Prerequisites: None
Course Objective:
Detailed Syllabus:
MODULE I
Basic concepts for recommender systems, detailed taxonomy of recommender systems,
Evaluation of recommender systems
MODULE II
Collaborative filtering algorithms: User-based nearest neighbour recommendation, Item-
based nearest-neighbour recommendation, Model based and pre-processing based
approaches, Attacks on collaborative recommender systems.
MODULE III
Content-based recommendation: High level architecture of content-based systems,
Advantages and drawbacks of content based filtering, Item profiles, Discovering features of
documents, Obtaining item features from tags, Representing item profiles, Methods for
learning user profiles, Similarity based retrieval, Classification algorithms.
MODULE IV
Knowledge based recommendation: Knowledge representation and reasoning, Constraint
based recommenders, Case based recommenders
MODULE V
Hybrid approaches: Opportunities for hybridization, Monolithic hybridization design: Feature
combination, Feature augmentation, Parallelized hybridization design: Weighted, Switching,
Mixed, Pipelined hybridization design: Cascade Meta-level, Limitations of hybridization
strategies
MODULE VI
Evaluating Recommender System: General properties of evaluation research, Evaluation
46
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Reference Book:
1. Charu Aggarwal “Recommender Systems: The Textbook,” First Edition, Springer
2. Francesco Ricci, Lior Rokach, and Bracha Shapira “Recommender Systems
Handbook,” First Edition, Springer
3. Rounak Banik “Hands-On Recommendation Systems with Python,” First Edition, Packt
Publishing
4. Kim Falk “Practical Recommender Systems,” First Edition, Manning Publications
5. Deepak Agarwal and Bee-Chung Chen “Statistical Methods for Recommender
Systems,” First Edition, Cambridge University Press
CO-PO Mapping:
47
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
Detailed syllabus:
MODULE I
Introduction: Artificial Intelligence and Deep Learning-a historical perspective, Artificial
neural networks, Shallow neural networks, Deep neural networks, gradient descent,
forward and backpropagation, computational graphs, linear and non-linear activation
functions.
MODULE II
Optimization techniques: Regularization, Dropout, Batch Normalization,
Vanishing/Exploding gradients, Mini-batch gradient, Gradient descent with momentum,
RMSprop, Adam optimization, Learning rate decay, Local optima, Global optima.
Hyperparameter tuning,
MODULE III
Convolutional Neural Networks: Basic operations: padding, stride, pooling; Classic
convolutional models: LeNet-5, AlexNet, VGG, Modern Deep Convolutional models: ResNet,
GoogleNet; Inception Network, 1-D convolutions, Object detection and Face Recognition
with CNN.
MODULE IV
Recurrent Neural Networks: Sequence modelling, Types of Recurrent Neural Networks,
Backpropagation through time, Language modelling and sequence generation, Word
Embeddings, vanishing gradients with RNNs, Long-Short Term Memory (LSTM), Gated
Recurrent MODULEs (GRU), Bidirectional LSTMs, Sequence-to-Sequence model, Attention
Mechanism, Transformer Network.
MODULE V
Advanced topics: Deep Reinforcement Learning, Generative Adversarial Networks,
Generative vs. Discriminative models, Deep Convolution GANS, Autoencoders.
48
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
References:
1. Charu C. Aggarwal, Neural Networks and Deep Learning- A textbook, 2018, Springer.
2. Ian Goodfellow, Yoshua Bengio, Aaron Courville, ”Deep Learning (Adaptive
Computation and Machine Learning series”, MIT Press.
3. Nikhil Buduma, Nicholas Locascio, “Fundamentals of Deep Learning: Designing Next
Generation Machine Intelligence Algorithms”, O'Reilly Media.
4. Other online resources and research publications.
CO-PO Mapping:
CO PO1 PO2 PO3 PO4 PO5 PO6
CO1 2 2 2 1 1 --
CO2 2 2 1 2 2 --
CO3 3 3 3 3 2 1
CO4 3 3 2 2 2 2
Total 10 10 8 8 7 3
Average Attainment 2.5 2.5 2 2 1.75 0.75
Eq. Average Attainment 3 3 2 2 2 1
49
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. This course is all about data visualization, the art and science of turning data into
readable graphics.
2. How to design and create data visualizations based on data available and tasks to be
achieved.
3. This process includes data modeling, data processing (such as aggregation and
filtering), mapping data attributes to graphical attributes, and strategic visual
encoding based on known properties of visual perception as well as the task(s) at
hand.
4. Students will also learn to evaluate the effectiveness of visualization designs, and
think critically about each design decision, such as choice of color and choice of
visual encoding.
Detailed Syllabus:
MODULE I
Foundation: Importance of analytics and visualization in the era of data abundance, 2-D
Graphics, 2-D Drawing, 3-D Graphics, Photorealism, Non-Photorealism, The Human Retina,
Perceiving Two Dimensions, Perceiving Perspective.
MODULE II
Visualization of Numerical Data: Data Mapping, Charts, Glyphs, Parallel Coordinates, Stacked
Graphs, Tufte's Design Rules, Using Colours.
MODULE III
Visualization of Non-Numerical Data: Graphs and Networks, Embedding Planar Graphs,
Graph Visualization, Tree Maps, Principal Component Analysis, Multidimensional Scaling,
Packing.
MODULE IV
Visualization Dashboard: Visualization Systems, Database Visualization, Visualization System
Design.
50
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
REFERENCE BOOKS
CO-PO Mapping:
51
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. To provide exposure to the Data Science within the context of its importance in
biology.
2. To learn various methodologies and techniques in biology using Data Science.
3. To learn various tools for bioinformatics data analytics.
4. To learn deep learning approaches for bioinformatics applications.
5. To learn and apply various data science models in biology.
Detailed Syllabus:
MODULE I
Need for Data Science in Biology and Healthcare, Visualization tools for biological and
bioinformatics datasets, data handling, transformations of data.
MODULE II
Data Science in genomics, from genetics to genomes, Alignment, and phylogenetic trees.
MODULE III
Structural bioinformatics, Proteomics, Protein structure prediction, integrative structural
modeling, and structure-based drug design.
MODULE IV
AI algorithms, statistical tools, graph algorithms for bioinformatics data analytics.
MODULE V
Deep learning algorithms in perspective of bioinformatics applications, GANs for biological
applications, Whole-cell modeling approaches.
Text Books:
52
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
53
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objective:
Detailed Syllabus:
MODULE-I
Fundamentals of Analytics: Introduction to data-driven decision making; general
introduction to data driven strategy and its importance; use of examples and mini-case
studies to illustrate the role of statistical analysis in decision making.
MODULE-II
Basic Data Analysis: Various types of data that are commonly collected by firms; methods to
be used and inferences/insights that can be obtained depending on the type of data that are
available (stated versus revealed preference, level of aggregation, cross- sectional, time
series, panel data and so forth); use of frequency distributions, mean comparisons, and
cross tabulation; statistical inferences using chi-square; t-test and ANOVA.
MODULE-III
Experimental Design and Natural Experiments: Issues of design of experiments and internal
and external validity; case studies in marketing; economics; and medicine etc.; A-B testing;
and circumstances that provide us with “natural” experiments.
MODULE-IV
Decision making tools: Regression analysis and its applications; use of regression output in
forecasting; promotional planning and optimal pricing; multivariate analysis (unsupervised
learning) cluster analysis; factor analysis decision trees; elastic nets and random forests.
MODULE-V
Case Studies: To understand the problem at an intuitive level; use of simple data analysis
and visualization to verify (or falsify) the intuition; use of appropriate statistical analysis to
present your arguments.
54
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Text Books:
1. F.S. Hillier and G.J. Liberman “Introduction to Operations Research” Tata McGraw
Hill Education Private Limited.
2. Gregory S. Parnel, Terry A. Bresnick, Steven N. Tani, Eric R. Johnson “Handbook of
Decision Analysis”, Wiley.
3. Emily Moberg and Igor Linkov “Multi-Criteria Decision Analysis: Environmental
Applications and Case Studies”, CRC Press, Taylor and Francis group.
4. Adiel Teixeira de Almeida, Emel Aktas, Sarah Ben Amor, João Luis de Miranda
“Advanced Studies in Multi-Criteria Decision Making“, CRC Press.
CO-PO Mapping:
55
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
Detailed Syllabus:
MODULE-I
Social Network Analysis: Preliminaries and definitions, Erdos Number Project, Centrality
measures, Balance and Homophily.
MODULE-II
Random graph models: Random graphs and alternative models, Models of network growth,
Navigation in social Networks.
MODULE-III
Network topology and diffusion, Contagion in Networks, Complex contagion, Percolation
and information, Epidemics, and information cascades.
MODULE-IV
Cohesive subgroups, Multidimensional Scaling, Structural equivalence, Roles and positions,
Ego networks, Weak ties, Structural holes.
MODULE-V
Small world experiments, small world models, Origins of small world, Heavy tails, Small
Diameter, Clustering of connectivity
MODULE-VI
The Erdos-Renyi Model, Clustering Models, Preferential Attachment
MODULE-VII
Navigation in Networks Revisited, Important vertices and page rank algorithm, Towards
rational dynamics in networks, Basics of game theory.
MODULE-VIII
Coloring and consensus, biased voting, network formation games, network structure and
equilibrium, behavioral experiments, Spatial and agent-based models
56
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Text Books:
CO-PO Mapping:
57
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
Course Objectives:
1. To learn the concept and properties of time series data
2. To learn Autoregressive models and forecasting for time series data.
3. Analyzing time series data using R programming.
4. Learn various models of time series data.
Detailed syllabus:
MODULE-I
Basic Properties of time-series data: Distribution and moments, Stationarity,
Autocorrelation, Heteroscedasticity, Normality.
MODULE-II
Autoregressive models and forecasting: AR, ARMA, ARIMA models.
Random walk model: non-stationarity and unit-root process, Drift and Trend models.
MODULE-III
Regression analysis with time-series data using R programming.
Principal Component Analysis (PCA) and Factor Analysis.
MODULE-IV
Conditional Heteroscedastic Models: ARCH, GARCH. T-GARCH, BEKK-GARCH.
Introduction to Non-linear and regime-switching models: Markov regime-switching models,
Quantile regression, Contagion models
MODULE-V
Introduction to Vector Auto-regressive (VAR) models: Impulse Response Function (IRF),
Error Correction Models, Co-integration. Introduction to Panel data models: Fixed-Effect and
Random-Effect models.
Text Books:
1. Chris Brooks “Introductory Econometrics for Finance,” Fourth Edition, Cambridge
University Press.
2. Ruey S. Tsay “Analysis of Time-series data,” Third Edition, Wiley
3. John Fox and Sanford Weisberg “An R Companion to Applied Regression,” Third
Edition, SAGE
4. Yves Croissant and Giovanni Millo “Panel Data Econometrics with R,” First Edition,
Wiley
58
CSE Department, NIT Agartala
M.Tech. in Data Science and Engineering
CO-PO Mapping:
CO-1 1 1 1 2 1 1
CO-2 2 2 1 2 1 1
CO-3 2 2 2 1 1 1
CO-4 2 3 2 2 2 1
Total 7 8 6 7 5 4
Average 1.75 2 1.5 1.75 1.25 1
Attainment 2 2 2 2 1 1
***************************************************************************
59
CSE Department, NIT Agartala