Reference Books
4. Cloud Security: A Comprehensive Guide to Secure Cloud Computing, 2010
Ronald L. Krutz, Russell Dean Vines, Wiley-India(ISBN: 978-0-470-
58987-8)
1. Subject Code: SE416 Course Title: Big Data Analytics
2. Contact Hours: L: 3 T: 1 P: 0
3. Examination Duration (ETE) (Hrs.): Theory 3 Hrs Practical 0
4. Relative Weightage: CWS 25 PRS 0 MTE 25 ETE 50 PR 0
5. Credits: 4
6. Semester: VIII
7. Subject Area: DEC
8. Pre-requisite: Database management systems.
9. Objective: Understand the fundamentals of various big data analysis techniques, Hadoop
structure, environment and framework.
10. Details of Course
S.No. Contents Contact
Hours
1. INTRODUCTION TO BIG DATA : Introduction to Big Data Platform – 8
Challenges of Conventional Systems - Intelligent data analysis – Nature of
Data - Analytic Processes and Tools - Analysis vs Reporting - Modern Data
Analytic Tools - Statistical Concepts: Sampling Distributions - Re-Sampling -
Statistical Inference - Prediction Error.
2. MINING DATA STREAMS : Introduction To Streams Concepts – Stream 8
Data Model and Architecture - Stream Computing - Sampling Data in a
Stream – Filtering Streams – Counting Distinct Elements in a Stream –
Estimating Moments – Counting Oneness in a Window – Decaying Window -
Real time Analytics Platform(RTAP) Applications - Case Studies - Real Time
Sentiment Analysis, Stock Market Predictions.
3. HADOOP: History of Hadoop- The Hadoop Distributed File System – 10
Components of Hadoop- Analyzing the Data with Hadoop- Scaling Out-
Hadoop Streaming- Design of HDFS-Java interfaces to HDFS- Basics-
Developing a Map Reduce Application-How Map Reduce Works-Anatomy of
a Map Reduce Job run-Failures-Job Scheduling-Shuffle and Sort – Task
execution - Map Reduce Types and Formats- Map Reduce Features
4. HADOOP ENVIRONMENT :Setting up a Hadoop Cluster - Cluster 8
specification - Cluster Setup and Installation - Hadoop Configuration-Security
in Hadoop - Administering Hadoop – HDFS - Monitoring-Maintenance-
Hadoop benchmarks- Hadoop in the cloud
5. FRAMEWORKS: Applications on Big Data Using Pig and Hive – Data 8
processing operators in Pig – Hive services – HiveQL – Querying Data in
Hive - fundamentals of HBase and ZooKeeper - IBM InfoSphere BigInsights
and Streams. Visualizations - Visual data analysis techniques, interaction
techniques; Systems and applications
Total 42
11. Suggested Books
S.No. Name of Books / Authors/ Publishers Year of
Publication/
Reprint
Text Books
1. Michael Berthold, David J. Hand, “Intelligent Data Analysis”, Springer, 2007
2007.
2 Tom White “ Hadoop: The Definitive Guide” Third Edition, O’reilly 2012
Media, 2012.
3 Chris Eaton, Dirk DeRoos, Tom Deutsch, George Lapis, Paul 2012
Zikopoulos, “Understanding Big Data: Analytics for Enterprise Class
Hadoop and Streaming Data”, McGrawHill Publishing, 2012
4 Anand Rajaraman and Jeffrey David Ullman, “Mining of Massive 2012
Datasets”, Cambridge University Press, 2012.
Reference books:
5 Bill Franks, “Taming the Big Data Tidal Wave: Finding Opportunities in 2012
Huge Data Streams with Advanced Analytics”, John Wiley & sons, 2012.
6 Glenn J. Myatt, “Making Sense of Data”, John Wiley & Sons, 2007 2007
7 Pete Warden, “Big Data Glossary”, O’Reilly, 2011. 2011
8 Jiawei Han, Micheline Kamber “Data Mining Concepts and Techniques”, 2008
Second Edition, Elsevier, Reprinted 2008.
9 Da Ruan,Guoquing Chen, Etienne E.Kerre, Geert Wets, Intelligent Data 2007
Mining, Springer,2007
10. Paul Zikopoulos ,Dirk deRoos , Krishnan Parasuraman , Thomas Deutsch 2012
, James Giles , David Corrigan , Harness the Power of Big Data The IBM
Big Data Platform, Tata McGraw Hill Publications, 2012
11 Michael Minelli (Author), Michele Chambers (Author), Ambiga Dhiraj 2013
(Author) , Big Data, Big Analytics: Emerging Business Intelligence and
Analytic Trends for Today's Businesses,Wiley Publications,2013
12 Zikopoulos, Paul, Chris Eaton, Understanding Big Data: Analytics for 2011
Enterprise Class Hadoop and Streaming Data, Tata McGraw Hill
Publications, 2011
1. Subject Code: SE418 Course Title: Wireless and Mobile Computing