0% found this document useful (0 votes)

115 views9 pages

Data Science & Analytics Overview

The document provides an overview of data analytics concepts. It defines analytics as using data, statistics, modeling and fact-based analysis to drive decisions. It lists common terms in data analytics like data types, data sources, data warehousing, data mining and machine learning algorithms. It distinguishes statistics as quantifying data, data mining as finding patterns, machine learning as using models to predict outcomes, and artificial intelligence as using reasoning to behave intelligently. The document also describes types of analytics reports, data science as transforming data into actionable predictions, and machine learning as using mathematical functions trained on data to produce outputs.

Uploaded by

sumit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

115 views9 pages

Data Science & Analytics Overview

Uploaded by

sumit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Data Analytics

What is Analytics ?
Analytics is the extensive use of data, statistical and quantitative analysis,
exploratory, predictive models, and fact based management to drive
decisions and actions.”

Analytics can be defined as “the analysis of data to draw hidden insights to

aid decision making”.

…… and many more !!!

Frequently used terms
Data Analytics Data Analysis Big Data Data Types

Data Warehouse Data Mining Data Cleansing Data Definition

Data Manipulation Data Transformation Data Wrangling Databases

Data Sources Data Forms Raw and Processed Data

Data Collection Statistics Statistical measures Mathematics

Linear Algebra
Artificial Intelligence Normalization R / Python Hadoop
Text Analytics Algorithms Predictions Patterns

Supervised learning Unsupervised learning Clustering

etc….
Definitions
Statistics is just about the numbers, and quantifying the data. There are many tools for finding relevant
properties of the data but this is pretty close to pure mathematics.

Data Mining is about using statistics as well as other programming methods to find patterns hidden in the
data so that you can explain some phenomenon. Data Mining builds intuition about what is really happening
in some data and is still little more towards math than programming, but uses both.

Machine Learning uses Data Mining techniques and other learning algorithms to build models of what is
happening behind some data so that it can predict future outcomes. Math is the basis for many of the
algorithms, but this is more towards programming.

Artificial Intelligence uses models built by Machine Learning and other ways to reason about the world
and give rise to intelligent behavior whether this is playing a game or driving a robot/car. Artificial
Intelligence has some goal to achieve by predicting how actions will affect the model of the world and
chooses the actions that will best achieve that goal. Very programming based.

Statistics quantifies numbers

Data Mining explains patterns
In short Machine Learning predicts with models
Artificial Intelligence behaves and reasons
https://stats.stackexchange.com/questions/5026/what-is-the-difference-between-data-mining-statistics-machine-learning-and-ai
Types of Analytics
Types of report, analytics
and query Focus

Optimization What’s the best that can happen ?

Prediction What will happen next ?

Analytics

Forecasting What if this trend continues ?

Statistical Analysis Why is this happening ?

Alerts What actions are needed ?

Query and Reports

Drilldown reports Where is the problem ?

Ad-hoc reports How many, how often ?

Standard Reports What happened ?

Data Science
• Art of transforming hypotheses and data into actionable predictions

• For example, we can use models and data to

 Predict who will win an election
 What products will sell well together (Apriori / Market-Basket analysis)
 Who is likely to default on loans
 Which advertisements will be clicked on
 etc.

• Tools used (but not restricted to)

Empirical Sciences Statistics Business Intelligence Databases Data Warehousing Visualization

Expert Systems Analytics Machine Learning Big Data Data Mining Reporting

• Central goal of Data Science

To deploying effective decision-making models to a production environment
What distinguishes data science itself from the tools and techniques is the central goal of deploying
effective decision-making models to a production environment.
Data Science
These systems share a lot of features:

• Amazon’s product recommendation systems

• Google’s advertisement valuation systems
• Linkedin’s contact recommendation system
• Twitter’s trending topics
• Walmart’s consumer demand projection systems

Built on a large dataset Most of the systems are live or online

Allowed to make mistakes Not concerned with any cause

Machine Learning
• The ability to write a mathematical function that will read an input and produce output

• We provide the function – machine does not pick its own function

• ML considerations
 Training data (lots of it)
 Model
 Cost function (eg: Ordinary Least Squares)
 Optimisation (eg: Gradient descent)

Why is learning possible?

 Generalisation is possible
eg: if dataset contains travel time between places A and B, function would not generalise if we
predict travel distance between A and C

 IID (independent and identical distribution) of data

That’s why gradient descent needn’t go through the entire dataset, since data is similar
… Eventually data will surpass in oil and water in importance

Thank You !!!

Din V 18599-3
No ratings yet
Din V 18599-3
81 pages
In The Mountains (Form 4)
50% (2)
In The Mountains (Form 4)
6 pages
"US Crisis to Empire: Key Events"
No ratings yet
"US Crisis to Empire: Key Events"
38 pages
Nortel GSM R Solution Brief
No ratings yet
Nortel GSM R Solution Brief
10 pages
LTE PIM Issues in 800/1800 Bands
No ratings yet
LTE PIM Issues in 800/1800 Bands
9 pages
Alessandra Lemma - Minding The Body - The Body in Psychoanalysis and Beyond (2014, Routledge)
80% (5)
Alessandra Lemma - Minding The Body - The Body in Psychoanalysis and Beyond (2014, Routledge)
211 pages
RPG Xna
No ratings yet
RPG Xna
388 pages
Data Science Essentials for Beginners
No ratings yet
Data Science Essentials for Beginners
203 pages
Graphs and Networks 2nd Edition PDF
No ratings yet
Graphs and Networks 2nd Edition PDF
476 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
32 pages
Handwritten vs. Email: Pros and Cons
100% (1)
Handwritten vs. Email: Pros and Cons
3 pages
Importance of Technology Transfer
80% (10)
Importance of Technology Transfer
6 pages
NMS U2000 Training
0% (1)
NMS U2000 Training
150 pages
Case Study On Anthropology
No ratings yet
Case Study On Anthropology
4 pages
Car Template Proposal 4g
No ratings yet
Car Template Proposal 4g
37 pages
New Site Kpi Optimization
No ratings yet
New Site Kpi Optimization
7 pages
Microsoft Excel VBA and Macros (Office 2021 and Microsoft 365) - 131-201-57-71 Chapter 5 Vba
No ratings yet
Microsoft Excel VBA and Macros (Office 2021 and Microsoft 365) - 131-201-57-71 Chapter 5 Vba
15 pages
Pulmonary Function Tests
No ratings yet
Pulmonary Function Tests
65 pages
TSEL LTE OMC Guideline - V9
No ratings yet
TSEL LTE OMC Guideline - V9
131 pages
PEGA 02 Material Total
100% (8)
PEGA 02 Material Total
223 pages
Contemporary Social Issues of Tamil Nadu
No ratings yet
Contemporary Social Issues of Tamil Nadu
8 pages
TEMS Discovery 4.0 User Guide
100% (3)
TEMS Discovery 4.0 User Guide
366 pages
Huawei GSM-R Solution: Huawei Technologies Co., LTD
No ratings yet
Huawei GSM-R Solution: Huawei Technologies Co., LTD
16 pages
Carl Schlechter Wins With White - 191 Games
100% (1)
Carl Schlechter Wins With White - 191 Games
73 pages
Telecom Site Acceptance Guide
No ratings yet
Telecom Site Acceptance Guide
6 pages
Huawei 2G/3G Network Optimization: Sharing Session Bandung, January 21-23, 2015
No ratings yet
Huawei 2G/3G Network Optimization: Sharing Session Bandung, January 21-23, 2015
174 pages
Dedicated Mobile Communications For High-Speed Railway
No ratings yet
Dedicated Mobile Communications For High-Speed Railway
354 pages
Ramen Product Cost Analysis
No ratings yet
Ramen Product Cost Analysis
4 pages
Transformer Training for Engineers
100% (2)
Transformer Training for Engineers
8 pages
Review Paper by Wendosen Seife
No ratings yet
Review Paper by Wendosen Seife
4 pages
ONDM2019 Tutorial 5G Networks Technologies Challenges and Tools
No ratings yet
ONDM2019 Tutorial 5G Networks Technologies Challenges and Tools
39 pages
Iso 45009
No ratings yet
Iso 45009
30 pages
Key Performance Guarantee Steps For UMTS Swapping V2.0
100% (1)
Key Performance Guarantee Steps For UMTS Swapping V2.0
58 pages
Case Conclusion
No ratings yet
Case Conclusion
3 pages
GSM Handover Management
No ratings yet
GSM Handover Management
11 pages
Standard 8
No ratings yet
Standard 8
1 page
CR - 4G - CWJ - MC - Bundang - Huawei CQI Improve - 2020 - 07 - 08
No ratings yet
CR - 4G - CWJ - MC - Bundang - Huawei CQI Improve - 2020 - 07 - 08
8 pages
LTE2100 Hygiene Report Analysis
No ratings yet
LTE2100 Hygiene Report Analysis
44 pages
Intelligent Transportation System - I: Lecture Notes in Transportation Systems Engineering
No ratings yet
Intelligent Transportation System - I: Lecture Notes in Transportation Systems Engineering
81 pages
00-Guide To Features (eRAN15.1 - 05)
No ratings yet
00-Guide To Features (eRAN15.1 - 05)
102 pages
Imananger PRS V100R016 Main Slide For CVM PDF
No ratings yet
Imananger PRS V100R016 Main Slide For CVM PDF
49 pages
KPI Definition
No ratings yet
KPI Definition
75 pages
LTE - Events MANTAP
No ratings yet
LTE - Events MANTAP
81 pages
Railway Dhruv
100% (1)
Railway Dhruv
41 pages
Zamani Swap Project Final Report - 0922 - v3
No ratings yet
Zamani Swap Project Final Report - 0922 - v3
50 pages
Aircon Marketing Plan for Parents
100% (1)
Aircon Marketing Plan for Parents
16 pages
Unit III: Concept Description: Characterization and Comparison
No ratings yet
Unit III: Concept Description: Characterization and Comparison
53 pages
New Site For LTC 601665 - Klapanunggal - 3G: Huawei RNP/RNO Team
No ratings yet
New Site For LTC 601665 - Klapanunggal - 3G: Huawei RNP/RNO Team
41 pages
Designing Lte R
100% (1)
Designing Lte R
7 pages
GSM-R Implementation and Global Expansion
No ratings yet
GSM-R Implementation and Global Expansion
14 pages
Garki and Environs (Garki Modern Market, Garki Village and Other Parts of Garki) Pre - and Post-Optimization Complaint DT Report - 18.07.2013
No ratings yet
Garki and Environs (Garki Modern Market, Garki Village and Other Parts of Garki) Pre - and Post-Optimization Complaint DT Report - 18.07.2013
24 pages
Cht13 Cht13 Transportation Demand Ali Transportation Demand Ali Analysis Analysis
No ratings yet
Cht13 Cht13 Transportation Demand Ali Transportation Demand Ali Analysis Analysis
43 pages
Data Science - PPT
No ratings yet
Data Science - PPT
45 pages
An Overview of Research Topics and Challenges For 5G Massive MIMO Antennas
No ratings yet
An Overview of Research Topics and Challenges For 5G Massive MIMO Antennas
4 pages
Data To Knowledge To Results Rev4
No ratings yet
Data To Knowledge To Results Rev4
21 pages
Sabrimalai Ayyappan
No ratings yet
Sabrimalai Ayyappan
2 pages
Chapter - 3 - Key Notes - Election and Representation
No ratings yet
Chapter - 3 - Key Notes - Election and Representation
2 pages
Article - Introdcution of MTRC PDF
No ratings yet
Article - Introdcution of MTRC PDF
2 pages
Industry 4.0 Big Data Strategies
No ratings yet
Industry 4.0 Big Data Strategies
14 pages
Primitives
100% (1)
Primitives
3 pages
E. Nursing Diagnosis
No ratings yet
E. Nursing Diagnosis
2 pages
4g - Concheck - (Siteid and Sitename) v3
No ratings yet
4g - Concheck - (Siteid and Sitename) v3
28 pages
Introduction To Business Management
0% (1)
Introduction To Business Management
4 pages
VEGA v. SSS
No ratings yet
VEGA v. SSS
4 pages
KTMB GSM-R
No ratings yet
KTMB GSM-R
14 pages
Sreelakshmi Report
No ratings yet
Sreelakshmi Report
21 pages
The Night Before Christmas: Inside
No ratings yet
The Night Before Christmas: Inside
2 pages
Ivy - Data Science and Data Visualization Certification Course
100% (1)
Ivy - Data Science and Data Visualization Certification Course
10 pages
Railway, Airport & Harbour Engg.
No ratings yet
Railway, Airport & Harbour Engg.
3 pages
Step 1: Login U2000 and Link To Maintenance Client: Huawei Technologies Co., Ltd. Huawei Confidential
No ratings yet
Step 1: Login U2000 and Link To Maintenance Client: Huawei Technologies Co., Ltd. Huawei Confidential
5 pages
Huawei LTE Network Configuration Guide
No ratings yet
Huawei LTE Network Configuration Guide
35 pages
4G Wireless Technology: Biniwale Aditi.M. 5 Sem Computer Piet
No ratings yet
4G Wireless Technology: Biniwale Aditi.M. 5 Sem Computer Piet
20 pages
IDIOMS tiếng Anh thường dùng cho chủ đề BUSINESS
No ratings yet
IDIOMS tiếng Anh thường dùng cho chủ đề BUSINESS
2 pages
Tutorial U2000: Query Result
No ratings yet
Tutorial U2000: Query Result
8 pages
Lee Criteria Model Tuning White Paper Web
No ratings yet
Lee Criteria Model Tuning White Paper Web
3 pages
RS Power Adjustment Scripts
No ratings yet
RS Power Adjustment Scripts
10 pages
Standards of Future Railway Wireless Communication in Korea-Good Reference
No ratings yet
Standards of Future Railway Wireless Communication in Korea-Good Reference
8 pages
Development of A Digital Twin For Prediction of Rail Surface Damage in Heavy Haul Railway Operations
No ratings yet
Development of A Digital Twin For Prediction of Rail Surface Damage in Heavy Haul Railway Operations
27 pages
Huawei CEM INFRA Job Types Overview
No ratings yet
Huawei CEM INFRA Job Types Overview
12 pages
3GPP Highlights Issue 9 WEB Download
No ratings yet
3GPP Highlights Issue 9 WEB Download
32 pages
CMR Bda Why Data Analytics
No ratings yet
CMR Bda Why Data Analytics
108 pages
My Beloved Charioteer
No ratings yet
My Beloved Charioteer
41 pages
Edetino Case
No ratings yet
Edetino Case
11 pages
Scheduling Algorithms For 5G Networks and Beyond Classification and Survey
No ratings yet
Scheduling Algorithms For 5G Networks and Beyond Classification and Survey
20 pages
Mod1 DM
No ratings yet
Mod1 DM
9 pages
(Reading Certificate) Egemen Türedi 16 Oct 2025
No ratings yet
(Reading Certificate) Egemen Türedi 16 Oct 2025
2 pages
Class 2 Bridge Course
No ratings yet
Class 2 Bridge Course
6 pages
JUDICIAL AFFIDAVIT OF BELLE VERAS ANNEX 12 Answer For Forcible Entry Civil Case No. 890
No ratings yet
JUDICIAL AFFIDAVIT OF BELLE VERAS ANNEX 12 Answer For Forcible Entry Civil Case No. 890
4 pages
DS RC
No ratings yet
DS RC
92 pages

Data Science & Analytics Overview

Uploaded by

Data Science & Analytics Overview

Uploaded by

Data Analytics

Analytics can be defined as “the analysis of data to draw hidden insights to

…… and many more !!!

Data Warehouse Data Mining Data Cleansing Data Definition

Data Manipulation Data Transformation Data Wrangling Databases

Data Sources Data Forms Raw and Processed Data

Data Collection Statistics Statistical measures Mathematics

Supervised learning Unsupervised learning Clustering

Statistics quantifies numbers

Optimization What’s the best that can happen ?

Prediction What will happen next ?

Forecasting What if this trend continues ?

Statistical Analysis Why is this happening ?

Alerts What actions are needed ?

Drilldown reports Where is the problem ?

Ad-hoc reports How many, how often ?

Standard Reports What happened ?

• For example, we can use models and data to

• Tools used (but not restricted to)

• Central goal of Data Science

• Amazon’s product recommendation systems

Built on a large dataset Most of the systems are live or online

Allowed to make mistakes Not concerned with any cause

Why is learning possible?

 IID (independent and identical distribution) of data

Thank You !!!

You might also like