INTRODUCTION
TO DATA SCIENCE
AND DATA
ANALYTICS
Understanding the Basics and Using Excel
CHAPTER 1: OVERVIEW OF DATA, DATA
SCIENCE ANALYTICS, AND TOOLS
• About Data Science
• About Data Analytics and its types
• Data, Data Sources and Data Types
• Data Analytics Process
• Excel as Data Analytics Tool
• Understanding the MS Excel Interface
• Creating and Saving Workbooks
• Working with Worksheets and Data Entry
• Formulas and Functions
2
OBJECTIVES
OBJECTIVES
• What is Data Science?
• Data Analytics overview and its types
• Data, Data Sources and Data Types
• Data Analytics Process
• Excel as Data Analytics Tool
• Understanding the MS Excel Interface
• Creating and Saving Workbooks
• Working with Worksheets and Data Entry
• Formulas and Functions
4
DATA SCIENCE
WHAT IS DATA SCIENCE?
• Data Science is a multidisciplinary field that uses scientific methods, processes, algorithms, and
systems to extract knowledge and insights from structured and unstructured data. It combines
techniques from statistics, computer science, and domain expertise to analyze and interpret complex
data.
• Key Components of Data Science
• Data Collection:
• Gathering raw data from various sources, such as databases, surveys, sensors, and online platforms.
• Ensuring data quality and relevance for subsequent analysis.
• Data Analysis:
• Applying statistical methods and algorithms to clean, process, and analyze data.
• Identifying patterns, trends, and relationships within the data.
• Machine Learning:
• Using algorithms and statistical models to build predictive or classification models based on historical data.
• Techniques include supervised learning, unsupervised learning, and reinforcement learning.
• Data Visualization:
6 • Creating visual representations of data to make complex information more accessible and understandable.
• Tools include charts, graphs, dashboards, and maps.
DATA ANALYTICS OVERVIEW AND ITS TYPES
Data Analytics is the process of examining datasets to draw conclusions
about the information they contain. It involves techniques and tools to
1. What is Data analyze raw data and extract meaningful insights that can inform business
Analytics decisions. Data Analytics encompasses various methods such as statistical
analysis, data mining, and predictive modeling to uncover patterns, trends,
and relationships.
Difference Between Data Science and Data Analytics
Scope:
• Data Science: A broader field that includes data analytics as one of its
components. It combines data analysis with machine learning, data
engineering, and advanced computational methods.
• Data Analytics: Focuses specifically on analyzing data to extract
insights and make data-driven decisions. It generally involves less
emphasis on machine learning and more on statistical analysis and data
interpretation.
Objectives:
• Data Science: Aims to develop predictive models, automate processes,
and create algorithms that can handle large volumes of data and
7 complex problems.
• Data Analytics: Primarily aims to interpret historical data to understand
TOOLS AND TECHNIQUES
Data Science: Utilizes tools like Python, R, and big data technologies (e.g.,
Hadoop, Spark) along with advanced machine learning algorithms.
Data Analytics: Often involves tools like Excel, SQL, and BI software (e.g.,
Tableau, Power BI) with a focus on descriptive and diagnostic analytics.
8
IMPORTANCE OF DATA ANALYTICS IN DECISION
MAKING
• Informed Decisions:
• Data analytics provides actionable insights based on empirical
evidence rather than intuition, helping organizations make more
informed and accurate decisions.
• Identifying Trends and Patterns:
• By analyzing historical data, organizations can identify trends and
patterns that help in forecasting future outcomes and strategic
planning.
• Improving Efficiency:
• Analytics can uncover inefficiencies in processes and operations,
enabling organizations to optimize workflows and reduce costs.
• Customer Insights:
• Understanding customer behavior and preferences through data
analytics allows businesses to tailor their offerings, improve customer
satisfaction, and drive growth.
• Competitive Advantage:
• Leveraging data analytics can give organizations a competitive edge
9 by enabling them to adapt quickly to market changes and emerging
trends.
TYPES OF DATA
ANALYTICS
• Descriptive Analytics: What happened?
• Example: Sales reports
• Diagnostic Analytics: Why did it happen?
• Example: Cause of sales decline
• Predictive Analytics: What is likely to
happen?
• Example: Sales forecasts
• Prescriptive Analytics: What should be
done?
• Example: Recommendations for marketing
strategies