DATA SCIENCE ANALYTICS
1. Title of the Module
Chapter 1: INTRODUCTION TO DATA SCIENCE ANALYTICS
2. Introduction
This course covers the basics of data science analytics, and its tools. It also
contains an overview of data science, and data analytics and its types, along with is
data, and its sources and types. In addition is the process of data analytics and using
Microsoft Excel as a tool for data analysis.
3. Learning Outcome
It contains the list of competencies that students should acquire during the
learning process.
1. Differentiate Data Science and Data Analytics;
2. Explain the data analytics process; and
3. Demonstrate proficiency in fundamental Excel functions such as data
entry, formatting, formula creation, and use of Excel functions
4. Learning Content
Topics for module 1: Overview of Data, Data Science Analytics, and Tools
Topic 1: Data Science vs. Data Analytics
Topic 2: Types of Data Analytics
Topic 3: Data Science
Topic 4: Data Analytics Process
1| Page
5. Teaching and Learning Activities
Topic 1: Data Science vs. Data Analytics
Data Analyst
An individual that examines large datasets to identify trends, forecasts and
data visualizations to tell a compelling story through actionable insights.
Data Scientist
An individual that is tasked with designing and constructing new
processes for data modeling using algorithms, predictive analytics and statistical
analysis.
Their Differences
In Education Requirements, data analyst has degrees in data analytics, data
science, computer science, applied mathematics or statistics. Whilst, data scientist
similarly has a degree in computer science or math, but possibly an advanced degree in
data science.
In Work Experience, a data analyst works with data in a business context.
Whilst, a data scientist works with data in a mathematics and statistical context.
In Technical Skills, data analysts use programming languages like SQL, R,
Python; Microsoft Excel, and Tableau and PowerBi for data visualization. Whilst, data
scientists possess skills in statistical analysis, machine learning, coding and Hadoop.
In Professional Skills, a data analyst possesses soft skills in data storytelling,
analytical thinking, critical thinking, and interpersonal skills. A data scientist possess the
given soft skills with an addition of business intuition.
2| Page
Data Analytics reviews raw data and draws meaningful insights to solve business
problems.
Types of Data Analytics
● Descriptive analytics. Answer the question “What has happened in the past and
what is happening right now?” by providing a current snapshot of trends and
patterns by leveraging current and historical data.
● Diagnostic analytics. Answers the question “Why are these trends and patterns
happening?” By focusing on the trend data to discover the factors or reasons for
past performance.
● Predictive analytics. Answer the question “What is likely to happen in the
future?” by utilizing machine learning and artificial intelligence (AI) to build
predictive models and statistical models to predict the future.
● Prescriptive analytics. Answer the question “What do we need to do?” through
testing and other techniques to recommend specific solutions that will drive a
desired outcome.
3| Page
Data Science focuses on building models and designing frameworks that will gather
and analyze large datasets. Typically includes data mining, statistical methods and
machine learning algorithms.
• Unstructured Data
Unorganized and unusable information. It does not follow a predefined data
model or organized format. It also often contains text, images, audio, videos, or
other complex data.
• Statistical Methods
Once the data is collected there can be many variables to consider. Regression
analysis is one statistical method that allows data scientists to explore the
relationship between these variables. Correlation analysis is also used for both
qualitative and quantitative data.
• Machine Learning Algorithms
Data scientists use machine learning algorithms to predict, categorize and
classify data with minimal chance for error. There are three main sets of machine
learning algorithms: - Supervised - Unsupervised - Reinforcement learning.
It's an iterative process where hypotheses are formed and tested with data.
Data Analytics Process
As the data available to companies continues to grow both in amount and complexity,
so too does the need for an effective and efficient process by which to harness the
value of that data. The data science process typically moves through several iterative
phases.
1. Identify the business question you’d like to answer. What problem is the
company trying to solve? What do you need to measure, and how will you
measure it?
4| Page
2. Collect the raw data sets you’ll need to help you answer the identified question.
Data collection might come from internal sources, like a company’s client
relationship management (CRM) software, or from secondary sources, like
government records or social media application programming interfaces (APIs).
3. Clean the data to prepare it for analysis. This often involves purging duplicate
and anomalous data, reconciling inconsistencies, standardizing data structure
and format, and dealing with white spaces and other syntax errors.
4. Analyze the data. By manipulating the data using various data analysis
techniques and tools, you can begin to find trends, correlations, outliers, and
variations that tell a story. During this stage, you might use data mining to
discover patterns within 4 databases or data visualization software to help
transform data into an easy-to-understand graphical format.
5. Interpret the results of your analysis to see how well the data answered your
original question. What recommendations can you make based on the data?
What are the limitations to your conclusions?
5| Page