Applied Business Statistics
Master of Business Administration
( MBA )
Dr. Hosni Elwakeel
Business Analytics in the
Age of Generative AI
Data is a critical component of
decision making, helping businesses
and organizations gain key insights
and understand the implications of
their decisions at a granular level.
Visual analytics, in the form of
interactive dashboards and
visualizations, are essential tools for
anyone - from students to CEOs -
who needs to analyze data and tell
stories with data
What is SPSS? Definition, Features, Types, and Use Case
SPSS Statistics is a statistical software suite developed by IBM for
data management, advanced analytics, multivariate analysis,
business intelligence, and criminal investigation.
Long produced by SPSS Inc. it was acquired by IBM in 2009. Versions
of the software released since 2015 have the brand name IBM SPSS
Statistics.
The software name originally stood for Statistical Package for the Social
Sciences (SPSS),reflecting the original market, then later changed
to Statistical Product and Service Solutions.
What are the Core Features of SPSS?
The core functionalities offered in SPSS are:
➢ Statistical program for quantitative data analysis – It includes
frequencies, cross-tabulation, and bivariate statistics.
➢ Modeler program that allows for predictive modeling. It enables
researchers to build and validate predictive models using advanced
statistical procedures.
➢ Text analysis helps you derive insights from qualitative inputs through
open-ended questionnaires.
➢ Visualization Designer allows researchers to use their data for a variety of
visual representations.
Is SPSS Enough for Data Analysis
From basic descriptive statistics to advanced multivariate
analysis, SPSS can handle various data analysis tasks, making it
a valuable tool for researchers and analysts.
Conclusion
SPSS helps scientists, researchers, business analysts, managers, and
marketers during the entire analytics process, starting from planning, data
collection, data analysis, report creation, and deployment.
No matter what your business objectives are if you have a bunch of data that
you want to analyze.
SPSS is one of the best statistical analysis tools that you can use.
Chapter 1
Introduction To Statistics
What is meant by Statistics?
The word statistics is used to describe the collection , organization
( representation or visualization of data in tables and graphs ) then
interpretation of data ( make numerical measures ) that lead to conclusions
( make decision ).
Organization of
Collection of Numerical
Data Decision
Data Measures
(Tables,Graphs)
Types of Data ( variables )
QUALITATIVE
- Are categorical data that measure attributes.
- Each individual falls in one and only one category.
- Categories are mutually exclusive.
Example :
Gender , social status .
QUANTITATIVE
- Assume only numerical values and it has two types .
Discrete Continuous
No decimal and its
Uncountable values
values are countable
EX: Time , weight
EX: number of children
Types of Data ( variables )
Discrete Continuous
No decimal and its
Uncountable values
values are countable
EX: Time , weight
EX: number of children
Sources of Data ( variables )
Two sources of data
Primary source
when data is collected for the first time for the research purpose .
Data may be collected via surveys, laboratory experiments and observations .
➢ A survey : is to let some individuals answer a set of questions, called “Questionnaire”
Collected answers constitute the data to be analyzed .
➢ Laboratory experiments: such as results obtained in pharmacological , medical,
agriculture research .
➢ Observations : is used to collect data by observing, for example, the behavior of some
individuals .
Sources of Data ( variables )
Secondary source
When data has been collected by others and for different reasons .
This type of data includes data collected via :
➢ any site on the internet.
➢ journals.
➢ books.
When data id downloaded and saved, it is called “ Soft Copy”, but when
same data is printed , it is called “ hard Copy”.
Sources of Data ( variables )
There are several International Secondary sources, to name a few :
➢ CAPMAS: Central Agency for Public Mobilization and Statistics : Official data source in
Egypt.
➢ ILO : International Labor organization.
➢ WHO: World Health Organization.
➢ UNESCO: United Nations for Education, Science, and Culture Organization.
Some Definitions
Population:
It is the set of all items, objects or individuals of interest .
EX:
- All students in AAST.
- All licensed private cars in Cairo.
Sample:
Is a subset ( portion ) of the population.
EX:
- A few lamps selected for testing .
- 500 students selected randomly from a school.
Some Definitions
Census:
It is the process of recording ( collecting ) the entire set of measurements taken from the
whole population.
Sample survey:
It is the process of recording ( collecting ) the entire set of measurements taken from the
sample.
Parameter:
Is the numerical measure computed from the population.
Statistic:
Is the numerical measure computed from the sample data.
Branches of statistics
(a) Descriptive statistics : احصاء وصفيه
Collection of
data
Organization of data
( table ,graphs)
using
Numerical Population or To take a
Which is true ( right )
measures sample decision
Branches of statistics
(b) inferential statistics : احصاء استنتاجيه
Collection of
data
Organization of data
( table ,graphs)
using
To take a Which is not true
Numerical
Sample only decision about
measures
the population Probability of Errors
Branches of statistics
inferential statistics : احصاء استنتاجيه
- The conclusion or making decisions concerning a population based only on sample data.
- Sample statistic used to estimate population parameter.
Branches of statistics
Branches of statistics
Difference between Data Analysis and Data
Science
Data analysis and data science are often used interchangeably, but they're not exactly the
same thing. Think of them as two overlapping circles, with some distinct differences.
Data Analysis
Data analysis is the process of extracting insights from data to answer specific questions or
solve a particular problem. It's a more focused, tactical approach that typically involves:
1.Descriptive analytics: Examining historical data to understand what happened.
2.Diagnostic analytics: Identifying why something happened.
3.Data visualization: Presenting findings in a clear, visual format.
Data analysts typically work with structured data, using tools like Excel, SQL, or statistical
software (e.g., SPSS,R, Python). Their primary goal is to provide actionable insights to
stakeholders, often to inform business decisions.
Data Science
Data science is a broader, more strategic field that encompasses data analysis, but also involves:
1.Predictive analytics: Using data to forecast what might happen in the future.
2.Prescriptive analytics: Recommending actions based on predictions.
3.Machine learning: Developing algorithms to improve model performance over time.
4.Data engineering: Building data pipelines and architectures to support large-scale data
processing.
Key differences
1.Scope: Data analysis is typically narrower in scope, focusing on specific problems or questions.
Data science has a broader scope, often involving multiple stakeholders and business units.
2.Methodologies: Data analysis tends to focus on descriptive and diagnostic analytics, while data
science incorporates predictive, prescriptive, and machine learning techniques.
3.Tools and techniques: Data analysts often use traditional statistical software, while data
scientists work with a wider range of tools, including big data platforms and machine learning
frameworks.
4.Deliverables: Data analysts typically deliver reports, dashboards, or visualizations. Data scientists
often deliver models, algorithms, or entire data products.
Data measurement levels
INTERVAL RATIO
NOMINAL ORDINAL Data with ranking
Data with ranking
and measurement and measurement
Data with NO Data with ranking ,
ranking , NO NO measurement EX: EX:
measurement Temperature Weight , distance
EX: EX: For quantitative For quantitative
Names, state of Military ranks , Data data
birth, colours levels of education
Zero does not Zero value means
For qualitative data For qualitative data mean NONE NONE
Data measurement levels
Qualitative Quantitative
Scale
True/ False questions
check if each of the following statements is True (T) or False (F) with justification :
1. There is no difference between descriptive statistics and inferential statistics.
2. Survey results are considered primary data .
3. When collected data about all new graduates, data is considered population data.
4. The different categories of banks (National, Foreign and joint ) are an example of discrete
variables.
5. CAPMAS provides researchers with secondary data.
Non
Random
Random
Sample
Sample
Random Sample
Simple random sampling (SRS) :
➢ Each element has the same chance of being selected.
➢ The results are easily analyzed and biased results are
avoided.
➢ The randomness of the selection can result in a sample that
doesn’t reflect the population.
This can be done in one of two ways:
➢ The lottery : in the lottery method, you choose the sample
at random by “drawing from a hat” .
➢ Use a table of random numbers.
➢ To obtain a representative sample from a population that is
homogeneous.
➢ For some situations, the most practical way to sample is to
choose every Kth item from a list. This type of sampling is
called systematic sampling.
➢ Systematic sampling, therefore, only selects the first unit
randomly, while the remaining units are chosen at fixed
intervals.
N 2500
➢ Let N=2500 n=250 K = = = 10
𝑛 250
selects the first unit randomly = 8 (from 1 to 10 )
The sample is 8,18,28 ……….,2498
- Value of any number = first unit + k(n-1)
- Where n is the order of the number
➢ To obtain a representative sample from a population that
is not homogeneous, a stratified sampling technique is
generally applied.
➢ The stratified sampling method breaks the population into
sub-populations greater in homogeneity than the total
population (the different sub-populations are called strata)
and then we take items from each stratum to constitute a
sample.
Let N=2500 N1=1500 Male N2=1000 Female n = 250
𝑛 250
P= = = 0.1
𝑁 2500
From males we select 1500*0.1 = 150
From females we select 1000*0.1 = 100
Total sample size = 250
➢ To take a sample from a large area, it is convenient to divide
the area into small, non-overlapping bits (usually called
clusters).
➢ In cluster sampling, the total population is divided into a
number of relatively small subdivisions, which themselves
are clusters of even smaller units, and then some of these
clusters are selected at random for inclusion in the sample
as a whole.
Non Random Sample
Judgement ➢ With this method, sampling is done based on previous ideas of
sampling population composition and behavior.
➢ An expert with knowledge of the population decides which units
in the population should be sampled.
➢ In other words, the expert purposely selects what is considered
to be a representative sample.
➢ large biases can be introduced .
➢ However, it can be useful in exploratory studies, for example in
selecting members for focus groups or in-depth interviews to test
specific aspects of a questionnaire.
Quota sampling
➢ This is one of the most common forms of non-probability
sampling.
➢ For example, if there are 100 men and 100 women in the
population and a sample of 20 are to be drawn, 10 men and 10
women may be interviewed.
➢ Quota sampling is somewhat similar to stratified sampling,
However, it differs in how the units are selected.
➢ In probability sampling, the units are selected randomly while in
quota sampling a non-random method is used
Convenience
sampling ➢ Convenience sampling involves using respondents who are
“convenient” to the researcher.
Convenience sampling method: How and when to use it?
➢ Having a low budget to conduct research or budgetary
constraints.
➢ You don't have access to the full target population for a
representative sample.
Applications of Statistics in Medical Field
Applications of Statistics in Economics
Applications of Statistics in Agriculture
Applications of Statistics in Machine Learning
What is geostatistics?
Applications of Statistics in Military Field
Applications of Statistics in Business
Data analyst in Oil and Gas Field
What data analyst do in oil
and gas?
The Data Analyst
is responsible for
collecting and analyzing
data to create reports or
data models to provide
insights for business
decision making.
Oil-Gas Industry and Big Data Analytics:
How Data Analytics is Impacting Oil & Gas Industry
The intelligent use of big data analytics in the oil
and gas industry can help perform various
operations such as seismic processing, reservoir
modeling, and sensor calibration that aid in
different stages of the oil exploration, production,
transportation, and delivery processes.
Key impacts include:
Exploration and Production
➢ Big data improves predictions about oil reserves,
reducing exploration costs and increasing drilling
efficiency.
➢ Predictive Maintenance: Analyzing equipment data
allows companies to predict and prevent failures,
reducing downtime and maintenance expenses.
Top Ten Oil-Producing Countries (2020)
1.United States: 744.7 million metric tons
2.Russia: 512.8 million metric tons
3.Saudi Arabia: 500.7 million metric tons
4.Canada: 253.3 million metric tons
5.Iraq: 202 million metric tons
6.China: 195 million metric tons
7.United Arab Emirates: 165.6 million metric tons
8.Brazil: 159.2 million metric tons
9.Iran: 142.7 million metric tons
10.Kuwait: 130.1 million metric tons
Natural Gas Strategist Engineer
Key Responsibilities
As a successful candidate you will be required to perform the following:
•Collect, process, and analyze data and information on the natural gas industry, using various sources and
methods such as databases, reports, surveys, models, and software.
•Monitor and forecast the trends and dynamics of the natural gas market, including supply and demand drivers,
price movements, market structure, competition, energy transition and regulatory changes.
•Evaluate the economic feasibility and profitability of potential natural gas projects, such as exploration,
production, transportation, storage, distribution, export and generate net present value, internal rate of return
and other economic yardsticks
•Conduct cost-benefit analysis and risk assessment of various natural gas scenarios and strategies, considering
technical, environmental, social, and political factors.
•Prepare and present clear and concise reports, charts, tables, dashboards and presentations to communicate
the results of the economic analysis and research to internal and external stakeholders.
•Collaborate with other economists, analysts, engineers, consultants, and experts in the natural gas industry to
exchange information and share best practices.
The Importance of Statistics in The Academic Research
Chapter 1 Research Framework
Chapter 2 Literature Review
Chapter 3 Research Methodology
Chapter 4 Research Results
Chapter 5 Discussion,conclutions and recommendations
Statistical Analysis software
SPSS : Statistical Package for the Social Science.