Thanks to visit codestin.com
Credit goes to www.scribd.com

0% found this document useful (0 votes)
8 views62 pages

Lecture (1) Chapter

The document provides an overview of applied business statistics, focusing on the importance of data in decision-making and the use of SPSS software for statistical analysis. It covers various statistical concepts, types of data, sampling methods, and the applications of statistics in fields such as business, economics, and healthcare. Additionally, it distinguishes between data analysis and data science, highlighting their methodologies and tools.

Uploaded by

kerbrosea
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
8 views62 pages

Lecture (1) Chapter

The document provides an overview of applied business statistics, focusing on the importance of data in decision-making and the use of SPSS software for statistical analysis. It covers various statistical concepts, types of data, sampling methods, and the applications of statistics in fields such as business, economics, and healthcare. Additionally, it distinguishes between data analysis and data science, highlighting their methodologies and tools.

Uploaded by

kerbrosea
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 62

Applied Business Statistics

Master of Business Administration


( MBA )

Dr. Hosni Elwakeel


Business Analytics in the
Age of Generative AI
Data is a critical component of
decision making, helping businesses
and organizations gain key insights
and understand the implications of
their decisions at a granular level.

Visual analytics, in the form of


interactive dashboards and
visualizations, are essential tools for
anyone - from students to CEOs -
who needs to analyze data and tell
stories with data
What is SPSS? Definition, Features, Types, and Use Case

SPSS Statistics is a statistical software suite developed by IBM for


data management, advanced analytics, multivariate analysis,
business intelligence, and criminal investigation.

Long produced by SPSS Inc. it was acquired by IBM in 2009. Versions


of the software released since 2015 have the brand name IBM SPSS
Statistics.

The software name originally stood for Statistical Package for the Social
Sciences (SPSS),reflecting the original market, then later changed
to Statistical Product and Service Solutions.
What are the Core Features of SPSS?

The core functionalities offered in SPSS are:

➢ Statistical program for quantitative data analysis – It includes


frequencies, cross-tabulation, and bivariate statistics.

➢ Modeler program that allows for predictive modeling. It enables


researchers to build and validate predictive models using advanced
statistical procedures.

➢ Text analysis helps you derive insights from qualitative inputs through
open-ended questionnaires.

➢ Visualization Designer allows researchers to use their data for a variety of


visual representations.
Is SPSS Enough for Data Analysis

From basic descriptive statistics to advanced multivariate


analysis, SPSS can handle various data analysis tasks, making it
a valuable tool for researchers and analysts.
Conclusion

SPSS helps scientists, researchers, business analysts, managers, and


marketers during the entire analytics process, starting from planning, data
collection, data analysis, report creation, and deployment.

No matter what your business objectives are if you have a bunch of data that
you want to analyze.

SPSS is one of the best statistical analysis tools that you can use.
Chapter 1

Introduction To Statistics
What is meant by Statistics?
The word statistics is used to describe the collection , organization
( representation or visualization of data in tables and graphs ) then
interpretation of data ( make numerical measures ) that lead to conclusions
( make decision ).

Organization of
Collection of Numerical
Data Decision
Data Measures
(Tables,Graphs)
Types of Data ( variables )

QUALITATIVE
- Are categorical data that measure attributes.
- Each individual falls in one and only one category.
- Categories are mutually exclusive.
Example :
Gender , social status .

QUANTITATIVE
- Assume only numerical values and it has two types .

Discrete Continuous
No decimal and its
Uncountable values
values are countable
EX: Time , weight
EX: number of children
Types of Data ( variables )

Discrete Continuous
No decimal and its
Uncountable values
values are countable
EX: Time , weight
EX: number of children
Sources of Data ( variables )

Two sources of data


Primary source
when data is collected for the first time for the research purpose .
Data may be collected via surveys, laboratory experiments and observations .

➢ A survey : is to let some individuals answer a set of questions, called “Questionnaire”


Collected answers constitute the data to be analyzed .

➢ Laboratory experiments: such as results obtained in pharmacological , medical,


agriculture research .

➢ Observations : is used to collect data by observing, for example, the behavior of some
individuals .
Sources of Data ( variables )

Secondary source

When data has been collected by others and for different reasons .
This type of data includes data collected via :
➢ any site on the internet.
➢ journals.
➢ books.

When data id downloaded and saved, it is called “ Soft Copy”, but when
same data is printed , it is called “ hard Copy”.
Sources of Data ( variables )

There are several International Secondary sources, to name a few :

➢ CAPMAS: Central Agency for Public Mobilization and Statistics : Official data source in
Egypt.

➢ ILO : International Labor organization.

➢ WHO: World Health Organization.

➢ UNESCO: United Nations for Education, Science, and Culture Organization.


Some Definitions

Population:
It is the set of all items, objects or individuals of interest .
EX:
- All students in AAST.
- All licensed private cars in Cairo.

Sample:
Is a subset ( portion ) of the population.
EX:
- A few lamps selected for testing .
- 500 students selected randomly from a school.
Some Definitions
Census:
It is the process of recording ( collecting ) the entire set of measurements taken from the
whole population.

Sample survey:
It is the process of recording ( collecting ) the entire set of measurements taken from the
sample.

Parameter:
Is the numerical measure computed from the population.

Statistic:
Is the numerical measure computed from the sample data.
Branches of statistics

(a) Descriptive statistics : ‫احصاء وصفيه‬


Collection of
data

Organization of data
( table ,graphs)
using

Numerical Population or To take a


Which is true ( right )
measures sample decision
Branches of statistics

(b) inferential statistics : ‫احصاء استنتاجيه‬


Collection of
data

Organization of data
( table ,graphs)
using

To take a Which is not true


Numerical
Sample only decision about
measures
the population Probability of Errors
Branches of statistics

inferential statistics : ‫احصاء استنتاجيه‬

- The conclusion or making decisions concerning a population based only on sample data.

- Sample statistic used to estimate population parameter.


Branches of statistics
Branches of statistics
Difference between Data Analysis and Data
Science
Data analysis and data science are often used interchangeably, but they're not exactly the
same thing. Think of them as two overlapping circles, with some distinct differences.
Data Analysis
Data analysis is the process of extracting insights from data to answer specific questions or
solve a particular problem. It's a more focused, tactical approach that typically involves:

1.Descriptive analytics: Examining historical data to understand what happened.


2.Diagnostic analytics: Identifying why something happened.
3.Data visualization: Presenting findings in a clear, visual format.
Data analysts typically work with structured data, using tools like Excel, SQL, or statistical
software (e.g., SPSS,R, Python). Their primary goal is to provide actionable insights to
stakeholders, often to inform business decisions.
Data Science
Data science is a broader, more strategic field that encompasses data analysis, but also involves:
1.Predictive analytics: Using data to forecast what might happen in the future.
2.Prescriptive analytics: Recommending actions based on predictions.
3.Machine learning: Developing algorithms to improve model performance over time.
4.Data engineering: Building data pipelines and architectures to support large-scale data
processing.

Key differences
1.Scope: Data analysis is typically narrower in scope, focusing on specific problems or questions.
Data science has a broader scope, often involving multiple stakeholders and business units.
2.Methodologies: Data analysis tends to focus on descriptive and diagnostic analytics, while data
science incorporates predictive, prescriptive, and machine learning techniques.
3.Tools and techniques: Data analysts often use traditional statistical software, while data
scientists work with a wider range of tools, including big data platforms and machine learning
frameworks.
4.Deliverables: Data analysts typically deliver reports, dashboards, or visualizations. Data scientists
often deliver models, algorithms, or entire data products.
Data measurement levels
INTERVAL RATIO
NOMINAL ORDINAL Data with ranking
Data with ranking
and measurement and measurement
Data with NO Data with ranking ,
ranking , NO NO measurement EX: EX:
measurement Temperature Weight , distance

EX: EX: For quantitative For quantitative


Names, state of Military ranks , Data data
birth, colours levels of education
Zero does not Zero value means
For qualitative data For qualitative data mean NONE NONE
Data measurement levels

Qualitative Quantitative

Scale
True/ False questions

check if each of the following statements is True (T) or False (F) with justification :

1. There is no difference between descriptive statistics and inferential statistics.

2. Survey results are considered primary data .

3. When collected data about all new graduates, data is considered population data.

4. The different categories of banks (National, Foreign and joint ) are an example of discrete
variables.

5. CAPMAS provides researchers with secondary data.


Non
Random
Random
Sample
Sample
Random Sample
Simple random sampling (SRS) :

➢ Each element has the same chance of being selected.


➢ The results are easily analyzed and biased results are
avoided.
➢ The randomness of the selection can result in a sample that
doesn’t reflect the population.

This can be done in one of two ways:

➢ The lottery : in the lottery method, you choose the sample


at random by “drawing from a hat” .

➢ Use a table of random numbers.


➢ To obtain a representative sample from a population that is
homogeneous.

➢ For some situations, the most practical way to sample is to


choose every Kth item from a list. This type of sampling is
called systematic sampling.

➢ Systematic sampling, therefore, only selects the first unit


randomly, while the remaining units are chosen at fixed
intervals.
N 2500
➢ Let N=2500 n=250 K = = = 10
𝑛 250
selects the first unit randomly = 8 (from 1 to 10 )
The sample is 8,18,28 ……….,2498
- Value of any number = first unit + k(n-1)
- Where n is the order of the number
➢ To obtain a representative sample from a population that
is not homogeneous, a stratified sampling technique is
generally applied.

➢ The stratified sampling method breaks the population into


sub-populations greater in homogeneity than the total
population (the different sub-populations are called strata)
and then we take items from each stratum to constitute a
sample.

Let N=2500 N1=1500 Male N2=1000 Female n = 250


𝑛 250
P= = = 0.1
𝑁 2500
From males we select 1500*0.1 = 150
From females we select 1000*0.1 = 100
Total sample size = 250
➢ To take a sample from a large area, it is convenient to divide
the area into small, non-overlapping bits (usually called
clusters).
➢ In cluster sampling, the total population is divided into a
number of relatively small subdivisions, which themselves
are clusters of even smaller units, and then some of these
clusters are selected at random for inclusion in the sample
as a whole.
Non Random Sample
Judgement ➢ With this method, sampling is done based on previous ideas of
sampling population composition and behavior.

➢ An expert with knowledge of the population decides which units


in the population should be sampled.

➢ In other words, the expert purposely selects what is considered


to be a representative sample.

➢ large biases can be introduced .

➢ However, it can be useful in exploratory studies, for example in


selecting members for focus groups or in-depth interviews to test
specific aspects of a questionnaire.
Quota sampling
➢ This is one of the most common forms of non-probability
sampling.

➢ For example, if there are 100 men and 100 women in the
population and a sample of 20 are to be drawn, 10 men and 10
women may be interviewed.

➢ Quota sampling is somewhat similar to stratified sampling,


However, it differs in how the units are selected.

➢ In probability sampling, the units are selected randomly while in


quota sampling a non-random method is used
Convenience
sampling ➢ Convenience sampling involves using respondents who are
“convenient” to the researcher.

Convenience sampling method: How and when to use it?

➢ Having a low budget to conduct research or budgetary


constraints.

➢ You don't have access to the full target population for a


representative sample.
Applications of Statistics in Medical Field
Applications of Statistics in Economics
Applications of Statistics in Agriculture
Applications of Statistics in Machine Learning
What is geostatistics?
Applications of Statistics in Military Field
Applications of Statistics in Business
Data analyst in Oil and Gas Field
What data analyst do in oil
and gas?

The Data Analyst


is responsible for
collecting and analyzing
data to create reports or
data models to provide
insights for business
decision making.
Oil-Gas Industry and Big Data Analytics:
How Data Analytics is Impacting Oil & Gas Industry

The intelligent use of big data analytics in the oil


and gas industry can help perform various
operations such as seismic processing, reservoir
modeling, and sensor calibration that aid in
different stages of the oil exploration, production,
transportation, and delivery processes.
Key impacts include:
Exploration and Production

➢ Big data improves predictions about oil reserves,


reducing exploration costs and increasing drilling
efficiency.
➢ Predictive Maintenance: Analyzing equipment data
allows companies to predict and prevent failures,
reducing downtime and maintenance expenses.
Top Ten Oil-Producing Countries (2020)

1.United States: 744.7 million metric tons

2.Russia: 512.8 million metric tons

3.Saudi Arabia: 500.7 million metric tons

4.Canada: 253.3 million metric tons

5.Iraq: 202 million metric tons

6.China: 195 million metric tons

7.United Arab Emirates: 165.6 million metric tons

8.Brazil: 159.2 million metric tons

9.Iran: 142.7 million metric tons

10.Kuwait: 130.1 million metric tons


Natural Gas Strategist Engineer
Key Responsibilities

As a successful candidate you will be required to perform the following:

•Collect, process, and analyze data and information on the natural gas industry, using various sources and
methods such as databases, reports, surveys, models, and software.
•Monitor and forecast the trends and dynamics of the natural gas market, including supply and demand drivers,
price movements, market structure, competition, energy transition and regulatory changes.
•Evaluate the economic feasibility and profitability of potential natural gas projects, such as exploration,
production, transportation, storage, distribution, export and generate net present value, internal rate of return
and other economic yardsticks
•Conduct cost-benefit analysis and risk assessment of various natural gas scenarios and strategies, considering
technical, environmental, social, and political factors.
•Prepare and present clear and concise reports, charts, tables, dashboards and presentations to communicate
the results of the economic analysis and research to internal and external stakeholders.
•Collaborate with other economists, analysts, engineers, consultants, and experts in the natural gas industry to
exchange information and share best practices.
The Importance of Statistics in The Academic Research

Chapter 1 Research Framework

Chapter 2 Literature Review

Chapter 3 Research Methodology

Chapter 4 Research Results

Chapter 5 Discussion,conclutions and recommendations


Statistical Analysis software

SPSS : Statistical Package for the Social Science.

You might also like