Basic Statistics For Business & Economics - 10th-CH1
Basic Statistics For Business & Economics - 10th-CH1
What Is Statistics? 1
image
Kelvin Wong/Shutterstock
LEARNING OBJECTIVES
When you have completed this chapter, you will be able to:
LO1-1 Explain why knowledge of statistics is important.
LO1-2 Define statistics and provide an example of how statistics is
applied.
LO1-3 Differentiate between descriptive and inferential statistics.
LO1-4 Classify variables as qualitative or quantitative, and discrete or
continuous.
LO1-5 Distinguish among nominal, ordinal, interval, and ratio levels of
measurement.
LO1-6 List the values associated with the practice of statistics.
page 2
INTRODUCTION
Suppose you work for a large company and your supervisor asks you to
decide if a new version of a smartphone should be produced and sold. You
start by thinking about the product’s innovations and new features. Then,
you stop and realize the consequences of the decision. The product will
need to make a profit, so the pricing and the costs of production and
distribution are all very important. The decision to introduce the product is
based on many alternatives. So how will you know? Where do you start?
image
Gregor Schuster/Getty Images
Without experience in the industry, beginning to develop an intelligence
that will make you an expert is essential. You select three other people to
work with and meet with them. The conversation focuses on what you need
to know and what information and data you need. In your meeting, many
questions are asked. How many competitors are already in the market? How
are smartphones priced? What design features do competitors’ products
have? What features does the market require? What do customers want in a
smartphone? What do customers like about the existing products? The
answers will be based on business intelligence consisting of data and
information collected through customer surveys, engineering analysis, and
market research. In the end, your presentation to support your decision
regarding the introduction of a new smartphone is based on the statistics
that you use to summarize and organize your data, the statistics that you use
to compare the new product to existing products, and the statistics to
estimate future sales, costs, and revenues. The statistics will be the focus of
the conversation that you will have with your supervisor about this very
important decision.
As a decision maker, you will need to acquire and analyze data to
support your decisions. The purpose of this text is to develop your
knowledge of basic statistical techniques and methods and how to apply
them to develop the business and personal intelligence that will help you
make decisions.
image
Courtesy of Domo, Inc.
If you look through your university catalogue, you will find that statistics is
required for many college programs. As you investigate a future career in
accounting, economics, human resources, finance, business analytics, or
other business area, you also will discover that statistics is required as part
of these college programs. So why is statistics a requirement in so many
disciplines?
A major driver of the requirement for statistics knowledge is the
technologies available for capturing data. Examples include the technology
that Google uses to track how Internet users access websites. As people use
Google to search the Internet, Google records every search and then uses
these data to sort and prioritize the results for future Internet searches. One
recent estimate indicates that Google processes 20,000 terabytes of
information per day. Big-box retailers like Target, Walmart, Kroger, and
others scan every purchase and use the data to manage the distribution of
products, to make decisions about marketing and sales, and to track daily
and even hourly sales. Police departments collect and use data to provide
city residents with maps that communicate information about crimes
committed and their location. Every organization is collecting and using
data to develop knowledge and intelligence that will help people make
informed decisions, and track the implementation of their decisions. The
graphic to the left shows the amount of data generated every minute
(www.domo.com). A good working knowledge of statistics is useful for
summarizing and organizing data to provide information that is useful and
supportive of decision making. Statistics is used to make valid comparisons
and to predict the outcomes of decisions.
In summary, there are at least three reasons for studying statistics: (1)
data are collected everywhere and require statistical knowledge to make the
information useful, (2) statistical techniques are used to make page 3
professional and personal decisions, and (3) no matter what your
career, you will need a knowledge of statistics to understand the world and
to be conversant in your career. An understanding of statistics and statistical
methods will help you make more effective personal and professional
decisions.
image
STATISTICS IN ACTION
This question can be rephrased in two, subtly different ways: what are
statistics and what is statistics? To answer the first question, a statistic is a
number used to communicate a piece of information. Examples of statistics
are:
• The inflation rate is 2.3%.
• Your grade point average is 3.5.
• The price of a new Tesla Model S sedan is $81,190.
Each of these statistics is a numerical fact and communicates a very limited
piece of information that is not very useful by itself. However, if we
recognize that each of these statistics is part of a larger discussion, then the
question “what is statistics” is applicable. Statistics is the set of knowledge
and skills used to organize, summarize, and analyze data. The results of
statistical analysis will start interesting conversations in the search for
knowledge and intelligence that will help us make decisions. For example:
• The inflation rate for the calendar year was 2.3%. By applying
statistics we could compare this year’s inflation rate to the past
observations of inflation. Is it higher, lower, or about the same? Is
there a trend of increasing or decreasing inflation? Is there a
relationship between interest rates and government bonds?
• Your grade point average (GPA) is 3.5. By collecting data and
applying statistics, you can determine the required GPA to be admitted
to the Master of Business Administration program at the University of
Chicago, Harvard, or the University of Michigan. You can determine
the likelihood that you would be admitted to a particular program. You
may be interested in interviewing for a management position with
Procter & Gamble. What GPA does Procter & Gamble require for
college graduates with a bachelor’s degree? Is there a range of
acceptable GPAs?
• You are budgeting for a new car. You would like to own an electric car
with a small carbon footprint. The price, according to Kelley Blue
Book, for a 2020 Tesla Model S sedan is $81,190. By collecting
additional data and applying statistics, you can analyze the
alternatives. For example, another choice is a hybrid car that runs on
both gas and electricity. A 2020 Toyota Prius Prime can be purchased
for about $27,750. Another hybrid, the 2020 Chevrolet Bolt EV, costs
$37,495. What are the differences in the cars’ specifications? What
additional information can be collected and summarized so that you
can make a good purchase decision?
Another example of using statistics to provide information to evaluate
decisions is the distribution and market share of Frito-Lay products. Data
are collected on each of the Frito-Lay product lines. These data include the
market share and the pounds of product sold. Statistics is used to present
this information in a bar chart in Chart 1–1. It clearly shows Frito-Lay’s
dominance in the potato, corn, and tortilla chip markets. It also shows the
absolute measure of pounds of each product line consumed in the United
States.
These examples show that statistics is more than the presentation of
numerical information. Statistics is about collecting and processing
information to create a conversation, to stimulate additional questions, and
to provide a basis for making decisions. Specifically, we define statistics
as:
page 4
image
CHART 1–1 Frito-Lay Volume and Share of Major Snack Chip
Categories in U.S. Supermarkets
In this book, you will learn the basic techniques and applications of
statistics that you can use to support your decisions, both personal and
professional. To start, we will differentiate between descriptive and
inferential statistics.
TYPES OF STATISTICS
LO1-3
Differentiate between descriptive and inferential statistics.
Descriptive Statistics
Masses of unorganized data—such as the census of population, the weekly
earnings of thousands of computer programmers, and the individual
responses of 2,000 registered voters regarding their choice for president of
the United States—are of little value as is. However, descriptive statistics
can be used to organize data into a meaningful form. We define descriptive
statistics as:
page 5
Inferential Statistics
Sometimes we must make decisions based on a limited set of data. For
example, we would like to know the operating characteristics, such as fuel
efficiency measured by miles per gallon, of sport utility vehicles (SUVs)
currently in use. If we spent a lot of time, money, and effort, all the owners
of SUVs could be surveyed. In this case, our goal would be to survey the
population of SUV owners.
image
STATISTICS IN ACTION
Where did statistics get its start? In 1662 John Graunt published an
article called “Natural and Political Observations Made upon Bills of
Mortality.” The author’s “observations” were the result of a study
and analysis of a weekly church publication called “Bill of
Mortality,” which listed births, christenings, and deaths and their
causes. Graunt realized that the Bills of Mortality represented only a
fraction of all births and deaths in London. However, he used the
data to reach broad conclusions or inferences about the impact of
disease, such as the plague, on the general population. His logic is an
example of statistical inference. His analysis and interpretation of
the data are thought to mark the start of statistics.
image
So, the sample of six SUVs represents evidence from the population that we
use to reach an inference or conclusion about the average MPG for all
SUVs. The process of sampling from a population with the objective of
estimating properties of a population is called inferential statistics.
SELF-REVIEW 1–1
image
The answers are in Appendix D.
TYPES OF VARIABLES
LO1-4
Classify variables as qualitative or quantitative, and discrete or continuous.
There are two basic types of variables: (1) qualitative and (2) quantitative
(see Chart 1–2). When an object or individual is observed and recorded as a
nonnumeric characteristic, it is a qualitative variable or an attribute.
Examples of qualitative variables are gender, beverage preference, type of
vehicle owned, state of birth, and eye color. When a variable is qualitative,
we usually count the number of observations for each category and
determine what percent are in each category. For example, if we observe
variable eye color, what percent of the population has blue eyes and what
percent has brown eyes? If the variable is type of vehicle, what percent of
the total number of cars sold last month were SUVs? Qualitative variables
are often summarized in charts and bar graphs (Chapter 2).
image
LEVELS OF MEASUREMENT
LO1-5
Distinguish among nominal, ordinal, interval, and ratio levels of
measurement.
image
Ron Buskirk/Alamy Stock Photo
There are four levels of measurement: nominal, ordinal, interval, and
ratio. The lowest, or the most primitive, measurement is the nominal level.
The highest is the ratio level of measurement.
Nominal-Level Data
For the nominal level of measurement, observations of a qualitative
variable are measured and recorded as labels or names. The labels or names
can only be classified and counted. There is no particular order to the labels.
page 8
Ordinal-Level Data
The next higher level of measurement is the ordinal level. For this level of
measurement a qualitative variable or attribute is either ranked or rated on a
relative scale.
ORDINAL LEVEL OF MEASUREMENT Data recorded at the
ordinal level of measurement is based on a relative ranking or rating of
items based on a defined attribute or qualitative variable. Variables
based on this level of measurement are only ranked or counted.
1. Tennessee
2. Virginia
3. Alabama
4. Texas
5. Utah
6. Indiana
7. South Carolina
8. Florida
9. Nevada
10. Arkansas
image
Why is the “size” scale an interval measurement? Observe that as the size
changes by two units (say from size 10 to size 12 or from size 24 to size
26), each of the measurements increases by 2 inches. To put it another way,
the intervals are the same.
There is no natural zero point for dress size. A “size 0” dress does not
have “zero” material. Instead, it would have a 24-inch bust, 16-inch waist,
and 27-inch hips. Moreover, the ratios are not reasonable. If you divide a
size 28 by a size 14, you do not get the same answer as dividing a size 20
by a size 10. Neither ratio is equal to two, as the “size” number would
suggest. In short, if the distances between the numbers make sense, but the
ratios do not, then you have an interval scale of measurement.
Ratio-Level Data
Almost all quantitative variables are recorded on the ratio level of
measurement. The ratio level is the “highest” level of measurement. It has
all the characteristics of the interval level, but, in addition, the 0 point and
the ratio between two numbers are both meaningful.
page 11
image
SELF-REVIEW 1–2
image
(a) The mean age of people who listen to talk radio is 42.1 years. What level
of measurement is used to assess the variable age?
(b) In a survey of luxury-car owners, 8% of the U.S. population own luxury
cars. In California and Georgia, 14% of people own luxury cars. Two
variables are included in this information. What are they and how are
they measured?
EXERCISES
The answers to the odd-numbered exercises are in Appendix C.
1. What is the level of measurement for each of the following
variables?
a. Student IQ ratings.
b. Distance students travel to class.
c. The jersey numbers of a sorority soccer team.
d. A student’s state of birth.
e. A student’s academic class—that is, freshman, sophomore, junior,
or senior.
f. Number of hours students study per week.
2. Slate is a daily magazine on the Web. Its business activities can be
described by a number of variables. What is the level of
measurement for each of the following variables?
a. The number of hits on their website on Saturday between 8:00
a.m. and 9:00 a.m.
b. The departments, such as food and drink, politics, foreign policy,
sports, etc.
c. The number of weekly hits on the Sam’s Club ad.
d. The number of years each employee has been employed with
Slate.
3. On the Web, go to your favorite news source and find examples of
each type of variable. Write a brief memo that lists the variables and
describes them in terms of qualitative or quantitative, discrete or
continuous, and the measurement level.
4. For each of the following, determine whether the group is a sample
or a population.
a. The participants in a study of a new cholesterol drug.
b. The drivers who received a speeding ticket in Kansas City last
month.
c. People on welfare in Cook County (Chicago), Illinois.
d. The 30 stocks that make up the Dow Jones Industrial Average.
page 12
ETHICS AND STATISTICS
LO1-6
List the values associated with the practice of statistics.
(statements.cornell.edu/2018/20180920-statement-provost-michael-
kotlikoff.cfm)
Professor Wansink resigned from his position at Cornell. Note the series of
findings that describe the nature of unethical statistical practices. Clearly,
the unethical conduct of research has serious consequences for the
researcher and the consumers of the false research outcomes.
Another instance of unethical behavior occurred in 2008 and affected
the lives of thousands of people. Wall Street money manager Bernie Madoff
created a Ponzi scheme designed to swindle billions of dollars from
investors. His misrepresentation of financial information resulted in a
sentence of 150 years in jail. Financial misrepresentations by Enron, in
2001, and Tyco, in 2002, also underscore the need for business students to
understand that these events were based on the misrepresentation of
financial information. In each case, people within each organization
reported financial information to investors that indicated the companies
were performing much better than they actually were. When the true
financial information was reported, the companies were worth much less
than advertised. The result was many investors lost all or nearly all the
money they had invested.
In April 2018, the American Statistical Association approved “Ethical
Guidelines for Statistical Practice” (https://www.amstat.org/ASA/Your-
Career/Ethical-Guidelines-for-Statistical-Practice.aspx). The
Association advises us to practice statistics with integrity and honesty and
urges us to “do the right thing” when collecting, organizing, summarizing,
analyzing, and interpreting data and information. The real contribution of
statistics to society is a moral one. Financial analysts need to provide
information that truly reflects a company’s performance so as not to mislead
individual investors. Information regarding product defects that may be
harmful to people must be analyzed and reported with integrity and honesty.
The guidelines further indicate that when we practice statistics, we need to
maintain an independent and principled point of view when analyzing and
reporting findings and results.
As you progress through this text, we will highlight ethical issues in the
collection, analysis, presentation, and interpretation of statistical
information. We also hope that as you learn about using statistics, you will
become a more informed consumer of information. For example, you will
question a report if the data do not fairly represent the population, if the
report does not include all relevant statistics, or if the presentation of the
report does not include the limitations of the statistical analyses and
possible sources of error.
image
Source: Microsoft Excel
image
Tutorial
#20 in
Connect
Throughout the text, we will encourage the use of computer software to
summarize, describe, and present information and data. To demonstrate the
application of statistical analysis in Excel, Connect includes tutorials
referenced with the icon shown in the left margin. In addition, Connect also
includes data files for many exercises that can be opened directly with
statistical software and spreadsheets.
page 14
CHAPTER SUMMARY
CHAPTER EXERCISES
image
image
14. Using data from such publications as the Statistical Abstract of the
United States, Forbes, or any news source, give examples of
variables measured with nominal, ordinal, interval, and ratio
scales.
15. The Struthers Wells Corporation employs more than 10,000 white-
collar workers in its sales offices and manufacturing facilities in
the United States, Europe, and Asia. A sample of 300 U.S.
workers revealed 120 would accept a transfer to a location outside
the United States. On the basis of these findings, write a brief
memo to Ms. Wanda Carter, Vice President of Human Services,
regarding all white-collar workers in the firm and their willingness
to relocate.
16. AVX Home Entertainment Inc. recently began a “no-hassles”
return policy. A sample of 500 customers who recently returned
items showed 400 thought the policy was fair, 32 thought it took
too long to complete the transaction, and the rest had no opinion.
On the basis of this information, make an inference about
customer reaction to the new policy.
17. image MarkLines is an online portal (www.marklines.com) that
reports automotive industry data. The table below shows the
number of cars sold in the United States in 2018 and 2019. The
top 22 manufacturers, plus others, are listed here. Sales data often
are reported in this way to compare current sales to last year’s
sales.
This image icon indicates that the data are available in Connect.
You will be able to download the data directly into statistical
software from this site.
Car Sales
Manufacturer Jan.– Jan.–
Dec.2019 Dec.2018
GM (Est.) 2,877,590 2,951,200
Ford 2,406,188 2,485,222
Toyota 2,383,349 2,426,674
FCA (Est.) 2,203,663 2,235,204
Honda 1,608,170 1,604,828
Nissan 1,345,681 1,493,877
Hyundai 710,007 677,946
Subaru 700,117 680,135
Kia 615,338 589,673
VW 363,322 354,064
Mercedes (includes 357,730 354,137
Sprinter)
BMW 324,826 311,014
Mazda 278,552 300,325
Audi 224,111 223,323
Tesla (Est.) 178,950 126,150
Mitsubishi 121,046 118,074
Volvo 108,234 98,263
Land Rover 94,736 92,143
Porsche 61,568 57,202
Car Sales
Manufacturer Jan.– Jan.–
Dec.2019 Dec.2018
MINI 36,092 43,684
Jaguar 31,051 30,483
Others 16,724 19,346
smart 680 1,276
page 16
DATA ANALYTICS
20. image Refer to the North Valley Real Estate data, which report
information on homes sold in the area last year. Consider the
following variables: selling price, number of bedrooms, township,
and mortgage type.
a. Which of the variables are qualitative and which are
quantitative?
b. How is each variable measured? Determine the level of
measurement for each of the variables.
21. image Refer to the Baseball 2019 data, which report information
on the 30 Major League Baseball teams for the 2019 season.
Consider the following variables: number of wins, payroll, season
attendance, whether the team is in the American or National
League, and the number of home runs hit.
a. Which of these variables are quantitative and which are
qualitative?
b. Determine the level of measurement for each of the variables.
22. image Refer to the Lincolnville School District bus data, which
report information on the school district’s bus fleet.
a. Which of the variables are qualitative and which are
quantitative?
b. Determine the level of measurement for each variable.
PRACTICE TEST
There is a practice test at the end of each chapter. The tests are in two
parts. The first part includes 10 to 15 objective questions, usually in a
fill-in-the-blank format. The second part includes problems. In most
cases, it should take 30 to 45 minutes to complete the test. The
problems will require a calculator. Check your answers against those
provided in Appendix C in the back of the book.
Part 1—Objective
1. The science of collecting, organizing, presenting, analyzing, and
interpreting data to assist in making more effective decisions is
referred to as _______.
2. Methods of organizing, summarizing, and presenting data in an
enlightening way are called _______.
3. The methods used to estimate a value of a population on page 18
the basis of a sample are called _______.
4. A portion, or part, of the group of interest is referred to as a
_______.
5. The entire set of individuals or objects of interest or the
measurements obtained from all individuals or objects of interest
is known as a _______.
6. With the _______ level of measurement, the data are sorted into
categories with no particular order to the categories.
7. The _______ level of measurement has a significant zero point.
8. The _______ level of measurement presumes that one
classification is ranked higher than another.
9. The _______ level of measurement has the characteristic that the
distance between values is a constant size.
10. Is the number of bedrooms in a house a discrete or continuous
variable? _______.
11. The jersey numbers on baseball uniforms are an example of the
_______ level of measurement.
12. What level of measurement is used when students are classified by
eye color? _______.
Part 2—Problems
1. Thirty million pounds of snack food were eaten during a recent
Super Bowl Sunday. The chart below describes this information.
image