Data Insight Overview
Data Insight section consists of 20 questions with time limit of 45 minutes. There are 5 types of
questions in this section:
1. Graphics Interpretation: Evaluating the ability to read and infer from graphical data.
2. Two-Part Analysis: Probing problem-solving skills through linked questions.
3. Table Analysis: Assessing the capacity to extract relevant information from sortable tables.
4. Multi-Source Reasoning: Evaluating the ability to integrate information from multiple sources.
5. Data Sufficiency: Testing quantitative reasoning and data adequacy assessment.
In this section, you will handle realistic numbers. Rather than seeing nice round, small numbers,
you’ll actually see things like 0.738 billion or have to divide by 1.25%. Therefore, an on-screen
calculator is provided for this module.
Statistical Terms Review
Average/ Mean/ Arithmetic Mean
The sum of 𝑛 number
Average (arithmetic mean) =
𝑛
6+4+7+10+4 31
Example: The average of 6, 4, 7, 10, and 4 is = = 6.2
5 5
Median
For sets containing an odd number of values, the median is the unique middle value when the
data are arranged in increasing (or decreasing) order.
For sets containing an even number of values, the median is the average of the two middle
values when the data are arranged in increasing (or decreasing) order.
Example 1: The median of {4, 4, 6, 7, 10} is 6
The median of {4, 4, 6, 7, 10, 12} is 6.5
Example 2: Consider a set of 1,000 numbers, in which:
+ 200 numbers from 1 to 2
+ 100 numbers from 2 to 3
+ 300 numbers from 3 to 4
+ 400 numbers from 4 to 5
In which range will the median of the set lie?
Range
Range is defined as the greatest value in the numerical data minus the least value.
Example: The range of {11, 10, 5, 13, 21} is 21 − 5 = 16
Mode
The mode of a list of numbers is the number that occurs most frequently, at least twice.
Example: The mode of {3, 4, 4, 6, 6, 6, 9, 9, 2} is 6
The modes of {3, 4, 4, 4, 6, 6, 6, 9, 9, 2} are 4 and 6
Set {1, 2, 3, 4, 6, 7, 11} has no mode
Standard Deviation
The more the data is spreaded away from the mean, the greater the standard deviation.
A small SD indicates that a set is clustered closely around the average (mean) value.
A large SD indicates that the set is spread out widely, with some points appearing far
from the mean.
Example: The standard deviation of {3, 4, 5} is smaller than that of {2, 4, 6}
Rounding to the nearest x
Rounding a value to the nearest x means finding the nearest multiple of x. Note that if a value is
at exactly midpoint of two multiples, we will take the higher multiple.
Example 1: Rounded to the nearest 5 percent: 11.2% → 10%; 24% → 25%; 32.5% → 35%
Example 2: Rounded to the nearest 10,000:
123,456 → 120,000
4,236,524 → 4,240,000
165,000 → 170,000
Positive/ Negative Correlation
Graphic Interpretation
1. Introduction
Graphic Interpretation presents information in a graph or pictures with descriptions
Charts can be line graph, bar chart, stacked bar chart, flowchart, scatterplots …
Test takers are given 2 incomplete statements, each contains a drop-down list, or 1 incomplete
statement with 2 drop-down lists
The task will always be the same for all Graphic Interpretation problems: Select answer from
each drop-down list to complete the statement(s)
2. General strategies
Step 1: Carefully study the chart & explanation before answering questions
Step 2: To check the understanding about the given chart, pick a random point on the chart and
explain the meaning of that data point
Step 3: Read the statement(s) and define which part of the chart need a closer look
Step 4: Investigate that part and answer question
3. Practice
Question 1
The graph shows the average mature tree heights of each of species A - C at each of the Tree
Farms W - Z.
Based on the information provided, select from each drop-down menu the option that creates the
most accurate statement.
For the 4 tree farms, the sum of the average mature heights of Species A, B, and C is greatest for
Tree Farm (A) ____.
For the 3 species, the standard deviation of the 4 average mature heights given on the graph is
greatest for Species (B) ____.
(A) W (B) A
X B
Y C
Z
Question 2
In Country Q, 6 different models of passenger automobiles are assembled. The graph shows the
number of each model assembled, in thousands, for each of the years 2006–2010.
Select the option from each drop-down menu that creates the most accurate statement based on
the information provided.
Rounded to the nearest 5 percent, the number of Model Y cars assembled in Country Q increased
by (A) ____ percent from 2006 to 2010.
From 2007 to 2008, the increase in the number of cars assembled in Country Q was greatest for
Model (B) ____.
(A) 35 (B) U
40 V
45 W
50 X
55 Y
60 Z
Question 3
The chart shows a firm's complete time-and-task schedule for the marketing team,
planning team, and development team involved in the launching of a new product.
Based on the information in the graph, select from each drop-down menu the option that
creates the most accurate statement.
To the nearest 5 percent, the number of weeks the development team has at least one
task scheduled is (A) percent of the total duration represented by the chart.
Week (B) is the only week during which three different tasks are
scheduled to begin.
(A) 35 (B) 6
40 7
50 8
55 9
60 10
Question 4
In the diagram, 14 nodes are represented by circles, and relationships among the nodes are
indicated by arrows. Each arrow points from one node (the parent node) to another node (the
child node). A positive numerical value is assigned to each node so that the value assigned to a
parent node is exactly double the value assigned to its child node. The nodes labeled A through E
are the principal nodes.
Use the drop-down menus to create the most accurate statement based on the information
provided.
The value assigned to the principal node labeled (A) ____ must be exactly double the value
assigned to the principal node labeled (B) ____.
(A) A (B) A
B B
C C
D D
E E
Two-Part Analysis
1. Introduction
Two-part Analysis presents a scenario and ask 2 linked questions. Your task is to find the correct
answer to each part, keeping in mind that the answers are interconnected.
These questions can be divided into 4 smaller types:
Quant-like
Critical Reasoning-like
Reading Comprehension-like
Logic game
2. General strategies
Investigate the scenario carefully. Usually the scenario is quite long, so it takes many practices to
familiarize with the complexity of scenarios in this question type.
Summarize the scenario by notes, formula, or flowchart
Usually, the scenario presents general rules, and the questions are specific cases. Match the cases
asked with the logic in general rules.
3. Practice
Question 1
For each positive integer 𝑛, the quantity 𝑠𝑛 is defined such that 𝑠𝑛+2 = (𝑠𝑛 )2 − 𝑠𝑛+1. In
addition, 𝑠2 = 1.
Select values for 𝑠1 and 𝑠4 that are jointly compatible with these conditions. Select only two
values, one in each column.
𝑠1 𝑠4
- 12
-7
-3
-1
0
Question 2
In an experiment, one thousand nine-year-old children were allowed to choose whether to
participate in a program in which the researchers taught them dance during the daily break from
their lessons. Four hundred of the children chose to participate for at least one year. At the end of
the year, researchers found that the children who had participated had significantly better
balance, on average, than those who had not. The researchers hypothesized that dancing resulted
in a sustained improvement in the children’s sense of balance.
It would be most helpful in evaluating the researcher’s hypothesis to know whether the
researchers ___1___ prior to having ___2___.
Select Researchers for the phrase that fills the blank labeled 1 in the given statement, and select
Prior to for the phrase that fills the blank labeled 2 in the given statement to create the most
accurate statement on the basis of the information provided. Make only two selections, one in
each column.
1 2
Researchers Prior to
tested the children’s ability to dance
designed a second experiment
divided the children into the two experimental groups
tested the children’s sense of balance
taught dance to the children through the dance program
Question 3
A dance show will include exactly five pieces, and each piece will be presented only once:
Requiem, Smooth Step, Trampoline, Unleashed, and Waltzy. Requiem will be presented
immediately before Smooth Step. Unleashed will be presented earlier than Trampoline, with at
least one piece in between. Waltzy will not be the final piece presented.
In the table, select a piece that cannot be the second piece presented and a piece that cannot be
the third piece presented. Make only two selections, one in each column.
Cannot be presented second Cannot be presented third
Requiem
Smooth Step
Trampoline
Unleashed
Waltzy
Question 4
Klaus: Closing our neighborhood’s school will mean a major hassle for neighborhood families.
Students will have to travel farther to and from school, which means they’ll have less time for
homework. That will likely affect them academically. Additionally, a vacant school building in
our neighborhood is sure to attract crime and drag down home values.
Rena: Our neighborhood’s school never had a great reputation anyway. Closing it means
students will get to go to a larger school—with a better reputation—just a few kilometers away,
where they’ll have more opportunities to participate in extracurricular activities. That should
make homes here more valuable to potential buyers. And the vacant school building could be
converted into a much-needed community center.
Select Klaus for the phrase that best describes a factor that is specifically addressed by Klaus but
not by Rena, and select Rena for the phrase that best describes a factor that is specifically
addressed by Rena but not by Klaus. Make only two selections, one in each column.
Klaus Rena
The vacant school building
The distance students must travel to school
Extracurricular activities
Real-estate values
Students’ academic performance
“Verbal” Data Sufficiency
Question 1
Does it benefit a company overall to invest in employee training?
(1) Companies that invest in comprehensive employee training programs often observe improved
job performance among their workforce.
(2) Well-trained employees are likely to excel in their roles and take the company forward.
Question 2
A research project has a successful outcome if its research is reported in a blind peer-reviewed
academic publication. Can interdisciplinary collaborations on research projects at least
sometimes produce a successful outcome?
(1) Some interdisciplinary research groups experience conflict and rivalry.
(2) A peer-reviewed and well-regarded interdisciplinary review of published papers on urban
greening trends found that there was little if any evidence that planting more trees in an urban
area significantly improved air quality there
Question 3
What is an example of how investment in healthcare resources might sometimes create a
challenge?
(1) Investment in both advanced medical technology and specialized treatments can improve
patient outcomes for certain conditions. Nevertheless, it may put a strain on healthcare budgets.
(2) When it comes to allocating resources in the healthcare sector, directing money toward the
purchase of advanced medical equipment could potentially limit access to other essential services
and treatments.
Question 4
Five coffee-chicory blends—J, K, L, M, and N—are all priced differently. The blends are such
that the greater the proportion of coffee, the more expensive the blend, and the greater the
proportion of chicory, the stronger the flavor of the blend. If the five blends were ranked in order
of cost per kilogram, which would fall in the middle?
(1) Blend L has a higher proportion of coffee than the other blends except for Blend M.
(2) Blend K and Blend N both have a stronger flavor than Blend J
Table Analysis
1. Introduction
Table Analysis presents data in a table and this table can be sorted by any column. However, you
cannot use any other Excel functions like filtering or writing formulas.
For a majority of Table Analysis problems, 3 statements related to the table are provided and test
takers must identify each statement as true or false.
2. General strategies
Step 1: Study the table first. Do not focus on the detailed figure, but make sure to understand the
meaning of each data column.
Step 2: To double check your understanding, pick a random line and explain the meaning of
each figure in that line.
Step 3: Read the question and understand the task
Step 4: Read each statement and define relevant data
Step 5: Choose appropriate sort option, then look at the data and answer
3. Practice
Table Analysis Sample Questions
Multi-Source Reasoning
1. Introduction
Multi-Source Reasoning present data in two or three tabs. Test takers can toggle between tabs,
but can only view 1 tab at a time
Data in each tabs can be in multiple format: texts, table, diagram, chart …
Test takers must answer 3 consecutive questions related to the tabs provided. Questions can
involve making direct inferences from given information, doing some calculation, or
weakening/strengthening arguments.
2. General strategies
Step 1: Skim through data tabs to understand the overall information in each tab. Take note of
the core information for future use.
Usually, the first tab presents overall scenario, and other tab(s) provide extra aspects of the
scenario. You should study the first tab carefully, and find the connection between the first tab
with other tab(s)
Step 2: For each question, use logic to determine the relevant data tab(s).
Step 3: Study those tabs carefully to pick up formula or logic needed to answer the question.
Keep in mind that some questions might involve only 1 data tab.
3. Practice
Question Group 1
KJ Reservoir
When the KJ Reservoir (KJR) was built, a single, well-known species of fish was introduced for
its role in the new ecosystem. KJR is a popular destination for a public fishing season that
extends from June through August each year. Each May 1, the area's wildlife management team
generates an estimate of the number of these fish in the reservoir. The team has determined that
the fish's population should be no less than 1,500 fish at the start of the fishing season—less than
1,500, and the team acquires and releases enough fish to make up the difference.
The fish of this species reproduce once a year in mid-September. Due to environmental factors,
of the species's total population at the end of September, 80% will survive until the following
May. It is estimated that KJR cannot support a healthy, reproductive fish population greater than
2,000.
Methodology
To estimate the total number of individuals of an animal species in a particular area, some
researchers use the following method:
The reseachers first capture a number (M) of individual animals of the species (the capture
group) without doing harm. Each is marked, released, and allowed to disperse back into its
general population. A number (N) of individual animals are then captured in a second group (the
recapture group), some of which will have been captured in both groups, provided both the
capture group and recapture group are sufficiently large. Researchers then use the equation
below to estimate P, which represents the total number of animals in the area:
number of fish common to both group M
=
N P
Fish Reproduction
To predict the annual increase in fish population in KJR due to reproduction, the wildlife
management team uses the following model developed from past population data from KJR. The
model expresses population increase in terms of:
+ the fish population at the start of September (P)
+ the maximum supported healthy fish population (C)
+ a constant chosen to fit the model to the data (r).
Question 1
On May 1 last year, KJR’s wildlife management team marked the 60 fish comprising the capture
group. The recapture group had 100 fish, including exactly 5 from the capture group.
Based on the information provided, which one of the following was the team’s estimate of KJR’s
fish population
(A) 1,000
(B) 1,200
(C) 1,400
(D) 1,500
(E) 2,000
Question 2
Assume that KJR’s fishing season will continue for the foreseeable future and that, on May 1
next year, the wildlife management team plans on 60 fish in the capture group and 60 fish in the
recapture group.
For each of the following scenarios, select Restock if the scenario would indicate that the team
will need to restock the reservoir. Otherwise, select Do not restock.
Restock Do not restock Exactly 2 fish were in both the capture and recapture groups.
Restock Do not restock Exactly 4 fish were in both the capture and recapture groups.
Restock Do not restock Exactly 6 fish were in both the capture and recapture groups.
Question 3
Assume that at the beginning of next year’s fishing season the fish population will be at the
minimum level for the start of the fishing season. For each of the following numbers of fish
removed from KJR during that fishing season, select Yes if the model for population increase
predicts that the fish population will increase by at least 40 fish due to reproduction. Otherwise,
select No.
Yes No 100 fish removed
Yes No 200 fish removed
Yes No 600 fish removed
Question Group 2
Designs
A certain electric utility is considering exactly two designs (Designs 1 and 2) for a new power
generation plant. Each design would result in a plant that would generate the same amount of
power annually; but Design 1 would use a single Type A power generator, whereas Design 2
would use three Type B power generators.
Type A generators cost less than Type B generators to fuel per unit of power generated and vent
through taller chimneys, which better prevent pollutants from concentrating near the plant.
However, Type A generators produce solid waste that is difficult and expensive to dispose of
safely.
Type B generators produce no solid waste and, as compared to Type A generators, vent
approximately half as much Pollutant X, and even less Pollutant Y, per unit of power generated
annually. However, Type B generators vent through much shorter chimneys.
Pollutants by Phase
Both Type A and Type B generators emit (vent) significantly more of Pollutant Y during the
startup phase—when a generator is brought to its normal power-generating phase from
nonoperation. During startup, unhealthy levels of Pollutant Y can become concentrated at ground
level near the power plant. The following table shows, for a single Type A or Type B generator,
Pollutant Y emissions, in kilograms per hour (kg/hr), and the resultant contributions from those
generators to the ground-level concentration of Pollutant Y, in micrograms per cubic meter of air
(mcg/m3), near the plant during the startup and normal generating phases.
Pollutant Y
Power generator type Operation phase Emissions Increases ground-level
(kg/hr) concentration by (mcg/m3)
A startup 744 61
normal 218 13
B startup 203 47
normal 6 3
Question 1
Suppose that the utility chooses Design 1 over Design 2. For each of the following motivations,
select Yes if the information in the tabs suggests that it would help explain this choice.
Otherwise, select No
Yes No A desire to use the type of generator that costs less to fuel per unit of
power generated
Yes No A desire to avoid having to dispose of solid waste
Yes No A desire to emit less Pollutant X per unit of power generated
annually
Question 2
For each of the following statements, select Yes if the statement accurately reflects the
information given in the tabs. Otherwise, select No
Yes No During the startup phase, the Design 1 power plant would emit more
Pollutant Y per hour than would the Design 2 power plant.
Yes No During the normal generating phase, the emissions per hour of
Pollutant Y from the Design 2 power plant would be less than 5% of
those from the Design 1 power plant.
Yes No During the normal generating phase, the contribution to the
ground-level concentration of Pollutant Y from the Design 2 power
plant would be less than 25% of that from the Design 1 power plant.
Question 3
Suppose that the startup phase for Type A generators takes exactly one hour. Given this and
based on the information in the tabs, which one of the following would be closest to the startup
time, in hours, for Type B generators that would result in equal emissions of Pollutant Y for both
power plant designs during a single startup phase?
(A) 3.67
(B) 0.27
(C) 0.81
(D) 1.22
(E) 36.33