The Neural Basis of Attentional Control in Visual Search: Martin Eimer
The Neural Basis of Attentional Control in Visual Search: Martin Eimer
Martin Eimer
Abstract
How do we localize and identify target objects among distractors in visual scenes? The role
of selective attention in visual search has been studied for decades, and the outlines of a
general processing model are now beginning to emerge. Attentional processes unfold in real
time, and this review describes four temporally and functionally dissociable stages of
attention in visual search (preparation, guidance, selection, and identification). Insights from
neuroscientific studies of visual attention suggest that our ability to find target objects in
visual search is based on processes that operate at each of these four stages in close
association with working memory and recurrent feedback mechanisms.
Keywords: visual search, selective attention, top-down control, visual cortex, working
memory, recurrent feedback
3
Introduction
In visual environments where multiple objects compete for attention, the challenge
is to find relevant information and to ignore objects and events that are unrelated to
current task goals (Figure 1). Many studies of visual spatial attention have investigated how
prior knowledge about the position of target objects in the visual field facilitates behavioural
performance and neural processing [1,2]. However, the fundamental problem for visual
search is the absence of precise advance information about target locations. In many
laboratory-based visual search tasks, target locations are determined randomly on each trial
and are therefore completely unpredictable. In real-world search, attention may benefit
from contextual spatial information (e.g., kitchen knives are typically found on kitchen
counters), but the exact locations of target objects are still unknown. To understand our
ability to find known target objects at uncertain locations, it is useful to consider how
attentional processes in visual search operate in real time. Based primarily on
neuroscientific studies of attention in the human and monkey brain, this review describes
four successive stages of attentional selectivity in visual search (Figure 2). Each of these
stages performs a specific function, and each is characterized by a particular neural
signature. Within this framework, ‘attention’ is not seen as a single functionally and
anatomically distinct control system, but as emerging from the coordinated operation of a
set of neurocognitive mechanisms in real time.
Preparation
Search starts by deciding which object or feature to look for, and representing this
search goal in memory. William James [3] believed that such preparatory “images in the
mind” are the single most important aspect of selective attention. Representations of
search goals in working memory (attentional templates [4]) can be activated before the
relevant visual scene is physically present, and are assumed to control subsequent stages of
the search process in a goal-directed fashion [5,6].
How are James’ “images in the mind” implemented at the neural level? The sensory
recruitment model of working memory (see Box 1) suggests that visual target objects for a
4
search task are represented in posterior visual-perceptual brain regions. There is indeed
evidence that visual cortical areas are activated in a goal-selective fashion during the
preparation phase of visual search. The activity of neurons in inferior temporal cortex (IT)
that selectively respond to a particular object is enhanced in a sustained fashion while
monkeys prepare to find this object in an upcoming search display (Figure 3a), and such
preparatory “baseline shifts” of neural activity level may reflect an activated attentional
template [7]. Similar target-selective preparatory activation patterns have been found in
human event-related brain potential (ERP) and fMRI experiments [8-14]. Some fMRI studies
have demonstrated increased activity in colour- or motion-selective visual areas when
observers prepare for targets defined in these dimensions [9-11]. Others have found more
globally distributed goal-sensitive activation patterns in visual cortex during the preparation
of search for specific target shapes [12], or for target object categories in real-world visual
scenes (e.g., people, cars, houses, faces [13,14]). The exact locus of preparatory activity
patterns within the visual processing hierarchy may depend on the nature of the current
search goal [12], with lower-level visual areas responsible for representing simple target
features [15], and higher-level regions involved in the preparation for more abstractly
defined targets [13].
While the existence of goal-selective activity modulations in visual cortex during the
preparation phase is well documented, it remains unclear whether these modulations
causally affect subsequent stages of visual search. The existence of correlations between
the target-selectivity of preparatory pre-stimulus activation patterns in visual cortex and the
quality of subsequent target detection performance [10,12-14,16] suggests that preparatory
modulations of visual activity might indeed act as attentional templates that are causally
involved in the control of visual search. However, this conclusion is by no means universally
accepted [17-19]. When the location of targets is uncertain, target-selective visual activation
patterns elicited during the preparation for visual search should represent search goals in a
position-independent fashion (Box 1). Although spatially global working memory
representations do exist in visual cortex [15], the question whether preparatory goal-
selective patterns of visual activity are generally position-invariant, and whether this is a
necessary requirement for their role as attentional templates still needs to be systematically
addressed.
5
In line with the involvement of prefrontal cortex in visual working memory (Box 1),
preparatory activation patterns that are sensitive to current task goals have also been
observed in prefrontal areas [13,20,21]. These effects may be linked to top-down aspects of
attentional preparation, with prefrontal cortex controlling target-selective preparatory
modulations of visual activity, but could also reflect explicit representations of search goals.
Because many object representations in prefrontal cortex are position-invariant, they could
act as location-independent preparatory attentional templates that control subsequent
stages of visual search when no precise spatial information about target locations is
available.
Guidance
While preparation takes place prior to the arrival of visual input, guidance and
selection operate once a search display has been encountered. Models of visual search [22-
24] assume that information about the presence of task-relevant features is accumulated in
parallel (guidance) and is then used to control the allocation of spatial attention to possible
target objects (selection). When target locations are uncertain, guidance processes may
operate globally across the entire visual field.
A plausible neural basis of spatially global attentional guidance in visual search has
been identified in studies of feature-based attention. When monkeys search for target
events defined by a particular feature (such as orientation or motion direction), neurons in
visual areas V4 or middle temporal (MT) cortex that are selective for this task-relevant
feature increase their activity, while neurons with opposite feature preferences are
inhibited [25,26]. Critically, these activity modulations are also elicited in response to stimuli
at task-irrelevant unattended locations (Figure 3b), suggesting that feature-based attention
is a spatially global phenomenon. During search for colour- or shape-defined target objects,
V4 neurons that prefer target-defining features increase their activity when a target object
is present in their receptive field, even when monkeys fixate elsewhere, fail to detect the
target, and shift gaze to another object [27]. Such observations underline the fact that
feature-based attention operates in parallel across the visual field, independent of the
current focus of spatial attention. Analogous spatially global feature-based attentional
6
modulations of visual activity have been found in human electrophysiological and fMRI
experiments [28-30]. When observers attend to target features in one visual hemifield,
objects in the other irrelevant hemifield elicit enhanced visual responses when their
features match the features that are currently task-relevant.
Because feature-based attentional modulations of visual processing operate in a
spatially global fashion, they can provide guidance signals for the subsequent allocation of
spatial attention to candidate target objects. Models of visual attention [18,24] postulate
that during the parallel analysis of visual input, processing is selectively weighted in favor of
target-defining features. The neural mechanisms of feature-based attention can implement
such task-dependent attentional biases in a spatially global fashion across the visual field.
Feature-based attentional guidance is not necessarily limited to simple visual attributes such
as orientation, color, or movement direction, but can also operate in a spatially global
fashion when search goals are more abstractly defined, such as during search for target
objects from a particular target category in real-world visual scenes [31]. If guidance
depends on the mechanisms of feature-based attention, the fact that visual features differ
considerably in their ability to guide spatial attention [32] could be related to systematic
differences in the ability of these features to produce task-dependent spatially global
modulations of visual activity during the guidance phase of visual search.
Feature-based attentional guidance mechanisms are likely to be closely linked to the
processes that operate during the preceding preparation stage. If preparatory goal-selective
baseline shifts of visual activity operate in a position-independent fashion, they could be
directly responsible for the emergence of spatially global feature-based attention effects
during the parallel processing of visual input. For example, spatially global working memory
representations of target features [15] may remain active after search display onset,
resulting in feature-selective modulations of sensory responses across the visual field. The
observation that feature-based attention effects can spread to currently empty regions of
visual space [30] suggests that preparation and guidance might interact in this way. In
addition, top-down signals from position-invariant representations of search targets in
prefrontal cortex to visual areas may also play a role in the control of feature-based
attention [33]. Despite such close functional links between preparation and guidance, these
two stages are not only temporally distinct (i.e., one operates before and the other after the
onset of visual stimulation), but are also functionally dissociable. For example, preparation is
7
Selection
observation that electrical stimulation of the FEF triggers activity in spatially corresponding
regions of visual area V4 [44] suggests that these two areas are indeed causally linked. An
important and controversial question is whether recurrent signals from priority maps to
visual areas always address one particular location at a time or whether multiple locations
can be addressed simultaneously. Does object selection in visual search operate serially or
in a parallel fashion (see Box 3)?
The spatial selection of candidate target objects is reflected by enhanced neural
responses in ventral visual cortex that start around 150-200 ms post-stimulus [7,45]. In ERP
studies of visual search, target selection is marked by the emergence of the N2pc
component at around 180 ms after search display onset. The N2pc is an enhanced negativity
at posterior electrodes contralateral to candidate target objects in visual search displays
[46,47]. This component is generated during the spatially selective enhancement of target
processing in ventral visual cortex [39], which is controlled by recurrent signals from higher-
level attentional control areas such as FEF [48]. Because it tracks the operation of selective
attention on a millisecond-by-millisecond basis, the N2pc can provide unique insights into
the time course of attentional object selection in visual search (Box 3 and Figure 4).
Identification
and several target objects are presented simultaneously or in rapid succession [53-55] also
point towards capacity limitations at an identification stage that follows object selection.
Selection and identification are associated with distinct ERP markers in visual search
(Figure 5). The N2pc component reflects target selection, and a later sustained posterior
contralateral negativity is elicited during the subsequent object identification stage [56,57].
This sustained negativity is equivalent to the CDA component observed during the delay
period of working memory tasks [58], and its presence during the identification of target
objects in visual search demonstrates the involvement visual working memory [18,59].
Working memory is required for object identification because spatially specific
enhancements of target processing that emerge during the preceding selection stage
remain transient unless they are sustained by recurrent input from higher-order control
areas to visual cortex [18]. Such sustained feedback loops could represent the neural basis
of working memory maintenance [60]. They may be critical for integrating features into
object files [61], and for matching perceptual representations of selected objects with
stored representations of search targets during the identification stage. Visual areas such as
superior intraparietal sulcus and the lateral occipital complex that are sensitive to the
number of memorized objects and their complexity [62,63] are likely to be involved in the
maintenance of possible target objects in visual search.
Concluding comments
overlap in time. For example, the identification of selected objects may be a relatively slow
process that could operate in parallel with the attentional selection of other candidate
target objects [24]. Identification involves the comparison of visual object representations
with representations of current search goals (see Box 1), which implies that attentional
templates in working memory that are set up during the preparation stage remain active
throughout the search process. Furthermore, when targets are not detected on the basis of
a single guidance/selection/identification cycle, search will become iterative, with each new
iteration of this cycle initiated by a mismatch between selected object representations and
current search goals at the identification stage. Search is likely to be based on complex
interactions between serial and parallel mechanisms, and the simple serial architecture
illustrated in Figure 2 is not intended to be a fully realistic representation of these
mechanisms. However, because visual search unfolds in real time, and because each of the
stages described here depends on the output of operations that take place at preceding
stages, the attentional control processes that contribute to successful search performance
retain important serial characteristics, which were highlighted in this review.
Another important aspect of visual search that was not discussed here concerns the
role of eye movements. Spatial attention and saccade programming are known to be closely
linked [41,64]. Selection does not only control the access of task-relevant objects to working
memory, but also provides spatial coordinates for upcoming eye movements. Research on
saccades and microsaccades [65-67] has provided important insights into the control of
selective attention during search in real-world visual scenes. For example, semantic and
spatial expectations linked to particular scene contexts strongly constrain which parts of a
scene will be visually examined [65,68], which demonstrates that high-level world
knowledge plays an important role for attentional guidance and selection. The implicit
acquisition of contextual information can guide spatial attention even in simple search
displays [69]. Such observations show that visual search does not always operate in the
complete absence of prior information about likely target locations, and raise important
questions for the control of attentional guidance and selection processes. Instead of
operating in a spatially unconstrained global fashion across the entire visual field, these
processes may sometimes be confined to spatially restricted attentional windows [34] that
are linked to context-dependent expectations about target locations. Furthermore, the
concept of preparatory attentional templates may have to be extended to include not only
11
representations of target features or objects, but also of scene contexts and likely target
positions.
Visual search performance varies greatly across task contexts [70]. It is unlikely that
such differences can all be attributed to one particular stage (such as serial selection [71]).
In this review, search is described as a process that unfolds in real time, and involves
successive attentional mechanisms at multiple stages of processing. The efficiency of visual
search for known targets at uncertain locations is determined by the complex interplay of all
of these mechanisms.
12
features also exist in visual cortex [15]. If attentional templates are based on position-
invariant or spatially global representations of search targets, the observation that these
templates are strongly capacity-limited suggests that only one position-invariant
representation can be active during the preparation stage of visual search.
Visual processing does not operate in a strictly hierarchical bottom-up fashion, with
lower-level visual areas representing simple features at specific locations projecting to
neurons at higher levels that progressively code more complex properties of visual objects
in a position-invariant fashion. Such feedforward connections are accompanied by parallel
feedback projections from higher to lower levels of the visual processing hierarchy [35]. This
recurrent architecture allows top-down control signals to modulate visual processing in a
flexible task-dependent fashion [84]. According to the Reverse Hierarchy Theory [85], visual
input is transmitted in parallel to a high level of processing where complex visual properties
and object categories are represented. Information about the presence of potentially task-
relevant objects is then fed back to lower visual areas where a detailed analysis of these
objects takes place.
Recurrent feedback plays a central role in the object selection and identification
stages of visual search [86,87]. During identification, recurrent feedback loops between
higher-level control regions and visual cortex are responsible for the maintenance of
spatially selective representations of possible target objects in visual working memory
[59,60]. During the preceding selection stage, the initial transient activation of these
representations is triggered by recurrent signals from control regions such as the frontal eye
fields [43,48] where the locations of task-relevant features are represented in priority maps
[41]. The importance of recurrent feedback connections is illustrated by the phenomenon of
object substitution masking [88]. Target detection is strongly impaired when targets are
immediately replaced by another visual stimulus, and therefore can no longer be addressed
by recurrent feedback signals. The existence of fast recurrent pathways from category-
selective areas to visual cortex [85] can also explain why category-guided object selection is
often remarkably rapid [89].
14
The transition from spatially global guidance to spatially focal selection could
coincide with the transition from parallel to serial attentional processing in visual search. In
line with this assumption, several models of visual search claim that candidate target
objects are selected in a strictly serial fashion. According to Feature Integration Theory
[22,71], spatial attention is allocated serially to one object at a time, such that the
attentional selection of a new object is always preceded by a de-allocation of attention from
its previous location. In Guided Search [24], guidance and object identification are modelled
as parallel processes, but object selection is described as a serial attentional bottleneck. In
contrast to such serial selection accounts, other models of visual attention assume that
object selection can operate in parallel at multiple locations in the visual field [5,18]. Along
similar lines, the ability to track multiple moving objects has been explained by assuming
that spatial attention can be allocated independently and in parallel to different objects in
the visual field [90], which has been supported by ERP studies of multiple object tracking
[91,92].
Although serial and parallel selection scenarios are often regarded as mutually
exclusive accounts of visual search, it is possible that these two types of attentional
selection are employed in different task contexts. This is illustrated by two ERP studies
[93,94] that both used the N2pc component to distinguish serial and parallel selection
mechanisms in visual search (Figure 4). Woodman and Luck [93] obtained evidence that
15
targets are selected in a serial fashion when observers search for difficult-to-discriminate
target objects in crowded visual search displays. In contrast, Eimer and Grubert [94] found
that multiple targets are selected in a parallel and independent fashion when the target
identification task (letter/digit discrimination) is highly practiced and search displays contain
few competing distractor items. The fact that these two N2pc studies draw opposite
conclusions about the serial versus parallel nature of visual search suggests a more
ecumenical view of attentional object selection. Parallel and serial selection could both be
available options in visual search, with the choice between these selection strategies
determined by the nature of a particular search task. Serial selection may be the preferred
option when a task imposes high demands on object selection and identification
mechanisms, while a parallel strategy is chosen under conditions where selection and
identification requirements are less challenging.
16
Outstanding Questions
Glossary
Baseline shift: Sustained increase in the baseline activity of neurons that are selective for a
task-relevant feature or object during the preparation phase of visual search.
Spatial attention: Allocation of selective attention to specific locations within the visual
field. During the selection phase of visual search, spatial attention is reflected by spatially
specific processing enhancements for candidate target objects at particular locations.
Acknowledgements
This work was supported by the Economic and Social Research Council (ESRC), UK. Thanks to Anna
Grubert and John Towler for comments, and to Joanna Parketny for creating Figure 1.
18
References
1. Posner, M. I. et al. (1980). Attention and the detection of signals. J Exp Psychol Gen,
109, 160-174.
2. Moran, J. and Desimone, R. (1985). Selective attention gates visual processing in the
extrastriate cortex. Science, 229, 782-784.
3. James, W. (1890). The Principles of Psychology. Harvard University Press.
4. Duncan, J. and Humphreys, G. W. (1989). Visual search and stimulus similarity.
Psychol Rev, 96, 433.
5. Desimone, R. and Duncan, J. (1995). Neural mechanisms of selective visual attention.
Annu Rev Neurosci, 18, 193-222.
6. Olivers, C. N. L. et al. (2011). Different states in visual working memory: when it
guides attention and when it does not. Trends Cogn Sci, 15, 327-334.
7. Chelazzi, L. et al. (1998). Responses of neurons in inferior temporal cortex during
memory-guided visual search. J Neurophysiol, 80, 2918–2940.
8. Carlisle, N. B. et al. (2011). Attentional templates in visual working memory J
Neurosci, 31, 9315-9322.
9. Chawla, D. et al. (1999).The physiological basis of attentional modulation in
extrastriate visual areas. Nat Neurosci, 2,671–676.
10. Giesbrecht, B. et al. (2006). Pre-target activity in visual cortex predicts behavioral
performance on spatial and feature attention tasks. Brain Res, 1080, 63–72.
11. Shibata. K. et al. (2008) The effects of feature attention on prestimulus cortical
activity in the human visual system. Cereb Cortex, 18, 1664–1675.
12. Stokes, M. et al. (2009). Shape-specific preparatory activity mediates attention to
targets in human visual cortex. Proc Natl Acad Sci U S A, 106, 19569–19574.
13. Peelen, M.V. and Kastner, S. (2011). A neural basis for real-world visual search in
human occipitotemporal cortex. Proc Natl Acad Sci U S A, 108, 12125–12130.
14. Soon, C. S. et al. (2013). Preparatory patterns of neural activity predict visual
category search speed. NeuroImage, 66, 215-222.
15. Ester, E. F. et al. (2009). Spatially global representations in human primary visual
cortex during working memory maintenance. J Neurosci, 29, 15258-15265.
16. Puri, A.M. et al. (2009). Category expectation modulates baseline and stimulus-
evoked activity in human inferotemporal cortex. Brain Res, 1301, 89–99.
17. Fannon, S. P. et al. (2007). Baseline shifts do not predict attentional modulation of
target processing during feature-based visual attention. Front Hum Neurosci, 2, 7.
18. Bundesen, C. et al. (2005). A neural theory of visual attention: bridging cognition and
neurophysiology. Psychol Rev, 112, 291.
19. McMains, S. A. et al. (2007). Mechanisms of feature-and space-based attention:
response modulation and baseline increases. J Neurophysiol, 98, 2110.
20. Warden, M. R. and Miller, E. K. (2010). Task-dependent changes in short-term
memory in the prefrontal cortex. J Neurosci, 30, 15801-15810.
21. Stokes, M. G. et al. (2013). Dynamic coding for cognitive control in prefrontal cortex.
Neuron, 78, 364-375.
22. Treisman, A. and Sato, S. (1990). Conjunction search revisited. J Exp Psychol Hum
Percept Perfor, 16, 459.
23. Wolfe, J.M. (1994). Guided Search 2.0: A revised model of visual search. Psychon B
Rev, 1, 202–238.
19
24. Wolfe, J.M. (2007). Guided Search 4.0: Current Progress with a model of visual
search. In W. Gray (Ed.), Integrated Models of Cognitive Systems (pp. 99-119). New
York: Oxford.
25. Martinez-Trujillo, J. C. and Treue, S. (2004). Feature-based attention increases the
selectivity of population responses in primate visual cortex. Curr Biol, 14, 744–751.
26. Cohen, M. R. and Maunsell, J. H. (2011). Using neuronal populations to study the
mechanisms underlying spatial and feature attention. Neuron, 70, 1192-1204.
27. Bichot, N. P. (2005). Parallel and serial neural mechanisms for visual search in
macaque area V4. Science, 308, 529–534.
28. Zhang, W. and Luck, S. J. (2009). Feature-based attention modulates feedforward
visual processing. Nat Neurosci, 12, 24-25.
29. Saenz, M. et al. (2002). Global effects of feature-based attention in human visual
cortex. Nat Neurosci, 5, 631–632.
30. Serences, J. T. and Boynton, G. M. (2007). Feature-based attentional modulations in
the absence of direct visual stimulation. Neuron, 55, 301–312.
31. Peelen, M. V. et al. (2009). Neural mechanisms of rapid natural scene categorization
in human visual cortex. Nature, 460, 94-97.
32. Wolfe, J. M. and Horowitz, T. S. (2004). What attributes guide the deployment of
visual attention and how do they do it? Nat Rev Neurosci, 5, 495-501.
33. Maunsell, J. H. and Treue, S. (2006). Feature-based attention in visual cortex. Trends
Neurosci, 29, 317-322.
34. Theeuwes, J. (2010). Top–down and bottom–up control of visual selection. Acta
Psychol, 135, 77-99.
35. Kravitz, D. J. et al. (2013). The ventral visual pathway: an expanded neural
framework for the processing of object quality. Trends Cogn Sci, 17, 26-49.
36. Franconeri, S. L. et al. (2013). Flexible cognitive resources: competitive content maps
for attention and memory. Trends Cogn Sci, 17, 134-141.
37. Anllo-Vento, L. and Hillyard, S. A. (1996). Selective attention to the color and
direction of moving stimuli: electrophysiological correlates of hierarchical feature
selection. Perc Psychophys, 58, 191-206.
38. Liu, T. et al. (2007). Comparing the time course and efficacy of spatial and feature-
based attention. Vision Res, 47, 108-113.
39. Hopf, J. M. et al. (2004). Attention to features precedes attention to locations in
visual search: Evidence from electromagnetic brain responses in humans. J Neurosci,
24, 1822–1832.
40. Itti, L. and Koch, C. (2001). Computational modelling of visual attention. Nature Rev
Neurosci, 2, 194-203.
41. Fecteau, J. H. and Munoz, D. P. (2006). Salience, relevance, and firing: a priority map
for target selection. Trends Cogn Sci, 10, 382-390.
42. Gottlieb, J. P. et al. (1998). The representation of visual salience in monkey parietal
cortex. Nature, 391, 481-484.
43. Schall, J. D. (2004). On the role of frontal eye field in guiding attention and saccades.
Vision Res, 44, 1453-1467.
44. Moore, T. and Armstrong, K. M. (2003). Selective gating of visual signals by
microstimulation of frontal cortex. Nature, 421, 370-373.
45. Chelazzi, L. et al. (2001). Responses of neurons in macaque area V4 during memory-
guided visual search. Cereb Cortex, 11, 761-772.
20
46. Luck, S. J. and Hillyard, S. A. (1994). Spatial filtering during visual search: Evidence
from human electrophysiology. J Exp Psychol Hum Percept Perform, 20, 1000–1014.
47. Eimer, M. (1996). The N2pc component as an indicator of attentional selectivity.
Electroen Clin Neuro, 99, 225–234.
48. Cohen, J. Y. et al. (2009). On the origin of event-related potentials indexing covert
attentional selection during visual search. J Neurophysiol, 102, 2375.
49. Horowitz, T. S. et al. (2007). Tracking unique objects. Percept Psychophys, 69, 172-
184.
50. Ghorashi, S. et al. (2010). Spatial selection and target identification are separable
processes in visual search. J Vis, 10, 7.
51. Xu, Y. and Chun, M. M. (2009). Selecting and perceiving multiple visual objects.
Trends Cogn Sci, 13, 167-174.
52. Huang, L. and Pashler, H. (2007). A Boolean map theory of visual attention. Psychol
Rev, 114, 599.
53. Duncan, J. (1980). The locus of interference in the perception of simultaneous
stimuli. Psychol Rev, 87, 272.
54. Raymond, J. E. et al. (1992). Temporary suppression of visual processing in an RSVP
task: An attentional blink? J Exp Psychol Hum Percept Perform, 18, 849.
55. Duncan, J. et al. (1994). Direct measurement of attentional dwell time in human
vision. Nature, 369, 313-315.
56. Mazza, V. et al. (2007). Attentional selection and identification of visual objects are
reflected by distinct electrophysiological responses. Exp Brain Res, 181, 531–536.
57. Jolicœur, P. et al. (2008). Dissociation of the N2pc and sustained posterior
contralateral negativity in a choice response task. Brain Res, 1215, 160-172.
58. Vogel, E. K. and Machizawa, M. G. (2004). Neural activity predicts individual
differences in visual working memory capacity. Nature, 428, 748-751.
59. Chun, M. M. and Johnson, M. K. (2011). Memory: Enduring traces of perceptual and
reflective attention. Neuron, 72, 520-535.
60. Luck, S. J. and Vogel, E. K. (2013). Visual working memory capacity: from
psychophysics and neurobiology to individual differences. Trends Cogn Sci, 17, 391-
400.
61. Kahneman, D. et al. (1992). The reviewing of object files: Object-specific integration
of information. Cogn Psychol, 24, 175-219.
62. Todd, J. J. and Marois, R. (2004). Capacity limit of visual short-term memory in
human posterior parietal cortex. Nature, 428, 751-754.
63. Xu, Y. and Chun, M. M. (2006). Dissociable neural mechanisms supporting visual
short-term memory for objects. Nature, 440, 91-95.
64. Krauzlis, R. J. et al. (2013). Superior colliculus and visual spatial attention. Ann Rev
Neurosci, 36
65. Henderson, J. M. (2003). Human gaze control during real-world scene perception.
Trends Cogn Sci, 7, 498-504.
66. Rayner, K. (2009). Eye movements and attention in reading, scene perception, and
visual search. Q J Exp Psychol, 62, 1457-1506.
67. Martinez-Conde, S. et al. (2013). The impact of microsaccades on vision: towards a
unified theory of saccadic function. Nat Rev Neurosci, 14, 83-96.
21
68. Hollingworth, A. (2009). Two forms of scene memory guide visual search: Memory
for scene context and memory for the binding of target object to scene location. Vis
Cogn, 17, 273-291.
69. Chun, M. M. (2000). Contextual cueing of visual attention. Trends Cogn Sci, 4, 170-
178.
70. Wolfe, J. M. (1998). What can 1 million trials tell us about visual search? Psychol Sci,
9, 33-39.
71. Treisman, A. and Gelade, G. (1980). A feature-integration theory of attention.
Cognitive Psychol, 12, 97–136.
72. Baddeley, A. (2012). Working memory: theories, models, and controversies. Ann Rev
Psychol, 63, 1-29.
73. Fuster, J. M. and Alexander, G. E. (1971). Neuron activity related to short-term
memory. Science, 173, 652-654.
74. Curtis, C. E. and D'Esposito, M. (2003). Persistent activity in the prefrontal cortex
during working memory. Trends Cogn Sci, 7, 415-423.
75. Harrison, S. A. and Tong, F. (2009). Decoding reveals the contents of visual working
memory in early visual areas. Nature, 458, 632-635.
76. Ikkai, A. et al. (2010). Contralateral delay activity provides a neural measure of the
number of representations in visual working memory. J Neurophysiol, 103, 1963.
77. Postle, B. R. (2006). Working memory as an emergent property of the mind and
brain. Neuroscience, 139, 23-38.
78. D'Esposito, M. (2007). From cognitive to neural models of working memory. Philos T
Roy Soc B, 362, 761-772.
79. Cowan, N. (2001). The magical number four in short-term memory: A
reconsideration of mental storage capacity. Behav Brain Sci, 24, 87-114.
80. Houtkamp, R. and Roelfsema, P. R. (2009). Matching of visual input to only one item
at any one time. Psychol Res, 73, 317–326.
81. Lewis-Peacock, J. A. et al. (2012). Neural evidence for a distinction between short-
term memory and the focus of attention. J Cogn Neurosci, 24, 61-79.
82. Chun, M. M. et al. (2011). A taxonomy of external and internal attention. Ann Rev
Psychol, 62, 73-101.
83. Anderson, D. E. et al. (2013). A common discrete resource for visual working memory
and visual search. Psychol Sci, 24, 929-938.
84. Gilbert, C. D. and Li, W. (2013). Top-down influences on visual processing. Nat Rev
Neurosci, 14, 350-363.
85. Hochstein, S. and Ahissar, M. (2002). View from the top: Hierarchies and reverse
hierarchies in the visual system. Neuron, 36, 791-804.
86. Di Lollo, V. et al. (2001). The preattentive emperor has no clothes: a dynamic
redressing. J Exp Psychol Gen, 130, 479.
87. Hamker, F. H. (2005). The reentry hypothesis: the putative interaction of the frontal
eye field, ventrolateral prefrontal cortex, and areas V4, IT for attention and eye
movement. Cereb Cortex, 15, 431-447.
88. Enns, J. T. and Di Lollo, V. (2000). What’s new in visual masking? Trends Cogn Sci, 4,
345-352.
89. Nako, R. et al. (2014). Rapid guidance of visual search by object categories. J Exp
Psychol Hum Percept Perform, 40, 50-60.
22
90. Cavanagh, P. and Alvarez, G. A. (2005). Tracking multiple targets with multifocal
attention. Trends Cogn Sci, 9, 349-354.
91. Drew, T. and Vogel, E. K. (2008). Neural measures of individual differences in
selecting and tracking multiple moving objects. J Neurosci, 28, 4183-4191.
92. Drew, T. et al. (2009). Attentional enhancement during multiple-object tracking.
Psychon B Rev, 16, 411-417.
93. Woodman, G. F. and Luck, S. J. (2003). Serial deployment of attention during visual
search. J Exp Psychol Hum Percept Perform, 29, 121-138.
94. Eimer, M. and Grubert, A. (2014). Spatial attention can be allocated rapidly and in
parallel to new visual objects. Curr Biol, 24, 193-198.
95. Pashler, H. (1994). Dual-task interference in simple tasks: Data and theory. Psychol
Bull, 16, 220-244.
23
Figure Legends
Figure 3. Neural correlates of preparation (a) and guidance (b) during visual search. (a):
Stimulus setup used by Chelazzi et al. [7] during a memory-guided visual search task (top
panel), and neural responses recorded in monkey IT cortex during the preparation phase
(bottom panel). Monkeys remembered a target object presented at the start of each trial
during a delay period, and moved their eyes to the location of this target in the subsequent
search display. When the target object was an effective stimulus for a particular IT neuron,
an initial transient sensory response was followed by a sustained increase (baseline shift) of
neuronal activity throughout the preparation period. Reproduced with permission from [7].
(b): Top panel: Stimulus setup used by Martinez-Trujillo and Treue [25]. Monkeys attended
to the direction of moving dots in one visual hemifield and ignored another set of dots in
the opposite hemifield that moved in the same direction (“attend to direction”). In a
baseline condition, they attended to the central fixation spot and ignored both dot arrays
(“attend to fix spot”). Bottom panel: Response of a MT neuron with a receptive field on the
unattended dot array. Attention to a specific movement direction in the opposite visual field
increased the neural response to the unattended movement when it matched the preferred
24
movement of this neuron, while attention to the opposite (“null”) direction produced
inhibition. Reproduced from [25], as printed in [33].
Figure 4. Electrophysiological evidence for serial and parallel object selection in visual
search. (a): Top panel: Woodman and Luck [93] instructed participants to search for targets
defined by a particular colour (e.g., red) and shape (gap at the top). Each search array
contained two target-colour objects. One of these possible targets appeared on the vertical
meridian and the other on the horizontal meridian, either near or far from fixation. Because
the N2pc is a contralateral ERP component, it is not triggered by possible targets on the
vertical midline, and therefore provides a pure measure of the attentional selection of the
other (horizontal) target. Bottom panel: The N2pc measured at lateral posterior electrodes
to near possible targets on the horizontal midline emerged 200 ms after stimulus onset, and
preceded the N2pc to far possible targets by 150 ms. There was no temporal overlap
between these two N2pc components, indicating that spatial attention was allocated serially
first the near and then to the far target-colour object. Reproduced with permission from
[93]. (b): Top panel: In the Eimer and Grubert study [94], two search displays that contained
a colour-defined (red) target and a distractor on opposite sides were presented in rapid
succession with a stimulus onset asynchrony (SOA) of 100 ms or 10 ms. One display
contained a horizontal target, and the other a target on the vertical midline. Bottom panel:
ERPs at lateral posterior electrodes contralateral and ipsilateral to the horizontal target, and
N2pc difference waveforms obtained by subtracting ipsilateral from contralateral ERPs.
When the SOA between the two targets was 100 ms, the N2pc to a horizontal target in the
second array (H2) emerged 100 ms after the N2pc to a horizontal target in the first array
(H1). When the SOA was reduced to 10 ms, these two N2pc components were triggered
within 10 ms of each other, and overlapped in time. These results show that two target
objects can be selected in parallel, with each selection process following its own
independent time course. Reproduced with permission from [94].
singleton diamond among uniformly coloured distractors were presented for 150 ms (top
panel). Participants either had to report the location of this singleton (localisation task) or its
detailed shape (cut on the left or right side; discrimination task). ERP waveforms measured
at posterior electrodes contralateral and ipsilateral to the target and corresponding
contralateral-ipsilateral N2pc difference waveforms (bottom panels) demonstrate that N2pc
components (reflecting target selection) were identical in both tasks. In contrast, the
subsequent sustained posterior contralateral negativity (SPCN) that is generated during the
identification stage when selected objects are maintained in working memory [58] was
reliably triggered only in the shape discrimination task, demonstrating that selection and
identification are separable stages of visual search. Data from [56], reproduced in a different
format.