Native Speakers and Task Performance Com
Native Speakers and Task Performance Com
Parvaneh Tavakoli
London Metropolitan University
This article argues that a native-speaker baseline is a neglected dimension of studies into
second language (L2) performance. If we investigate how learners perform language
tasks, we should distinguish what performance features are due to their processing an L2
and which are due to their performing a particular task. Having defined what we mean
by “native speaker,” we present the background to a research study into task features
on nonnative task performance, designed to include native-speaker data as a baseline
for interpreting nonnative-speaker performance. The nonnative results, published in this
journal (Tavakoli & Foster, 2008) are recapitulated and then the native-speaker results
are presented and discussed in the light of them. The study is guided by the assumption
that limited attentional resources impact on L2 performance and explores how narrative
design features—namely complexity of storyline and tightness of narrative structure—
affect complexity, fluency, accuracy, and lexical diversity in language. The results show
that both native and nonnative speakers are prompted by storyline complexity to use
more subordinated language, but narrative structure had different effects on native and
nonnative fluency. The learners, who were based in either London or Tehran, did not
differ in their performance when compared to each other, except in lexical diversity,
where the learners in London were close to native-speaker levels. The implications of
the results for the applicability of Levelt’s model of speaking to an L2 are discussed, as
is the potential for further L2 research using native speakers as a baseline.
The study was funded by the Economic and Social Research Council of the United Kingdom, award
reference RES-000-22-1155, and was the subject of a presentation by Pauline Foster at PacSLRF
2006 in Brisbane.
Correspondence concerning this article should be addressed to Pauline Foster, St. Mary’s
University College, Waldegrave Road, Twickenham, Middlesex, TW1 4SX, UK. Internet:
[email protected]
familiarity with language tests (which a native speaker may lack completely).
Added to this is an unease with using the “native speaker” as the gold standard
for language proficiency tests, especially for a language like English, which has
a multitude of native varieties. Davies (2003) showed how the definition of the
term native speaker involves a complex of sociolinguistic and psycholinguistic
attributes—some inherited, others acquired—and Rampton (1990) advocated
the use of the term language expert over native speaker, as that would allow
highly proficient L2 users to be counted as equal to first language (L1) users.1
For L2 research that uses L2 performance to illuminate developmental or
instructional factors, these concerns are perhaps less of an issue. This research
is usually undertaken not to take a measure of proficiency but to see how dimen-
sions of L2 performance are affected by variables such as type of input, type
of interactional pattern, type of task, the presence or absence of planning time,
or the presence or absence of performance time pressure. Overwhelmingly,
such studies involve participants who are not “expert users” of the L2 but are
learners of low to high intermediate level, who are very obviously distinguish-
able from native speakers. What variety of English they are learning is also
not a matter for concern, as long as they are all learning the same one (which,
normally, they are). For the native-speaker element, it is sufficient that these
participants be first language users of whatever variety of the target language
the L2 participants are learning. One can exclude expert L2 learners from this
category on the grounds that we cannot be sure they are indistinguishable from
L1 users, so it is best to avoid the possible confound. One could add that as it
would be much harder to locate a group of very highly proficient L2 users than
a group of L1 users, it is best to proceed with the latter.
As we argued earlier, using a native-speaker baseline in research into L2
performance gives greater validity to any claims that this is affected by the
independent variable(s) under scrutiny. If L1 performance is shown to be af-
fected in the same way, then something other than L2 processing is the cause.
However, to date, very little data of this type has been collected. Foster (2001)
illustrated that it is worth doing. Using native-speaker participants, the study
found that, broadly, planning time before a task conveyed similar performance
benefits on native speakers, as was shown in a parallel study on nonnative
speakers (Foster & Skehan, 1996). This indicates that the fluency and complex-
ity of language generated in task performance depends on more than ease of
language processing and is affected by the design of the task itself. The field
is wide open for replications of, for example, the pretask and online planning
literature (see Ellis, 2005), which would add native-speaker baseline data and
set up comparisons of these with data from advanced and intermediate learners.
(Schneider & Shiffrin, 1977; Shiffrin & Schneider, 1977), they gradually de-
mand less. Performance speeds up, accuracy increases, and other things that
need attention, like carrying on a conversation, can be undertaken at the same
time. Skehan (1998) applied this model to SLA; an imperfectly learned L2
imposes a large burden on the learner’s attention because it involves, among
other things, meaning and form, fluency and accuracy. A learner is someone
who has not yet developed fully automatized control over L2 morphosyntax or
articulation and so makes choices as to where to allocate attention, prioritizing
one aspect of performance, such as being syntactically accurate, over another,
such as being phonologically accurate.3
The concept of competing claims on limited attention lies behind much
research into task-based L2 performance (Ellis, 1987; VanPatten, 1990). The
thrust of the research has been to determine how language learners are likely to
allocate attention and what task features can be said to influence this. Planning
studies (Crookes, 1989; Foster & Skehan, 1996; Ellis & Yuan, 2005; Ortega,
1999) show a consistent effect for planning time, in terms of increased com-
plexity and fluency in performance. Some show increases in accuracy (Foster
& Skehan; Mehnert, 1998; Wigglesworth, 2001) although Foster and Skehan
suggested that the results indicate a trade-off between accuracy and complex-
ity, as competing demands on limited resources mean that some things get
higher priority. Task design has also figured in this line of research. Tasks that
are deemed to be cognitively less demanding allow greater attention to lan-
guage form because they take less for language meaning. Research suggests
that features affecting the cognitive complexity of a task include such things
as the task-taker’s familiarity with the content and/or format, the clarity of the
information given, the amount of computation involved, the number of other
interlocutors involved, and time allowed for completion (see Robinson, 2001b,
and Skehan & Foster, 2001, for a fuller exploration of this).
In the study reported in Tavakoli and Foster (2008), we explored how
the demands of the task being carried out affect the allocation of a language
learner’s limited attentional resources and impact on language performance,
as displayed by its fluency, accuracy, complexity, and lexical diversity. Our
particular interest was in two narrative task characteristics that previous studies
have suggested affect performance: storyline complexity and inherent structure.
Storyline complexity refers to whether a narrative has background as well as
foreground events, with a narrative with background events classified as more
complex than one with only foreground events. Thus, in the study we report
here, the Walkman task (see the Appendix) is categorized as a narrative with
storyline complexity because although the foreground action is about a man
taking a walk about town, there are things going on simultaneously—a car
crash, a bank robbery, an escaped tiger—that the narrator must include. The
Journey task, on the other hand, is categorized as without storyline complexity,
because beyond the foreground action of a couple cycling from place to place,
there are no other actions to include. Narrative structure refers to how the
episodes within the narrative connect to make the whole. A narrative with an
inherently loose structure, such as Walkman, can have its episodes reordered
without loss of coherence. (It does not matter if the bank robbery comes before
or after the escaped tiger.) A narrative with an inherently tight structure cannot
be reordered in this way. In the Football task, each frame must stay where it is
for the narrative to make sense.
Our line of enquiry was prompted by a post hoc finding in previous research.
Tavakoli and Skehan (2005) concluded that the unexpectedly high syntactic
complexity scores in two of the narrative tasks they used could be ascribed
to the narrative structure. An inspection of the data transcripts pinpointed
increased syntactic complexity at moments when the participants were trying
to connect foreground and background events. As Matthiessen and Thompson
(1988) and Harris and Bates (2002) both noted, in English the conditions,
reasons, and purposes of an event are signaled through syntactic subordination,
so it is to be expected that anyone connecting background to foreground events
in a narrative will be prompted to do so through subordinate clauses.
Two other studies (Foster & Skehan, 1996; Skehan & Foster, 1997) explored
the extent to which familiar and unfamiliar task content would impact on L2
task performance. In general, the results went as predicted: that talking in
an L2 about well-known information resulted in a more fluent and accurate
performance, whereas talking about new and unfamiliar information entailed
a performance that was less fluent and less accurate (but more complex).
Intriguingly, a post hoc analysis of the data from the two studies together
showed that a variable other than familiarity of content might be involved. The
most fluent performances across the two studies were observed in tasks that did
not have the same degree of familiar information. With the addition of planning
time (another variable under investigation), these two tasks returned the most
accurate performance. The interpretation offered was that the key variable was
structure: Both of these tasks dealt with events that had a clear time sequence
from beginning to middle and end and this had eased the L2 processing burden
and enabled greater attention to accuracy and fluency in performance. This was
tested out by Skehan and Foster (1999) and upheld. A narrative task with a
tightly structured storyline was associated with greater fluency and accuracy
than a loosely structured narrative. If a narrative is tightly sequenced from
i n he r e n t n a r r a t i ve s t r u c t u r e
− background n = 50 n = 50
+ background n = 50 n = 50
Total n = 100. Participants did either Journey and Football tasks, or Walkman and Picnic
tasks. The task order was counterbalanced. The native speaker study reported here used
exactly the same design, with n = 20 in each cell, and total N = 40.
Hypotheses
Essentially the same hypotheses were used as in the nonnative-speaker part of
the study.
Design
This replicates the design used in the nonnative-speaker phase of the research
(Figure 1), with the number of participants in this phase being the only dif-
ference. It has a 2 × 2 factorial design with storyline complexity and inher-
ent narrative structure as the two independent variables. Storyline complexity
was operationalized as number of storylines; the narratives with one storyline
(Football and Journey) contained only foreground information, whereas nar-
ratives with two storylines (Picnic and Walkman) contained both foreground
and background information. Inherent narrative structure was defined in terms
of whether the narratives were loose or tight. In the two loosely structured
narratives (Journey and Walkman) the events could be reordered without com-
promising the main story, and in two tightly structured narratives (Football
and Picnic) this was impossible. The participants told either Journey and Foot-
ball or Walkman and Picnic. This design allowed for a within-participants
comparison for inherent narrative structure and a between-participants com-
parison for the presence or absence of background events (see Figure 1). These
tasks are from Heaton (1966), Jones (1980) and Swan and Walter (1990) (see
Appendix).
Participants were 40 native speakers of English,6 all first-year students
studying Literature or Psychology at a university in London. They were between
18 and 60 years of age. The four narratives were typical of picture stories
used in English as a foreign language (EFL) classrooms. Each comprised six
cartoon frames. Previous trialing had shown the narrative in each was engaging,
clear, and unlikely to baffle. Each participant met individually with one of the
researchers in a quiet room. They were asked to look over the first picture story
for up to 3 min in order to understand the narrative and then they were asked
to tell it, as if to someone who could not see the pictures. Their performance
was recorded. When the first task was completed, the same procedures were
followed for a second story. To control for any practice effect, the order of
the stories was counterbalanced. Inspections of the transcripts showed that all
of the participants accomplished the tasks as expected, telling the stories as
presented in the pictures. No one did the tasks in an idiosyncratic way or came
up with an anomalous story.
the effect of inherent structure on the native speakers’ performance did not
reach significance (F = 2.10, ns, η2 = .149). Neither was there a significant
difference in the interaction of inherent structure and storyline complexity (F =
2.24, ns, η2 = .158). The only difference to reach statistical significance was
the effect of storyline complexity (F = 3.25, p < .05, η2 = .213), indicating
that the native speakers’ performance on foreground and background narratives
is significantly different from that on tasks with foreground events only. The
effect size for this difference indicates a noticeable effect (Tabachnic & Fidell,
1996) of storyline complexity variable on the native speakers’ performance. To
test Hypotheses 1 to 4, t-tests were run to compare the mean scores on measures
of fluency, complexity and lexical diversity.
M SD
M SD
longer MLU, with the native speakers consistently producing longer syntactic
runs when narrating Walkman and Picnic. The results from these two measures
of complexity lend support for H1.
Hypothesis 4, which predicts no difference in the effects of task design on
native and nonnative speakers, is also supported here. As reported by Tavakoli
and Foster (2008), nonnative performance on these same narrative tasks was
affected in the same way by storyline complexity. The narrative tasks where
the storyline contained background as well as foreground information resulted
in learners attempting significantly more complex language to describe them,
both in Tehran (Picnic vs. Football: F = 4.13, p < .001; Walkman vs. Journey:
F = 2.02, p < .05) and in London (Picnic vs. Football: F = 4.30, p < .001;
Walkman vs. Journey: F = 2.85, p < .01).
In terms of MLU there was also an effect for storyline complexity; in tasks
with background events, learners produced significantly longer MLUs, both in
Tehran (Picnic vs. Football: F = 4.63, p < .001; Walkman vs. Journey: F =
2.87; p < .01) and in London (Picnic vs. Football: F = 3.78, p < .001; Walkman
vs. Journey: F = 4.82, p < .001). The effect of storyline complexity is the same
for both learners and native speakers. Figures 2 and 3 illustrate how the data for
all three groups (native speakers, learners in London, and learners in Tehran)
follows the same general pattern across tasks, although it also indicates that,
for the Picnic task, the nonnative-speaker data from London converges with
the native-speaker data.
2.00
1.90
Syntactic Subordination
1.80
1.70
1.60 NNS Tehran
1.50 NNS London
1.40 NS
1.30
1.20
1.10
1.00
Journey Football Walkman Picnic
Narrative Tasks
11
10
9 NNS Tehran
MLU
NNS London
8 NS
6
Journey Football Walkman Picnic
Narrative Tasks
both repair fluency (reformulation, false start, replacement, and repetition) and
breakdown fluency (number of pauses occurring in the middle and at the end
of clauses, total amount of silence in the middle and at the end of clauses).
Table 3 shows the results of the t tests comparing measures of repair fluency
and Table 4 shows the results of t tests comparing breakdown fluency.
The figures in Table 3 are all very low, indicating that native speakers, by and
large, do not need to repair their speech. The scores are, with one exception,
higher for the loosely structured narratives (Journey and Walkman) than on
Reformulations 0.2 (0.52) 0.35 (0.67) 0.90 .370 ns 0.30 (.80) 0.50 (.51) 0.94 .359 ns
False starts 1.00 (2.00) 1.45 (2.00) 0.96 .345 ns 0.80 (.89) 0.85 (1.00) 0.20 .847 ns
Replacements 0.65 (1.2) 0.85 (1.38) 0.94 .359 ns 0.45 (.51) 0.25 (.55) 1.28 .214 ns
Repetitions 0.4 (1.00) 0.55 (1.23) 0.82 .419 ns 0.25 (.55) 0.50 (.88) 0.142 .171 ns
Note. Standard deviations are given in parentheses.
Language Learning 59:4, December 2009, pp. 866–896
tight ones (Picnic and Football), but there is no significant difference for any
measure. We can conclude that the inherent structure of a narrative does not
affect any measures of repair fluency in native speakers’ performance.
Table 4 shows that though the native speakers consistently produce fewer
pauses and a smaller amount of silence on tightly structured narratives (Football
and Picnic) than on loosely structured narratives (Journey and Walkman), these
differences are far from significant. Taken together these results do not support
H2 and suggest that native-speaker fluency is not affected by inherent narrative
structure.
For H4, concerning the comparison of effects on non-native speaker perfor-
mance, we use the results reported in Tavakoli and Foster (2008). These showed
that inherent task structure had a very limited effect. The only measure of repair
fluency to reach statistical significance was false starts in the Tehran data for
the Walkman/Picnic comparison (3.63 vs. 6.23, F = 3.49, p < .01). This is as
predicted, but in the absence of any other significant results for repair fluency,
it cannot be taken as strong support for it. For the learners in London, no sig-
nificant results were reached for any measure of repair fluency. On measures
of breakdown fluency, performances were generally more fluent for narratives
with tight structure, but only two measures reached significance: These were
number of pauses midclause for the learners in Tehran for the Football/Journey
comparison (9.05 vs. 11.46, F = 3.38, p < .01) and total amount of silence mid-
clause for the learners in London for the Walkman/Picnic comparison (8.13 vs.
9.89, F = 2.67, p < .05). Taken together, the nonnative and native results show
no effect on native speakers for inherent narrative structure, whereas nonnative
performance, especially in breakdown measures such as pausing, is affected
to some small degree. H4, then, receives weak support.10 Figure 4 shows the
pattern of results for all three groups of participants. As reported earlier, a task
effect is discernible but weak, with a little less pausing on tasks with inherent
structure. What is most noticeable is that the native speakers pause very much
less midclause than their nonnative counterparts. Figure 5 shows an opposite
pattern, with end-of-clause pauses very much common in the nonnative data,
although with the learners in London converging on the native speaker levels
for the Picnic task.
14
12
No. of mid clause pauses
10
NNS Tehran
8
NNS London
6
NS
4
0
Journey Football Walkman Picnic
Narrative Tasks
10
9
No. of end-clause pauses
8
NNS Tehran
7
NNS London
6
NS
5
3
Journey Football Walkman Picnic
Narrative Tasks
50.00
45.00
D: Lexical Diversity
40.00
NNS Tehran
35.00 NNS London
NS
30.00
25.00
20.00
Journey Football Walkman Picnic
Narrative Tasks
shows how similar the patterns of results are. In both the London and the
Tehran data, the Picnic task, which does have background events, gives rise to
less lexical diversity than the Football task, which does not. However, Walkman,
the other task with background events, gives rise to greater lexical diversity
than Journey, which has no background. As there was no significant effect for
storyline complexity in either the London or the Tehran data, we can conclude
that this narrative feature does not influence lexical diversity in either native or
nonnative performance.
We summarize here that fluency and lexical diversity in native-speaker
narratives are not affected by narrative design and that native speakers maintain
fluency regardless of whether they are narrating a story with a tight narrative
structure or a loose one. It is also notable that incidences of repair fluency were
uniformly few (Table 2) and that when native speakers did pause, it was far
more likely to be on a syntactic boundary than between them (Table 3). Lexical
diversity in native speakers was unaffected by task design with similar scores
for D reported for single- and dual-storyline tasks. There was, as predicted,
an effect on syntactic complexity for storyline complexity, with the ratio of
clauses to AS units significantly higher for the narratives with dual storylines
and MLU significantly higher also for dual narratives with an inherently tight
structure.
When comparing task effects on native and nonnative speakers, it is clear
that storyline complexity operates in the same way on syntactic complexity
regardless of whether the speaker is using English as an L2 or a native language.
However, whereas a tight inherent narrative structure exerts no influence on
native-speaker fluency, it does seem that it helped nonnative speakers to perform
more fluently in the L2. In neither group did narrative structure have a significant
effect on lexical diversity.
Interestingly the data presented earlier indicate that the variance among
native and nonnative speakers’ performance was not greatly different. The
standard deviation figures show that there is no particular difference between
the two groups in the variance in scores for fluency, length of utterance, and
lexical diversity.
Discussion
The study reported here adds a native-speaker dimension to the nonnative
speaker study reported by Tavakoli and Foster (2008) in order to see whether
effects that might be ascribed to the demands of processing an imperfectly
known language are better ascribed to the demands of processing a task. The re-
sults show quite clearly that native speakers, whose L1 skills rest on a secure
and implicit knowledge of its systems, maintain their fluency regardless of the
structure or complexity of the narrative they are telling and that when they
pause (as they must, for a seamless performance would be highly unnatural
and very hard on the listener), they are much more likely to do so at syn-
tactic boundaries, indicating that they plan and deliver their stream of speech
in clause-length runs (Pawley & Syder, 1983). Our data show that pausing in
nonnative speakers’ performance occurs more often in the middle of clauses.
They are less secure in planning and delivering a whole clause of the L2 at a
time, but, as Tavakoli and Foster (2008) showed, the processing burden can be
eased by a narrative such as Picnic and Football, in which the events follow on
from each other in a predictable way. The native-speaker data presented here
shows that L1 fluency is something that does not rest on this aspect of task
design, whereas L2 fluency does, at least in terms of degree and positioning of
pausing.
For complexity, the conclusions are different. Tavakoli and Foster (2008)
showed that for nonnative speakers, greater storyline complexity resulted in
greater syntactic complexity. This pattern of results was obtained here too with
the native speakers and indicates that increased syntactic complexity is, at least
partly, a task effect and not wholly a language processing effect. Dual storylines
provoke greater use of subordinated clauses in English; single storylines do
not. A native speaker selects subordination to weave two storylines together,
not because a secure knowledge of English subordination allows it but because
the nature of the narrative like Picnic and Walkman requires it. Thus, a learner
with an insecure control of English subordination (such as relative clauses,
if/when clauses, unless/although/in case clauses) has a harder task telling such
a narrative and may be less accurate or less fluent than on a narrative with only
a single storyline.
In Tavakoli and Foster (2008), we suggested that, for oral testing purposes,
care had to be taken when choosing narratives for an exam. The native-speaker
data presented in this part of the study reinforce such a conclusion.
The lexical diversity results are most interesting. The comparison of the
mean scores of the three groups of participants’ lexical diversity of performance
shows that, unlike on any other measure, the learners in London are far closer
to the native speakers than they are to the learners in Tehran. This was not
predicted by any of the original hypotheses of the study, so it does invite a post
hoc analysis of variance to compare the three groups across the four tasks. This
is shown in Table 6.
We see from Table 6 that across all four tasks the learners in London are
producing language that is significantly more lexically diverse than that of the
learners in Tehran and at the same time is not significantly less diverse than
that of the native speakers. The first part of this result was already clear in
Tavakoli and Foster (2008) where the t test to compare D scores from the
London and Tehran learners on all tasks reported significant results in favor of
greater diversity in the London data (Football p < .01; Journey p < .02; Picnic
p < .003; Walkman p < .006). The ANOVA in Table 6 shows the remarkable
extent of this difference. The learners in London use a diversity of vocabulary
that is closing in on the native speakers. When we consider ANOVAs for all
other dimensions of performance, we see this result on only one other variable:
total seconds silence at end of clause, where across all tasks the London data
are significantly different from the Tehran data, and not significantly different
from the native-speaker data (Football p < .001, η2 = .322; Journey p < .02,
η2 = .335; Picnic p < .001, η2 = .282; Walkman p < .002, η2 = .175). We can
see these results as linked in some way. Knowledge of lexis, especially the
kind that is gained implicitly through frequent exposure, is not just knowledge
of individual words but of chunks of words that occur regularly in the same
patterns.12 A wider knowledge of words in phrases, such as would be gained
through living within the target culture and hearing the language on a daily
basis, would increase the learner’s ability to plan and execute phrase by phrase,
not word by word, which a lesser exposure, say only through classroom contact,
might allow. Hence, the learners in London have acquired lexical knowledge,
which means they pause more naturally at clause boundaries than within them.
Again, the native-speaker data shows this aspect of L2 performance is closing
in on native-speaker patterns of fluency. The results suggest that living inside
the target language community conveys a considerable lexical advantage, with
learners in London able to draw upon an enriched and networked lexicon.
A further qualitative analysis of the narrative transcripts undertaken of each
narrative, frame by frame, examined the lexical choices of the learners and the
native speakers in the face of the same picture prompts and showed that the
learners in London do indeed have a greater familiarity with nativelike lexical
selections (i.e., they were much more likely than the learners in Tehran to make
lexical choices similar to the native speakers), and that they use many more
lexicalized phrases13 than the learners in Tehran. The London data contained
twice as many of these as the Tehran data, although only half as much as
the native speaker data (see Foster, 2009). Crucially, however, there is no
grammatical advantage. As shown by Tavakoli and Foster (2008), in terms of
accuracy of performance the two groups of learners are not distinct.
Such results pose interesting questions for how a model of L2 speaking
fits Levelt’s model of L1 speaking, discussed earlier. We could perhaps safely
assume that in the prelinguistic conceptualization phase, native and nonnative
speakers are on a level playing field, sharing whatever conceptual demands
the speaking task imposes. However, in the formulation stage, during which
linguistic coding of the message takes place, the playing field is not even. Native
speakers have greater knowledge of and faster access to the linguistic code and
can formulate information even as conceptualization feeds it in. Nonnative
speakers with less knowledge (of say, lexical relationships or relative clauses)
and slower retrieval may have to conceptualize and formulate utterances in
alternation rather than in tandem due to insufficient attentional capacity and may
also be going through an intermediary stage whereby formulation occurs first in
the structures of their L1 and then translated before it can move to articulation.
Both such delays would account for the higher incidence of midclause pausing
that characterizes L2 performance (and, of course, may be exacerbated by a
less than automatic neuromuscular control of L2 phonemes). However, greater
use of nativelike lexical phrases in the London learner data (Foster, 2009)
would also account for the different pausing patterns (Figures 4 and 5) between
learners in London and Tehran. The learners in London, with a greater store
of chunked phrases, were able to plan and execute more clause-length runs
because their L2 formulator was taking the same processing shortcuts as native
speakers. We have reported here that whereas the tightness or looseness of
narrative structure made no difference to native-speaker fluency, it had affected
nonnative-speaker performance, with tight narrative structure associated with
increased fluency and accuracy. We can see this task feature in terms of easing
attentional demands on the conceptualizer, because the direction of a tight
narrative requires little thought or interpretation, and so for the L2 formulator
and articulator, there is more attention available. For native speakers, who
make few errors beyond an occasional slip of the tongue or loss of syntactic
cohesion, neither the formulator nor the articulator require much attentional
capacity to start with, so the extra space afforded by a tight narrative has no
discernible effect.
Although one might claim that with a few modifications to account for depth
of knowledge and speed of retrieval, the Levelt model is directly applicable to
L2 performance (see Kormos, 2006), research evidence is needed to put some
flesh on these bones. This article has gone some way to providing a small
amount, and clearly there is scope for a great deal more work. Especially,
it would be interesting to compare native-speaker performance to different
levels of nonnative-speaker performance. It would be very useful to be able to
characterize nonnative-speaker performance at a level approaching nativelike
to explore in which dimensions their performance comes to resemble that of
native speakers and, ultimately, to contrast this with high- and low-intermediate
and beginning learners. Our data suggest that intermediate learners with greater
exposure to the target language might be relying more on lexical processing
than grammatical processing. One question that presents itself is whether it
would be possible to see at what point in their development, learners integrate
lexical and syntactic processes as they approach higher levels of proficiency.
Coupled to this would be explorations of task variables, such as the design
of content or implementation conditions, to illuminate further which variables
do or do not affect native-speaker performance and at what levels of nonnative
proficiency we see a convergence with native-speaker results. Although most of
this research is envisaged to be based on comparisons of groups of learners and
Notes
1 See Skehan and Foster (2009) for a full discussion of the definition of native and
nonnative speakers.
2 Such as simultaneously rotating your left foot clockwise and your left hand
anticlockwise or speaking on a mobile phone while driving through heavy traffic.
3 Robinson (2001a, 2001b) used work in psychology by Navon (1989) and Neuman
(1996) to challenge limited-capacity processing, proposing a different model of
attention in which there are multiple attentional “pools.” For him, depletion of
attention in one pool has no effect on the amount remaining in another, so language
learners do not need to prioritize one dimension of performance over any other. In
his revised Triadic Framework (Robinson & Gilabert, 2007), accuracy and
syntactic complexity depend on whether the cognitive complexity of a task is
driven by resource-directing or resource-dispersing variables (pp. 166–167).
4 As we note earlier, this study is a replication of the study by Foster and Skehan
(1996) using native rather than nonnative speakers.
5 Full details of these, and the statistical procedures used, are given in this article and
are compared to the parallel native-speaker results.
6 All had learned English from early childhood; none were proficient in any other
language.
7 AS units and clauses divide the data into complete and dependent syntactic units
and enable a syntactic complexity measure to be calculated as a ratio of clauses to
AS units. Foster, Tonkyn, and Wigglesworth (2000) give a full and detailed
definition of how these are identified.
8 MLU was expressed as mean number of words per AS unit.
9 Unlike a straightforward type-token ratio, the measure D corrects for length of text
and is therefore more valid.
10 The very different patterns of results in London and Tehran are discussed further
below.
11 We thank Jeanine Treffers-Daller for this observation.
12 See, for example, Weinert (1995).
13 These are defined here as either fully or partially prefabricated chunks of language
that constitute single choices for retrieval by the formulator.
References
Alderson, C. (1980). Native and non-native speaker performance on cloze test.
Language Learning, 30, 59–76.
Clark, J. (1977). The performance of native speakers of English on the Test of English
as a Foreign Language (TOEFL Report No 1). Princeton, NJ: Educational Testing
Service.
Crookes, G. (1989). Planning and interlanguage variation. Studies in Second Language
Acquisition, 11, 367–383.
Davies, A. (2003). The native speaker: Myth and reality. Clevedon, UK: Multilingual
Matters.
Elder, C., Iwashita, N., & McNamara, T. (2002). Estimating the difficulty of oral
proficiency tasks: What does the test-taker have to offer. Language Testing, 19(4),
343–368.
Ellis, R. (1987). Interlanguage variability in narrative discourse: Style shifting in the
use of the past tense. Studies in Second Language Acquisition, 9, 12–20.
Ellis, R. (2005). Planning and task performance in a second language. Amsterdam:
Benjamins.
Ellis, R., & Yuan, F. (2005). The effects of careful within-task planning on oral and
written task performance. In R. Ellis (Ed.), Planning and task performance in a
second language (pp. 167–193). Amsterdam: Benjamins.
Foster, P. (2000). Attending to message and medium: The effects of planning time on
the task-based language performance of native and non-native speakers.
Unpublished doctoral dissertation, King’s College, London.
Foster, P. (2001). Rules and routines: a consideration of their role in the task-based
language production of native and non-native speakers. In M. Bygate, P. Skehan, &
M. Swain (Eds.), Researching pedagogic tasks: Second language teaching, learning
and testing (pp. 75–95). London: Longman.
Foster, P. (2009). Lexical diversity and native-like selection: The bonus of studying
abroad. In H. Daller, D. Malvern, P. Meara, J. Milton, B. Richards, and J.
Treffers-Daller (Eds.), Vocabulary studies in first and second language acquisition:
The interface between theory and application (pp. 91–106). Basingstoke, UK:
Palgrave-Macmillan.
Foster, P., & Skehan, P. (1996). The influence of planning and task type on second
language performance. Studies in Second Language Acquisition, 18, 299–323.
Foster, P., Tonkyn, A., & Wigglesworth, G. (2000). Measuring spoken language.
Applied Linguistics, 21(3), 354–375.
Givon, T. (1985). Function, structure, and language acquisition. In D. Slobin (Ed.), The
crosslinguistic study of language acquisition (Vol. 1, pp. 1008–1025). Hillsdale, NJ:
Lawrence Erlbaum.
Harris, C., & Bates, E. (2002). Clausal background and pronominal reference: A
functionalist approach to c-command. Language and Cognitive Processes, 17(3),
237–269.
Tabachnic, B. G., & Fidell, L. S. (1996). Using multivariate statistics. New York:
Harper Collins.
Tavakoli, P., & Foster, P. (2008). Task design and second language performance:
The effect of narrative type on learner output. Language Learning, 58(2),
439–473.
Tavakoli, P., & Skehan, P. (2005). Strategic planning, task structure and performance
testing. In R. Ellis (Ed.), Planning and task performance in a second language (pp.
239–277). Amsterdam: Benjamins.
VanPatten, B. (1990). Attending to form and content in the input: An experiment in
consciousness. Studies in Second Language Acquisition, 12, 287–301.
Weinert, R. (1995). The role of formulaic language in second language acquisition: A
review. Applied Linguistics, 16(2), 181–205.
Whiteson, V. (1979). A study of proficiency and linguistic variety of non-native EFL
teachers. Unpublished doctoral dissertation, Indiana University, Bloomington.
Wigglesworth, G. (2001). Influences on performance in task-based oral assessments.
In M. Bygate, P. Skehan, & M. Swain (Eds.), Researching pedagogic tasks: Second
language teaching, learning and testing (pp. 186–209). London: Longman.
Appendix
Figure A3 Walkman task (Swan & Walter, 1990). Reprinted with permission.