0% found this document useful (0 votes)

21 views26 pages

Discourse Analysis

The document outlines the definitions and distinctions between Text Linguistics and Discourse Analysis, emphasizing their overlapping yet distinct focuses on text-internal and text-external criteria. It discusses the evolution of these fields, highlighting their multidisciplinary nature and the importance of context in discourse studies. The text also notes the integration of various linguistic disciplines and approaches that contribute to a comprehensive understanding of language use in social contexts.

Uploaded by

KimberlyAnne95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views26 pages

Discourse Analysis

Uploaded by

KimberlyAnne95

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Chapter outline:

• Defining text and discourse.

• Defining Text Linguistics and Discourse Analysis.
• Evolution of Text Linguistics and Discourse Analysis
through time.
• Approaches to the phenomenon of discourse.
• The job and interests of discourse analysts.

1.1 Defining text and discourse. What is Text Linguistics? What is Discourse
Analysis?
To define and describe the scope of study of Text Linguistics and Discourse Analysis and to establish
the differences between them both is not an easy task. Suffice it to say that the terms text and discourse
are used in a variety of ways by different linguists and researchers: there is a considerable number of
theoretical approaches to both Text Linguistics and Discourse Analysis and many of them belong to very
different research traditions, even when they share similar basic tenets.
In everyday popular use it might be said that the term text is restricted to written language, while
discourse is restricted to spoken language. However, modern Linguistics has introduced a concept of text
that includes every type of utterance; therefore a text may be a magazine article, a television interview, a
conversation or a cooking recipe, just to give a few examples.
Crystal (1997) defines Text Linguistics as “the formal account of the linguistic principles governing
the structure of texts”. De Beaugrande and Dressler (1981) present a broader view; they define text as a
communicative event that must satisfy the following seven criteria:

EE }
1) Cohesion, which has to do with the relationship between text and syntax. Phenomena such as
conjunction, ellipsis, anaphora, cataphora or recurrence are basic for cohesion.
ponle.it =
2) Coherence, which has to do with the meaning of the text. Here we may refer to elements of
linguistics ¥ knowledge or to cognitive structures that do not have a linguistic realization but are implied by
the language used, and thus influence the reception of the message by the interlocutor.

}
3) Intentionality, which relates to the attitude and purpose of the speaker or writer.
4) Acceptability, which concerns the preparation of the hearer or reader to assess the relevance or

¥7
usefulness of a given text.
Discourse 5) Informativity, which refers to the quantity and quality of new or expected information.

Analysis 6) Situationality, which points to the fact that the situation in which the text is produced plays a
crucial role in the production and reception of the message.
É 7) Intertextuality, which refers to two main facts: a) a text is always related to some preceding or
simultaneous discourse; b) texts are always linked and grouped in particular text varieties or
genres (e.g.: narrative, argumentative, descriptive, etc.) by formal criteria.

In spite of the considerable overlap between Text Linguistics and Discourse Analysis (both of them
are concerned with the notion of cohesion, for instance) the above criteria may help us make a distinction
between them.
Tischer et al. (2000) explain that the first two criteria (cohesion and coherence) may be defined as
text-internal, whereas the remaining criteria are text-external. Those approaches oriented towards ‘pure’
Text Linguistics give more importance to text-internal criteria, while the tradition in Discourse Analysis
has always been to give more importance to the external factors, for they are believed to play an essential
role in communication.
Some authors, such as Halliday, believe that text is everything that is meaningful in a particular
situation: “By text, then, we understand a continuous process of semantic choice” (1978:137). In the
“purely” text-linguistic approaches, such as the cognitive theories of text, texts are viewed as “more or
less explicit epi-phenomena of cognitive processes” (Tischer et al., 2000: 29), and the context plays a
subordinate role.
It could be said that the text-internal elements constitute the text, while the text-external ones
constitute the context. Schiffrin points out that all approaches within Discourse Analysis view text and
context as the two kinds of information that contribute to the communicative content of an utterance, and
she defines these terms as follows:
I will use the term “text” to differentiate linguistic material (e.g. what is said, assuming a verbal channel)
from the environment in which “sayings” (or other linguistic productions) occur (context). In terms of
utterances, then, “text” is the linguistic content: the stable semantic meanings of words, expressions, and
sentences, but not the inferences available to hearers depending upon the contexts in which words,
expressions, and sentences are used. […] Context is thus a world filled with people producing utterances:
people who have social, cultural, and personal identities, knowledge, beliefs, goals and wants, and who
interact with one another in various socially and culturally defined situations. (1994: 363)

Text in context

Thus, according to Schiffrin, Discourse Analysis involves the study of both text and context. One might
conclude, then, that Text Linguistics only studies the text, while Discourse Analysis is more complete
because it studies both text and context. However, as has been shown, there are definitions of text (like de
Beaugrande’s) that are very broad and include both elements, and that is why it would be very risky to
talk about clear-cut differences between the two disciplines. De Beaugrande’s (2002) definition of Text
Linguistics (herinafter TL) as “the study of real language in use” does not differ from many of the
definitions of Discourse Analysis (hereinafter DA) presented by Schiffrin within its functional approach,
some of which are the following:
The study of discourse is the study of any aspect of language use (Fasold, 1990: 65).

The analysis of discourse is, necessarily, the analysis of language in use. As such, it cannot be restricted to
the description of linguistic forms independent of the purposes or functions which these forms are designed
to serve in human affairs (Brown & Yule, 1983: 1).

Discourse… refers to language in use, as a process which is socially situated (Candlin, 1997: ix).

Thus, we see that the terms text and discourse are sometimes used to mean the same and therefore one
might conclude that TL and DA are the same, too. It can be said, nevertheless, that the tendency in TL
has been to present a more formal and experimental approach, while DA tends more towards a functional
Discourse
textlingvisH.es analysis
TL DA
approach. Formalists are apt to see language as a mental phenomenon, while functionalists see it as a
predominantly social one. As has been shown, authors like Schiffrin integrate both the formal and the
functional approaches within DA, and consequently, DA is viewed as an all-embracing term which would
include TL studies as one approach among others.
Slembrouck points out the ambiguity of the term discourse analysis and provides another broad
definition:
The term discourse analysis is very ambiguous. I will use it in this book to refer mainly to the linguistic
analysis of naturally occurring connected speech or written discourse. Roughly speaking, it refers to
attempts to study the organisation of language above the sentence or above the clause, and therefore to study
larger linguistic units, such as conversational exchanges or written texts. It follows that discourse analysis is
also concerned with language use in social contexts, and in particular with interaction or dialogue between
speakers. (2005:1)

Another important characteristic of discourse studies is that they are essentially multidisciplinary, and
therefore it can be said that they cross the Linguistics border into different and varied domains, as van
Dijk notes in the following passage:
…discourse analysis for me is essentially multidisciplinary, and involves linguistics, poetics, semiotics,
psychology, sociology, anthropology, history, and communication research. What I find crucial though is
that precisely because of its multi-faceted nature, this multidisciplinary research should be integrated. We
should devise theories that are complex and account both for the textual, the cognitive, the social, the
political and the historical dimension of discourse. (2002: 10)

Thus, when analyzing discourse, researchers are not only concerned with “purely” linguistic facts;
they pay equal or more attention to language use in relation to social, political and cultural aspects. For
this reason, discourse is not only within the interests of linguists; it is a field that is also studied by
communication scientists, literary critics, philosophers, sociologists, anthropologists, social psychologists,
political scientists, and many others. As Barbara Johnstone puts it:
… I see discourse analysis as a research method that can be (and is being) used by scholars with a variety of
academic and non-academic affiliations, coming from a variety of disciplines, to answer a variety of
questions. (2002: xi)

As noted above, not all researchers use and believe in the same definition of text and discourse. In
this book, we are going to adopt the general definition of DA as the study of language in use, and we
shall follow Schiffrin in including both text and context as parts of discourse, in which case we will
consider the term text in its narrow sense, not in the broad sense that could place it on a par with the term
discourse.

1.2. Origins and brief history of Text Linguistics

and Discourse Analysis
Parallel to the Chomskyan Generative School (whose starting point is considered to be the publication
of Syntactic Structures in 1957), other schools emerged in different parts of the world that supported
different and even opposing ideas to those of Chomsky’s.
All these new schools believed that a good linguistic description should go beyond the sentence, and
pointed to the fact that there are certain meanings and aspects of language that cannot be understood or
embraced if its study is limited to the syntactic analysis of sentences.
Thus, in the twentieth century, the following new disciplines emerged within the field of Linguistics:
• Functionalism (functional grammars)

I
• Cognitive Linguistics
• Sociolinguistics
• Pragmatics
• Text Linguistics
• Discourse Analysis

All these new disciplines are interrelated, and sometimes it is very difficult to distinguish one from the
other, due to the fact that all of them have common denominators. Bernárdez (1999: 342) explains the
basic tenets of these disciplines, which are summarized here as follows:
tenetsotall disciplinas

|
a) Language only exists in use and communication. It always fulfils certain functions in human
interaction.
b) Language use is necessarily social.
c) Language is not autonomous. It shares some characteristics with other social and cognitive
phenomena.
d) The description of language must account for the real facts of language. It should not postulate
hidden entities only motivated by the needs of the formal system utilized.
e) Linguistic structures should be closely linked to the conditions of language use.
f) Language is natural and necessarily vague and inaccurate; therefore any prediction can only be
probabilistic.

When performing DA, then, researchers may also engage themselves in Functional Grammar,
Sociolinguistics, Pragmatics or Cognitivism, because all these fields are interrelated and have common
tenets. As regards TL and DA, we may speak of a progressive “integration” of both disciplines, for, if we
observe the evolution of language research through time, it will be noticed that many scholars have
moved from TL into DA as part of the natural flow of their beliefs and ideas, as is the case with van Dijk,
who, in his biographical article of 2002, explains how his research evolved from Text Grammar to
Critical Discourse Analysis2. This author points out that the main aim of his studies in the 1970s was to
give an explicit description of the grammatical structure of texts, and the most obvious way of doing so
was by accounting for the relationship among sentences. A very important concept for Text Grammar at
that time was the introduction of the notion of macrostructure (van Dijk, 1980). Another fundamental
notion was that of coherence and the idea that texts are organized at more global descriptive levels than
that of the sentence. Later on, and under the influence of the cognitive theories, the notion of strategic
understanding was developed, which attempted to account for what the users of a language really do
when they understand a given text. Van Dijk also notes how several other new concepts were introduced
in TL studies, such as socio-cultural knowledge and mental models (Johnson-Laird, 1983), as well as all
the ideas and concepts coming from the field of Pragmatics. In his particular case, he took interest in the
study of power and ideology, which places him within the DA stream-of-thought known as Critical
Discourse Analysis3.
Thus, after the early and uniform stage of “Text Grammar”, TL went through a series of more open
and diversified stages. The “textuality” stage emphasized the global aspects of texts and saw the text as a
functional unit, larger than the sentence. This stage led into the “textualization” or “discourse processing”
stage, where analysts “set about developing process models of the activities of discourse participants in
interactive settings and in ‘real time’” (de Beaugrande, 1997: 61-62).
The current aim now in DA is to describe language where it was originally found, i.e. in the context of
human interaction. In this respect, it is important to point out that this interaction often involves other
media besides language. Examples of these other semiotic systems may be gesture, dance, song,
photography or clothing, and it is also the discourse analyst’s job to explain the connection between these
systems and language. In order to achieve these aims, different researchers have taken different
approaches. We now turn to them.

1.2. Approaches to the phenomenon of discourse

Current research in DA, then, flows from different academic fields. This is one of the reasons why the
terms discourse and discourse analysis are used to mean different things by different researchers.
Schiffrin et al. note that all the definitions fall into three main categories:

t
1) Anything beyond the sentence
2) Language use
3) A broader range of social practice that includes non-linguistic and non-specific instances of
language. (2001: 1)

Authors such as Leech (1983) and Schiffrin (1994) distinguish between two main approaches: 1) the
formal approach, where discourse is defined as a unit of language beyond the sentence, and 2) the
functional approach, which defines discourse as language use. Z. Harris (1951, 1952) was the first
linguist to use the term discourse analysis and he was a formalist: he viewed discourse as the next level in

2
Another example can be found in de Beaugrande (1997: 68) when he comments on how his concepts of text and
discourse evolved over a series of studies and expanded beyond the linguistic focus he first encountered.
3
This approach is presented and studied in Chapter 10.
a hierarchy of morphemes, clauses and sentences. This view has been criticized due to the results shown
by researchers like Chafe (1980, 1987, 1992), who rightfully argued that the units used by people in their
speech can not always be categorized as sentences. People generally produce units that have a semantic
and an intonational closure, but not necessarily a syntactic one.
Functionalists give much importance to the purposes and functions of language, sometimes to the
extreme of defending the notion that language and society are part of each other and cannot be thought of
as independent (Fairclough, 1989; Focault, 1980). Functional analyses include all uses of language
because they focus on the way in which people use language to achieve certain communicative goals.
Discourse is not regarded as one more of the levels in a hierarchy; it is an all-embracing concept which
includes not only the propositional content, but also the social, cultural and contextual contents.
As explained above, Schiffrin (1994) proposes a more balanced approach to discourse, in which both the
formal and the functional paradigms are integrated. She views discourse as “utterances”, i.e. “units of
linguistic production (whether spoken or written) which are inherently contextualized” (1994: 41). From
this perspective, the aims for DA are not only sequential or syntactic, but also semantic and pragmatic.

Bodily hexis

Within the category of discourse we may include not only the “purely” linguistic content, but also
sign language, dramatization, or the so-called ‘bodily hexis’ (Bordieu, 1990), i.e. the speaker’s disposition
or the way s/he stands, talks, walks or laughs, which has to do with a given political mythology. It can
thus be concluded that discourse is multi-modal because it uses more than one semiotic system and
performs several functions at the same time.
Wetherell et al. (2001) present four possible approaches to DA, which are summarized as follows:

1. The model that views language as a system and therefore it is important for the
researchers to find patterns.
2. The model that is based on the activity of language use, more than on language in
itself. Language is viewed as a process and not as a product; thus researchers focus
on interaction.
3. The model that searches for language patterns associated with a given topic or
activity (e.g. legal discourse, psychotherapeutic discourse, etc.).
4. The model that looks for patterns within broader contexts, such as “society” or
“culture”. Here, language is viewed as part of major processes and activities, and as
such the interest goes beyond language (e.g. the study of racism or sexism through
the analysis of discourse).

In spite of these categorizations, it would not be unreasonable to say that there are as many approaches
to discourse as there are researchers devoted to the field, for each of them proposes new forms of analysis
or new concepts that somehow transform or broaden previous modes of analysis. However, it would also
be true to say that all streams of research within the field are related to one another, and sometimes it is
difficult to distinguish among them. Precisely with the aim of systematizing the study of discourse and
distinguishing among different ways of solving problems within the discipline, different traditions or
schools have been identified. It would be impossible to embrace them all in only one work, and for that
reason, in this book we are only going to concentrate on the main ideas and practices within some of the
best-known schools, which are the following:

1. Pragmatics ( Chapter 3)
2. Interactional Sociolinguistics (Chapter 4)
3. Conversation Analysis (Chapter 5)
4. The Ethnography of Communication (Chapter 6)
5. Variation Analysis and Narrative Analysis (Chapter 7)
6. Functional Sentence Perspective (Chapter 8)
7. Post-structuralist Theory and Social Theory (Chapter 9)
8. Critical Discourse Analysis and Positive Discourse Analysis (Chapter 10)
9. Mediated Discourse Analysis (chapter 11)
A common characteristic of all these schools of thought is that they do not focus on language as an
abstract system. Instead, they all tend to be interested in what happens when people use language, based
on what they have said, heard or seen before, as well as in how they do things with language, such as
express feelings, entertain others, exchange information, and so on. This is the main reason why the
discipline has been called “Discourse Analysis” rather than “language analysis”.

1.4 What do discourse analysts do?

Broadly speaking, discourse analysts investigate the use of language in context, thus they are
interested in what speakers/writers do, and not so much in the formal relationships among sentences or
propositions. Discourse analysis, then, has a social dimension, and for many analysts it is a method for
studying how language “gets recruited ‘on site’ to enact specific social activities and social identities”
(Gee 1999: 1).
Even when a discipline is hard to delimit, as is the case with DA, we can learn a great deal about its
field of concern by observing what practitioners do. If we look at what discourse analysts do, we will
find they explore matters such as:

• Turn-taking in telephone conversations

• The language of humor
• Power relationships in doctor/patient interviews
• Dialogue in chat rooms
• The discourse of the archives, records or files of psychoanalysts
• The conversation at a dinner table
• The scripts of a given television program
• The discourse of politicians
• The study of racism through the use of discourse
• How power relations and sexism are manifested in the conversation between men and women
• The characteristics of persuasive discourse
• Openings and closings in different types of conversations
• The structure of narrative
• Representations of black/white people (or any race) in the written media (magazines,
newspapers, etc.)
• The strategies used by speakers/writers in order to fulfil a given discourse function
• The use of irony or metaphor for certain communicative aims
• The use of linguistic politeness
• The discourse of E-mail messages
• Legal discourse used in trials
• How people create social categories like “boy” or “immigrant” or “lady” as they talk to, about,
or among each other
• And a long etcetera…

These are just a few examples reflecting the concerns of discourse analysts, but they are sufficient to
demonstrate that researchers in DA are certainly concerned with the study of language in use. As
students/readers progress through the different chapters of this book, they will encounter several other
examples of possible DA areas of interest.
It is worth noting that, as Johnstone (2002) remarks, the discipline is called discourse analysis (and
not, for instance, “discourseology”) because it “typically focuses on the analytical process in a relatively
explicit way” (2002: 3). This analysis may be realized by dividing long stretches of discourse into parts
or units of different sorts, depending on the initial research question, and it can also involve looking at the
phenomenon under study in a variety of ways, by performing, for instance, a given set of tests.
Thus, discourse analysts have helped (and are helping) to shed light on how speakers/writers organize
their discourse in order to indicate their semantic intentions, as well as on how hearers/readers interpret
what they hear, read or see. They have also contributed to answer important research questions which
have lead, for instance, to the identification of the cognitive abilities involved in the use of symbols or
semiotic systems, to the study of variation and change, or to the description of some aspects of the
process of language acquisition.
In order to carry out their analyses, discourse analysts need to work with texts. Texts constitute the
corpus of any given study, which may consist of the transcripts of a recorded conversation, a written
document or a computerized corpus of a given language, to name a few possibilities. The use of corpora
has become a very widespread practice among discourse researchers, and for that reason it is necessary
for any discourse analyst to acquire some basic knowledge of how to handle the data and how to work
with corpora. Chapter 2 is devoted to this enterprise.

1. The terms text and discourse have been –and still are– used ambiguously, and they are defined in
different ways by different researchers. In this book we are going to use the term text to refer to the
‘purely’ linguistic material, and we are going to consider discourse in a broader sense, defining it as
language in use, composed of text and context.
2. Text Linguistics and Discourse Analysis share some basic tenets and, while some authors make a
distinction between them, others use both terms to mean the same. However, it may be said that
“purely” Text Linguistic studies are more concerned with the text-internal factors (i.e. cohesion
and coherence), while Discourse Analysis focuses its attention more on the text-external factors,
without disregarding the text-internal ones. The history of these disciplines shows that research has
evolved, in many cases, from the narrower scope of Text Grammar (and later, Text Linguistics) into
the broader discipline of Discourse Analysis, and therefore both disciplines have merged. For this
reason and for clarifying and practical purposes, we shall consider DA as a macro-discipline that
includes several sub-approaches, among which the ‘purely’ text-linguistic ones can also be found.
3. In this book we are going to touch on the main theoretical and practical tenets of the following
traditions identified within discourse studies: Pragmatics, Conversation Analysis, Interactional
Sociolinguistics, Ethnography of Communication, Variation Analysis and Narrative Analysis,
Functional Sentence Perspective, Post-structural and Social Theory, Critical Discourse
Analysis/Positive Discourse Analysis and Mediated Discourse Analysis.
4. In order to learn about a given discipline, it is useful to look at what practitioners do. Discourse
analysts explore the language of face-to-face conversations, telephone conversations, e-mail
messages, etc., and they may study power relations, the structure of turn-taking, politeness strategies,
the linguistic manifestation of racism or sexism, and many, many other aspects of language in use.
The sky is the limit.
5. Discourse analysts are interested in the actual patterns of use in naturally-occurring texts. These
natural texts, once transcribed and annotated, are known as the corpus, which constitutes the basis for
analysis. Thus, discourse analysts necessarily take a corpus-based approach to their research
16120

Choose the answer that best suits the information given in Chapter 1.

:
1. Modern Linguistics has introduced a concept of text that…
a) is very restrictive.
b) includes all types of utterances.
c) includes only written discourse.

2. De Beaugrande and Dressler (1981)…

a) view Text Linguistics from a broader perspective than that of Crystal’s (1997).
b) define text in terms of three main criteria.
c) define text as a grammatical category.
✓
3. According to Tischer et al. (2000), the first two criteria that define text (De Beaugrande &
Dressler, 1981)…
a) are text-external.
b) belong only to “pure” Text Linguistics.
c) are text-internal.

÷
4. The tradition in Discourse Analysis has always been to…
a) give more importance to the text-external criteria of intentionality, acceptability, informativity,
situationality and intertextuality.
b) give more importance to the text than to the context.
c) consider context as playing a subsidiary role.

5. According to Schiffrin (1994) and other authors, Discourse Analysis…

a) involves only the study of context.
b) is devoted to the study of text.
c) includes the analysis of both text and context.
\
6. De Beaugrande & Dressler’s (1981) definition of Text Linguistics…
a) differs widely from Schiffrin’s (1994) definition of Discourse Analysis
b) does not substantially differ from Schiffrin’s (1994), Fasold’s (1990), Brown & Yule’s (1983) or
Candlin’s (1997) definition of Discourse Analysis.
c) is exactly the same as Schiffrin’s (1994).

:
7. The tendency in Text Linguistics has been to…
a) present a more formal approach than that of Discourse Analysis.
b) present a more functional approach than that of Discourse Analysis.
c) be less formal than any other approach.

8. Functionalists see language…

a) mainly as a mental phenomenon.
b) as a predominantly social phenomenon.
c) as an acoustic phenomenon.

9. Many discourse analysts, like Schiffrin or Slembrouck …

a) integrate both the formal and functional approaches in their study of discourse.
b) do not mix the formal with the functional approach.
c) prefer the formal to the functional approach.

Í
10. Discourse studies are…
a) restricted to the field of Linguistics.
b) devoted mainly to social phenomena.
c) essentially multidisciplinary.

11. Functionalism, Cognitive Linguistics, Sociolinguistic, Pragmatics, Text Linguistics and

Discourse Analysis are…
a) all relatively new disciplines which are interrelated.
\
b) completely different from one another.
c) very easily distinguished from one another.
✓
12. Many scholars’ studies, like those of van Dijk or de Beaugrande…
a) have not changed substantially with time.
\
b) have evolved from Text Linguistics to Discourse Analysis.
c) do not show a natural flow of beliefs or ideas.

✓
13. The current and main aim in Discourse Analysis is to…
a) study the formal aspects of texts.
b) discover the functions of language.
\
c) describe language in the context of human interaction.

14. Zellig Harris (1951, 1952)…

a) was a functionalist.
b) was the first scholar that used the term Discourse Analysis.
c) criticized Chafe’s view of Discourse Analysis.
✓
15. According to Schiffrin (1994), utterances are…
a)
\ written or spoken linguistic units that are inherently contextualized.
b) units which are essentially and only sequential and syntactic.
c) purely linguistic units.

:
16. Discourse is multi-modal because it…
a) embodies one semiotic system.
b) includes laughter in its study.
c) uses more than one semiotic system.

17. Whetherell et al (2001) …

a) write about four possible approaches to Discourse Analysis.
b) write about only two models of analysis.
c) do not distinguish between models.
✓
18. Discourse analysts are…
a) more interested in the grammatical aspects of language than in the details of its context.
b) more concerned with the actions of speakers or writers than with the formal relationships
\
between sentences.
c) not particularly interested in body language.

:
19. In general, we may say that discourse analysts are…
a) only interested in different types of conversations.
b) not interested in the written language.
c) mainly concerned with the study of language in use.

20. In order to carry out their analyses, discourse analysts…

a) very frequently use linguistic corpora as their data.
b) work only with written documents.
c) always use recorded conversations as texts.

A) READING: After reading the contents of this chapter, Choose ONE of the following chapters
from books on Discourse Analysis and read it:

• Chapter 2 (“Definitions of discourse”) and chapter 10 (“Text and context”) in Deborah

Chapter Outline:
• Techniques of data collection and annotation.
• The ethics of data collection.
• Defining the term corpus.
• Different types of corpora.
• The use of corpora for Discourse Analysis.
• Available tools for corpus querying.

2.1. Data collection

One of the first problems we encounter when facing discourse analytic
research concerns the data to be used. Some questions arise, such as: What
type of discourse are we going to analyze? How are we going to collect
the data we need? And, in the case of spoken discourse, how are we going
to transcribe and annotate the data in such a way that we can show the
features of both text and context as faithfully as possible?
The answer to the first question depends completely on the objectives
the researcher has in mind, which, in turn, depend on the research
question. S/he may want, for example, to analyze spoken or written
language, or both, or s/he may want to focus on a given genre or register:
there are innumerable possibilities here. But once we know the type of
discourse we want to analyze, we have to figure out how to collect the
data; i.e. we need to decide upon the best possible way of getting a
linguistic corpus which will provide the basis for our research. As Taylor
remarks, “one of the processes by which material becomes data is
selection” (2001: 24), and there may be several different criteria for
selecting a sample. As noted above, these criteria depend on the goals of
research. Schiffrin explains how the different approaches to Discourse
Analysis (DA) take different perspectives and have different beliefs about
methods for collecting and analyzing data:

For example, some approaches focus intensively on a few fragments of talk

(e.g. interactional sociolinguistics), others focus on distributions of
discourse items across a wide range of texts (e.g. variationists). Some
require a great deal of social, cultural, and personal information about
interlocutors and may use interlocutors as informants in analysis of their
own talk (e.g. ethnography of communication); others assume an idealized
speaker/hearer whose specific social, cultural, or personal characteristics
do not enter into participant strategies for building text at all (e.g.
pragmatics). Methodological differences such as these are due, partially,
to different theoretical assumptions –assumptions that are based on the
different origins noted above (1994: 13).

Thus, when it comes to data collection, our goals will guide us in the
selection process and they are likely to lead us to choose different
procedures, such as recording and transcribing spoken discourse, keying
texts in, scanning, using texts which are stored in machine-readable form,
downloading material from the internet, etc.

2.2. Transcribing the data

A very important aspect of data collection in research involving talk or
spoken discourse is transcription. By means of the process of transcription
the researcher turns the spoken discourse in question into a document
called transcript. Transcribing is not an easy task and it is very time-
consuming. If the researcher aims at some degree of objectivity (another
difficult –if not impossible– task), s/he should try to use a system of
transcription that shows, as faithfully as possible, all the variables that
intervene in the studied phenomenon. There is no such thing as a totally
neutral transcription, and, to the present, it has not been possible to create
a system so perfect as to represent all variables and aspects. However,
discourse analysts have always made attempts to contrive annotation
systems that best suit the aims of their research, and that allow them to
obtain reliable results to a reasonable extent. For instance, a conversation
analyst who views talk as interaction would argue that the data will
include not only the words, but also other aspects of the conversation, such
as the sequential organization of the utterances of the different
participants, as well as the interruptions and pauses. Another important
requirement would be to work with a sample of ‘naturally occurring’ 1 talk,

1Taylor explains that, in the most idealized form, naturally occurring talk “would
probably refer to informal conversation which would have occurred even if it was
not being observed or recorded, and which was unaffected by the presence of the
observer and/or recording equipment” (2001: 27).
rather than with data collected by means of research interviews 2. Some
analysts include information about the text, such as genre, date and
place of publication, etc. Others include information about the
pronunciation and intonation patterns, or about the speakers (sex, age,
occupation, social class, etc.). They can also assign labelled brackets to
each constituent of a sentence (parsing) or signal some features of spoken
language such as laughter, interruptions or hesitations. In general, and as
Johnstone (2005:20) notes, it is crucial to be able to uncover the many
ways in which texts are shaped by contexts and the many ways in which
texts shape contexts”.
For the purpose of illustration, we will now examine the attempts made
by a few authors to annotate their data.

2.2.1. Transcription conventions used by some discourse

analysts
There is no single accepted way to transcribe or represent speech on the
page. Each analyst chooses and annotates the features that best suit the
purposes of his/her research, and consequently it can be said in all fairness
that there are almost as many ways to transcribe speech as there are
researchers who set about doing the task. Leech (2004) discusses this issue
in the following passage:
Any type of annotation presupposes a typology — a system of
classification — for the phenomena being represented. But linguistics, like
most academic disciplines, is sadly lacking in agreement about the
categories to be used in such description. Different terminologies abound,
and even the use of a single term, such as verb phrase, is notoriously a
prey to competing theories. Even an apparently simple matter, such as
defining word classes (POS), is open to considerable disagreement.
Against this background, it might be suggested that corpus annotation
cannot be usefully attempted: there is no absolute ‘God's truth' view of
language or ‘gold standard’ annotation against which the decision to call
word x a noun and word y a verb can be measured.

However, it cannot be said that there is not a certain consensus. For

example, when dealing with some syntactic categories, such as nouns,
verbs and so on, people performing transcription will normally agree on

2 Research interviews are supposed to be a more conventional method of data

collection, by means of which the researcher initiates talk ‘about’ something and
conducts an interview for the specific purpose of the research. The interviewer
usually works with a prepared questionnaire or list of topics.
how to label them, whereas they will disagree on less clear cases. Hence
the ideal procedure would be for the analyst to start from a consensual set
of categories and only use his/her own for the cases in which there is no
agreement whatsoever. In order to do this, it is first necessary to examine
previous systems of annotation designed by other researchers.

Upper Middle Class

Upper Class
Female University Professor

Artist Female
55 42

Some important information about the speakers, necessary for analyzing their
discourse.

There is no ideal transcription system which suits all purposes: highly

detailed transcripts include more information but are often hard to read,
whereas transcripts with less detail are easier to read but contain less
information about the whole discourse situation. Transcripts including too
many details may provide more information than people are able to
process, and thus it is advisable not to include too much distracting
extraneous detail in order to be able to concentrate only on the crucial
elements or features of the speech event that will help us answer the
research question.
According to Edwards (1993) three main principles should be observed
when designing a transcription system, taking into account its readability
for a human researcher or a computer:
1. Categories should be:

i
• systematically discriminable
• exhaustive
• systematically contrastive

2. Transcripts should be readable (to the researcher)

3. For computational tractability, mark-up should be:

• systematic
• predictable.

In addition, the designer should make a decision as to whether the

transcription is to be orthographic, prosodic, or phonetic, or more than
one of these. Also, s/he will have to decide on how to represent non-verbal
data, such as contextual information, paralinguistic features, pauses, and
overlaps. This means that more than one level of transcription must be
aligned, which will have obvious implications for mark-up of the data. As
explained above, it is always useful to examine previous systems of
annotation created by other researchers, and this is what we shall do in the
next section of this chapter.

2.2.1.1. Notation used in the London Lund Corpus (Svartvik & Quirk,
1980)

The London Lund Corpus is a computerized corpus of spoken English

which has been widely used by linguists and discourse analysts in different
studies. It consists of 87 texts which are arranged in text groups (face-to-
face conversation, telephone conversation, etc.) and, apart from the
symbols used in their annotation, in some of the texts the authors provide
the possible users of the corpus with some extra information about the
speakers, concerning their age and occupation.
Let us now examine a fragment of one of the texts in this corpus with
respect to its notation conventions:
Text S.11.1 Public, unprepared commentary, demonstration, oration. A trial (legal
discourse)
11 1 1 10 1 1 a 11 ^Mr P=otter# /
11 1 1 20 1 1 a 11 ^did y/ou# - - /
11 1 1 30 1 1 a 11 ar^r\/ive# /
11 1 1 40 1 1 a 11 a^bout !two o`cl\ock# /
11 1 1 50 1 1 a 11 ôn [dhi] . !S\unday# . /
11 1 1 60 1 1 a 11 the ^date the 'will was . s\igned# . /
11 1 1 70 1 1 b 11 ^y/es# - - /
11 1 1 80 1 1 a 11 and . did ^you . g\/o# /
11 1 1 90 1 1 a 11 and ^see your 'mother :straight aw/ay# /
11 1 1 100 1 1 b 11 ^y\es I _did# /
11 1 1 110 1 1 a 11 ^what was she 'then d\oing# . /
11 1 1 120 1 1 b 11 she was ^having her l\unch# - - - /
11 1 1 130 1 1 a 11 ^what a'bout the :br\andy 'bottle# /
11 1 1 140 1 1 a 11 ^where was th\at# - - /
11 1 1 150 1 1 b 11 Î 'don`t kn\ow# /
11 1 1 160 1 1 b 12 Î 'didn`t [s] . ![lu?] Î 'didn`t !s\ee# /
11 1 1 170 1 1 a 11 you ^didn`t s\/ee _it# /
11 1 2 180 1 1 b 11 ^w\ell# . /
11 1 2 190 1 1 b 11 ^n\o {I ^d\idn`t#}# /
11 1 2 200 1 1 b 14 Î Î Î âll I kn/ow# /
11 1 2 210 1 1 b 11 was ^my !mother was :having her !l\unch# . /
11 1 2 220 2 1 b 21 when *I* /
11 1 2 230 1 1 a 20 *((and))* /
11 1 2 220 1 1(b 11 ar^r\ived# - /
11 1 2 240 1 1 a 11 ^how did she !s\eem {^th\en#}# - /
11 1 2 250 1 1 a 11 ((at)) ^two o`cl\ock# - /
11 1 2 260 1 1 b 11 ^w/ell# . /
11 1 2 270 1 1 b 11 she ^seemed 'all r/ight# /
11 1 2 280 1 1 b 11 I ^think she was a :little t\/ired# - - - /
11 1 2 290 1 1 a 11 and ^how 'long did it :t\ake# /
11 1 2 300 1 1 a 11 ^for her to com'plete her l\unch# - - - /
11 1 2 310 1 1 b 11 oh Î would !th=ink# - - - /
11 1 2 320 1 1 b 11 ^pr=obably# . /
11 1 2 330 1 1 b 11 ^f/ifteen 'minutes# - /
11 1 3 340 1 1 a 12 ^was it /any a ^meal of any s/ubstance# . /
11 1 3 350 1 1 b 11 she ^had [@:m] . :ch\icken# /
11 1 3 360 1 1 b 11 she ^didn`t 'eat very :m\uch of it# - - - /
11 1 3 370 1 1 a 11 ^did !you s\/it with 'her# /
11 1 3 380 1 1 a 11 ^wh=ilst# . /
11 1 3 390 1 1 a 11 she com^pleted the m\/eal# - /
11 1 3 400 1 1 b 11 I was în the r=oom# /
11 1 3 410 1 1 b 11 ^while she was :h\/aving _it# /
11 1 3 420 1 1 b 11 ^y\es# . /
11 1 3 430 1 2 a 12 and ^then [@] ( . coughs) - did she ^have it on a /
11 1 3 430 1 1 a 12 tr/ay# - . /
11 1 3 440 1 1 b 11 ^y/es# /
11 1 3 450 1 1 a 11 ^somebody took the !tr\ay out . {pre^s\umably#}# . /
11 1 3 460 1 1 b 11 [@:] ^my !w\ife 'took it 'out# - /
11 1 4 470 1 1 a 11 and [?] . ^that`s . 'then a'bout 'two fift\een# - -/
11 1 4 480 1 1 b 11 [@] ^y/es# /
11 1 4 490 1 1 b 11 [i?] ^y/es# /
11 1 4 510 1 1 a 11 were ^y/ou 'then# . /
11 1 4 520 1 1 a 11 a^lone w/ith 'her# - - /
11 1 4 500 1 1 b 11 it ^w\ould be# . /
11 1 4 530 1 1 b 11 [@m] I was a^lone with m/other# /
11 1 4 540 1 1 b 11 ^y\es# /
11 1 4 550 1 1 b 21 âfter . my :wife left - - *[@m]* /
11 1 4 560 1 1 a 11 ^*what* !took pl\ace# /
11 1 4 570 1 2 a 11 âfter your :wife 'left with the tr\ay . {be^tween /
11 1 4 570 1 1 a 11 'you and your m\other#}# . /
11 1 4 580 1 1 b 11 well my ^mother \asked 'me# /
11 1 4 590 1 1 b 11 ^when I "!g\ot th/ere# . /
11 1 4 600 1 1 b 11 îf 'I had br\ought# - /
11 1 4 610 1 1 b 11 [@] this ^draft of her ":w\ill# . /
11 1 4 620 1 1 b 11 and I ^said I h\ad# - - /

Transcription conventions:
A) PROSODY: # End of Tone Group ^Yes Beginning of Tone
Group
Tones

Y\es FALL Y\/es FALL-RISE Y=es LEVEL

Y/es RISE Y/\es RISE-FALL

Pitch

:Yes Higher than the previous syllable

!Yes High !!Yes Very High

Stress

‘Yes Normal “Yes Strong

Pauses

Yes - - Each dash is a unit pause of one stress unit or “foot”

Yes + Brief pause

B) SPEAKERS

A Speaker identity
(A) Speaker continues where s/he left off
A, B A and B
VAR Various speakers
? Speaker identity unknown
a (low case letter) Non-surreptitious speaker

As can be seen, the specification of the notation used helps us learn

many details, not only about the text, but also regarding the context of this
fragment of legal discourse. There is both prosodic and pragmatic
information, which allows us, for instance, to learn that both a and b are
non-surreptitious speakers, which tells us that they knew they were being
recorded. Also, by looking at the tones, pitch, stress, and other prosodic
features used by the speakers, we may, among other things, infer
information having to do with their attitude (e.g. if they are upset, or trying
to be ironic, etc.).

2.2.1.2. Notation used by Deborah Schiffrin

The following data have been taken from D. Schiffrin and R. Lakoff’s
Data Packet for their “Discourse” class at Berkeley and Georgetown
Universities (Spring 1998). As will be noticed, this notation has its
peculiarities and is different from that used in the London Lund Corpus
above. For example, this author uses square brackets ([]) to signal speech
overlap, and a dot (.) to represent a falling intonation followed by a pause.

Debby: D Zelda: Z

D: (1) What does your uh daughter in law call you?

Z: (2) Well, that’s a sore spot.
D: hhhh
Z: (3) My older daughter in law does call me Mom.=
D: Uh huh.
Z: (4) My younger daughter in law right now is up to nothing.
(5) She [had said-
D: [Oh
Z: (6) We had quite a discussion about it.
(7) We did bring it out in the open.
(8) She said that um…that she- just- right now, she’s:- it’ll take her
time.
(9) Now they’re marrie:d, it’s gonna be uh… I think eh…five
years,=
D: Um hmm.
Z: (10) that they’ll be married.
(11) and she said that eh it was very hard t’s:-call someone else Mom
beside her
mother.
(12) so I had said to her, “That’s Okay!”
(13) I said, “If you- if you can’t say Mom, just call me by my [first
name!
D:
[Umhm
Z: (14) So, we had quite a discussion about it.
(15) It was a little heated, at one time.=
D: Yeh
Z: (16) She said, “All right”, she’ll call me Zelda.
(17) But she still can’t bring herself to say Zelda.
(18) so she calls me nothing!
(19) She do- but we’re very cl- we’re on very good terms,=
Z
D:
Yeh.
Transcription conventions

. Falling intonation followed by noticeable pause (as at end of

declarative sentence)
… Noticeable pause or break in rhythm without falling intonation.
bold Self interruption with glottal stop
CAPS Very emphatic stress
[ ] Speech overlap
= Continuity of previous line of text (when lack of space prevents
continuous speech from being
presented on a single line of text)
Z When speech from B follows speech from A without perceptible
pause, then Z links the end of A with
the beginning of B

When speech from B occurs during what can be heard as a brief silence
from A, then B’s speech is under A’s silence:

A: I can’t wait to go to the party! It’ll be fun.

B: Oh yeh!
2.2.1.3. Other annotation practices

The examples in 2.2.1.1. and 2.2.1.2. show only two possible ways of
annotating corpora. Other authors have chosen different symbols or have
taken into account some other, additional, variables. For instance,
Jefferson (1979) marks the gaze of the speaker with a line above the
utterance and the gaze of the addressee with a line below it. The line
indicates that the interlocutor marked is gazing toward the other, while the
lack of a line indicates the absence of gaze. Commas are used to indicate
the dropping of gaze. Besides, some movements like head nodding are
marked when they occur:

Ann: ____________________________________
Karen has this new hou:se. en it’s got all this

Beth: ______________________ ,,, ((Nod))

Jefferson also marks applause by using strings of X’s with lower- (for
quiet applause) and uppercase (for loud applause) letters. In the following
example, the amplitude of the applause increases at the end:

Audience: xxxxxxxxxXXXXXXXXXXXXXXXXX

Deborah Tannen uses left arrows to highlight key lines, as in the

following fragment taken and adapted from her well-known book You Just
Don’t Understand. Women and Men in Conversation (1990: 197):

STEVE: I think it’s basically done damage to children. That what good
it’s done is outweighed by the damage

DEBORAH: Did you two grow up with

television?

The examples of notation conventions presented in this section display

only a few of the innumerable possibilities. As noted above, each
researcher may choose his/her own conventions (which normally depend
upon the needs and objectives of the analysis), provided they are explained
and made clear to the reader.
2.3. Ethics of data collection
Another important aspect to be considered when doing discourse analysis
is the ethics of the research, which necessarily affect the process of data
collection.
Even though in an ideal world all participants in a project should have
equal rights and power, it is a fact that, in general, the researcher has more
power than the other participants in an experiment. This power may come,
as Taylor notes, “from holding the status associated with being an
academic and, supposedly, an expert” (2001: 20). In addition, the
researcher has more information about the experiment than the subjects, a
fact which also contributes to her power. Thus, for example, the discourse
analyst will know that the publication of a given conversation might bring
some negative consequences to the participants of the conversation if their
identities are revealed, and therefore s/he should never publish the real
names of the participants without their consent 3. Researchers ought not to
abuse their power. It is an ethical requirement that the researcher obtain
the consent of the participants, not only to be involved in the study but
also to use the data they provide. As a general rule, researchers have the
obligation to a) protect all participants, b) not harm them in any way, and
c) always observe their legal rights.

2.4. Corpus Linguistics: The use of corpora for DA

Corpus is defined by Crystal as “a collection of LINGUISTIC DATA,
either written texts or a TRANSCRIPTION of recorded speech, which can
be used as a starting-point of linguistic description or as a means of
verifying hypotheses about a LANGUAGE (corpus linguistics)” (1997:
95).
Corpus linguistics, thus, has to do with the practice and the principles
of using corpora in language study. Biber et al. note that the essential
characteristics of corpus-based analysis are:
• Empiricism (it analyzes the actual patterns of use in
natural texts);
• Utilization of a large and principled collection of natural
texts, known as a “corpus”, as the basis for analysis;

3 However, the principle of anonymity is not observed in DA when the intention of

the analyst is to denounce and/or condemn the speech or writing of, for example, a
given politician or institution. We find numerous instances of this type of analysis
within the approach called Critical Discourse Analysis (Chapter 10).
• Extensive use of computers for analysis, using both
automatic and interactive techniques;
• Use of both quantitative and qualitative analytical
techniques. (1998: 4)

Corpora are excellent tools for discourse analysts, for they facilitate the
investigation of language in use. Studies of language use require empirical
analyses of large databases of authentic texts, a requirement that has been
possible to meet, obviously, thanks to the aid of corpus linguistics. Using
corpora allows researchers to analyze patterns of use, i.e. how some
linguistic features are used in association with other linguistic and non-
linguistic features. Linguistic and non-linguistic association patterns
interact; they are not independent (Biber et al., 1998). For instance, if we
consider the lexical associations for thin, skinny and slim, we can also
consider their distribution across different registers. Thus, corpus-based
studies aim at characterizing registers, dialects, etc. in terms of their
linguistic association patterns.
Although some scholars (especially generative grammarians) have
pointed to the limitations of corpus-based analysis (e.g. that it is limited to
samples of performance only, or that no corpus can contain information
about all areas of language), it cannot be denied that the use of corpora has
proved to present considerable advantages when analyzing discourse: it
has allowed researchers to deal with larger and more varied texts, bringing
about a reliability of analysis never reached before; it has enabled them to
make more objective and accurate descriptions of usage than would be
possible through mere introspection. It also allows them, for instance, to
come to reliable conclusions based on frequency of use of a given
linguistic feature or pattern, to make comparative analyses about usage in
different varieties, or to arrive at a total account of the linguistic features in
any of the texts contained in the corpus. And, most important of all, a
well-constructed general corpus can be an inexhaustible source of
hypotheses about the way language works.
All the above advantages have been mainly made feasible thanks to the
construction, in modern times, of computerized corpora, which permit the
storage and analysis of a much greater number of natural language texts
than would be possible if we had to store and analyze them by hand.
However, the first large corpus of English-language data was entirely
transcribed by hand and stored on index cards which were processed
manually. This corpus was originally known as the Survey of English
Usage, a project which started in the 1960s and which consisted of a
million words comprising 200 texts of spoken and written material of
5,000 words each. The whole survey has now been computerized, and is
currently known as the London-Lund Corpus 4.

2.4.1. Computer corpora and concordance programs

The first computerized corpus in the history of linguistics was the Brown
University Corpus of American English. It was created in the 1960s by
Henry Kucera and W. Nelson Francis, and it aimed to represent a wide
range of genres of published written text in American English produced
during a single year. The Lancaster-Oslo/Bergen (LOB) Corpus of British
English was compiled in the 1970s to match the Brown corpus using
British English texts.
Ever since the 1980s, increasingly large corpora have been compiled
(especially of English) and are used in different fields, such as in the
development of natural language processing software and in applications,
including lexicography, machine translation, speech recognition, etc.
Three examples of modern corpora are The British National Corpus
(BNC), The International Corpus of English (ICE) and The Bank of
English. Some online corpora can be found, such as the Experimental BNC
Website (which offers a BNC online service allowing everyone with
access to the internet to register for an account on the BNC server) or the
Shakespeare Online Corpus. In addition, researchers can now benefit from
concordance programs, i.e. programs which turn the electronic texts into
databases which can be searched. Some examples of these programs are
the Word Cruncher (which you get, for example, when you buy the
ICAME corpora of modern and medieval English), TACT (a well-known,
freeware program), SARA (specifically made for searches of the BNC) and
WordSmith Tools (a program widely used by linguists, lexicographers and
discourse analysts nowadays. It offers several possibilities, such as
querying, searching for word combinations within a specified range of
words, looking up substrings or parts of words, or accessing collocates and
frequency lists).

2.4.1. A possible classification of corpora

Once we have decided to use a computer corpus, we should decide what

type of corpora will best suit our aims. We will not use the same corpus if
we want, for example, to analyze spoken language as if we want to
analyze written language (unless the corpus we choose is mixed and has

4 See 2.2.1.1.
samples of both). Reich (1998) offers the following taxonomy, which
classifies corpora according to medium, national varieties, historical
variation, geographical/dialectal variation, age, genre, open-endedness
and availability:

l
• Medium: spoken corpora (eg. London-Lund corpus) vs. written
corpora (e.g. Lancaster Oslo/Bergen corpus (LOB)) vs. mixed
corpora (British National Corpus (BNC) or Bank of English)
• National varieties: British corpora (e.g. Lancaster Oslo/Bergen
corpus) vs. American corpora (e.g. Brown corpus) vs. an
international corpus of English.
• Historical variation: diachronic corpora (Helsinki corpus, cf.
the ICAME home page) vs. synchronic corpora (Brown, LOB,
BNC) vs. corpora which cover only one stage of language
history (corpus of Old or Middle English, Shakespeare corpora)
• Geographical variation/dialectal variation: corpus of dialect
samples (e.g. Scots) vs. mixed corpora (The BNC spoken
component includes samples of speakers from all over Britain)
• Age: corpora of adult English vs. corpora of child English
(English components of CHILDES)
• Genre: corpora of literary texts vs. corpora of technical English
vs. corpora of non-fiction (e.g. news texts) vs. mixed corpora
covering all genres
• Open-endedness: closed, unalterable corpora (e.g. LOB,
Brown) vs. monitor corpora (Bank of English)
• Availability: commercial vs. non-commercial research corpora,
online corpora vs. corpora on ftp servers vs. corpora available
on floppy disks or CD-ROMs

This taxonomy takes into account most of the types of corpora which are
currently available, but it is not entirely comprehensive. Other variables
might be considered depending on the research aims, which might bring
about new types.
12115

Choose the answer that best suits the information given in Chapter 2.

:
1. The type of discourse the analyst is going to study depends
mainly on…
a) the research question.
b) what the researcher likes to do.
c) how the data are collected.

2. The different approaches to Discourse Analysis…

a) all share the same methods of collecting data.
b) take different views about collecting and analyzing data.
c) all work with naturally-occurring language.

:
3. Downloading material from the internet…
a) may be a procedure for data collection.
b) is a method of Discourse Analysis.
c) is the best method for data collection.

4. A transcript is…
a) a process of data collection.
b) a document that reflects the spoken discourse to be analyzed.
c) a complex type of discourse research.

:
5. Transcriptions …
a) are always completely neutral and objective.
b) try to show the different variables that intervene in the discourse
studied.
c) always include contextual factors.

6. Each analyst…
a) uses various notation systems.
b) uses the notation system that best suits his/her objectives.
c) includes tone groups in the notation used.
:
7. Transcription conventions…
a) should always be explained and made clear to the reader.
b) should always be used in the same way by all researchers.
c) should be different for each study.

8. It is a well-known fact that, in an experiment…

a) the participants and the researcher have equal power.
b) there are no power issues.
c) the researcher has more power than the other participants.

9. It is an ethical requirement in discourse analysis that…

a) the researchers protect all the participants.
b) the researchers pay for the services of the participants.
c) the participants hide their names.

:
10. Corpus-based analysis …
a) normally uses both quantitative and qualitative techniques of
analysis.
b) does not normally have an empirical nature.
c) is always theoretical in nature.

11. The use of corpora …

a) allows us to work with the spoken language only.
b) does not favor the analysis of small fragments of discourse.
c) allows the researcher to analyze patterns of use in language.

12. Corpus-based analysis…

a) has brought a reliability of analysis never reached before.
b) has many disadvantages.
c) can only be carried out by means of computers.

í
13.
a)
b)
c)
Concordance computer programs…
turn electronic texts into talk.
transform the texts into databases that can be searched.
are not used much by linguists nowadays.
::
14. The BNC is a corpus…
a) that has been classified in terms of a national variety.
b) of spoken British English.
c) showing mainly historical variation.

15. The aims of research…

a) have nothing to do with the corpus chosen for analysis.
b) will vary if we use different corpora in the analysis.
c) will determine the type of corpus used in the analysis.

A) COLLECTING SAMPLES OF SPOKEN DISCOURSE:

a) SEARCHING THE WWW: THINK of a type of spoken discourse

you would like to analyze in English (e.g. interviews,
conversations, discussion panels, etc.) and SEARCH the World
Wide Web to get samples of such a type. You can find it, for
instance, on radio or TV programs, or the You Tube website.

b) ANNOTATING THE DATA: USE conventions for annotating

the recorded data with the features you consider important for
your future research (e.g. information about the text (genre, date
of publication, place of publication), information about intonation
patterns or pronunciation; information about the speakers (sex,
age, occupation, social and geographical origin); parsing (i.e.
assigning labelled brackets to each constituent of a sentence);
discourse information of spoken material (laughing, interruptions,
hesitations, etc). Reading “Appendix 2: Transcription
Conventions” of D. Schiffrin’s Approaches to Discourse 5 may
help you in this enterprise.

c) DISCUSSION: SEND or HAND IN the sample data to your tutor

(in case you are following a course) and DISCUSS the
procedures and annotation used. Justify your decisions.

5 See References at the back of this book.

Perspectives On Discourse Analysis Theory and Prac... - (CHAPTER ONE)
No ratings yet
Perspectives On Discourse Analysis Theory and Prac... - (CHAPTER ONE)
20 pages
Alba-Juez - 2009 - Introducing Discourse Analysis
No ratings yet
Alba-Juez - 2009 - Introducing Discourse Analysis
16 pages
Discourse Analysis Activity No. 1 PDF
0% (1)
Discourse Analysis Activity No. 1 PDF
4 pages
Power Points de Las Conferencias Del Tutor
No ratings yet
Power Points de Las Conferencias Del Tutor
211 pages
Discourse Analysis for English Studies
No ratings yet
Discourse Analysis for English Studies
31 pages
Resúmen Lengua 3
No ratings yet
Resúmen Lengua 3
28 pages
Discourse
No ratings yet
Discourse
12 pages
Discourse Analysis Activity No. 1
No ratings yet
Discourse Analysis Activity No. 1
4 pages
Chapter One Basic Concepts in Discourse Analysis
No ratings yet
Chapter One Basic Concepts in Discourse Analysis
45 pages
Elec 1 Stylistics and Discourse Analysis (Review)
No ratings yet
Elec 1 Stylistics and Discourse Analysis (Review)
12 pages
Da Unit 1 Grado
No ratings yet
Da Unit 1 Grado
32 pages
Unit 31 Ta Ündem Formacio Ün
No ratings yet
Unit 31 Ta Ündem Formacio Ün
14 pages
Discourse Analysis Overview 1998
No ratings yet
Discourse Analysis Overview 1998
53 pages
Análisis Del Discurso de La Lengua Inglesa
No ratings yet
Análisis Del Discurso de La Lengua Inglesa
34 pages
Introduction To Discourse Analysis
No ratings yet
Introduction To Discourse Analysis
6 pages
Intro to Discourse Analysis
No ratings yet
Intro to Discourse Analysis
26 pages
Discourse and Text: Ahmed Qadoury Abed
No ratings yet
Discourse and Text: Ahmed Qadoury Abed
5 pages
Lecture 8 Discourse and Cross-Cultural Pragmatics
No ratings yet
Lecture 8 Discourse and Cross-Cultural Pragmatics
12 pages
Belyaevskaya Stylistics +2
No ratings yet
Belyaevskaya Stylistics +2
27 pages
Discourse Analysis Essentials
No ratings yet
Discourse Analysis Essentials
36 pages
Discours and CDA Notes
No ratings yet
Discours and CDA Notes
11 pages
Understanding Discourse Analysis
No ratings yet
Understanding Discourse Analysis
29 pages
Text Typology - Register, Genre and Text Type
No ratings yet
Text Typology - Register, Genre and Text Type
11 pages
49,720, Text Typology
No ratings yet
49,720, Text Typology
11 pages
Discourse Analysis: Verbal Communication. All This Fine Talk. Direct / Indirect Speech. To Chat
No ratings yet
Discourse Analysis: Verbal Communication. All This Fine Talk. Direct / Indirect Speech. To Chat
10 pages
Understanding Discourse Analysis
50% (2)
Understanding Discourse Analysis
17 pages
Análisis Del Discurso en Lengua Inglesaaaaaa
No ratings yet
Análisis Del Discurso en Lengua Inglesaaaaaa
8 pages
Text Discourse Analysis Meeting 5 (Group 1)
No ratings yet
Text Discourse Analysis Meeting 5 (Group 1)
10 pages
Text and Discourse
No ratings yet
Text and Discourse
4 pages
What Is Discourse Analysis
No ratings yet
What Is Discourse Analysis
15 pages
Resumen Final PDCE I
No ratings yet
Resumen Final PDCE I
20 pages
ECS 9 Chapter 1
100% (1)
ECS 9 Chapter 1
7 pages
Discourse Analysis Lectures
No ratings yet
Discourse Analysis Lectures
23 pages
Discourse 1..
No ratings yet
Discourse 1..
4 pages
The Distinction Between Discourse and Text
100% (3)
The Distinction Between Discourse and Text
10 pages
CH 1 Discourse
No ratings yet
CH 1 Discourse
6 pages
Stubbs 1996 Text Corpus CH 1
No ratings yet
Stubbs 1996 Text Corpus CH 1
15 pages
Relevance Discourse Analysis To Education Research
No ratings yet
Relevance Discourse Analysis To Education Research
23 pages
Unit 1 Tod
No ratings yet
Unit 1 Tod
52 pages
UNIT 36. DIALOGICAL TEXTS - Cristina Uson Calvo
No ratings yet
UNIT 36. DIALOGICAL TEXTS - Cristina Uson Calvo
7 pages
Unit 29 2020
No ratings yet
Unit 29 2020
20 pages
Discourse Analysis - Definition of The Term
No ratings yet
Discourse Analysis - Definition of The Term
64 pages
Bab 6 (Discourse Analysis)
No ratings yet
Bab 6 (Discourse Analysis)
12 pages
The Definition of The Text
No ratings yet
The Definition of The Text
4 pages
Discourse Analysis Summary (By Dororo Hyakimaru Luffy)
No ratings yet
Discourse Analysis Summary (By Dororo Hyakimaru Luffy)
8 pages
Lesson II
No ratings yet
Lesson II
11 pages
Discourse Analysis
100% (6)
Discourse Analysis
21 pages
Discourse Analysis
No ratings yet
Discourse Analysis
6 pages
1 Da-1
No ratings yet
1 Da-1
19 pages
Total Discourse Analysis 1225482185740463 9
100% (3)
Total Discourse Analysis 1225482185740463 9
22 pages
TM - Reasearch
No ratings yet
TM - Reasearch
17 pages
Chapter 1 - What Is Discourse Analysis?
No ratings yet
Chapter 1 - What Is Discourse Analysis?
17 pages
Introduction To Discourse Analysis
No ratings yet
Introduction To Discourse Analysis
4 pages
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
No ratings yet
M4 L1Assessment For Learning Using Assessment To Classify Learning and Understanding
5 pages
Photography - Tips & Tricks
No ratings yet
Photography - Tips & Tricks
13 pages
2015 Turbine Day-Final
100% (4)
2015 Turbine Day-Final
217 pages
Chapter 4 Practice
No ratings yet
Chapter 4 Practice
10 pages
Quaternion Cheat Sheet and Problems Quaternion Arithmetic: 0 X y Z I 0 X y Z
No ratings yet
Quaternion Cheat Sheet and Problems Quaternion Arithmetic: 0 X y Z I 0 X y Z
2 pages
Audels Engineers and Mechanics Guide Volume 5 From WWW Jgokey Com
No ratings yet
Audels Engineers and Mechanics Guide Volume 5 From WWW Jgokey Com
556 pages
Chapter 4 Notes Class 12
100% (1)
Chapter 4 Notes Class 12
21 pages
4A Lesson Plan in English Grade 2: Valencia City Central School
No ratings yet
4A Lesson Plan in English Grade 2: Valencia City Central School
3 pages
KMS-GL-QUA-SOP-12-PFL.04 - 3rd Party Inspection Process Flowchart
No ratings yet
KMS-GL-QUA-SOP-12-PFL.04 - 3rd Party Inspection Process Flowchart
3 pages
Critical Book Review Guide
No ratings yet
Critical Book Review Guide
4 pages
QMS Internal Audit - 1 Day Trainng
100% (2)
QMS Internal Audit - 1 Day Trainng
104 pages
SAT Suite Question Bank - Problem Solving and Data Analysis AnsResults
No ratings yet
SAT Suite Question Bank - Problem Solving and Data Analysis AnsResults
113 pages
Nelco N5000 BT Epoxy Laminate and Prepreg
No ratings yet
Nelco N5000 BT Epoxy Laminate and Prepreg
6 pages
RJ3
100% (4)
RJ3
1 page
BSBCRT511 Task 3 Assessment Templates V3.0923
No ratings yet
BSBCRT511 Task 3 Assessment Templates V3.0923
10 pages
A Rose For Emily Is A Story Told by William Faulkner. The Setting of The Story Occurred
No ratings yet
A Rose For Emily Is A Story Told by William Faulkner. The Setting of The Story Occurred
4 pages
MLGS Ii
No ratings yet
MLGS Ii
505 pages
CO2 Fire Suppression Systems Guide
100% (2)
CO2 Fire Suppression Systems Guide
21 pages
Design of Radial Gate Using Rectangular 2
100% (1)
Design of Radial Gate Using Rectangular 2
55 pages
Stefano Brambilla CV ENGLISH
No ratings yet
Stefano Brambilla CV ENGLISH
2 pages
Audio Compression Using Wavelet Techniques: Project Report
No ratings yet
Audio Compression Using Wavelet Techniques: Project Report
41 pages
Lec 7
No ratings yet
Lec 7
40 pages
U2103660 Sharvani Exp5
No ratings yet
U2103660 Sharvani Exp5
10 pages
Mechanics of Structure
No ratings yet
Mechanics of Structure
16 pages
Grade 11 Matrices
No ratings yet
Grade 11 Matrices
3 pages
Northern Black Polished Ware in India
100% (1)
Northern Black Polished Ware in India
19 pages
ICSE VII Maths Ratio and Proportion
67% (3)
ICSE VII Maths Ratio and Proportion
12 pages
Prime and Composite Numbers PDF
No ratings yet
Prime and Composite Numbers PDF
6 pages
Simpson Strong-Tie® Wall-Bracing-Length Calculator Front
No ratings yet
Simpson Strong-Tie® Wall-Bracing-Length Calculator Front
2 pages
Review Packet #2 - Polynomials: FX X X
No ratings yet
Review Packet #2 - Polynomials: FX X X
4 pages

Discourse Analysis

Uploaded by

Discourse Analysis

Uploaded by

Chapter outline:

• Defining text and discourse.

1.2. Origins and brief history of Text Linguistics

1.2. Approaches to the phenomenon of discourse

1.4 What do discourse analysts do?

• Turn-taking in telephone conversations

2. De Beaugrande and Dressler (1981)…

5. According to Schiffrin (1994) and other authors, Discourse Analysis…

8. Functionalists see language…

9. Many discourse analysts, like Schiffrin or Slembrouck …

11. Functionalism, Cognitive Linguistics, Sociolinguistic, Pragmatics, Text Linguistics and

14. Zellig Harris (1951, 1952)…

17. Whetherell et al (2001) …

20. In order to carry out their analyses, discourse analysts…

• Chapter 2 (“Definitions of discourse”) and chapter 10 (“Text and context”) in Deborah

2.1. Data collection

For example, some approaches focus intensively on a few fragments of talk

2.2. Transcribing the data

2.2.1. Transcription conventions used by some discourse

However, it cannot be said that there is not a certain consensus. For

2 Research interviews are supposed to be a more conventional method of data

Upper Middle Class

There is no ideal transcription system which suits all purposes: highly

2. Transcripts should be readable (to the researcher)

In addition, the designer should make a decision as to whether the

The London Lund Corpus is a computerized corpus of spoken English

Y\es FALL Y\/es FALL-RISE Y=es LEVEL

:Yes Higher than the previous syllable

‘Yes Normal “Yes Strong

Yes - - Each dash is a unit pause of one stress unit or “foot”

As can be seen, the specification of the notation used helps us learn

2.2.1.2. Notation used by Deborah Schiffrin

D: (1) What does your uh daughter in law call you?

. Falling intonation followed by noticeable pause (as at end of

A: I can’t wait to go to the party! It’ll be fun.

Beth: ______________________ ,,, ((Nod))

Deborah Tannen uses left arrows to highlight key lines, as in the

DEBORAH: Did you two grow up with

The examples of notation conventions presented in this section display

2.4. Corpus Linguistics: The use of corpora for DA

3 However, the principle of anonymity is not observed in DA when the intention of

2.4.1. Computer corpora and concordance programs

2.4.1. A possible classification of corpora

Once we have decided to use a computer corpus, we should decide what

2. The different approaches to Discourse Analysis…

8. It is a well-known fact that, in an experiment…

9. It is an ethical requirement in discourse analysis that…

11. The use of corpora …

12. Corpus-based analysis…

15. The aims of research…

A) COLLECTING SAMPLES OF SPOKEN DISCOURSE:

a) SEARCHING THE WWW: THINK of a type of spoken discourse

b) ANNOTATING THE DATA: USE conventions for annotating

c) DISCUSSION: SEND or HAND IN the sample data to your tutor

5 See References at the back of this book.

You might also like