Download pdf - T.C. SÜLEYMAN DEMİREL ÜNİVERSİTESİ BATI DİLLERİ VE ...tez.sdu.edu.tr/Tezler/TS02710.pdf · Abdul Jabbar Fadh ıl Jameel AL-JAMEEL ... anlamı, anlamın ifrelenmi ve ifrelenmemi

T.C.

SÜLEYMAN DEMİREL ÜNİVERSİTESİ

SOSYAL BİLİMLER ENSTİTÜSÜ

BATI DİLLERİ VE EDEBİYATI ANABİLİM DALI

METİN TUTARLILIĞININ ÇIKARIMSAL BİR MODELİ

Abdul Jabbar Fadhıl Jameel AL-JAMEEL

1630224013

YÜKSEK LİSANS TEZİ

DANIŞMAN

Prof. Dr. Hüseynağa RZAYEV

ISPARTA-2018

T.R.

SULEYMAN DEMIREL UNIVERSITY

INSTITUTE OF SOCIAL SCIENCES

DEPARTMENT OF WESTERN LANGUAGE AND LITERATURE

AN INFERENTIAL MODEL OF TEXT COHERENCE

Abdul Jabbar Fadhıl Jameel AL-JAMEEL

1630224013

MASTER’S THESIS

ADVISOR

Prof. Dr. Hüseynağa RZAYEV

ISPARTA 2018

iii

ACKNOWLEDGEMENT

I would like to express my gratitude to my advisor Prof. Dr. Hüseynağa

RZAYEV for his invaluable guidance and useful comments throughout this study.

I would also like to thank all my graduate course instructor; Prof. Dr. Ömer

ŞEKERCİ, Ph. D. Carrier Önder ÇAKIRTAŞ, also amember of the advisory committe.

My thanks and appreciation İlker ÖZÇELİK.

Abdul Jabbar Fadhil Jameel AL-JAMEEL

2018

iv

(AL-JAMEEL, Abdul Jabbar Fadhıl Jameel, Metin Tutarlılığının Çıkarımsal Bir

Modeli, Yüksek Lisans Tezi, Isparta, 2018)

ÖZET

Metin düzeninde, tutarlılığın rolüne geleneksel bakışı sorgulayarak bu araştırma,

tutarlılığın çıkarımsal modeli vasıtasıyla gerçekleştirilen “araştırma-sonra-anlam-uyum”

yöntemine bilişsel bir yaklaşım önermektedir. Önerilen yaklaşım varsaymaktadır.

Tutarlılık sadece dilde (statik ürün olarak) var olan bir şey değildir; aynı zamanda dili

kullanıcılarının zihninde var olan bir şeydir ki bunun mantığı (i) metindeki altta yatan

(şifrelenmemiş) içerik ile dilsel olarak sunulan bilgiyi ayırmayı, (ii) bütünselliği,

sayesinde bir cümlenin bütün çıkarımsal ilişkilerinin anlamı oluşturmak için birleştiği

bir kavram olarak düşünmeyi, makul kılar. Bu bağlamda, herhangi bir dil öğesinin

anlamı, anlamın şifrelenmiş ve şifrelenmemiş yönlerinin karşılıklı ilişkisinden oluşur ki

bunun yorumlaması, sırasıyla, içerik açısından ve kasıtlı olarak uygun çıkarımlar

oluşturmaya bağlıdır. Kabul edilen yaklaşımın gelişmesi için, çıkarımın farklı türlerinin,

farklı iletişim türlerinde nasıl belirlendiğini ve nasıl “davrandığını” açığa kavuşturmak

gerekmektedir. Çıkarım-temel-araştırma-sonra-anlam-uyumun dil kullanımının farklı

düzeylerinde ne derece ilintili olduğu da bu araştırmanın temel ilgi odaklarındandır.

Key Words: Metin Düzeni, Tutarlılık, Cümle içi-/Cümleler arası çıkarım temelli

bütünlük, evrensel tutarlılık, çıkarım türleri, araştırma-sonra-anlam-birlik.

v

(AL-JAMEEL, Abdul Jabbar Fadhil Jameel, An Inferential Model of Text

Coherence, Master’s Thesis, Isparta, 2018)

ABSTRACT

Challenging the traditional view of the role of coherence in text organization, the

research proposes a cognitive approach to “search- after- meaning- unity” procedure

realized by means of an inferential model of coherence. The suggested approach

assumes that coherence is not only something that exists in the language (as a static

product) , but also something that exists in the brain of the language user, the logic of

which makes it reasonable to : (i) distinguish between the underlying ( non - encoded)

content and the linguistically represented information in the text, and (ii) consider

meaning holism a concept which is the relevant unity of the non-encoded inferential

relations and the physically expressed meaning of a sentence or an intersentential level.

In this sense, a meaning of any language unit is composed of the interrelationship of the

encoded and non - encoded aspects of meaning, the interpretation of which, in turn,

depends on generating contextually and intentionally appropriate inferences, What the

accepted approach necessitates in furtherance is to clarify how the different types of

inference get determined in different genres of communication. To what extent the

“inference-based-search-after-meaning- unity” is relevant at different levels of language

use is also among the major concerns of the research.

Key Words: text organization, coherence, intrasentential, intersentential, /

inference - based coherence, global coherence, types of inferences, search - after –

meaning – unity.

vi

TABLE OF CONTENTS

TEZ SAVUNMA SINAV TUTANAĞI .......................................................................... i

YEMİN METNİ .............................................................................................................. ii

ACKNOWLEDGEMENT ............................................................................................. iii

ÖZET ............................................................................................................................... iv

ABSTRACT ..................................................................................................................... v

TABLE OF CONTENTS ............................................................................................... vi

LIST OF FIGURES ..................................................................................................... viii

LIST OF TABLES ......................................................................................................... ix

LIST OF ABBREVIATIONS ........................................................................................ x

INTRODUCTION ........................................................................................................... 1

CHAPTER ONE

1. COHERENCE AND TEXT ORGANIZATION

1.1. FROM SENTENCE TO TEXT .............................................................................. 6

1.2. COHERENCE AND COHESION ....................................................................... 11

1.3. CRITICISM OF THE COHESION VIEW OF COHERENCE ........................... 13

1.4. PRAGMATIC INFERENCE AND COHERENCE IN THE RELEVANCE

THEORY ............................................................................................................. 18

1.5. INFERENCE RESEARCH METHODS .............................................................. 24

CHAPTER TWO

2. THE TAXONOMY OF INFERENCES

2.1. GENERAL PRINCIPLES: INFERENCE NECESSITATING FACTORS ......... 28

2.2. THE TAXONOMY OF INFERENCE ................................................................. 33

2.2.1. Logical- Semantic- Pragmatic Inferences ...................................................... 43

2.2.2. Inference Favoring / Influencing Conditions ................................................. 61

CHAPTER THREE

3. INFERENCING IN DIFFERENT GENRES

3.1. BACKGROUND .................................................................................................. 75

3.2. INFERENCE - BASED COHERENCE SCIENCE ............................................. 76

vii

3.3. INFERENCE –BASED COHERENCE IN A POLITICAL TEXT ..................... 83

CONCLUSION .............................................................................................................. 92

POSSIBLE PEDAGOGICAL IMPLICATIONS OF THE RESULTS AND

FINDINGS OF THE RESEARCH .............................................................................. 94

PERSPECTIVES .......................................................................................................... 95

REFERENCES .............................................................................................................. 96

ÖZGEÇMİŞ ................................................................................................................. 104

CURRICULUM VITAE (CV) ................................................................................... 105

viii

LIST OF FIGURES

Figure 1. Propositions ..................................................................................................... 36

Figure 2. “What Time is it? Simple Inferential Structure of Eleven Propositions .......... 37

Figure 3. Adapted from Menmara and Magliano 2009................................................... 48

Figure 4. Types of Inferences (Adapted from Rzayev: Unpublished Paper 3) ............... 48

ix

LIST OF TABLES

Table 1. Types of Inferences ........................................................................................... 54

Table 2. Graesser et al.: Production Rules for Inferences (1994:381) ............................ 58

Table 3. Distinction between Different Types of Inferences( Kispal 2008: 10) ............. 59

Table 4. Inferences (Kispal 2008; 11) ............................................................................. 60

Table 5. “Holland is flat” (Taken from Zvolensky 2015: 167). ...................................... 63

Table 6. Types of Inference (adapted from Kispal 2008: 22) ......................................... 71

x

LIST OF ABBREVIATIONS

CDA : critical discourse analysis

Cf : Confer (Latin) / See (English)

CP : Cooperative Principle

CV : Curriculum Vitae

EFL : English as foreign language

GDP : Gross Domestic Product

LTM : long-Term Memory

NCT : National Child birth Trust

RT : Relevance Theory

SCA : Socio-Cognitive approach

SFG : Systemic Functional Grammar

1

INTRODUCTION

Given that pragmatic inference has been richly studied in different fields of

science, the main concern of the present research is to investigate the impact and role of

inferencing (as a vehicle) in different genres in terms of distinguishing between the

underlying (non-encoded) and the linguistically represented semantic content in

language use. Also proceeding from the undeniable explanatory power and divergence

of inferential mechanisms from the perspective of cognition and philosophy, as well as

psychology, some linguistics claim that such kind of distinction (especially their

proportional distribution) directly leads to understanding how text organization

influences text comprehension (Brewer 1980: 229-233; Brewer and Lichtenstein 1982:

476- 477; Bock and Brewer 1985: 55-60; Ohtsuka and Brewer 1988: 3-14. Brewer

(1987: 187-197) which rests on the claim that discourse comprehension can hardly be

isolated from the mental model of the world in discourse. And it is this framework

within which the reader constructs his/her own mental model not breaking away from

the writer’s original mental model (cf: Johnson laird 1983). As to the nature and

working mechanisms of inference, both cognitive linguistics and the Relevance theory

have given more than enough space to them. Even some inference researchers, such as

Tendahl and Gibbs (2008) and Zhang Hui (2005) etc., have tried to combine these two

approaches together to form a more comprehensive inference theory, since, for them,

the relevance theory does not figure out the interconnectedness of inferential processes

with coherence, which is of crucial importance for appropriate comprehension in detail.

Thus, the paper will firstly make a brief summary of the studies on coherence (in terms

of its contribution to text organization) and pragmatic inference under the existing

theories, which will, in turn, lead us to the coherence providing power of inference.

The contribution of different types of inference in comprehension of examples

chosen from different genres is also among the prominent “missions’’ of the research.

Accordingly, this research focuses on the impact of the underlying (i.e., non-

encoded) content on the text, which, in turn, can be revealed by applying the inferential

mechanisms suggested by the constructionist theory (Graesser et al 1994) and Rzayev

(the unpublished paper), according to which, no author will provide full information in

2

the text structure and thus make the reader generate bridging inferences to recover the

underlying context of the non- expressed events. E.g. consider the text borrowed from

Noordman and Vonk (2012: 37): There were municipal elections yesterday. Because

the majority of the lower town voted for the local party, there was a shift toward the left

in the city council. But the right-wing party was not completely disappointed. It has

expected to lose much more.

But detecting any non-encoded meaning in the given text becomes possible via

addressing the explicitly represented meaning of the text. Taking into account this

principle, we can infer in the text above that the city council was more right oriented;

that there is a contrast between the shift toward the left and not being completely

disappointed, and, besides, there are other reasons (worse expectations) why the right-

wing party was not so disappointed; and that ‘’it’’ refers to the right-wing party and not,

for example, to the city council. All these pieces of information are not stated explicitly

in the text, but, we, as readers, do understand them due to the so-called “inferences’’.

Assuming the role of inferential semantics for sentences, Devitt (1996: 10)

defines meaning holism as the view that the inferential relations a sentence contains

together with the observable part constitute its whole meaning. To make it more precise,

theory of making the meanings of sentences determines which inferences the speaker, or

thinker, accepts and proceeds from. More than that, the relation that holds between two

or more sentences is not only semantic. It, also, depends on the hearer. Given such non-

semantic restricting facts, it is up to the theory to judge how the inferential properties in

the “text-reader” interrelationship gets determined. That is, as Pagin assumes, the

sentences involved must be assigned meaning to the effect that the accepted inferences

come out as valid (2008; 219).

Our approach: is both qualitative and partially quantitative in the sense that (i)

the derivation of the speaker’s prior knowledge-based inferential meaning (s) can

qualitatively differ from those of the hearer; (ii) Ignorance of the validity (i.e., truth

fullness) of the inferential meaning can lead us to false or inappropriate conclusions. To

put it otherwise, the qualitative approach assumes that the system of inference the

essential feature of which is the preservation of truth is sound if and only if it never

permits the inference of a false conclusion come from true premises; and (iii)

3

disagreement between the premises and the conclusion can create an ambiguity or

misunderstanding. As for the partial quantitative nature of our approach, inferential

meanings, as out-text components, (can) / (do) change the scope of the explicitly

expressed meaning essentially, without destroying its central idea.

We mainly take the stance of the Relevance theory which assumes that the gap

between the encoded and non-encoded meanings of a definite description contributes to

propositional content in a particular context which, in turn, is bridged by relevance-

guided pragmatic inference (Carston and Powell 2008: 347). Also enhancing the

Constructionist Theory’s basic principle “search after meaning” (Bartlett 1932; Berlyne

1949; Spiro 1980; Stein and Jrabasso 1985; Graesser et al 1994), our approach implies

the following critical assumptions:

1. The reader, as an inferencer, constructs his / her own meaning of representation

that arises as a result of his/her combining the explicitly expressed meaning with its

deep level comprehension adequate to a definite referential situation;

2. The inference constructs a coherent meaning representation at both local and

global levels.

3. Studying the question which kinds of semantics can be reasonably seen as

engendered by inferences, we, as stated above, restrict ourselves to the question of the

feasibility of seeing the meanings of the explicitly expressed representations in terms of

their belonging to the classical logic in an inferential way.

The Significance of the research: is due to the following gaps it aims to fill in:

1. The research extensively investigates the inference-based discourse

comprehension with respect to texts in different genres:, scientific texts and political

texts;

2. Given that little attention has been paid to (a) how local and global causal-

consequence coherence relations are achieved at different levels, as well as (b) the role

and potential of different types of inferences in the above-mentioned text types, the

study focuses on the cognitive processes involved in comprehending the

4

interrelationships of the explicitly expressed and non-encoded meanings in different

genres;

3. Applying a new causal-consequence inference-Maker model (Rzayev,

unpublished paper: 5) at intrasentential, intersentential and extratextual levels in

Grammar, translation, semantics, literature, writing and other classes will enable

students to find an adequate causal explanation for the integration of in-text and out-text

meanings represented by different types of inference, due to which, causally activating

the inferential meaning and connecting it with the text- base meaning, the student will

extract the text as a set of inference-based integrated ideas:

Structuring of the research: In accord with the aim and other specific features of

the research paper, the “labour division ‘’ has been distributed as follows:

The introduction part provides brief information about the basic aim, scope,

approach, methodology, significance as well as the structural organization of the

research.

Chapter 1 is concerned with the inference-based coherence and text

comprehension including a critical overview of the existing theories on coherence and

inferential models of communication. “Methods in inferential research of text

comprehension” part is also amongst the major concerns of Chapter 1.

The second Chapter mainly deals with the taxonomy of inferences as well as the

inference influencing factors.

Chapter 3 is concerned with both theoretical and empirical aspects of inferencing

in different genres of language use. The ways different types of inference provide

coherence in different genres constitute the major task of Chapter3.

Conclusion “mirrors’’ the results and findings of the research while

“Pedagogical implications’’ part focuses on the implementation of the principle findings

of the study in improving teaching of English as a foreign language at the universities of

Turkish Republic.

5

The perspectives part emphasizes those aspects of coherence and inference

which are still open to discussion in terms of the ways of benefiting from the realization

of causal and predicting inferences in the processing and comprehension of text types in

the unexplored genres as well as in spontaneous spoken discourse. References include

two types of sources: the directly used/cited literature and the supporting citation

references.

6

CHAPTER ONE

1. COHERENCE AND TEXT ORGANIZATION

1.1. FROM SENTENCE TO TEXT

Text organization, as a complex process, reflects in itself ( a ) constructing a

cognitive representation of the content conveyed by the text ( as a whole) (Meyer 1984;

Murray 1995; Jones 2012; Widdowson 1973; Harris 1952, 1964; Labov 1972; Bee

2010;Baker 2005; Bhatia 1995; Bibereta 2007) as well as (b) the pragmatic-dialectical

contributions of the different syntactic-semantic devices within the text (Kitsch and Van

Dijk 1978;Sanders and Norman 2000;Liu 2009;Martin 1992;Schegloff

2007;Schiffrinetal 2004;Scol-Lon 2007; Van Dijk 2008; Widdowson 2007;Martin

1985;1992).

People who approach texts from the point of its organization and functioning

usually focus on four different perspectives based on four different ideas about what

text is. Some have taken a formal approach to text, seeing it simply as language unit

above the level of a clause or sentence (Harris; 1952:1). What Harris aimed at here was

to throw light on how sentences are put together to form texts (Harris 1952 in Jones

2012:36).

The idea that texts could be analyzed compositionally was actually very popular

in the early and mid-twentieth century (under the influence of the dominant structural

approach to language), even before Harris introduced the term “discourse analysis”.

E.g., a group of Russian Formalist, tried to apply the principles of the grammar

of sentences to analyzing stories and novels. Vladimir Propp (1986) was well-known for

coming up with ‘’grammar of stories” in studying Russian folk tales. The “distributional

analysis” proposed by Harris, in fact, was not much different from the widely used

grammatical analysis of a separately taken sentence. The idea was to identify particular

linguistic features and determine how they occur in texts relative to other units, or “in

the same environment” with them.

7

This approach gives rise to questions which are crucial not only from the point

of information structuring within the frame of communication development, but also in

terms of behaviour of text structures in different functional styles as well as in certain

kinds of social practices: How could such knowledge explain whether “old” refers to

only “men” or it modifies both “men” and” women” in the sentence, or how can the

logic of sentence-level grammar remain the same within longer stretches of text without

causing unavoidable complexities in comprehension (problem1) and how does language

correlate with” culture” (i.e. linguistic and non-linguistic behaviour)? (Harris 1952/

1964: 356).

For Widdowson (1973-in Jones 2012: 137), Harris simply establishes formal

patterns without any reference to meaning, what contradicts the basic function of

language in terms of its communicating a definite meaning. For Halliday, the

distributional analysis also demonstrates how the formal properties of different pieces of

language correlate with the social situations in which they occur.

Widdowson, unlike Harris, argues that the analysis of discourse or text must

reveal not only how texts are put together but also how people use language to perform

social actions. Sharing this argument, William Labov, advanced the idea that “the object

of linguistics must ultimately be the instrument of communication used by the speech

community, and if we are not talking about that language, there is something trivial in

our proceedings”( Labov 1972: 187) .

Jones (2012: 138), using the term “discourse analysis” also refers to the way

linguistic elements are put to communicative use in the performing of social action,

whose purpose is to discover what sentences count as utterances and what it is that

provides a discourse with its coherence “as a piece of communication”. Widdowson,

contrasting Harris’s formal view of discourse with how language performs particular

communicative acts, associates the perspective of Harris more with the study of

cohesion and the perspective of Labov(discourse as language in use) more with the

study of coherence of discourse “ as a piece of communication”.

For J. P. Gee (2010: 28-29). “People build identities and activities not just

through language, they also use the resources of language to project themselves as a

8

certain kind of person (in accord with the assumption “language is indexical)”, in

different circumstances (to protect your “social face”, etc). Thus, you behave as a

different identity at a formal dinner party than you do at the family dinner table and,

though these are both dinner, they are nonetheless different practices or activities

(different “games”/ “roles)”.

As Bhatia et al (2008:1) state, having undergone a “discursive turn”, social

sciences have become increasingly interested in the creation of the reality that surrounds

us. This interest, in turn, has given rise to new theories and methods for the study of

language use and its role in human society. Being of interdisciplinary nature, discourse

analysis has acquired the status, stability, significance and integrity of a well-established

discipline. Unlike the 1960s, linguistics has extended its boundaries on an account of

the limits of individual sentences, shifting its focus to the meaning constructed and

interpreted specifically under the impact of particular social contexts.

This definition stems from two relevant features: (a) language (in use) can be

analyzed both on the level of (a) sentence/clause, and (b) on the level of the text.

Accordingly, language ought to be analyzed not as an abstract set of rules, but as a

dynamic tool for social action. Although early conceptualizations of text organization

(e.g. the structuralist / formal approaches) were seen as offshoot of language in use,

their present versions have moved to a focus on “ language in use,” benefiting from

sociology, psychology, semiotics, communication studies, rhetoric as well as disciplines

such as business and accountancy, organizational studies, law and information

technology, to name only a few. Due to this enrichment, the use of language in a variety

of institutional, academic, workplace and professional settings has become more

rational and efficient.

The roots of the view of text in terms of” language in use” are best traced to the

work of Wittgenstein (1951 / 1972), for whom it is language through which people

construct not only “forms of life”, but also communicate to others and their

surroundings Bhatia et al 2008: 2). Less than two decades later with the publication of

Austin’s 1962 classic. ‘How to do things with words’’; the notion that the study of text

should involve more than just its structure but also the way it is used and the way

structure shapes and gives rise to it became more prominent. Later, Foucault and

9

Derrida, though diverging considerably from the tradition of Austin, also made

language, and in particular “discourse”/ text central to their understanding of social

practice. Social scientific disciplines particularly concerned with social practice also

began to recognize the centrality of language in much of what they were studying.

Anthropologist Gregory Bateson and psychiatrist Jurgen Ruesch (Ruesch and Bateson

1951) argued that social and psychological phenomena cannot be separated from the

matrix of communication in which they occur. They were followed by a host of social

and behavioural scientists, among them Goff-man (1959) and Garfinkel (1967), who,

focused on the role of language in social behaviour and social formations. By the 1970s,

psychology, sociology, and anthropology had all taken decidedly ‘’discursive’’ turn

influenced not just by the structuralist linguistics of de Saussure, but also by a new

group/generation of linguists who were becoming more and more aware of the

relationship of language to social actions and to the socio-cultural worlds of those who

use it.

In America this concern had given rise to the ‘’linguistic relativism’’ of Edward

Sapir and Benjamin Whorf while in Europe this new concern in use was first

exemplified by the Prague linguistic circle( also called Prague functional school)

representatives whose major “purpose” was to identify the potential of language units

(no matter to what levels they belonged) from the point of their contribution to language

use in the process of communication in different contextual situations for different

purposes. Proceeding from the function of language in the social structure” (1973: 65),

Halliday and Hasan named their approach “Systemic Functional Grammar (SFG)”

which has had a profound influence on many contemporary schools of discourse/text

linguistics, including Critical Discourse Analysis (CDA), Conversation Analysis and

Multimodal Discourse Analysis.

The analytical tools of the Prague functional school ( the oppositional analysis,

the thematic progression and its types in accord with the purposive and contextual

aspects of communication) and Halliday’s grammar as well as Functional grammar

established by Dik, have been found rather productive in tracking participants in

different basic sentence types, logical as well as communicative relations, processes,

10

qualities and evaluations of these by language users as they develop throughout a given

text or across group of texts (cf. Martin and Rose2003) .

These major discourse/text analytical approaches, to varying degrees, represent a

concern for language use in the social world and focus on widely differing aspects of its

use and often define the adaptability of communication (i.e., text processing and

comprehension) to/from the immediate conversational context to the larger political,

social or economic context...

As the title of this research suggests, we are not interested in surveying all the

available literature that has been done using these various approaches. In doing so we

hope not only to understand what is specific about these approaches, but also to identify

where future possibilities for convergence and interdisciplinary are opening up.

However, it is first necessary to understand the main questions upon which these

approaches diverge and the different roads they have taken from common intellectual

roots. It is doubtless that those different approaches diverge on two of the most basic

issues in this formulation: (a) the question what text is, and (b) how the affects text

organization. These differences can be seen (in one possible sense) as different factors

contribution of multiple disciplines to the development of text /discourse studies. E.g.

sociologists and anthropologists have viewed the use of language as a function of the

(social-cultural) context, whereas linguistics is mainly concerned with focusing on text

structure with context in the background.

In contrast to restricting our approach with the requirements of this or that

trend, we think it reasonable to consider and benefit from where they might be going by

(enriching each other) and how their trajectories (both horizontally and vertically) might

converge to elaborate the findings as “ingredients” of the whole concerned with the

study of text in terms of its coherent constituents as opposed to randomly used

sentences which would not be considered a text (Carrell 1982: 479).Many researchers

(as mentioned earlier) have been working hard trying to understand the fundamental

properties of text and some theoretical accounts of them have been proposed. Often

these accounts are connected with textual analysis techniques which parallel sentence

analysis techniques. These approaches are even sometimes called” text grammars”.

11

Among others, in this sense, have been the American structuralist Charles Fries (1952),

the first American transformationist Zelling Harris (1952 / 1970), the tagmemicists

Kenneth Pike (1967) and Robert Longacre (1968, 1972), as well as a propositional

analyst Walter Kintsch (1974), macrostructulist Teun Van Dijk (1972, 1977), and story

grammarians Mandler and Johnson (1977), Stein and Glenn (1976), Thorndyke (1977)

and Rumelhart (1975). The most influential of the widely used textual analysis

techniques, however, in applied linguistics, has been the approach of Michael Halliday

and Rugaya Hasan (1976), also known as cohesion theory.

The reason of the organization of this part of chapter 1 is threefold. The first

section of the paper reviews Halliday and Hasan’s (1976) cohesion concept in terms of

textual coherence, while the second section is concerned with criticisms of the cohesive

view of coherence and relevance.

1.2. COHERENCE AND COHESION

In spite of its being one of the central notions in the study of text, the concept of

coherence has many different and often incompatible definitions and connotations. For

text linguists or psycholinguists, coherence is predominantly matter of semantics and

domain knowledge, while various brands of speech act and dialogue analysis describe

coherence in terms of intentions and context dependencies.

We will argue in this section that cohesion can hardly be taken as an attribute of

the “connected and referential continuity”of text. We begin with the role of

connectedness between /among the sentences of a text and how it can be provided in

languages use.

As some scholars (Sanders and Maat 2006: 591) point out, discourse is in no

way a random set of utterances: it should show connectedness. Accordingly, a central

objective of any linguistic work is to characterize this connectedness. Linguists have

traditionally approached this problem in terms of presence of observable language

devices and structures. For instance, Strube and Hann (1999: 309) argue that

establishing referential coherence in discourse can be characterized as the presence of a

proper antecedent of a given anaphoric expression in the current or the preceding

utterances and the rendering of both as semantically identical. This task, as the

12

interpretation suggests, can also put general constraints on the grammatical

compatibility of the expressions involved (cf: Haddock1987; Alshawi 1992). Linguists

(from different approaches) have also made considerable efforts to identify the

emerging and contribution of the scope and “behaviour” of intersentential anaphora

within the representation theory (Kamp and Reyle 1993)or for intersentential anaphora

within the Binding theory (Chomsky 1981), or within the context of the discourse

representation theory (Kamp and Reyle 1993). Unfortunately, these approaches fail to

differentiate coherence and cohesion, as a consequence of which, referentially

ambiguous interpretations have to appear when several alternatives fulfil all the required

syntactic and semantic constraints. It proves that syntactic and semantic criteria

constitute necessary but by no means sufficient conditions for identifying cohesion and

coherence in terms of text processing and comprehension activities.

For Rider et al (2011: 455), cohesion as a set of relations combines together

structures and lexical items to form a text. Pronouns like” she,” for example,

conjunctions like “but|, and linking adverbs like “therefore” usually perform this or that

cohesive role in a text. Rhetorical patterns, as Meyer (1985:21) states, apply to Macro-

propositional semantic relations, which include “collection”, “causation,” response’’

comparison” and “description’ also combining in a wide variety of ways to provide

possibilities for the subordinate relational structures of exposition.

For Recento (1987: 68 -.69), cultural/values/attitudes approach is based on

analysis of paragraph orderings and definitely plays a role in ranking thematic material

in cases in which there is thematic ambiguity or more than one macro-proposition in

text.

For Rzayev et al (2016: 236), connectedness provided by both cohesiveness and

coherence functions to make sure if: (a) all the parts of composition are closely related

to each other in explaining one common idea and (b) if they are relevant and ranged in a

clear order, ask and answer the appropriate questions. For Carrell (1982: 480), Halliday

and Hasan’s (1976) treatment of cohesion as a device contributing to coherence,

overlaps with the tradition of text grammars which usually treat text primarily as only a

structure-based linguistic phenomenon. For Halliday and Hasan, “cohesion does not

concern what a text means; it concerns how the text is constructed as a semantic

13

edifice” (1976: 26). Assuming that cohesion exists within text between a presupposed

item and a presupposing item, Halliday and Hasan (1976:4) also suggest that: ‘’The

concept of ties makes it possible to analyze a text in terms of its cohesive properties and

give a systematic account of its pattern of texture” (ibid: 4).

However, as Blakemore (1998:85) states, meaning relations are not always

realized explicitly for a discourse to have coherence.e.g. both the causal connection in

“he was tired and so he went to bed” and the temporal connection in” he wrote a letter

and then went to bed” could have been conveyed implicitly i.e., without “so” in the 1st

and “then” in the 2nd case.

Moreover, even a cohesive tie can combines two sentences/clauses to go beyond

the encoded linguistic resources to recover an interpretation. For example “he” could in

principle refer to either John or Bill in the sentences borrowed from Hobbs (1979):

John can open Bill’s safe. He knows the combination.

This, in fact, suggests that linguistic connectivity alone can hardly provide

content connectivity. Hobbs (1978), for example, argues that the coherence of a text or

discourse can be defined in terms of a set of structural binary relations between its

segments, which, in turn, depend on their propositional content.

A speaker who wishes to be understood must ensure that his utterance stands in

one of these relations to the preceding text, and a hearer who wishes to understand the

utterance must recognize which particular relation it bears to the preceding text (Blake -

more 1998: 85

1.3. CRITICISM OF THE COHESION VIEW OF COHERENCE

The other sources of criticism concerning the cohesion view of textual coherence

come from even those researchers who share several different perspectives (Brown and

Attardo 2008; Feathers 1981; Morgan and Sellner 1980, Tierney and Mosenthal 1981;

Free body and Anderson 1981; Steffensen 1981; Carrell 1982).

14

E.g. Karen Feathers (1981) assumes that it would be more rational and

reasonable to analyze a text into its propositional units and then look for cohesive ties

between the propositions, rather than the surface structures.

E g., having examined the relationship of cohesion to coherence, the report

below results consistent with Morgan and Sellner’s theory - based criticism of Halliday

and Hasan.

Tierney and Mosenthal (1981) experimented this by correlating the propositional

use of cohesive ties by 12th grade students in written essays with holistic coherence

rankings given by their instructors. The students, elicited under controlled conditions,

with several other variables of interest in the study (e.g., Topic and familiarity)

controlled that” topic” or” content”, as Morgan and Sellner argued, appears to affect the

options a writer has for using cohesive items. Summarizing the findings of the

experiment, Tierney and Mosenthal conclude: “What was found is that cohesive ties are

pervasive in text and are patterned across topics. But ties are pervasive almost by

definition since reference, conjunction, and lexical cohesion include a large proportion

of any text… such pervasiveness severely diminishes the usefulness of the cohesion

concept as an index of coherence… This study argues against using cohesion analysis as

an index or predictor of a text’s coherence. There appears to be no causal relationship

between proportional measures of cohesive ties and coherence rankings with a topic.

The present study indicates that a cohesion index is causally unrelated to a text’s

coherence” (Tierney and Mosenthal 1981: 24-25).

Freebody and Anderson (1981), empirically studying the effect of three different

levels of cohesion on reader’s comprehension, looked at the effects of vocabulary

difficulty, which had a dramatic effect on comprehension, but the amount of cohesion

did not. Hagerup Nelson in (1977) indicates that lack of connectives does not seriously

damage comprehension because readers are usually able to make bridging

inferences”(1981:19).

To reveal the interactive effects of both cohesive ties and cultural background

knowledge on reader’s processing of short prose texts, Steffensen (1981) reads a

15

comparable passage from both their native culture and from a foreign culture, and asked

them to write their recalls of the passages.

Analyzing the results, Steffensen found that causal cohesive elements were

recalled better by readers from the passage of their own native culture than from the

passage of the foreign culture. These findings suggest that when there is mismatch in

cultural background knowledge between the reader and that assumed by the text, there

will be a loss of textual coherence. For Steffensen, textual coherence can be fully

realized only when a reader appropriately identifies the schema underlying a text. In

other words, if a reader does not have or fails to access the appropriate background

schema underlying the text, all the cohesive ties in the world won’t help that text cohere

for that reader.

For Brown and Attardo, (2008) coherence is the overall meaning of a text.

Accordingly, you may think of it as its” point” or” main” idea”; or as the part of its

meaning that makes it all fit together. Happening at the semantic level, coherence may,

but does have to be, explicitly expressed in the text itself. Besides by cohesion,

coherence may be established by any of the following means:

1. The setting up and fulfilment of expectations in the text. e.g.; we start talking

about a couple engaged to be married and then we describe getting married;

2. The cooperative principle, i.e., by implicature, for example, one asks for walking

directions to a given address, only to be told that the address is five miles away; the

answer assumes the implicature that the question is being asked with the purpose of

going to the address.

3. Reasoning , I.e., by inference, for example, if we say John wants to the

swimming pool and he asks to borrow Mary’s car, we infer that he is doing so in order

to reach the swimming pool;

4. By the activation of our knowledge of a common situation .e.g., if we are

discussing a trip to the restaurant, we will expect to find mention of food, whereas if we

are discussing “the stars in Hollywood,” the word “star” is most likely to be interpreted

as” famous people” rather than” heavenly bodies” whereas in with a telescope , you can

16

see many stars in the sky” the context selects the other interpretations as more likely

(Brown and Attardo 2008: 52,68).

The presence in a text of cohesive devices, as they claim, does not guarantee that

it is also coherent, although usually coherence and cohesion go together. To show that

cohesion can neither be sufficient nor necessary to provide consider the following two

paragraphs:

(A) John likes to swim. Mary is fond of sky diving. Ann is a progolfer what

athletic children have.

(B) John likes to swim. It is a very good sport, from an exercising point of view.

Exercise is a good way to lose weight. Weight loss is the number one reason for dieting.

In (A) there are no cohesive ties (unless you count the fact that John, Mary, and

Ann are potentially children’s names) and yet coherence is easily achieved by invoking

the frame for ‘family’’ which tells us that one may have three children.

In (B) there are cohesive ties between each sentence and the following one, yet

the paragraph fails to be coherent because there is no unifying theme. (Brown and

aflardo 2008: 53-54).

Proceeding from the assumption that the cohesion approach to connectedness is

inadequate, Sanders and Maat argue: “the dominant view has come to be that

connectedness of discourse is a characteristic of the mental representation of the text

rather than of the text itself. The connectedness thus conceived is often called

coherence.” (Sanders and Maat 2006: 592)

Generally speaking, they distinguish two respects in which texts can cohere:

1. Referential coherence: smaller linguistic units (often nominal groups) may relate

to the same mental referent, i.e., discourse anaphor

2. Relational coherence: text segments (most often conceived of as clauses) are

connected by coherence relations like cause -consequence between them.

What they emphasize is: “we need to realize that coherence phenomena may be

of a cognitive nature, but their reconstruction is often based on linguistic signals in the

17

text. Both coherence phenomena under consideration-referential and relational

coherence-have clear linguistic indicators” (Sanders and Maat 2006:592). They also

claim: “For referential coherence these are devices such as pronouns and demonstratives

while for relational coherence these are connectives and (other) lexical markers of

relation, such as cue phrases and signalling phrases. A major research issue in this case

is the relation between the linguistic surface code (what Givon 1995 calls “grammar as a

processing instructor)” and aspect of the discourse representation” (ibid: 593).

Referential coherence relation can be illustrated by different referential devices

which correspond to different degrees of activation for the referent in question. For

instance, a discourse topic may be mentioned in the first sentence but once the referent

has been identified, pronominal forms suffice. This can hardly be taken as a

coincidence. Many linguistic have noted this regularity (e.g. Ariel 1990 Givon 1992;

Chafe 1994).

Ariel (1990, 2001), for instance, in her “Accessibility Theory” argues that high

accessibility markers use little linguistic material and signal the default choice of

continued activation. By contrast, low accessibility markers contain more linguistic

materials and signal the introduction of a new referent’s.

If to consider coherence relations in terms of their functioning, we are to state

that they are also termed as rhetorical relations” (Mann and Thompson 1986; 1988;

1992) or” clause relations” which constitute” discourse patterns” at a higher text level

(Hoey 1983). Cohesive relations, being meaning relations, connect two or more

sentences, cohesion’s contribution to this property of discourse implying that sentences

mean more together than apart (Hoey 1996:13).

The other defining characteristic for these relations is that they provide more

information than is provided by the sum of the segments taken in isolation (Sanders and

Maat 2006:593). For Rzayev et al (2016:523), each discourse element is classified

according to the communicative function it permits in relation to the discourse as a

whole.

The patterning of relations and elements in a discourse is described as the

discourse Macro- pattern, that is, the overall composition of discourse is determined by

18

the order in which each of its elements occurs. For example, as Crombie (1985; 58-59)

and Maurer (1999: 182-183) argue, a discourse consisting of four elements (situation,

problem, solution and evaluation) is usually constructed on the three binary discourse

relations (situation-problem; problem-solution and solution-evaluation).

For Sanders and Matt (2006:593), although relations like cause-consequence are

conceptual, they can but need not be expressed explicitly.

Recently, a significant part of research on coherence relations, focusing on the

question of how different sets of relations should be organized (Hoey 1990: Knott and

Dale 1994; Sanders etal 1992), has started to define the “relations among relations”

relying on the intuition that some coherent relations are more alike than others. For

instance, the relations in (A),(B) and (C), all express a certain type of causality; they

express relations of cause-consequence volitional result: A-argument-claim; conclusion

(B) and speech act causality (c): “This is boring watching this stupid bird all the time. I

propose we go home now”. The relations expressed in (D) and (E), however, do not

express causal, but rather additive relations. Furthermore, a negative relation is

expressed in (D). All other examples express positive relations and (E) expresses an

enumeration relation”.

A. The eagle was looking for prey. The bird was soaring in the air for hours.

B.The bird has been soaring in the air for hours now. It must be an eagle.

C. The eagle has been soaring in the air for hours now. Let’s finally go home.

D. The eagle was soaring in the air for hours. Yesterday we did not see it all day.

E. The eagle was soaring in the air for hours. There was a peregrine falcon in the

area, too.

1.4. PRAGMATIC INFERENCE AND COHERENCE IN THE

RELEVANCE THEORY

In everyday communication, the addressee often does not take the time to

exactly capture the speaker’s intended meaning, since his / her role includes both taking

in what is said (communicative meaning or informative intention) and inferring the

19

speaker’s intended meaning (i.e., communicative intention), for what the addressee is to

understand to infer what is behind the addressor’s utterance, which is usually called

pragmatic/ inference. What stimulates the inferential activity is the fact that there is a

lack of information and the reasons of such kind of asymmetry between the

linguistically encoded meaning and the communicatively intended meaning are rather

various:

(1) The economical and asymmetric principle of language use;

(2) The speaker’s “cloudy” use of language (intentional or unintentional)

(3) the listener’s incompetence in comprehending, inferring and so on, in which

situation the addressee may still fail to get the real meaning (either both the encoded and

intended meanings or only the intended meaning). Here, only the first reason will be

discussed.

The informative intention of an utterance is much easier to comprehend than the

communicative intention behind it for the latter needs more “information” and the

language (i.e., the economically expressed meaning) is far from enough. Relevance

theory mainly is concerned with the ways how people get information from an utterance

by inferring (deductively mostly) it in a cognitive context (Heziran 2006). Context

being definitely something indispensable to pragmatic inference (as well as to the whole

communication), the relevance theory takes communication as a process of inferring in

a specific context. Although the existing opinions are widely divided in terms of the

definition of the term context” (and it is still an open question), relevance theory has

made great progress by coming up with the notion of “cognitive context”, which differs

from the traditional context.

Meanwhile, different scholars focus specifically on different aspects of the

cognitive context, justifying this paradox with an assumption like the following:

Context is neither objective entity nor something predetermined; it is something coming

out of dynamic inferring process (Cai Yun 1997), which, for example, in relevance

theory, is also called cognitive assumptions, including the logical information,

encyclopaedic knowledge, and lexical representation; it is a’’ psychological construct ;”

it is part of the assumptions about the world; it is based on the real world and more than

20

that as well. The process of contextual assumption must be inferred while the inference

is an intellectual enquiry process. In Ostensive-inferential communicative model, the

speaker shows the listener his informative and communicative intention by means of

ostensive behaviours, thus providing necessary grounds of judgment of inference.

Meanwhile the addressee makes inference and forms contextual assumptions according

to the addressor’s ostensive behaviours ,the final aim is to reach optimal relevance and

communicate successfully, that is, to get adequate effect for no unjustifiable effort. So

to infer is to search relevance. The inferential mechanism of relevance theory is closely

connected with the concepts of maximal relevance and cognitive context. For example,

in the beginning of an American film titled” pretty woman” Edward quarrelled with his

girlfriend on the phone and she complained:” I speak to your secretary more than I

speak to you”. Then Edward met his former girl friend Susan and the following

conversation takes place:

Edward: when you and I were dating, did you speak to my secretary more than

you spoke to me?

Susan: She was one of my bride maids.

Prescriptively, Susan’s answer is irrelevant with Edward’s question, because

Edward expects an answer with “yes” or” no”; but Susan doesn’t answer Edward’s

question directly. To understand the implied meaning, some relevance must be

established and from what Susan has said, the following assumptions may be formed in

Edward’s mind:

(a) If one wants to be Susan’s bridesmaid, one must be very familiar with her.

(Inference: Edward’s secretary was very familiar with Susan.)

(b) Edward’s secretary became Susan’s bridesmaid, so she must be Susan’s good

friend.

(c). It is true that Susan spoke to Edward’s secretary more than he spoke to her.

In this way Edward, understanding what Susan means, can achieve the optimal

relevance. Thus, Sperber and Wilson’s ostensive and inferring model is impossible

21

without inferring (Xu Shenghuan 2007: 3). It can also be said that inference is the core

of the relevance theory. E.g., the communicative intention of the addressor is not

exactly encoded in literal language; nor can it be accessed through sentence phrase.

Accordingly, comprehending as a non-demonstrative inferential process can be divided

into two stages: assumptions formation and testing (Ran Yongping 2002: 52).

Being a universal necessity (in terms of economic and rational use of language),

inferencing is, in particular, of crucial importance in indirect speech acts. The following

example depicts an episode from a bus crash.

A: so taking bus is not safe.

B: 130 is a plane, which you can take.

As seen from B’s answer,” 130 is a plane” is a typical metaphor in the context of

A’s utterance. But we realize that it is a metaphor. “130” is, in fact, the number of a bus,

which is only known to people who live in this district where bus no.”130” runs. And

this is part of shared knowledge manifested between A and B. Furthermore, each of

them knows that planes have wings and are of super speed; with this in mind, it can be

inferred that B is implying that bus no.130 is too speed and not safe enough to take,

thus” do not take it” is what he really intends to say.

While cognitive content, as a dynamic process is applied to test assumptions,

inferring process, being cognitively ever-changing, functions as a “bridge” between the

linguistically encoded meaning and the non coded meaning, which, for Rzayev

(unpublished paper: 2) consists of two constituents: 1) the decoded informative

meaning, and (2) the intentional meaning. As also seen from the example above, only

“do not take bus no. 130” is the intentional constituent of the inferred information which

in turn becomes relevant only after inferencing the non coded informative information

(“bus no. 130”is too speedy and not safe enough to take”). This cognitively ever-

changing nature might play an essential role in both processing and comprehension of

communication.

For Hofweber (2010: 1-26), there is also an undeniably strong unity between the

intended meaning and inferential role in our language. And the expressions, to exhibit

this connection are inherently related to other sentences in a certain way (Hofweber

22

2010:2). But this assumption does not require that speaker is disposed to prefer this but

not the other way, or that they do it to Show that inferential way is best in this case. The

expressions in question, including the predicate logic (the semantic level), can have an

inferential role which should in no way break away from the intended meaning. To see

how this happens, what kind of inferential role the connectives in connection with the

truth predicates are supposed to have.

E.g. a crucial difference between the inferential behaviour of expressions like

“and” and “dog” is evident to everybody and in all cases “dog” figures in all kinds of

valid inferences (whether literal or metaphorical), which, as the truth of the premises,

guarantee the truth of the conclusion. “Fido” is a dog” implies” Fido is an animal”. But

“and’’ in contrast is characterized not by an inferential relation to various other

sentences with or without “and” in them. The inferential role of “and” is captured by a

simple Schema: And in this Schema everything is left Schematic except for the “and”.

This is paradigmatically done (for economizing purposes) with the natural deduction,

inclusion and/or elimination rules.

Yus (2006:1) goes still further in arguing that “… the development of

pragmatics, especially cognition-cantered pragmatics, has made us aware that what is

coded in human communication (spoken utterances, written texts, nonverbal behaviour,

images) highly underdetermines the information that the Sender actually intends to

communicate, that is, coded messages are far less informative than the thoughts that the

Sender really wants to communicate with them”.

For Yus and other proponents of the Relevance theory, within the more

appropriate “inferential model” of communication, although coding is necessary in

communication (we do need utterances, written texts, etc. to transmit information to one

another), it is inference that plays a major role in transforming the schematically coded

messages into fully satisfactory interpretations.

If to assume that the point of a philosophy of presence is to accomplish certain

things not only in the text but beyond the text (Said 1978: 681), then we are forced to

say that the events in the text demand an active, questioning interpretation which must

23

be approached on two levels: first, from within the text and second, from the

perspective of the reader (Gooze 1983: 337).

The inferentialism beginning with the work of Paul Grice (1989: part 1) treats

utterances / evidences about the communicator’s intentions and assumes that “the

borderline between decoding and inference coincides with the borderline between

explicit and implicit communication, so that the proposition and attitude expressed

would be recovered purely by decoding and the role of inference would be restricted to

the calculation of implicature’ (Wilson 1998:5), what, in fact, overlaps with the Grician

cooperative principle and maxims for whose opinion inferential intention is

characteristic for implicit communication. If, for example, your friend asks you if you

think her new boyfriend is good looking, but you do not think so, you might express

your opinion saying “He has a lovely personality, “violating the maxim of relevance

(her question was not about his personality, or you might say something rather vague

meaning. Showing that you do not think he is very good looking thus avoiding saying

this explicitly (violating the maxim of manner). The obvious questions are: why do

people do this? Why don’t they simply communicate what they mean directly?

For Jones (2012; 54), one reason is that implicative way of language use allows

us to be politer or to avoid hurting someone’s feelings. We also use implicature not to

make ourselves too accountable for what we have said-in other words, to say something

without “really saying” it. Most scholars distinguish two kinds of inferences. The first

kind is an inference used to generate new information. For example, from the sentence

“John selected his strantson because there was much mud” the reader can infer that

apparently strantson is a material or brand that has advantages if there is much mud.

Given that the reader is not familiar with strantson, the reader cannot know that the

conjunction” because’ is correctly used. However, by assuming that the sentences make

sense, the reader can derive the inference as new information. The second kind of

inference is a knowledge activating one. Examples from the text (about municipal

elections) are that “it” refers to the right - wing party, and that, given the contrasted

relation indicated by “but”, a shift toward an opponent party is in general a reason for

disappointment. Although this kind of inference in general is not recognized in

everyday language, most of the psycho-linguistic research can hardly neglect it.

24

The other unchallenged thing (especially concerning the second type of

inference) is that in understanding a text/discourse, the number and variety of derivable

inferences seem to be almost unlimited. Yet, as the comprehension is accomplished too

quickly for many time-consuming inferences to be made, the first issue is how to

account for the control of inferences: which inferences are made and which ones are not

made? The second issue concerning the inference as a process is; what is the process

that allows us to make an inference? How does the inferred information get activated

and deactivated? How does the information in the text interact with the reader’s

knowledge? Although until recently, inferences were considered as a rather isolated

phenomenon, our research considers inferences as one of the components of the

comprehension process. The newly developed methods to measure brain activity also

lead to a better understanding of the relation between inferences and the brain.

1.5. INFERENCE RESEARCH METHODS

When and what kind of inference is generated due to the assumption claiming

that the inferences stemming from the mental text representation, are similar to

information that is expressed explicitly by the text. Therefore, they require fulfilment of

the tasks such as reproduction, recognition, and verification. In a production task, the

reader reproduces the information that was not stated explicitly in the text. In a

recognition task, the reader has to judge to what extent particular words or sentences

contribute to inference generation; the necessity of the recognition task is similar to that

of the reproduction task.

Many people who had studied the sentences “John was trying to fix the bird-

house. He was pounding the nail when his father came out to watch him and to help him

to do the work” incorrectly recognized the sentence “John was using the hammer to fix

the bird house when his father came out to watch him and to help him do the work”.

This indicates that an instrumental inference (hammer) was made (Bransford and John-

son 1973). But reproduction and recognition tasks, indicating whether the inferences are

made or not, but in general cannot distinguish whether the inferences are made during

reading (at encoding) or during the measurement (at retrieval). In verification tasks, the

generated inferences verify whether the content of a sentence is true or false with

respect to the content of the text. Off-line or after-reading methods, used in combination

25

with online or during reading methods answer the question of when inferences are

made.

Online methods, aiming to detect the ongoing inference process immediately,

improve comprehensionsto the extent that the reader constructs more levels of

representation and more inferences at each level (Graesser et al 1994: 373). To illustrate

the multiple levels of representation, consider the following short text:

The truck driver saw the policeman hold up his hand. The truck driver’s vehicle

stopped, but a car rear- ended the truck driver.

The text-base level (i.e. predicate-argument-calculus representation) would

include a propositional description of the explicit text (Kintsch 1992; Kintsch and van

Dijk 1978). Accordingly, the first sentence would have the following propositional

representation:

Proposition 1: saw (truck driver, Proposition 2)

Proposition 2: hold-up (policeman, hand)

Each proposition has a predicate of its own (i.e. verb and/or connective) and one

or more arguments (i.e. noun or embedded noun-equivalent proposition). The text base

level would also connect the first / sentence would be connected to the second sentence

by the overlapping argument “truck driver.”

Thus, the text- base, providing a shallow representation of the explicit text, also

computes a referential specification for each noun, while deeper comprehension is

achieved only with the reader’s constructing motives which explain “why” events and

actions occurred. Readers would infer that an abrupt stop of the truck caused the car to

rear end the truck, even though the text never states that there was an abrupt stop! The

reader would also infer that the truck driver had the goal of stopping the truck and

performed some intentional action to stop it, even though this was never explicitly

stated.

Deeper comprehension stems from the inference-based global message or point,

of the text, such as” accidents occur even when people follow society’s rules. However,

26

this level of representation depends on the pragmatic context of the text, such as who

wrote the text, why it was written, who read the text, and why it was read. Nevertheless,

according to the inherently relevant inferential components of comprehension, readers

would be able to generate inferences at all of these levels. Otherwise most readers

would not / would not be able to construct a detailed description of what the truck driver

executed to stop the break (i.e. he moved his foot to the brake pedal, he pumped the

brake, and he calculated the distance between the truck and the policeman).Although

these details are usually omitted on the text-base level (to avoid wordiness and irrational

use of experience knowledge), yet it might be important to construct these details when

an insurance agent is trying to settle an insurance claim for the accident. Thus, the goals

of the reader and the pragmatic context of the message must also be considered, since it

is not sufficient to build a theory of comprehension on the basis of the physically

encoded text alone.

Some researchers, to cope with the narrow understanding, have enriched the

definition of comprehension by adopting a “systemic” perspective that appeals to the

notion of “harmony” (i.e. congruity, compatibility, and synchrony). In one sense,

harmony addressing the global coherence of the text comprehension succeeds when

there is unification between both explicit and implicit ideas within the text (Britton and

Eisenhart 1993). A second sense of harmony addresses the compatibility among the

three major components of communication system; the author, the text, and the reader

(Britton and Gulgoz 1991; Rosen blah 1978); Tierney and shanahan 1991). That is,

comprehension succeeds to the extent that there is harmony among three

representations; (a) the explicit texts; (b) the author’s intended meaning in the text; and

(c) the reader’s constructed meaning of the text.

Writers usually compose the text in Accord with their communication goals,

whereas readers attempt to recover the writer’s goals during comprehension.

Comprehension breaks down when there is a gap among the meaning suggested by the

author, the explicit text, and the reader’s generated meaning and thus it fails to drive a

transmission of information from author to reader properly. When the reader draws

appropriate inferences, the comprehension succeeds. For this purpose, the reader

should/can ask relevant questions to tap potential knowledge gaps, anomalies, and

27

contradictions. When the reader’s answers to such questions are correct and

informative, he can paraphrase the message and generate valid inferences. In fact,

inference generation, question asking, question answering and paraphrasing have

traditionally been an indicative test of whether readers can understand text without

using inferential procedures (Kass 1992; Lehnert, Dyer, Johnson, Young, and Harley

1983; Schank and Abolson 1977). Comprehension also succeeds when the reader is able

to reveal whether an incoming statement in the text involves a contradiction, an

anomaly, or irrelevancy with respect to the earlier information (Glenberg, Wilkinson

and Epstein 1982; Otero and Kintsch 1992; Graesser and Mcmahen 1993; Markman

1979).

28

CHAPTER TWO

2. THE TAXONOMY OF INFERENCES

2.1. GENERAL PRINCIPLES: INFERENCE NECESSITATING

FACTORS

The reason giving rise to so much research in psycho-linguistics and discourse

processing /comprehension on inferences since the 1970s is connected with finding an

answer to the following question: which inferences should be generated while others

cannot be neglected. This question was taken in terms of the classification of inferences

also in accord with their contribution to text comprehension. (Clark 1977; Graesser and

Kreuz 1993; Graesser, Singer and Trabasso 1994; Magliano and Graesser 1991;

Nicholas and Trabasso 1981; Reiger 1957; Singer 1988) , but a consensus has hardly

emerged, e.g. For Noordman and Vonk (2015:39); a common distinction is between

necessary and elaborative inferences while Muñoz divides inferences into text-based,

knowledge- based and causal groups (Muñoz 2014:15-2) .

Inference generation is obligatory if they provide the so-called text coherence.

Inferences that are not necessary for coherence are called elaborative, optional, or

embellishing. Two aspects of coherence can be distinguished (Noordman and Vonk:

2015:39): Inference making, as some scholars state, allows the reader to extract

information that is not explicitly described in the text (e.g. Baretta et al 2009) ; Cain

and Oakhill 1999; Cain et al 2001; Van den Broek 1989); as well as comprehension

monitoring, which helps readers to verify their understanding of what they have read (

baker 1989; Kinnunen and Vauras 2010; Vorstius et al 2013; Wagoner 1983; Graesser

et al 1994; Noordman and Vonk 2015); and the updating of information, in order to

replace outdated information with new information that is more consistent with the

context (Carrtti, Belacchi, and cornoldi 2010; Carretti, Borella, Cornoldi, and De Beni

2009; O’Brien et al 2010; radvansky and Copeland 2010).

Inference making is crucial in establishing the role and hierarchy of these three

high-level cognitive processes in comprehension and constructing a coherent situation

29

model because each of them provides the necessary connections in extracting the

unexpressed ideas of a text. It is just for this reason that most of comprehension models

appeal to the process of inference generation (Albrecht and O’Brien 1993; Gernsbacher

1990; 1997; Graesser et al 1994; Kintsch 1988; 1998; Linderholm et al 2004; Magliano

et al 1999; Myer and O’Brien 1998; Trabasso et al 1989; Van der broek et al 2005;

Zwaan and Radvansky 1998).In addition to the multiple taxonomies and the brief

information describing inference making (e.g. Graesser et al 1994; Zwaan and

Radvansky 1998; Noordman and Vonk 2015; Muñoz 2014) several properties, usually

identified as the “back bone of inferences”, also deserve careful consideration:

In text comprehension inferencing has three important functions: to provide coherence,

integration and causality. Inferences, as some scholars claim (Graesser et al 1994;

Magliano et al 1999; Zwaan and Radvansky 1998; Muñoz 2014; Noordman and Vonk

2015), are of crucial importance in providing coherence to the mental representation of

the text during reading comprehension. Muñoz (2014: 10) also argues that text

coherence is gained through establishing connection between the presented information

and both prior parts of the text and / or prior knowledge. In fact, when coherence

breaks, readers are forced to “repair” their comprehension through appealing to

inference making procedure.

For Hewett (2014; 11), inference generation allows us to “dig out” the implicit

information and integrate it with explicit information from within the text or by

incorporating world knowledge with textual information (Cain and Oakhill 2007). For

other scholars, inferences are also used to derive a high-level, understanding of the state

of affairs implicitly described throughout a discourse (Cain and Oakhill 2008; Gould

2008).

According to Gersbacher (1990), at least four sources of inference providing

coherence improve the establishment of connection in reading comprehension:

referential coherence relates to the existing connection between something or someone

that is mentioned, and a previous reference of that thing/person (e.g.: we got some beer

out of the truck. The beer was warm); temporal coherence secures necessary relatedness

to the time (e.g. “I arrived of the start time at 7:45 a.m. The marathon was scheduled to

begin at 8:00… At eight o’clock sharp the starter fired his pistol”.). Locational

30

coherence maintains consistency in relation to the space (e.g. “Mike and I were standing

in the hallway near my office. We were enthusiastically discussing some new data. In a

nearby office, people had difficulty in concentrating”) and causal coherence explains

why something happens or what will happen next(e.g. “Brown punched George. George

called the doctor”). Hence, inference making plays a critical role in constructing

coherent representations of the text. More than that, text information and background

knowledge must be integrated in order to shed light on the main idea of a text. Thus, the

degree of such relevant integration also depends on inferential connections (Magliano et

al 1999; Zwaan, Langston, and Graesser 1995). Interestingly, the construction-

integration model (Kintsch 1988, 1998) has distinguished two phases: the construction

phase, which activates both the current text information and the related prior

knowledge; and the integration phase, which, in turn,

(1) activates the more semantically interconnected concepts, and

(2) deactivates the more disconnected ideas. Therefore, the process of properly

integrating information into the situation model also requires the suppression of

irrelevant information (Muñoz 2014: 12). Moreover, several studies have highlighted

the importance of inference making in establishing causality (Mills and Graesser 1994;

Singer et al 1992; Walsh and Johnson-Laired 2009; Thagard 2004). The relation

between causality and inference making goes in both directions: causality facilitates the

generation of inferences, and inferences help to establish causal relationship - the causal

network model of comprehension (Trabasso et al 1989; Roberts and Aitken PG no.3).

Focusing mainly on the role of causal/abductive relationships, bind different types of

information in a text. As Muñoz states in (2014: 13), readers must be involved in

effortful and deep reading comprehension in order to generate inferences. Only in this

way, readers can draw proper inferences (McNamara and Magliano 2009).

However, inference making is likely to depend on different levels of processing

going from low demand to high demand of resources (Perfetti et al 2005).In this sense,

an example of an easily-made inference could be an anaphoric resolution, e.g., in the

sentences “Ann predicted that Pam would lose the track race, but she came in first very

easily”, the pronoun “she” is easily mapped into the antecedent “Pam” instead of

‘’Ann’’.(Corbetl and Chang 1983); on the other hand, a high demanding inference could

31

be caused by a causal relationship between information that is poorly connected, e, g., in

the sentences;’’Joey went to a neighbor’s house to play. The next day, Joey’s body was

covered in bruises “(Keenan et al 1984 Consequently, inferences may be represented in

a continue of processing depending on the resources that are necessary to generate

them:

1. Direction of processing. Inferences can be driven (as already mentioned earlier)

by ‘bottom-up “or” top-Bottom/down” processes. Despite the existing disagreement

between comprehension models to determine the direction by which comprehension

occurs (Albrech and Myers 1995; Graesser et al 1994; Magliano and Radvansky 2001;

Myers and O’Brien 1998; Kispal 2008), both text-driven (bottom-up) and goal-directed

(top-down) processes directly participate in inference making. For example, when the

text primes reader’s world knowledge without difficulty (“The monkey received a piece

of fruit as treat”), readers generate automatic inference associated with bottom-up

processes (e.g. idea of banana’). In contrast, when readers do not have sufficient prior

knowledge and the text is difficult or unfamiliar (e.g., “Neutrinos are created as a result

of certain types of radioactive decay, or nuclear reactions such as those that take place

in the sun, in nuclear reactors, or when cosmic rays hit atoms”), they generate more

controlled inferences associated with top-down processes, although this does not

necessarily demand resource-heavy processing. Therefore, the direction of inference

making in text comprehension, depends on the characteristics of the text (Muñoz

2014:14).

2. Knowledge-based information: Coherence integration, sometimes established by

connecting textual information, requires activating knowledge not present in the text

(prior knowledge). Explaining information expressed implicitly or in the text make

readers trigger inferences that go beyond text information; accordingly, the importance

of inferences in the interaction between text coherence and prior knowledge is of no

doubt. As some scholars (Mcnamara et al 1996; Azuru et al 2009) state, readers with

low prior knowledge achieve better comprehension when the text is highly coherent,

whilst readers with high prior knowledge only benefit from their knowledge when the

text is low in coherence. But in both cases, accessibility of prior knowledge is of great

necessity to the reader in order to elaborate inferences (Perfetti and Adlof 2012), which,

32

in turn, means: inference making depends on memory processes. According to this

assumption, the Resonance model proposes that current text information and its

comprehension reactivate prior parts of the text and or knowledge from long-term

memory (Albrecht and O’Brien 1993: Myers and O’Brien 1998; Myers et al 1994). As

some theories assume, the existence of a long-term working memory helps text

comprehension(Ericsson 1995).What it means is that the mental representation of a text

requires inference making by means of activating memory processes and prior

knowledge.

and generic knowledge that are relevant to the text. The specific knowledge For

Graesser et al (1994; 374), the background knowledge involves both specific includes

memory representations from other texts, and previous excerpts within the same text

while the generic knowledge structures include schemata (Mandler 1984; Rumelhart

and Orton 1977), scripts (Bower et al 1979; Schank and Abelson 1977) frames (Minsky

1975) , stereo types (Wyer and Gordon 1984), and other structured packets of generic

knowledge. Most background knowledge coming from life experience, or learning

organized by hierarchic relations (e.g., a script of eating at a restaurant), are meaningful

and contextually rich. These rich structures, activated, furnish much of the content

needed to interpret, explain, predict, and understand narrative events. However, in

different genres, the background and knowledge structures can be rather abstract and

decontextualized, such as the Schema for the rhetorical format of a fairy tale (Mandler

1984; Stein and Glenn 1979). E.g. a novel inference, being a product of several

cognitive cycles of searching memory, accumulates information from multiple sources

(Just and Carpenter 1992). The precise mechanisms of constructing these novel

knowledge-based inferences are not well understood, although several researchers have

offered speculations (Graesser and Clark 1985; Graesser et al 1994; Johnson- laird

1983; Schank and Abelson 1977; Wilensky 1983). These novel knowledge-based

inferences place more burden on the working memory and a potential inference has a

lower likelihood of being generated on-line to the extent that its generation imposes

greater working memory (Graesser et al 1994: 374).

33

2.2. THE TAXONOMY OF INFERENCE

Much research has been done concerning the analysis of types of the inference

that exist in their author’s view of comprehension. The literature has been prolific in

distinguishing various types and categories of inference, ranging from thirteen,

described in Graesser et al (1994), nine in Pressley and Afflerbach (1995), to the more

usual two, adopted by many more researchers. Even amongst those scholars who have

identified essentially the same single distinction between two types of inference, there is

an assortment of labelling. Commenting on this variety in the naming (and

characterizing) of inferences, Graesser et al (1994: 374) concluded: Researchers in

psycholinguistics and discourse processing have proposed several taxonomies of

inferences... but a consensus has hardly emerged” e.g., “forward” and “backward”

inferences (Mckoon and Ratcliff 1989; Singer and Ferreira 1983; Van den Broek 1990).

For van den Broek (1999: 88), the forward inferences are less frequent than backward

inferences due to the latter’s playing a more essential role in coherence-building.

Forward inferences (in fact limiting themselves with the cohesive relations based on the

so-called intralinguistic relational references) are drawn but only when the preceding

text provides compelling semantic constraints (Klin and Myer 1993;Van den Broek and

Huang 1995; Vonk and Noordman 1990) .

The landscape model severely restricts the activity of forward inferences, the

generation of which depends not only on the presence but also the timing of adequate

constraints. On the one hand, constraints take time to accumulate: Retrieval from

episodic memory as well as activation of background knowledge is a slow process

(Balota and lorch 1986; Bloom et al 1990; Kintsch 1988; Till et al 1988). On the other

hand, constraints decline when the supporting activation position / Vector dissipates, for

example, as the time since the last reading cycle increases, this opportunity enables all

the required constraints allow the forward inference (