II.
Scientific
Precursors
1.
STRUCTURALISM AND LINGUISTICS
1.1 Many diverse trends have carried the label "structuralism,"
sometimes against their will. Different structuralist approaches shared some
consistent, stable tendencies, yet often disagreed vigorously about the nature
of their enterprise. Their investigations of language or language-related
domains were dominated by the search for WELL-STRUCTUREDNESS — a common
ambition of scientists (1.1.3). If a "structure" is two or more
elements with a relation obtaining between them (1.4.5), structuralism is the
exploration of formal units and their patterns. In some domains, the nature of
those units and patterns, and the methods for uncovering them, appear
comparatively straightforward. But language offers considerable latitude for
controversy.
1.2
Ferdinand de Saussure's pioneering "course in general linguistics"
(1916) proved enormously influential and conformed to the conventional ambitions
of scientists depicted in 1.1. Because it was published by his pupils after his
death, the basic tenets of his theory remained open to differing interpretations
(cf. Wells, 1947a; Godel, 1966). Saussure defined what he called
"langue" "as a purely relational structure, as a pattern, as
opposed to the usage […] in which this pattern is accidentally
manifested" (Hjelmslev, 1947: 73);1 [I translate into English
any quotations that require it.] the "usage" was termed
"parole." In effect, Saussure was selecting the virtual system
over its actualizations (I.4.5) and inaugurating an abstraction that
became a fundamental methodological step for many schools of modern linguistics.
Saussure also proposed several further abstractions:
1.2.1
Language should be studied as synchronic system: a configuration of
elements captured at a single point in time; the diachronic perspective
{44} of the system evolving and changing in historical time was relegated to a
separate field (termed “philology” or “historical linguistics”).1[1
“Philology” usually denotes language inquiry (e.g. Delbrück, 1904) as
developed before the rise of modern linguistics. “Historical linguistics” is
usually the extension of modern methods to language history.]
1.2.2
Language was to be considered a system of signs, i.e., perceivable traces with a
conventional meaning and purpose. Each sign would arbitrarily relate an
“acoustic image,” later called a “signifier,” to a “concept,” later
called a “signified.” Saussure (1916: 98ff) hoped this approach would
supplant the popular view (going back at least to the account in the Book of
Genesis) that a language item is a “name” for a “real thing” in the
world. However, as Helbig (1974: 39f) remarks, Saussure himself fell back into
this view by proving the arbitrariness of sound-meaning relations on the grounds
that the same real object has different names in different languages (cf.
Gardiner, 1932: 29ff; Benveniste, 1939: 24). The alternative would have been to
recognize that meaning can be constituted during usage (Schmidt, 1969; cf.
1.4.6f; III.2.26); and Saussure had already excluded usage from linguistic
inquiry (II.1.2).
1.2.3
As a system, language was to be described via the notion of opposition:
each sign or element would be defined by how it differs from all others signs.
To be sure, exact recordings and measurements reveal that every utterance is
physically different from all others, even when the same person repeats the same
words. However, linguistics should be concerned with only those differences that
are SYSTEMIC, i.e., essential to the definition of the system. (For example,
Arabic has two “glottal stops” one pronounced in the back of the throat
(transcribed ‘q’) and one in the front (transcribed ‘k’), whereas
English treats both as the same sound: the difference is systemic in Arabic, but
not in English.) Systemic differences rest upon distinctive features. (For the
Arabic example, the two sounds differ in the distinctive features of “back”
vs. “front”, yet share other features, e.g., the “stop” of the air
flow.) The sounds of a language form a system based on these opposed features
(I.1.5).
1.3
These abstractions — synchronic systems of signs defined by mutual oppositions
and distinctive features — became the cornerstone of structuralist approaches,
both inside and outside linguistics. Such abstractions lend considerable
importance to the reconstructive actions of the investigator: not just to
uncover units and features, but to decide what constitutes a feature and which
features should be postulated. Martinet (1962: 59) surmised that the structures
were “nothing but a frame invented by the linguist in order to help him
classify the data. In other words, a structuralist is not one who discovers
structures, but one who makes them” (emphasis added). As Hartmann
(1963a, 1963b, 1964) has emphatically demonstrated, signs and sign patterns have
reality only as actual occurrences. To have an abstract system, the speakers of
a language (including a linguistic investigator) {45} must first synthesize real
experiences and invent their underlying principles (I.4.12). This factor was
overlooked in a “science” whose “empiricism” “suppresses the
constructive role of operations”; yet “in ‘operating’ on the objects,
the subject alone, by his action, elaborates structures” (Piaget, 1976: 132).
The “objectivity” of the human sciences is the result, rather than the
exclusion, of cognitive (subjective) activities among the scientists themselves.
Cognition is controlled by one’s prior model of the world.
1.4 It is perhaps fair to say that the central enterprise of modern
linguistics has consisted of assigning to language theoretical substrates of
structure (Beaugrande, 1982d, 1983a). This trend brought a steady increase in
the theoretical well-structuredess of language and thus rendered it a more
convincing object of scientific investigation (cf. I.1.3). However, the
scientific climate led many structuralists to act as if the structures were
situated in the object of inquiry. As long as the human processes of discovering
and utilizing structures were carried out without acknowledgement by the
structuralists themselves (cf. I.1.3), the most interesting aspect of the
enterprise remained hidden from view. As structuralism moved toward more complex
domains of language, unity and consensus steadily diminished, because the
investigators had to supply more diffuse, elaborate contexts.
1.5
The history of structural linguistics supports this contention. The most
successful project was phonology, the study of sound systems. The
positions of the vocal organs during the production of language sounds were used
for classifying the sound system into units called phonemes: the smallest sound
units capable of determining meaning. Phonemes were “systemized” according
to whether the vocal cords vibrate and how the tongue, lips, teeth, gums,
palate, etc. participate. The catalog of “distinctive features” included
“voiced,” “unvoiced,” “dental,” “palatal,” “glottal,” and so
on (cf. Trubetzkoy, 1939). Jakobson, Fant, and Halle (1952) reaffirmed
Saussure’s outlook (II. 1.2.3) by situating all features in binary oppositions
such as “ + /- voiced,” “ + /- dental,” etc. The binary opposition
became a further hallmark of structuralism,’ and obscured the recognition of
gradations (cf. I.4.15).1 [The binary opposition is vacuous unless
both sides are relevant alternatives. Otherwise, we could put “ + /-” in
front of anything because its presence is trivially the opposite of its absence
(cf. Werth, 1976). Linguists tended to use the “ + /-” symbol as a trademark
for making vague concepts look “scientific.”]
1.6
Phonology proved to be the most enduring success of structuralist linguistics
precisely because its categories are closely allied to observable sound events.
The problem of discovering structures vs. creating them was not acute as long as
researchers agreed on units and features-easy enough in regard to sounds. This
success brought enough credibility to structuralism that researchers widely
accepted and applied its basic assumptions; later structuralist explorations of
language attempted to generalize the principles {46} that had worked for
phononogy (Pike, 1967: 95; Koch, 1973-74: xvi; Quirk, 1976: v). Yet the
correspondence between sound systems and the “deeper levels” of language,
such as meaning and purpose, is far from obvious. The processing actions that
investigators had to perform in order to discover structures were much more
complex than what was needed for sound events (II. 1.4). By asserting arbitrary
ties between meanings and words, structuralism precluded a study of meaning in
terms of conceptual events (II. 1.2.2). They failed to appreciate the difference
between the total units and features that analysis can abstract vs. the relevant
features which are actually consulted and utilized in language processing. The
latter are most essential for theories of communication and cognition, even in
phonology (Beaugrande, 1978b).
1.7
The structuralist offshoot known as “Russian Formalism” can illustrate my
point. Around 1916, a circle of Russian scholars, including at various time
Roman Jakobson, Lev Jakubinskij, Viktor Sklovskij, Boris Eichenbaum, Boris
Tomasevskij, Viktor Vinogradov, Jurij Tynjanov, and Viadimir Propp, merged into
a “society for the study of poetic language.” These “Formalists,” as
they soon came to be known (cf. Eichenbaum, 1965 [1927]),2 I place in
square brackets original reference dates that bear on the discussion. The
Formalists wrote mostly in Russian, so that their impact on Western methods was
long delayed until translations were made (e.g. Todoroy [Ed.], 1965; Matejka
& Pomorska [Eds.], 1971).They analyzed units and features in such literary
text types as poems, short stories, folktales, and novels. Literature was
defined by its additional “structuration” (cf. I.4.5) most obvious in poetry
that “de-automatizes” language processing and hence intensifies and
diversifies literary communication (cf. VI. I. If, 7). Jakobson’s detailed
analyses (e.g. Jakobson & Levi-Strauss, 1962; Jakobson & Jones, 1970)
“surprise the examiner himself by unexpected, striking symmetries and
asymmetries, balanced structures, efficient accumulation of equivalent forms and
salient contrasts” (Jabobson, 1968: 602f). However, it is hard to tell how
many of these structures are relevant to, or experienced in, the act of
producing or receiving the poem, and how many are derived from the language
system as such (cf. discussions in Posner, 1971; Lotman, 1976; Werth, 1976;
Beaugrande, 1978c, 1978d; Dillon, 1978; Koch, 1978). We need to know how
representative the refined methods of the Formalists in finding structures might
be for literary communication at large. Some literary scholars have at least
expressed a self-awareness of their own role as processors (e.g. Riffaterre,
1966; Fish, 1970, 1976, 1980; Iser, 1980). Ultimately, only general empirical
studies among authors and audiences can uncover how people make structures (cf.
Wienold, 1972; Groeben, 1980; Schmidt, 1982). The same demand holds for ordinary
communication as much as for literature.
1.8 Soon after the Formalists went to work, a “Linguistic
Circle” assembled in Prague to investigate language. {47} At the First
International Congress of Slavists in 1929, the structuralist approach was
affirmed (1929 report cited in Jakobson, 1973: 11f):
Were we to comprise the leading idea of present-day science in
its most various manifestations, we could hardly find a more appropriate
designation than structuralism. Any set of phenomena examined by contemporary
science is treated not as a mechanical agglomeration but as a structural whole,
and the basic task is to reveal the inner, whether static or developmental, laws
of this system. […] Therefore, immanent structural considerations of language
were predestined to take a prominent place in the debates of the Congress.
The
Prague School, dating from 1926, developed a variant of structuralism that
sought to relate structures to their functions (cf. surveys in Mathesius, 1947;
Trnka, 1957; Garvin [Ed.], 1964). However, an operational definition of
“functions” requires a greater consensus about the nature of the system and
its elements than the Saussurian abstractions allowed. Research was mostly
limited to phonology and word order (survey in Firbas & Golkova, 1976; cf.
IV.2.63).
1.9
A rival school, convening in Copenhagen in 1933, preferred the designation
“glossematics” (cf. Hjelmslev, 1943, 1953; Siertsema, 1955; Uldall, 1957).
The Danish scholars claimed to follow Saussure’s theories most faithfully.1
[A letter to Hjelmslev is extant from Charles Bally, Saussure’s successor in
Geneva, certifying that the Copenhagen school had indeed most fully understood
Saussure’s injunction to study “language in and of itself” (cf. Helbig,
1974: 62).] Like Formalism, Glossematics suffered from a language barrier-Hjelmslev’s
fundamental book was in Danish.] They introduced a still greater abstraction by
electing to study only “form” and to disregard “substance” (whereas
phonology was allied to substantive sound events). Language was seen as a system
composed entirely of abstract relationships (“functives”) among elements
(“terminals”) that could be either “constants” or “variables” (cf.
Uldall, 1957: 9f). This austerity made glossematics the most radical version of
structuralism and placed extraordinary demands on the linguist’s powers of
structural analysis. Not surprisingly, glossematics offered few attempts to
describe real languages (e.g. Togeby, 1951). Nonetheless, the glossematists
Hjelmslev (1943) and Uldall (1957) recognized that the text is systemic (cf. Gülich
& Raible, 1977: 92ff; 1.4.5).
1.10
In the United States, structuralism came under the dominance of physicalism in
science (see II.2.25). American research, far from working through difficult
philosophical abstractions, sought language in observable events in order to
describe it in “physical terms” (Bloomfield, 1936; II.2.1ff). Again, the
dilemma arose of knowing what “observable events” mean in communication.
Though once a pupil of Sapir’s (1921) “mentalist” approach, Bloomfield
(1933: 162f) averred that linguistics must “always start from the form and not
from the meaning,” because “meanings cannot be defined in terms of our
science.” Meaning resulted from the “speaker’s {48} situation” and the
“hearer’s response”; hence, “a scientifically accurate definition of
meaning for every form in a language” would require “a scientifically
accurate knowledge of everything in a speaker’s world” (1933: 139; cf.
IV2.65). So Bloomfield (1933: 77) elected to use only “differential meaning”
— a tenet fully in line with the structuralist notion of opposition
(II.1.2.3). The investigator would proceed by asking “informants” (naive
native speakers) whether or not two linguistic forms mean the same thing.
Bloomfield’s successors (e.g. Bloch, 1948; Harris, 1951; Fries, 1954; even
Chomsky, 1957) felt that if meaning could not be exactly defined, then it should
not even be consulted in the study of language. Harris (1951: 3) proposed
instead to found linguistic inquiry wholly on “the distribution or arrangement
within the flow of speech of some parts or features relative to each other.”
Meaning was to be used only as a “short-cut,” under the condition that a
purely formal analysis of distributions would eventually arrive at the same
results.
1.11 This reduction of language to physical events and formal patterns
could not be upheld in practice. Bloomfield’s followers “bypassed the
mention of meaning without bypassing the use of meaning” (Pike, 1967: 61).
Linguists must consult meaning in order to decide which “parts or features”
in the “flow of speech” should be studied in their distribution; which pairs
of items should be presented to informants; and so on. (For example, linguists
would isolate ‘imp-’ as a segment in ‘impish’, but not in ‘impious’,
although ‘-ish’ and ‘-ious’ are both common adjective endings; and
‘impart’ and ‘implore’ would never be construed as the ‘art’ and the
‘lore’ produced by some ‘imp’-precisely because linguists know what the
words reasonably mean. No purely formal analysis, however involved, not even
listing every known use of all English words beginning with ‘imp-’, forces
us to the conclusion we arrive at so effortlessly by relying on meaning; the
“short-cut” (I.1.10) is the only road. Hence, the interesting object for
study would be the cognitive processes of linguists or informants who discuss,
analyse, or give opinions about language. These processes determine the value,
consistency, and reliability of the whole enterprise and would reveal whether
agreement or disagreement is systemic (in the sense of II. 1. 2.3) or purely
accidental.
1.12 Morphology closely resembled phonology, but dealt with the
forms, rather than the sounds, of words (cf. de Courtenay, 1972 [1888]; Sapir,
1921; Nida, 1947) (cf. II. 1.6).1 [The first account of
“morphology” in linguistics appears to be that of de Courtenay in the
1880’s. For a time, this area was eclipsed by the great success of phonology,
until its importance for fieldwork brought it back into prominence, thanks to
Sapir, Nida, Pike, and their co-workers. To limit the cataloguing,
“morphemes” were eventually restricted to function words and affixes, while
the content words were called “lexemes” (cf. Weinrich, 1976). An unresolved
difficulty in the “synchronic” approach (II. 1.2. 1) is that many morphemes
in English are recognizable as word-parts only from a historical standpoint,
e.g., ‘en-’ in ‘enchant’, or ‘-fy’ in ‘satisfy’. Again, the
structure varies according to who is doing the analysis, a professional linguist
or an everyday language user.] In analogy to a “phoneme” (II. 1.5), {49} a morpheme
was defined as the smallest unit of form that could determine meaning. Every
word should contain at least one morpheme, plus any morphemes made of affixes:
parts added either before (prefixes), inside (infixes), or after (suffixes) the
word stem. Hence, a word like ‘unbeliever’ would contain three
“morphemes.” An utterance would be a sequence of morphemes, and hence a
framework of slots that could be occupied by given classes of fillers. For
example, the subject slot of the sentence takes the filler class “noun
phrase.” The distribution of a language element, i.e., the sum of all the
slots it can fill (Harris, 1951; Gleason, 1955; cf. II. 1. I 0), should give an
exhaustive account of its use and meaning. This postulate is essentially potent
(Beaugrande, 1980a: 7), but it is hardly practicable to assemble all the
occurrences of a word or word-part. Linguists had to be content with a few
occurrences which they (or their informants) considered representative by virtue
of knowing the language.
1.13 A popular structuralist approach was immediate constituent analysis
(cf. Wells, 1947b; Fries, 1952; Gleason, 1955; Hockett, 1958). A sentence was
dissected into steadily smaller pieces until every unit was classified as a part
(I.e. an “immediate constituent”) of a larger unit. To start, the sentence
would be cut into clauses, and each clause cut into subject vs. predicate, which
raises porblems That all-time linguists’ evergreen in (4) is easy to dissect,
whereas in (4a) the predicate is partly before and partly after the subject:
NP
VP
(4)
The man | hit the ball.
Art n
v art n
VP
NP
VP
(4a)
In the ninth inning | the man | hit the ball
prep art mod n
art n
v art n
KEY
art: article; mod: modifier; n: noun; NP: Noun Phrase; prep: preposition; v:
verb; VP: Verb Phrase
As
we see, the method works fine when grammatical units are continuous,
i.e., if their constituents are directly adjacent (IV 1.3), but leads to
difficulties if they are discontinuous. Though most linguists asserted that
speech was primary over writing (e.g. Saussure, 1916; Bloomfield, 1933; V.3.1),
{50} and the success of phonology (II. 1.6) firmly anchored speech sounds in the
theoretical layout of the discipline, constituent analysis usually took the
short-cut of using written sentences already segmented by the investigator into
written words. This discrepancy was later far more pronounced in the terminology
of transformational grammar (e.g., “right-branching” vs.
“left-branching” is oddly metaphorical unless sentences are written down)
(cf. II.3.13). Indeed, the written modality is generally better than speech for
defining the “sentence” (cf. II.3.13; IV.2.17; V2.18, 30, 33, 45; V.3.7ff;
VI.1.10).
1.14 Despite these difficulties in both theory and practice (I.1.11-13),
American structuralism made important contributions to language exploration by
conducting fieldwork on remote languages about which very little was
known in wider circles. In order to record and describe a language without a
grammar or dictionary (and sometimes without even a bilingual interpreter) the
linguist had to take account of communicative and cultural settings. The tagmemics
developed by Pike and his associates (cf. Pike, 1967; Pike & Pike, 1975;
also Longacre, 1964, 1976) expressly went beyond sentence patterns to apply the
slot-and-filler approach (I. 1. 12) to human behavior at large (II.2.25). The
“tagmeme” itself was defined as a slot plus the class of items that could
fill the slot. It made no difference if the slot appeared in a sentence or in a
football game. The linguist was to be a “human observer” who “demands the
right of multiple perspective, the recognition of parallel, simultaneous
reaction to crisscrossing, intersecting vectors of experience, mental tools,
values, and psychological pre-sets” (Pike, 1967: 10). 1[Unfortunately,
the slot-and-filler approach is especially unsuited to representing the
“simultaneous” processing envisioned here (cf. IV. 1.7).] This broad scope
made tagmemics the most successful method for linguistic fieldwork. Under the
auspices of the Summer Institute of Linguistics, the Pikes and their students
described literally hundreds of little-known languages, many of them already
disappearing. However, the slot-and-filler method was too narrow a basis for a
“unified theory of the structure of human behavior.” It fails to emphasize
communicative participants’ intentions, plans, and cognitive processes
responsible for patterns of behavior. In practice, the meaning and purpose of
discourse were informally supplied by the fieldworkers themselves (cf. I.1.3;
II.1.3), who relied on commonsense knowledge about social interaction. (I.15).
Over the years, linguistic structuralism covered word sounds (phonology), word
forms (morphology), and word order (syntax). These domains were generally called
LEVELS (cf. I.4.2). Some early researchers (e.g. Hockett, 1942: 20; Moulton,
1947: 225; Trager & Smith, 1951: 81; Hall & Trager, 1953: 57) insisted
that the levels be independently studied and described. A complete account of
language would be the sum of the descriptions of all its levels. However, the
actual result was an extreme fragmentation {51} (cf. I.1.3) — language taken
to pieces until its multi-level unity in communication faded from view
altogether. Too much after-the-fact analysis obscured the synthesis that creates
language samples in the first place (cf. 1.4.3, 14). Predictably, the ban on
level-mixing soon evaporated, at least in practice. Yet a preference for
"modular" research on language has only recently yielded to an
"interactive" approach (cf. III.1.27; III.2.3ff).
1.16 Syntax became the decisive testing grounds for structuralism. To see
why this level resisted a transfer of methods from phonology and morphology, we
must bear in mind that a taxonomy is expected to offer an exhaustive catalog of
elements in a system (1.4.3). This requirement posed no obstacle to phonology,
because the sound systems of languages are fairly compact. In morphology, most
languages have, besides their basic vocabulary of content words, a limited
number of function words and/or affixes (cf. I.4.6) But in syntax, the
feasibility of an exhaustive catalog of phrases and sentences seemed very
doubtful. They might well be too numerous for a linguist to list.
1.17 This critical impasse was resolved with Chomsky's (1957) notion of
"transformation" borrowed from Zellig S. Harris (1952). Harris's
earlier (and barely noticed) paper had suggested using transformations for
distilling out "equivalent distributions of items in a discourse.1
[For example, Harris (1952: 27) "transformed" 'satisfied customers'
into 'customers are satisfied' to make the phrase "equivalent" to 'You
will be satisfied’] Chomsky's theory postulated a small set of rules to make
basic sentences, plus a larger set of rules to "transform" basic
patterns into any other possible sentence of the language. This solution
eliminated the need for an exhaustive catalog of sentence patterns while
preserving the characteristic scientific ambitions of linguistics: well-structuredness,
formality, and abstraction. Any sample sentence could be described by stating
the patterns and rules which would “generate" it, that is, assign to it a
structural description (cf. II.3.5, 9).
1.18 It was probably inevitable that this new direction, though
temporarily upholding structuralism, would eventually transcend the confines of
the latter. Research was shifting from the structures themselves over to
structure-building operations. Naturally, the question of whether the operations
of the grammar were also those carried out by speakers and hearers could not
remain unnoticed indefinitely. The field of "psycholinguistics" for
testing the correspondence between human processes and linguistic theories was
already well on its way (cf. Osgood & Sebeok [Eds.], 1954). The contact of
linguistic structuralism with other aspects of human behavior and mind
profoundly influenced the direction of language research. The next sections of
this chapter retrace that evolution.
1. 19 Structuralism had its weaknesses, and its critics have been
numerous. Yet we should not overlook its substantial contributions. The rigorous
definition of theoretical language units, and of procedures for discovering {52}
those units, helped to render language research consistent, reliable, and
cumulative. The systemic nature of language came into new focus, along with the
role of differentiation within systems. If structuralism failed to recognize the
contribution of human processing to the structures of language, it was only
following the spirit of the times (I.1.3f). Placed in a proper context, the
structuralist heritage can continue to serve us in future explorations.
2.
PHYSICALISM AND BEHAVIORISM
2.1
The prestige of physics as a science (I.I.Iff) understandably encouraged
researchers to treat language as an agglomerate of physical mechanisms that can
be externally observed and measured — an outlook we can call PHYSICALISM. The
philosophical direction known as “positivism” (cf. John Stuart Mill, 1843)
had in part prefigured this outlook. Karl Spencer Lashley, at a 1948 symposium
fittingly titled “Cerebral Mechanisms Of Behavior” (proceedings in Jeffress
[Ed.], 1951), stated the credo: “Our common meeting ground is the faith to
which we all subscribe, I believe, that the phenomena of behavior and mind are
ultimately describable in the concepts of the mathematical and physical
sciences.” Willard van Ormen Quine (1971: 142) asserted in a 1968 lecture
“that knowledge, mind, and meaning are part of the same world that they have
to do with, and that they are to be studied in the same empirical spirit that
animates the natural sciences.” The commonsense dictionary definitions cited
in I.1.1 imply that "science” itself is pre-empted, at least in America,
by the “natural sciences” that “deal with objectively measurable
phenomena.”
2.2 It was thus not surprising that American linguistics and psychology
sought to gain credibility by looking for physical events. Margaret Floy
Washburn ( 916: 11f) depicted speech as a “successive movement system,”
i.e., as “a combination of movements so linked together that the stimulus
furnished by the actual performance of certain movements is required to bring
about other movements.” John Broadus Watson (1920) viewed thought as inaudible
movements of the vocal organs, and Louis Max (1937) saw consciousness as
“motor activity.” Agnes Thorsen (1925) concluded that thought processes
could be detected by studying tongue movements (cf. VI. 31). B[urrhus[
F[rederick] Skinner has, for almost 50 years, advertised his “conditioning”
theory as scientific because its “results may be formulated in centimeters,
grams and seconds” (1982: 48) — as iof science were a part of the Olympic
Games.
2.3 Physicalist approaches tend in two directions. The first direction is
to reduce all human actions, including those of cognition and communication {53}
to detectable body functions, notably those of the nervous system. We
might strive to record which parts of the brain perform given operations,
consulting (a) patterns of excitation measured with an electroencephalograph (a
brain-wave recording device); (b) concentrations of blood flow measured with
positron emission tomagraphy (a technique for displaying proportions of
radioactively labeled substances in the body); (c) dysfunctions resulting from
brain damage, e.g. aphasia (language impairment caused by a lesion or tumor);
and so forth. These directions have developed along with recent advances in
neurology and neuroscience, but fundamental obstacles persist:
2.3.1
The correspondences between patterns of neural excitation and language events
are not clearly defined. A recording from an electroencephalograph does not by
itself suffice to reveal the organization of a specific language event. The
location of the neural activity is also inconclusive. The finding that language
is dominantly centered in the left hemisphere of most people’s brains is too
simple to yield a theory of discourse. Glassner (1981: 152) found from
electroencephalographs that creative (“exploratory”) text production causes
higher activity in the right hemisphere, while the reporting of familiar
material has the same effect on the left. Yet this finding does not provide us
with a detailed theoretical model of writing.
2.3.2
Recorded activities of the nervous system could hardly be matched up with the
many cognitive and communicative events that are not observable. The recordings
themselves are the only evidence. We cannot watch, for example, a writer’s
brain considering several language options and eventually selecting one of them.
We couldn’t use the pattern of brain waves or blood flow to identify the
options and the selective processes. We have only the final word choice, plus
the writer’s introspective self-commentary evidence after the fact, not of the
fact.
2.3.3
The effects of brain damage are typically gross, yielding dysfunctions mostly in
large complexes of language behavior. The location of the damage accounts for
some differences. Damage to Broca’s area (in the inferior frontal gyrus of the
language-dominant brain hemisphere) impairs text production, while damage to
Wernicke’s area (in the posterior part of the temporal lobe of the
language-dominant brain hemisphere) impairs reception (cf. Broca, 1865; Wernicke,
1874). But text production and reception are themselves constituted by elaborate
complexes of activity that could be sorted out only if impairments were much
more specific.
2.3.4 Finally, the human nervous system is enormously adaptive and
flexible, not just for different persons, but also for the same person at
different times. Early researchers (e.g. Eberstaller, 1890; Cunningham, 1892)
already found that the organization of brain surfaces is far from identical {54}
for everybody — a finding borne out by recent studies of language localization
(Whitaker & Selnes, 1975).
2.4 For all these reasons, it seems implausible that a theory of
cognition and communication can be strictly derived from neurological
observations. Already proposed theoretical models of language operations can be
tested, but hardly created, by observing the activities of the nerves. The
differences between brain hemispheres might account for aptitudes of language
learning during maturity (cf. Lenneberg, 1967; Scovel, 1969); for inclinations
toward induction vs. deduction (Winterowd, 1979); for creative vs. stereotyped
expression (Glassner, 1981, 1982; cf. II.2.3.1); or for sound/letter
correspondences in spelling (V1.26). But only a theoretical model of text
processing can effectually suggest why any of this should be so.
2.5 The other direction for a physicalist approach to cognition and
communication would be to consider the human a simple mechanism determined by
the environment and its visible objects. Manipulating these things should then
provide all needful insights about the human. This outlook was the basis for the
research paradigm known as BEHAVIORISM (due especially to Watson, 1920; Pavlov,
1927; Skinner, 1938). Its central project was to observe an "organism"
receiving an external STIMULUS and making a RESPONSE. By REINFORCING the
response with either a positive or a negative reaction, the investigator could
CONDITION the organism to behave in desired patterns. The conditioning could
make the organism select a given action and perform it with FREQUENCY (number of
acts and rate of occurrence) and FLUENCY (the smoothness of transitions among
stages of the act). in accord with the ambitions of the time, the
stimulus-response mechanism was claimed to follow a strict "law" of
effect (Skinner, 1938; Keller & Schoenfeld, 1950), like the "laws"
of the natural sciences, notably physics (cf. I.1.1; II.2.1).
2.6 LEARNING takes place if the conditioned organism increases the
frequency and fluency of the desired action. TRANSFER occurs if the activities
of the test session carry over to typical behavior (cf. Greeno, James, DaPolito,
& Poison, 1978). A DISTRACTOR is a task that interferes with actions and
learning. In classical conditioning, the experimenter takes the
initiative by stimulating the organism to do something, and replaces the
organism's natural response with an arbitrarily induced response (cf. Pavlov,
1927, and his famous dogs). In operant conditioning, on the other {55}
hand, the experimenter waits for the organism to display some bit of naive
behavior and then uses reinforcement to shape and control (cf. Skinner, 1938,
1953, 1957; Bachrach [Ed.], 1962; Wolpe, Salter, & Reyna [Eds.], 1964;
Krasner & Ulimann [Eds.], 1965).
2.7 The clearest successes of behavioral conditioning were obtained with
animals of limited reasoning abilities, such as rats, cats, dogs, and pigeons.
With enough training, these animals could be taught to carry out tasks like
activating levers or running mazes (see Bachrach, 1964, for one spectacular
case). The animal had to be periodically rewarded (usually with food) or
punished (usually with an electric shock) to show what sorts of behavior were or
were not desired. The studies of “visceral learning” by Neal Miller and
co-workers (cf. Jonas, 1973) suggested that even involuntary acts such as
heartbeat could be shaped by expert conditioning.
2.8 This reasonably secure body of research was then extrapolated (in the
sense of I.1.12) to the study of human language. Some experimental techniques
derived from conditioning became standard fixtures of research. In
paired-associate learning, test persons are given a list of word-pairs to
remember, then “stimulated” with a word, and required to “respond” with
its paired word (cf. Murdock, 1963). The pairing can be varied according to
whether the two words are similar or different in sound, word-class, meaning,
and so forth (cf. Deese, 1965; Kintsch, 1968; Kintsch & Buschke, 1969).
Today, the learning is attributed not to a simple strengthening of an
undifferentiated association, but to the selection of some aspect of the
stimulus that sets it apart from others (Greeno et al., 1978). Hence, learning
word lists without context may be harder and shallower (in the sense III.1/11)
of than learning from contextualized discourse (cf. Kircher, 1971; Brown,
Campione, & Barclay, 1978), because the latter provides much richer means of
differentiation (cf. IV.1.8f). In principle, the correlation between natural and
experimental conditions is far from clear (cf. III.1.1, 3; III.2.40; IV.I.9;
IV.2.16, 21; V1.13f, 19).
2.9 The stimulus-response scheme (II.2.5) became the basis for
behaviorist theories of language. Bloomfield (1933) depicted discourse as a
response to an event or situation, with visible objects in the main focus (cf.
II.2.23). In his twee example, Jill is stimulated to hunger upon seeing an apple
tree, but alas, she is not sufficiently athletic. Instead of struggling with the
fence and tree, she made a few small movements in her throat and mouth, which
produced a little noise. At once, Jack began to make her reactions for her; he
performed actions that were beyond Jill’s strength, and in the end, Jill got
the apple (Bloomfield, 1933: 24). This reductionism converts discourse into a
“few small movements” in the “throat and mouth” and “a little
noise.” The cognitive processes whereby Jill formed a plan to attain her goal
and constructed an appropriate utterance action, and the contexts that make the
action meaningful, are ignored. {56}
The food reward reinforcing her act is the classic enticement of the animal
conditioner (II.2.7).1 [1 The intrusion of the apple tree in place of
the hill-and-water-pail scheme may be a vague Biblical allusion (Genesis
3:6), though Bloomfield’s sexism impels him to give the initiative to the man
rather than to the woman.] Jack’s agency is limited to “enacting Jill’s
response” when stimulated by “movements” and “noise” to obtain food at
the risk of breaking his crown. He could be replaced by a chimpanzee, a superior
tree climber, had that animal not been eschewed by American behaviorists as too
intelligent.2 [Kintsch
(1977: 440f) remarks on the ominous consequences of selecting particular animals
for learning experiments. Thorndike (1911) put cats in cages with levers and
concluded from observations that rote repetition and trial-and-error should
become the foundations for American learning theory-which they did, for decades
(cf. Das et al., 1979: 3ff). Kohler’s (1925) chimpanzee showed substantial
reasoning powers, but these tests were mostly ignored by American educators.
2.10 Like most reductionist theories, behaviorism presented as general
cases only those special cases it could deal with (I.1.3). It also obscured the
difference between external and internal actions, between performance and
knowledge (cf. I.2.8.2; I.3.3, 11; III.3.2.2; IV.1.12, 48; V1.14f; VI.3.22).
Some situations do stimulate people to perform discourse as a response — a
special case I call situation monitoring (I.4.11.5). But this monitoring
is normally mediated by the cognitive organization of the text producer and by
the exigencies of text production (what to mention, how to describe it, what
motives to follow, etc.) (cf. I.4.6). If discourse is mediated by the intent to
steer the situation toward one’s goals, we have situation management
(I.4.11.5; cf. Beaugrande & Dressier, 1981: 163ff). By trying to stretch
monitoring to include management, behaviorist language theories like
Bloomfield’s deny the importance of plans, goals, and world-models in human
communication. No explanation can be found for how humans decide what
environmental events are relevant, and what response might be appropriate.
2.11
Hobart Mowrer (1954) thought of the sentence as a conditioning device: the
subject is the conditioned stimulus and the predicate is the unconditioned one.
Understanding the sentence means “pairing” the subject and predicate
in a new “association” (cf. II.2.8). If a text producer sees Tom stealing
money, the utterance
(5)
Tom is a thief.
takes
“Tom” as a known point of departure and impels the hearer to pair “Tom”
with “thief.” The effort required to learn this new association varies
according to the text receiver’s situational role, e.g. Tom’s mother vs. a
policeman (Osgood, 1963: 748). With only the stealing event in view, Mowrer’s
example is pure situation monitoring. Still, Mowrer recognized the distinction
between given vs. new (here, context vs. observed event) in a sentence or
utterance (IV.2.63), whereas Bloomfield did not. However, having visible {57}
objects be the “stimulus” for the utterance-the favorite example for
behaviorists (cf. II.2.5, 9, 23; IV2.56) is just a special case. It’s hard to
see any stimulus-response scheme in a text such as:
(6)
What a piece of work is a man! How noble in reason! How infinite in faculties!
In form and moving how admirable! In action how like an angel! In apprehension
how like a god! (Hamlet, II, ii)
A
strict behaviorist would predict that Hamlet should say:
(6a)
What a piece of conditioned behavior is a man! How observable in response! How
law-governed in faculties! In form and moving how statistical! In action how
like a mechanism! In apprehension how like a rat!
And
even (6a) is still not a direct “response” to a situation, but a statement
mediated by the goal of propagating a reductive theory.
2.12
The most ambitious behaviorist theory of language was Skinner’s (1957) account
of “verbal behavior” derived from his work with operant conditioning
(II.2.6) of animals (Skinner, 1938, 1953). He classified utterances as types of
“verbal operants.” A “mand” was “a verbal operant in which the
response is reinforced by a characteristic consequence,” e.g., the “mand”
‘Pass the salt’ being reinforced by getting the salt (Skinner, 1957: 350. A
“tact” was “a verbal operant in which a response of a given form is evoked
(or at least strengthened) by a particular object or event or a property of an
object or event,” e.g., ‘How do you do?’ for a greeting and ‘Thank
you’ for a service (1957: 81f). “Mands” and “tacts” (derived by
dismantling the words “commands” and “contacts”) were central to
Skinner’s approach because they are special cases where discourse actions fit
the stimulus-response scheme most closely.
2.13
To appreciate how such a bias abrogates communicative relevance (II.2.10),
consider Skinner’s (1957: 253f) droll list of “all the available
techniques” for “evoking verbal behavior,” e.g., for eliciting the word
‘pencil’. We can: (a) “mand the response by saying to the subject
‘Please say “pencil’”; (b) “make sure that no pencil or writing
instrument is available,” and offer the subject “a handsome reward for a
recognizable picture of a cat”; (c) “provide echoic stimuli (a phonograph in
the background occasionally says ‘pencil’) and textual stimuli (signs on the
wall read ‘PENCIL’)”; (d) “produce interverbal responses: the phonograph
occasionally says ‘pen and —; and there are other signs reading ‘PEN AND
____; or puts very large or unusual pencil in an unusual place clearly in
sight-say, half-submerged in a large aquarium or floating freely in the air near
the ceiling.” The word ‘pencil’ seems to be a “mand” in (a) and (b),
and a “tact” in the other cases, especially (e). Only (b) has any
motivational relevance to the speaker’s goals. “Echoic stimulation” and
“interverbal responses” are special cases with the least resemblance to
creative communication. Though methods (a) through (c) should elicit the word
‘pencil’ from the most rat-brained test-person (unless the latter, not
surprisingly, runs away, believing that the experimenter has lost his mind and
may be dangerous), they are far from hardly represent the natural conditions for
saying ‘pencil’, let alone for composing utterances like Hamlet’s in (6).
A composition teacher who tried to “evoke” student essays with such methods
could reach the age of retirement before one paper got finished.
2.14 Most of Skinner’s examples were either one-word utterances like
‘pencil’ or else standing phrases in “mands” and “tacts” (cf
II.2.12). He barely touched on the complex activities of “textual behavior.”
Since physicalists stress the environment over the human being (II.2.5), Skinner
(1957: 31 1) saw the text producer as a mere “interested bystander” impelled
not by creativity, but by the stimulus-response mechanism. In Skinner’s (1957:
346) view, a text might be (a) made of “memorized sentences emitted as purely
interverbal sequences”; (b) “reproduced as echoic of textual behavior”; or
(c) “blended” from “a few key responses with stock patterns.” To engage
in “composition,” the text producer could use: (a) “the essential operants,”
(b) “the interverbals possibly arising from these operants in the course of
emission (often composing thematic groups of responses),” and (c) “the
autoclitic framework” (1957: 353f). Thus, the formatting of texts depended “autoclities”:
“responses within behavior which is based upon or depends upon other verbal
behavior,” such as: “informing the listener of the kind of verbal operant it
accompanies”; “describing the state of strength of a response”;
“indicating the emotional or motivational condition of the speaker”; and so
on (1957: 315f). Grammar and syntax would be “autoclitic processes” that
signal the relationships among the “manded” responses to “a particular
object or event or a property of an object or event” (II.2.12). This account
implied that the “key” content words (nouns, verbs, and modifiers) are
selected before function words, and that the main vocabulary is chosen before
the phrasing to put it in (cf. III.2.30; IV.2.10, 15, 26ff).
2.15 Still more unsatisfying and reductive was Skinner’s (1957: 438fn)
treatment of “thinking,” “verbal fantasy,” and “rationalization.”
Here, he could only feebly assert that text producers are “automatically
reinforcing themselves.” Skinner thus abandoned his only firm ground: the
externalized stimulus-response scheme. Chomsky (1959: 37f) objected here “that
a person can be reinforced though he emits no response at all and the
reinforcing ‘stimulus’ need not impinge upon the ‘reinforced person’ (it
is sufficient that it be imagined or hoped for.” Skinner had finally tossed
out the evidence on which the behaviorist approach had been originally founded.
The extrapolation from the animal-conditioning laboratory to human behavior
became a leap of faith, a palpable metaphor, or an “outrageous hyperbole”
(Lazarus, 1982: 43) (cf. I.1.3, 12). Without a unifying theory of communication
as such, Skinner could only transpose the notions of operant conditioning
wherever he saw a vague resemblance between discourse events and
stimulus-response chains. Chomsky (1959: 430) concluded: {59}
Skinner
[…] utilizes experimental results as evidence for the scientific character of
his system of behavior, and analogic guesses (formulated in terms of a
metaphoric extension of the technical vocabulary of the laboratory) as evidence
for its scope. This creates the illusion of a rigorous scientific theory with a
very broad scope, although in fact the terms used in the description of
real-life and of laboratory behavior may be mere homonyms, with at most a vague
similarity of meaning.
2.16
In the behaviorist view, the proper place to look for meaning was not the human
mind, but the environment, (II.2.5). Bloomfield (1933: 144) equated meaning with
the “speaker’s situation” and the “hearer’s response” (II.1.10;
II.2.9). Skinner’s (1957) “tacts and “mands” also discount meaning
itself in favor of its external consequences. “Tacts,” being evoked by
objects, events, and their properties (II.2.12), fall under the general notion
of REFERENCE: the capacity of language to point to objects and situations in a
(real or envisioned) world (I.4.11.2; III.2.22). This notion was central to the
philosophical trend of positivism, with its credo that natural science was the
only legitimate vehicle for knowing about the world (II.2. 1). Thus, the meaning
of a statement became equivalent to the state of the world that makes it true or
false.1 [In philosophy, this outlook has been called
“truth-conditional semantics,” and
variously applied in “model-theoretical”, “extensional,” and
“truth-functional semantics” (cf. Lyons, 1976, for discussion). The analogy
between world/statement vs. stimulus/response made the same fixation on
“truth” appears like a common shibboleth of behaviorism and mentalism.]
A unification of behaviorism with the theory of reference was indeed undertaken
by Willard van Ormen Quine (1960, 1968, 1971). His approach depicted “knowing
a word” as having “a phonetic” part” “achieved by observing and
imitating other people’s behavior,1 [The “phonetic part” would
seem to be the simple word-sounds, in contrast to the “phonemic” value of
those sounds in a language system. Later in the same paper, Quine switches to
the “phonemic” standpoint (1971: 152).] and a “semantic part” whose
“paradigm case” is the word’s reference “to some visual object” (Quine,
1971: 143). A comparison with Saussure’s “acoustic image” and
“concept” (II. 1.2.2) at first suggests that Quine has regressed to the pre-structuralist
standpoint in which words are names for things.
2.17 However, Quine mistrusted this approach to reference because the
relationship between a word and the thing it stands for is a mental construct
(cf. II.3.1). He wanted to replace the things themselves with people’s
responses to things. He asserted that “there are no meanings, nor likenesses
nor distinctions of meaning, beyond what are implicit in people’s dispositions
to overt behavior” (1971: 143). We best understand speakers by “seeing what
is stimulating” them (ibid.). The clearest case would be “direct ostension”:
“pointing your finger” at “an opaque surface”; if you point in order to
illustrate not the object itself, but its abstract properties (e.g. pointing at
grass to illustrate ‘green’), you engage in “deferred ostension” (1971:
148f). Reference (surprise!) becomes “darker” when we move away from
ostension {60} toward
“denotation” (the capacity of words to mean things). Quine (1971: 153)
hinted at the hope that all other uses of language are a “grand and ingenious
permutation” of the basic “denotation” of “familiar terms.” If we
explain the meaning of one term by using another term, and the meaning of the
latter by using still another term, we are threatened by an infinite regress “
we could eventually break by pointing to an object” (1971: 153).
2.18 Ultimately, Quine conceded that reference is (for his approach, at
least) “inscrutable,” because “reference is nonsense except relative to a
coordinate system” (1971: 153). What a word means depends on its place in a
system of meaning (cf. I.4.5; III.2.22). Quine (1968) therefore had to be
content with an “ontological relativity” that permitted suspiciously many
“ingenious permutations” and “compensatory adjustments in the
interpretations” of words. Here, Quine seemed to unintentionally swing into
line with the structuralist conception of “systems of opposition” that make
language meaningful (II.1.2.3). In fact, however, he demanded not a theory of
abstract systems, but “a relational theory of objects” to explore “not
what the objects of a theory are, absolutely speaking, but how one theory of
objects is interpretable or reinterpretable in another” (1971: 153f). The
meanings of words would be explained by a, theory of how objects are relatable
in pace,” “position,” “velocity,” etc. in a “co-ordinate system”
(ibid.), such that people can translate object systems into language systems,
and, on that basis, from one person’s language system to another person’s
(cf. Osgood’s scheme in II.2.23).
2.19 Quine’s demand clearly transcended the narrow framework of
orthodox behaviorism set down by Skinner (1957). In effect, Quine admitted that
people’s “dispositions to overt behavior” do not arise from a simple
stimulus-response chain, but from the position of any stimulus event or object
in a relational system. Experiments have indeed shown that the language used to
refer to visible objects depends on what the text producer considers necessary
and relevant in the current context, for example: how many objects are similar
or different, and in what ways (Krauss & Weinheimer, 1967; Olson, 1970;
Osgood, 1971); how an object is located in a system of classes subsuming, or
subsumed by, other classes (Collins & Quillian, 1972; Smith, Shoben, &
Rips, 1974; Osgood, May, & Miron, 1975); what traits of an object are
thought more or less essential to its nature (Rosch & Mervis, 1975; Rosch,
1977); and so on. Incorporating such factors makes behaviorist approaches to
communication seem less narrow, though still incomplete.
2.20 Charles E. Osgood refined his model with the notion of MEDIATION —
in my definition, the degree to which a processor imposes its own {61} outlook
(organization, priorities, beliefs) upon a situation (cf. I.1.7; I.4.6;
I.4.11.5; II.2.10). Osgood’s approach interposed mediation between the
stimulus and its response, for instance, to explain why a person doesn’t react
to a word the same way as to the thing the word refers to: “the mediating
reaction to the sign is not the same as the reaction to the significate, but
rather consists of those most readily conditionable, least effortful, and least
interfering components of the total original reaction” (Osgood, 1963: 740).
Osgood comes close to saying that word meanings can be dynamically constituted
on each occasion (I.4.6); but he would attribute the effect to behavioral
impulses, whereas cognitive decision-making gives to human processing at least
as much initiative as to the environment.
2.21 Osgood’s mediation theory sought to resolve the matter of
initiative by introducing PROBABILITIES into the stimulus-response mechanism in
place of the natural “laws” envisioned by orthodox behaviorists (II.2.5). He
argued that “decision in behavior is simply selection of the momentarily most
probable alternative”; and that “control over behavior is simply the way in
which combination and patterning” modify “the momentary probabilities” of
the alternatives (1963: 741). Lounsbury (1954) extended Hull’s (1943) work on
habits and suggested that utterances are controlled by “habit family
hierarchies” determining what actions or reactions are probable. Probabilities
can be “reshuffled” in context, so that the same stimulus can elicit
different responses. If this view holds, each successive word in an utterance
should be more probable than the one before it, as context continually narrows
down the likely choices (cf. Goldman-Eisler, 1958a; cf. IV.1.1, IV2.15).
However, this strict correlation was not found (Tannenbaum, Williams, &
Hillier, 1965). In fact, the end stretches of utterances are more often devoted
to new or unexpected material (IV,2.63f).
2.22 Still, Osgood’s model allowed a more refined outlook on discourse
than orthodox behaviorist ones. He broke down the simple stimulus-response
concept into three “stages” that were carried out during both “decoding”
and “encoding” (Osgood, 1963: 740). A presented text would be first
registered as a “pattern of sensory signals,” then “integrated” (or
“chunked”, I.4.5), and finally assigned a “representation” (or meaning);
to produce a reply text, an “intention” to be “meaningful” would be
relayed to the “skills” of “integration” and from there to
“execution” (1963: 741). Hence, the whole process is both psychologically
and physiologically reversible: production and reception are mirror images of
each other (but cf. III.1.9). The person’s vocabulary is stored in a huge
“word form pool,” a catalogue of “surface expressions” (the forms of
words, not their meanings).1 [On how word forms are accessed, see
V.1.34ff.]These forms are passed to a “semantic key sort” that decides what
meaning is most probable, where “meaning” is defined by Osgood as {62} a
“bundle of simultaneous semantic features.2 [“Semantic
features,” an ingenious extrapolation from sound features (II.1.5f), enjoyed
some Vogue, especially as a way to make static objects out of word meanings. But
nobody could devise a catalog of them for more than a handful of contrived
examples (usually objects like chairs, whose parts or traits made obvious
“features”), and the “ +/-mania soon reached bizarre extremes (cf. II 1).]
“In a sentence (or text), these feature bundles are integrated by a cognitive
mixer” that establishes “congruity” of meaning and eventually leads to the
“total response.” For example, to understand Mowrer’s sentence (II.2. I 1)
(5) ‘Tom is a thief’ a “semantic fusion” puts ‘Tom’ together with
‘thief’; if we had: (5a) ‘Tom is not a thief’, a “semantic
dissolution” would be done instead (Osgood, 1963: 748). The more complex a
text, the more “mixing” is needed for “fusion” and “dissolution.”
The “features” in these “bundles” were explored by an experimental
technique called “semantic differential,” where test persons rated words on
scales like “light vs. dark,” or “pleasant vs. unpleasant” (Osgood, Suci,
& Tannenbaum, 1957).3 [Though interesting, the “semantic
differential” approach is not a theory of meanings so much as a theory of
responses to meanings (connotations, attitudes, emotions).]
2.23 In his early work, Osgood (1963: 741) resolutely denied the need for
such notions as “cognition,” “purpose,” and “volition,” and
stretched his concept of “probabilities” to cover them all (cf. II.2.21). He
carefully reduced all meanings to “past experience with behavioral outcomes”
(ibid.). Later, Osgood (1971: 498ff) propounded a “non-linguistic cognitive
system” “shared” both by “perceptual signs” and by “linguistic
signs.” This system would be capable of the “representational mediation”
that allows “perceptuomotor behavior” to engender “symbolic processes.”
However, the treatment of meaning was still in terms of “patterns of
stimulation, including previously learned signs, which regularly and reliably
produce distinctive patterns of behavior” (1971: 523; cf. Quine’s scheme in
II.2.18). He sought to prove this view with his well-known experiment where
students watched him put or move objects on a table and “simply described”
the events to a fictitious “six year old boy just outside the door” (Osgood,
1971: 497). His students produced very similar sentences to describe a given
event, depending on the visibility, salience, and similarity of the objects
(e.g. moving rather than stationary) (cf. II.2.19; IV.2.56ff).
2.24 By seeking the basis of language in perceptual systems, Osgood in
effect adopted the empirical correlate of Quine’s philosophical account. Though
both scholars strove to found meaning upon “overt behavior” and on responses
to visible objects (Quine, 1971: 143; Osgood, 1971: 523, 497), they {63} both
had to expand the behaviorist-physicalist framework by postulating complex
systems that determined the real significance of external events or objects.
Ineluctably, the “stimulus/response” mechanism became a commonplace (and
hence unnoticed) metaphor for causality. For the sake of orthodoxy, researchers
would say “stimulate” instead of “present,” “make happen,”
“cause,” “enable,” and so on. Such usage often had little relation to
the original context of animal-conditioning (cf. II.2.7, 12, 15).
2.25 Among structuralist trends, American descriptivism and tagmemics
proved the most congenial to behaviorism, e.g., in Bloomfield’s account of
meaning (II.1.10; II.2.9, 16). Pike’s tagmemics leaned toward behaviorism as a
vehicle for fieldwork (II.1.14). The fieldworker immediately sees “the
interdependence of language and non-language behavior” and must “correlate
particular sequences of sounds” with the “particular environment in which
they are uttered and with the particular non-language activity which is going
on” (Pike, 1967: 29). However, this “unified” approach was rejected by
most linguists in both European structuralism and American mentalism, mainly
because they hoped to separate “pure” language from everything else. Thus,
language as behavior sustained no common ground with language as an abstract
system, because the two reductions of language went off to the opposite
extremes. Unfortunately, this divergence prevented a dialogue or integration
that would have benefited both sides.
2.26 Behaviorism had a profound impact on education and child development
programs.1 [All behaviorist educational programs inherit the
contradiction between making learners self-reliant and creative vs. applying a
theory best suited for making them pliable and “conditioned”. The tactics
are well-known: rote repetition, punishment (“negative reinforcement”), and
a simplistic equation of observed behavior, especially errors, with the
child’s true abilities (cf. I.3.3, 9).] Behaviorists
found in children’s language good support for their views. Skinner’s (1957)
“mands” and “tacts” (II.2.12), Quine’s (1960) progression from
pointing to objects over to using language to refer (II.2.17), and Osgood’s
(1971) “conditions for sign learning,” were all illustrated with
children’s behavior. The motive is clear: children fit the behaviorist
world-view much better than adults, allowing this extrapolation: animals =>
children => all humans. For instance, the pairing of words with visible
objects (a type of “paired-associate learning,” II.2.8) is really feasible
only for a young child with a minimal vocabulary (Gough & Hillinger, 1980:
183).
2.27 It follows that behaviorist learning techniques work best for
training young children (or the mentally retarded) in rudimentary, emergent
skills, when complexity has not yet imposed severe demands on cognitive
organization. Later, the techniques become steadily less helpful, because they
ignore more elaborated processing capacities in favor of wasteful, eventually
unmanageable methods of rote repetition and reinforcement. Whether anything
useful is learned and retained from behaviorist schooling {64} is mainly
pure chance, as long as the educators don’t understand the processes of
cognitive reorganization that, according to Mandler (1979), are the necessary
preconditions for learning from repeated experiences (cf. II.3.31; III.1.28;
IV.2.38). Thus, behaviorist education, like Darwinism, weeds out children for
whom it doesn’t happen to work; and any inequities are rationalized as results
of “genetic intelligence” and “aptitude” (I.3.Iff).
2.28 Nonetheless, education is concerned with behavior in the project of
improving human skills, including writing. It is unwise to lose sight of the
fact that writing is an action. Porter (1962) criticized composition classes if
they violate these basic conditions for improving behavior: (a) the behavior has
consequences, (b) learning doesn’t have to be done in a single leap, and (c)
overzealous punishment does not suppress behavior altogether. Similarly,
Zoellner (1969) advocated his “behavioral pedagogy of composition” because
writing instruction so often flouts learning theory by being intransigent toward
errors, disregarding students’ “here-and-now behavior,” failing to
appreciate and work with approximations, and attempting to impose conformism
(cf. also Zoeliner, 1971; I.2.17ff; 1.3.7ff). Zoellner (1969) extrapolated from
“operant conditioning” (II.2.6) to advise that the instructor should wait
for students to “emit” a desired “bit of behavior” and should then
“shape” and “reinforce” it with frequent reactions. Since behaviorism
offers no standards of value or communicative relevance, Zoellner focused on
fluency: the rate and ease at which students write (cf. II.2.5; II.3.21, 38).
Students were to “blurt” something out”1 [Zoellner uses the
term “visceral blurt” as a facetious metaphor alluding to Miller’s
“visceral learning” (II.2.7).] and instantly record it on a blackboard or
drawing pad. This record serves as an approximation to be used in later stages
of the writing. The student must not be asked to “think,” lest there ensue
“an immediate drop in the density and variety of external behavior”: the
student’s “body tenses up, he [sic] becomes motionless, his face takes on a
wooden expression, and his vocalization drops to zero” (1969: 298).
2.29 This emphasis on fluency suggested that fast, easy writing equals
good writing (compare Zoellner, 1969: 270ff). Yet good writers are often not
rapid or fluent (Boyle, 1969; Elbow, 1975; Lowenthal & Wason, 1977; Murray,
1978; Atlas, 1979; Odell, 1980; Wason, 1980), mostly because they expend intense
effort. Zoellner extended over to writing the popular view (I.2.8.3) that fluent
speech is a hallmark of literacy. However, due to differences in rates and
processing conditions of the two modalities, a transfer of the rhythm and tempo
of speech over to writing can be an impediment to composition as much as an aid
(V.3.10-37). Student writers rush along, inserting fillers, trivialities,
needless words, and restatements, just to keep moving. And since speech is not
preserved, little use is made of the opportunity {65} writing offers to look
back and build directly upon what was already said.
2.30 Zoellner contended that students can talk out their message in a
sustained, rapid-fire segment of sound stream” that “communicates quickly
and effectively” (1969: 273), though their writing may be muddled. His
appraisal of the need to transfer speech skills over to writing was sound, but he
failed to spell out the steps in the transition. He had to presuppose that
“the student has already thought the rhetorical problem out, and that it is
only necessary to get him to say what he already knows” (1969: 299). As in
operant conditioning, the experimenter/teacher begins the “shaping” only
after the animal/student emits some “naive behavior” (II.2.6). For Zoellner
(1969: 303f), a failure to write what one wants to say is like a pathological
deviation between a brain impulse vs. a motor action, sometimes called
“schizokinesis” (Gantt, 1953) — as if writers’ problems are comparable
to mental illness. By decrying “thinking” (II.2.28) and taking for granted
the resolving of “rhetorical problems,” Zoeliner neglected the cognitive
processes that control organize, and evaluate the content of writing.
2.31 Because the stimulus-response scheme abridges relevance and creative
initiative (II.2.10, 13, 28), educational behaviorism at large deals only with
gross externalizations of crucial cognitive developments (II.2.27). The status
of any observed event as an indicator of underlying skills is drastically
over-simplified. No doubt Zoellner himself evaluates student papers by standards
wholly unrelated to operant conditioning. He publicly opposed the “behavioral
objectives” that clutter up education as disguised demands for
“middle-class” uniformity and “cultural coercion” (Zoellner, 1971:
231f), e.g., Lois Caffyn’s (1969: 72) goals of habitually using
“preferred” language, listening eagerly and courteously, and attending
community meetings and clubs. (I know I don’t do these things all that
much, yet I can’t see how my writing is any the worse for it.) Yet the
individualism Zoeliner (1971) favored is irreconcilable with his extrapolation
from operant conditioning which, if anything, massively steers each individual
toward predetermined ends -to be “conditioned” means to conform. The
“shaping” can be tailored to the student, but the standards for judging what
to write and how to react to it cannot be derived from the rat-training sessions
Zoeliner extolled.
2.32 Even Caffyn’s more convincing “objectives” of discussing
issues, sharing experiences, and responding to fine distinctions, are mere
labels for undefined cognitive and social developments. Writing is a genuine
mode of human behavior, involving manifest actions, and it is organized in a
real time progression (cf. Matsuhashi, 1979, 1981, 1982; Bereiter, 1980; Flower
& Hayes, 1980, 1981). But conventional behaviorism is too simple and
reductive to be more than a precursor for a science of composition. In future
work, we must find new approaches, but we should also bear in mind some
behaviorist tenets: {66}
2.32.1 A theory of human language should be an integral part of a general
theory of human behavior (Skinner, 1957; Osgood, 1963; Pike, 1967). We must keep
in view the “broader field” of “the behavior of a most complex creature in
contact with a world of endless variety” (Skinner, 1957: 456).
2.32.2 “What needs to be explained is the total speech episode,”
including “all relevant events in the behavior of both speaker and listener in
their proper temporal order” (Skinner, 1957: 36). In this “total event,”
“language behavior and non-language behavior are fused” (Pike, 1967: 26).
2.32.3 “The listener” (or audience) is “an essential part of the
situations in which verbal behavior is observed”; “the change which is thus
brought about in the behavior of the listener is appropriately called
‘instruction’” (Skinner, 1957: 172, 362; cf. 1.4.6; III.2.26).
2.32.4 Language education should be based upon “the actual here-and-now
behavior of a given student or a given classroom of students” (Zoeliner, 1971:
232).
2.32.5 Learning works best as a series of successive approximations
moving steadily and confidently toward an end result (Sidman, 1962).
2.33 Cari Frederiksen (personal communication) remarks that experimental
psychology is inherently behaviorist to the extent that it observes and measures
the responses of test persons to induced or manipulated events. Teaching has the
same orientation to judge performance in terms of tasks and results. Thus, teaching
is research (I.4.17; III.2.40; VI.3.25) by virtue of this analogy. However,
we need a comprehensive model of writing that shows how behavioral evidence
relates to the larger context of cognition and communication. Not until then
‘can we co-ordinate daily practices with the totality of our enterprise.
3.
MENTALISM AND TRANSFORMATIONAL GRAMMAR
3.1
Whereas PHYSICALISM seeks explanations in observable mechanisms, MENTALISM looks
to the mind and its capacities. Mentalists explore the ideas, idealizations,
abstractions, intuitions, and so on, that underlie external events and
behavioral dispositions. In American language research, mentalism had been
represented only in a few studies (e.g. Bloomfield 1914; Sapir, 1921; Whorf
1965; [19441]. Bloomfield’s (1933, 1936) vigorous conversion to physicalism
determined the course of structuralist linguistics for decades (II.1.10). Even
in the study of meaning, mentalism was lamented as “vitiating” and
“pernicious” (Quine, 1971: 142). Though physicalism and behaviorism were
eventually recognized as inadequate, they were difficult to revise, as we saw
from the efforts of Quine and Osgood (II.2.16-23). The terminology lived on, but
more as a set of orthodox metaphors than as a description of human communication
(II.2.15, 24). {67}
3.2 The rehabilitation of mentalism in American linguistics and
psychology came when formal logic was conspicuously introduced into theory and
method. One of the early projects of positivism (II.2.1, 16) was to build a
perfectly consistent and complete logical basis for mathematics and science
(Russell & Whitehead, 1903). Gödel’s (1934) proof that every logical
system allows true statements you can’t prove and false ones you can’t
disprove put an end to that project, but not to the longings of scientists for
such a system. In effect, the argumentative procedures of science continued to
resemble those of axiomatic logic (survey in Stegmüller, 1976). Moreover, logic
commands the respect of the general public who only vaguely understands its
intents and limits (cf. Beaugrande, 1980a, 1981b). Logical languages fall far
short of the flexibility, complexity, and subtlety of “natural” (i.e.
non-artificial) languages like English, and fail to account for the
participation of producer and receiver, and the informativity and situationality
of the text. Logic presupposes well-defined classes, subclasses, and instances.
Its conclusions are often so trivial and predictable that real people wouldn’t
utter them in everyday communication. Hence, the use of logic as a model for
human thought and language entails a different, though hardly less radical,
reduction, than that of physicalism (II.2.9f), and is thus open to principled
objections. A more interesting issue is how people (including philosophers)
perform processes they consider “logical,” e.g., defining classes and
drawing conclusions.
3.3 In common parlance, “logic” designates reasoning procedures that
emphasize truth, consistency, completeness, and causality. This kind of logic is
popularly thought to be the ideal mode of thinking and communicating (cf. Olson,
1977a, 1977b). For example, composition teachers watch for violations of logic
in students’ papers (I 317). However, too much stress upon logic can be
inhibiting to creative learning processes whose approximations and diversities
are wrongly looked upon as embarrassing. Seymour Papert (cited in McCorduck,
1979: 290ff; cf. Papert, 1971) remarks:
We
all know that we have all these horrible moments of confusion when we begin a
new project, that nothing looks clear and everything looks awful, that we work
our way out using all sorts of odd little rules of thumb, by going down blind
alleys and coming back again, and so on, but since everyone else seems to be
thinking logically, or at least they claim they do, then we figure we must be
the only ones in the world with such murky thought processes. We disclaim them,
and make believe we think in logical, orderly ways.
The
logical ideal thus plays down the intuitions, inferences, defaults, and
approximations of everyday reasoning (cf. Piaget, 1976; Collins, 1978; Papert,
1980), and may mislead learners to simply give up trying. The real role of logic
in cognition and communication is more likely that of a community of specialized
processes (I.4.8; III.1.23); this “logic specialist” can be consulted as
needed, but doesn’t dictate what a person can think, say, or mean. {68}
3.4 The popular wishful thinking about logic helped to case the entry of
mentalism into linguistics and psychology. Chomsky’s (1957) “grammar”
promised to relieve a crisis in taxonomic structuralism: the unwieldy complexity
and variety of language syntax (II.1.16f). He suggested that a natural language
like English is composed of an infinite set of sentences, in the same way as
logic permits unlimited embedding (inserting one instance of a structure inside
another) and recursion (repeating a formative operation over and over) (cf.
IV.2.13, 70; IV.3.37). Thus, no exhaustive catalog of surface patterns could or
should be undertaken. His own system was an extrapolation from “axiomatic
logic” (also called a “calculus”): a logic founded on a set of
“strings” (sequences of symbols or words) called “axioms” because they
are accepted a priori. The axioms are created by means of “formation rules.”
All the strings of the logic can then be made by applying “derivation rules”
to the axioms. Only the strings so derived count as “well-formed.” The
meaning of a string is determined by applying “interpretation rules.” Only
well-formed strings can be interpreted — ill-formed ones are considered
meaningless. Two or more strings can be linked with “connectives”:
‘and’, ‘or’, ‘if […] then’, and ‘if and only if’ (also written
‘iff’). Whether a string so connected is as a whole “true” or
“false” is decided by consulting “truth value tables.”
3.5 At first glance, all these notions suited the purposes of logic for
building formal proofs. The analogy between a logical language and a natural
language also seemed reasonable. The “strings” were the sentences. The
“axioms” were kernels (or “kernel sentences”). The “formation rules”
were called “phrase-structure rules”, and the “derivation rules” were
“transformational rules”. A “well-formed string” would be a
“grammatical sentence” (or “well-formed sentence”). The logic itself was
the grammar for the language in question, e.g. English. The “transformational
grammar” propagated by Chomsky is one type of generative grammar: one capable
of assigning a structural description to all the strings it allows (cf. II. 1.
17). Other grammars, both transformational and generative, were proposed once
the trend started, but Chomsky stipulated most of the “standard”
assumptions. For example, he rejected the notion of probability in linguistics
on the grounds that every sentence has close to zero probability — logically
true, but operationally false. Though the detailed surface structure of a
complete sentence may often be unique, participants in communication have
elaborated expectations about what concepts, goals, and actions are more or less
probable in a given context. (II.3.14; IV.1.2; IV.2.15, 54).
3.6 Despite common talk of a “Chomskyan revolution,” the
new grammar actually upheld many contemporary attitudes in science and
linguistic theory. The place of logic in positivism and popular opinion
(II.3.2f) made it a good candidate for extrapolation. The already central notion
of the “sentence” attained an even higher theoretical status when it became
axiomatic, i.e. postulated a priori (II.3.4f). The notion of “deep
structure” lent {69} new impetus to the established striving for well-structuredness
by adding theoretical substrates of structure (cf. I.1.3; II.1.4), e.g., to
distinguish between superficially similar sentences like (Chomsky’s, 1965:
22):
(7)
I persuaded John to leave.
(8)
I expected John to leave.
Thus,
the overall scientific climate favored the new grammar despite its innovative
terminology.
3.7 This success determined the priorities of language study. First, the
primacy of syntax was established not only over meaning and purpose, but also
over phonology and morphology (cf. Householder, 1965; Dik, 1967). Researchers
would blithely affirm things like:
The
fluent speaker’s ability to use and understand speech involves a basic
mechanism that enables him to construct formal syntactic structures […] and
two subsidiary mechanisms: one that associates a set of meanings with each
formal syntactic structure and another that maps such structures into phonetic
representations. [emphasis added] (Katz and Postal, 1964: 2)
Second,
the independence of syntax from the rest of language was implied by the design
of the logical model. Third, the independence of language from sociological and
cognitive contexts had to be postulated if the grammar alone was to stipulate
which sentences were well-formed or grammatical. The abstraction of the language
system from language use — prefigured in Saussure’s (1916) dichotomy of
“langue vs. parole” (II.1.2) — was renewed as “competence vs.
performance.” Fourth, language was defined as a means of predication, i.e.,
the assigning of “arguments” to “predicates” to form “propositions”
(e.g. Jarvella, 1977). The meaning of “propositions” depended on the
conditions under which they are true (Chomsky, 1965: 22).
3.8 Chomsky soon reached out beyond the domain of formal grammar.
According to his own testimony (1975), he felt provoked by the “intellectual
scandal” of physicalist and behaviorist theories, especially Skinner’s.
Transformational grammar appeared suitable for the role of a rival theory of
human processing. Admittedly, there was no hard evidence that humans produce and
understand utterances the same way that the grammar handled sentences. Yet
psycholinguistics, a field striving to merge linguistics with psychology
(II.1.18), readily took this correlation as a working hypothesis (e.g. Miller,
1962; Mehler, 1963). Soon after, Chomsky’s (1965) version of the grammar
ventured into extensive psychological implications. Even in his early Skinner
review, Chomsky (1959: 56) vacillated: on the one hand, he warned that “the
construction of a grammar that enumerates sentences in such a way that a
meaningful structural description can be determined for each sentence does not
in itself provide an account of this actual behavior”; on the other hand, he
considered the “grammar” to be “a statement of the {70} integrative
processes and generalized patterns imposed upon the specific acts that
constitute an utterance,” and the “rules of the grammar” to be “the
selective mechanisms involved in the production of a particular utterance.”
3.9 This ambivalence persisted throughout transformational grammar and
engendered a double-faced terminology whose formal definitions in the theory
were at variance with their commonsense meanings: “competence,”
“generate,” “speaker-hearer,” “language acquisition device,” and so
forth. Ironically, Chomsky fell prey to the same predicament he diagnosed in
Skinner (II.2.15): “the terms used in the description of real life” and
those of the abstract grammar were often “mere homonyms.” The grammatical
terms became pervasive, hardly noticed metaphors (cf. I.1.3).1 [Chomsky
(1965: 9) acknowledged that his terminology had created a “continuing
misunderstanding,” but saw “no reason for a revision.” The
“misunderstanding” was probably an advantage.] The correlation between the
grammar and the human mind, like that between logic and language, was simply
asserted and accepted as an act of faith. The finding that more complex
sentences are harder to process would be predicted by almost any grammar, not
just by Chomsky’s.2 [ Specifically, Chomsky’s model predicts that
processing times reflect the number of transformations people are doing in their
heads to work down to, or up from, “deep structures”; this “derivational
complexity hypothesis” would appear in some form for virtually any model (cf.
Kintsch, 1974; Levelt, 1978).]
3.10 Chomsky himself apparently considered empirical discovery via
observation wholly inessential. He averred (1965: 15, 19f) that “the
description of speech habits” and the discovery of “patterns” in “a
corpus of observed speech” could “preclude the development of a theory.”
“Reliable operational criteria for the deeper and more important notions of
linguistics” would never “be forthcoming”; the “sharpening of the data
by more objective tests” was “a matter of small importance,” since
“objectivity can be pursued with little consequent gain in insight and
understanding.” A better source of evidence was the linguist’s
“intuitions” as a “native speaker”—an “enormous mass of
unquestionable data.”
3. 11 Though intuitions play some role in most sciences (cf. I.18),
linguistics was the only discipline to proclaim openly that the “intultions”
of the investigator are its “unquestionable data.” The supplying of
contexts, previously unacknowledged in scientific procedure (I.1.3; II.1.3, 7,
11, 18), came out in the open, but with important restrictions. The bias toward
mentalism and formal logic prevented an examination of the really broad contexts
of communication in social interaction. On the contrary, Chomsky’s approach
obscured rather than clarified the correlation between theory vs. empirical
evidence observed in everyday language (cf. Beaugrande, 1982d). First, such
evidence was “performance,” and the theory was directed to “competence”;
“psycholinguistics” was expected to explain away discrepancies between
performance and competence (Slobin, 1971). {71} Second, the theory focused on
“deep structure,” whereas real language events provided no evidence except
“surface structure,” whose nature was repeatedly redefined in
“extensions” of the “standard” theory (cf. Chomsky, 1965 vs. 1972 and
1976).1 [In the “standard theory” (Chomsky 1965), surface
structure was given no role in the interpretation of a sentence. In the
“extended standard theory” (Chomsky, 1972), both deep and surface structure
were admitted to the interpretation. More recently still, in “trace theory”
(Chomsky, 1976), the representation is a single level of surface structure
annotated by “traces” that in effect preserve the predicate/argument
structure of the sentence as a proposition.] Third, contrary evidence could be
accounted for by adding new rules and leaving the theory itself untouched.
Fourth, the role of context in defining grammatical categories was disregarded
(I.4.6; V.2.27; Lenneberg, 1975); the more “abstract” the grammar, the less
secure its constructions.
3.12 It would have been necessary to test the more fundamental claims
implied by transformational grammar: (a) that the structures of various
sentences are derivable from each other by applying formal rules; and (b) that
simple structures are more basic to language (and to knowledge of a language)
than complex ones. Though seemingly reasonable, these claims may be either
trivial or misleading, depending on how we interpret them:
3.12.1 The two claims are trivial in the sense that the grammar itself
stipulates the same categories for all structural descriptions. Hence, any
sentence structure can by definition be derived from any other by manipulating
shared categories. The number and elaboration of categories and rules also
decide what counts as simple or complex. The two claims are thus inherent in the
grammar itself, though so far, no complete grammar of this kind has been
designed for any natural language.
3.12.2 The two claims are misleading in the sense that the decision to
create one sentence structure in preference to any other, e.g. an active vs. a
passive, is guided chiefly by meaning and purpose, e.g. the desire to express an
event with a particular focus (cf. IV2.56, 61; VI.I.22); syntactic rules only
specify the options. Hence, the operational relationships among different
sentence structures are semantic and pragmatic, not just syntactic.
Consequently, purely syntactic rules yield no realistic measure of processing
complexity (cf. II.3.2Iff). What is experienced as simple or complex depends on
the total processing demands within the current context, including a person’s
language skills. “The job of syntax is not central but rather peripheral in
ordinary language-merely accommodating lexical decisions made on the basis of
the fleeting interests and motivations entertained by speakers” (Osgood, 1971:
529). No one has yet proven that the categories and structures posited by the
grammar play a prominent part in normal communication.
3.13 The problem of empirical reality can be illustrated with the common
assumption that the sentence is the primary unit of speech production and
comprehension (Ohmann, 1964a; Bever, Lackner, & Kirk, 1969; cf. survey {72}
in Levelt, 1978). This assumption is not borne out by everyday speech (cf.
I.4.11.2; ]V2.17; V.2.18, 30, 33, 45; V.3.7ff; VI.I.10), where the boundaries of
phrases and clauses often reflect intonational and conceptual units, not just
syntactic ones. This discrepancy obliges the proponents of abstract grammars to
manipulate evidence, e.g., to (a) convert it themselves into well-formed
sentences; (b) assert that well-formed sentences “underlie” the evidence;
(c) exclude any evidence that doesn’t happen to be a recognizable sentence;
(d) invent their own examples as needed; (e) rely only on a written sampling,
where sentences are usually complete and their boundaries well-marked (what
Abercrombic, 1965, calls “spoken prose”; cf. II.1.13);” or (f) use the
term “sentence” inconsistently to mean not just a grammatical structure, but
also a predication, a proposition, an utterance action, a speech act, and so
forth (cf. Beaugrande, 1980a: 11ff).
3.14
All these recourses can be found in mentalist linguistics. As a result, evidence
was subordinated to theory in unprecedented ways. Real discourse came to be
regarded as deformed and degenerate (e.g. by Ohmann, 1975: 298; Stockwell, 1977:
168). Paradoxically, this attitude is just as evaluative as that of traditional
prescriptive grammar, though for different reasons. The old grammarians
unabashedly defended their own usage as “good” and “proper,” as meeting
the demands of social prestige, elegant diction, and rhetorical effectiveness.
The new grammarians propagated their own usage by allowing their language
attitudes and social commitments to influence their judgements of
grammaticality. People most readily recognize a banal sentence as grammatical,
not because they have “internalised” a grammar of “well-formedness,” but
because they can easily imagine a context where the sentence could be used (Bolinger,
1968; McCawley, 1976; Snow & Meijer, 1977). Therefore, the old
grammarians’ samples taken from outstanding writers and orators of the day
could in fact seem deviant, marginal, or ill-formed to the new grammarians cf.
VI.1.If), who presented English as a flurry of inane sentences about‘John’
and ‘Mary’ — the star-crossed pair whose ideal, abstract exploits provided
the content of linguistic samples for years to come. English was converted from
a vehicle of memorable style into “ infinite set of banalities”.
3.15
With so many empircal limitations, standard transformational grammar can be no
more than a partial theory of literal paraphrase. Humans would create
sentences in ways comparable to the grammar if they had to recast a sentence
into a close paraphrase using the same word stems. This occasion is are in
comparison to looser paraphrasings, {73} reports, gists, and so on; and even
rarer among the totality of language activities. Chomskyan linguistics was thus
forced to exclude major aspects of language and insist on a version of
“competence” so narrow as to be irrelevant.
3.16 However, “transformations” could be useful in a broader science of text and discourse not centered on abstract syntax. Configurations mapping from one language level or processing phase to another are presumably “transformed,” e.g., when an array of goals of concepts is projected onto a linear surface text, or vice-versa (cf. IVI.7-9). According to Wundt (1970 [1900]), text production takes a mental configuration (an “idea” in the sense of III.2.14) of simultaneously activated components, organizes it into grammatically or logically related segments within an orderly hierarchy, and finally transforms it into the surface sequence in the text. Other types of transformations have been envisioned that convert: (a) a spatial representation into linear syntax (Degerman & Mather, 1972); (b) a “logical form” into a sentence, as in “generative semantics,” a counter-movement inside transformational grammar (cf. papers in Steinberg & Jakobovits [Eds.], 1971); (c) one gestalt-like configuration into another (Piaget, 1976: 130f); (d) the order in which concepts are activated into the order in which those concepts are expressed (Osgood & Bock, 1977: 91); (e) an underlying content hierarchy into a story, as in the “story grammars” developed by cognitive psychologists (see especially Johnson & Mandler, 1980; survey in Beaugrande, 1982d); or (f) a text into an elaborate network of logical representations, as in János Petöfi’s (1980) “text-structure/world-structure theory.” Some of these transformations also go in reverse. The proposals all concur that transformations need not convert one syntactic pattern into another, but may rather a non-linear pattern into a linear one (or vice-versa). This approach to “transformation” dese