II.
Toward a Science of Text and Discourse
II.A.
Paradigms for the study of language
1. If we define a text
as a communicative event (I.34), a discourse
would be a set of interconnected texts, the primary instance being the
conversation. It would follow that text and discourse are the main channels for
people to encounter language. If so, all work for cultivating or studying
language would have been at least implicitly or indirectly concerned with texts,
even though only a few were explicitly or directly so. Before moving ahead, we
can look back on past disciplines in this perspective.
2. Each discipline is circumscribed by a °schema° or °scenario° of shared concepts and methods regarding what questions to ask and where to seek the answers. The currently fashionable term is the paradigm, which the °philosophy of science° attributes to each normal science performing its routine work of solving problems (‘puzzles’). Periodically, a paradigm shift occurs, most radically in the scientific revolution. To cover its broad range of concerns, a °science of text and discourse° must ‘shift’ to a transdisciplinary paradigm that does not just displace some older paradigm with a new one, e.g., ‘sentence linguistics’ with ‘text linguistics’. Instead, ours should be a °meta-paradigm that integrates multiple paradigms° in order to situate their concepts and findings in wider contexts where disciplines can interact in full solidarity to develop a detailed °ecologist program for sustainable social progress° (I.60; II.132). To encompass the range of rich,interconnected issues we face today, we must transcend the philosophers’ popular but outmoded scenario of competing paradigms locked into a Darwinian struggle for survival (cf. III.24, 181; VII.199). In that spirit, it is essential to build upon our precursors and to appreciate the problems they faced and the solutions they attempted.
II.B.
Textuality in grammar, rhetoric, and logic
3. The oldest and richest tradition belongs to the maintenance and interpretation of prestigious texts, typically official, legal, ritual, sacred, or poetic. Evidently, the text, particularly the written inscription, was widely esteemed a potent social instrument for expressing and transmitting the important knowledge of the culture. This esteem is most evident when prestigious texts are cited in legitimizing discourse about what is right or binding, witness these two examples from the 15th and 20th centuries [8-9] (i.a.).
[8] I am talking of the statutes that have the force of canons and of decretals that are universally binding on the church […] since it
is useful to introduce any reform on the basis of precedent, I would submit an
imperial letter which is credited to
Constantine (Nicholas of Cusa, ‘On the supremacy of general councils in church
and empire’, 1433)
[9] [Prime Minister ] Begin
reviewed the text in detail, and
finally concluded, ‘This document is
not a proper basis for negotiations’ […] He focused on words
and their meaning […] In all, the
American team prepared 23 versions of
the Framework for Peace. I wrote the original Sinai agreement personally, and there were eight different texts
before we finished (Jimmy Carter, Keeping
Faith, 1982)
Evidently,
many people believe having written text settles things for good.
4. Yet the elite rank
and specialized training of many experts on prestigious texts — say, for the
Vedic priests or the Kabbalists of the Torah, or for today’s attorneys of
corporate law — suggests that the text can be a highly problematic social
instrument. Paradoxically, the experts apply specialized skills and procedures
to construct a meaning that has an adaptive
value
for them while purporting to have only registered
and presented the one
correct or
true meaning
. The authority of
the text is all too readily co-opted for authoritarian
moves of power (cf. II.129; III.10; V.14).
The °reifying ‘formalist’ myth of the self-sufficient text with a stable
and determinate meaning°, which inflates the individual over the social (cf.
I.39; II.110; III.107), serves the
dual agenda of enhancing the prestige of texts and securing the rights of
interpretation for elite groups (V.80; VII.199).
5. So in most historical settings, textual skills decide whether °access to knowledge promotes either power and inequality or else solidarity and equality°. Discoursal power moves are easy to °mystify as ‘literal’ and ‘true’ statements about ‘reality’° because few people properly appreciate the °potential of discourse to construct and negotiate reality° (cf. VIII.75, 131). Conversely, discourses of power are hard to °demystify° because doing so requires us to raise our critical consciousness and suspend our easy routines of understanding talk in order to draw wider and subtler connections (cf. I.59). We need to critique such familiar expressions as ‘nature’ and ‘reason’, e.g. in [10], and recognize such discourse to be a direct precursor of today’s myth of ‘intelligence’ and ‘aptitude’ being pre-determined by nature (VII.64).
[10] Every constitution is founded on natural law [that] exists by nature in reason. [the] wiser and more excellent are chosen as rulers in order that, endowed with a naturally clear reason, […] they may choose just laws and by these govern others and hear cases. […] thus those who are strong in reason are by nature masters and rulers of others (Nicholas of Cusa)
The
mystification of discourses of power has been abetted by the disinterest among
language-related disciplines in explaining the relevant strategies and providing
an explicit theory for such a long-standing and important practice. Evidently,
powerful people are not eager to have it made public how they use discourse in
ways that are quite adaptive for them and quite maladaptive for the
disempowered.
6. Three main
disciplines have traditionally been concerned with language. The discipline of grammar
has sought to expound the organization of a language in terms of forms,
patterns, and rules. Typically, °prestigious texts° were chosen as sources,
which enhanced the prestige of grammar and encouraged qualified scholars to
study and cultivate it. Using such sources placed traditional grammarians fairly
close to ‘text linguists’ and kept them from anticipating the
20th‑century rift between ‘language study’ versus ‘text study’
(cf. II.24). Their own specialized education and ‘high literacy’ legitimized
the early grammarians’ claim to ‘know the language’ and encouraged many of
them to act as °language guardians°
by adopting conservative, elitist standards and by making °prescriptions
for ‘correct’
usage
and proscriptions
against ‘incorrect’
usage°. This strategy was highly adaptive in creating a lucrative and
self-perpetuating market for expertise about grammar. In practice, the standards
were largely constructed along social lines to accentuate the dual distinctions
between writing versus speech and between a prestigious
variety or dialect
versus one or more non-prestigious ones (cf. VII. 251-55).
The reliability and generality of such grammars for representing actual usage
remained uncertain, insofar as the things people were not supposed to say were precisely ones which many people did
say.
7. The conservative quality of grammars was reinforced by the strategy of recycling earlier grammars, even ones from other languages, e.g., when the English abbot Ælfric Grammaticus (ca. 955-1020) relied in [11] on the Latin grammarians Priscian of Caesarea (5th century A.D.) and Aelius Donatus (4th

century).
Like many later grammarians,
Ælfric assumed that English grammar should be legitimized by borrowing upon
Latin grammar and using the same descriptive terms. Fortunately, the models they
admired, such as Priscian’s Institutio
de arte grammatica and Donatus’ Grammatica
urbis Romae, took a very broad view of grammar, witness Donatus’ sections
on the ‘voice, letter, syllable, feet, tones, schemes, and tropes’ (de
voce, de littera, de syllaba, de pedibus, de tonis, de schematibus, de tropis).
Hence, Ælfric’s notion of ‘grammatica’ (Anglo-Saxon ‘stæfcræft’,
‘skill in letters [of the alphabet]’, but also ‘learning’, ‘study’)
covered the sounds of speech and the construction of meaning as well, in each
case foregrounding written language. In modern terms, the ‘grammar’ extended
into ‘phonology’, ‘intonation’, and ‘semantics’ (cf. II.29f, 48ff,
63).
8. Rhetoric was a more directly textual discipline whose social
function was to teach active and public skills, especially for oratory, rather
than to cultivate the passive and private skills gleaned from the grammatical
study of °prestigious texts°. The rhetoricians saw in language less a
catalogue of ‘correct’ forms and patterns than an armory of discourse
strategies for practical goals. This scope was not necessarily narrower than
grammar, because the rhetoricians, while not trying to cover all the points a
grammar might address, emphasized richer factors of context, e.g., how to
persuade particular audiences. Rhetorical effectiv
eness may not coincide with grammatical ‘correctness’;
some audiences might see an inappropriate power move in the use of a cultivated
style preferred by grammarians. Rhetoric (with the Greek root ‘eírein’
meaning ‘speak’) has therefore been more political and populist and centered
on oral discourse, whereas grammar (with the Greek root ‘gramma’ meaning
‘letter’) has been more academic and elitist and centered on written
discourse. Privileging grammar over rhetoric in education right up into modern
times reflects the policy of granting preferential knowledge access to reward
‘high literacy’ and ‘standard’ dialect rather than the skills of popular
communication (VII.162f, 169f, 212f).
9. Recognizing the public demand for training in rhetoric and the profits of the schools already offering it (e.g. the school of Isocrates), Aristotle expounded his famous Art of Rhetoric in search of a systematic philosophical and psychological foundation. His method of classifying into subtypes has proven vastly influential ever since, even among modern linguists (II.34; V.45). For example, he distinguished between the ‘deliberative’ rhetoric for reaching decisions in assemblies, the ‘forensic’ (or ‘litigious’) for arguing cases in court, and the ‘epideictic’ for displaying elegant or ornate language. His advice about the proper style or usage of language was ambivalent, due his admiration for poetry and stage drama (e.g. by the sonorous Euripides). He recommended ‘appropriate’ language and the ‘normal idiom’ but also ‘exotic’ and ‘distant’ language that is ‘artificial’ but seems ‘natural’. For writing, his ominous suggestion that content ranks below style has stirred considerable annoyance (cf. V.39).
[12] the virtue of style is to be clear, and to be
neither low nor endlessly sublime, but appropriate
. […]
the discourse must be made to sound exotic, for men are admirers of what is
distant, and what is admired is pleasant. […] one must do this without being
noticed and give the impression of speaking not artificially but naturally.
[…] the technique is well concealed by drawing words from the normal idiom,
[…] we should make little use of exotic, compound, and artificial ones; […]
they involve too great a change from what is appropriate. […] written speeches
have more effect through their style than through their intellectual content.
Despite
a brief plea for ‘correct’ grammar (agreement of Gender and Number), he
differed prominently from the grammarians (and even more from the logicians) by
describing how to build rational or ‘logical’ arguments and ‘proofs’
upon uncertain or subjective premises, and how to gain power by exploiting the
audience’s emotions (anger, contempt, gentleness, friendship, love, pity,
envy, jealousy). He classified people into ‘character types’ by age and by
social rank, noting that ‘the rich are arrogant and overbearing’ and
‘mindlessly happy’, for ‘their wealth is the highest measure of all
things’, and ‘they believe themselves privileged to rule’; — just what
the wealthy backers of today’s °New Right coalition° devoutly and mindlessly
believe (cf. VII.32).
10. Logic
was a discipline aspiring to complete the search for a universal system of
knowing in the discipline of ‘philosophy’ (from Greek roots, ‘love of
knowledge’). Whereas some philosophers invoked an absolute ‘first
principle’ such as ‘Nature’ or ‘God’, others turned to language as the
privileged mode for knowing, meaning, defining, asserting, and so on (whence
‘logic’ from the Greek root ‘logos’ meaning ‘word’ or ‘speech’
and, technically, ‘sentence’). Early logic, with strong affinities to
mathematics and geometry, sought to construct proofs on certain and objective
premises rather than on the uncertain and subjective premises of rhetoric. So
whereas the grammarians cultivated a body of prestigious texts, and the
rhetoricians composed tactical arguments for debates, the logicians sought a
system of principles (or ‘axioms’) whereby statements and arguments could be
constructed and proven true or false, independently of text type and context, or
of speaker and hearer and their beliefs, attitudes, or goals. The term
‘logic’ might therefore justly subsume all relationships and methods whereby
the value of a term can be strictly determined through formal steps from the
known value of other terms (cf. III.167f). In the idealized complete system,
every fact, premise, and conclusion would be either true or false, freed from
the rich and evolving contexts of everyday experience and ordinary discourse.
11. A favorite illustration has been the classical syllogism:
[13] major
premise: All men are mortal.
minor premise:
Socrates is a man.
conclusion:
Therefore, Socrates is mortal.
Yet
even this threadbare syllogism is not so simple and certain. The conclusion is valid
only from a sparse reading of the
major and minor premises by handling the words and phrases as mere counters or
placeholders, but is convincing from a
rich reading based on our knowledge of
what the words mean and how the world is organized. The validity holds whether
or not we know who Socrates was or what is meant by ‘man’ and ‘mortal’,
because the syllogism merely uses language to restate the truism of set theory
that a property holding for all members of a set also holds for each individual
member (cf. II.50; V.113).
But we are convinced because we do
know all that, and we might be upset if the wording were changed to ‘all men
are moral’ or ‘Xanthippe is a man’, even though the syllogism would still
hold. We could only dispute the ‘facts’, which the logic doesn’t claim to
supply. Nor is the logic relevant if somebody retorts that Socrates was ‘immortal’
because his life and works live on, or interprets ‘man’ in an unusually rich
sense, as when Antony says of Brutus ‘This was a man’ because ‘the
elements’ were ‘so mixed in him’ (Julius
Caesar, V, v, 72-75). Logic disdains mixed elements.
12. Logic
symptomatically underrates the essential difference between language versus
mathematics and geometry, whose exactness and universality hinge on being
disconnected from content and context. The ‘meaning’ of numbers and their
combinations is precisely the quantities they express; the ‘meaning’ of
lines and figures is precisely their dimensional and positional relations (cf.
III.158). These sparse conditions do not hold either for a ‘natural
language’ like English or for human knowledge of the world and society. A
disquieting trade-off impends: the more rigorous and precise the logic becomes,
the sparser the meaning and the lower the cognitive and social relevance of its
statements (cf. II.50, 57; III.168).
13. Still, over the centuries, the project for using logic to definitively capture truth and meaning has enthroned logic as the prime model for all human thought and reasoning (II.96; III.167), both in ‘idealism’ and in some radical versions of its archrival ‘realism’, such as ‘logical positivism’ holding that certain (‘positive’) knowledge is based on observable physical phenomena, and ‘verificationism’ holding that knowing the ‘meaning’ of statements equals knowing how to verify them by observing the reality they describe. Similarly, the science of language has at times used ‘formal language’ as the leading model for ‘natural language’ (cf. II.58, 99; III.167-73). So philosophy, language science, and psychology have projected their own procedures for designing formal models and representations over onto human processes of thought and language at large. I hold this projection to be a category mistake whereby a theory is ‘short-circuited’ onto the domain of practices it purports to describe or explain (cf. II.96; III.169, 180; V.124; VII.63; VIII.17). How far human thought or language might be formal or logical is a question to be empirically discovered and not just quietly pre-empted or taken on faith (II.79;III.173; IV.30).
4. Fig. II.1 sums up how grammar, rhetoric, and logic might be assessed as three approaches to

texts. Typically, a grammar has been derived from passive skills honed on a body of prestigious texts, mainly written; a rhetoric by practicing more active skills on oratory texts, mainly oral though sometimes transcribed and embellished; and a logic by formal skills with using notations and determining the validity and truth values of mainly artificial texts.
15. Despite their disparate perspectives on texts, grammar, rhetoric, and logic co-existed over the centuries in a ‘trivium’, a trinity of eminent scholastic domains (called ‘liberal arts’). Some early authorities saw no tension among them, e.g., when Aristotle sought to merge rhetoric with logic and treated grammar as well (II.9). Our ‘modern’ age has seen them split up; grammar has survived best, heavily represented in schooling at all levels (VII.K.2), whereas marginal areas are left to rhetoric in departments of speech or journalism and to logic in philosophy. The heritage of the trivium has been a studious respect for high standards of style, cogency, and clarity that education has appreciated without being properly equipped to impart the requisite skills in equality (Ch. VII). Perhaps a °science of text and discourse° can contribute to a productive reassessment of criteria for grammatical standards, rhetorical effectiv eness, or logical validity.
II.C.
Textuality in philology
16.
The notions of ‘language science’ or ‘modern
linguistics’ owe much to the field of philology, which was established in the nineteenth and century, and
whose chief concerns were: (a) historical
connections among successive stages of a language; (b) comparative connections among languages from related groups or
‘families’; and (c) geographical
connections among various ‘dialects’ of a language. All three investigations
have gathered their main evidence from written text-artifacts and devoted
scrupulous attention to problems and inconsistencies in transmission and
transcription. Historical studies, particularly by the ‘Neo-Grammarians’,
sought to formulate ‘laws’ of language change, but also to produce reliable
textual editions (e.g. the Hildebrandslied,
1812), and to cultivate the language skills for studying and using these —
whence such academic domains as ‘Old English’ and ‘ältere Germanistik’.
Comparative studies, particularly those of the ‘Indo-European’ group,
inquired which languages were related, and whether the relations were due to
common ancestry or to other factors like migration and borrowing. Geographical
studies, often labelled ‘dialectology’, focused on forms or pronunciations
that varied from region to region in regular ways. This work pioneered thorough
methods to describe dialects apart from the usual pejorative social attitudes
(cf. II.6; VII.49, 171; VIII.96, 150).
17.
One eminent success in philology was to describe
the great ‘sound shift’ that split off from the other ‘Indo-European’
languages the ‘Germanic’ branch that would later produce ‘Anglo-Saxon’
and later still ‘English’. For example, the ‘unvoiced stops’ (where the
air passage is stopped but the vocal cords are inactive, II.29) written as
‘p’, ‘t’, and ‘k’, got shifted to ‘fricatives’ (where the air
passes with friction) written as ‘f’, ‘th’, and ‘h’, whence Latin
‘pater’, ‘tres’, and ‘centum’ versus English ‘father’,
‘three’, and ‘hundred’. A shift could be strikingly regular, affecting
virtually all relevant words and even progressing to a precise geographical
line; unaffected items are probably later borrowings or frozen proper names,
ritual terms, and so on. So the ‘sound shift’ shed light on all three
branches of philology: describing the history of languages and their common
ancestry as well as the evolution of dialects. The discovery nicely fit the
19th-century concept of ‘natural laws’ reaching all across the sciences. Here already, the study of sound systems
fostered an impressively tidy theory of language, and detached the language from
the discourse practices of social agents (II.29; III.137).
18.
Philologists also pioneered the technique of
working with authentic data from a corpus
of language samples; and they carefully described what they found people saying
instead of °prescribing° what people should say in the manner of traditional
grammar. The key question was which forms were attested,
i.e., °textualized° in preserved inscriptions or transcriptions for reliable
identification and collation. Here, the textualizing of forms provided the means
not merely to determine their function and meaning — as for all language
(I.41) — but to decide which forms merited scrutiny at all. Although
‘Indo-European’ is commonly held to be a scholarly reconstruction and an
abstract system of forms rather than an actual language, some unrecorded
language(s) resembling it must once have been textualized in order to develop
and stabilize the formal regularities we can trace in its descendants. And the
recovery of genuine texts in any early offshoot of Indo-European is a
sensational event, witness the stir in our own century over the discovery of
Hittite going back 3,000 years.
19.
Recent research has raised again the prospect
that, contrary to the standard view among philologists, that Indo-European may
not be unrelated to other language families after all. Supportive evidence
includes the ‘-m-’ in First Person forms and the ‘-t-’ in Second Person
forms consistently appearing across the whole group called ‘Eurasiatic’ by
Joseph Greenberg, including also Uralic
, Altaic, Korean-Japanese-Ainu, Gilyak, Chukchi-Kamchatkan,
and Eskimo-Aleut. Some linguists, such as Morris Swadesh, have even revived the
prospect, once discounted by philologists, of a monogenesis
whereby all the
world’s languages descended from a single one. How they might have evolved
into separate groups is being actively discussed in the wake of startling
findings that populations with different language families also differ in their
genetics. These findings converge with burgeoning explorations of the
relationship between language and genetics.
20. Fig. II.2 shows how the branches of philology might be seen as three more approaches to texts.

The historical branch compiled and edited documents and inscriptions to trace the historical stages in a language. The comparative branch mapped the relations among languages or ‘language families’ indicated by patterns of attested forms complemented with painstakingly reconstructed forms. The geographical branch used surveys and questionnaires to elicit the dictation or transcription of regional varieties. In each of the three, the concept of ‘language’ varied to focus on evolution, comparison, and variation, respectively.
21.
Philology grappled valiantly with the problems of
analyzing authentic data and of
weighing textual evidence against conflicting or missing evidence, especially
when postulating non-attested forms. With living languages, in contrast,
informants can be consulted, and the obstacle may be not too little evidence but
too much — too many diverse opinions and too little consensus about the
vocabulary, grammar, or sound systems in various dialects or geographical areas.
Most techniques of taking dictation rarefy the factors not easily represented
through the conventions of writing (III.195). And if schoolmasters are the
informants, as for the monumental Deutscher Sprachatlas research, their usage may be more
self-conscious than the usage of ordinary speakers.
22. Despite such problems, philology inaugurated major advances in the directions later pursued by modern linguistics. The main concern was no longer grammatical, rhetorical, or logical standards, but the systematic properties of all the data that could be assembled in a corpus. Philologists appreciated the °prestigious texts° of literary and poetic discourse and scrupulously collected and edited them. But they also worked to uncover and preserve any surviving samples of non-literary discourse, such as statutes, proclamations, proverbs, prayers, incantations, personal letters, and so on. This concern prefigured the more resolute move of linguistics to affirm the centrality of spoken ordinary language over written elite language (cf. II.24ff). Still, many modern linguists have not properly acknowledged their debt to philology. They may have been motivated by rivalry or by competition within academic language programs treating ancient and medieval languages from a mainly literary viewpoint. Yet these programs did pave the way for modern language programs in the schools and universities. Perhaps a °science of text and discourse° can requite our own debt to philologists by providing further perspectives for the study of the early texts they have so diligently preserved.
II.D.
Textuality in modern linguistics
23.
The term modern linguistics designates a field of research and method that
emerged in the 19th century and was consolidated in the 20th. It resolved to be modern
by making a fundamentally new departure from the earlier approaches to language
outlined so far in this chapter. It would be scientific
by framing explicit theories of
language and implementing disciplined
practices. It would describe what
people do say rather than prescribing
or proscribing what people ought or ought not to say. It would provide comprehensive
coverage of the whole language
rather than selective coverage
of grammatical, rhetorical, or logical issues. And it would defend the
primacy of spoken language, e.g. by
developing elaborate methods for representing sounds more accurately than with
conventional orthographies (especially for English, VI.56); although written
language samples were often used and their visual clarity did some ‘cheap
explanatory work’ in identifying units and their boundaries (cf. II.29, 34,
38; III.195, 202; IV.1)
24.
However, the ratio between theory and practice in modern linguistics has remained
profoundly problematic (cf. IV.21) The human practices connected to language are
dauntingly vast and diverse, and certainly do not, by themselves, stipulate the
form or the content a theory of language ought to have. So, many linguists —
though by no means all, as we shall see — have attempted to stipulate a theory
from the ‘top down’, i.e., in purely theoretical terms, which would
stipulate the methods for connecting it to the practices of linguists and of
speakers. A famous project of this kind was inaugurated by Ferdinand de
Saussure’s ‘course in general linguistics’ (published in French in 1916),
which declared the ‘true and unique object of linguistics’ to be language
studied in and for itself’, termed
‘langue’ in French, and placed it in a stringent dichotomy against language
use, termed ‘parole’. Henceforth, the ‘classical’ program for °mainstream
linguistics° — as we find it reflected in influential works and in the
agendas of university departments, professional journals and conferences, and so
on — has typically constructed its theories of language according to tenets
like these:
(a) Language is a phenomenon distinct from other domains of human knowledge or activity.
(b) Language should be described apart from the conditions under which speakers use it.
(c) Language should be described by internal, language-based criteria.
(d)
Language should be considered a uniform,
stable, and abstract system in a single stage of its evolution, i.e., in a
‘synchronic’ perspective at the present time rather than in an
‘evolutionary’ or ‘diachronic’ one across historical time.
(e) The description should be stated at a high degree of generality — what applies to the entire language or even ‘universally’ to all languages.
In
the early stages, these tenets strategically suited the academic politics of
establishing a °monodisciplinary normal science° in a sparse and
self-conscious scientific climate. Disconnecting °language by itself° from
issues in neighboring fields, such as literature, history, folklore, philosophy,
psychology, and sociology, was expected to make the description more compact,
unified, and rigorous by imposing strict limits on which data get accredited,
i.e., accepted as worthy of investigation and placed into the established
categories.
25. But a theory of this type raises a serious problem for practice: language is not encountered by itself but only in use. Undaunted, linguists undertook to construct a language as a theoretical ‘system’ presumed to ‘underlie’ all of its practical uses, though the preferred term was discover to uphold °classical realism° by implying that this system is a reality waiting to be found. The chief practices of discovery have been data-handling moves like these:
(1) collating : the data samples in a °corpus° are compared and contrasted to see what they have in common, e.g., which word types frequently co-occur;
(2) consulting informants : native speakers are asked to judge or rate data samples of their language, e.g., which among several versions of an utterance they would be more or less likely to say;
(3) generalizing : certain aspects of the data are construed to be general ones, e.g., that the ‘Subject-Verb-Object’ order of a sample set of English Sentences is a typical pattern for the whole language;
(4) rarefying : the ‘rich’ data as we find them in real discourse are rendered ‘sparse’ by disregarding certain aspects or details, e.g., variations in the actual pronunciation of the ‘same’ language sound;
(5) decontextualizing : the data are removed from the observed context and treated as if they could occur in isolation or in a wide range of contexts, e.g., irrespective of the social status of groups or speakers;
(6) introspecting : the linguists base estimations on their own °intuitions° about the language, e.g., which sentences conform to ‘grammatical rules’.
These
practices were performed as °cognitive
moves° while organizing materials and constructing theories and models, and
as °discoursal and social moves
° while stating the results and informing colleagues through
reports, conferences, journals, books, and so on. Because our own language
strongly affects how we classify and label things, the discoursal aspects of
‘doing linguistics’ are far more influential and significant than has been
widely recognized.
26.
The practices for these data-handling moves
differed between two broad approaches to language. Fieldwork
linguists (as the term says) go ‘work’ in the ‘field’ of cultural
and social activities and carefully record a corpus
of °authentic data°: what
native speakers of a previously undescribed language or dialect are actually
observed to say. In contrast, homework
linguists (to coin a new matching term) ‘work’ at ‘home’ (or in the
office) with data that have been supplied from an outside source. The data may
be authentic, as in a computerized corpus of real discourse (II.64). But they
may also be °invented data° by the
linguists, who are fluent speakers and claim to represent the whole community of
native speakers. This tactic is highly problematic, and we need to consider why
(II.36; IV.21).
27.
In practice, fieldworkers never encounter °language
by itself°, even if the official theory purports to describe it; you confront
language in °rich connections with language use° and exploit modes of data
that are not just ‘linguistic’, such as the procedures of buying and selling
goods (II.33). Your work is always strongly °data-driven°,
and °practice-driven°, especially when you join in the social
practices of interaction and conversation. The practices supply continual tests: if your theory or your conclusions are wrong,
you’ll get corrected, misunderstood, teased, or ignored. In contrast, homework
can disconnect theory from practice by relying on your own introspection and intuition
to formulate and describe what native speakers are presumed to know about the
language by itself. The work becomes strongly °theory-driven°, especially when you pursue a construction that, by
definition, does not directly manifest itself in data or in practice, and when
you try to disconnect invented data from cognitive and social constraints
. Nor are your theory and conclusions tested by being put
into practice. Instead, theories and descriptions get evaluated mainly by standards
of design, such as formality, rigor, elegance, or compactness (cf. II.43,
92). In exchange, authentic data such as everyday conversations may look messy
or deviant, hardly worthy of attention (II.41; VII.133). So the quest to
discover the language-system as an underlying
reality can paradoxically lead radical homework linguists to replace
the language with a °technical
construction° which exists only in their own theorizing and can be
connected to language data only when these have been removed from practice and
translated into some °sparse notation°. Along the way, real speakers as
informants get marginalized and silenced, and a written ‘standard’ variety
parades as the language itself (VII.165; VIII.18) — the linguists’ version
of the °fundamental contradiction between inclusive theory (e.g. for covering
an ‘infinite set of sentences’) versus exclusive practice° (e.g. for
disregarding ordinary conversations) (cf. I.6; II.41). And cultural memory gets
erased to avoid questions about how concrete discoursal practices might evolve
and for whose benefit (III.113).
28.
After 70 or 80 years of research, the status of °language
by itself° is still uncertain. Since it is not a manifestation but a
theoretical construction, it cannot be verified by conventional observation of
the practices of °language use°. So we can move to a higher plane and observe
the progress of linguistics as a science
. There, our testable hypothesis might be: if ‘language’
is indeed a °uniform, stable, and abstract system we can describe by ‘purely
linguistic’ criteria°, then we should observe steady long-range increases on
three ‘test scales’: (a) coverage, i.e., how much language data have been described; (b) convergence,
i.e., how far various descriptions get corresponding results; and (c) consensus, i.e., how far linguists agree about how the description
should be stated and assessed. What we actually observe, however, is not a
steady overall increase but an uneven pattern of increases in some domains, and
stagnation or short-range decreases in other domains. How far linguistics might
count as a °normal science° (in the sense of II.2) is still not decided; its
‘normalcy’ fluctuates periodically and substantially (II.41).
29.
We observe the stablest increases
in the
description of language
sounds, which had already anchored the philologists’ concept of language being
governed by ‘natural laws’ (II.17). In the domain of phonology
(or ‘phonemics’), linguistics discovered a °uniform, stable, and abstract
system° of phonemes: theoretical
minimal units which correspond to the practical units of language sounds, and
whose quantity and nature can be precisely described in practice for any
language by the physical and mental criteria that ‘differentiate’ them (cf.
II.45, 48). Physically, each phoneme is connected
to articulatory events and locations,
e.g., a ‘voiced dental stop’ such as /d/ produced when the vocal cords
vibrate and the air flow is blocked by the teeth (cf. II.17); the concrete
physiological processes are investigated in the kindred domain of phonetics.
Mentally, each phoneme must be capable of
differentiating between elements (words or word-parts) that also differ in
meaning, e.g., /d/ versus /t/ in ‘hid’ versus ‘hit’ (III.92). These
two sets of criteria nicely converge in practice to give tidy results: complete
coverage and a high consensus among phonologists. The description also reaps
some ‘cheap explanatory work’ from the visual match between the notation of
many phonemes and written letters, or technically, the graphemes,
of the popular Roman alphabet — even though spoken language was the official
domain to be described (II.23).
30. This huge success
on our three test scales made the study of sound systems in phonology into the model
paradigm for early modern linguistics, whence the series of ‘‑eme’
terms (e.g., ‘morpheme’, ‘lexeme’, ‘syntagmeme’, ‘sememe’)
modelled on ‘phoneme’. Henceforth, °mainstream theories° confidently
projected language to be an array of °uniform, stable, and abstract°
‘subsystems’, usually called levels,
each consisting of a repertory of
theoretical minimal units which correspond
to the practical units in the language data and which constitute a system of
differences (cf. II.39, 45, 114,
123). A complete description of a
language would be the sum of the descriptions for each subsystem, supplied by
linguists investigating the several areas within a neat division of labor.
Research of any scope or in any area could contribute on its own terms, e.g., by
doing ‘pure phonology’ or ‘only syntax’. In this fashion, linguistics
would be ’mono-disciplinary’ and would also contain a set of smaller ‘monodisciplines’.
31. Impressive success was also attained in the domain of morphology (or ‘morphemics’). Here, the theoretical minimal units are the meaningful forms called morphemes, which correspond to the practical units of word-parts, such as word-stem, prefix, or suffix, and of non-segmentable words. This domain offered vital support for °fieldwork in ‘Peripheral’ regions on previously undescribed languages of lesser diffusion° that often °defamiliarize the Western fieldworker° by presenting much more elaborated systems of meaningful forms than do the familiar ‘Western’ languages of ‘Center’ regions (e.g. English and French) (cf. II. 37, 86; VIII.9). The ‘polysynthetic’ languages spoken, say, by Native Americans join morphemes into long word-like constructions that would be expressed by complex phrases in other languages, e.g., [14] from Paiute of south-western Utah (reported by Edward Sapir), or even by whole utterances, e.g., [15] from Aymara of Peru and Bolivia (reported by Martha J. Hardman) (cf. II.62).
[14]
wii - to
- kuchum-punku-rügani-yugwi
- va
-
ntü - m(ü)
knife-black-buffalo
- pet -
cut up-sit (plural)-future-participle-animate plural
‘they who are going to sit and cut up with a knife a black cow (or bull)’
[15]
aru-si
- kipa
- si -
p -
xa - ña - naka -sa -
ki - puni -
rak
- spa -
wa
speak-refl-bridge-prog-pluv-comp-nom-plun-
we -just-always-also- vrb-3rdp+des-aff
‘I
know it is desirable and necessary that all of us, including you also, keep
communicating’
The
piece-by-piece or ‘interlinear’ translations customarily provided in
fieldwork linguistics (cf. VI.73) combine ordinary language with specialized
functional labels (like ‘animate plural’) to suggest what the various
morphemes contribute. In the Aymara data [15], the morphemes convey not just the
content such as ‘speak’, but a rich palette of grammatical functions and
aspects, such as reflexive (refl) (i.e. ‘speak to each other’), progressive (prog),
plural of numerousness for the verb (pluv) and for the noun (plun),
completive (comp) indicating sententious certainty, a marker for the
third person of the desiderative aspect (3rdp+des),
a transition from the opening verb to a nominal (nom) and then back to a verbal (vrb), plus a final affirmation marker (aff)
that turns the whole cluster into a sentence type. The ‘idiomatic’ or
‘free’ translation shows English using ‘clausal framing’ to convey
aspects such as sententious certainty and desiderative (‘I know’, ‘it is
desirable and necessary’).
32. How has morphology
fared on our three test scales proposed in II.28? Coverage faces the tough
practical problem that the stock of minimal forms of a language could include
the entire vocabulary of indivisible words plus word-parts. The preferred
solution has been a theoretical division between the grammatical morphemes in stable, compact classes, e.g., the set of
all verb inflections for ‘present’, ‘past’, and so on, versus the lexical
morphemes in unstable, open classes, e.g., the set of all verbs or verb
stems. Morphology would seek to provide full coverage of the grammatical ones as
the proper ‘morphemes’, whereas the lexical ones would be lexemes
and would be consigned to lexicology,
a domain often left at the borders of linguistics (cf. II.38, 64). The ensuing
division between ‘grammar’ versus ‘lexicon’ became an accredited
dichotomy in °formal linguistics° (II.49), whereas °systemic functional
linguistics° has projected the fundamental unity of the lexicogrammar
apportioning the work of expression to grammatical or lexical resources in a
characteristic way for each language or language type (II.63).
33.
Morphology attained its best convergence through
the fieldwork described in II.31, where theory and practice are always
connected, and where all available clues must be exploited to discover the
meaningful parts in transcriptions of actually observed utterances. This job
requires acute skills in listening and transcribing, but also in relating
utterances or utterance-parts to °cognitive and social constraints°, whether
or not these might be described in ‘linguistic’ terms (II.27). Moreover, you
must meet the practical challenge of moving from being a total ‘outsider’
for the community of speakers over to being an ‘insider’ who can speak the
language well enough to interact in communicative practices (cf. VIII.9). Along
the way, you continually test and refine your description, and adjust your
theory accordingly.
34.
Consensus is supported by the emphasis on identifying
and isolating the morphemes in the
data, which can extract some ‘cheap explanatory work’ from the linear
distributions being visually displayed by a transcription into a reliable
‘phonetic alphabet’ (cf. II.23, 29, 38). One straightforward method is to
repeatedly segment the data until no
further meaningful subdivisions appear feasible, and then to classify the
segments and the patterns these constitute. This ‘immediate
constituent analysis’, if applied ‘in all observation of
word-structure’, was expected to eliminate any ‘inconsistency of
procedure’. You can, for instance, readily distinguish the bound
morphemes that can occur only as a part of a longer unit from the free
morphemes
that can occur by themselves.
35.
But even so, morphology is less congenial than
phonology for the °classical program of mainstream linguistics° because the
data are not nearly so tidy and compact. For example, Modern English has fairly
few indisputable bound morphemes, such as the endings for plurals of nouns and
for tenses and persons of verbs, or the endings such as ‘-able’ and
‘-ish’ for deriving words; in return, much of the ‘grammatical work’ is
now done with the so-called Function
Words (e.g., Articles, prepositions,
and conjunctions) which, like
bound morphemes in other languages, belong to small or closed sets and have key
phrase positions but sparse or indeterminate meanings. The ‘lexical work’ is
mainly done with the so-called Content
Words (e.g., nouns, verbs,
and modifiers) which belong to large or open sets and have
flexible phrase positions but richer and more determinate meanings. A Content
Word is more self-sufficient and more likely to be uttered alone (e.g.,
‘fire!’, ‘run!’, ‘terrible!’) than is a Function Word (e.g., ‘?the!’,
‘?at!’, ‘?unless!’),
except maybe to emphasize a contrast. Also, the Content Words are more likely to
take on the remaining bound morphemes (the ‘inflections’), e.g., to signal a
‘declension’ of nouns with singular versus plural or a ‘conjugation’ of
verbs with present versus past (IV.20, 47, 62, 82).
36.
English also illustrates the problems for
describing a language that has assiduously borrowed word-stems, prefixes, and so
on, e.g., from French, Latin, or Greek, which are no longer recognized as
meaningful units by many contemporary monolingual speakers. Should a
morphological description include not just the more obvious units like the
prefixes ‘in-’ and ‘im-’ for negation alongside ‘un-’, ‘non-’,
or ‘a-’ but also the erudite units like ‘-pter’ (‘wing’) in
‘helicopter’, where speakers would more likely identify the final ‘-er’
as an agentive suffix (compared, say, to ‘propeller’) on the verb ‘helicopt’,
which has in fact appeared in a recent English dictionary with the irritable
label ‘back formation by false analysis’? Such etymological data would bring
in language history and thus undercut the °mainstream program to describe
language in a single stage of its evolution° (II.24(d)).
37.
Still, these problems for theory have not kept
morphology from making huge practical contributions by describing hundreds of
previously undocumented languages through active fieldwork, such as that
sponsored by the Summer Institute of Linguistics and carried out by Kenneth,
Evelyn, and Eunice Pike, Robert Longacre, Joseph Grimes, and their many diligent
colleagues and students (cf. IV.41; VI.73). This work is undoubtedly the most
memorable and enduring achievement of modern linguistics. We have gained not
merely a deepened and defamiliarized sense of the striking morphological
diversity of languages, but also refined methods of investigation we can reapply
to the more familiar, worked-over languages (cf. II.86, 108). We can also rework
the classifications of languages begun by philologists like Rasmus Rask and
continued by linguists like Edward Sapir, Joseph Greenberg, and Morris Swadesh
(cf. II.19).
38.
Looking to reapply the successful methods of
morphology, the next domain in terms of unit size would logically be lexicology,
whose theoretical units, the lexemes,
correspond to the practical units of words; but this domain did not at all fit
the linguists’ beloved conception of a uniform, stable, and abstract system.
So the next domain taken up was actually the subsystem of syntax,
which concerns the organization of phrases and clauses in ‘structures’
or ‘constituents’. Here, fresh
problems cropped up. Consensus was hard to achieve about what the theoretical
units, prospectively called syntagmemes
(after the terms ‘phonemes’ and ‘morphemes’) or ‘phrase structures’,
might be, and how they might correspond to the practical units ranging from just
one word (e.g. ‘help!’) to a phrase and on up to an extended clause or
sentence. Since the repertory obviously would not consist of minimal units, some new mode of theory would need to deal with complex units, Nor does it seem feasible to give an exhaustive, precise
coverage of phrases and clauses; even the traditional division into
‘Subject’ and ‘Predicate’ can leave tricky residues, e.g., signals of
the speaker’s viewpoint like ‘frankly’ (cf. IV.28, 206). Once again, the
visual appearance of data written down does some ‘cheap explanatory work’ in
indicating some divisions between words and between phrases (cf. II.23, 29, 34).
But the linear sequence is not ‘rich’ enough: we can inspect the positions of items but not the relations
among them — we can see where
things are but not why they are there
or where else they could be.
39.
Evidently, the methods for segmenting data into
formal units and classifying these into repertories supported °convergence and
consensus° much better for phonology and morphology than for syntax. Linguists
began to cast about for other theories and methods, particularly ones that would
still treat language as a °uniform, stable, and abstract system°. So the
system was redefined to consist not of a repertory of theoretical minimal units
(II.30) but of a repertory of theoretical rules
for arranging simple units into complex units. Here, the correspondence
between theoretical units and practical units dramatically receded, insofar as
these rules would not produce (or describe) the
phrases and sentences themselves, but rather the ‘underlying structure’ of phrases and sentences. This new °paradigm°
therefore felt authorized to reject the °data-driven and practice-driven
methods of descriptive fieldwork°
for ‘classifying and organizing data’ and ‘extracting patterns from a
corpus of observed speech’, in favor of theory-driven
methods for designing a °generative homework
model of language by itself°. The model would be a generative
grammar: a highly °technical construction° whose rules would °generate all the
grammatical sentences of the language by assigning them structural descriptions°, and no ungrammatical
sentences (cf. II.105). The grammar does not specify how people actually produce real sentences or why they say what they do (although
discussions kept using terms like ‘producing sentences’); it only specifies
the abstract structure of possible
sentences. The homeworkers craftily declared it ‘unreasonable to demand of
linguistic theory’ that it stipulate a ‘discovery procedure’ ‘for
actually constructing the grammar, given a corpus of utterances’ (cf. VII.132,
326); ‘how one might have arrived at the grammar’ was ‘not relevant to the
program of research’ — ‘one may arrive at a grammar by intuition,
guess-work, all sorts of partial methodological hints, reliance on past
experience, etc.’ Of course, practical discovery procedures were just what
fieldworkers had always demanded of linguistic theory, and they were shocked to
be called ‘unreasonable’ by people using ‘hints’ and ‘guess-work’.
40. The ratio between theory and practice in modern linguistics became more elusive than ever (cf. II.24). The °dichotomy in the classical program of mainstream linguistics between language by itself (‘langue’) versus language use (‘parole’)° (II.24) was redrawn between competence, i.e., what speakers of the language know, versus performance, i.e., what speakers actually do or say. ‘Competence’ belonged to an ‘ideal speaker-hearer in a completely homogeneous speech-community who knows its language perfectly’, and determines only what is grammatical, i.e., described by the grammar; ‘performance’ would determine what is acceptable, i.e., approved by real native speakers. This line of reasoning installed the idealized grammar as the formal model for competence (cf. VII.K.2). It was easy to invent plainly ‘grammatical’ or ‘well-formed’ examples like [16] and plainly ‘ungrammatical’ or ‘ill-formed’ ones like [16a], while still others seemed ‘grammatical’ and yet disturbingly odd like [16b] (cf. IV.129).
[16] Revolutionary new ideas
appear infrequently.
[16a] New appear revolutionary
infrequently ideas.
[16b] Colorless green ideas sleep
furiously.
°Introspection
and intuition° were expected to underwrite an explicit formal statement of the
status of all such samples. But from that day to this, no such statement has
been produced for any language. In hindsight, it seems evident that
introspection and intuition cannot achieve in practice the task set down for it
by the theory (cf. II.27, 65). No strong evidence suggests that the native
speaker’s intuitive knowledge of the language is substantially more precise
and uniform than his or her actual utterances and can be disconnected to make
purely ‘grammatical’ judgements about invented sentences. Introspection is
opportunistic, coming into practice when data are presented and evolving to
adapt to viewpoint and focus, rather than subsisting as a stable theoretical
system when it’s not being used.
41.
At this stage in °mainstream linguistics° with
°syntax as the model paradigm° (cf. II.30, 49), its status of a °normal
science° fluctuated uneasily (II.28). The increase along the three °test
scales° proposed in II.28 stagnated and gradually turned into a decrease. In
theory, full coverage of language data
was built right into definition of the °grammar: assigning a structural
description to all the grammatical
sentences° (II.39). But in practice, only modest sets of sentences could be so
devised that their °grammaticality° was indisputable; the rest were sensitive
to varying contexts and interpretations, which are presumably affected by °performance
factors°. The homework linguists had once proclaimed that ‘descriptivist’
°data-driven practices° only ‘precluded the development of a theory’ (a
‘theory’ being of course a formalist grammar); the lesson now emerged, with
high poetic justice, that ‘generativist’ °theory-driven procedures°
preclude the description of adequate data in practice. The same grammar
purporting in theory to provide total
coverage of all the ‘well-formed
sentences of English’ might in practice provide almost no coverage of a corpus of observed speech in everyday
conversations, which would often look ‘ill-formed’ and which was always
produced by real speakers who are neither ‘ideal’ nor ‘homogeneous’ and
certainly don’t ‘know the language perfectly’ (cf. II.27, 54, 57, 66, 95).
A paradoxical scenario arose: reality being claimed for a °technical
construction° while rejecting the reality of human discourse — inclusive
theory with exclusive practice (cf. I.6; II.27). And instead of becoming a
genuine normal science, generativist
linguistics could only be a normalizing
quasi-science that cannot describe authentic data but only the data it has
expressly normalized, thereby rendering them empirically undecidable (cf.
III.172).
42.
Convergence among the descriptions of data was
impeded by the burgeoning theoretical apparatus of formal rules and notations
and innovative terms. True to the ‘grammar’s’ name, each fresh rule-set
‘transformed’ the data’s appearance, sometimes closer to the ‘surface
structure’ of sentences as we find them written down in ordinary
orthography and sometimes venturing at varying distances into the ‘deep
structure’ that was held to ‘underlie sentences’ and to ‘generate’
them through procedures of ‘transformation’,
e.g., to convert Active into Passive. The prerogative to ‘transform’ the
‘surface data’ often led to divergent results, especially if, as was freely
conceded, ‘surface structure’ is ‘unrevealing as to underlying deep
structure’, and if ‘the grammar does not, in itself, provide any sensible
procedure for finding a deep structure of a given sentence’.
43.
Consensus was concentrated in the early phase when
‘generative grammar’ still had a ‘standard’ model’. Soon, the leeway
for inventing theoretical rule-sets spurred proposals for competing models. The
fuzzy connection between theory and practice in such models prevented using a °corpus
of authentic data° to establish a consensus about the best model; instead,
models were hotly debated on theoretical grounds (cf. VIII.22). Over thirty °technical
constructions° still contend within the field, bearing trademark names like
‘case grammar’, ‘Montague grammar’, ‘lexical-functional grammar’,
‘X-bar theory’, or ‘government and binding’, and advocated on technical
and formal criteria of design (II.27), which again do not favor consensus.
44. Perhaps the evolution of modern linguistics outlined in this section could be retold as the search for constraints, with constraint being broadly defined as any factor making some items or patterns of a language more or less likely than others. The Saussurian conception of ‘a system wherein everything holds everything else in place’ (‘un système où tout se tient rigoureusement’) suggests that a °language by itself° (‘langue’) consists of a complete set of standing constraints. The °generative° conception further assumes that the range of those standing constraints is formally circumscribed within the borders of the ‘grammar’. An alternative conception, sponsored by ‘systemic functional linguistics’ and related methods, sees language as a system (or ‘multi-system’) of evolving interactions between standing versus emergent constraints, and among linguistic, cognitive, and social constraints. Language is interposed like a layer inside a cake (Fig. II.3): society uses language to understand and

appropriate theworld
, while the world passes via language into a socially
shared ‘world-model’. The °classical
program of °mainstream linguistics° would disconnect
language from this configuration, as if rolling the layers apart (Fig. II.3,
right side). °Language by itself° can be described in
‘purely linguistic’ terms
only if
it can hold firm and
continue to subsist and operate upon its own internal, standing constraints (cf.
II.54).
45. But can language by itself hold firm? It apparently can in some °sparse sectors° whose constraints are general and uniform and whose organization seems fairly °frozen°. In phonology, every °phoneme° is held precisely and uniquely in place by the physical constraints of articulation and by the mental constraints of being able to differentiate between elements that also differ in meaning (II.29). The physical constraints are the clearest, but the mental ones can be easily met by finding at least one contrastive data pair like ‘hid’ and ‘hit’, whose members do not mean the same thing; we needn’t state what they mean or how their meanings differ, we only need a ready consensus that the meanings do differ (cf. II.48).
46. Internal standing constraints also suffice for some fairly sparse and frozen sectors of morphology. For example, the ‘inflections’ of nouns and verbs can be
covered
and classified without having to state their often complex relations to
cognitive constraints (e.g., what might be the Agent of an Action) and social
constraints (e.g., how Pro-Nouns can signal power or solidarity). Also, the
distributions of morphemes can be firmly grasped by observing them in a °corpus
of authentic data°, where the speakers themselves were respecting cognitive and
social constraints (cf. II.33, 58, 75).
47.
In syntax, however, where the constraints are more
specific and diverse, the search was less successful. °Descriptive syntax°
could not progress very far beyond morphology, e.g., in ‘phrase structure
grammars’, and identified the linear positions of segments much better than
the constraints that would put them there. Syntacticians kept looking
‘deeper’, whence the shift from fieldwork over to homework methods
(II.39ff). Linguists apparently assumed that purely syntactic constraints corresponding to ‘rules’
would emerge most clearly from invented data out of context, since authentic
data in rich contexts obviously obey non-syntactic constraints too. Stating
which sequences (or ‘sentences’) can or cannot occur in theory should prove
easier than stating which ones actually did in practice. Predictably, the
ambition to make syntax stand on its own constraints highlighted the frozen
islands of language, i.e., the stabilized formal patterns of °sparse
standing constraints° such as ‘article + noun’ in English. Yet even in
languages which, like English, have a fair quantity, these frozen islands by no
means suffice for continuing increases in °coverage, convergence, and consensus°.
Without realizing it, generative syntax had taken on the unworkable task of freezing
the whole system, and each attempt necessarily got partial and divergent
results (IV.3). Even simple invented data (e.g., ‘John is eager to please’,
II.66) may refuse to freeze over: the more you fixate and analyze them, the
likelier you are to notice other aspects or interpretations.
48.
This predicament encouraged widening the search
for constraints to encompass semantics,
the investigation of the meanings of
language. Early theorizing had implicitly disconnected semantics from the rest
of linguistics by insisting that the relation between form and meaning is arbitrary,
i.e., not motivated by any natural bond. Intriguingly, this move deviated from
the °mainstream program° by appealing to the history
and evolution of language: how the
meanings came to correspond to the sound patterns we now observe. But the
relation between form and meaning is not
all arbitrary within the system and the consensus among speakers (II.62); it
eludes linguistics because it is always
evolving in multiple dimensions on several levels. In phonology, the
phonemes need merely differentiate meanings
(II.29, 45). In morphology, the morphemes grammaticalize
meanings, but the meaning may not be decided until a morpheme gets used,
e.g., the English Possessive ‘‑’s’ that might indicate many
relations beside possession, e.g., part (‘the car’s engine’), temporality
(‘tonight’s feature film’), locality (‘Hollywood’s fashion
industry’), and so on. In lexicology, the lexemes lexicalize
meanings that are typically stable though often adaptive, e.g. ‘freedom’
(VII.10, 18, 31). In syntax, the syntagmemes linearize
meanings that are also grammaticalized and lexicalized. This scheme, which
will be enriched later on (II.62; III.92, 203, 232ff), does not foresee a level
or domain for ‘meanings by themselves’, fully separated from these other
levels: such a level might indeed look quite ‘arbitrary’.
49. At all events,
semantics long led a shadowy life within linguistics, and was often left to the
philosophers. Some linguists even vowed that meaning should not be a part of
their ‘science’ at all. Others undertook to develop a semantics closely
modeled upon the better-known levels. When phonology was the °model paradigm°
(II.30), ‘structural semantics’ postulated a repertory of sparse-looking
theoretical minimal units called ‘sememes’
(or ‘semes’ or ‘semantemes’), such as ‘± Animate’ or ‘±
Human’. When syntax became the model paradigm (II.41), this scheme was taken
over into ‘semantic features’ (or
‘markers’) specified by formal rules and situated in the ‘lexicon’
rather than in the ‘grammar’ — these two being considered separate levels
or components, as they already had been in morphology (cf. II.32, 75; IV.14).
Neither approach fared well on the °test scales of coverage, convergence, and
consensus°. Research centered on programmatic demonstrations with meagre
examples, and even simple analyses (e.g. ‘kill = cause + die’) raised thorny
disputes. In retrospect, this outcome could have been predicted. Of all the
aspects of language, meaning is the hardest to ‘freeze’ and suffers the most
through °disconnecting language from people’s knowledge of world and society°
(II.74). We are left with no way to tell what practical units these theoretical
units might correspond to: ideas, human actions, real objects and events,
cultural artefacts, and so on. A ‘sememe’ or ‘feature’ like ‘+Human’
cannot be a stable, determinate unit: how complex or simple and how abstract or
concrete it is depends on the context, e.g., on whether it occurs in the
discourse of biology, anthropology, philosophy, religion, politics, law, or
medicine (cf. III.93). Trying to analyze meanings out
of context or else for all contexts
are two equally impossible jobs — the one having too few constraints and the
other too many — and linguists get vague or conflicting results (cf. II.53f;
III.32). Meanings can converge and linguists can reach a secure consensus only
by assembling a representative set of discourse contexts, where the constraints
have been applied by real speakers (II.65, 75).
50. Nor has it been easy to reach a consensus about how semantics should interact with syntax. After withdrawing his early claim that ‘grammar’ is ‘autonomous and independent of meaning’, Chomsky’s ‘standard model’ gave the ‘semantic component’ the job of supplying formal ‘rules’ for ‘interpreting’ and ‘disambiguating’ syntactic sequences that had already been ‘generated’ (‘structurally described’, II.39) by the syntax. Because this arrangement blocked the semantic constraints from helping to construct the sequence, a counter-proposal was soon made to give the ‘generative’ role to the semantics, and to postulate the ‘logical form’ for the ‘deep structure’ of the sentence before the syntax kicks in. But consensus also broke down about how to represent a ‘logical form’: logic pursues sparse issues like ‘truth’ and ‘validity’ rather than the rich ordinary meanings of words or utterances (cf. II.10ff). The ‘logical forms’ shown in sample analyses glibly retained undefined ordinary words, e.g., in this ‘semantic representation’ for a sentence from James McCawley:

The
formula states that it is true for all members (x’s)
— ‘"’ being a ‘universal quantifier’ — that each x
loves x’s wife. The formal
treatment covers only the assignments into sets, an issue logic can handle (II.11).
The rich meaning of the crucial ‘love’ is not semantically or logically
analyzed into units, and attempts to do so would hardly converge.
51.
Further constraints were plainly needed, and the
stage was set to welcome pragmatics,
which had been defined by linguistics and semiotics as the study of the uses
of language in a three-part scheme alongside ‘syntax’ and ‘semantics’.
Fended off by the °staid division between language by itself versus language
use°, pragmatics had made its academic home mainly in philosophy. Linguistics
now admitted pragmatics on the condition that, like semantics, it would respect
the ‘formality’ prescribed for syntax. So pragmatics was to supply further
sparse constraints in the guise of ‘rules’ for ‘interpreting’ and
‘disambiguating’ sentences by specifying what a sentence ‘implies’,
‘entails’, or ‘presupposes’, and how its meaning (sometimes!) depends on
the speaker’s intention and the hearer’s acceptance. What appears to be a
simple statement about reality, e.g., Rodolfo’s ‘c’è freddo fuori’
(‘it’s cold outside’) in Puccini’s La
Bohème, can be intended and accepted to be a proposal to stay inside;
Mimi’s answer ‘ti starò vicina’ (‘I’ll be near you’) reacts by
implying that she’ll keep him from feeling the cold outside.
52.
A catchy motto for pragmatic inquiries was ‘how
to do things with words’ (John Austin), and not just how words are formed
and arranged or what words mean. The notion of speech
act was adopted from ordinary
language philosophy, a term
contrasting with the more technical or formal language philosophy of logical and
mathematical systems (cf. III.167). Here, research sought to specify the
conditions under which an utterance act
or a locutionary act corresponds to a propositional
act of stating a content or message, an illocutionary
act of performing a discourse action (e.g. promising, threatening), and a perlocutionary
act of eliciting an effect on the hearer (e.g. getting compliance). Whereas
the propositional act foregrounds the constative
function of making statements about the world, the illocutionary act is most
prominent in the performative function
where saying and doing fully converge (e.g. ‘pronouncing this meeting
adjourned’).