7. Noam Chomsky1

 

 7.1 Both inside and outside the discipline, Chomsky's work has fundamentally affected views of what of linguistics is or should be, and reopened issues many linguists had long thought were settled. On the jacket of Aspects of the Theory of Syntax (hereafter AT), a reviewer calls the ‘approach’ ‘truly fresh and revolutionary’; and Chomsky often stresses how it is different from, and better than, various alternatives. Yet many of his ideas are conservative in that they derive from traditional philosophy, grammar, and logic. The most ‘revolutionary’ aspect lies in his claims about how these ideas apply to language and linguistics (cf. 7.78, 95).

7.2 A skilful public debater, Chomsky intensifies the forensic and polemical aspects of the discipline by using theoretical arguments about ‘the nature of language’ to fortify his positions against competitors (cf. 9.3; 122). He foregrounds points of contention even where he implicitly agrees with or borrows from his adversaries, and uses a highly confident rhetoric for his ‘tentative’ views and proposals (cf. 7.85, 94).2 His argumentation oscillates from intuitive reasoning and philosophical speculations on ‘the mind’, over to technical points drawn from formal language theory and from such sciences as biology and neurology. Due in part to this diversity of sources, his terminology and notation take on a strategic plurality of meanings (7.15, 28, 78, 83ff).

7.3 Chomsky turns away from ‘modern linguistics’ (cf. 7.5, 19, 30, 62, 75) and cites far earlier sources: Panini, Plato, and both rationalist and romantic philosophers, such as René Descartes (1647), Claude Fauré Vaugelas (1647), César Chesneau DuMarsais (1729), Denis Diderot (1751), James Beattie (1788), and Wilhelm von Humboldt (1836) (AT v, 4f, 49, 198f, 233f).3 He sees ‘the conception of linguistic structure that marked the origins of modern syntactic theory’ in the Grammaire générale et raisonnée (Lancelot & Arnaud 1660), which claimed that ‘aside from figurative speech’ the ‘“natural order of thoughts”‘ is ‘mirrored by the order of words'4 -- an idea Chomsky rejects as a ‘naive view of language structure’ blocking ‘a precise statement of regular processes of sentence formation’, though his notion of ‘deep structure’ being ‘interpreted’ via ‘semantic universals’ has a similar cast (AT 6, 117f, 137) (cf. 7.20, 71f; 11.84). Such allegiances brought ‘Cartesian linguisitics’ into fashion (cf. Chomsky 1966) in a discipline that had been rather hostile to philosophy (cf. 4.4ff, 19, 38, 51, 72; 6.3, 5, 13; 8.5, 17, 60, 814; 9.3ff, 31; 13.16).

7.4 Another chief source is ‘traditional grammars’, staunchly defended for having ‘exhibited’ ‘linguistic processes’, ‘however informally’, and ‘given a wealth of information concerning the structural descriptions of sentences’ (AT 5). For Chomsky, this ‘information is ‘without question substantially correct, and is essential to any account of how the language is used or acquired’ (cf. 2.6; 6.49; 12.41, 88; 13.7); that such ‘grammars’ ‘have been “long condemned by professional linguists”‘ is deemed ‘irrelevant’ (AT 63f, 194) (cf. Dixon 1963). Admittedly, these ‘grammars were deficient in that they left unexpressed many of the basic regularities of language’, and never went ‘beyond the classification of particular examples to the stage of formulation of generative rules on any significant scale’; but this ‘defect’ is also found in ‘structuralist grammars’ (AT 5). Chomsky's grammatical terms in both SS and AT are not attributed to any source; language philosophers are cited only in theoretical arguments.

7.5 A source Chomsky roundly repudiates is ‘structural linguistics’, his chief professional  and academic competitor for posts in the  then newly opening ‘Departments of Linguistics’ his team in fact managed to corral with the all razor skills of an intellectual mafia -- eventually, and to me incredibly, even Halliday's own when the latter retired.  His ‘discussion’ of work in this discipline leads the agenda of Syntactic Structures (hereafter SS) and is cited in AT as ‘unanswerable’ or at least ‘for the moment, not challenged’, whereupon he presumes that ‘the inadequacy’ of ‘structuralist grammars’ ‘for natural languages’ ‘has been established beyond any reasonable doubt’ (AT 54, 67). He simply ignores the merits of such grammars for describing a host of previously unrecorded languages (5.2; 13.56). He sees only their theoretical flaws and ‘no compensating advantage for the modern descriptivist reanalysis’, and even vows that ‘knowledge of grammatical structure cannot arise’ from the ‘operations’ used in that kind of ‘linguistics’ (AT 174, 57) (cf. 7.7, 24, 29, 33, 76, 96). Also, descriptivist fieldwork conflicts with his own plan to focus on English and to discount the formal diversity of languages as a ‘surface’ issue (cf. 7.18f, 41, 55, 62; 13.71).

7.6 Chomsky charges that ‘descriptivists’ rely unduly on induction: the ‘limitation-in-principle to classification and organization of data, to “extracting patterns” from a corpus of observed speech’ (AT 15) (cf. 4.7; 13.45). ‘Structural linguistics’ makes an ‘extremely strong demand’ on ‘discovery’ by ‘assuming that the techniques for discovering the correct hypotheses (grammar) must be based on procedures of segmentation and classification of items in a corpus’ (AT 202f) (cf. 13.30). Thus, ‘general linguistic theory’ seems to ‘consist only of a body of procedures’ which ‘determine’ the ‘restrictions on possible grammars’ and otherwise leave ‘the form of language unspecified’ (AT 52). ‘Proposals’ ‘attempt to state methods of analysis that an investigator might actually use’ ‘to construct a grammar of language directly from the raw data’ (e.g. Wells 1947; Bloch 1948:, Harris 1951, 1955; Hockett 1952; Chomsky 1953) (SS 52). ‘This goal’ can ‘lead into a maze’ of ‘complex analytic procedures’ unsuited to answer ‘many important questions about the nature of linguistic structure’ (SS 53).

7.7 In SS, Chomsky does ‘not deny the usefulness’ of ‘discovery procedures’ in ‘providing valuable hints to the practising linguist’, but only admonishes that ‘linguistic theory’ is not ‘a manual’ of ‘procedures’ (SS 55n, 59, 106) (cf. 6.38, 61; 83; 12.33). In AT, however, he pointedly devalues the discovery of data, saying that ‘no adequate formalizable techniques are known for obtaining reliable information concerning the facts of linguistic structure’; and that ‘there is no reason to expect that reliable operational criteria for the deeper and more important notions of linguistic theory’ ‘will ever be forthcoming’ (AT 19). ‘Knowledge of the language, like most facts of interest and importance, is neither presented for direct observation nor extractable from data by inductive procedures of any known sort’ (AT 18). ‘Allusions’ to ‘“procedures of elicitation” or “objective methods” simply obscure the actual situation in which linguistic work’ must ‘proceed’ (AT 19) (cf. 5.10). So ‘theoretical’ ‘investigation of the knowledge of the native speaker can proceed perfectly well’ without ‘operational procedures’. ‘The critical problem is not a paucity of evidence but rather the inadequacy of present theories of language to account for masses of evidence’ ‘hardly open to serious question’ (AT 19f). These arguments lead to a saving of labour: ‘once we have disclaimed practical discovery procedures’, ‘certain problems’ likely to stir up ‘intense methodological controversy simply do not arise’ (SS 56). Indeed.

7.8 ‘The relation between the general theory and the particular grammars’ is where Chomsky's ‘approach’ ‘diverges sharply from many theories’ (SS 50). In his view, ‘a discovery procedure’ whereby ‘the theory’ ‘provides a practical and mechanical method for actually constructing the grammar, given a corpus of utterances’, is an unduly ‘strong requirement’ (SS 50f). ‘A weaker requirement’ is ‘a decision procedure’ whereby the ‘method’ ‘determines whether or not a grammar is’ ‘the best grammar’ for the ‘corpus’, without asking ‘how this grammar was constructed’. ‘Even weaker’ is ‘an evaluation procedure’ for telling which of ‘two proposed grammars’ (or ‘small sets of grammars’) is ‘better’. He thinks ‘it is unreasonable to demand of linguistic theory’ ‘more than a practical evaluation procedure’ (SS 52f). Having one would ‘guarantee significance’ for linguistics as one of the ‘few areas of science’ seeking a ‘practical mechanical method for choosing among several theories, each compatible’ with ‘the data’.

7.9 So a ‘theory may not tell us in any practical way’ how to ‘construct the grammar, but it must tell us how to evaluate’ and ‘choose’ (SS 54). Chomsky again saves labour by ‘never considering the question of how one might have arrived at the grammar’, and declaring ‘such questions’ ‘not relevant to the programme of research’ (SS 56) (cf. 13.38). ‘One may arrive at a grammar by intuition, guesswork’, ‘partial methodological hints’, ‘past experience’, etc.’ We might ‘give an organized account of useful procedures of analysis’, but these cannot be ‘formulated rigorously, exhaustively, and simply enough’ to ensure valid ‘discovery’. Instead, his ‘ultimate aim is to provide an objective, non-intuitive way to evaluate a grammar once presented’.

7.10 As Chomsky sets the task, ‘the grammarian must construct a description, and, where possible, an explanation, for the enormous mass of unquestionable data concerning the linguistic intuition of the native speaker, often himself’ (AT 20, i.s.) (cf. 6.15). Against much of earlier American linguistics, Chomsky vows that ‘linguistic theory is mentalistic’, ‘concerned with discovering a mental reality underlying actual behaviour’ (AT 4) (cf. 4.8; 13.57). He decries ‘behaviourism’ for consulting only ‘data’ and ‘expressing a lack of interest in theory and explanation’, due to ‘certain ideas’ (like ‘operationalism’ and ‘verificationism’) ‘in positivist philosophy of science’ (AT 193f). Instead, ‘linguistic theory’ should ‘contribute to the study of human mental processes and intellectual capacity’ (AT 46) (cf. 3.10ff; 5.69; 6.6; 7.10; 12.17ff, 22; 13.22). ‘Observed use of language or hypothesized dispositions to respond, habits, and so on, may provide evidence’ about the ‘mental reality, but surely cannot constitute the subject-matter of linguistics’ as ‘a serious discipline’ (AT 4).

7.11 ‘Introspective judgments of the informant’ cannot be ‘disregarded’ ‘on grounds of methodological purity’ without ‘condemning the study of language to utter sterility’ (AT 194). In SS, however, Chomsky had warned against ‘asking the informant to do the linguists’ work’ by eliciting a ‘judgment about his behaviour’; such ‘opinions may be based on all sorts of irrelevant factors’ (SS 97n) (cf. 13.49). SS also announced that ‘the major goal of grammatical theory is to replace’ the ‘obscure reliance on intuition by some rigorous and objective approach’ (SS 94; cf. SS 5, 56).5 But in AT, the prospect that ‘giving such priority to introspective evidence’ and ‘intuition’ might ‘exclude’ ‘linguistics’ from ‘science’ is downplayed as a ‘terminological question’ with ‘no bearing at all on any serious issue’ (AT 20). Though ‘the successful sciences’ are ‘concerned’ with ‘objectivity’, the latter ideal can, as ‘the social and behavioural sciences’ show, ‘be pursued with little gain in insight and understanding’. Chomsky's preferred model is ‘the natural sciences’, which ‘have sought objectivity as a tool for gaining insight, for providing phenomena’ to ‘suggest or test deeper explanatory hypotheses’, rather than ‘as goal in itself’ (cf. 2.13; 4.8, 18; 7.16; 9.112; 12.14, 49, 99; 13.11, 18). Regarding whether ‘a wider range and more exact description of phenomena is relevant’, Chomsky asserts that ‘in linguistics’, ‘sharpening the data by objective test’ is ‘of small importance’ for ‘new and deeper understanding of linguistic structure’ (AT 20f). ‘Many questions’ ‘today’ ‘do not demand evidence’ ‘unattainable without significant improvements in objectivity of experimental technique’ (i.a.).

7.12 Chomsky makes the declaration, soon to be famous, that ‘linguistic theory is concerned primarily with an ideal speaker-hearer in a completely homogeneous speech-community, who knows its language perfectly, and is unaffected by such grammatically irrelevant conditions as memory limitations, distractions, shifts of attention and interest, and errors (random or characteristic) in applying his knowledge of the language in actual performance’ (AT 3) (cf. 7.96; 9.5; 11.69; 12.44; 13.14, 18, 36, 132). Chomsky thus draws an influential ‘distinction between competence, the speaker-hearer's knowledge of his language, and performance, the actual use of language in concrete situations’ (AT 4) (cf. 9.6; 13.39). ‘Only under the idealization’ of the ‘speaker-hearer’ ‘is performance a direct reflection of competence’, but not ‘in actual fact’. ‘A record of natural speech will show numerous false starts, deviations from rules, changes of plan in mid-course, and so on’. Chomsky's ‘distinction’ ‘is related to the langue-parole distinction of Saussure’ (2.20), but ‘rejects his concept of language as merely a systematic inventory of items, and returns to the Humboldtian conception of underlying competence as a system of generative processes’.

7.13 Chomsky subscribes to the ‘view’ ‘that the investigation of performance will proceed only so far as the understanding of underlying competence permits’ (AT 10). ‘The only concrete results’ and ‘clear suggestions’ so far for ‘a theory of performance’ have come from ‘models that incorporate generative grammars’ by making ‘assumptions about underlying competence’; and he finds it ‘difficult to imagine any other basis’ (AT 10, 15). Only within such models will ‘actual data of linguistic performance’ provide ‘evidence for determining the correctness of hypotheses about underlying structure, along with introspective reports by the native speaker or the linguist who has learned the language’ -- this being ‘the position universally adopted in practice’ (AT 18). He cites ‘observations concerning limitations on performance imposed by organization’ and ‘bounds of memory’, or ‘concerning exploitations of grammatical devices to form deviant sentences’ (AT 10) (cf. 7.41).

7.14 Chomsky stipulates that ‘a theory of linguistic intuition’, a ‘grammatical description’, or ‘an operational procedure’ ‘must be tested for adequacy’ ‘by measuring it against’ ‘the tacit knowledge’ it tries to ‘describe’ (AT 19ff) (cf. 7.24, 29, 91). But even if it is ‘the ultimate standard’ of ‘accuracy’ and ‘significance’ for both ‘grammars’ and ‘tests’, this ‘tacit knowledge may very well not be immediately available to the user of the language’ (13.49). Precisely due to the ‘elusiveness of the speaker's tacit knowledge’, the linguist is needed to ‘guide and draw it out’ in ‘subtle ways’ (AT 24). By ‘adducing’ examples, we can ‘arrange matters’ so that people's ‘linguistic intuition, previously obscured, becomes evident to them’. Chomsky's favoured tactic is to show that an isolated ‘sentence’ (like ‘“flying planes can be dangerous”’) can be ‘assigned’ more than one ‘interpretation’ ‘by the grammar’, although ‘in an appropriate’ ‘context, the listener will interpret it immediately in a unique way’ and ‘may reject the second interpretation’, if ‘pointed out to him, as forced or unnatural’ (AT 21) (cf. 7.53, 61, 79, 82; 13.39).

7.15 Along similar lines, Chomsky finds it ‘obvious’ that ‘every speaker of a language has mastered and internalized a generative grammar that expresses his knowledge of his language’ (AT 8). Indeed, Chomsky ‘uses the term “grammar” with a systematic ambiguity’ both for ‘the native speaker's internally represented “theory of his language”‘ and for ‘the linguist's account of this’ (AT 25) (cf. 7.28, 78; 13.45). Yet people may not be ‘aware of the rules’, or even able to ‘become aware’, nor are their ‘statements about their intuitive knowledge’ ‘necessarily accurate’ (AT 8) (13.49). ‘Any interesting generative grammar will be dealing, for the most part, with mental processes’ ‘far beyond the level of actual or even potential consciousness’ (AT 8; cf. AT 59; 216; 13.49). Since ‘the speaker's reports and viewpoints about his behaviour and competence may be in error’, a ‘grammar attempts to specify what the speaker knows, not what he may report’. Chomsky sees a ‘similarity’ here to ‘a theory of visual perception’ trying to ‘account for what persons actually see and the mechanisms that determine this rather than for their statements about what they see and why, though these statements may provide useful, in fact compelling evidence’ (AT 8f) (cf. 7.35, 89). However, seeing objects or scenes differs from ‘seeing’ language, if in ‘linguistics’ ‘the viewpoint’ ‘creates the object’, as Saussure asserted (2.9; 13.58).

7.16 Further analogies with other sciences are strategically drawn. Using a ‘grammar’ to ‘reconstruct formal relations among utterances in terms’ of ‘structure’ and to ‘generate all grammatically “possible” utterances’ is ‘analogous’ to using ‘chemical theory’ to ‘generate all physically possible compounds’ (SS 48) (cf. 2.82; 4.8; 5.28; 13.18). A ‘chemical theory’ ‘serves as a theoretical basis for techniques of qualitative analysis and synthesis of specific compounds, just as a grammar’ supports ‘the investigation’ of the ‘analysis and synthesis of particular utterances’. Or in physics, ‘any scientific theory is based on a finite number of observations’, which it ‘relates’ and ‘predicts’ ‘by constructing general laws in terms of such hypothetical constructs as’ ‘“mass and electron”‘ (SS 49) (4.8, 71; 5.66; 7.36; 6.62; 8.49; 13.43). ‘Similarly, a grammar of English is based on a finite corpus of utterances (observations)’, ‘contains grammatical rules (laws) stated in terms of phonemes, phrases, etc.’ ‘(hypothetical constructs)’ and ‘expresses structural relations among sentences of the corpus and the indefinite number of sentences generated by the grammar’ (‘predictions’). Such ‘physical’ analogies stand out in a mentalist approach, especially one whose main ‘metaphor’ seems to be mathematics (cf. 7.93; 13.10, 15, 18).

7.17 Chomsky proposes to ‘construct a formalized general theory of linguistic structure and to explore’ its ‘foundations’, hoping to ‘fix in advance for all grammars’ the way they are ‘related to a corpus of sentences’ (SS 5, 14). ‘We can attempt to formulate as precisely as possible both the general theory and the set of associated grammars’ (SS 50).6 His ‘motivation’ is not just a ‘mere concern for logical niceties or desire to purify well-established methods of linguistic analysis’ (SS 5). ‘Precisely constructed models’ ‘can play an important role, both negative and positive, in the process of discovery’ (cf. 7.98). ‘By pushing a precise but inadequate formulation to an unacceptable conclusion’ and ‘exposing the source’, we can ‘gain a deeper understanding of the linguistic data’. Or, ‘positively, a formalized theory may automatically provide solutions for problems’ for which it wasn't ‘originally designed’. To ‘determine the fundamental underlying properties of successful grammars’, ‘linguists’ should ‘recognize the productive potential’ of ‘rigorously stating a proposed theory and applying it strictly to linguistic material’ without ‘ad hoc adjustments or loose formulations’ (SS 11, 5). In any event, ‘neither the general theory nor the particular grammars are fixed for all time’ (SS 50). ‘Progress and revision may come’ from ‘new facts’ or ‘purely theoretical insights’, i.e., ‘new models for linguistic structure’.

7.18 In fine, Chomsky's priorities sound like Hjelmslev's:7 ‘a theory of language must state the principles interrelating theoretical terms’ and ‘ultimately must relate this system to potential empirical phenomena, to primary linguistic data’ (AT 208).8 We should ‘describe the form of grammars (equivalently, the nature of linguistic structure)’ and ‘the empirical consequences of adopting a certain model’ (SS 56). ‘The ultimate outcome’ ‘should be a theory of linguistic structure in which the descriptive devices utilized in particular grammars are presented and studied abstractly’, without ‘reference to particular languages’ (SS 11, 50) (cf. 6.11). Having ‘data’ from ‘relatively few languages’ ‘is not particularly disturbing’, because the ‘conditions’ of ‘a single language may provide significant support’ for ‘some formal property’ belonging to ‘general linguistic theory’ (AT 209). Chomsky sees ‘an important advance in the theory of language’ when an apparent ‘peculiarity of English’ is made ‘explicable’ by ‘a general and deep empirical assumption about the nature of language’, which can be ‘refuted if false’ by ‘grammars of other languages’ (AT 36). This step fits the hope of ‘replacing’ ‘assertions about particular languages’ with ones about ‘language in general’, so that ‘features of grammars in individual languages can be deduced’ (AT 46). Conversely, ‘the difficulty or impossibility of formulating certain conditions within’ a ‘theory of grammar’ may signal that they are ‘general’ for ‘the applicability of grammatical rules rather than aspects of the particular language’ (AT 209).

7.19 So against ‘modern linguistics’, Chomsky calls for ‘the grammar of a particular language’ ‘to be supplemented by a universal grammar’ ‘expressing deep-seated regularities’ (AT 6) (cf. 2.10; 3.67; 4.4, 71f, 74; 5.44; 6.5, 10, 34; 7.22, 29, 33f, 45, 55, 62, 65, 71, 78f, 91, 93, 710, 732, 739; 8.19, 60, 86; 9.3, 25, 60; 12.94; 13.18). Indeed, ‘the main task of linguistic theory must be to develop an account of linguistic universals’ (AT 28). He contests the ‘commonly held’ view ‘that modern linguistic and anthropological investigations have conclusively refuted the doctrines’ of ‘universal grammar’ (AT 118) (cf. 7.55, 62). ‘Linguistic theory’ may yet ‘develop an account of linguistic universals’ that reveals ‘the properties of any generative grammar’ and is ‘not falsified by the diversity of languages’ (AT 27f). An important (and labour-saving) corollary is that ‘universal’ ‘regularities’ ‘need not be stated in the grammar’ of a ‘language’ but ‘only in general linguistic theory as part of the definition of the notion “human language”‘ (AT 6, 117; cf. AT 35f, 112, 144, 168, 225; 7.65, 72).

7.20 ‘Universals’ fall into two kinds. The ‘substantive’ ones, a ‘traditional concern of general linguistic theory’, involve ‘the vocabulary for the description of language’ and state that ‘items of a particular kind in any language must be drawn from a fixed class’ (AT 28). The best case comes from phonology, the old standby of linguists, namely the ‘theory of distinctive features’, ‘each of which has a substantive acoustic-articulatory characterization independent of any particular language’ (cf. 2.70; 5.42f; 6.54; 7.71, 710; 836; 13.26). Another case is the assumption of ‘traditional universal grammar’ that ‘fixed syntactic categories (Noun, Verb, etc.) can be found in the syntactic representations of the sentences of any language’ and ‘provide the general underlying syntactic structures of each language’. ‘More abstract’ and ‘recently’ studied are the ‘formal linguistic universals’ stating ‘formal conditions’ such as ‘the character of the rules’ in ‘grammar’ and the ways’ they are ‘interconnected’ (AT 29). Such ‘deep-seated formal universals’ ‘imply that all languages are cut to the same pattern’, though without any ‘point-by-point correspondence’ (AT 30).9 These ‘formal constraints’ ‘may severely limit the choice’ ‘of a descriptive grammar’. Not surprisingly, Chomsky's favoured candidate here is ‘that the syntactic component of a grammar must contain transformational rules’; and that even ‘the phonological component’ ‘consists of a sequence of rules, a subset of which’ is ‘a transformational cycle’ (AT 29). This move too saves labour: if ‘the transformational cycle is a universal feature of the phonological component, it is unnecessary, in grammar of English, to describe’ the ‘functioning of phonological rules that involve syntactic structure’ (AT 35).10 A new rationale for separating levels?

7.21 Therefore, ‘our problem is to develop and clarify the criteria for selecting the correct grammar for each language, that is, the correct theory of this language’ (SS 49). ‘A linguistic theory is descriptively adequate’ and ‘empirically significant’ if it ‘makes available for each language’ a ‘grammar’ (or ‘a class of grammars’), ‘correctly’ ‘states the facts’, ‘describes the intrinsic competence of the idealized native speaker’ for ‘understanding arbitrary sentences’, and ‘accounts for the basis of that achievement’ (AT 24, 27, 34, 40, i.s.). ‘External conditions of adequacy’ hinge on whether ‘the sentences generated’ are ‘acceptable to the native speaker’, but apply to a ‘merely descriptive’ ‘theory’ (SS 49f, 13, 54). ‘Internal grounds’ constitute a ‘much deeper’ and more ‘principled’ ‘level’ of adequacy (SS 13; AT 41, 27). For the moment however, Chomsky simply ‘tests the adequacy’ of an ‘apparatus’ by ‘applying it directly to the description of English sentences’ (SS 34) (cf. 7.80). In fact, he makes do with the ‘weak test’ of ‘a certain number of clear cases’; a ‘strong test’ would require these to ‘be handled properly for each language by grammars all’ ‘constructed by same method’ (SS 14) (cf. 7.42; 13.40).

7.22 ‘The structural descriptions assigned to sentences by the grammar’ and its ‘distinctions between well-formed and deviant’ ‘must correspond to the intuition’ of the ‘speaker, whether or not he’ is ‘aware’ of it (AT 24) (cf. 7.11; 13.50). Yet a ‘descriptively adequate’ theory may ‘provide such a wide range of potential grammars’ that ‘no formal property distinguishes’ them, or may ‘leave unexpressed’ ‘the defining properties of natural languages’ that ‘distinguish’ them ‘from arbitrary symbol systems’ (AT 35f). So ‘the major endeavour of the linguist must be’ to ‘enrich the theory of linguistic form’ and ‘restrict the range of possible hypotheses by adding structure’, ‘constraints, and conditions’ to ‘the notion “generative grammar”‘ and ‘to reduce the class of attainable grammars’ until ‘a formal evaluation measure’ can be applied (AT 35, 46, 41). ‘This requires a precise and narrow delimitation of the notion “generative grammar” -- a restrictive and rich hypothesis’ about ‘universal properties that determine the form of language’ (AT 35). ‘Given a variety of descriptively adequate grammars’ we need to discover if they are ‘unique’ or share ‘deep underlying similarities attributable to the form of language as such’. Only the latter discovery yields ‘real progress in linguistics’.

7.23 Hence, Chomsky deems it ‘crucial for the development of linguistic theory’ to ‘pursue’ ‘much higher goals’ than descriptive adequacy’, even ‘utopian’ ones (AT 24f). He envisions ‘explanatory adequacy’ when ‘a linguistic theory succeeds in selecting a descriptively adequate grammar on the basis of primary linguistic data’ and in relating ‘an explanation of the intuition of the native speaker’ to ‘an empirical hypothesis about the innate predisposition of the child’ (AT 25ff; cf 7.27f). Instead of ‘gross coverage of a large mass of data’, which is not ‘an achievement of any theoretical interest or importance’ (7.6f, 10f), ‘linguistics’ should ‘discover a complex of data that differentiates between conflicting conceptions of linguistic structure’ by showing which ones ‘can explain’ the ‘data’ via ‘some empirical assumption about the form of language’ (AT 26). Even for ‘descriptive adequacy’, an ‘explanatory theory of the form of grammar’ provides a ‘main tool’, because ‘the choice’ is ‘always underdetermined by data’ and because ‘relevant data’ from ‘successful grammars for other languages’ can be collated (AT 41) (13.43). Though both ‘unrealized goals’, ‘descriptive’ and ‘explanatory adequacy’ are ‘crucial at every stage of understanding linguistic structure’ (AT 36, 46).

7.24 Since ‘all concrete attempts to formulate an empirically adequate linguistic theory’ leave ‘room for mutually inconsistent grammars’, an ‘evaluation measure’ might be drawn from ‘language acquisition’ (AT 37). For Chomsky, ‘a theory of language’ can in fact ‘be regarded as a hypothesis’ about the ‘“language-forming capacity” of humans’ and ‘language learning’ (AT 30). We can thus ‘formulate’ ‘problems of linguistic theory’ and ‘language learning’ ‘as questions about the construction of a hypothetical language acquisition device’ or ‘model’ (AT 47, 25). The ‘child’ or ‘device’ ‘constructs a theory’ that ‘specifies its tacit competence, its knowledge of the language’, by ‘devising a hypothesis compatible with presented data’ of ‘performance’ (AT 32 36, 4). This feat sounds like a task for a linguist, and Chomsky makes the parallel explicit (cf. 7.88f). On one side, ‘the theorist’ is ‘given an empirical pairing of collections of primary linguistic data with certain grammars’ ‘in fact constructed by people’; on the other side, the child confronts ‘primary linguistic data consisting of signals, classified as sentences and non-sentences’, that are ‘paired with structural descriptions’ (AT 38, 47, 32). For this purpose, ‘the child has developed and internally represented a generative grammar’: ‘a theory’ and ‘a system of rules that determine how sentences are to be formed, used, and understood’ (AT 25).

7.25 ‘In part, such data determine’ which ‘possible language’ ‘the learner is being exposed’ to (AT 33). But the child's ‘internalized grammar goes far beyond the presented primary linguistic data’, ‘and is in no sense an “inductive generalization”‘ (AT 33, 25) (cf. 7.5ff, 30, 34, 93). Because apart from occasional ‘corrections of learners’ attempts’ by ‘the linguistic community’, ‘no special care is taken to teach’ ‘children’, the latter ‘must have the ability to “invent” a generative grammar that defines well-formedness and assigns interpretations to sentences even though linguistic data’ are ‘deficient’ (AT 31, 201). Indeed, Chomsky's idealizations imply that data in ‘actual speech’ are not merely ‘finite’, ‘scattered’, and ‘restricted in scope’, but also ‘degenerate in quality’, replete with ‘non-sentences’, ‘fragments, and deviant expressions’ (AT 43f, 31, 58, 201, 25; cf. 7.12; cf. 9.4f). He compares this factor with the ‘traditional view’ that ‘“the pains”‘ of ‘“conversation”‘ lie in ‘“extricating”‘ ‘“the thought from the signs or words which often agree not with it” (Cordemoy 1667)’ (AT 201), but his own concept of well-formedness is unrelated to the deviousness or overelaboration Cordemoy probably had in mind.

7.26 The child again resembles a Chomskyan linguist if ‘as a precondition for language learning, he must possess a linguistic theory that specifies the form of the grammar of a possible human language’ plus ‘a strategy for selecting a grammar’ ‘compatible’ with the ‘data’ (AT 25). Chomsky envisions the ‘child’ applying ‘a class of possible hypotheses about language structure’ and ‘determining what each’ ‘implies for each sentence’ so as to ‘select one of the presumably infinitely many hypotheses’ ‘compatible with the given data’ (AT 30, 45). Perhaps ‘an acquisition model’ entails a ‘strategy for finding hypotheses’ by ‘considering ‘only grammars’ above ‘a certain value’ before ‘language learning can take place (AT 203).11 Or, ‘it is logically possible that the data’ are so ‘rich’ and ‘the class of potential grammars’ so ‘limited’ that only a ‘single permitted grammar will be compatible with the available data’ (AT 28). Or, even ‘a system not learnable by a language acquisition device’, which ‘is only one component of the total system of intellectual structures’ applied to ‘problem-solving and concept formation’, might be ‘mastered by a human some other way’; but Chomsky ‘expects ‘a qualitative difference’ in how ‘an organism’ ‘approaches’ ‘languagelike’ ‘systems’ (AT 56).

7.27 As ‘the most interesting and important reason for studying descriptively adequate grammars’ and for ‘formulating and justifying a general linguistic theory’, Chomsky states this ‘issue’: that ‘the general features of language structure reflect not so much the course of one's experience’ as the ‘capacity to acquire knowledge’ by ‘innate ideas’ and ‘principles’ (AT 59, 27, 78). Invoking ‘the traditional belief that “the principles of grammar form an important” “part of the philosophy of the human mind”‘ (Beattie 1788) (AT 59), he postulates a ‘structure’ that ‘pertains to the form of language in general’ and ‘reflects what the mind brings to the task of language acquisition, not what it discovers or invents’ during ‘the task’ (AT 59, 117). He hopes ‘linguistic theory’ can thereby ‘contribute to the study of human mental processes and intellectual capacity’ via ‘the abilities that make language learning possible under empirically given limitations of time and data’ (AT 46f, 31) (cf. 7.10; 13.22).

7.28 So ‘a long-range task for general linguistics’ is an ‘account of this innate linguistic theory’ forming ‘the basis for language learning’, although Chomsky short-circuits this task with another ‘systematic ambiguity’: using ‘“theory of language”‘ for ‘both the child's innate predisposition’ and for ‘the linguist's account’ (AT 25) (cf. 7.15, 78). A ‘hypothesis concerning the innate predisposition of the child’ can help ‘explain the intuition of the native speaker’ and can be ‘falsified’ when ‘it fails to provide’ a ‘grammar’ for ‘data from some other language’ (AT 25f). An ‘extremely strong’ ‘claim’ is involved here ‘about the innate capacity’ or ‘predisposition’ ‘of the child’ (AT 32) (cf. 7.93). It ‘is maintained’ ‘that the child has an innate theory of potential structural descriptions’ ‘rich and fully developed’ enough to ‘determine from a real situation’ ‘which structural descriptions may appropriate to a signal’ that ‘occurs’; and that this can be done ‘in advance of any assumption as to the linguistic structure of this signal’ (AT 32f) (cf. 66).

7.29 ‘A theory of language’ along these lines ‘should concern itself’ with ‘linguistic universals’, a ‘theory’ or ‘tacit knowledge’ of which is ‘attributed’ ‘to the child’ (or to the ‘language acquisition device’) (AT 30, 27, 55f) (cf. 7.23, 89). ‘The child approaches the data with the presumption that they are drawn from a language’ of an ‘antecedently well-defined type’, and ‘determines which of the humanly possible languages is that of the community’ (AT 27). A ‘theory’ with ‘universals’ ‘implies that only certain kinds of symbolic systems can be acquired and used as languages’, whereas ‘others are beyond it’ (AT 55). We thus arrive at ‘the question: what are the initial assumptions’ about ‘the nature of language that the child brings’ and how ‘specific is the innate schema (the general definition of “grammar”) that gradually becomes more explicit and differentiated as the child learns the language?’ (AT 27).

7.30 ‘Acquisition of language’ offers ‘a special and informative case’ for ‘a more general’ ‘discussion’ of ‘two lines of approach to the acquisition of knowledge’ (AT 47). ‘Empiricism’ (spurred by ‘18th century struggles for scientific naturalism’) ‘assumes that the structure of the acquisition device is limited to certain elementary “peripheral processing mechanisms”‘, e.g. ‘an innate “quality space”, or to a set of primitive unconditioned reflexes’ or (for ‘language’) a set of ‘“aurally distinguishable components”‘; plus ‘elementary’ ‘analytical data-processing mechanisms or inductive principles’, e.g., ‘weak principles of generalization’ or (for language) ‘principles of segmentation and classification’ like those of ‘modern linguistics’ (AT 47, 51, 58f, 207) (cf. 7.5ff, 25, 33, 96; cf. 13.26, 30). ‘A preliminary analysis of experience is provided’ by those ‘mechanisms’ as ‘concepts and knowledge are acquired’ through ‘inductive principles’ (AT 48). Such claims are said to portray ‘language’ as ‘an adventitious construct’ (AT 51). Moreover, Chomsky complains that ‘empiricist views have generally been formulated in such an indefinite way’ as to be ‘next to impossible to interpret’, ‘analyse, or evaluate’ (AT 204). Skinner's (1957) account (reviewed in Chomsky 1959) is ‘grossly, obviously counter to fact’ if ‘terms like “stimulus”, “reinforcement”, “conditioning”, etc.’ are used ‘as in experimental psychology’; if they are just ‘metaphoric extensions’, we get only ‘a mentalist account differing from traditional ones’ by ‘the poverty of the terminology’ and not by any ‘scientific’ quality.12

7.31 ‘Rationalism’ is a ‘different approach’ to the ‘acquisition of knowledge’ and holds that ‘innate ideas and principles’, ‘fixed in advance as a disposition of mind’, ‘determine the form of acquired knowledge’ in a ‘restricted and highly organized way’ (AT 48, 51). A main inspiration is ‘seventeenth-century rationalist philosophy’ (AT 49). Descartes (1647) said that ‘“innate ideas”‘ about ‘“movements and figures”‘, ‘“pain, colour, sound”‘, etc. -- as well as ‘such notions as that things equal to same thing are equal to each other’ -- arise from ‘“our faculty of thinking”‘, not ‘“from external objects”‘ (AT 48). ‘The Port-Royal Logic (Arnauld [& Nicole] 1662)’ made ‘the same point’: ‘no idea’ ‘in our minds has taken its rise from sense’, because ‘ideas’ have ‘rarely any resemblance to what takes place in the sense and in the brain’, and ‘some have no connection with any bodily image’ (AT 49f). ‘Lord [Edward] Herbert [of Cherbury] (1624) maintained that’ without ‘innate ideas and principles’, ‘“we should have no experience”‘ or ‘“observations”‘, nor ever ‘“distinguish between things or grasp any general nature”‘ (AT 49). Leibniz (1873[1702-03]) declared ‘the senses ‘necessary’ but ‘not sufficient’ for ‘actual knowledge’, furnishing only ‘examples, i.e. particular’ ‘truths’, whereas ‘the truths of numbers’, i.e. ‘all arithmetic and geometry, are in us virtually’ to ‘set in order what we already have in the mind’ (AT 50). ‘Necessary truths must have principles whose proof does not depend on examples nor consequently’ on ‘the senses’, and which ‘form the soul’ of ‘our thoughts’, ‘as necessary thereto as the muscles’ for ‘walking’.

7.32 Descartes’ claim that ‘corporeal movements’ ‘reach our mind’ ‘from external objects’ and ‘cause’ it to ‘envisage these ideas’ (AT 48) matches Chomsky's stipulation that ‘innate mechanisms’ must be ‘activated’ by ‘appropriate stimulation’ and are then ‘available for interpretation of the data of sense’ (AT 48, 51). ‘The rationalist view’ implies that ‘one cannot really teach language, but can only present the conditions under which it will develop spontaneously in the mind’ (cf. Humboldt 1836) (but cf. 12.17, 1111). ‘Thus the form of a language, the schema for its grammar, is to a large extent given’, though it must be ‘set into operation’ by ‘experience’ (AT 51).

7.33 ‘Empiricism’ is surprisingly accused of being not ‘“scientific”‘ but more ‘dogmatic and aprioristic’ than ‘rationalism’ in saying that ‘arbitrarily selected data-processing mechanisms’ ‘are the only ones available’ (AT 207). ‘Rationalism’, to which Chomsky finds it ‘difficult to see’ ‘an alternative’, is praised for holding ‘no preconceptions’ about ‘the internal structure’ of the ‘input-output device’, and only ‘studying uniformities in the output’, i.e. ‘universals’ (AT 48) (but cf. 7.93). If ‘rationalism’ does ‘not show how internal structure arises’, neither does ‘empiricism’ (AT 206). The rival is further outflanked by putting physics and biology, of all things, on the innatist side: we cannot ‘take seriously a position that attributes a complex human achievement to months or at most years of experience rather than to millions of years of evolution or to principles of neural organization’ ‘even more deeply grounded in physical law’ (AT 59).

7.34 Thus, even though ‘the empiricist notion’ is ‘the prevailing modern view’, Chomsky avers that ‘a hypothesis about initial structure’ able to ‘account for acquisition of language’ cannot fit ‘preconceptions’ from ‘centuries of empiricist doctrine’ -- ‘implausible to begin with’, ‘without factual support’, and ‘hardly consistent’ with how ‘animals or humans construct a ‘theory of the external world’ (AT 51, 58). He raises the prospect of ‘testing’ the two sets of ‘principles’ against ‘those we in fact discover’ in ‘real languages’, or gauging the ‘feasibility’ for ‘producing grammars within the given constraints of time and access’ and the ‘observed uniformity of output (AT 53f). But his own verdict is already decided: ‘empiricist theories about language acquisition are refutable wherever they are clear’, or else ‘empty’, just as ‘evidence’ about ‘language acquisition’ ‘shows clearly that taxonomic views’ are ‘inadequate’ because ‘knowledge of grammatical structure cannot arise’ by ‘inductive operations’ ‘developed in linguistics, psychology, or philosophy’ (AT 54, 57) (cf. 7.5ff, 24, 96; 13.45). In contrast, ‘the rationalist approach exemplified’ by ‘transformational grammar seems to have proven productive’ and ‘fully in accord with what is known about language’, and to offer the ‘hope of providing a hypothesis about the intrinsic structure of a language acquisition system’ (AT 54). Chomsky thereby preserves the hope that his ‘theory’ with its ‘linguistic universals’ may grow ‘rich and explicit enough to account for the rapidity and uniformity of language learning, and the remarkable complexity and range of generative grammars that are the product’, whereas ‘taxonomic linguistics’ can make no such ‘empirical claim’ (AT 28, 52f).

7.35 ‘The problem of mapping the intrinsic cognitive capacities of an organism and identifying the systems of belief and the organization of behaviour that it can readily attain should be central to experimental psychology’ (AT 56f). But due to ‘the atomistic and unstructured framework’ of ‘empiricist thinking’, ‘learning theory’ has instead ‘concentrated’ on the ‘marginal topic’ of ‘species-independent regularities in acquisition’ of ‘a “behavioural repertoire” under experimentally manipulable conditions’ (AT 205, 57). ‘Attention’ has focused on ‘tasks’ ‘extrinsic to cognitive structure’ ‘that must be approached in a devious, indirect, and piecemeal fashion’ (AT 57) (11.92). Chomsky is a bit inconsistent here. To discredit ‘empiricism’, he dismisses ‘comparisons with species other than man’ because ‘every species has highly specialized cognitive capacities’ and ‘language’ is ‘a human creation’ ‘reflecting intrinsic human capacity’ (AT 206, 59) (cf. 3.15; 4.28; 8.27; 12.10; 13.12, 18). The ‘analysis of stimuli’ ‘provided’ by ‘peripheral processing in the receptor systems or in lower cortical centres’ is ‘specific to the animal's life-space’ and ‘behaviour patterns’ (AT 205). Yet to discredit the ‘view that all knowledge is derived solely by the senses’ via ‘association and “generalization”‘, Chomsky denies that ‘man is’ ‘unique among animals’; and to make a point about ‘situational context’ (7.88), he turns to ‘animal learning’ (e.g., ‘depth perception in lambs’ ‘facilitated by mother-neonate contact’ upon which ‘the nature of the lamb's “theory of visual space”‘ does not ‘depend’) (AT 59, 34) (cf. 7.15).

7.36 All these arguments about mind, thought, and learning were absent from Chomsky's advocacy in SS. There, his ‘basic requirement’ for ‘any conception of linguistic structure’ was merely that ‘the grammar of English become’ ‘more simple and orderly’ and yet be able to ‘generate exactly the grammatical sentences’ (SS 68, 54n). The claim that his own approach can ‘simplify’ both ‘theory’ and ‘grammar’ has remained a major theme (SS 37, 41, 47, 55, 58, 65, 72, 106; AT 17, 87, 134, 136, 144, 202, 224) (cf. 7.39f, 42f, 47f, 50f, 54). He also stresses the ‘simplicity’ of very specific areas or tactics (usually ‘transformations’), and judges what is ‘simpler’ or the ‘simplest’ (SS 18, 84, 80; AT 62; SS 14, 56f, 74f, 85, 107, AT 55). Indeed, SS said Chomsky's ‘sole concern’ was ‘to decrease the complexity of the grammar’ ‘of English’ and make it ‘simpler than any proposed alternative’ (SS 83f, i.a.). Admittedly, ‘the notion of “simplicity”‘ ‘was left unanalysed’; Chomsky foresaw it being ‘defined within linguistic theory’, and he compared ‘choosing’ it to ‘determining the value of a physical constant’ as a ‘matter with empirical consequences’ (SS 103, AT 37f) (cf. 7.16). Instead of ‘simplifying one part’ and ‘complicating others’, ‘the right track’ lies where ‘simplifying one part leads to’ ‘simplifying others’ (SS 56). But SS proposed to ‘simplify the grammar’ ‘by formulating rules of a more complex type’ ‘than immediate constituent analysis’;13 and AT supports its arguments about ‘language learning’ by pointing to ‘the remarkable complexity’ of ‘generative grammars’ (SS 41; AT 28; cf. 7.50).

7.37 Simplicity was also a key point in Chomsky's original ‘contention that the conceptions of phrase structure are fundamentally inadequate and that the theory of linguistic structure must be elaborated’ through ‘transformational analysis’ (SS 69). Though not using ‘the strongest proof of the inadequacy of a linguistic theory’ (‘that it literally cannot apply to natural language’), Chomsky did apply the ‘weaker’ one that it ‘can apply only clumsily, in complex, ad hoc, unrevealing’ ways, and that ‘fundamental formal properties of natural language cannot be utilized to simplify grammars’ (SS 34). His plan was to compare ‘three models for linguistic structure’ and ‘their limitations’: a ‘communication theoretic model’, an ‘“immediate constituent”‘ or ‘phrase structure’ model, and a ‘transformational model’, and to show that the first two ‘leave gaps in linguistic theory’ and ‘cannot properly serve the purposes of grammatical description’, unless they are made ‘so hopelessly complex that they will be without interest’ (SS 6, 23, 41, 44, 18-49) (cf. 7.94).

7.38 The ‘communication theoretic model’, outlined by Shannon and Weaver (1949), implies only ‘a minimal linguistic theory’ (SS 18f, 34). ‘A machine’ ‘can be in any one of a finite number of different internal states’, and ‘switches’ among them ‘by producing a certain symbol’, e.g. ‘an English word’ (SS 18f). The ‘sequence’ from ‘an initial’ to ‘a final state’ is ‘a “sentence”‘ (i.r.). A ‘language produced in this way’ is ‘a finite state language’ and the ‘machine itself a finite-state grammar’ or a ‘“Markov process”‘ (SS 19ff). ‘To complete’ the ‘model’, ‘we assign a probability to each transition from state to state’, ‘calculate the uncertainty’, and thus ‘the “information content”‘ (cf. 7.90; 923). Plainly, ‘English is not a finite state language’. Its ‘symbols’ may not be ‘consecutive’, but ‘embedded’; and ‘Markov process models’ cannot ‘account for the ability of a speaker of English to produce and understand new utterances’ (SS 21ff). Imposing ‘arbitrary limitations’, e.g., that ‘sentences’ must be ‘less than a million words’, won't help, because in ‘English’ some ‘processes have no finite limit’.

7.39 A ‘description in terms of phrase structure’ is ‘more powerful’ than a ‘finite state’ one (SS 30f). Chomsky fits this second model to the ‘conception of linguistic structure’ of the last ‘half a century’, namely ‘the “taxonomic” view’ he wants to supplant (AT 88). He ‘does not know’ if ‘English itself is literally outside the range of analysis’ by ‘phrase structure’, but finds such grammars ‘certainly inadequate in the weaker sense’, e.g., in failing to ‘handle’ ‘discontinuous elements’ (SS 34, 38, 41, 75) (cf. 8.61; 13.28). Still, such ‘grammars’ are ‘quite adequate for a small part of the language’, while ‘the rest’ ‘can be derived by repeated application of a rather simple set of transformations to the strings given by the phrase structure grammar’ (SS 41fn) (cf. 7.62). In contrast, if we merely ‘extended phrase structure to cover the entire language directly, we would lose the simplicity of limited’ versions (SS 42).

7.40 We can thus attain ‘an entirely new conception of linguistic structure’ and ‘develop a certain fairly complex but reasonably natural algebra of transformations having properties we apparently require for grammatical description’ (SS 44) (cf. 2.82; 3.72f; 5.27, 86; 6.8, 29, 51, 60; 7.40, 718). This ‘transformational model’ is still ‘more powerful’, applying to ‘languages beyond the bounds of phrase structure’ and ‘accounting’ for ‘relations’ between sentences’ ‘in a natural way’ (SS 6, 47, 75). Here, ‘apparently arbitrary distinctions’ are shown to ‘have a clear structural origin’ and to belong to a ‘higher-level regularity’ (SS 75, 107). ‘A wide variety of apparently distinct phenomena all fall into place’; ‘linguistic behaviour that seems unmotivated and inexplicable in terms of phrase structure appears simple and systematic’ from a ‘transformational point of view’ (SS 68, 75). Now, ‘apparently irregular behaviour’ and ‘glaring and distinct exceptions’ ‘result automatically from the simplest grammar’ and from ‘our rules’ for ‘regular cases’, and emerge as ‘instances of a deeper underlying regularity’ (SS 63f, 66f, 68, 85, 88f, AT 190). This is a ‘remarkable indication of the fundamental character of this analysis’ (SS 66f).

7.41 The transformational model must be ‘a generative grammar’: ‘a system of rules that in some explicit and well-defined way assigns structural descriptions to sentences’ (AT 8). Here, ‘the linguist's task’ is ‘to produce a device of some sort (called a grammar)’ for ‘generating all grammatical sequences’ and no ‘ungrammatical ones’ (SS 85, 11, 13). Hence, ‘the fundamental aim in the linguistic analysis of language is to separate’ these two sets and ‘to study the structure of the grammatical sequences’ (SS 13f). The ‘task’ may ‘explicate’ the ‘intuitive concept’ not just of ‘“grammatical in English”‘ but of ‘“grammatical”‘ in ‘general’ (SS 13). To simplify the task, a distinction is drawn between ‘grammaticalness’ in ‘competence’ versus ‘acceptability’ in ‘performance’ (AT 11) (cf. 7.12). Sentences may be ‘high’ on one ‘scale’ and ‘low’ on the other. ‘Grammaticalness is only one of many factors’ that ‘determine acceptability’, along with ‘memory limitations, intonational and stylistic factors, “iconic” elements of discourse, and so on’ (AT 10f) (cf. 7.13, 56).14 These limits may be probed with ‘operational tests’ of ‘rapidity, correctness, and uniformity of recall and recognition, or normalcy of intonation’, whereas ‘the much more abstract’ and ‘important notions of grammaticalness’ may not. It may in fact be ‘quite impossible to characterize’ ‘unacceptable sentences in grammatical terms’, e.g. by ‘formulating rules’ ‘to exclude them’ or by ‘limiting the number of reapplications of grammatical rules’ (AT 11f). For Chomsky's ‘purposes’, ‘“acceptable” refers to utterances that are perfectly natural and immediately comprehensible without paper-and-pencil analysis, and in no way bizarre or outlandish’ (AT 10).

7.42 Chomsky's reasoning leaves it unclear just how ‘a study of performance’ might ‘investigate the acceptability’ of ‘sentences’ or how a grammar could test its claims about ‘speakers’ ‘rejecting’ ‘sequences as not belonging to the language’ (cf. AT 12, SS 23). He postpones the problem by relying on ‘clear cases’ (preferably ‘violating purely syntactic rules’ rather than ‘semantic or “pragmatic”‘ ones), by not ‘appealing’ to ‘farfetched contexts’, and by promising that for ‘intermediate cases’ we can ‘let the grammar itself decide’ once it is ‘set up in the simplest way’ to ‘include clear sentences and exclude clear non-sentences’ (SS 14 16, AT 76, 208) (cf. 4.67; 7.21; 13.40). But such a stage presupposes that ‘linguistic theory’ has ‘stated the relation between the set of observed sentences and the set of grammatical sentences’ (SS 14n, 55), which has proven far harder than Chomsky implies here. He expects ‘a systematic account of how application of the devices and methods appropriate to unequivocal cases can be extended and deepened’ for others (AT 77f). Meanwhile, he follows his own judgment in adducing sentences that ‘we do not have’, or that ‘many would question the grammaticalness of’, or that are ‘much less natural’ (SS 73, 35n).15 Or, he proposes ‘degrees of grammaticalness’, depending on how ‘completely we violate constituent structure’, and hopes this ‘can be developed in purely formal terms’ (SS 36n, 43nf) (cf. 7.72, 93; 13.40). Doubtful cases would be ‘interpreted’ ‘by analogies to nondeviant sentences’, at least by ‘an ideal listener’ (AT 76, 78, 149). The snappiest evasion of all, which Chomsky admits leaves ‘a serious gap in theory’, is to ‘assume that the set of sentences is somehow given in advance’ (SS 103, 85, 11, 18, 54).

7.43 These issues bear on how a ‘theory’ might ‘provide a general method for selecting a grammar, given a corpus of sentences’ (SS 11). Noting that the whole ‘set of sentences’ a language allows for ‘cannot be identified with any partial corpus obtained by the linguist in his field work’, which is necessarily ‘finite and somewhat accidental’ (13.43), Chomsky defines ‘a language’ as ‘an infinite set of sentences,16 each finite in length and constructed out of a finite set of elements’ (SS 15, 13). ‘All natural languages in their spoken or written form are languages in this sense, since each’ ‘has a finite number of phonemes or letters’, and ‘each sentence is representable as a finite sequence’ of these (even a ‘formal system of mathematics’ qualifies) (SS 13) (cf. 6.51, 56, 60). ‘The assumption that languages are infinite’ can ‘simplify the description’ by allowing the ‘grammar’ to have ‘recursive devices’ for ‘producing infinitely many sentences’ (SS 23f). ‘Hence, a generative grammar must be a system of rules that can iterate to generate an indefinitely large number of structures’ (AT 16).

7.44 Specifically, ‘the infinite generative capacity of the grammar arises from a formal property’ of ‘rules’: ‘inserting’ basic structures ‘into others’, ‘this process being iterable without limit’ (AT 142). ‘In this respect, a grammar mirrors the behaviour of the speaker who’ ‘can produce and understand an infinite number of new sentences’, and ‘knowledge of a language involves this implicit ability’ (SS 15, AT 15). This notion is said to capture the ‘creative aspect’ ‘all languages have’: they ‘provide the means for expressing indefinitely many thoughts and for reacting appropriately to an indefinite range of new situations’ (AT v, 5) (cf. 3.38; 7.67, 83; 8.18, 28, 43, 83; 12.56, 58). The ‘technical devices for expressing a system of recursive processes’, as developed in ‘mathematics’, not only account for ‘sequences’ ‘longer than any ever before produced’, but offer ‘an explicit formulation of creative processes’ (AT 8, SS 17n ). Yet saying a language (or a grammar) is ‘creative’ because it has no longest sentence is like saying mathematics is creative because it has no highest number. Iteration and recursion are the exact opposite of creative: they just churn out the same thing at fixed increments. The real creativity of language, as shown, say, in poetry, would fall in a trouble zone of Chomsky's theory, namely on the borders of the ‘grammatical’, and unsettle his constructs (‘rules’, ‘restrictions’, etc.) for stating what can or cannot be ‘generated’ (cf. 13.40f).

7.45 Because ‘the grammar of a language is a complex system with many and varied interconnections between its parts’, ‘the linguist's task’ should be ‘to describe language’ by ‘a universal system of levels of representation’ (cf. Chomsky 1955) (SS 60, 85, 11, AT 222) (cf. 4.71; 5.34f; 8.51f; 9.30; 11.16f, 35, 56; 12.82; 13.27). Indeed, SS saw ‘“linguistic level”‘ as ‘the central notion in linguistic theory’, a ‘level’ consisting of ‘descriptive devices’ ‘for the construction of grammars’ (SS 11). Borrowing (without acknowledgement) from structuralism, Chomsky proposes to ‘rebuild the vast complexity of the actual language more elegantly and systematically by extracting the contribution’ of ‘several linguistic levels, each’ ‘simple in itself’, although the higher ‘levels’ must attain ‘more powerful modes of linguistic description’ by ‘increasing complexity’ (SS 42n, 11).17 Perhaps ‘we can determine the adequacy of a linguistic theory by developing rigorously and precisely the form of grammar corresponding to a set of levels’ and applying them to ‘natural languages’ (SS 11). ‘Each level’ is in turn ‘a system’ having a ‘set’ (or ‘alphabet’) of ‘minimal elements’ (‘“primes”’), plus ‘relations’, ‘mappings’ to other ‘levels’, and ‘operations of concatination which form strings’ of ‘arbitrary finite length’ (AT 222, SS 109).18

7.46 ‘A hierarchy of linguistic levels’ in ‘a satisfactory grammar of English’ might subsume ‘phonetic, phonological, word, morphological, phrase structure’, and ‘transformational structure’ (AT 223, SS 11f, 18, 85).19 Chomsky dislikes ‘the idea that higher levels are literally constructed out of lower level elements’, though he still depicts a ‘sentence’ as a ‘sequence’ of ‘phonemes, morphemes, or words’ (SS 58f, 106, 18, 32f). Against earlier American linguists, he sees ‘no objection to mixing levels’ (SS 57, 59, 106) (cf. 5.35; 13.27). The ‘interdependence of levels’ -- for instance, ‘if morphemes are defined in terms of phonemes’ and ‘morphological considerations’ are ‘relevant to phonemic analysis’ -- does not entrain ‘linguistic theory’ in any ‘real circularity’; the ‘compatibility’ of ‘phonemes and morphemes’ helps ‘lead to the simplest grammar’ (SS 57) (cf. 4.50; 5.36, 45; 13.27). This insight, however, ‘does not tell how to find the phonemes and morphemes in a direct, mechanical way’, but then ‘no other’ ‘theory’ does ‘either, and there is little reason to believe’ any can. This argument is meant to refute the ‘commonly held view’ that ‘syntactic theory is premature’ when ‘problems’ ‘on the lower levels’ are ‘unsolved’ -- ‘a faulty analogy between the order of development of linguistic theory and the presumed order of operations in discovery’ (SS 59f). ‘Developing one part of grammar’ is rather aided by a ‘picture’ of ‘a completed system’ (cf. 5.28; 13.43); and working down from ‘higher levels’ can bypass ‘absurd’ and ‘futile tasks’ like ‘stating principles of sentence construction in terms of phonemes and morphemes’ (cf. 733).

7.47 On ‘the level of phrase structure’, ‘each sentence is represented by a set of strings, not by a single string’ or ‘sequence’ as ‘on the level of phonemes, morphemes, or words’ (SS 32f, 47). This factor makes ‘phrase structure’ a ‘level’ of ‘different and nontrivial characteristics’, not able to be ‘subdivided’ into some ‘set of levels’ ‘ordered from higher to lower’ (SS 32f, 47). ‘The break between’ ‘phrase structure and the lower levels’ is ‘not arbitrary’, because ‘similarities’ and ‘dissimilarities’ of ‘representation’, such as ‘rules with ‘different formal proper

7.48 In SS, Chomsky resolves the levels into ‘a picture of grammars’ with ‘a natural tripartite arrangement’, each part having its own set of ‘rules’: ‘phrase structure’, ‘transformational structure’, and ‘morphophonemics’ (SS 45f, 114).20 ‘A grammar’ provides ‘rules’ for ‘reconstructing’ ‘phrase structure’ and for ‘converting strings of morphemes into strings of phonemes’; these two rule sets are ‘connected’ by ‘a sequence of transformational rules’ for ‘carrying strings with phrase structure into new strings to which morphophonemic rules can apply’ (SS 107). ‘Phrase structure’ is created by means of ‘instruction formulas’ called ‘rewriting rules’ (SS 26, 29, 110, AT 66). For example, , ‘X --> Y is interpreted: “rewrite X as Y”‘, or ‘A --> Z/X => Y’ as ‘category A is realized as the string Z’ ‘in the environment’ ‘of X to the left and Y to the right’; in ‘context free rules’, ‘X and Y are null, so that the rules apply independently of context’ (SS 26, 110, AT 66) (cf. 7.73f; 13.39).21 These ‘rules’ apply to ‘only a single element’ and proceed in a ‘sequence’ from an ‘initial symbol’ to ‘a terminal string’, where ‘no further rewriting is possible’ (SS 29f, 26f, AT 66f).22 The sequence of strings is termed a ‘derivation’, and each sentence has a ‘history of derivation’ (SS 37, 107) (cf. 7.52). A ‘system of rewriting rules’ can ‘present’ ‘grammatical information’ ‘in the most natural way’, state ‘grammatical functions’, and ‘provide a simple method for assigning a unique and appropriate phrase marker to a terminal string, given its derivation’ (AT 72, 66f).

7.49 For Chomsky, ‘the most obvious formal property of utterances is their bracketing into constituents’ (AT 12) (cf. 3.22; 4.59; 5.21, 50, 62; 7.63; 9.33; 13.26). Hence, ‘constitutent analysis’ by ‘parsing’ is ‘customary’ for ‘linguistic description on the syntactic level’ (SS 26) (cf. 11.3, 14, 16, 33f, 77, 79, 94). As a better means for ‘assigning’ or ‘imposing constituent structure’ and obtaining ‘valuable, even compelling evidence’ about it, Chomsky proposes ‘transformations’ (SS 83, 73, 81). ‘A grammatical transformation’ is ‘a rule’ that ‘applies to a string with a particular structural description’ (AT 89). So ‘a transformation is defined by the structural analysis of the strings’ it ‘applies’ to and by ‘the change it effects’ on them (SS 111, 61, 91). Reciprocally, ‘the representation of a string on the level of transformations is given by the terminal string’ and the ‘transformations by which it is derived’ (SS 91).

7.50 A ‘transformational treatment’, as we saw, is claimed to enhance ‘simplicity’ (SS 62, 72, 81) (cf. 7.36ff). As a ‘general principle’ stressed in SS, ‘if we have a transformation that simplifies the grammar and leads from sentences to sentences in a large number of cases’, we can ‘simplify the grammar even further’ by ‘assigning constituent structure to sentences in such a way that this transformation always leads to grammatical sentences’ (SS 83). And we can gain still more by using ‘the same underlying transformational pattern’ for several constructions, e.g. ‘negatives and interrogatives’ (SS 64ff). But in AT, Chomsky says these ‘operations’ need no ‘a priori justification’ by being ‘the most simple or “elementary” ones’ (AT 55). ‘“Elementary operations”‘ are often not ‘transformations at all, while those that are may be ‘far from elementary’. ‘Transformations’ ‘manipulate substrings only in terms of their assignment to categories’, ‘independently’ of the ‘length or internal complexity of strings belonging to these categories’. Evidently, Chomsky's hopes for simplicity were waning (cf. 7.36).

7.51 In SS, however, he had proposed to ‘simplify the description of English and gain new insight’ by ‘limiting the direct description in terms of phrase structure to a kernel of basic sentences: simple, declarative active with no complex verb or noun phrases’ (SS 106f, 47f, 61) (cf. 4.58, 65). ‘All other sentences can be described more simply as transforms’, i.e. ‘strings’ ‘derived’ by ‘simply statable transformations’ (SS 80, 48). Chomsky promised ‘clear and easily generalizable considerations of simplicity’ for ‘determining which set of sentences belongs to the kernel, and what sorts of transformations’ can ‘account for the non-kernel sentences’ (SS 77). We thus establish ‘the transformational level'23 where ‘an utterance is represented’ ‘abstractly’ via ‘a sequence of transformations by which it is derived, ultimately from kernel sentences’ (SS 47). The ‘kernel of the language’ can be ‘defined’ ‘as the set of sentences produced by obligatory transformations’ (SS 45f, 61, 91), i.e., those needed to make a sentence at all. All others pass through a ‘transformational history’, which might be a concept for ‘determining sentence types in ‘general’ (SS 71, 89, 91f, AT 130) (cf. 7.48; 13.28).

7.52 Neither SS nor AT remotely attempts to give a comprehensive list of transformations. A handful are cited for demonstration, such as those for ‘negations’, ‘questions’, ‘emphatic affirmatives’, ‘nominalizations’, and ‘comparatives’ (SS 61f, 64, 66, 72, 111-14, AT 178). When ‘adjective modifiers’ are derived from whole sentences (‘converting “the boy is tall” into “the tall boy”’) or ‘relative clauses’ (i.e. ‘“the boy who is tall”’) (SS 72 AT 217), we see simpler constructions being derived from more complex ones, not the other way around (cf. 13.54). And these very cases are adduced as ‘independent support’ and ‘syntactic justification for a transformational analysis’ (SS 73, AT 189).

7.53 A ‘paradigmatic instance’ is ‘the passive transformation’ (SS 77). Having ‘both actives and passives in the kernel’ would make ‘the grammar’ ‘much more complex’ ‘than if passives are deleted and reintroduced by a transformation’ for ‘inverting subject and object’, and, where needed, ‘deleting the unspecified agent’ (SS 77, 72, 79; AT 128) (cf. 9.70). ‘“Quantificational” sentences such as ‘“everyone in the room knows at least two languages”‘ versus ‘“at least two languages are known by everyone in the room”‘ lead to the overstatement that ‘not even the weakest semantic relation (factual equivalence) holds in general between active and passive’, whose ‘relations’ would ‘not have come to light’ in terms of ‘synonymy’ -- a warning against ‘vague semantic clues’ (SS 100f).24 But the relation, I think, is semantic similarity plus difference in focus: in each sentence, focus goes to the identity and size of the set named in subject position (just which persons versus just which languages), while the set named via direct object or prepositional agent is not in focus (any two languages versus anyone in the room).

7.54 In SS Chomsky felt that despite possible ‘difficulties and complications’, ‘the order’ must be defined among the ‘rules’ and ‘transformations’ (SS 32, 111, 114, 66, 44, 35). But in AT he sees ‘no way’ to decide ‘a priori’ whether ‘the rules’ should be ‘ordered’ or ‘unordered’, e.g., in terms of ‘simplicity’ or ‘elegance’ (AT 39). And ‘the theory of transformation markers permits a great deal of latitude’ in ‘the ordering of transformations’ (AT 133).25 Yet Chomsky evokes ‘strong reasons’ for ‘assigning an internal organization and an inherent order of derivation’ (AT 125).26 He uses the term ‘sequential derivations’ for those ‘preserving ordering’ and considers them central to ‘empirical studies of transformational grammar’ (AT 67, 211).

7.55 Chomsky limits his ‘theory of transformations’ by not covering ‘the full range of possibilities for stylistic inversion’ and ‘reordering’ (AT 126f, 227f). ‘Richly inflected languages’ (e.g. Russian, Mohawk) ‘tolerate’ much ‘reordering’ unless it ‘leads to ambiguity’ -- maybe a ‘universal’ constraint (AT 126f) (cf. 3.27). Again saving labour while advancing his cause, Chomsky insists that ‘the rules of stylistic reordering are very different from grammatical transformations’, the latter being ‘much more deeply embedded in the grammatical system’ and using ‘markers drawn from a fixed, universal language-independent set’, while ‘stylistic’ ones are ‘peripheral’, apply to ‘performance’, and ‘have no apparent bearing’ ‘on the theory of grammatical structure’ (AT 127, 222f).

7.56 In contrast to SS, AT presents ‘grammar’ with a different group of three parts: ‘a syntactic’, ‘a semantic’, and ‘a phonological component’ (AT 141, 16) (cf. 7.48; 9.30). Morphemics fades, and semantics finally enters the picture. The exact ‘relation between syntax and semantics’ is a question Chomsky often raises and fails to resolve, regarding it as ‘a side issue’ and a ‘dangerous ground’ fraught with ‘difficulties’, ‘confusion’, and ‘speculation’ (SS 93, 101n, AT 75, 148). But he devotes explicit sections to it in both SS (92-105) and AT (14-163). In SS, he follows structuralists like his teacher Harris (1951) by asserting ‘the independence of grammar’ from ‘semantics’ and ‘concluding that only a purely formal basis can provide a firm and productive foundation for construction of grammatical theory’ (SS 13, 106, 100) (cf. 5.61; 13.54). His own ‘theory’ is ‘completely formal and non-semantic’, ‘meaning’ being no more relevant for ‘constructing a grammar’ than ‘the hair colour of speakers’ (SS 93). The idea that ‘one can construct a grammar with appeal to meaning’ is ‘widely accepted’ (cf. 3.40) but ‘totally unsupported’, and obscures ‘important generalizations about linguistic structure’ (SS 93, 101, AT 32, 78).

7.57 So SS proffers ‘a purely negative discussion of the possibility of finding semantic foundations for syntactic theory’, leading to the ‘conclusion that grammar is autonomous and independent of meaning’ (SS 93, 17). Chomsky denies having ‘acquaintance with any detailed attempt to develop a theory of grammatical structure in partially semantic terms’ (though the traditional grammars he praises in AT were just this, 8.5, 12.41) nor any ‘rigorous proposal for using semantic information in constructing or evaluating grammars’. He turns the tables by placing ‘the burden of proof’ ‘completely on linguists who claim’ to ‘develop some grammatical notion in semantic terms’ (SS 94)27 -- a bold move, since the burden of proof should rest on the challenger of ‘widely accepted’ views. Or, he advocates postponement, because ‘a decision as to the boundary separating syntax and semantics’ ‘is not a prerequisite for theoretical and descriptive study’ (AT 160). ‘Correspondences’ and ‘relations between semantics and syntax can only be studied after the syntactic structure has been determined on independent grounds’ and within ‘some more general theory’ (SS 17n, 102, 108). Semantics can wait, but syntax can't (7.46).

7.58 Moreover, ‘despite the undeniable interest and importance of semantic’ ‘studies of language’, ‘semantic notions’ like ‘reference, significance, and synonymity’ ‘have no direct relevance’ for ‘characterizing the set of grammatical utterances’, which is the true ‘problem of syntactic research’ (SS 17, 102f).28 His demonstration that ‘“grammatical” cannot be identified with “meaningful” or “significant”‘ hinges on the now-famous sentence ‘“colourless green ideas sleep furiously”‘ (SS 15, AT 149). This example reminds me of the grammatical but nonsensical examples adduced by structuralists to show the self-sufficiency of ‘grammatical categories’, like Chomsky's own ‘“Pirots karulize etalically”‘; but he finds their ‘notion of “structural meaning”‘ ‘quite suspect’ and doubts ‘that the grammatical devices’ are ‘used consistently enough so that meaning can be assigned to them’ (SS 104, 108, cf. SS 103n, 104f) (cf. 5.61).29

7.59 Chomsky favours moving in the reverse direction: ‘linguists with a serious interest in semantics’ should ‘deepen and extend syntactic analysis’ to supplant ‘unanalysed semantic intuition’ (AT 75) (cf. 75). A ‘purely formal investigation’ of ‘syntactic structure’ ‘isolated and exhibited by the grammar’ is ‘rated more highly’ when it ‘provides’ ‘insight into problems of meaning and understanding’ and ‘supports semantic description’ (SS 93, 102, 12). He believes his own ‘requirement’ of a ‘completely formal discipline is perfectly compatible’ for making ‘significant interconnections with a parallel semantic theory’ (SS 103). However ‘imperfect’ and ‘inexact’, the ‘striking correspondences’ between ‘syntactic structures’ ‘in formal grammatical analysis and specific semantic functions’ at least indicate that ‘syntactic devices’ are ‘used fairly systematically’ (SS 104, 101f, 108).

7.60 Such arguments help us to appreciate why AT adds a ‘semantic component’ and states that ‘a descriptively adequate grammar’ is also ‘a theory of sentence interpretation’ (AT 76). Yet Chomsky hedges his bets by requiring ‘that all information utilized in semantic interpretation must be presented in the syntactic component’ (AT 75). At one point, he unexpectedly says he sees ‘no way to decide’ ‘a priori’ ‘whether the burden of presentation should fall on the syntactic or semantic component’; ‘in fact, it should not be taken for granted that syntactic and semantic considerations can be sharply distinguished’ (AT 77f). But this indecision applies specifically to an issue he is reluctant to face anyway, namely ‘deviant sentences’. ‘The syntactic component’ could only treat them via ‘structural similarities to perfectly well-formed’ ones, while the ‘semantic’ one could provide ‘lexical items’ to ‘specify’ ‘rules’ for ‘determining the incongruity’ (cf. 7.72). Chomsky proposes ‘selectional rules’ that ‘impose a hierarchy of deviation from grammaticalness’ on ‘sentences’ ‘generated by relaxing’ ‘constraints while keeping the grammar unchanged’ (AT 152). These rules could be ‘dropped from the syntax’ and put in ‘the semantic component’ with ‘little violence to structure of the grammar’ (AT 153, 158).

7.61 In practice, the role of meaning is marginalized by sticking close to the literal wording of sentences or by replacing words with symbols, and suggesting thereby that the relations between ‘transforms’ depend on the identities of written words or symbols rather than on similarities of meaning (cf. 13.33). Emblematic here Chomsky's use of ‘ambiguity and similarity of understanding’ as ‘a motivation for’ the ‘levels’ of ‘phrase structures’ and ‘transformations’, e.g., to clear up constructions like ‘“the shooting of the hunters”‘ or ‘“flying planes can be dangerous”‘ (SS 86ff, AT 21). ‘Transformational grammar’ offers a way to make ‘ambiguity’ ‘analysable in syntactic terms’ by treating it as ‘constructional homonymity’ (SS 28, 33, 86fn, 107). An ‘ambiguity’ signalled by some ‘intuitive similarity of utterances’ points to ‘dual representations on some level’, while ‘synonymity’ points to a ‘similar or identical representation’ (SS 107). A very simple case is ‘a phoneme sequence’ that can be ‘ambiguously understood’ in terms of ‘morphemes’ (e.g. /neym/ as ‘“a name” or “an aim”’) (SS 85) (a prominent trait of English).

7.62 But the most elaborate and explicit means for absorbing semantics and meaning into ‘the syntactic component’ is by adding the conception of a ‘deep structure’ that ‘underlies’ the ‘actual sentence’ and ‘determines its semantic interpretation’ (AT 16, 23, 135, 138). This ‘deep structure’ ‘expresses’ ‘grammatical relations’ and gets ‘mapped’ by ‘transformational rules’ into ‘surface structure’ (AT 99, 16f, 123, 99, 135). Chomsky complains that the ‘syntactic theories’ in ‘modern structural (taxonomic) linguistics’ ‘assume that deep and surface structures are actually the same’ (AT 16) (cf. 7.80). Making the two ‘distinct’ is now ‘the central idea of transformational grammar’, with ‘surface structure determined by repeated application of formal operations called “grammatical transformations” to objects of a more elementary sort’ (AT 16f) (which are also more abstract than ‘kernels’ and look less like sentences). Against ‘immediate constituent analysis’, which ‘may be adequate’ for the ‘surface’, Chomsky's ‘concern’ is ‘primarily with deep structure’ as the site of ‘the significant grammatical relations of an actual sentence’, while he finds ‘surface structure’ ‘unrevealing’ and ‘irrelevant’ (AT 17, 220, 24) (7.84, 82; cf. 13.47). He warns how ‘surface similarities hide underlying distinctions’ which ‘no English grammar has pointed out’, nor are they ‘clear to the speaker’ (AT 24, 22) (cf. 13.49). ‘Universality is claimed’ only for ‘deep structures’, whereas ‘surface structures’ need have no ‘uniformity’, as ‘modern linguistics’ has indeed ‘found’; this argument keeps their ‘findings’ from conflicting with ‘the hypotheses of universal grammarians’ (AT 118) (cf. 7.18f, 55).30 Also, ‘a very general, and perhaps universal way’ for ‘defining the system of grammatical relations’ is ascribed to the ‘categorial component’, which ‘determines the ordering of elements in deep structures’ (AT 122f).

7.63 This new design enables ‘grammatical functions’, which are ‘relational’ and sometimes have ‘traditional’ ‘names’ like ‘“Subject” and “Object”‘, to be ‘sharply distinguished from ‘grammatical categories’ like ‘“Noun Phrase”‘ and ‘“Verb Phrase'“ (AT 68f, 104) (cf. 4.69; 5.55, 530; 6.49; 9.46, 70; 11.34; 12.70, 79).31 ‘The so-called “grammatical Subject” belongs to ‘surface structure’, while ‘functional notions’ like ‘“logical Subject”‘ belong to ‘deep structure’; thus, a ‘“grammatical Subject” may be a ‘“logical Object”‘ (AT 70, 23, 68, 163, 230). This ‘difference’ ‘provides the primary motivation and empirical justification for the theory of transformational grammar’ (AT 70) (cf. 7.79). ‘A sentence’ is assumed to have ‘a basis’ whose ‘phrase markers’ ‘directly represent’ ‘semantically relevant information concerning grammatical function’ (AT 70, 117, 230). Yet ‘the extension to surface structure of such functional notions’ is not ‘entirely straightforward’, and ‘different definitions are needed’ (AT 220f). For example, ‘Topic-Comment’ might be ‘the basic grammatical relation of surface structure corresponding (roughly) to the fundamental Subject-Predicate relation of deep structure; ‘often’ ‘Topic and Subject coincide’. Disregarding communicative functions, Chomsky ‘defines the Topic’ ‘as the leftmost NP immediately dominated by S [Sentence] in surface structure and the Comment’ as ‘the rest of the string’ (but cf. 9.57, 927; 11.45, 64). Also, ‘case’ is attributed to ‘the position of the Noun in surface structure rather than deep’ (AT 221f), an idea ‘case grammar’ was later to dispute (cf. 11.28, 41).

7.64 The home of ‘underlying’ or ‘deep structure’ is ‘the base of the syntactic component’ (AT 17). This is ‘a system of rules that generate a highly restricted (perhaps finite) set of basic strings, each with an associated structural description called a base Phrase marker’ -- the ‘elementary unit’ of ‘deep structure’ (AT 17, 135). ‘The basis of the sentence’, namely its ‘underlying’ ‘sequence of base phrase markers’, gets ‘mapped into the sentence by the transformational rules’, which ‘automatically assign to the sentence a derived phrase marker, ultimately, a surface structure’ (AT 17, 128, 135). To ‘simplify the exposition’, Chomsky assumes that ‘no ambiguities are introduced by rules of the base’ (AT 17). The ‘“kernel sentences”‘ of SS are briefly mentioned as having ‘a single phrase marker’ and ‘involving a minimum of transformational apparatus’; but they have a merely ‘intuitive significance’ (AT 18) (cf. 7.51, 53, 82).

7.65 Being associated with ‘deep structure’ and ‘the categorial component’, ‘much of the structure of the base’ ‘is common to all languages’ (AT 123, 117) (cf. 7.62). Chomsky finds it ‘natural to suppose that formal properties of the base will provide the framework for the characterization of universal categories’: either the ‘rules’ themselves, or ‘constraints’ on the choice’ of ‘rules’ and ‘elementary structures’, or ‘a fixed’ ‘alphabet’ of ‘category symbols’ (AT 117, 141f). And ‘the form of the categorial component’ may be ‘determined by universal conditions that define “human language”‘ (AT 120f). This ‘traditional view’ dates back ‘at least to the Grammaire generale et raisonnee’ and in light of ‘relevant evidence’ remains ‘true’ ‘today’ -- and convenient too: ‘aspects of base structure’ ‘not specific to a particular language’ ‘need not be stated in the grammar’ (AT 117) (cf. 7.19f, 72).

7.66 Therefore, ‘the account of the base rules’ ‘may not belong to the grammar of English any more than the definition’ ‘of “transformation”‘ (AT 117f). ‘Not all phrase markers generated by the base will underlie actual sentences and thus qualify as deep structures’ (AT 138). But ‘the transformational rules provide’ a ‘very simple’ ‘test’, namely whether the ‘rules can generate’ a proper ‘surface structure’ from a ‘deep’ one. This ‘filtering function’ of the ‘rules’ was ‘true of the earlier version’ (in SS) as well though ‘never discussed’ (AT 191, 139), since ‘deep structure’ hadn't been proposed then. Transformations which didn't filter would be useless, since they wouldn't supply the restrictions the formalism requires.

7.67 ‘The syntactic component’ is the ‘sole “creative” part’ of the grammar’, while the other ‘two are purely interpretive’ and ‘play no part’ in ‘generating sentence structures’ (AT 135f, 141, 16) (cf. 7.44; 13.32). ‘The deep structure of a sentence is submitted to the semantic component for semantic interpretation, and its surface structure enters the phonological component’ for ‘phonetic interpretation’ (AT 135, 141, 16, 138, 99).32 ‘The final effect of a grammar, then, is to relate a semantic interpretation to a phonetic’ one -- ‘that is, to state how a sentence is interpreted’ (AT 136). ‘When we define deep structures’ as ‘“generated by the base”‘, we ‘assume that the semantic interpretation of a sentence depends only on its lexical items and the grammatical functions and relations represented in the underlying structures’. ‘This is the basic idea that has motivated the theory of transformational grammar since its inception’, though not ‘clearly formulated’ until Katz and Fodor (1963) (cf. 5.76; 7.71).33

7.68 Though they are ‘not our concern’ and have not been ‘worked out’, Chomsky thinks his two ‘interpretive components function in parallel ways’ (AT 143). ‘Phonological rules’ ‘apply’ ‘first to minimal elements (formatives), then to the constituents of which they are parts’, ‘and so on, until the maximal domain of phonological processes is reached’. ‘In this way a phonetic representation of the entire sentence is formed’ via ‘the intrinsic abstract’ ‘properties of its formatives’ and ‘categories’ ‘in the surface structure’, and ‘related’ to a ‘signal’ (AT 143f, 16). ‘Similarly, the projection rules of the semantic component operate on the deep structure’ and ‘assign’ ‘a “reading” to each constituent on the basis’ of those ‘assigned to its parts’, all the way down to ‘the intrinsic semantic properties of the formatives and categories’ ‘in the deep structure’ (AT 144) (cf. 7.82; 13.59).

7.69 Chomsky's ‘syntactic component’ centred between the other two, is naturally the one AT is ‘concerned with’ (AT 3), and is assigned a whole range diverse and complex duties. It provides ‘rules that specify the well-formed strings of minimal syntactically functioning units (“formatives”) and assigns structural information to these strings and strings which deviate from well-formedness’. It ‘specifies the infinite set of abstract formal objects, each of which incorporates all the information relevant to a single interpretation of a particular sentence’ (AT 16). It ‘contains a lexicon’, with each ‘item specified’ by its ‘intrinsic’ ‘features’, such as ‘Animate’ and ‘Human’ (AT 78, 82f, 85f, 150f, 153, 226) (cf. 4.69; 7.72, 74).34 Though they are evidently borrowed over from reality, Chomsky insists these features belong to ‘purely syntactic rules’ and to the ‘syntactic component, no matter how narrowly syntax is conceived’ (AT 150f). Maybe their absence in traditional grammar worries him.

7.70 The ‘lexicon’ is designed to be not just a list of ‘dictionary definitions’ (AT 37), but an enormously diverse and complex complement for the grammar (cf. 13.52). It contains not words as such, but ‘formatives’: ‘minimal syntactically functioning units’, including both ‘lexical items (“sincerity, boy”) and grammatical items (“Perfect, Possessive”)’ (AT 87, 3, 65) (cf. 13.30). Since ‘a careful grammar’ ‘reveals that many formatives have unique or almost unique grammatical characteristics’, ‘substantial simplification’ is gained (and labour saved) by putting all these ‘properties’ into ‘the lexicon’, ‘where they most naturally belong’, and ‘excluding’ them ‘from the rewriting rules’ (AT 86f) (cf. 13.59). We thus have the ‘advantage’ of making ‘the lexical entries’ absorb all ‘idiosyncrasies’ and ‘irregularities of the language’ (AT 86f, 216, 142, 214) (cf. 4.49, 52; 13.59; Sweet 1931:31).

7.71 The ‘entries’ in ‘the lexicon’ ‘each consist of a descriptive feature matrix and a complex symbol’ (AT 164, 222). The ‘features’ are ‘phonological’, ‘syntactic’, and, in the ‘definition’, ‘purely semantic’ (cf. Katz & Fodor 1963) (AT 214). The ‘semantic features’ are ‘distinct but related’ to the ‘syntactic’ ones and ‘constitute a well-defined set in a given grammar: a feature belongs’ here if ‘it is not referred to by any rule of the phonological or syntactic component’ (AT 88, 120, 142). Lest he imply that ‘the system of dictionary definitions’ is without ‘intrinsic structure’ and hence ‘atomistic’, Chomsky invokes ‘relations of meaning (rather than relations of fact)’ that ‘cannot in any natural way be described within’ ‘independent lexical entries’ (AT 160). Examples include the ‘“field properties”‘ studied in ‘componential analysis’, or the ‘referential domains’ of terms that are ‘mutually exclusive’, e.g. ‘colour words’ (AT 229, 160f, 229) (cf. 4.22; 5.68; 7.32). Moreover, Chomsky foresees ‘universal language-independent constraints on semantic features’ -- ‘in traditional terms, the system of possible concepts’ (AT 160) -- presumably innate rather than extracted from reality (but cf. 7.69; 13.24). This hope again rests on a parallel to ‘the theory of phonetic distinctive features’, which gives ‘a language-independent significance to the choice of symbol’ for ‘lexical formatives’ (cf. 7.20; 13.26); Chomsky ‘assumes’ that ‘the grammatical formatives and the category symbols’ in the lexicon ‘too are selected from a fixed universal vocabulary, although this assumption’ has no ‘effect’ on his ‘descriptive material’ (AT 65f, 160). Only ‘our ignorance of the relevant psychological and physiological facts’ hides the ‘a priori structure’ in ‘the system of “attainable concepts”‘ (AT 160).

7.72 The ‘lexical entry’ gets the tough jobs of supplying ‘information’ for ‘the phonological and semantic components’ and for ‘the transformational part of the syntactic component’, and of ‘determining the proper placement’ ‘in sentences, and hence by implication degree and manner of deviation of strings’ not ‘generated’ (again relieving the ‘rules’) (AT 87ff). Some ‘features’ are ‘inherent’ to a ‘formative’, being ‘part of the complex symbol of the lexical entry’, while others are ‘contextual’, being ‘introduced by grammatical rules’ when the ‘item’ is ‘inserted into a phrase marker’ (AT 171, 122, 176f). ‘This analysis of a formative as a pair of sets of features’ is ‘tentatively proposed’ ‘as a linguistic universal’ on ‘slender evidence’, but with the (labour-saving) corollary that the ‘features need not actually be mentioned’ ‘in the rules of the grammar’ (AT 181, 141f) (cf. 7.19, 65). Also, a possible explosion of features in the lexicon is to be counteracted by ‘redundancy rules’ for ‘ predictable’ ‘feature specifications’ -- another carry-over from the ‘phonological’ domain to the ‘syntactic’ and a prompt new candidate for ‘universal notation’ (e.g. ‘[+ Human]’ again) (AT 168f, 222, 166).

7.73 ‘A lexical entry is substituted’ in ‘positions’ where ‘its contextual features match those of the symbol’ (AT 121). Since the ‘rules’ can ‘assign’ ‘contextual features’ also to the ‘frames in which a symbol appears’, we have ‘context-sensitive rules’ in the lexicon instead of ‘context-free’, and thereby get them too out of ‘the grammar’ (AT 121f) (cf. 7.48; 13.52). In exchange, the ‘features determined by context’ are ‘noninherent’ and, being ‘unspecified in underlying structures’ and ‘added’ by ‘redundancy rules’, can make ‘no contribution to sentence interpretation’ (AT 182) (cf. 7.82). Or, ‘determining the restrictions’ via ‘feature specifications’ might be ‘preferable, since it does not affect the structure of the lexicon’, leaving it ‘simply a list of entries’ (AT 188). Chomsky wonders whether to ‘list in the lexicon only the features’ for ‘frames in which the item’ ‘cannot appear’; or to list those in which it ‘can appear’ and to assume it is ‘automatically’ ‘specified negatively’ for ‘every contextual feature not mentioned’ (AT 110f, 166, 230). Or, a ‘distinct subpart of the base of the syntactic component’ could contain ‘lexical rules’ derived by a ‘general analysis of lexical items’ and empowered to ‘introduce lexical formatives’ (AT 68, 190, 74).

7.74 Beyond ‘the lexicon’, Chomsky's ‘base’ has a ‘categorial component’, ‘defined’ as a ‘system of rewriting rules’ (AT 123, 120) (cf. 7.48). ‘Among the rewriting rules’ ‘we can distinguish branching rules’, which ‘analyse a category symbol’ ‘into a string of (one or more) symbols’, either ‘terminal’ or ‘nonterminal’; versus ‘subcategorization rules’, which ‘introduce syntactic features and thus form or extend a complex symbol’ (AT 112). Again, these two types of ‘rule may be context-free’, ‘introducing inherent features’ (like ‘[Human]’); or ‘context-sensitive’, ‘introducing contextual features’ (like ‘[Transitive]’) by ‘environment’ (AT 112, 120, 90, i.r.).35 In ‘theory’, ‘a grammar is obviously more highly valued if subcategorization is determined by a set of contexts that is syntactically definable’ (AT 97f). But midway through AT, Chomsky decides it would be less ‘restrictive’ and more ‘flexible’ to change ‘the base component’ in his ‘earlier proposal’ by discarding ‘subcategorization rules’ as ‘rewriting rules and ‘assigning’ them too ‘to the lexicon’, thus leaving only ‘branching rules’, ‘all context-free’ (like in a ‘simple phrase structure grammar’), inside ‘the categorial component’ (AT 123, 120, 128, 122). Now, ‘strict subcategorial and selectional restrictions’ get ‘listed in lexical entries’ and ‘defined by transformational rules’ (AT 139).

7.75 The place of morphology in the tripartite AT model, lacking the older ‘morphophonemic level’ (7.48), is merely sketched. Chomsky ‘compares’ ‘the traditional method of paradigms’ whose ‘dimensions’ are ‘inflectional categories of gender, number, case, and declensional type’, against ‘the descriptivist method of morphemic analysis’, which Chomsky (after Hockett) classes as an ‘“item-and-arrangement” grammar’ (AT 170, 172, 232) (cf. 4.57; 5.50, 74, 512; 6.34; 7.75f; 8.57; 9.31, 911; 12.71). The older ‘paradigmatic description’ can be ‘restated’ or ‘incorporated directly’ in ‘the theory of syntactic features’, using ‘not + and - but integers’ ‘associated with traditional designations’ (AT 171f).36 The concept of ‘morphemes’ in ‘modern linguistics’, in contrast, is ‘clumsy’ and ‘inelegant’ to ‘represent’ by ‘a grammar based on rewriting rules or transformations’, and is ‘in fact designed’ ‘to exclude’ ‘all but the most elementary general rules’ (AT 173, 323). ‘The order of morphemes is quite arbitrary, whereas this arbitrariness is avoided in paradigmatic treatment, the features being unordered’ (AT 174). Also, ‘many morphemes are not phonetically realized’ and so are treated, ‘in particular contexts, as zero elements’, such that ‘each case’ needs ‘a specific context-sensitive rule’; Chomsky finds these rules ‘superfluous’, and his ‘feature analysis simply gives no rule’ (AT 173). So he judges ‘the modern descriptivist reanalysis’ ‘in terms of morpheme sequences’ ‘an ill-advised theoretical innovation’ (AT 174) (cf. 7.5).

7.76 But he doesn't quite throw it out: ‘within our framework, either paradigmatic analysis’ by ‘features or sequential morphemic analysis is available, whichever permits the optimal and most general statement’ (AT 174).37 He foresees no ‘difficulty in extending the theory of transformations’ to ‘formalize traditional rules’ for ‘inflectional systems’ (AT 175f). Yet he admits that morphemic ‘derivation processes’ are a bigger ‘problem’ for ‘generative’ grammar because ‘they are typically sporadic and only quasi-productive’, even when ‘the meaning’ is ‘to some extent predictable’ from ‘inherent semantic properties of the morphemes’ (AT 184, 186). Still, ‘all presently known theories of language fail’ here, so ‘quasi-productive processes’ need not ‘support an alternative theory of grammar’ (AT 235f) (argued by Bolinger 1961).

7.77 Though I have had to pass over some technical details about ‘rules’, ‘features’, and so on, the major ‘aspects’ of Chomsky's ‘theory of syntax’ should now be evident. The two books I have summarized are far from being the whole story. He and his associates produced a flood of additional books and papers revising numerous aspects of the model or refuting objections to it; in fact, the grammar got more and more ‘transformational’ because it kept getting transformed, even to the point where many ‘transformations’ themselves were dropped, along with ‘deep structure’ (Chomsky 1977). But AT and SS were and still are by far the most widely read of his works, and Chomsky himself later (1971[1968]:184f) called AT ‘the standard theory’. I have treated the two books mainly as two advocacies of one overall project, while pointing out the important innovations and greater development of AT over SS. The vacuum left by not acknowledging his debt to ‘modern’ linguistics (7.5, 19, 29, 57, 76) is filled in AT by garnering sources among philosophers of previous centuries (7.3) and by citing himself as a main source. AT contains 113 references to 18 works he had by that time authored or co-authored,38 and relies heavily on works of his associates and followers (e.g. Halle, Miller, Katz, Fodor, Postal, Lees, Bever).

7.78 Above all, AT reveals a marked change in Chomsky's estimation of the project. He now presents the approach not merely as a way to make English grammar ‘simpler’, more ‘adequate’, and so on, but as a leading candidate in the investigation of ‘linguistic competence’, ‘innate language capacity’, ‘language acquisition, ‘language universals’, and ‘philosophy of mind’. To retain much of the conceptions and terminology originally designed for ‘grammatical description’, he makes them ‘systematically ambiguous’, even though the correspondence between a speaker's or a child's knowledge and ‘the linguist's account’ is precisely what linguistics has to demonstrate (cf. 7.15, 28; 13.45). He thus lends a more exalted significance to the rather traditional centrepiece of his project: analysing, describing, and comparing phrases or sentences in terms of grammatical structure.

7.79 Since many grammatical relations and distinctions are not formally marked in English, Chomsky promises to offset the lack by postulating ‘underlying structures’, ‘markers’, and ‘features’. He stresses superfical similarity with underlying difference, and underlying similarity with superficial difference (cf. 7.14, 53, 61, 82; 13.7, 30, 43, 54). We are justified in ‘transforming’ linguistic data to put such cases into clearer forms, which competing ‘grammars’ fail to do. He nourishes the prospect that, on the ‘underlying’ (‘deep’, ‘base’, etc.) level, structure is far more precise, distinct, orderly, and general than everyday utterances indicate. Down there, categories are ‘universal’ (‘determined by the nature of language as such’), relationships are crystal-clear, constraints on what may or may not be constructed are exactly stated, meanings are rigorously decomposed, and ambiguities are fully resolved. Such demanding goals force Chomsky to be evasive about his deep level, or to advance peculiar claims. To limit ‘functional’ ‘issues’ to ‘surface structure’, he says ‘order’ ‘plays no role in determining grammatical relations in deep structures’, even though he has already said ‘the categorial component determines the ordering of elements in deep structures’ (AT 220f, 122ff). To keep his ‘deep structure’ simple, he says ‘features determined by context’, being ‘unspecified in underlying structure’, can make ‘no contribution to sentence interpretation’ (AT 182) (7.72); but what can ‘context’ be if not an influence on how people interpret?

7.80 Chomsky's evasions are predictably clustered at points of contact between his technical, underlying apparatus and the facts or patterns it purports to ‘account for’ or ‘explain’. One evasion, already in SS, is to insist that ‘the transformational part of grammar applies’ ‘properly’ to ‘the forms’ (‘the terminal strings’ or ‘phrase markers’) that ‘underlie’ sentences, yet to show them applying to the sentences themselves (SS 45, 47f, 66, 71, 73, 88f, 91, 107 vs. SS 35f, 42f, 74f, 78f, 80ff). In AT too, he may make ‘no careful distinction’ ‘between the basic string and sentence itself’ and proceed on the ‘simplifying and contrary to fact assumption that the underlying basic string is the sentence’, and the ‘base phrase marker is the surface structure as well as the deep structure’ (AT 18). This matter is by no means trivial if making the two levels ‘distinct’ is ‘the central idea of transformational grammar’ (AT 16f) (7.62). Surely more precision could be attained, especially if Chomsky is ‘concerned’ ‘primarily’ with ‘extremely simple sentences’ e.g., those ‘with a single element in the basis’ (AT 63, 18). He condemns ‘modern structural (taxonomic) linguistics’ for ‘assuming that deep and surface structures are actually the same’, yet says his ‘grammar does not, in itself, provide any sensible procedure for finding a deep structure of a given sentence’ (AT 16, 141) (cf. 6.38, 61; 7.7). The pathway between surface and deep is further obscured by Chomsky's habit of ‘leaving out quite a few transformations’, ‘refinements’, qualifications’, or ‘details of formalization’ he says aren't ‘relevant’ (AT 131, 92, 81f, 222).

7.81 Also, he makes use of scant data from English, and far less from other languages. Most examples are merely quoted, or are listed without further analysis as ‘grammatical’ or ‘deviant’. Only 28 invented sentences in SS and 24 in AT are analysed, i.e., broken into constituents, formalized, paraphrased or ‘transformed’. One sentence (‘“sincerity may frighten the boy”’) and its alterations gets analysed and discussed over 26 pages in the two books (SS 42f, 78, AT 63ff, 68f, 71, 73, 75f, 85f, 107, 111, 119, 149, 152f, 157, 165, 211, 228, 230) -- a lot of work for it (Humpty Dumpty would have paid it royally). ‘Diagramming’ a ‘“tree structure”‘, in which ‘a sequence of words is a constituent’ if it is ‘traced’ ‘back to a single point’ (SS 27f, AT 12), is done for one sentence in SS and 6 in AT. The trees are called ‘(base) phrase markers’ (AT 65, 69, 71, 86, 108f, 128, 130f, 171, 184), but one is called a ‘deep structure’ (AT 178) (compare the much simpler ‘deep structure’ of ‘Noun phrase -- Verb -- Noun phrase -- Sentence’ on AT 23).39 Chomsky calmly remarks that ‘the interpretation of such a diagram is transparent’, and that ‘the procedure for constructing a phrase marker’ from a ‘derivation’ is a ‘minor matter of appropriate formalization and involves nothing of principle’ (AT 164, 107) (cf. 7.86).

7.82 Yet another evasion regards the role of underlying entities in the process of language understanding. Chomsky proposes to ‘judge formal theories’ by ‘their ability to explain’ and ‘clarify’ ‘facts’ about how ‘sentences are used and understood (SS 102). In SS, he stipulates that ‘understanding a sentence’ requires ‘knowing the kernel sentences from which it originates (more precisely the terminal strings underlying these)’, and raises the hope that ‘the general problem of analysing the process of “understanding”‘ might be ‘reduced’ to ‘explaining how kernel sentences are understood’ (SS 87, 92, 104n, 107f). In AT, he announces (with no mention of his earlier statement) that ‘kernel sentences’ ‘play no distinctive role’ in the ‘interpretation of sentences’ (AT 18). In return, ‘phrase markers’ are now presented as ‘the elementary content elements from which semantic interpretations of actual sentences are constructed’ -- an ‘insight as old as syntactic theory itself’ (AT 117, 221) (though he didn't know it in 1957). ‘The manner of combination provided by the surface’ ‘structure is in general almost totally irrelevant to semantic interpretation, whereas the grammatical relations expressed in deep structure are in many cases just those that determine the meaning of the sentence’ (AT 162; cf. AT 135) (cf. 7.84; 11.100; 13.47). This ‘determining’ requires that ‘the meaning of a sentence be based on the meaning of its elementary parts and manner of their combination’ (AT 162f) (but cf. 2.29; 5.67, 75f; 11.36; 12.93; 13.59). The requirement is congenial for an approach based on analysing and assembling structures, but is violated even by his own handpicked samples. His interpretation of ‘“I had a book stolen”‘ depends not on the parts and their combination, but on ‘elaborations of the sentence’ to supply contexts (like ‘“I had a book stolen from his library by a professional thief who I hired”’) -- just the recourse to ‘entirely different constructions’ he later calls ‘irrelevant’ (AT 21, 161).

7.83 A similar evasion pertains to the production of sentences. In SS, Chomsky cautions against a ‘misunderstanding’ that ‘grammars’ ‘described’ as ‘devices for generating sentences’ are ‘concerned with the process of producing utterances rather than’ ‘analysing and reconstructing’ their ‘structure’ (SS 48). The ‘synthesis and analysis’ ‘speaker and hearer