Language by Edward Sapir

tyrannical hold on our mind. Intermingled with this scientific prejudice and largely anticipating it was another, a more human one. The vast majority of linguistic theorists themselves spoke languages of a certain type, of which the most fully developed varieties were the Latin and Greek that they had learned in their childhood. It was not difficult for them to be persuaded that these familiar languages represented the “highest” development that speech had yet attained and that all other types were but steps on the way to this beloved “inflective” type. Whatever conformed to the pattern of Sanskrit and Greek and Latin and German was accepted as expressive of the “highest,” whatever departed from it was frowned upon as a shortcoming or was at best an interesting aberration.[93] Now any classification that starts with preconceived values or that works up to sentimental satisfactions is self-condemned as unscientific. A linguist that insists on talking about the Latin type of morphology as though it were necessarily the high-water mark of linguistic development is like the zooelogist that sees in the organic world a huge conspiracy to evolve the race-horse or the Jersey cow. Language in its fundamental forms is the symbolic expression of human intuitions. These may shape themselves in a hundred ways, regardless of the material advancement or backwardness of the people that handle the forms, of which, it need hardly be said, they are in the main unconscious. If, therefore, we wish to understand language in its true inwardness we must disabuse our minds of preferred “values”[94] and accustom ourselves to look upon English and Hottentot with the same cool, yet interested, detachment.

[Footnote 93: One celebrated American writer on culture and language delivered himself of the dictum that, estimable as the speakers of agglutinative languages might be, it was nevertheless a crime for an inflecting woman to marry an agglutinating man. Tremendous spiritual values were evidently at stake. Champions of the “inflective” languages are wont to glory in the very irrationalities of Latin and Greek, except when it suits them to emphasize their profoundly “logical” character. Yet the sober logic of Turkish or Chinese leaves them cold. The glorious irrationalities and formal complexities of many “savage” languages they have no stomach for. Sentimentalists are difficult people.]

[Footnote 94: I have in mind valuations of form as such. Whether or not a language has a large and useful vocabulary is another matter. The actual size of a vocabulary at a given time is not a thing of real interest to the linguist, as all languages have the resources at their disposal for the creation of new words, should need for them arise. Furthermore, we are not in the least concerned with whether or not a language is of great practical value or is the medium of a great culture. All these considerations, important from other standpoints, have nothing to do with form value.]

We come back to our first difficulty. What point of view shall we adopt for our classification? After all that we have said about grammatical form in the preceding chapter, it is clear that we cannot now make the distinction between form languages and formless languages that used to appeal to some of the older writers. Every language can and must express the fundamental syntactic relations even though there is not a single affix to be found in its vocabulary. We conclude that every language is a form language. Aside from the expression of pure relation a language may, of course, be “formless”–formless, that is, in the mechanical and rather superficial sense that it is not encumbered by the use of non-radical elements. The attempt has sometimes been made to formulate a distinction on the basis of “inner form.” Chinese, for instance, has no formal elements pure and simple, no “outer form,” but it evidences a keen sense of relations, of the difference between subject and object, attribute and predicate, and so on. In other words, it has an “inner form” in the same sense in which Latin possesses it, though it is outwardly “formless” where Latin is outwardly “formal.” On the other hand, there are supposed to be languages[95] which have no true grasp of the fundamental relations but content themselves with the more or less minute expression of material ideas, sometimes with an exuberant display of “outer form,” leaving the pure relations to be merely inferred from the context. I am strongly inclined to believe that this supposed “inner formlessness” of certain languages is an illusion. It may well be that in these languages the relations are not expressed in as immaterial a way as in Chinese or even as in Latin,[96] or that the principle of order is subject to greater fluctuations than in Chinese, or that a tendency to complex derivations relieves the language of the necessity of expressing certain relations as explicitly as a more analytic language would have them expressed.[97] All this does not mean that the languages in question have not a true feeling for the fundamental relations. We shall therefore not be able to use the notion of “inner formlessness,” except in the greatly modified sense that syntactic relations may be fused with notions of another order. To this criterion of classification we shall have to return a little later.

[Footnote 95: E.g., Malay, Polynesian.]

[Footnote 96: Where, as we have seen, the syntactic relations are by no means free from an alloy of the concrete.]

[Footnote 97: Very much as an English _cod-liver oil_ dodges to some extent the task of explicitly defining the relations of the three nouns. Contrast French _huile de foie de morue_ “oil of liver of cod.”]

More justifiable would be a classification according to the formal processes[98] most typically developed in the language. Those languages that always identify the word with the radical element would be set off as an “isolating” group against such as either affix modifying elements (affixing languages) or possess the power to change the significance of the radical element by internal changes (reduplication; vocalic and consonantal change; changes in quantity, stress, and pitch). The latter type might be not inaptly termed “symbolic” languages.[99] The affixing languages would naturally subdivide themselves into such as are prevailingly prefixing, like Bantu or Tlingit, and such as are mainly or entirely suffixing, like Eskimo or Algonkin or Latin. There are two serious difficulties with this fourfold classification (isolating, prefixing, suffixing, symbolic). In the first place, most languages fall into more than one of these groups. The Semitic languages, for instance, are prefixing, suffixing, and symbolic at one and the same time. In the second place, the classification in its bare form is superficial. It would throw together languages that differ utterly in spirit merely because of a certain external formal resemblance. There is clearly a world of difference between a prefixing language like Cambodgian, which limits itself, so far as its prefixes (and infixes) are concerned, to the expression of derivational concepts, and the Bantu languages, in which the prefixed elements have a far-reaching significance as symbols of syntactic relations. The classification has much greater value if it is taken to refer to the expression of relational concepts[100] alone. In this modified form we shall return to it as a subsidiary criterion. We shall find that the terms “isolating,” “affixing,” and “symbolic” have a real value. But instead of distinguishing between prefixing and suffixing languages, we shall find that it is of superior interest to make another distinction, one that is based on the relative firmness with which the affixed elements are united with the core of the word.[101]

[Footnote 98: See Chapter IV.]

[Footnote 99: There is probably a real psychological connection between symbolism and such significant alternations as _drink_, _drank_, _drunk_ or Chinese _mai_ (with rising tone) “to buy” and _mai_ (with falling tone) “to sell.” The unconscious tendency toward symbolism is justly emphasized by recent psychological literature. Personally I feel that the passage from _sing_ to _sang_ has very much the same feeling as the alternation of symbolic colors–e.g., green for safe, red for danger. But we probably differ greatly as to the intensity with which we feel symbolism in linguistic changes of this type.]

[Footnote 100: Pure or “concrete relational.” See Chapter V.]

[Footnote 101: In spite of my reluctance to emphasize the difference between a prefixing and a suffixing language, I feel that there is more involved in this difference than linguists have generally recognized. It seems to me that there is a rather important psychological distinction between a language that settles the formal status of a radical element before announcing it–and this, in effect, is what such languages as Tlingit and Chinook and Bantu are in the habit of doing–and one that begins with the concrete nucleus of a word and defines the status of this nucleus by successive limitations, each curtailing in some degree the generality of all that precedes. The spirit of the former method has something diagrammatic or architectural about it, the latter is a method of pruning afterthoughts. In the more highly wrought prefixing languages the word is apt to affect us as a crystallization of floating elements, the words of the typical suffixing languages (Turkish, Eskimo, Nootka) are “determinative” formations, each added element determining the form of the whole anew. It is so difficult in practice to apply these elusive, yet important, distinctions that an elementary study has no recourse but to ignore them.]

There is another very useful set of distinctions that can be made, but these too must not be applied exclusively, or our classification will again be superficial. I refer to the notions of “analytic,” “synthetic,” and “polysynthetic.” The terms explain themselves. An analytic language is one that either does not combine concepts into single words at all (Chinese) or does so economically (English, French). In an analytic language the sentence is always of prime importance, the word is of minor interest. In a synthetic language (Latin, Arabic, Finnish) the concepts cluster more thickly, the words are more richly chambered, but there is a tendency, on the whole, to keep the range of concrete significance in the single word down to a moderate compass. A polysynthetic language, as its name implies, is more than ordinarily synthetic. The elaboration of the word is extreme. Concepts which we should never dream of treating in a subordinate fashion are symbolized by derivational affixes or “symbolic” changes in the radical element, while the more abstract notions, including the syntactic relations, may also be conveyed by the word. A polysynthetic language illustrates no principles that are not already exemplified in the more familiar synthetic languages. It is related to them very much as a synthetic language is related to our own analytic English.[102] The three terms are purely quantitative–and relative, that is, a language may be “analytic” from one standpoint, “synthetic” from another. I believe the terms are more useful in defining certain drifts than as absolute counters. It is often illuminating to point out that a language has been becoming more and more analytic in the course of its history or that it shows signs of having crystallized from a simple analytic base into a highly synthetic form.[103]

[Footnote 102: English, however, is only analytic in tendency. Relatively to French, it is still fairly synthetic, at least in certain aspects.]

[Footnote 103: The former process is demonstrable for English, French, Danish, Tibetan, Chinese, and a host of other languages. The latter tendency may be proven, I believe, for a number of American Indian languages, e.g., Chinook, Navaho. Underneath their present moderately polysynthetic form is discernible an analytic base that in the one case may be roughly described as English-like, in the other, Tibetan-like.]

We now come to the difference between an “inflective” and an “agglutinative” language. As I have already remarked, the distinction is a useful, even a necessary, one, but it has been generally obscured by a number of irrelevancies and by the unavailing effort to make the terms cover all languages that are not, like Chinese, of a definitely isolating cast. The meaning that we had best assign to the term “inflective” can be gained by considering very briefly what are some of the basic features of Latin and Greek that have been looked upon as peculiar to the inflective languages. First of all, they are synthetic rather than analytic. This does not help us much. Relatively to many another language that resembles them in broad structural respects, Latin and Greek are not notably synthetic; on the other hand, their modern descendants, Italian and Modern Greek, while far more analytic[104] than they, have not departed so widely in structural outlines as to warrant their being put in a distinct major group. An inflective language, we must insist, may be analytic, synthetic, or polysynthetic.

[Footnote 104: This applies more particularly to the Romance group: Italian, Spanish, Portuguese, French, Roumanian. Modern Greek is not so clearly analytic.]

Latin and Greek are mainly affixing in their method, with the emphasis heavily on suffixing. The agglutinative languages are just as typically affixing as they, some among them favoring prefixes, others running to the use of suffixes. Affixing alone does not define inflection. Possibly everything depends on just what kind of affixing we have to deal with. If we compare our English words _farmer_ and _goodness_ with such words as _height_ and _depth_, we cannot fail to be struck by a notable difference in the affixing technique of the two sets. The _-er_ and _-ness_ are affixed quite mechanically to radical elements which are at the same time independent words (_farm_, _good_). They are in no sense independently significant elements, but they convey their meaning (agentive, abstract quality) with unfailing directness. Their use is simple and regular and we should have no difficulty in appending them to any verb or to any adjective, however recent in origin. From a verb _to camouflage_ we may form the noun _camouflager_ “one who camouflages,” from an adjective _jazzy_ proceeds with perfect ease the noun _jazziness_. It is different with _height_ and _depth_. Functionally they are related to _high_ and _deep_ precisely as is _goodness_ to _good_, but the degree of coalescence between radical element and affix is greater. Radical element and affix, while measurably distinct, cannot be torn apart quite so readily as could the _good_ and _-ness_ of _goodness_. The _-t_ of _height_ is not the typical form of the affix (compare _strength_, _length_, _filth_, _breadth_, _youth_), while _dep-_ is not identical with _deep_. We may designate the two types of affixing as “fusing” and “juxtaposing.” The juxtaposing technique we may call an “agglutinative” one, if we like.

Is the fusing technique thereby set off as the essence of inflection? I am afraid that we have not yet reached our goal. If our language were crammed full of coalescences of the type of _depth_, but if, on the other hand, it used the plural independently of verb concord (e.g., _the books falls_ like _the book falls_, or _the book fall_ like _the books fall_), the personal endings independently of tense (e.g., _the book fells_ like _the book falls_, or _the book fall_ like _the book fell_), and the pronouns independently of case (e.g., _I see he_ like _he sees me_, or _him see the man_ like _the man sees him_), we should hesitate to describe it as inflective. The mere fact of fusion does not seem to satisfy us as a clear indication of the inflective process. There are, indeed, a large number of languages that fuse radical element and affix in as complete and intricate a fashion as one could hope to find anywhere without thereby giving signs of that particular kind of formalism that marks off such languages as Latin and Greek as inflective.

What is true of fusion is equally true of the “symbolic” processes.[105] There are linguists that speak of alternations like _drink_ and _drank_ as though they represented the high-water mark of inflection, a kind of spiritualized essence of pure inflective form. In such Greek forms, nevertheless, as _pepomph-a_ “I have sent,” as contrasted with _pemp-o_ “I send,” with its trebly symbolic change of the radical element (reduplicating _pe-_, change of _e_ to _o_, change of _p_ to _ph_), it is rather the peculiar alternation of the first person singular _-a_ of the perfect with the _-o_ of the present that gives them their inflective cast. Nothing could be more erroneous than to imagine that symbolic changes of the radical element, even for the expression of such abstract concepts as those of number and tense, is always associated with the syntactic peculiarities of an inflective language. If by an “agglutinative” language we mean one that affixes according to the juxtaposing technique, then we can only say that there are hundreds of fusing and symbolic languages–non-agglutinative by definition–that are, for all that, quite alien in spirit to the inflective type of Latin and Greek. We can call such languages inflective, if we like, but we must then be prepared to revise radically our notion of inflective form.

[Footnote 105: See pages 133, 134.]

[Transcriber’s note: Footnote 105 refers to the paragraph beginning on line 4081.]

It is necessary to understand that fusion of the radical element and the affix may be taken in a broader psychological sense than I have yet indicated. If every noun plural in English were of the type of _book_: _books_, if there were not such conflicting patterns as _deer_: _deer_, _ox_: _oxen_, _goose_: _geese_ to complicate the general form picture of plurality, there is little doubt that the fusion of the elements _book_ and _-s_ into the unified word _books_ would be felt as a little less complete than it actually is. One reasons, or feels, unconsciously about the matter somewhat as follows:–If the form pattern represented by the word _books_ is identical, as far as use is concerned, with that of the word _oxen_, the pluralizing elements _-s_ and _-en_ cannot have quite so definite, quite so autonomous, a value as we might at first be inclined to suppose. They are plural elements only in so far as plurality is predicated of certain selected concepts. The words _books_ and _oxen_ are therefore a little other than mechanical combinations of the symbol of a thing (_book_, _ox_) and a clear symbol of plurality. There is a slight psychological uncertainty or haze about the juncture in _book-s_ and _ox-en_. A little of the force of _-s_ and _-en_ is anticipated by, or appropriated by, the words _book_ and _ox_ themselves, just as the conceptual force of _-th_ in _dep-th_ is appreciably weaker than that of _-ness_ in _good-ness_ in spite of the functional parallelism between _depth_ and _goodness_. Where there is uncertainty about the juncture, where the affixed element cannot rightly claim to possess its full share of significance, the unity of the complete word is more strongly emphasized. The mind must rest on something. If it cannot linger on the constituent elements, it hastens all the more eagerly to the acceptance of the word as a whole. A word like _goodness_ illustrates “agglutination,” _books_ “regular fusion,” _depth_ “irregular fusion,” _geese_ “symbolic fusion” or “symbolism.”[106]

[Footnote 106: The following formulae may prove useful to those that are mathematically inclined. Agglutination: c = a + b; regular fusion: c = a + (b – x) + x; irregular fusion: c = (a – x) + (b – y) + (x + y); symbolism: c = (a – x) + x. I do not wish to imply that there is any mystic value in the process of fusion. It is quite likely to have developed as a purely mechanical product of phonetic forces that brought about irregularities of various sorts.]

The psychological distinctness of the affixed elements in an agglutinative term may be even more marked than in the _-ness_ of _goodness_. To be strictly accurate, the significance of the _-ness_ is not quite as inherently determined, as autonomous, as it might be. It is at the mercy of the preceding radical element to this extent, that it requires to be preceded by a particular type of such element, an adjective. Its own power is thus, in a manner, checked in advance. The fusion here, however, is so vague and elementary, so much a matter of course in the great majority of all cases of affixing, that it is natural to overlook its reality and to emphasize rather the juxtaposing or agglutinative nature of the affixing process. If the _-ness_ could be affixed as an abstractive element to each and every type of radical element, if we could say _fightness_ (“the act or quality of fighting”) or _waterness_ (“the quality or state of water”) or _awayness_ (“the state of being away”) as we can say _goodness_ (“the state of being good”), we should have moved appreciably nearer the agglutinative pole. A language that runs to synthesis of this loose-jointed sort may be looked upon as an example of the ideal agglutinative type, particularly if the concepts expressed by the agglutinated elements are relational or, at the least, belong to the abstracter class of derivational ideas.

Instructive forms may be cited from Nootka. We shall return to our “fire in the house.”[107] The Nootka word _inikw-ihl_ “fire in the house” is not as definitely formalized a word as its translation, suggests. The radical element _inikw-_ “fire” is really as much of a verbal as of a nominal term; it may be rendered now by “fire,” now by “burn,” according to the syntactic exigencies of the sentence. The derivational element _-ihl_ “in the house” does not mitigate this vagueness or generality; _inikw-ihl_ is still “fire in the house” or “burn in the house.” It may be definitely nominalized or verbalized by the affixing of elements that are exclusively nominal or verbal in force. For example, _inikw-ihl-‘i_, with its suffixed article, is a clear-cut nominal form: “the burning in the house, the fire in the house”; _inikw-ihl-ma_, with its indicative suffix, is just as clearly verbal: “it burns in the house.” How weak must be the degree of fusion between “fire in the house” and the nominalizing or verbalizing suffix is apparent from the fact that the formally indifferent _inikwihl_ is not an abstraction gained by analysis but a full-fledged word, ready for use in the sentence. The nominalizing _-‘i_ and the indicative _-ma_ are not fused form-affixes, they are simply additions of formal import. But we can continue to hold the verbal or nominal nature of _inikwihl_ in abeyance long before we reach the _-‘i_ or _-ma_. We can pluralize it: _inikw-ihl-‘minih_; it is still either “fires in the house” or “burn plurally in the house.” We can diminutivize this plural: _inikw-ihl-‘minih-‘is_, “little fires in the house” or “burn plurally and slightly in the house.” What if we add the preterit tense suffix _-it_? Is not _inikw-ihl-‘minih-‘is-it_ necessarily a verb: “several small fires were burning in the house”? It is not. It may still be nominalized; _inikwihl’minih’isit-‘i_ means “the former small fires in the house, the little fires that were once burning in the house.” It is not an unambiguous verb until it is given a form that excludes every other possibility, as in the indicative _inikwihl-minih’isit-a_ “several small fires were burning in the house.” We recognize at once that the elements _-ihl_, _-‘minih_, _-‘is_, and _-it_, quite aside from the relatively concrete or abstract nature of their content and aside, further, from the degree of their outer (phonetic) cohesion with the elements that precede them, have a psychological independence that our own affixes never have. They are typically agglutinated elements, though they have no greater external independence, are no more capable of living apart from the radical element to which they are suffixed, than the _-ness_ and _goodness_ or the _-s_ of _books_. It does not follow that an agglutinative language may not make use of the principle of fusion, both external and psychological, or even of symbolism to a considerable extent. It is a question of tendency. Is the formative slant clearly towards the agglutinative method? Then the language is “agglutinative.” As such, it may be prefixing or suffixing, analytic, synthetic, or polysynthetic.

[Footnote 107: See page 110.]

[Transcriber’s note: Footnote 107 refers to the paragraph beginning on line 3331.]

To return to inflection. An inflective language like Latin or Greek uses the method of fusion, and this fusion has an inner psychological as well as an outer phonetic meaning. But it is not enough that the fusion operate merely in the sphere of derivational concepts (group II),[108] it must involve the syntactic relations, which may either be expressed in unalloyed form (group IV) or, as in Latin and Greek, as “concrete relational concepts” (group III).[109] As far as Latin and Greek are concerned, their inflection consists essentially of the fusing of elements that express logically impure relational concepts with radical elements and with elements expressing derivational concepts. Both fusion as a general method and the expression of relational concepts in the word are necessary to the notion of “inflection.”

[Footnote 108: See Chapter V.]

[Footnote 109: If we deny the application of the term “inflective” to fusing languages that express the syntactic relations in pure form, that is, without the admixture of such concepts as number, gender, and tense, merely because such admixture is familiar to us in Latin and Greek, we make of “inflection” an even more arbitrary concept than it need be. At the same time it is true that the method of fusion itself tends to break down the wall between our conceptual groups II and IV, to create group III. Yet the possibility of such “inflective” languages should not be denied. In modern Tibetan, for instance, in which concepts of group II are but weakly expressed, if at all, and in which the relational concepts (e.g., the genitive, the agentive or instrumental) are expressed without alloy of the material, we get many interesting examples of fusion, even of symbolism. _Mi di_, e.g., “man this, the man” is an absolutive form which may be used as the subject of an intransitive verb. When the verb is transitive (really passive), the (logical) subject has to take the agentive form. _Mi di_ then becomes _mi di_ “by the man,” the vowel of the demonstrative pronoun (or article) being merely lengthened. (There is probably also a change in the tone of the syllable.) This, of course, is of the very essence of inflection. It is an amusing commentary on the insufficiency of our current linguistic classification, which considers “inflective” and “isolating” as worlds asunder, that modern Tibetan may be not inaptly described as an isolating language, aside from such examples of fusion and symbolism as the foregoing.]

But to have thus defined inflection is to doubt the value of the term as descriptive of a major class. Why emphasize both a technique and a particular content at one and the same time? Surely we should be clear in our minds as to whether we set more store by one or the other. “Fusional” and “symbolic” contrast with “agglutinative,” which is not on a par with “inflective” at all. What are we to do with the fusional and symbolic languages that do not express relational concepts in the word but leave them to the sentence? And are we not to distinguish between agglutinative languages that express these same concepts in the word–in so far inflective-like–and those that do not? We dismissed the scale: analytic, synthetic, polysynthetic, as too merely quantitative for our purpose. Isolating, affixing, symbolic–this also seemed insufficient for the reason that it laid too much stress on technical externals. Isolating, agglutinative, fusional, and symbolic is a preferable scheme, but still skirts the external. We shall do best, it seems to me, to hold to “inflective” as a valuable suggestion for a broader and more consistently developed scheme, as a hint for a classification based on the nature of the concepts expressed by the language. The other two classifications, the first based on degree of synthesis, the second on degree of fusion, may be retained as intercrossing schemes that give us the opportunity to subdivide our main conceptual types.

It is well to recall that all languages must needs express radical concepts (group I) and relational ideas (group IV). Of the two other large groups of concepts–derivational (group II) and mixed relational (group III)–both may be absent, both present, or only one present. This gives us at once a simple, incisive, and absolutely inclusive method of classifying all known languages. They are:

A. Such as express only concepts of groups I and IV; in other words, languages that keep the syntactic relations pure and that do not possess the power to modify the significance of their radical elements by means of affixes or internal changes.[110] We may call these _Pure-relational non-deriving languages_ or, more tersely, _Simple Pure-relational languages_. These are the languages that cut most to the bone of linguistic expression.

B. Such as express concepts of groups I, II, and IV; in other words, languages that keep the syntactic relations pure and that also possess the power to modify the significance of their radical elements by means of affixes or internal changes. These are the _Pure-relational deriving languages_ or _Complex Pure-relational languages_.

C. Such as express concepts of groups I and III;[111] in other words, languages in which the syntactic relations are expressed in necessary connection with concepts that are not utterly devoid of concrete significance but that do not, apart from such mixture, possess the power to modify the significance of their radical elements by means of affixes or internal changes.[112] These are the _Mixed-relational non-deriving languages_ or _Simple Mixed-relational languages_.

D. Such as express concepts of groups I, II, and III; in other words, languages in which the syntactic relations are expressed in mixed form, as in C, and that also possess the power to modify the significance of their radical elements by means of affixes or internal changes. These are the _Mixed-relational deriving languages_ or _Complex Mixed-relational languages_. Here belong the “inflective” languages that we are most familiar with as well as a great many “agglutinative” languages, some “polysynthetic,” others merely synthetic.

[Footnote 110: I am eliminating entirely the possibility of compounding two or more radical elements into single words or word-like phrases (see pages 67-70). To expressly consider compounding in the present survey of types would be to complicate our problem unduly. Most languages that possess no derivational affixes of any sort may nevertheless freely compound radical elements (independent words). Such compounds often have a fixity that simulates the unity of single words.]

[Transcriber’s note: Footnote 110 refers to the three paragraphs beginning on line 2066.]

[Footnote 111: We may assume that in these languages and in those of type D all or most of the relational concepts are expressed in “mixed” form, that such a concept as that of subjectivity, for instance, cannot be expressed without simultaneously involving number or gender or that an active verb form must be possessed of a definite tense. Hence group III will be understood to include, or rather absorb, group IV. Theoretically, of course, certain relational concepts may be expressed pure, others mixed, but in practice it will not be found easy to make the distinction.]

[Footnote 112: The line between types C and D cannot be very sharply drawn. It is a matter largely of degree. A language of markedly mixed-relational type, but of little power of derivation pure and simple, such as Bantu or French, may be conveniently put into type C, even though it is not devoid of a number of derivational affixes. Roughly speaking, languages of type C may be considered as highly analytic (“purified”) forms of type D.]

This conceptual classification of languages, I must repeat, does not attempt to take account of the technical externals of language. It answers, in effect, two fundamental questions concerning the translation of concepts into linguistic symbols. Does the language, in the first place, keep its radical concepts pure or does it build up its concrete ideas by an aggregation of inseparable elements (types A and C _versus_ types B and D)? And, in the second place, does it keep the basic relational concepts, such as are absolutely unavoidable in the ordering of a proposition, free of an admixture of the concrete or not (types A and B _versus_ types C and D)? The second question, it seems to me, is the more fundamental of the two. We can therefore simplify our classification and present it in the following form: _
I. Pure-relational _/ A. Simple
Languages \_ B. Complex
_
II. Mixed-relational _/ C. Simple
Languages \_ D. Complex

The classification is too sweeping and too broad for an easy, descriptive survey of the many varieties of human speech. It needs to be amplified. Each of the types A, B, C, D may be subdivided into an agglutinative, a fusional, and a symbolic sub-type, according to the prevailing method of modification of the radical element. In type A we distinguish in addition an isolating sub-type, characterized by the absence of all affixes and modifications of the radical element. In the isolating languages the syntactic relations are expressed by the position of the words in the sentence. This is also true of many languages of type B, the terms “agglutinative,” “fusional,” and “symbolic” applying in their case merely to the treatment of the derivational, not the relational, concepts. Such languages could be termed “agglutinative-isolating,” “fusional-isolating” and “symbolic-isolating.”

This brings up the important general consideration that the method of handling one group of concepts need not in the least be identical with that used for another. Compound terms could be used to indicate this difference, if desired, the first element of the compound referring to the treatment of the concepts of group II, the second to that of the concepts of groups III and IV. An “agglutinative” language would normally be taken to mean one that agglutinates all of its affixed elements or that does so to a preponderating extent. In an “agglutinative-fusional” language the derivational elements are agglutinated, perhaps in the form of prefixes, while the relational elements (pure or mixed) are fused with the radical element, possibly as another set of prefixes following the first set or in the form of suffixes or as part prefixes and part suffixes. By a “fusional-agglutinative” language we would understand one that fuses its derivational elements but allows a greater independence to those that indicate relations. All these and similar distinctions are not merely theoretical possibilities, they can be abundantly illustrated from the descriptive facts of linguistic morphology. Further, should it prove desirable to insist on the degree of elaboration of the word, the terms “analytic,” “synthetic,” and “polysynthetic” can be added as descriptive terms. It goes without saying that languages of type A are necessarily analytic and that languages of type C also are prevailingly analytic and are not likely to develop beyond the synthetic stage.

But we must not make too much of terminology. Much depends on the relative emphasis laid on this or that feature or point of view. The method of classifying languages here developed has this great advantage, that it can be refined or simplified according to the needs of a particular discussion. The degree of synthesis may be entirely ignored; “fusion” and “symbolism” may often be combined with advantage under the head of “fusion”; even the difference between agglutination and fusion may, if desired, be set aside as either too difficult to draw or as irrelevant to the issue. Languages, after all, are exceedingly complex historical structures. It is of less importance to put each language in a neat pigeon-hole than to have evolved a flexible method which enables us to place it, from two or three independent standpoints, relatively to another language. All this is not to deny that certain linguistic types are more stable and frequently represented than others that are just as possible from a theoretical standpoint. But we are too ill-informed as yet of the structural spirit of great numbers of languages to have the right to frame a classification that is other than flexible and experimental.

The reader will gain a somewhat livelier idea of the possibilities of linguistic morphology by glancing down the subjoined analytical table of selected types. The columns II, III, IV refer to the groups of concepts so numbered in the preceding chapter. The letters _a_, _b_, _c_, _d_ refer respectively to the processes of isolation (position in the sentence), agglutination, fusion, and symbolism. Where more than one technique is employed, they are put in the order of their importance.[113]

[Footnote 113: In defining the type to which a language belongs one must be careful not to be misled by structural features which are mere survivals of an older stage, which have no productive life and do not enter into the unconscious patterning of the language. All languages are littered with such petrified bodies. The English _-ster_ of _spinster_ and _Webster_ is an old agentive suffix, but, as far as the feeling of the present English-speaking generation is concerned, it cannot be said to really exist at all; _spinster_ and _Webster_ have been completely disconnected from the etymological group of _spin_ and of _weave (web)_. Similarly, there are hosts of related words in Chinese which differ in the initial consonant, the vowel, the tone, or in the presence or absence of a final consonant. Even where the Chinaman feels the etymological relationship, as in certain cases he can hardly help doing, he can assign no particular function to the phonetic variation as such. Hence it forms no live feature of the language-mechanism and must be ignored in defining the general form of the language. The caution is all the more necessary, as it is precisely the foreigner, who approaches a new language with a certain prying inquisitiveness, that is most apt to see life in vestigial features which the native is either completely unaware of or feels merely as dead form.]

Note.–Parentheses indicate a weak development of the process in question.

+—————-+—+—-+—+————–+———-+————–+ |Fundamental Type”II |III |IV |Technique “Synthesis “Examples | +—————-+—+—-+—+————–+———-+————–+ | A ” | | | ” ” | |(Simple Pure- “– |– |a |Isolating “Analytic “Chinese; | | relational) ” | | | ” “Annamite | | ” | | | ” ” | | “(d)|– |a,b|Isolating “Analytic “Ewe | | ” | | |(weakly ” “(Guinea Coast)| | ” | | |agglutinative)” ” | | ” | | | ” ” | | “(b)|– |a, |Agglutinative “Analytic “Modern Tibetan| | ” | |b,c|(mildly ” ” | | ” | | |agglutinative-” ” | | ” | | |fusional) ” ” | | ” | | | ” ” | | B ” | | | ” ” | |(Complex Pure- “b, |– |a |Agglutinative-“Analytic “Polynesian | | relational) “(d)| | |isolating ” ” | | ” | | | ” ” | | “b |– |a, |Agglutinative-“Polysyn- “Haida | | ” | |(b)|isolating “thetic ” | | ” | | | ” ” | | “c |– |a |Fusional- “Analytic “Cambodgian | | ” | | |isolating ” ” | | ” | | | ” ” | | “b |– |b |Agglutinative “Synthetic “Turkish | | ” | | | ” ” | | “b,d|(b) |b |Agglutinative “Polysyn- “Yana (N. | | ” | | |(symbolic “thetic “California) | | ” | | |tinge) ” ” | | ” | | | ” ” | | “c, |– |a,b|Fusional- “Synthetic “Classical | | “d, | | |agglutinative “(mildly) “Tibetan | | “(b)| | |(symbolic ” ” | | ” | | |tinge) ” ” | | ” | | | ” ” | | “b |– |c |Agglutinative-“Synthetic “Sioux | | ” | | |fusional “(mildly ” | | ” | | | “polysyn- ” | | ” | | | “thetic) ” | | ” | | | ” ” | | “c |– |c |Fusional “Synthetic “Salinan (S.W. | | ” | | | ” “California) | | ” | | | ” ” | | “d,c|(d) |d, |Symbolic “Analytic “Shilluk | | ” | |c,a| ” “(Upper Nile) | | ” | | | ” ” | | C ” | | | ” ” | |(Simple Mixed- “(b)|b |– |Agglutinative “Synthetic “Bantu | | relational) ” | | | ” ” | | “(c)|c, |a |Fusional “Analytic “French[114] | | ” |(d) | | “(mildly ” | | ” | | | “synthetic)” | | ” | | | ” ” | | D ” | | | ” ” | |(Complex Mixed- “b, |b |b |Agglutinative “Polysyn- “Nootka | | relational) “c,d| | | “thetic “(Vancouver | | ” | | | “(symbolic “Island)[115] | | ” | | | “tinge) ” | | ” | | | ” ” | | “c, |b |– |Fusional- “Polysyn- “Chinook (lower| | “(d)| | |agglutinative “thetic “Columbia R.) | | ” | | | “(mildly) ” | | ” | | | ” ” | | “c, |c, |– |Fusional “Polysyn- “Algonkin | | “(d)|(d),| | “thetic ” | | ” |(b) | | ” ” | | ” | | | ” ” | | “c |c,d |a |Fusional “Analytic “English | | ” | | | ” ” | | “c,d|c,d |– |Fusional “Synthetic “Latin, Greek, | | ” | | |(symbolic ” “Sanskrit | | ” | | |tinge) ” ” | | ” | | | ” ” | | “c, |c,d |(a)|Fusional “Synthetic “Takelma | | “b,d| | |(strongly ” “(S.W. Oregon) | | ” | | |symbolic) ” ” | | ” | | | ” ” | | “d,c|c,d |(a)|Symbolic- “Synthetic “Semitic | | ” | | |fusional ” “(Arabic, | | ” | | | ” “Hebrew) | +—————-+—+—-+—+————–+———-+————–+

[Footnote 114: Might nearly as well have come under D.]

[Footnote 115: Very nearly complex pure-relational.]

I need hardly point out that these examples are far from exhausting the possibilities of linguistic structure. Nor that the fact that two languages are similarly classified does not necessarily mean that they present a great similarity on the surface. We are here concerned with the most fundamental and generalized features of the spirit, the technique, and the degree of elaboration of a given language. Nevertheless, in numerous instances we may observe this highly suggestive and remarkable fact, that languages that fall into the same class have a way of paralleling each other in many details or in structural features not envisaged by the scheme of classification. Thus, a most interesting parallel could be drawn on structural lines between Takelma and Greek,[116] languages that are as geographically remote from each other and as unconnected in a historical sense as two languages selected at random can well be. Their similarity goes beyond the generalized facts registered in the table. It would almost seem that linguistic features that are easily thinkable apart from each other, that seem to have no necessary connection in theory, have nevertheless a tendency to cluster or to follow together in the wake of some deep, controlling impulse to form that dominates their drift. If, therefore, we can only be sure of the intuitive similarity of two given languages, of their possession of the same submerged form-feeling, we need not be too much surprised to find that they seek and avoid certain linguistic developments in common. We are at present very far from able to define just what these fundamental form intuitions are. We can only feel them rather vaguely at best and must content ourselves for the most part with noting their symptoms. These symptoms are being garnered in our descriptive and historical grammars of diverse languages. Some day, it may be, we shall be able to read from them the great underlying ground-plans.

[Footnote 116: Not Greek specifically, of course, but as a typical representative of Indo-European.]

Such a purely technical classification of languages as the current one into “isolating,” “agglutinative,” and “inflective” (read “fusional”) cannot claim to have great value as an entering wedge into the discovery of the intuitional forms of language. I do not know whether the suggested classification into four conceptual groups is likely to drive deeper or not. My own feeling is that it does, but classifications, neat constructions of the speculative mind, are slippery things. They have to be tested at every possible opportunity before they have the right to cry for acceptance. Meanwhile we may take some encouragement from the application of a rather curious, yet simple, historical test. Languages are in constant process of change, but it is only reasonable to suppose that they tend to preserve longest what is most fundamental in their structure. Now if we take great groups of genetically related languages,[117] we find that as we pass from one to another or trace the course of their development we frequently encounter a gradual change of morphological type. This is not surprising, for there is no reason why a language should remain permanently true to its original form. It is interesting, however, to note that of the three intercrossing classifications represented in our table (conceptual type, technique, and degree of synthesis), it is the degree of synthesis that seems to change most readily, that the technique is modifiable but far less readily so, and that the conceptual type tends to persist the longest of all.

[Footnote 117: Such, in other words, as can be shown by documentary or comparative evidence to have been derived from a common source. See Chapter VII.]

The illustrative material gathered in the table is far too scanty to serve as a real basis of proof, but it is highly suggestive as far as it goes. The only changes of conceptual type within groups of related languages that are to be gleaned from the table are of B to A (Shilluk as contrasted with Ewe;[118] Classical Tibetan as contrasted with Modern Tibetan and Chinese) and of D to C (French as contrasted with Latin[119]). But types A : B and C : D are respectively related to each other as a simple and a complex form of a still more fundamental type (pure-relational, mixed-relational). Of a passage from a pure-relational to a mixed-relational type or _vice versa_ I can give no convincing examples.

[Footnote 118: These are far-eastern and far-western representatives of the “Soudan” group recently proposed by D. Westermann. The genetic relationship between Ewe and Shilluk is exceedingly remote at best.]

[Footnote 119: This case is doubtful at that. I have put French in C rather than in D with considerable misgivings. Everything depends on how one evaluates elements like _-al_ in _national_, _-te_ in _bonte_, or _re-_ in _retourner_. They are common enough, but are they as alive, as little petrified or bookish, as our English _-ness_ and _-ful_ and _un-_?]

The table shows clearly enough how little relative permanence there is in the technical features of language. That highly synthetic languages (Latin; Sanskrit) have frequently broken down into analytic forms (French; Bengali) or that agglutinative languages (Finnish) have in many instances gradually taken on “inflective” features are well-known facts, but the natural inference does not seem to have been often drawn that possibly the contrast between synthetic and analytic or agglutinative and “inflective” (fusional) is not so fundamental after all. Turning to the Indo-Chinese languages, we find that Chinese is as near to being a perfectly isolating language as any example we are likely to find, while Classical Tibetan has not only fusional but strong symbolic features (e.g., _g-tong-ba_ “to give,” past _b-tang_, future _gtang_, imperative _thong_); but both are pure-relational languages. Ewe is either isolating or only barely agglutinative, while Shilluk, though soberly analytic, is one of the most definitely symbolic languages I know; both of these Soudanese languages are pure-relational. The relationship between Polynesian and Cambodgian is remote, though practically certain; while the latter has more markedly fusional features than the former,[120] both conform to the complex pure-relational type. Yana and Salinan are superficially very dissimilar languages. Yana is highly polysynthetic and quite typically agglutinative, Salinan is no more synthetic than and as irregularly and compactly fusional (“inflective”) as Latin; both are pure-relational, Chinook and Takelma, remotely related languages of Oregon, have diverged very far from each other, not only as regards technique and synthesis in general but in almost all the details of their structure; both are complex mixed-relational languages, though in very different ways. Facts such as these seem to lend color to the suspicion that in the contrast of pure-relational and mixed-relational (or concrete-relational) we are confronted by something deeper, more far-reaching, than the contrast of isolating, agglutinative, and fusional.[121]

[Footnote 120: In spite of its more isolating cast.]

[Footnote 121: In a book of this sort it is naturally impossible to give an adequate idea of linguistic structure in its varying forms. Only a few schematic indications are possible. A separate volume would be needed to breathe life into the scheme. Such a volume would point out the salient structural characteristics of a number of languages, so selected as to give the reader an insight into the formal economy of strikingly divergent types.]

VII

LANGUAGE AS A HISTORICAL PRODUCT: DRIFT

Every one knows that language is variable. Two individuals of the same generation and locality, speaking precisely the same dialect and moving in the same social circles, are never absolutely at one in their speech habits. A minute investigation of the speech of each individual would reveal countless differences of detail–in choice of words, in sentence structure, in the relative frequency with which particular forms or combinations of words are used, in the pronunciation of particular vowels and consonants and of combinations of vowels and consonants, in all those features, such as speed, stress, and tone, that give life to spoken language. In a sense they speak slightly divergent dialects of the same language rather than identically the same language.

There is an important difference, however, between individual and dialectic variations. If we take two closely related dialects, say English as spoken by the “middle classes” of London and English as spoken by the average New Yorker, we observe that, however much the individual speakers in each city differ from each other, the body of Londoners forms a compact, relatively unified group in contrast to the body of New Yorkers. The individual variations are swamped in or absorbed by certain major agreements–say of pronunciation and vocabulary–which stand out very strongly when the language of the group as a whole is contrasted with that of the other group. This means that there is something like an ideal linguistic entity dominating the speech habits of the members of each group, that the sense of almost unlimited freedom which each individual feels in the use of his language is held in leash by a tacitly directing norm. One individual plays on the norm in a way peculiar to himself, the next individual is nearer the dead average in that particular respect in which the first speaker most characteristically departs from it but in turn diverges from the average in a way peculiar to himself, and so on. What keeps the individual’s variations from rising to dialectic importance is not merely the fact that they are in any event of small moment–there are well-marked dialectic variations that are of no greater magnitude than individual variations within a dialect–it is chiefly that they are silently “corrected” or canceled by the consensus of usage. If all the speakers of a given dialect were arranged in order in accordance with the degree of their conformity to average usage, there is little doubt that they would constitute a very finely intergrading series clustered about a well-defined center or norm. The differences between any two neighboring speakers of the series[122] would be negligible for any but the most microscopic linguistic research. The differences between the outer-most members of the series are sure to be considerable, in all likelihood considerable enough to measure up to a true dialectic variation. What prevents us from saying that these untypical individuals speak distinct dialects is that their peculiarities, as a unified whole, are not referable to another norm than the norm of their own series.

[Footnote 122: In so far as they do not fall out of the normal speech group by reason of a marked speech defect or because they are isolated foreigners that have acquired the language late in life.]

If the speech of any member of the series could actually be made to fit into another dialect series,[123] we should have no true barriers between dialects (and languages) at all. We should merely have a continuous series of individual variations extending over the whole range of a historically unified linguistic area, and the cutting up of this large area (in some cases embracing parts of several continents) into distinct dialects and languages would be an essentially arbitrary proceeding with no warrant save that of practical convenience. But such a conception of the nature of dialectic variation does not correspond to the facts as we know them. Isolated individuals may be found who speak a compromise between two dialects of a language, and if their number and importance increases they may even end by creating a new dialectic norm of their own, a dialect in which the extreme peculiarities of the parent dialects are ironed out. In course of time the compromise dialect may absorb the parents, though more frequently these will tend to linger indefinitely as marginal forms of the enlarged dialect area. But such phenomena–and they are common enough in the history of language–are evidently quite secondary. They are closely linked with such social developments as the rise of nationality, the formation of literatures that aim to have more than a local appeal, the movement of rural populations into the cities, and all those other tendencies that break up the intense localism that unsophisticated man has always found natural.

[Footnote 123: Observe that we are speaking of an individual’s speech as a whole. It is not a question of isolating some particular peculiarity of pronunciation or usage and noting its resemblance to or identity with a feature in another dialect.]

The explanation of primary dialectic differences is still to seek. It is evidently not enough to say that if a dialect or language is spoken in two distinct localities or by two distinct social strata it naturally takes on distinctive forms, which in time come to be divergent enough to deserve the name of dialects. This is certainly true as far as it goes. Dialects do belong, in the first instance, to very definitely circumscribed social groups, homogeneous enough to secure the common feeling and purpose needed to create a norm. But the embarrassing question immediately arises, If all the individual variations within a dialect are being constantly leveled out to the dialectic norm, if there is no appreciable tendency for the individual’s peculiarities to initiate a dialectic schism, why should we have dialectic variations at all? Ought not the norm, wherever and whenever threatened, automatically to reassert itself? Ought not the individual variations of each locality, even in the absence of intercourse between them, to cancel out to the same accepted speech average?

If individual variations “on a flat” were the only kind of variability in language, I believe we should be at a loss to explain why and how dialects arise, why it is that a linguistic prototype gradually breaks up into a number of mutually unintelligible languages. But language is not merely something that is spread out in space, as it were–a series of reflections in individual minds of one and the same timeless picture. Language moves down time in a current of its own making. It has a drift. If there were no breaking up of a language into dialects, if each language continued as a firm, self-contained unity, it would still be constantly moving away from any assignable norm, developing new features unceasingly and gradually transforming itself into a language so different from its starting point as to be in effect a new language. Now dialects arise not because of the mere fact of individual variation but because two or more groups of individuals have become sufficiently disconnected to drift apart, or independently, instead of together. So long as they keep strictly together, no amount of individual variation would lead to the formation of dialects. In practice, of course, no language can be spread over a vast territory or even over a considerable area without showing dialectic variations, for it is impossible to keep a large population from segregating itself into local groups, the language of each of which tends to drift independently. Under cultural conditions such as apparently prevail to-day, conditions that fight localism at every turn, the tendency to dialectic cleavage is being constantly counteracted and in part “corrected” by the uniformizing factors already referred to. Yet even in so young a country as America the dialectic differences are not inconsiderable.

Under primitive conditions the political groups are small, the tendency to localism exceedingly strong. It is natural, therefore, that the languages of primitive folk or of non-urban populations in general are differentiated into a great number of dialects. There are parts of the globe where almost every village has its own dialect. The life of the geographically limited community is narrow and intense; its speech is correspondingly peculiar to itself. It is exceedingly doubtful if a language will ever be spoken over a wide area without multiplying itself dialectically. No sooner are the old dialects ironed out by compromises or ousted by the spread and influence of the one dialect which is culturally predominant when a new crop of dialects arises to undo the leveling work of the past. This is precisely what happened in Greece, for instance. In classical antiquity there were spoken a large number of local dialects, several of which are represented in the literature. As the cultural supremacy of Athens grew, its dialect, the Attic, spread at the expense of the rest, until, in the so-called Hellenistic period following the Macedonian conquest, the Attic dialect, in the vulgarized form known as the “Koine,” became the standard speech of all Greece. But this linguistic uniformity[124] did not long continue. During the two millennia that separate the Greek of to-day from its classical prototype the Koine gradually split up into a number of dialects. Now Greece is as richly diversified in speech as in the time of Homer, though the present local dialects, aside from those of Attica itself, are not the lineal descendants of the old dialects of pre-Alexandrian days.[125] The experience of Greece is not exceptional. Old dialects are being continually wiped out only to make room for new ones. Languages can change at so many points of phonetics, morphology, and vocabulary that it is not surprising that once the linguistic community is broken it should slip off in different directions. It would be too much to expect a locally diversified language to develop along strictly parallel lines. If once the speech of a locality has begun to drift on its own account, it is practically certain to move further and further away from its linguistic fellows. Failing the retarding effect of dialectic interinfluences, which I have already touched upon, a group of dialects is bound to diverge on the whole, each from all of the others.

[Footnote 124: It is doubtful if we have the right to speak of linguistic uniformity even during the predominance of the Koine. It is hardly conceivable that when the various groups of non-Attic Greeks took on the Koine they did not at once tinge it with dialectic peculiarities induced by their previous speech habits.]

[Footnote 125: The Zaconic dialect of Lacedaemon is the sole exception. It is not derived from the Koine, but stems directly from the Doric dialect of Sparta.]

In course of time each dialect itself splits up into sub-dialects, which gradually take on the dignity of dialects proper while the primary dialects develop into mutually unintelligible languages. And so the budding process continues, until the divergences become so great that none but a linguistic student, armed with his documentary evidence and with his comparative or reconstructive method, would infer that the languages in question were genealogically related, represented independent lines of development, in other words, from a remote and common starting point. Yet it is as certain as any historical fact can be that languages so little resembling each other as Modern Irish, English, Italian, Greek, Russian, Armenian, Persian, and Bengali are but end-points in the present of drifts that converge to a meeting-point in the dim past. There is naturally no reason to believe that this earliest “Indo-European” (or “Aryan”) prototype which we can in part reconstruct, in part but dimly guess at, is itself other than a single “dialect” of a group that has either become largely extinct or is now further represented by languages too divergent for us, with our limited means, to recognize as clear kin.[126]

[Footnote 126: Though indications are not lacking of what these remoter kin of the Indo-European languages may be. This is disputed ground, however, and hardly fit subject for a purely general study of speech.]

All languages that are known to be genetically related, i.e., to be divergent forms of a single prototype, may be considered as constituting a “linguistic stock.” There is nothing final about a linguistic stock. When we set it up, we merely say, in effect, that thus far we can go and no farther. At any point in the progress of our researches an unexpected ray of light may reveal the “stock” as but a “dialect” of a larger group. The terms dialect, language, branch, stock–it goes without saying–are purely relative terms. They are convertible as our perspective widens or contracts.[127] It would be vain to speculate as to whether or not we shall ever be able to demonstrate that all languages stem from a common source. Of late years linguists have been able to make larger historical syntheses than were at one time deemed feasible, just as students of culture have been able to show historical connections between culture areas or institutions that were at one time believed to be totally isolated from each other. The human world is contracting not only prospectively but to the backward-probing eye of culture-history. Nevertheless we are as yet far from able to reduce the riot of spoken languages to a small number of “stocks.” We must still operate with a quite considerable number of these stocks. Some of them, like Indo-European or Indo-Chinese, are spoken over tremendous reaches; others, like Basque,[128] have a curiously restricted range and are in all likelihood but dwindling remnants of groups that were at one time more widely distributed. As for the single or multiple origin of speech, it is likely enough that language as a human institution (or, if one prefers, as a human “faculty”) developed but once in the history of the race, that all the complex history of language is a unique cultural event. Such a theory constructed “on general principles” is of no real interest, however, to linguistic science. What lies beyond the demonstrable must be left to the philosopher or the romancer.

[Footnote 127: “Dialect” in contrast to an accepted literary norm is a use of the term that we are not considering.]

[Footnote 128: Spoken in France and Spain in the region of the Pyrenees.]

We must return to the conception of “drift” in language. If the historical changes that take place in a language, if the vast accumulation of minute modifications which in time results in the complete remodeling of the language, are not in essence identical with the individual variations that we note on every hand about us, if these variations are born only to die without a trace, while the equally minute, or even minuter, changes that make up the drift are forever imprinted on the history of the language, are we not imputing to this history a certain mystical quality? Are we not giving language a power to change of its own accord over and above the involuntary tendency of individuals to vary the norm? And if this drift of language is not merely the familiar set of individual variations seen in vertical perspective, that is historically, instead of horizontally, that is in daily experience, what is it? Language exists only in so far as it is actually used–spoken and heard, written and read. What significant changes take place in it must exist, to begin with, as individual variations. This is perfectly true, and yet it by no means follows that the general drift of language can be understood[129] from an exhaustive descriptive study of these variations alone. They themselves are random phenomena,[130] like the waves of the sea, moving backward and forward in purposeless flux. The linguistic drift has direction. In other words, only those individual variations embody it or carry it which move in a certain direction, just as only certain wave movements in the bay outline the tide. The drift of a language is constituted by the unconscious selection on the part of its speakers of those individual variations that are cumulative in some special direction. This direction may be inferred, in the main, from the past history of the language. In the long run any new feature of the drift becomes part and parcel of the common, accepted speech, but for a long time it may exist as a mere tendency in the speech of a few, perhaps of a despised few. As we look about us and observe current usage, it is not likely to occur to us that our language has a “slope,” that the changes of the next few centuries are in a sense prefigured in certain obscure tendencies of the present and that these changes, when consummated, will be seen to be but continuations of changes that have been already effected. We feel rather that our language is practically a fixed system and that what slight changes are destined to take place in it are as likely to move in one direction as another. The feeling is fallacious. Our very uncertainty as to the impending details of change makes the eventual consistency of their direction all the more impressive.

[Footnote 129: Or rather apprehended, for we do not, in sober fact, entirely understand it as yet.]

[Footnote 130: Not ultimately random, of course, only relatively so.]

Sometimes we can feel where the drift is taking us even while we struggle against it. Probably the majority of those who read these words feel that it is quite “incorrect” to say “Who did you see?” We readers of many books are still very careful to say “Whom did you see?” but we feel a little uncomfortable (uncomfortably proud, it may be) in the process. We are likely to avoid the locution altogether and to say “Who was it you saw?” conserving literary tradition (the “whom”) with the dignity of silence.[131] The folk makes no apology. “Whom did you see?” might do for an epitaph, but “Who did you see?” is the natural form for an eager inquiry. It is of course the uncontrolled speech of the folk to which we must look for advance information as to the general linguistic movement. It is safe to prophesy that within a couple of hundred years from to-day not even the most learned jurist will be saying “Whom did you see?” By that time the “whom” will be as delightfully archaic as the Elizabethan “his” for “its.”[132] No logical or historical argument will avail to save this hapless “whom.” The demonstration “I: me = he: him = who: whom” will be convincing in theory and will go unheeded in practice.

[Footnote 131: In relative clauses too we tend to avoid the objective form of “who.” Instead of “The man whom I saw” we are likely to say “The man that I saw” or “The man I saw.”]

[Footnote 132: “Its” was at one time as impertinent a departure as the “who” of “Who did you see?” It forced itself into English because the old cleavage between masculine, feminine, and neuter was being slowly and powerfully supplemented by a new one between thing-class and animate-class. The latter classification proved too vital to allow usage to couple males and things (“his”) as against females (“her”). The form “its” had to be created on the analogy of words like “man’s,” to satisfy the growing form feeling. The drift was strong enough to sanction a grammatical blunder.]

Even now we may go so far as to say that the majority of us are secretly wishing they could say “Who did you see?” It would be a weight off their unconscious minds if some divine authority, overruling the lifted finger of the pedagogue, gave them _carte blanche_. But we cannot too frankly anticipate the drift and maintain caste. We must affect ignorance of whither we are going and rest content with our mental conflict–uncomfortable conscious acceptance of the “whom,” unconscious desire for the “who.”[133] Meanwhile we indulge our sneaking desire for the forbidden locution by the use of the “who” in certain twilight cases in which we can cover up our fault by a bit of unconscious special pleading. Imagine that some one drops the remark when you are not listening attentively, “John Smith is coming to-night.” You have not caught the name and ask, not “Whom did you say?” but “Who did you say?” There is likely to be a little hesitation in the choice of the form, but the precedent of usages like “Whom did you see?” will probably not seem quite strong enough to induce a “Whom did you say?” Not quite relevant enough, the grammarian may remark, for a sentence like “Who did you say?” is not strictly analogous to “Whom did you see?” or “Whom did you mean?” It is rather an abbreviated form of some such sentence as “Who, did you say, is coming to-night?” This is the special pleading that I have referred to, and it has a certain logic on its side. Yet the case is more hollow than the grammarian thinks it to be, for in reply to such a query as “You’re a good hand at bridge, John, aren’t you?” John, a little taken aback, might mutter “Did you say me?” hardly “Did you say I?” Yet the logic for the latter (“Did you say I was a good hand at bridge?”) is evident. The real point is that there is not enough vitality in the “whom” to carry it over such little difficulties as a “me” can compass without a thought. The proportion “I : me = he : him = who : whom” is logically and historically sound, but psychologically shaky. “Whom did you see?” is correct, but there is something false about its correctness.

[Footnote 133: Psychoanalysts will recognize the mechanism. The mechanisms of “repression of impulse” and of its symptomatic symbolization can be illustrated in the most unexpected corners of individual and group psychology. A more general psychology than Freud’s will eventually prove them to be as applicable to the groping for abstract form, the logical or esthetic ordering of experience, as to the life of the fundamental instincts.]

It is worth looking into the reason for our curious reluctance to use locutions involving the word “whom” particularly in its interrogative sense. The only distinctively objective forms which we still possess in English are _me_, _him_, _her_ (a little blurred because of its identity with the possessive _her_), _us_, _them_, and _whom_. In all other cases the objective has come to be identical with the subjective–that is, in outer form, for we are not now taking account of position in the sentence. We observe immediately in looking through the list of objective forms that _whom_ is psychologically isolated. _Me_, _him_, _her_, _us_, and _them_ form a solid, well-integrated group of objective personal pronouns parallel to the subjective series _I_, _he_, _she_, _we_, _they_. The forms _who_ and _whom_ are technically “pronouns” but they are not felt to be in the same box as the personal pronouns. _Whom_ has clearly a weak position, an exposed flank, for words of a feather tend to flock together, and if one strays behind, it is likely to incur danger of life. Now the other interrogative and relative pronouns (_which_, _what_, _that_), with which _whom_ should properly flock, do not distinguish the subjective and objective forms. It is psychologically unsound to draw the line of form cleavage between _whom_ and the personal pronouns on the one side, the remaining interrogative and relative pronouns on the other. The form groups should be symmetrically related to, if not identical with, the function groups. Had _which_, _what_, and _that_ objective forms parallel to _whom_, the position of this last would be more secure. As it is, there is something unesthetic about the word. It suggests a form pattern which is not filled out by its fellows. The only way to remedy the irregularity of form distribution is to abandon the _whom_ altogether for we have lost the power to create new objective forms and cannot remodel our _which_-_what_-_that_ group so as to make it parallel with the smaller group _who-whom_. Once this is done, _who_ joins its flock and our unconscious desire for form symmetry is satisfied. We do not secretly chafe at “Whom did you see?” without reason.[134]

[Footnote 134: Note that it is different with _whose_. This has not the support of analogous possessive forms in its own functional group, but the analogical power of the great body of possessives of nouns (_man’s_, _boy’s_) as well as of certain personal pronouns (_his_, _its_; as predicated possessive also _hers_, _yours_, _theirs_) is sufficient to give it vitality.]

But the drift away from _whom_ has still other determinants. The words _who_ and _whom_ in their interrogative sense are psychologically related not merely to the pronouns _which_ and _what_, but to a group of interrogative adverbs–_where_, _when_, _how_–all of which are invariable and generally emphatic. I believe it is safe to infer that there is a rather strong feeling in English that the interrogative pronoun or adverb, typically an emphatic element in the sentence, should be invariable. The inflective _-m_ of _whom_ is felt as a drag upon the rhetorical effectiveness of the word. It needs to be eliminated if the interrogative pronoun is to receive all its latent power. There is still a third, and a very powerful, reason for the avoidance of _whom_. The contrast between the subjective and objective series of personal pronouns (_I_, _he_, _she_, _we_, _they_: _me_, _him_, _her_, _us_, _them_) is in English associated with a difference of position. We say _I see the man_ but _the man sees me_; _he told him_, never _him he told_ or _him told he_. Such usages as the last two are distinctly poetic and archaic; they are opposed to the present drift of the language. Even in the interrogative one does not say _Him did you see?_ It is only in sentences of the type _Whom did you see?_ that an inflected objective before the verb is now used at all. On the other hand, the order in _Whom did you see?_ is imperative because of its interrogative form; the interrogative pronoun or adverb normally comes first in the sentence (_What are you doing?_ _When did he go?_ _Where are you from?_). In the “whom” of _Whom did you see?_ there is concealed, therefore, a conflict between the order proper to a sentence containing an inflected objective and the order natural to a sentence with an interrogative pronoun or adverb. The solution _Did you see whom?_ or _You saw whom?_[135] is too contrary to the idiomatic drift of our language to receive acceptance. The more radical solution _Who did you see?_ is the one the language is gradually making for.

[Footnote 135: Aside from certain idiomatic usages, as when _You saw whom?_ is equivalent to _You saw so and so and that so and so is who?_ In such sentences _whom_ is pronounced high and lingeringly to emphasize the fact that the person just referred to by the listener is not known or recognized.]

These three conflicts–on the score of form grouping, of rhetorical emphasis, and of order–are supplemented by a fourth difficulty. The emphatic _whom_, with its heavy build (half-long vowel followed by labial consonant), should contrast with a lightly tripping syllable immediately following. In _whom did_, however, we have an involuntary retardation that makes the locution sound “clumsy.” This clumsiness is a phonetic verdict, quite apart from the dissatisfaction due to the grammatical factors which we have analyzed. The same prosodic objection does not apply to such parallel locutions as _what did_ and _when did_. The vowels of _what_ and _when_ are shorter and their final consonants melt easily into the following _d_, which is pronounced in the same tongue position as _t_ and _n_. Our instinct for appropriate rhythms makes it as difficult for us to feel content with _whom did_ as for a poet to use words like _dreamed_ and _hummed_ in a rapid line. Neither common feeling nor the poet’s choice need be at all conscious. It may be that not all are equally sensitive to the rhythmic flow of speech, but it is probable that rhythm is an unconscious linguistic determinant even with those who set little store by its artistic use. In any event the poet’s rhythms can only be a more sensitive and stylicized application of rhythmic tendencies that are characteristic of the daily speech of his people.

We have discovered no less than four factors which enter into our subtle disinclination to say “Whom did you see?” The uneducated folk that says “Who did you see?” with no twinge of conscience has a more acute flair for the genuine drift of the language than its students. Naturally the four restraining factors do not operate independently. Their separate energies, if we may make bold to use a mechanical concept, are “canalized” into a single force. This force or minute embodiment of the general drift of the language is psychologically registered as a slight hesitation in using the word _whom_. The hesitation is likely to be quite unconscious, though it may be readily acknowledged when attention is called to it. The analysis is certain to be unconscious, or rather unknown, to the normal speaker.[136] How, then, can we be certain in such an analysis as we have undertaken that all of the assigned determinants are really operative and not merely some one of them? Certainly they are not equally powerful in all cases. Their values are variable, rising and falling according to the individual and the locution.[137] But that they really exist, each in its own right, may sometimes be tested by the method of elimination. If one or other of the factors is missing and we observe a slight diminution in the corresponding psychological reaction (“hesitation” in our case), we may conclude that the factor is in other uses genuinely positive. The second of our four factors applies only to the interrogative use of _whom_, the fourth factor applies with more force to the interrogative than to the relative. We can therefore understand why a sentence like _Is he the man whom you referred to?_ though not as idiomatic as _Is he the man (that) you referred to?_ (remember that it sins against counts one and three), is still not as difficult to reconcile with our innate feeling for English expression as _Whom did you see?_ If we eliminate the fourth factor from the interrogative usage,[138] say in _Whom are you looking at?_ where the vowel following _whom_ relieves this word of its phonetic weight, we can observe, if I am not mistaken, a lesser reluctance to use the _whom_. _Who are you looking at?_ might even sound slightly offensive to ears that welcome _Who did you see?_

[Footnote 136: Students of language cannot be entirely normal in their attitude towards their own speech. Perhaps it would be better to say “naive” than “normal.”]

[Footnote 137: It is probably this _variability of value_ in the significant compounds of a general linguistic drift that is responsible for the rise of dialectic variations. Each dialect continues the general drift of the common parent, but has not been able to hold fast to constant values for each component of the drift. Deviations as to the drift itself, at first slight, later cumulative, are therefore unavoidable.]

[Footnote 138: Most sentences beginning with interrogative _whom_ are likely to be followed by _did_ or _does_, _do_. Yet not all.]

We may set up a scale of “hesitation values” somewhat after this fashion:

Value 1: factors 1, 3. “The man whom I referred to.” Value 2: factors 1, 3, 4. “The man whom they referred to.” Value 3: factors 1, 2, 3. “Whom are you looking at?” Value 4: factors 1, 2, 3, 4. “Whom did you see?”

We may venture to surmise that while _whom_ will ultimately disappear from English speech, locutions of the type _Whom did you see?_ will be obsolete when phrases like _The man whom I referred to_ are still in lingering use. It is impossible to be certain, however, for we can never tell if we have isolated all the determinants of a drift. In our particular case we have ignored what may well prove to be a controlling factor in the history of _who_ and _whom_ in the relative sense. This is the unconscious desire to leave these words to their interrogative function and to concentrate on _that_ or mere word order as expressions of the relative (e.g., _The man that I referred to_ or _The man I referred to_). This drift, which does not directly concern the use of _whom_ as such (merely of _whom_ as a form of _who_), may have made the relative _who_ obsolete before the other factors affecting relative _whom_ have run their course. A consideration like this is instructive because it indicates that knowledge of the general drift of a language is insufficient to enable us to see clearly what the drift is heading for. We need to know something of the relative potencies and speeds of the components of the drift.

It is hardly necessary to say that the particular drifts involved in the use of _whom_ are of interest to us not for their own sake but as symptoms of larger tendencies at work in the language. At least three drifts of major importance are discernible. Each of these has operated for centuries, each is at work in other parts of our linguistic mechanism, each is almost certain to continue for centuries, possibly millennia. The first is the familiar tendency to level the distinction between the subjective and the objective, itself but a late chapter in the steady reduction of the old Indo-European system of syntactic cases. This system, which is at present best preserved in Lithuanian,[139] was already considerably reduced in the old Germanic language of which English, Dutch, German, Danish, and Swedish are modern dialectic forms. The seven Indo-European cases (nominative genitive, dative, accusative, ablative, locative, instrumental) had been already reduced to four (nominative genitive, dative, accusative). We know this from a careful comparison of and reconstruction based on the oldest Germanic dialects of which we still have records (Gothic, Old Icelandic, Old High German, Anglo-Saxon). In the group of West Germanic dialects, for the study of which Old High German, Anglo-Saxon, Old Frisian, and Old Saxon are our oldest and most valuable sources, we still have these four cases, but the phonetic form of the case syllables is already greatly reduced and in certain paradigms particular cases have coalesced. The case system is practically intact but it is evidently moving towards further disintegration. Within the Anglo-Saxon and early Middle English period there took place further changes in the same direction. The phonetic form of the case syllables became still further reduced and the distinction between the accusative and the dative finally disappeared. The new “objective” is really an amalgam of old accusative and dative forms; thus, _him_, the old dative (we still say _I give him the book_, not “abbreviated” from _I give to him_; compare Gothic _imma_, modern German _ihm_), took over the functions of the old accusative (Anglo-Saxon _hine_; compare Gothic _ina_, Modern German _ihn_) and dative. The distinction between the nominative and accusative was nibbled away by phonetic processes and morphological levelings until only certain pronouns retained distinctive subjective and objective forms.

[Footnote 139: Better, indeed, than in our oldest Latin and Greek records. The old Indo-Iranian languages alone (Sanskrit, Avestan) show an equally or more archaic status of the Indo-European parent tongue as regards case forms.]

In later medieval and in modern times there have been comparatively few apparent changes in our case system apart from the gradual replacement of _thou_–_thee_ (singular) and subjective _ye_–objective _you_ (plural) by a single undifferentiated form _you_. All the while, however, the case system, such as it is (subjective-objective, really absolutive, and possessive in nouns; subjective, objective, and possessive in certain pronouns) has been steadily weakening in psychological respects. At present it is more seriously undermined than most of us realize. The possessive has little vitality except in the pronoun and in animate nouns. Theoretically we can still say _the moon’s phases_ or _a newspaper’s vogue_; practically we limit ourselves pretty much to analytic locutions like _the phases of the moon_ and _the vogue of a newspaper_. The drift is clearly toward the limitation, of possessive forms to animate nouns. All the possessive pronominal forms except _its_ and, in part, _their_ and _theirs_, are also animate. It is significant that _theirs_ is hardly ever used in reference to inanimate nouns, that there is some reluctance to so use _their_, and that _its_ also is beginning to give way to _of it_. _The appearance of it_ or _the looks of it_ is more in the current of the language than _its appearance_. It is curiously significant that _its young_ (referring to an animal’s cubs) is idiomatically preferable to _the young of it_. The form is only ostensibly neuter, in feeling it is animate; psychologically it belongs with _his children_, not with _the pieces of it_. Can it be that so common a word as _its_ is actually beginning to be difficult? Is it too doomed to disappear? It would be rash to say that it shows signs of approaching obsolescence, but that it is steadily weakening is fairly clear.[140] In any event, it is not too much to say that there is a strong drift towards the restriction of the inflected possessive forms to animate nouns and pronouns.

[Footnote 140: Should _its_ eventually drop out, it will have had a curious history. It will have played the role of a stop-gap between _his_ in its non-personal use (see footnote 11, page 167) and the later analytic of _it_.]

[Transcriber’s note: Footnote 140 refers to Footnote 132, beginning on line 5142.]

How is it with the alternation of subjective and objective in the pronoun? Granted that _whom_ is a weak sister, that the two cases have been leveled in _you_ (in _it_, _that_, and _what_ they were never distinct, so far as we can tell[141]), and that _her_ as an objective is a trifle weak because of its formal identity with the possessive _her_, is there any reason to doubt the vitality of such alternations as _I see the man_ and _the man sees me_? Surely the distinction between subjective _I_ and objective _me_, between subjective _he_ and objective _him_, and correspondingly for other personal pronouns, belongs to the very core of the language. We can throw _whom_ to the dogs, somehow make shift to do without an _its_, but to level _I_ and _me_ to a single case–would that not be to un-English our language beyond recognition? There is no drift toward such horrors as _Me see him_ or _I see he_. True, the phonetic disparity between _I_ and _me_, _he_ and _him_, _we_ and _us_, has been too great for any serious possibility of form leveling. It does not follow that the case distinction as such is still vital. One of the most insidious peculiarities of a linguistic drift is that where it cannot destroy what lies in its way it renders it innocuous by washing the old significance out of it. It turns its very enemies to its own uses. This brings us to the second of the major drifts, the tendency to fixed position in the sentence, determined by the syntactic relation of the word.

[Footnote 141: Except in so far as _that_ has absorbed other functions than such as originally belonged to it. It was only a nominative-accusative neuter to begin with.]

We need not go into the history of this all-important drift. It is enough to know that as the inflected forms of English became scantier, as the syntactic relations were more and more inadequately expressed by the forms of the words themselves, position in the sentence gradually took over functions originally foreign to it. _The man_ in _the man sees the dog_ is subjective; in _the dog sees the man_, objective. Strictly parallel to these sentences are _he sees the dog_ and _the dog sees him_. Are the subjective value of _he_ and the objective value of _him_ entirely, or even mainly, dependent on the difference of form? I doubt it. We could hold to such a view if it were possible to say _the dog sees he_ or _him sees the dog_. It was once possible to say such things, but we have lost the power. In other words, at least part of the case feeling in _he_ and _him_ is to be credited to their position before or after the verb. May it not be, then, that _he_ and _him_, _we_ and _us_, are not so much subjective and objective forms as pre-verbal and post-verbal[142] forms, very much as _my_ and _mine_ are now pre-nominal and post-nominal forms of the possessive (_my father_ but _father mine_; _it is my book_ but _the book is mine_)? That this interpretation corresponds to the actual drift of the English language is again indicated by the language of the folk. The folk says _it is me_, not _it is I_, which is “correct” but just as falsely so as the _whom did you see_? that we have analyzed. _I’m the one_, _it’s me_; _we’re the ones_, _it’s us that will win out_–such are the live parallelisms in English to-day. There is little doubt that _it is I_ will one day be as impossible in English as _c’est je_, for _c’est moi_, is now in French.

[Footnote 142: Aside from the interrogative: _am I?_ _is he?_ Emphasis counts for something. There is a strong tendency for the old “objective” forms to bear a stronger stress than the “subjective” forms. This is why the stress in locutions like _He didn’t go, did he?_ and _isn’t he?_ is thrown back on the verb; it is not a matter of logical emphasis.]

How differently our _I_: _me_ feels than in Chaucer’s day is shown by the Chaucerian _it am I_. Here the distinctively subjective aspect of the _I_ was enough to influence the form of the preceding verb in spite of the introductory _it_; Chaucer’s locution clearly felt more like a Latin _sum ego_ than a modern _it is I_ or colloquial _it is me_. We have a curious bit of further evidence to prove that the English personal pronouns have lost some share of their original syntactic force. Were _he_ and _she_ subjective forms pure and simple, were they not striving, so to speak, to become caseless absolutives, like _man_ or any other noun, we should not have been able to coin such compounds as _he-goat_ and _she-goat_, words that are psychologically analogous to _bull-moose_ and _mother-bear_. Again, in inquiring about a new-born baby, we ask _Is it a he or a she?_ quite as though _he_ and _she_ were the equivalents of _male_ and _female_ or _boy_ and _girl_. All in all, we may conclude that our English case system is weaker than it looks and that, in one way or another, it is destined to get itself reduced to an absolutive (caseless) form for all nouns and pronouns but those that are animate. Animate nouns and pronouns are sure to have distinctive possessive forms for an indefinitely long period.

Meanwhile observe that the old alignment of case forms is being invaded by two new categories–a positional category (pre-verbal, post-verbal) and a classificatory category (animate, inanimate). The facts that in the possessive animate nouns and pronouns are destined to be more and more sharply distinguished from inanimate nouns and pronouns (_the man’s_, but _of the house_; _his_, but _of it_) and that, on the whole, it is only animate pronouns that distinguish pre-verbal and post-verbal forms[143] are of the greatest theoretical interest. They show that, however the language strive for a more and more analytic form, it is by no means manifesting a drift toward the expression of “pure” relational concepts in the Indo-Chinese manner.[144] The insistence on the concreteness of the relational concepts is clearly stronger than the destructive power of the most sweeping and persistent drifts that we know of in the history and prehistory of our language.

[Footnote 143: _They_: _them_ as an inanimate group may be looked upon as a kind of borrowing from the animate, to which, in feeling, it more properly belongs.]

[Footnote 144: See page 155.]

[Transcriber’s note: Footnote 144 refers to the paragraph beginning on line 4795.]

The drift toward the abolition of most case distinctions and the correlative drift toward position as an all-important grammatical method are accompanied, in a sense dominated, by the last of the three major drifts that I have referred to. This is the drift toward the invariable word. In analyzing the “whom” sentence I pointed out that the rhetorical emphasis natural to an interrogative pronoun lost something by its form variability (_who_, _whose_, _whom_). This striving for a simple, unnuanced correspondence between idea and word, as invariable as may be, is very strong in English. It accounts for a number of tendencies which at first sight seem unconnected. Certain well-established forms, like the present third person singular _-s_ of _works_ or the plural _-s_ of _books_, have resisted the drift to invariable words, possibly because they symbolize certain stronger form cravings that we do not yet fully understand. It is interesting to note that derivations that get away sufficiently from the concrete notion of the radical word to exist as independent conceptual centers are not affected by this elusive drift. As soon as the derivation runs danger of being felt as a mere nuancing of, a finicky play on, the primary concept, it tends to be absorbed by the radical word, to disappear as such. English words crave spaces between them, they do not like to huddle in clusters of slightly divergent centers of meaning, each edging a little away from the rest. _Goodness_, a noun of quality, almost a noun of relation, that takes its cue from the concrete idea of “good” without necessarily predicating that quality (e.g., _I do not think much of his goodness_) is sufficiently spaced from _good_ itself not to need fear absorption. Similarly, _unable_ can hold its own against _able_ because it destroys the latter’s sphere of influence; _unable_ is psychologically as distinct from _able_ as is _blundering_ or _stupid_. It is different with adverbs in _-ly_. These lean too heavily on their adjectives to have the kind of vitality that English demands of its words. _Do it quickly!_ drags psychologically. The nuance expressed by _quickly_ is too close to that of _quick_, their circles of concreteness are too nearly the same, for the two words to feel comfortable together. The adverbs in _-ly_ are likely to go to the wall in the not too distant future for this very reason and in face of their obvious usefulness. Another instance of the sacrifice of highly useful forms to this impatience of nuancing is the group _whence_, _whither_, _hence_, _hither_, _thence_, _thither_. They could not persist in live usage because they impinged too solidly upon the circles of meaning represented by the words _where_, _here_ and _there_. In saying _whither_ we feel too keenly that we repeat all of _where_. That we add to _where_ an important nuance of direction irritates rather than satisfies. We prefer to merge the static and the directive (_Where do you live?_ like _Where are you going?_) or, if need be, to overdo a little the concept of direction (_Where are you running to?_).

Now it is highly symptomatic of the nature of the drift away from word clusters that we do not object to nuances as such, we object to having the nuances formally earmarked for us. As a matter of fact our vocabulary is rich in near-synonyms and in groups of words that are psychologically near relatives, but these near-synonyms and these groups do not hang together by reason of etymology. We are satisfied with _believe_ and _credible_ just because they keep aloof from each other. _Good_ and _well_ go better together than _quick_ and _quickly_. The English vocabulary is a rich medley because each English word wants its own castle. Has English long been peculiarly receptive to foreign words because it craves the staking out of as many word areas as possible, or, conversely, has the mechanical imposition of a flood of French and Latin loan-words, unrooted in our earlier tradition, so dulled our feeling for the possibilities of our native resources that we are allowing these to shrink by default? I suspect that both propositions are true. Each feeds on the other. I do not think it likely, however, that the borrowings in English have been as mechanical and external a process as they are generally represented to have been. There was something about the English drift as early as the period following the Norman Conquest that welcomed the new words. They were a compensation for something that was weakening within.

VIII

LANGUAGE AS A HISTORICAL PRODUCT: PHONETIC LAW

I have preferred to take up in some detail the analysis of our hesitation in using a locution like “Whom did you see?” and to point to some of the English drifts, particular and general, that are implied by this hesitation than to discuss linguistic change in the abstract. What is true of the particular idiom that we started with is true of everything else in language. Nothing is perfectly static. Every word, every grammatical element, every locution, every sound and accent is a slowly changing configuration, molded by the invisible and impersonal drift that is the life of language. The evidence is overwhelming that this drift has a certain consistent direction. Its speed varies enormously according to circumstances that it is not always easy to define. We have already seen that Lithuanian is to-day nearer its Indo-European prototype than was the hypothetical Germanic mother-tongue five hundred or a thousand years before Christ. German has moved more slowly than English; in some respects it stands roughly midway between English and Anglo-Saxon, in others it has of course diverged from the Anglo-Saxon line. When I pointed out in the preceding chapter that dialects formed because a language broken up into local segments could not move along the same drift in all of these segments, I meant of course that it could not move along identically the same drift. The general drift of a language has its depths. At the surface the current is relatively fast. In certain features dialects drift apart rapidly. By that very fact these features betray themselves as less fundamental to the genius of the language than the more slowly modifiable features in which the dialects keep together long after they have grown to be mutually alien forms of speech. But this is not all. The momentum of the more fundamental, the pre-dialectic, drift is often such that languages long disconnected will pass through the same or strikingly similar phases. In many such cases it is perfectly clear that there could have been no dialectic interinfluencing.

These parallelisms in drift may operate in the phonetic as well as in the morphological sphere, or they may affect both at the same time. Here is an interesting example. The English type of plural represented by _foot_: _feet_, _mouse_: _mice_ is strictly parallel to the German _Fuss_: _Fuesse_, _Maus_: _Maeuse_. One would be inclined to surmise that these dialectic forms go back to old Germanic or West-Germanic alternations of the same type. But the documentary evidence shows conclusively that there could have been no plurals of this type in primitive Germanic. There is no trace of such vocalic mutation (“umlaut”) in Gothic, our most archaic Germanic language. More significant still is the fact that it does not appear in our oldest Old High German texts and begins to develop only at the very end of the Old High German period (circa 1000 A.D.). In the Middle High German period the mutation was carried through in all dialects. The typical Old High German forms are singular _fuoss_, plural _fuossi_;[145] singular _mus_, plural _musi_. The corresponding Middle High German forms are _fuoss_, _fueesse_; _mus_, _muese_. Modern German _Fuss_: _Fuesse_, _Maus_: _Maeuse_ are the regular developments of these medieval forms. Turning to Anglo-Saxon, we find that our modern English forms correspond to _fot_, _fet_; _mus_, _mys_.[146] These forms are already in use in the earliest English monuments that we possess, dating from the eighth century, and thus antedate the Middle High German forms by three hundred years or more. In other words, on this particular point it took German at least three hundred years to catch up with a phonetic-morphological drift[147] that had long been under way in English. The mere fact that the affected vowels of related words (Old High German _uo_, Anglo-Saxon _o_) are not always the same shows that the affection took place at different periods in German and English.[148] There was evidently some general tendency or group of tendencies at work in early Germanic, long before English and German had developed as such, that eventually drove both of these dialects along closely parallel paths.

[Footnote 145: I have changed the Old and Middle High German orthography slightly in order to bring it into accord with modern usage. These purely orthographical changes are immaterial. The _u_ of _mus_ is a long vowel, very nearly like the _oo_ of English _moose_.]

[Footnote 146: The vowels of these four words are long; _o_ as in _rode_, _e_ like _a_ of _fade_, _u_ like _oo_ of _brood_, _y_ like German _ue_.]

[Footnote 147: Or rather stage in a drift.]

[Footnote 148: Anglo-Saxon _fet_ is “unrounded” from an older _foet_, which is phonetically related to _fot_ precisely as is _mys_ (i.e., _mues_) to _mus_. Middle High German _ue_ (Modern German _u_) did not develop from an “umlauted” prototype of Old High German _uo_ and Anglo-Saxon _o_, but was based directly on the dialectic _uo_. The unaffected prototype was long _o_. Had this been affected in the earliest Germanic or West-Germanic period, we should have had a pre-German alternation _fot_: _foeti_; this older _oe_ could not well have resulted in _ue_. Fortunately we do not need inferential evidence in this case, yet inferential comparative methods, if handled with care, may be exceedingly useful. They are indeed indispensable to the historian of language.]

How did such strikingly individual alternations as _fot_: _fet_, _fuoss_: _fueesse_ develop? We have now reached what is probably the most central problem in linguistic history, gradual phonetic change. “Phonetic laws” make up a large and fundamental share of the subject-matter of linguistics. Their influence reaches far beyond the proper sphere of phonetics and invades that of morphology, as we shall see. A drift that begins as a slight phonetic readjustment or unsettlement may in the course of millennia bring about the most profound structural changes. The mere fact, for instance, that there is a growing tendency to throw the stress automatically on the first syllable of a word may eventually change the fundamental type of the language, reducing its final syllables to zero and driving it to the use of more and more analytical or symbolic[149] methods. The English phonetic laws involved in the rise of the words _foot_, _feet_, _mouse_ and _mice_ from their early West-Germanic prototypes _fot_, _foti_, _mus_, _musi_[150] may be briefly summarized as follows:

[Footnote 149: See page 133.]

[Transcriber’s note: Footnote 149 refers to the paragraph beginning on line 4081.]

[Footnote 150: Primitive Germanic _fot(s)_, _fotiz_, _mus_, _musiz_; Indo-European _pods_, _podes_, _mus_, _muses_. The vowels of the first syllables are all long.]

1. In _foti_ “feet” the long _o_ was colored by the following _i_ to long _oe_, that is, _o_ kept its lip-rounded quality and its middle height of tongue position but anticipated the front tongue position of the _i_; _oe_ is the resulting compromise. This assimilatory change was regular, i.e., every accented long _o_ followed by an _i_ in the following syllable automatically developed to long _oe_; hence _tothi_ “teeth” became _toethi_, _fodian_ “to feed” became _foedian_. At first there is no doubt the alternation between _o_ and _oe_ was not felt as intrinsically significant. It could only have been an unconscious mechanical adjustment such as may be observed in the speech of many to-day who modify the “oo” sound of words like _you_ and _few_ in the direction of German _ue_ without, however, actually departing far enough from the “oo” vowel to prevent their acceptance of _who_ and _you_ as satisfactory rhyming words. Later on the quality of the _oe_ vowel must have departed widely enough from that of _o_ to enable _oe_ to rise in consciousness[151] as a neatly distinct vowel. As soon as this happened, the expression of plurality in _foeti_, _toethi_, and analogous words became symbolic and fusional, not merely fusional.

[Footnote 151: Or in that unconscious sound patterning which is ever on the point of becoming conscious. See page 57.]

[Transcriber’s note: Footnote 151 refers to the paragraph beginning on line 1797.]

2. In _musi_ “mice” the long _u_ was colored by the following _i_ to long _ue_. This change also was regular; _lusi_ “lice” became _luesi_, _kui_ “cows” became _kuei_ (later simplified to _kue_; still preserved as _ki-_ in _kine_), _fulian_ “to make foul” became _fuelian_ (still preserved as _-file_ in _defile_). The psychology of this phonetic law is entirely analogous to that of 1.

3. The old drift toward reducing final syllables, a rhythmic consequence of the strong Germanic stress on the first syllable, now manifested itself. The final _-i_, originally an important functional element, had long lost a great share of its value, transferred as that was to the symbolic vowel change (_o_: _oe_). It had little power of resistance, therefore, to the drift. It became dulled to a colorless _-e_; _foeti_ became _foete_.

4. The weak _-e_ finally disappeared. Probably the forms _foete_ and _foet_ long coexisted as prosodic variants according to the rhythmic requirements of the sentence, very much as _Fuesse_ and _Fuess’_ now coexist in German.

5. The _oe_ of _foet_ became “unrounded” to long _e_ (our present _a_ of _fade_). The alternation of _fot_: _foti_, transitionally _fot_: _foeti_, _foete_, _foet_, now appears as _fot_: _fet_. Analogously, _toeth_ appears as _teth_, _foedian_ as _fedian_, later _fedan_. The new long _e_-vowel “fell together” with the older _e_-vowel already existent (e.g., _her_ “here,” _he_ “he”). Henceforward the two are merged and their later history is in common. Thus our present _he_ has the same vowel as _feet_, _teeth_, and _feed_. In other words, the old sound pattern _o_, _e_, after an interim of _o_, _oe_, _e_, reappeared as _o_, _e_, except that now the _e_ had greater “weight” than before.

6. _Fot_: _fet_, _mus_: _mues_ (written _mys_) are the typical forms of Anglo-Saxon literature. At the very end of the Anglo-Saxon period, say about 1050 to 1100 A.D., the _ue_, whether long or short, became unrounded to _i_. _Mys_ was then pronounced _mis_ with long _i_ (rhyming with present _niece_). The change is analogous to 5, but takes place several centuries later.

7. In Chaucer’s day (circa 1350-1400 A.D.) the forms were still _fot_: _fet_ (written _foot_, _feet_) and _mus_: _mis_ (written very variably, but _mous_, _myse_ are typical). About 1500 all the long _i_-vowels, whether original (as in _write_, _ride_, _wine_) or unrounded from Anglo-Saxon _ue_ (as in _hide_, _bride_, _mice_, _defile_), became diphthongized to _ei_ (i.e., _e_ of _met_ + short _i_). Shakespeare pronounced _mice_ as _meis_ (almost the same as the present Cockney pronunciation of _mace_).

8. About the same time the long _u_-vowels were diphthongized to _ou_ (i.e., _o_ of present Scotch _not_ + _u_ of _full_). The Chaucerian _mus_: _mis_ now appears as the Shakespearean _mous_: _meis_. This change may have manifested itself somewhat later than 7; all English dialects have diphthongized old Germanic long _i_,[152] but the long undiphthongized _u_ is still preserved in Lowland Scotch, in which _house_ and _mouse_ rhyme with our _loose_. 7 and 8 are analogous developments, as were 5 and 6; 8 apparently lags behind 7 as 6, centuries earlier, lagged behind 7.

[Footnote 152: As have most Dutch and German dialects.]

9. Some time before 1550 the long _e_ of _fet_ (written _feet_) took the position that had been vacated by the old long _i_, now diphthongized (see 7), i.e., _e_ took the higher tongue position of _i_. Our (and Shakespeare’s) “long _e_” is, then, phonetically the same as the old long _i_. _Feet_ now rhymed with the old _write_ and the present _beat_.

10. About the same time the long _o_ of _fot_ (written _foot_) took the position that had been vacated by the old long _u_, now diphthongized (see 8), i.e., _o_ took the higher tongue position of _u_. Our (and Shakespeare’s) “long _oo_” is phonetically the same as the old long _u_. _Foot_ now rhymed with the old _out_ and the present _boot_. To summarize 7 to 10, Shakespeare pronounced _meis_, _mous_, _fit_, _fut_, of which _meis_ and _mous_ would affect our ears as a rather “mincing” rendering of our present _mice_ and _mouse_, _fit_ would sound practically identical with (but probably a bit more “drawled” than) our present _feet_, while _foot_, rhyming with _boot_, would now be set down as “broad Scotch.”

11. Gradually the first vowel of the diphthong in _mice_ (see 7) was retracted and lowered in position. The resulting diphthong now varies in different English dialects, but _ai_ (i.e., _a_ of _father_, but shorter, + short _i_) may be taken as a fairly accurate rendering of its average quality.[153] What we now call the “long _i_” (of words like _ride, bite, mice_) is, of course, an _ai_-diphthong. _Mice_ is now pronounced _mais_.

[Footnote 153: At least in America.]

12. Analogously to 11, the first vowel of the diphthong in _mouse_ (see 8) was unrounded and lowered in position. The resulting diphthong may be phonetically rendered _au_, though it too varies considerably according to dialect. _Mouse_, then, is now pronounced _maus_.

13. The vowel of _foot_ (see 10) became “open” in quality and shorter in quantity, i.e., it fell together with the old short _u_-vowel of words like _full_, _wolf_, _wool_. This change has taken place in a number of words with an originally long _u_ (Chaucerian long close _o_), such as _forsook_, _hook_, _book_, _look_, _rook_, _shook_, all of which formerly had the vowel of _boot_. The older vowel, however, is still preserved in most words of this class, such as _fool_, _moon_, _spool_, _stoop_. It is highly significant of the nature of the slow spread of a “phonetic law” that there is local vacillation at present in several words. One hears _roof_, _soot_, and _hoop_, for instance, both with the “long” vowel of _boot_ and the “short” of _foot_. It is impossible now, in other words, to state in a definitive manner what is the “phonetic law” that regulated the change of the older _foot_ (rhyming with _boot_) to the present _foot_. We know that there is a strong drift towards the short, open vowel of _foot_, but whether or not all the old “long _oo_” words will eventually be affected we cannot presume to say. If they all, or practically all, are taken by the drift, phonetic law 13 will be as “regular,” as sweeping, as most of the twelve that have preceded it. If not, it may eventually be possible, if past experience is a safe guide, to show that the modified words form a natural phonetic group, that is, that the “law” will have operated under certain definable limiting conditions, e.g., that all words ending in a voiceless consonant (such as _p_, _t_, _k_, _f_) were affected (e.g., _hoof_, _foot_, _look_, _roof_), but that all words ending in the _oo_-vowel or in a voiced consonant remained unaffected (e.g., _do_, _food_, _move_, _fool_). Whatever the upshot, we may be reasonably certain that when the “phonetic law” has run its course, the distribution of “long” and “short” vowels in the old _oo_-words will not seem quite as erratic as at the present transitional moment.[154] We learn, incidentally, the fundamental fact that phonetic laws do not work with spontaneous automatism, that they are simply a formula for a consummated drift that sets in at a psychologically exposed point and gradually worms its way through a gamut of phonetically analogous forms.

[Footnote 154: It is possible that other than purely phonetic factors are also at work in the history of these vowels.]

It will be instructive to set down a table of form sequences, a kind of gross history of the words _foot_, _feet_, _mouse_, _mice_ for the last 1500 years:[155]

[Footnote 155: The orthography is roughly phonetic. Pronounce all accented vowels long except where otherwise indicated, unaccented vowels short; give continental values to vowels, not present English ones.]

I. _fot_: _foti_; _mus_: _musi_ (West Germanic) II. _fot_: _foeti_; _mus_: _muesi_
III. _fot_: _foete_; _mus_: _muese_ IV. _fot_: _foet_; _mus_: _mues_
V. _fot_: _fet_; _mus_: _mues_ (Anglo-Saxon) VI. _fot_: _fet_; _mus_: _mis_(Chaucer) VII. _fot_: _fet_; _mous_: _meis_
VIII. _fut_ (rhymes with _boot_): _fit_; _mous_: _meis_ (Shakespeare) IX. _fut_: _fit_; _maus_: _mais_
X. _fut_ (rhymes with _put_): _fit_; _maus_: _mais_ (English of 1900)

It will not be necessary to list the phonetic laws that gradually differentiated the modern German equivalents of the original West Germanic forms from their English cognates. The following table gives a rough idea of the form sequences in German:[156]

[Footnote 156: After I. the numbers are not meant to correspond chronologically to those of the English table. The orthography is again roughly phonetic.]

I. _fot_: _foti_; _mus_: _musi_ (West Germanic) II. _foss_:[157] _fossi_; _mus_: _musi_ III. _fuoss_: _fuossi_; _mus_: _musi_ (Old High German) IV. _fuoss_: _fueessi_; _mus_: _muesi_
V. _fuoss_: _fueesse_; _mus_: _muese_ (Middle High German) VI. _fuoss_: _fueesse_; _mus_: _mueze_[158] VII. _fuos_: _fueese_; _mus_: _mueze_
VIII. _fuos_: _fueese_; _mous_: _moeueze_ IX. _fus_: _fuese_; _mous_: _moeueze_ (Luther) X. _fus_: _fuese_; _maus_: _moize_ (German of 1900)

[Footnote 157: I use _ss_ to indicate a peculiar long, voiceless _s_-sound that was etymologically and phonetically distinct from the old Germanic _s_. It always goes back to an old _t_. In the old sources it is generally written as a variant of _z_, though it is not to be confused with the modern German _z_ (= _ts_). It was probably a dental (lisped) _s_.]

[Footnote 158: _Z_ is to be understood as French or English _z_, not in its German use. Strictly speaking, this “z” (intervocalic _-s-_) was not voiced but was a soft voiceless sound, a sibilant intermediate between our _s_ and _z_. In modern North German it has become voiced to _z_. It is important not to confound this _s_–_z_ with the voiceless intervocalic _s_ that soon arose from the older lisped _ss_. In Modern German (aside from certain dialects), old _s_ and _ss_ are not now differentiated when final (_Maus_ and _Fuss_ have identical sibilants), but can still be distinguished as voiced and voiceless _s_ between vowels (_Maeuse_ and _Fuesse_).]

We cannot even begin to ferret out and discuss all the psychological problems that are concealed behind these bland tables. Their general parallelism is obvious. Indeed we might say that to-day the English and German forms resemble each other more than does either set the West Germanic prototypes from which each is independently derived. Each table illustrates the tendency to reduction of unaccented syllables, the vocalic modification of the radical element under the influence of the following vowel, the rise in tongue position of the long middle vowels (English _o_ to _u_, _e_ to _i_; German _o_ to _uo_ to _u_, _uee_ to _ue_), the diphthongizing of the old high vowels (English _i_ to _ei_ to _ai_; English and German _u_ to _ou_ to _au_; German _ue_ to _oeue_ to _oi_). These dialectic parallels cannot be accidental. They are rooted in a common, pre-dialectic drift.

Phonetic changes are “regular.” All but one (English table, X.), and that as yet uncompleted, of the particular phonetic laws represented in our tables affect all examples of the sound in question or, if the phonetic change is conditional, all examples of the same sound that are analogously circumstanced.[159] An example of the first type of change is the passage in English of all old long _i_-vowels to diphthongal _ai_ via _ei_. The passage could hardly have been sudden or automatic, but it was rapid enough to prevent an irregularity of development due to cross drifts. The second type of change is illustrated in the development of Anglo-Saxon long _o_ to long _e_, via _oe_, under the influence of a following _i_. In the first case we may say that _au_ mechanically replaced long _u_, in the second that the old long _o_ “split” into two sounds–long _o_, eventually _u_, and long _e_, eventually _i_. The former type of change did no violence to the old phonetic pattern, the formal distribution of sounds into groups; the latter type rearranged the pattern somewhat. If neither of the two sounds into which an old one “splits” is a new sound, it means that there has been a phonetic leveling, that two groups of words, each with a distinct sound or sound combination, have fallen together into one group. This kind of leveling is quite frequent in the history of language. In English, for instance, we have seen that all the old long _ue_-vowels, after they had become

Language by Edward SapirAn introduction to the study of speec (page 3)