nitive feminine plural *-asom (Greek -aon; Latin -arum, Umbrian -aru, Oscan -azum) and of the pronominal ending of the nominative masculine plural *-oi (Greek -oi; Latin -i). The last innovation, however, is not shared with Osco-Umbrian, but is found instead in Germanic (in the strong declension of adjectives) and partly in Celtic. The dialectal individuality of Greek is very clearly marked in the organization of the verb (see below), which is without parallel except for an approximation in Indo-Iranian.


Indo-European is the name of a family of languages that by 1000 BC were spoken over most of Europe and in much of Southwest and South Asia; from the second half of the 15th century the Indo-European tongues have spread to most other inhabited parts of the world. The term Indo-Hittite is used by scholars who believe that Hittite and the other Anatolian languages (see below) are not just one branch of Indo-European but rather a branch coordinate with all the rest put together; thus, Indo-Hittite has been used for a family consisting of Indo-European proper plus Anatolian. As long as this view is neither definitively proved nor disproved, it is convenient to keep the traditional use of the term Indo-European. Languages of the family. The well-attested languages of the Indo-European family fall fairly neatly into the 10 main branches listed below; these are arranged according to the age of their oldest sizable texts. Anatolian. Now extinct, Anatolian was spoken during the 1st and 2nd millennia BC in what is presently Asian Turkey and northern Syria. By far the best known of its members is Hittite, the official language of the Hittite empire, which flourished in the 2nd millennium. Very few Hittite texts were known before 1906, and their interpretation as Indo-European was not generally accepted until after 1915; the integration of Hittite data into Indo-European comparative grammar has, therefore, been one of the principal developments of Indo-European studies in the 20th century. The oldest Hittite texts date from the 17th century BC, the latest from the 13th. For more information, see below Anatolian languages. Indo-Iranian. Indo-Iranian comprises two main subbranches, Indo-Aryan (Indic) and Iranian. Indo-Aryan languages have been spoken in what is now northern and central India and Pakistan since before 1000 BC. Aside from a very poorly known dialect spoken in or near northern Iraq during the 2nd millennium BC, the oldest record of an Indo-Aryan language is the Vedic Sanskrit of the Rigveda (Rgveda), the oldest of the sacred scriptures of India, dating roughly from 1000 BC. Examples of modern Indo-Aryan languages are Hindi, Bengali, Sinhalese (spoken in Sri Lanka), and the many dialects of Romany, the language of the Gypsies (Rom). Iranian languages were spoken in the 1st millennium BC in present-day Iran and Afghanistan and also in the steppes to the north, from modern Hungary to East (Chinese) Turkistan. The only well-known ancient varieties are Avestan, the sacred language of the Zoroastrians (Parsees), and Old Persian, the official language of Darius I (ruled 522-486 BC) and Xerxes I (486-465 BC) and their successors. Some modern Iranian languages are Persian (Farsi), Pashto (Afghan), Kurdish, and Ossetic. For more information, see below Indo-Iranian languages. Greek. Greek, despite its numerous dialects, has been a single language throughout its history. It has been spoken in Greece since at least 1600 BC, and, in all probability, since the end of the 3rd millennium. The earliest texts may be the Linear B tablets, some of which may date from as far back as 1400 BC (the date is disputed), and some of which certainly date to 1200 BC. This material, very sparse and difficult to interpret, was deciphered as Greek in 1952, though some scholars dispute the finding. The Homeric epics--the Iliad and the Odyssey--composed for the most part in the 8th century BC, are the oldest texts of any bulk. For more information, see below Greek language. Italic. The principal language of the Italic group is Latin, originally the speech of the city of Rome and the ancestor of the modern Romance languages: Italian, Romanian, Spanish, Portuguese, French, etc. The earliest Latin inscriptions apparently date from the 6th century BC, with literature beginning in the 3rd century. Scholars are not in agreement as to how many other ancient languages of Italy and Sicily belong in the same branch as Latin. For more information on Latin, the languages derived from it, and the other languages that belong to or are sometimes included in the Italic branch of Indo-European, see below Italic languages; Romance languages. Germanic. In the middle of the 1st millennium BC, Germanic tribes lived in southern Scandinavia and northern Germany. Their expansions and migrations from the 2nd century BC onward are largely recorded in history. The oldest Germanic language of which much is known is the Gothic of the 4th century AD. Other languages include English, German, Dutch, Danish, Swedish, Norwegian, and Icelandic. For more information on the Germanic languages, see below Germanic languages; English language. Armenian. Armenian, like the Greek tongue, is a single language. Speakers of Armenian are recorded as being in what now constitutes eastern Turkey and Armenia as early as the 6th century BC, but the oldest Armenian texts date from the 5th century AD. For more information, see below Armenian language. Tocharian. Tocharian, now extinct, was spoken in present-day Chinese Turkistan in the 1st millennium AD. Two distinct languages are known, labelled A (Turfanian) and B (Kuchean); many scholars consider Tocharian A and B to be two dialects of the same language. One group of travel permits for caravans can be dated to the early 7th century, and it appears that other texts date from the same or from neighbouring centuries. These languages became known to scholars only in the first decade of the 20th century; they have been less important for Indo-European studies than has Hittite, partly because their testimony about the Indo-European parent language is obscured by 2,000 more years of change and partly because Tocharian testimony fits fairly well with that of the previously known non-Anatolian languages. For more information, see below Tocharian language. Celtic. The Celtic language was spoken in the last centuries before the Christian Era over a wide area of Europe, from Spain and Britain to the Balkans, with one group (the Galatians) even in Asia Minor. Very little of the Celtic of that time and the ensuing centuries has survived, and this branch is known almost entirely from the Insular Celtic languages--Irish, Welsh, and others--spoken in and near the British Isles, as recorded from the 8th century AD onward. For further information, see below Celtic languages. Balto-Slavic. The grouping of Baltic and Slavic into a single branch is somewhat controversial, but the exclusively shared features outweigh the divergences. At the beginning of the Christian Era, Baltic and Slavic tribes occupied a large area of eastern Europe, east of the Germanic tribes and north of the Iranians, including much of present-day Poland and what was formerly the western Soviet Union--namely, Belarus, Ukraine, and westernmost Russia. The Slavic area was in all likelihood relatively small, perhaps centred in what is now southern Poland. But in the 5th century AD the Slavs began expanding in all directions, until today the Slavic languages are spoken over the greater part of eastern Europe and northern Asia. The Baltic-speaking area, however, has contracted, so that Baltic languages are presently confined to Lithuania and Latvia. The earliest Slavic texts, written in a dialect called Old Church Slavonic, date from the 9th century AD; the oldest substantial material in Baltic comes from the end of the 14th century, and the oldest connected texts from the 16th century. For more information, see below Baltic languages; Slavic languages. Albanian. Albanian, the language of the present-day republic of Albania, is known from the 15th century AD. It presumably continues one of the very poorly attested ancient Indo-European languages of the Balkan peninsula, but which one is not clear. For more information, see below Albanian language. In addition to the tongues just listed, there are several poorly documented extinct languages of which enough is known to be sure that they were Indo-European and that they did not belong in any of the branches enumerated above (e.g., Phrygian, Macedonian). Of a few, too little is known to be sure whether they were Indo-European or not (e.g., Ligurian). Establishment of the family. Shared characteristics. The chief reason for grouping the Indo-European languages together is that they share a number of items of basic vocabulary, including grammatical affixes, whose shapes in the different languages can be related to one another by statable phonetic rules. Especially important are the shared patterns of alternation of sounds. Thus the agreement of Sanskrit �s-ti, Latin es-t, and Gothic is-t, all meaning "is," is greatly strengthened by the identical reduction of the root to s- in the plural in all three languages: Sanskrit s-�nti, Latin s-unt, Gothic s-ind "they are." Agreements in pure structure, totally divorced from phonetic substance, are, at best, of dubious value in proving membership in the Indo-European family. Table 1 gives examples of typical vocabulary items widely shared within the Indo-European family that have been decisive in establishing the family. A blank indicates that the language in question does not use the item in accordance with the given meaning or that its word for that meaning is unknown. Similarities in grammatical endings are shown in Table 2 by samples of noun declension and verb inflection in some of the more archaic languages that have retained the inflectional endings of Indo-European in relatively unchanged form. Note that Old Lithuanian -i and -u were nasalized vowels, representing a continuation from the earlier forms *-in and *-un. (The asterisk marks a form that is not actually found in any document or living dialect but is reconstructed as having once existed in the prehistory of the language.) The statable phonetic rules referred to earlier are not always obvious without careful observation. Note that the English dental consonants t, d, and th do not correspond in a straightforward manner to the Greek dental sounds t, d, and th; that is, English t does not occur where Greek t appears, nor English d where Greek has d. But the relationships between the sounds are not random either--English t does not correspond to Greek t in one word, to d in a second, and to th in a third, according to no discernible pattern. Rather, where Greek has initial t, English has th, as in "that" and "three"; where Greek has d, English has t, as in "tree," "two," and "ten"; and where Greek has th, English has d, as in "daughter." Note also that phonetic similarity as such is not needed to establish relationship. Thus, many of the Armenian words in Table 1 look quite different from the related words in other Indo-European languages. but here too regular rules of correspondence can be found; e.g., Greek initial p corresponds to Armenian h or zero (a lack of consonant) in the words meaning "fire," "father," "foot," "five." Linguistic studies of the family. The ancient Greeks and Romans readily perceived that their languages were related to each other, and, as other European languages became objects of scholarly attention in the late Middle Ages and the Renaissance, many of these were seen to be more similar to Latin and Greek than, for example, to Hebrew or Hungarian. But an accurate idea of the true bounds of the Indo-European family became possible only when, in the 16th century, Europeans began to learn Sanskrit. The massive similarities between Sanskrit and Latin and Greek were noted early, but the first person to make the correct inference and state it conspicuously was the English Orientalist and jurist Sir William Jones, who in 1786 said in his presidential address to the Asiatic Society that Sanskrit bore to both Greek and Latin a stronger affinity, both in the roots of verbs, and in the forms of grammar, than could possibly have been produced by accident; so strong, indeed, that no philologer could examine them all three without believing them to have sprung from some common source, which, perhaps, no longer exists. There is a similar reason, though not quite so forcible, for supposing that both the Gothick [i.e., Germanic] and the Celtick, though blended with a very different idiom, had the same origin with the Sanscrit; and the old Persian might be added to the same family . . . . The detailed evidence on which Jones based his conclusion was not presented until the 19th century. In 1816 Franz Bopp, the German philologist, presented his �ber das Conjugationssystem der Sanskritsprache in Vergleichung mit jenem der griechischen, lateinischen, persischen und germanischen Sprache ("On the system of conjugation of the Sanskrit language, in comparison with those of Greek, Latin, Persian, and Germanic"), in which the relation of these five languages was demonstrated on the basis of a detailed comparison of verb morphology (structure). Two years later there appeared the "Unders�gelse om det gamle Nordiske eller Islandske Sprogs Oprindelse" ("Investigation on the Origin of the Old Norse or Icelandic Language"), by the Danish philologist Rasmus Rask, originally written in 1814. This work demonstrated methodically the relation of Germanic to Latin, Greek, Slavic, and Baltic. In 1822 the second edition of the first volume of Jacob Grimm's Deutsche Grammatik ("Germanic Grammar") was published; in this grammar were discussed the peculiar Indo-European vowel alternations called Ablaut by Grimm (e.g., English "sing, sang, sung"; or Greek pe�th-o "I persuade," p�-poith-a "I am persuaded," �-pith-on "I persuaded"). In addition, Grimm tried to find the principle behind the correspondences of Germanic stop and spirant consonants (the first made with complete stoppage of the breath, and the second made with constriction of the breath but not complete stoppage) to the consonants of other Indo-European languages. The sound changes implied by these correspondences have become known as "Grimm's Law." Examples of it include the stop consonant p in Latin pater corresponding to the spirant consonant f in "father," and the correspondences between English and Greek t, d, and th discussed above. Bopp demonstrated in 1838 that the Celtic languages were Indo-European, as had been asserted by Jones. In 1850 the German philologist August Schleicher did the same for Albanian, and in 1877 another German philologist, Heinrich H�bschmann, showed that Armenian was an independent branch of Indo-European, rather than a member of the Iranian subbranch. Since then, the Indo-European family has been enlarged by the discovery of Tocharian and of Hittite and other Anatolian languages, and by the recognition, with the aid of Hittite, that Lycian, known and partly deciphered already in the 19th century, belongs to the Anatolian branch of Indo-European. The Indo-European character of Tocharian was announced by the German scholars Emil Sieg and Wilhelm Siegling in 1908. The Norwegian orientalist J�rgen Alexander Knudtzon recognized Hittite as Indo-European on the basis of two letters found in Egypt (translated in Die zwei Arzawa-briefe, 1902; "The Two Arzawa Letters"), but his views were not generally accepted until 1915, when Bedrich Hrozn� published the first report of his own decipherment of the much more copious material that had meanwhile been found in the ruins of the Hittite capital itself. The first full comparative grammar of the major Indo-European languages was Bopp's Vergleichende Grammatik des Sanskrit, Zend, Griechischen, Lateinischen, Litthauischen, Altslawischen, Gotischen und Deutschen (1833-52; "Comparative Grammar of Sanskrit, Zend, Greek, Latin, Lithuanian, Old Slavic, Gothic, and German"). But this and August Schleicher's shorter Compendium der vergleichenden Grammatik der indogermanischen Sprachen (1861-62; "Compendium of the Comparative Grammar of the Indo-European Languages") were rendered obsolete by the major breakthrough of the 1870s, when scholars realized that sound correspondences are not merely rules of thumb that do not have to be strictly observed, and that apparent exceptions to sound laws can often be accounted for by stating them more accurately or by reconstructing additional different sounds in the parent language. The difference between Gothic d in fadar "father" and � in bro�ar "brother," for example, both corresponding to t in Sanskrit, Greek, and Latin, proved to be correlated with the original position of the accent, a discovery known as Verner's Law (named for the Danish linguist Karl Verner). Thus, d appears when the preceding syllable was originally unaccented (fadar : Greek pat�r-, Sanskrit pit�r-), and � occurs when the preceding syllable was originally accented (bro�ar : Greek phrater "member of a clan," Sanskrit bhratar-). The knowledge and opinions that had accumulated by the end of the 19th century are largely incorporated in the German linguist Karl Brugmann's Grundriss der vergleichenden Grammatik der indogermanischen Sprachen (2nd ed., 1897-1916; "Outline of Comparative Indo-European Grammar", which remains the latest fullscale treatment of the family. The parent language. By comparing the recorded Indo-European languages, especially the most ancient ones, much of the parent language from which they are descended can be reconstructed. This reconstructed parent language is sometimes called simply "Indo-European," but in this article the term Proto-Indo-European is preferred. Phonology. In Proto-Indo-European there were at least 11 stop consonants. In the following grid these sounds are arranged according to the place in the mouth where the stopage was made and the activity of the vocal cords during and immediately after the stoppage: Labial denotes a sound made with the lips; dental, with the tip of the tongue against the back of the teeth. The palatals were probably made by contact between the upper surface of the tongue and the hard palate (the roof of the mouth), like Hungarian ty and gy in atya and Magyar. The labiovelars were probably made by contact between the upper surface of the tongue and the soft palate (the area behind the hard palate), with a concomitant rounding of the lips. Voiceless designates sounds made without vibration of the vocal cords; voiced sounds are pronounced with vibration of the vocal cords. The exact pronunciation of the "voiced aspirates" is uncertain. There may also have been a voiced labial stop, b, but correspondences pointing to this are few, and rarely extend beyond immediately neighbouring languages. Correspondences that some scholars take as evidence for a set of plain velar consonants (made with the back of the tongue touching the soft palate), k, g, gh, are partly, perhaps entirely, the result of special developments of labiovelars and palatals in specific positions. The evidence for a set of voiceless aspirated stops ph, th, kh, kh, k h is extremely weak. (Aspirated consonants are sounds accompanied by a puff of breath.) There was one sibilant consonant, s, with a voiced alternant, z, that occurred automatically next to voiced stops. The existence of a second apical spirant, � (presumed pronunciation like that of th in English "thin"), is extremely uncertain. Most scholars now agree that the parent language had one or more additional stop or spirant consonants, for which the label laryngeal is used. These consonants, however, have mostly disappeared or have become identical with other sounds in the recorded Indo-European languages, so that their former existence had to be deduced mainly from their effects on neighbouring sounds. Hence, the laryngeal sounds were not suspected until 1878, and even then they were rejected by most scholars until after 1927, when Kurylowicz showed that Hittite often has h (perhaps a velar spirant like the ch in German ach) in places where a "laryngeal" had been posited on the evidence of the other Indo-European languages. There is still considerable disagreement about how many "laryngeals" there were, what they sounded like, what traces they left, and how best to symbolize them. Probably there were three or four, which can be written H, H, H (and H ), and probably some or all of them were palatal or (labio-)velar spirants. The principal traces they left outside Anatolian are in the quality and length of neighbouring vowels, H (and H ) changing a neighbouring e to a, and H changing it to o, while all laryngeals lengthened a preceding vowel. In Anatolian, H and H remained as h, at least in some positions; H is tentatively set up to account for words with a that lack h in Hittite. When laryngeals between consonants disappeared, a vowel sometimes remained, as in Greek stasis, Sanskrit sthitis, Old English stede "a standing (place)" from Proto-Indo-European *stH tis. Scholars who do not posit "laryngeals" reconstruct a separate Proto-Indo-European vowel (called schwa indogermanicum) to account for these correspondences. Finally, there were the nasal sounds n and m, the liquids l and r, and the semivowels y and w. When y and w occurred between consonants, they were replaced by the vowels i and u. The nasals and liquids functioning as nuclei of syllables in this position (like the final sounds of English "bottom," "button," "bottle," "butter") are traditionally written n, m, l, r. Some scholars dispense with these diacritical marks and with the distinction between syllabic i and u and nonsyllabic y and w, but this obscures certain distinctions, such as that between -wn- in *kwnsu "among dogs," Sanskrit shvasu, and -un- in *tund- "shove," Sanskrit tundate. The vowel system of Proto-Indo-European was dominated by a pattern of alternation called ablaut. The alternant (called a grade) that occurs in a given syllable of a given form is only partly predictable from the shape of the rest of the word. The basic vowel of the system was e ("normal grade"), and the changes it could undergo were loss (zero-grade), change to o (o-grade), lengthening to e (lengthened grade), and lengthening plus change to o (lengthened o-grade). The stem ped- "foot," for example, appears as such in Latin ped-is (normal grade) "of a foot," as -bd- in Avestan fra-bd-a- (zero-grade) "fore-foot," as pod- in Greek pod-es (o-grade) "feet," as *ped- in Latin pes (lengthened grade) "foot" in the nominative singular, and as *pod- in English "foot" (lengthened o-grade). Ablauting forms whose basic vowel is a, o, e, a, or o in the recorded languages (e.g., Greek ag- "lead," op-"see," sta- "stand") are now believed to have had e preceded or followed by laryngeal in the parent language; e.g., *H eg- "lead," *H ek- "see," *steH - "stand." It is uncertain whether there were additional o and a vowels besides those arising by ablaut and from e next to a laryngeal. The vowels i and u did not participate in ablaut alternations, but rather functioned primarily as the syllabic realizations of the consonants y and w, as in *leyk - "leave," zero-grade *lik -, like *derk- "see," zero-grade *drk-. Long i and u in the recorded languages derive, at least in part, from sequences of i or u plus laryngeal; e.g., Latin vivus "alive" from *g iHw�s. Thus the parent language had at least the following vowels: (In forming front vowels, the highest point of the tongue is in the front of the mouth; for back vowels, that point is in the back. High vowels are those in which the tongue is highest--closest to the roof of the mouth; mid vowels are made with the tongue between the extremes of high and low.) Of these vowels, i and u really functioned as consonants, and e, o, o were all conditioned alternants of e. But as noted above there may also have been i, u, a, and a second o. The accent just before the breakup of the parent language was apparently mainly one of pitch rather than stress. Each full word had one accented syllable, presumably pronounced on a higher pitch than the others. Morphology and syntax. The Proto-Indo-European verb had three aspects: imperfective, perfective, and stative. Aspect refers to the nature of an action as described by the speaker; e.g., an event occurring once, an event recurring repeatedly, a continuing process, or a state. The difference between English simple and "progressive" verb forms is largely one of aspect; e.g., "John wrote a letter yesterday" (implying that he finished it) versus "John was writing a letter yesterday" (describing an ongoing process, with no implication as to whether it was finished or not). The Anatolian languages lack a dimension of aspect, and it is not yet clear what the earlier system underlying both Anatolian and the rest of Indo-European was. The imperfective aspect, traditionally called present, was used for repeated actions and for ongoing processes or states; e.g., *sti-steH - "stand up more than once, be in the process of standing up," *wegh-e- "be in the process of conveying," *es- "be." The perfective aspect, traditionally called aorist, expressed a single, completed occurrence of an action or process; e.g., *cteH - "stand up, come to a stop," *wegh-s- "convey." The stative aspect, traditionally called perfect, described states of the subject; e.g., *woyd- "know," *ste-stoH - "be in a standing position." Verb roots were by themselves either perfective (like *steH - "stand") or imperfective (like *wegh- "convey," *es- "be"). This basic aspect, however, could be reversed by aspect markers; e.g., reduplication for imperfective, as in *sti-steH- (reduplication is the repetition of a word or part of a word), and -s- for perfective, as in *wegh-s-. The stative aspect was always marked by the o-grade of the root in the indicative singular (as in *woyd- "know"), and usually also by reduplication (as in *ste-stoH -); it had personal endings different from those of the other two aspects. From one aspect of a given verb the shape and even the existence of the other two aspects could not be predicted; for example, *es- "be" had only the imperfective aspect. Ways of forming imperfectives were especially numerous and often involved, in addition to their imperfective aspectual meaning, some other notion, such as performing the action habitually or repeatedly (iterative), or causing someone else to perform it (causative). One root could thus have several imperfective stems; so to the root *er- "move" there were at least a causative form, *r-new- "set in motion," and an iterative form, *r-ske- "go repeatedly." The Proto-Indo-European verb was also inflected for mood, by which the speaker could indicate whether he was making statements or inquiries about matters of fact; making predictions, surmises, or wishes about the future or about unreal but imagined situations; or giving commands. Compare English "If John is home now (he is eating lunch)" with the verb "is" in the indicative mood, discussing a matter of fact, with "If John were home now (he would be eating lunch)" with the verb "were" in the subjunctive mood, describing an unreal situation. There were two Proto-Indo-European suffixes expressing mood: -e- alternating with -o- for the subjunctive, corresponding roughly in meaning to the English auxiliaries "shall" and "will," and -yeH - alternating with -iH -for the optative, corresponding roughly to English "should" and "would." Verbs without one of these two suffixes were marked for mood and tense by their personal endings. These personal endings basically expressed the person and number of the verb's subject, as in Latin amo "I love," amas "you (singular) love," amat "he or she loves," amamus "we love," and so on. In the imperfective and perfective aspects there were two sets of endings, distinguishing two voices: active, in which typically the subject was not affected by the action, and mediopassive, in which typically the subject was affected, directly or indirectly. Thus Sanskrit active yajati and mediopassive yajate both mean "he sacrifices," but the former is said of a priest who performs a sacrifice for the benefit of another, while the latter is said of a layman who hires a priest to perform a sacrifice for him. In the stative aspect there was no distinction of voice. (Voice indicates the relationship of the action expressed by the verb to the subject of the statement.) To mark mood and tense, verbs in the imperfective aspect that did not have a mood suffix had three sets of personal endings in both active and mediopassive voices: imperative, primary, and secondary. Verbs with imperative endings belonged to the imperative mood (used for commands); e.g., *s-dh� "be," *�s-tu "let him be." Verbs with primary endings were marked as non-past in tense and indicative in mood; e.g., *�s-ti "he is." (Indicative mood signifies objective statements and questions.) Verbs with secondary endings were unmarked for tense and mood, but were most typically used as past indicatives (e.g., *g h�n-t "he slew") and to fill out gaps in the imperative paradigm (e.g., *s-t� "be" in the plural, * ghn-t� "ye slew; slay" in the plural). To mark such forms unambiguously as past indicatives, an augment, usually consisting of the vowel e, could be prefixed; e.g., *�-g hen-t "he slew," *est (= *�-es-t) "he was." Verbs in the perfective aspect without a mood suffix did not occur with primary endings, and so lacked a non-past indicative tense. Verbs in the stative aspect apparently lacked a distinction between primary and secondary endings, so that a form like *w�yd-e "he knows" meant also "he knew." The inflectional categories of the noun were case, number, and gender. Eight cases can be reconstructed: nominative, for the subject of a verb; accusative, for the direct object; genitive, for the relations expressed by English "of"; dative, corresponding to the English preposition "to," as in "give a prize to the winner"; locative, corresponding to "at," "in"; ablative, "from"; instrumental, "with"; and vocative, used for the person being addressed. For examples of some of these see Table 2. Besides singular and plural number, there was a dual number for referring to two items. Each noun belonged to one of three genders: masculine, to which belonged most nouns designating male creatures; feminine, to which belonged most names of female creatures; and neuter, to which belonged only a few words for individual adult living creatures. The gender of nouns not designating living creatures was only partly predictable from their meaning. Adjectives were nouns that varied in gender according to the gender of another noun with which they were in agreement, or, if used by themselves, according to the sex of the entity to which they referred; thus, Latin bonus sermo "good speech" (masculine), bona aetas "good age" (feminine), bonum cor "good heart" (neuter), or bonus "a good man," bona "a good woman," bonum "a good thing." The neuter of an adjective was identical with the masculine except for having different endings in nominative and accusative cases. Feminine gender was either completely identical with the masculine or derived from it by means of a suffix, the two commonest being *-eH - and *-iH - (*-yeH -). Demonstrative, interrogative, relative, and indefinite pronouns were inflected like adjectives, with some special endings. Personal pronouns were inflected very differently. They lacked the category of gender, and marked number and case (in part) not by endings but by different stems, as is still seen in English singular nominative "I"; oblique "my," "me"; plural nominative "we"; plural oblique "our," "us." (The oblique is any case other than nominative or vocative.) Some notable features of Proto-Indo-European syntax are: the non-ergative case system, that is, the subject of an intransitive verb is in the same case as the subject (rather than the object) of a transitive verb; concord (agreement) in case, number, and gender between adjective and noun; and use of singular verbs with neuter plural subjects, as in Greek panta rhei "all things flow," with the same verb as ho potamos rhei "the river (masculine) flows," contrasting with hoi potamoi rheousi "the rivers flow" (indicating that neuter plurals were originally collectives and grammatically singular). Lexicon and culture. Much less is known about the parent language's vocabulary than about its phonology and grammar. Sounds and grammatical categories do not easily disappear or undergo radical change in so many daughter languages that their former existence can no longer be detected. It is relatively easy, however, for an individual word to disappear or shift meaning in so many daughter languages that its existence or meaning in the parent language cannot be confidently inferred. Hence, from the linguistic evidence alone, scholars can never say that Proto-Indo-European lacked a word for any particular concept; they can only state the probability that certain items did exist, and from these items make inferences about the culture and location in time and space of the speakers of Proto-Indo-European. Thus is it supposed that the Proto-Indo-European community knew and talked about dogs (*kw�n-), horses (*�kwo-), sheep (*H �wi-), and almost certainly cows (*g �w-) and pigs (*suH-). Probably all these animals were domesticated. At least one cereal grain was known (*yewo-), and at least one metal (*H eyos or *H eyos). There were vehicles (*wogho-) with wheels (* k eklo-), pulled by teams joined by yokes (*yugo-). Honey was known, and probably formed the basis of an alcoholic drink (*melit-, *medhu) related to the English "mead." Numerals up through 100 (*kmt�m) were in use. All this suggests a people with a well-developed Neolithic (characterized by simple agriculture and polished stone tools) or even Chalcolithic (copper-or bronze-using) technology. Location and date. Linguists have not found a reliable and precise way to determine from linguistic evidence alone the date at which any set of related languages must have begun diverging. The best that can be done is to estimate the degree of difference between the languages in question, taking into account all that is known about them, and then compare this estimate with the estimated degrees of difference within families of languages--such as the Romance family--whose actual time of divergence is approximately known. Using this sort of "dead reckoning," it can be said that the earliest attested Indo-European languages--Anatolian, Indo-Iranian, and Greek--are different enough that the parent language must have been split into several distinct languages well before 2000 BC, but similar enough that the first split into separate languages is not likely to have been much earlier than 3000 BC, and may have been later. For further progress the linguistic findings must be correlated with those of archaeologists and paleontologists to see if there was a population group within Eurasia that was relatively small and homogeneous before 3000 BC and that underwent considerable expansion and fragmentation beginning about 3000 BC--give or take a few centuries--such that some of its fragments can be ancestral to components of the cultures of the speakers of the various recorded Indo-European languages. The culture of this population group in the centuries around 3000 BC must also correspond to what can be inferred for Proto-Indo-European from the linguistic data. At present the archaeological evidence seems to find such a group in the Kurgan culture of the south Russian steppe, east of the Dnepr (Dnieper) River, north of the Caucasus, and west of the Urals. According to the Lithuanian-American archaeologist Marija Gimbutas, in Indo-European and Indo-Europeans (1970), this culture began spreading west c. 4000-3500 BC (Kurgan II), and began to occupy a really wide area stretching from eastern central Europe to northern Iran c. 3500-3000 BC (Kurgan III). Allowing a few centuries for the speech of widely separated bands to diverge to the point of becoming distinct languages, this agrees tolerably well with the date suggested by the linguistic evidence for breakup of the parent language. So far the Kurgan culture has been traced back to the 5th millennium BC; its earlier antecedents are still unknown. Remote relationship of Indo-European to the Uralic languages is very likely. Geographically, the earliest reconstructible locations of the two families are contiguous; lexically, there are strong resemblances in a number of basic words or word parts, including personal, demonstrative, interrogative, and relative pronouns, personal endings of verbs, the accusative case ending -m, and such words as those for "water" and "name"; typologically, the families are fairly similar (e.g., both have many suffixes, but few or no prefixes or infixes--elements inserted within words). The resemblances, however, are too few to permit the reconstruction of a common "Indo-Uralic" parent language; the two families must have separated several thousand years before the breakup of Indo-European. If Indo-European is related to other language families--e.g., to Hamito-Semitic (Afro-Asiatic) or Caucasian--it must have diverged from them much earlier than from Uralic, because the number of cogent resemblances is much smaller. There is no evidence that Indo-European originated by fusion of components from two or more distinct language families. Characteristic developments of Indo-European languages. As Proto-Indo-European was splitting into the dialects that were to become the first generation of daughter languages, different innovations spread over different territories. Indo-Iranian, Balto-Slavic, Armenian, and Albanian agree in changing the palatal stops *k, *g, and *gh into spirants (s, sh, th) or affricates; e.g., Sanskrit ashri- "sharp edge." Old Church Slavonic ostru "sharp," Armenian aseln "needle," Albanian ath�t� "bitter" beside Greek �kros "tip," Latin acidus "biting," all from a basic element *Hek- "sharp, pointed." (Spirants, also called fricatives, are sounds produced with audible friction as a result of the airstream passing through a narrow, but unstopped, passage in the mouth; e.g., English s, f, v. Affricates are sounds that begin as stops, with complete stoppage of the airstream, but are released as spirants, or fricatives; e.g., the ch in "church," the j in "jam.") The languages that change the palatal stops to spirants or affricates are not separated from one another by any recorded languages that preserve the palatals as stops; so it is therefore inferred that the change to affricates (whence later spirants) occurred just once, and spread over a cohesive dialect area of Proto-Indo-European. Of the languages that share this change, however, Balto-Slavic shares with Germanic (including English) an m in certain case endings where other Indo-European languages, including Indo-Iranian, Armenian, and Albanian, have bh or a sound regularly developed from bh. Examples of the m ending include English "the-m" and Old Church Slavonic te-mu "to those ones"; the bh and related sounds (ph, v, b) are illustrated in the following: Sanskrit t�-bhyas "to those ones," Armenian noro-vk' "with new ones," Albanian male-ve "to mountains," Greek �khes-phin "with chariots," Latin omni-bus "for all." Because Balto-Slavic and Germanic are neighbours, it is inferred that m replaced bh in these case endings just once in the parent language, and that the area over which this innovation spread only partly overlapped the area that adopted affricated pronunciation of the palatals. This pattern is general for changes dating from the time the parent language was breaking up into distinct languages. Each of the resulting languages shares some innovations with some of its neighbours, but only rarely do different innovations shared by two or more branches of Indo-European cover exactly the same territory. Once the dialects had become differentiated enough to be distinct languages--probably by 2000 BC, at least in most cases--each largely went its own way, and agreements in developments since then are due either to borrowing across language boundaries (as in the notable convergences between Modern Greek, Albanian, Romanian, and the southernmost Slavic languages) or to parallel but independent workings out of the same base material. Changes in phonology. In phonology, the most striking changes have been loss or reduction in many languages of final or unaccented syllables, and loss in several languages of certain consonants between vowels, often followed by contraction of the resulting vowel sequence. Thus words in modern Indo-European languages are often much shorter than their Proto-Indo-European ancestors; e.g., English "four," Armenian c'ork', colloquial Persian car "four" from *k etw�res; French vit (pronounced vi) "lives" from *g �H weti; Russian dvesti "two hundred" from *duwoy kmtoy. Changes in morphology. Because much of the marking of Proto-Indo-European inflectional categories was done in final syllables, loss and reduction of these syllables have often had serious grammatical consequences. In the noun, loss of endings has generally led to loss or great reduction of the case and gender systems, while ways have generally been found to salvage the distinction between singular and plural. In Modern Persian, for example, where all final syllables have been lost, the old case and gender distinctions have disappeared also, but plural number is still regularly marked, either with -an (originally the genitive plural ending of some nouns) or with -ha (of obscure origin). In the verb, where more endings originally had two syllables, loss of final syllables has had less serious consequences for morphology. Even here, however, some languages, including English, have totally or almost totally given up the marking of subject by personal endings. Compare English "I, we, you, they love" and "he, she loves" with the Spanish conjugation for "love"--amo, amas, ama, amamos, am�is, aman--or the Russian version--ljublj�, lj�bish, lj�bit, lj�bim, lj�bite, lj�bjat. Changes in noun inflection have generally involved simplification. Almost everywhere the dual number has been lost; in many languages the noun genders have been reduced from three to two (as in French, Swedish, Lithuanian, and Hindi), or lost entirely (as in English, Armenian, and Bengali). Only Slavic has complicated the gender system, by imposing on the inherited distinctions contrasts of animate versus inanimate or of personal versus nonpersonal. Everywhere except in the oldest Indo-Iranian languages the original eight Indo-European cases have suffered reduction. Proto-Germanic had only six cases, the functions of ablative (place from which) and locative (place in which) being taken over by constructions of preposition plus the dative case. In Modern English these are reduced to two cases in nouns, a general case that does duty for the vocative, nominative, dative, and accusative ("Henry, did Bill give John the letter?"), and a possessive case continuing the old genitive ("Bill's letter"). In languages such as French and Welsh, nouns are no longer inflected for case at all. In some languages, to be sure, nouns have begun fusing with words placed directly after the nouns to create new case systems, coexisting with relics of the old. Thus, Old Lithuanian had in addition to seven inherited cases an illative (place into), made by adding -n(a) to the accusative (peklosna "into hell"), an allative (place to, toward), made by adding -p(i) to the genitive (Jesausp "to Jesus"), and an adessive (place at which), made by adding -p(i) to the locative (Joniep "in John"). Changes in the verb have been more complex. Besides loss or merger of old categories, many new forms have been created and many old forms have acquired new values. In Ancient Greek the focus of the stative aspect (perfect) has largely shifted from the present state ("he is dead") to the previous event that led to this state ("he has died"). As a result, the perfect came to mean the same as the perfective past (aorist), and has therefore disappeared from Modern Greek. New forms created in Ancient Greek include future and future perfect tenses, based on the desiderative present forms (such as "he wants to walk") of the parent language. In Germanic the principal new creation was the weak past tense (ending in a t or d), such as English "loved," "thought," German liebte, dachte, made by combining the verb stem with a past tense of the Germanic verb for "do." (The strong past tense formed by vowel alternations, like "sing," "sang," "run," "ran," comes from the proto-Indo-European stative aspect.) In some languages participles (verbal adjectives) have come to function as finite verbs. Thus in Hindi mard stri-ko dekhta "the man sees the woman," dekhta "sees" is etymologically a participle "seeing," agreeing in number and gender with the subject mard "man." In the past tense, mard-ne stri dekhi "the man saw the woman," the verb dekhi is etymologically a past passive participle "seen," agreeing in gender and number with the object stri "woman," and the subject is marked with an instrumental ending. Vocabulary changes. Changes in vocabulary have been even greater than those in sounds and grammar. Words in modern Indo-European languages have several sources. They may be recognizable loanwords, such as English "skunk," "chain," and "inch" (from Algonkian, French, and Latin, respectively); they may have been formed within the history or prehistory of the language itself, such as English "radar" and "rightness"; they may be of obscure origin, such as English "drink," which is common Germanic but has no cognates outside Germanic, or "boy," which is peculiar to English and Frisian; or they may be inherited words that have changed meaning, such as English "merry" from Proto-Indo-European *mrghu- "short." Only a small fraction of the vocabulary can be traced back to words that can confidently be asserted to have existed in the parent language with approximately their present meaning. The same is true, albeit in a lesser degree, even for the oldest recorded Indo-European languages. None has more than a few hundred words and roots that are clearly inherited from the parent language without essential change of meaning. Table 1 gives examples of words widely retained with little change. Typically they include pronouns; nouns, verbs, and adjectives of relatively simple and ubiquitous meaning; numerals; and simple adverbs and prepositions. Non-Indo-European influence on the family. Indo-European languages, like all languages, have always been subject to influence from neighbouring languages, both related and unrelated. Influence of non-Indo-European languages on the sounds and grammar of Proto-Indo-European is not demonstrable, partly because there is no direct evidence about the languages that were in contact with Indo-European before 3000 BC. It can be surmised, however, that some words are loans; e.g., *pelekus "ax," a word for an object likely to be imported or learned of from neighbours with superior technology, and which is not analyzable into a known Indo-European root plus a known Indo-European suffix. When Indo-European languages have been carried within historic times into areas occupied by speakers of other languages, they have generally taken over a number of loanwords, as with English and Spanish in the Americas or Dutch in South Africa. Aside from the special case of the pidgin and creole languages, however, there has been very little effect on sounds and grammar. These have been significantly affected within historic times only when an Indo-European language has been spoken in prolonged close contact with non-Indo-European speakers, as with Ossetic (an Iranian language) in the Caucasus, or when its speakers have been very strongly influenced culturally by speakers of a non-Indo-European language, as with Persian, in which Arabic plays much the same role as Latin does in English. In prehistoric times most branches of Indo-European were carried into territories presumably or certainly occupied by speakers of non-Indo-European languages, and it is reasonable to suppose that these languages had some effect on the speech of the newcomers. For the lexicon, this is indeed demonstrable in Hittite and Greek, at least. It is much less clear, however, that these non-Indo-European languages affected significantly the sounds and grammar of the Indo-European languages that replaced them. Perhaps the best case is India, where certain grammatical features shared by Indo-European and Dravidian languages appear to have spread from Dravidian to Indo-European rather than vice versa. For most other branches of Indo-European languages any attempt to claim prehistoric influence of non-Indo-European languages on sounds and grammar is rendered almost impossible because of ignorance of the non-Indo-European languages with which they might have been in contact. (W.C.) Anatolian languages The term Anatolian languages in its most comprehensive use includes both the Indo-European and non-Indo-European languages spoken in Anatolia (Asia Minor) before the Greco-Roman period. The Anatolian languages are known only from texts of the 2nd and 1st millennia BC; the earliest evidence is that of the so-called Cappadocian tablets (19th-18th century BC). The term Asianic is sometimes used as an alternative designation for the Anatolian languages, but, since the discovery in 1915 that Hittite, the main Anatolian language, is an Indo-European language, there has been a tendency to use Asianic in a more restricted sense for the non-Indo-European languages that existed in Anatolia before the entry of the Indo-Europeans. These are called substratum languages. Hattic (or Hattian), also misleadingly called Proto-Hittite, is the best known substratum language. It is completely unrelated to Hittite and its sister languages as well as to Hurrian, a language also spoken in Anatolia. The Anatolian group of Indo-European languages consists of Hittite, Palaic, Luwian, Hieroglyphic Luwian, Lydian, and Lycian. Hittite, Palaic, and Luwian are known from 2nd-millennium cuneiform texts found in the excavations in Bogazk�y-Hattusa since 1905; Hieroglyphic Luwian is found on scattered inscriptions and seals from Anatolia (mainly the southern area) and northern Syria dating mainly from later times (i.e., between c. 1200 and 700 BC, although there are earlier examples from the empire period, c. 1400-c. 1190 BC). Lydian and Lycian are known from texts in alphabetic script from c. 600 to 200 BC. It seems fairly reasonable to add the Carian language of southwest Anatolia to this list as well as other less well documented languages like Sidetic. More to the east, in the Caucasus region centring around Lake Van, Hurrian of the 3rd and 2nd millennia BC was replaced in the 1st millennium BC by the related Urartian language. Both of these languages are definitely non-Indo-European. Historical background of ancient Anatolia. It is customarily assumed that the Indo-Europeans entered Anatolia around or shortly after 2000 BC, although there are no specific archaeological data that might enable scholars to specify the period of entry or the route the invaders followed. On the basis of the agricultural terminology used in Hittite, it has been suggested that the entry into Anatolia was not a warlike invasion of predominantly male groups. If such had been the case, the influence of substratum languages would have been likely, but, on the contrary, the word stems used are definitely Indo-European. The differences in the terminology used in other Indo-European subgroups indicate that the "Anatolians" seceded from the parent group at an early date, before the common agricultural nomenclature came into being. On the other hand, Hittite shares the Indo-European notion of the hereafter, pictured as a pastureland with grazing cattle "for which the dead king sets out." There is a tendency among linguists to postulate an eastern route of entry into Anatolia by way of the Caucasus, because certain grammatical features--e.g., the loss of the feminine gender--might be explained as having been caused by prolonged contacts with Caucasian languages. It is likely that the Indo-European forebears of the later speakers of Hittite, Palaic, Luwian, and Lydian entered Anatolia together, following a common route, because the Anatolian languages share a considerable number of losses as well as innovations that presuppose a long common past. In the central parts of Anatolia, within the bend of the Halys River (modern Turkish, Kizil Irmak), and in the northern regions, Hittite and Palaic were profoundly influenced by Hattic as a substratum language. The Hattian culture also changed the political and religious concepts of the newcomers, and a clear cultural dependency of the Indo-Europeans on the older Hattian population is evident. Some scholars have stressed the likelihood that farther to the south the Luwians might have been conversant with a different substratum. In view of the absence of textual evidence, and because knowledge of the Luwian vocabulary is rather restricted, it is perhaps not surprising that this possible substratum element escapes definition. (For the history of Anatolia in the 2nd and 1st millennia BC, see TURKEY AND ANCIENT ANATOLIA: Ancient Anatolia.) The most important invaders of Anatolia in the "Dark Age" (after 1190 BC) were the Phrygians. Their language is definitely Indo-European, but it bears no relationship to the Anatolian subgroup. Rather, it seems akin to Thracian, Illyrian, or possibly Greek. Greek, in the second half of the 1st millennium BC, and, later, Latin, from the 2nd century onward, entered central Anatolia as languages of a ruling caste. Much earlier--beginning in Mycenaean times--the west coast had attracted Greek settlers. In the first half of the 1st millennium, the southern and northern shores also attracted Greek-speaking peoples. To the east in the Caucasus region, other Indo-Europeans, the Armenian-speaking invaders, penetrated into the former Urartian territory well before the beginning of the Persian period, probably in the 7th and 6th centuries BC. During Persian times, a Persian ruling caste entered eastern and also northeastern Anatolia and was still clearly recognizable in the Hellenistic and Roman periods (e.g., in Bithynia, Pontus, Cappadocia, and Commagene). Late data on names and scattered remarks made by Fathers of the Church indicate that until late Roman and perhaps even Byzantine times, some Anatolian dialects remained in use in certain isolated parts of the interior. Classification of the languages. Research on the Anatolian languages began in 1821 with the Lycian language and passed an initially fruitful phase in the 1880s with work on Hieroglyphic Hittite (nowadays referred to as Hieroglyphic Luwian). In 1902 the Norwegian Assyriologist J�rgen Alexander Knudtzon's study on the Arzawa letters was published; these were two letters exchanged between a king of Arzawa and Pharaoh Amenhotep III that had been found in the Amarna archive. They were written in the Hittite language in cuneiform writing. In 1915 research reached a climax with the interpretation of Cuneiform Hittite by the Czech Orientalist Bedrich Hrozn�. In all four of these highlights, the discovery that the texts in question were Indo-European was either clearly expressed or more discreetly implied. This conclusion was based on both the nominal (noun) declension and the verbal conjugation: the languages had a nominative ending in -s, the accusative in -n, verbal endings like -ti and -nti for the 3rd person singular and plural of the present tense, and an imperative form like estu "let it be." These features were deemed to be sufficient proof of their Indo-European origin. Study of the Anatolian subgroup of Indo-European thus began with Lycian, the last Anatolian offshoot in the temporal sequence, then passed the intermediary stage of Hieroglyphic Luwian, and reached the 2nd-millennium Hittite language in 20th-century research. For the relationship between members of the Anatolian subgroup, see Figure 2. The non-Indo-European Hurrian and Urartian languages are related to one another, but modern research indicates that Urartian should not be considered as a direct continuation of Hurrian. HISTORY AND DEVELOPMENT Languages using cuneiform writing and Anatolian hieroglyphs. Hattic. The Hattic language appears as hattili in Hittite cuneiform texts. Called Proto-Hittite by some, it was the language of the linguistic substratum inside the Halys River bend and in more northerly regions. Apparently the Indo-European newcomers of Hittite stock were named with the same designation as their predecessors. All the Hattic material preserved by Hittite scribes belongs to the religious sphere of life: rituals (e.g., connected with the erection of a new building), incantations, antiphons, litanies, and myths. Among the Hattic interpolations in Hittite texts, there are some to which a Hittite translation has been added. It is impossible to ascertain the length of time that the Hattians had been present in Anatolia before the Indo-Europeans entered the country, but it seems certain that during the Hittite New Empire (c. 1400-c. 1190 BC) Hattic was a dead language. Hattic studies began in 1922 with the work of the German Assyriologist Emil Forrer. In 1935, Hans G. G�terbock, a German-born Orientalist, published a large group of texts containing Hattic material and in so doing completed the publication of the Hattic texts stemming from the Winckler excavations (1905-12). Important studies on the subject have continued to appear since then. Hittite. The Hittite language is known from the approximately 25,000 tablets or fragments of tablets preserved in the archives of Bogazk�y-Hattusa, excavated by German archaeologists beginning in 1905. In Hittite cuneiform texts, the language is referred to as nesili (nasili) "language of Nesa," or nesumnili "language of the Neshite." Earlier Hittite linguistic material may be found in the indigenous proper names and a few loanwords from the local dialect that are recorded in the Cappadocian tablets (the commercial correspondence in Assyrian of Assyrian colonists living in Anatolia, especially in the emporium at K�ltepe, near modern Kayseri, between c. 1900 and 1720 BC). The data from K�ltepe are sometimes referred to as "Kaneshite" (from Kanesh, the old name of K�ltepe); this is obviously the modern equivalent of the word kanisumnili "language of the Kaneshite" found in a Hittite text. It is possible, or even likely, that Kanesh and Nesa do, in fact, refer to the same entity. Hittite tablets from places outside of the Hittite capital are rare; only stray examples have been found--e.g., in Tarsus, Alalakh, Ugarit, and Amarna. These findings attest to the growth of a great Hittite empire, especially between c. 1400 and c. 1190 BC. Old Hittite, the written embodiment of the earliest Indo-European language that has been discovered so far, is known from some tablets preserved in an "old ductus" type of handwriting that was typical of copies from the Old Kingdom period (c. 1700-1500 BC). The intermediary "Dark Age" between c. 1500 and c. 1400 BC is sometimes referred to as the period of the so-called Middle Hittite language. Most of the Old and Middle Hittite texts, however, are preserved in copies from the later empire period. The archives of Bogazk�y-Hattusa have been found in various places in the citadel, in the Great Temple complex, and in the "House on the Slope." Although the majority of the texts are concerned with religious subjects (oracle texts, hymns, prayers, myths, rituals, and festival texts), these archives also contain material of historical, political, administrative, literary, and legal character. The cuneiform adopted by the Hittite scribes is a variant of a writing system of Mesopotamian origin that closely resembles the ductus and shapes prevalent in tablets of the 17th century BC (layer VII) from Alalakh (modern Atsana in southeastern Turkey). It is possible that the cuneiform script might have been introduced as a result of the Hittites inducing Syrian scribes to transfer their activities to the Hittite capital during the early part of the Old Kingdom, shortly after 1650 BC. It has also been posited, with good reason, that the newly acquired script was first used to write Akkadian and was only later employed for Hittite as well. In addition to the genres enumerated above, the "scholarly literature" deserves to be mentioned. This consists of the material considered by the scribes to be essential for their training; it includes word lists, omens, and ritual prescriptions, all reflecting an encyclopaedic approach aimed at complete coverage of the subjects concerned. The Sumerian texts found in these archives belong to this class of literature. For treaties and correspondence with foreign powers, Akkadian was used as the diplomatic language of that period. Therefore, both Sumerian and Akkadian formed part of the curriculum of the qualified scribes, these languages belonging to the "eight languages" found in the Hittite archives. In actual fact, the first decipherer of Hittite was the Norwegian scholar J.A. Knudtzon, who pointed out in 1902 that the language of the so-called Arzawa letters (i.e., Hittite)--found in the Amarna archive--had an apparent affinity with Indo-European. Because the cuneiform script had already been deciphered, Knudtzon, and Bedrich Hrozn� after him, were able to "read" their texts. Thus their discovery consisted more in the interpretation than in the actual decipherment of the written material. The first series of German excavations, lasting from 1905 to 1912, produced about 10,000 tablets. It was work on this corpus that familiarized Hrozn� with the contents of these tablets and led him to his epoch-making discovery that Hittite was indeed Indo-European (1915).(See also WRITING: Cuneiform.) Palaic. Palaic, which appears as Palaumnili "language of the Palaite" in Hittite cuneiform texts, was the language of the region of Pala (probably Bla�ne in the Greek period), in northwest Anatolia. During the Old Hittite kingdom, Pala, Luwiya, and Hattusa formed the three major provinces of the Anatolian part of the Hittite territory. From the intermediary "Dark Age" onward, Kaska nomads made their influence felt in northern Anatolia, and this resulted in a decline of importance for this region. The Indo-European character of Palaic was first advocated by Emil Forrer (1922). Part of the text material is preserved on tablets in "old ductus." The knowledge of the limited vocabulary leaves much to be desired, but parallels--especially in the inflection of the noun, the forms of the demonstrative, relative, and enclitic pronouns, and the verbal endings--vouch for a close relationship to Hittite and Luwian. Luwian. Luwian (or Luvian), the language of Anatolia's southern coast, is known from texts stemming from three major periods: (1) the Hittite New Empire (c. 1400-c. 1190 BC); (2) the period of the Neo-Hittite states (c. 1190-c. 700 BC); (3) the period of the Lycian monumental inscriptions (c. 400-200 BC). In addition to the various time periods, there is also a variation in writing system--Mesopotamian cuneiform, Anatolian hieroglyphs, and an alphabet derived from a Greek source--and dialectal differentiation. There are indications that as early as the 15th and 14th centuries BC, there was a West Luwian dialect (the precursor of alphabetic Lycian) and an East Luwian dialect (the forerunner of the later Hieroglyphic Luwian of the Neo-Hittite states). Both of these differed from the Luwian found in the archives of Bogazk�y-Hattusa, which was possibly a central dialect. As in the case of Palaic, the pioneering work on Luwian written in cuneiform was done by Emil Forrer (1922). Following this work, new text materials were published in 1953, closely followed by both grammatical and vocabulary studies as well as a standard dictionary of Cuneiform Luwian (1959). The Anatolian hieroglyphic system has a long history, with its logographic beginnings dating back to early Hittite stamp seals of the 18th and 17th centuries BC; the youngest texts seem to date from the last quarter of the 8th century BC. The geographical range of the inscriptions is great, stretching from Sipylus and Karabel in the extreme west to Alaca H�y�k and Bogazk�y-Hattusa in the north, Malatya, Samsat, and Tell Ahmar (Til Barsib) in the east, and Hama and ar-Rastan in the south. During the "Dark Age" of the 16th and 15th centuries BC, the early writing grew into a fully developed writing system with logograms (word-signs), syllabic values, and auxiliary signs. During the New Empire, the script was already in use for a multitude of purposes (rock inscriptions, seals, and wooden tablets for everyday use in the temple and the army). Whether an example of the empire period such as the Aleppo inscription already reflects the Luwian language is a moot question but seems likely. It is certain that the later inscriptions of the Neo-Hittite states were in Luwian. The first attempts to decipher Hieroglyphic Luwian, made by the British archaeologist Archibald H. Sayce, were fortunate in some fundamental details, but it was not until the 1930s that systematic and mutually stimulating research by scholars of several countries led to the establishment of a number of syllabic values for the characters as well as to a correct analysis of the sentence structure of the inscriptions. In his publication of the (bilingual) Hittite royal seals (in 1940, 1942), Hans G. G�terbock bridged the gap between the inscriptions of the empire period and the late Neo-Hittite states; the seals found in the French excavations at Ugarit (in northern Syria) served a similar purpose. The most important recent finding was the discovery in 1947 by Helmuth T. Bossert, a German archaeologist, of the Karatepe bilingual inscriptions, written in Phoenician and Hieroglyphic Luwian. On many points the Luwian vocabulary is still an enigma. The unity between the various Luwian dialects and the close relationship of Luwian to the other members of the Anatolian subgroup, however, is secured by several linguistic parallels, especially in the singular inflection of the noun, the forms of certain pronouns, the verbal endings, and a number of lexical (vocabulary) correspondences. Hurrian. In earlier stages of research, the terms Mitanni language and Subarian were used as designations for Hurrian. In Hittite cuneiform texts, hurlili "language of the Hurrian" is used. In the last centuries of the 3rd millennium BC, Hurrians were already present in the Mardin region, which, from a geographical point of view, belongs to the North Mesopotamian plain. In Mesopotamian texts (from the time of the Akkad dynasty) some Hurrian personal names and glosses have been found. The customary assumption is that this non-Semitic and also non-Indo-European ethnic group had come from the Armenian mountains. During the beginning of the 2nd millennium BC, the Hurrians apparently spread over larger parts of southeast Anatolia and northern Mesopotamia. Still later, during the intermediary "Dark Age," they are supposed to have infiltrated into Cilicia and the adjacent Taurus and Antitaurus regions (Kizzuwatna in 2nd millennium texts). Before the middle of the 2nd millennium BC, an Indo-Aryan ruling caste wielded some type of authority over parts of Hurrian territory. Some names and words in ancient Near Eastern texts bear witness to their presence. Among these words are a group of technical terms related to the training of horses that found its way into Hittite treatises on that subject; they are most important from a historical point of view. After Sumerian, Akkadian, Hattic, Palaic, and Luwian, Hurrian and these Indo-Aryan glosses constitute the sixth and seventh additional languages of the Hittite archives. Hurrian texts have been found in Urkish (Mardin region, c. 2300 BC), Mari (on the middle Euphrates, 18th century BC), Amarna (Egypt, c. 1400 BC), Bogazk�y-Hattusa (Empire period), and Ugarit (on the coastline of northern Syria, 14th century). Amarna yielded the most important Hurrian document, a political letter sent to Pharaoh Amenhotep III. From Mari came a small number of religious texts; from Bogazk�y-Hattusa, literary and religious texts; and from Ugarit, vocabularies belonging to the more "scholarly literature" described above and Hurrian religious texts in Ugaritic alphabetic script. Hurrian personal names, found in texts from many sites (Bogazk�y-Hattusa, Alalakh, Ugarit, and especially Nuzu), constitute a second linguistic source of major importance. The research on Hurrian started in the 1890s with simultaneous contributions by several scholars. Subsequently, Bedrich Hrozn� (1920) and Emil Forrer (1919, 1922) discovered the presence of Hurrian material in the Bogazk�y-Hattusa archives. Urartian. The terms Chaldean and Vannic have also been used as designations for Urartian during earlier stages of research. Urartian is not a late dialect of Hurrian but a separate language, although both stem from a common parent. During the 9th through 6th centuries BC, Urartian was used in northeastern Anatolia as the official language of the state of Urartu, which centred around the district of Lake Van but also extended over the Transcaucasian regions of modern Russia and into northwestern Iran and at times even into parts of North Syria. The Urartian texts are written in a variant of the Neo-Assyrian script and consist mostly of monumental inscriptions (annals, votive inscriptions related to building and irrigation activities), some small inscriptions on helmets and shields dedicated in the temple, and a few economic cuneiform tablets. Two bilingual inscriptions in Urartian and Assyrian that apparently correspond very closely provided the key to the understanding of the language; the stylistic resemblances to Assyrian texts of the same period guided the further interpretation. Archibald H. Sayce was the first scholar to devote his attention to Urartian in the 1880s and 1890s and continued his activities until 1932. More important were the philological contributions of the German historian Carl F. Lehmann-Haupt between 1892 and 1935. The first reliable description of Urartian grammar was published by the German Orientalist Johannes Friedrich (1933). Next to the Urartian texts in cuneiform writing, there also existed an indigenous hieroglyphic script that is still undeciphered and is too meagrely represented to warrant a serious attempt. Dialects. The six modern Iranian languages discussed above are the only ones that have an established literary tradition. They are not, however, homogeneous, each having its own dialect divisions. No definitive dialect classification has yet been made, nor indeed has any attempt at systematic classification of the whole range of Iranian languages won wide acceptance. The usual practice, followed here, is simply to list the main languages in groups of varying size, arranged on a roughly geographic basis. There are two main dialects of Ossetic: the eastern, known as Iron, and the western, known as Digor (Digoron). Of these, Digor is the more archaic, Iron words being often a syllable shorter than their Digor counterparts--e.g., Digor mad�, Iron mad "mother." Iron is spoken by the majority of Ossetic speakers and is the basis of the literary language. Chosen in the 19th century for the translation of the Bible, it is still the official language today. Little is known of the other Ossetic dialects. A small amount of the Ossetic dialect of Tual in the south, which differs little from Iron, was published in Georgian script at the beginning of the 19th century. Yaghnabi is still spoken by a small number of people southeast of Samarkand, Uzbekistan. It has two main dialects, eastern and western, which differ only slightly. The characteristic difference is between a western t sound and an eastern s sound from an older [{theta}] sound (as th in English "thin")--e.g., western met, eastern mes "day," beside Sogdian me[{theta}] (Christian Sogdian my[{theta}]). Dialects of the Shughni group are spoken in the Pamirs. Closely related to this group is Yazgulami. A period of a Yazgulami-Shughni common language (protolanguage) has been postulated by some scholars, after which it separated first into Yazgulami and Common Shughni; and then Common Shughni gradually divided into Sarikoli, Oroshori-Bartangi, Roshani-Khufi, and Bajuvi-Shughni. Sarikoli, the easternmost of these dialects, is spoken in northwestern China. Speakers of Wakhi number 10,000 or so in the region of the upper Pyandzh (Panj) River. Vakhan (Wakhan), the Persian name for the region in which Wakhi is spoken, is based on the local name Wux, a Wakhi development of *Waxsu, the old name of the Oxus River (modern Amu Darya). (An asterisk denotes a hypothetical, unattested, reconstructed form or word.) The Wakhi language is remarkably distinct from its neighbours and has many archaic features. Around the bend of the Amu Darya and in the valley of the Varduj River to the southeast, a few people speak dialects of the Sanglechi-Ishkashmi group. This group is clearly distinguished from its neighbours but is closely related to the other languages of the Pamirs. Some 6,000 people speak dialects of the Yidgha-Munji group. Monjan is a very remote valley located in northern Afghanistan, and it is separated by a mountain pass from the Sanglechi-speaking region. Yidgha is spoken in the valley of the Lutkho River and in the nearby city of Chitral, a region now in Pakistan. Yidgha-Munji is most closely related to Pashto. The existence of two dialectal groups within Pashto has long been known. Thus, the word Pashto represents a southwestern dialect form (pasto), in contrast to a northeastern (paxto). According to one hypothesis, Pashto literature, which exists certainly from the 17th century and possibly from the 11th, was created among the northeastern tribes. Two minor dialects, Waziri and Wanetsi, have some features of special interest. Although spoken in a few villages in Afghanistan, two languages have features closely associating them with Western Iranian. These are Parachi, spoken in the Hindu Kush north of Kabul, and Ormuri, found in two dialects, one in the Lowgar River valley south of Kabul and the other in Kaniguram in Waziristan. Farther south is the wholly West Iranian language Balochi, mentioned above. Despite the vast area over which Balochi is spoken, its numerous dialects are all mutually intelligible. The most recent study of the Balochi dialects divides them into six groups: Eastern Hill dialects; Rakhshani dialects including that of Mary; Sarawani; Kechi; Lotuni; and the coastal dialects. Of these, Rakhshani is the most widely spoken and is used for broadcasting both in Pakistan and in Afghanistan, but the coastal dialects have the greatest prestige and the most extensive literature. In the southeastern corner of Iran, Balochi gradually gives way to the Bashkardi dialects. In central Iran the influence of Modern Persian is everywhere strongly felt, and it is often difficult to distinguish between dialects of Modern Persian, Persian with dialectal traits, and closely related languages. In the cities of Yazd and Kerman the Parsis speak the old Gabri dialect, whereas the Muslims speak Persian. Among other central dialects are Natanzi, Soi, Khunsari, Gazi (near Esfahan), Sivandi (northeast of Shiraz), Vafsi, and Ashtiyani, to name but a few. Semnani, spoken east of Tehran, forms a transitional stage between the central dialects and the Caspian dialects. The latter are divided into two groups, Gilaki and Mazandarani (Tabari). Also closely related is Talishi, spoken on the west coast of the Caspian Sea on both sides of the border with Azerbaijan. To this northwestern group belong the so-called southern Tati dialects spoken south and southwest of Qazvin, as well as the scarcely known dialects of Harzan and Galinqaya spoken northwest of Tabriz. The name Tati is usually applied to the dialects spoken in Russian Dagestan and northeastern Azerbaijan. They differ little from Modern Persian. Of the several dialects of Fars province, only Lari, southeast of Shiraz, is notably distinctive. Kumzari in Oman and the Lur dialects of the southwest also differ little from Persian. There are many dialects of Kurdish, the widely spoken West Iranian language that is thought to occupy a dialectal position intermediate between Balochi and Persian. Three main dialect groups can be distinguished--northern, central, and southern. A systematic study has been made of the dialects of Iraq, which include 'Aqrah (Akre), 'Amadiyah, Dahuk, Shaykhan, and Zakhu in the northern group, and Irbil (Arbil), Bingird, Pishdar (Pizhdar), Sulaymaniyah (Suleimaniye), and Warmawah in the central group. The Central Mukri dialect is spoken in the extreme west of Iran, south of Lake Urmia. Gorani is spoken in several dialects, mainly in the Zagros Mountains, and it is strongly influenced by the surrounding Kurdish dialects. The Gorani dialect of Hawraman, Hawrami, is notable for its many archaic features. Closely related to Gorani is Zaza (Dimli), which is spoken west of Iran. Historical survey of the Iranian languages. The Iranian protolanguage and its development. By the time Iranian begins to be attested in the 6th century BC, the language is already found differentiated into several distinct languages. Scholars have reconstructed the sound system and some of the grammatical features of Common Old Iranian, the protolanguage that preceded these dialects. The phonological system that underlay Common Old Iranian was by and large maintained everywhere throughout the Iranian-speaking world. It consisted of the following distinctive consonant sounds: Unfamiliar symbols are taken from the International Phonetic Alphabet, or are conventional transcriptions (e.g., s for the sh sound in "ship," z for the zh sound in "azure," c for ch in "church," and j for j in "jam"). The voiced fricatives (i.e., the first three consonants represented in the fourth column--{voiced velar fricative con.}, , and �), which are produced with vibrating vocal cords and local friction, may be regarded as variants of the voiced stops (e.g., g, b, d); but they are characteristic of Iranian languages generally and especially of the eastern Iranian languages. In addition to these sounds Old Persian had another sibilant sound, often transcribed as � or ss, which developed from the cluster r (pronounced as the thr in "three"). In Middle Persian it fell together with the s sound. The most noticeable alteration of the old sound system is the introduction in some languages of additional series of consonants under the influence of neighbouring languages. Thus, Ossetic has a series of ejective sounds (uttered with a simultaneous glottal stop) on the pattern of the unrelated Caucasian languages; and a number of Iranian languages have a retroflex series (produced with the tongue tip curled up toward the roof of the mouth) as a result of contact with Indo-Aryan languages. Some of the differences between Iranian languages arose as a result of different developments of the earlier sounds. Thus, the Indo-European sounds k, g, and gh resulted in Indo-Iranian sh, z, and zh, which in turn became s, z, and z, respectively, in Avestan but , d, and d in Old Persian. Hence, Indo-European *kmt�- "hundred" became Indo-Iranian *shat�-, attested by Old Indo-Aryan shat�-, and then Avestan sata-, but Old Persian ata-. Nevertheless, and d as well as s and z belong to the basic pattern, the difference being merely distributional. The main source of differentiation is in the variation of consonant cluster development and that of groups of consonants and semivowels. Here again it is mainly a question of distributional differences. Thus, the Indo-European group *ku{circumflex} became Indo-Iranian *shu{circumflex}, retained in Old Indo-Aryan in the spelling shv of the standard transcription. Indo-Iranian *shu{circumflex} developed variously in Iranian: s in Old Persian, sp in Avestan and Median, sh (written shsh) in Khotanese, and s in Wakhi. These developments can be seen in the following forms of the Indo-European word *eku{circumflex}o- "horse": Old Indo-Aryan �shva-, Avestan and Median aspa-, Old Persian asa-, Khotanese ashsha-, and Wakhi yas. Yet another development can be seen in Ossetic, in which the word for "mare," Avestan aspa-, appears as Digor �fs� and Iron y�fs. The vowel system of Common Old Iranian consisted of short and long varieties of a, i, and u, and a neutral vowel (similar to the a in "sofa"). This analysis assumes that the Indo-Iranian vocalic r (r) had already developed to r in Proto-Iranian, just as its long counterpart became ar. An early and general monophthongization of the diphthongs ai and au to e and o, respectively, also must be considered characteristic, although it should not be ascribed to Common Old Iranian as is sometimes done. This basic system was almost everywhere maintained, sometimes with the addition of one or two distinctive vowel sounds (phonemes). The Old Iranian stage. Old Persian was the language of the Achaemenid court. It is first attested in the inscriptions of Darius I (ruled 522-486 BC), of which the longest, earliest, and most important is that of Bisitun. At Bisitun are also inscribed versions of the same text in Elamite and Babylonian, and fragments of an Aramaic version on papyrus documents from Elephantine (modern Jazirat Aswan) also exist. Old Persian words and names also are to be found in large numbers as loanwords in contemporary Elamite sources and in 5th-century-BC Aramaic documents. As early as the time of Darius the Great's successor, Xerxes I (ruled 486-465 BC), the inscriptions show linguistic tendencies characteristic of the development from Old to Middle Persian. After Xerxes the production of original Old Persian inscriptions declined, probably as a result of the wider adoption of Aramaic and Elamite as the usual means of writing. With Artaxerxes III (ruled 359/358-338 BC), Old Persian inscriptions came to an end. The break is marked by Alexander's destruction of Persepolis in 330 BC. By far the largest part of attested Old Iranian is written in the language now usually called Avestan, after the Avesta, the name given to the collection of works forming the scripture of the Zoroastrians. The name itself is Middle Persian. In former times this language was called Zend, another Middle Persian word, which refers to the Middle Persian (Pahlavi) commentary on the Avesta. Because the homeland of the Avestan language was long thought to be in Bactria, it was often in the past called Bactrian. Bactrian is now used to designate a different Iranian language belonging to the Middle Iranian period. Since the beginning of the 20th century it has been generally accepted that the homeland of the Avesta was Khwarezm, which in ancient times included both Merv and Herat. Merv is now in Turkmenistan, Herat in northwestern Afghanistan. The oldest part of the Avesta is known as the Gathas, the poems composed by Zoroaster (Zarathustra), the founder of the Zoroastrian religion. His date is uncertain but is traditionally ascribed to the 7th to 6th century BC. The so-called Khurda Avesta ("Little Avesta") is a miscellany of texts of later date, the oldest parts of which may have been composed about 400 BC. The language of the Khurda Avesta is different in many details from that of the more archaic language of the Gathas, and it may even represent a different dialect. Many uncertainties surround the detailed interpretation of the Avesta as a result of the method of transmission. The Avesta was not recorded until after the language had ceased to be used, except by Zoroastrian priests. The present manuscripts date from the 13th century and later, although they reflect the recording of the priestly tradition in the special Avestan script during the 6th century AD. The Middle Iranian stage. Middle Persian, the major form of which is called Pahlavi, was the official language of the Sasanians (AD 224-651). The most important of the Middle Persian inscriptions is that of Shapur I (d. AD 272), which has parallel versions in Parthian and Greek. Middle Persian was also the language of the Manichaean and Zoroastrian books written during the 3rd to the 10th century AD. The extant literature of the Zoroastrian books is much more extensive than that of the Manichaean texts, but the latter have the advantage of having been recorded in a clear and unambiguous script. Moreover, the Middle Persian of the Zoroastrian books does not simply represent the spoken language of the writers of the 9th-century Zoroastrian texts. It is probable that they spoke early Modern Persian and that their speech often impinged upon their writing but that they strove to write the Middle Persian of several centuries earlier as it was attested in the inscriptions of the early Sasanian dynasty when Middle Persian was the koine. By contrast, in the case of Manichaean Middle Persian, some texts survive unchanged from the 3rd century AD, the time of the Persian teacher Mani himself (AD 216-274). Very little Parthian survives from the pre-Sasanian period. A large number of Parthian ostraca (inscribed pottery fragments) from the 1st century BC were discovered at Nisa near modern Ashkhabad, but they are inscribed in ideographic Aramaic (i.e., Aramaic writing that uses Aramaic words as symbols to represent Parthian words). Dating before the 3rd century are a document from Hawraman, some coin legends, and a dated grave stele. The most copious and important material in Parthian is the work of the Sasanian kings of the 3rd century, who added a Parthian version to their inscriptions--Hajjiabad, Naqsh-e Rustam (Ka'be yi Zardusht), and Paik�la. A few decades later Parthian disappeared as a result of the rise of the Sasanians and the predominance of their native tongue, Middle Persian. Manichaean Parthian of the 3rd century was preserved as a church language in Central Asia. The oldest surviving Sogdian documents are the so-called Ancient Letters found in a watchtower on the Chinese Great Wall, west of Tun-huang, and dated at the beginning of the 4th century AD. Most of the religious literature written in Sogdian dates from the 9th and 10th centuries. The Manichaean, Buddhist, and Christian Sogdian texts come mainly from small communities of Sogdians in the T'u-lu-p'an (Turfan) oasis and in Tun-huang. From Sogdiana itself there is only a small collection of documents from Mt. Mugh in the Zarafshan region, mainly the business correspondence of a minor Sogdian king, Dewashtich, from the time of the Arab conquest about 700. The relationship of the various forms of Sogdian to one another has not yet been sufficiently investigated, so that it is not clear whether different dialects are represented by the extant material or whether the differences can be accounted for by reference to other relevant factors, such as differences of script, period, subject, style, or social milieu. The importance of social milieu can be seen by comparing the elegant Manichaean literature directed to the court with the more vulgar language of the Christian literature directed to the lower classes. Of the Saka dialect known as Tumshuq very little has survived, and despite its evidently close relationship to the much better known Khotanese dialect, full interpretation has proved difficult. Knowledge of Khotanese is more firmly based on a substantial corpus of material, including extensive bilingual texts. Although the chronological range of the extant Khotanese material is limited to only a few centuries, probably the 7th to the 10th, a rapid development of the language is apparent. At the phonological level, most noticeable is the loss of syllables between the older and later stages of the language. Thus, hvatana- "Khotanese" at the oldest stage is successively weakened to hvat�na-, hvamna-, hvana-, hvam. At the morphological level, most striking is the tendency to simplify the case endings and even to replace them by analytical expressions, constructions of two or more words. Thus, Late Khotanese has raksaysa hiya rade "kings of the raksasas," whereas Old Khotanese would have raksays�nu rrunde. The Old Khotanese -�nu ending is unmistakably genitive plural, but the Late Khotanese -a is merely a general oblique plural ending and has been reinforced by hiya "own," used to mean "of." Khotan was a great centre of Buddhism during the 1st millennium AD, and all the surviving literature in Khotanese is either Buddhist or coloured by Buddhism. Even in business documents and official letters the Buddhist background is usually not difficult to discern. It can scarcely be coincidental that the Buddhist literature of Khotan, flourishing so vigorously during the 10th century, ended abruptly with the Muslim conquest at the beginning of the 11th. Little survives of Bactrian and Scytho-Sarmatian. Knowledge of Bactrian is based almost entirely on a single inscription of 25 lines from Ateshkadeh-ye Sorkh Kowtal in northern Afghanistan. Even less is known of Scytho-Sarmatian. Little is also known of Old Khwarezmian; that is, Khwarezmian written in the indigenous Khwarezmian script. Apart from a few coin legends and inscriptions on silver vessels, the material that survives consists of inscriptions of the 2nd century AD from Topraq-qal'ah (Toprakkala) and of the 7th from Toqqal'ah, archaeological sites in Uzbekistan. Much more is known of Late Khwarezmian, written in the Arabic script. This material is found mainly in two Arabic works, the 13th-century fiqh work of Mukhtar az-Zahidi, called the Qunyat almunyah, and the Arabic dictionary Muqaddimat al-Adab of az-Zamakhshari (1075-1144), of which a manuscript glossed in Khwarezmian was found. Modern Iranian. Of the modern Iranian languages, by far the most widely spoken is Persian, which, as already indicated, developed from Middle Persian and Parthian (with elements from other Iranian languages such as Sogdian) as early as the 9th century AD. Since then, it has changed little except for acquiring an increasing proportion of loanwords, mainly from Arabic. Persian has been a literary language since the 9th century, and there is an increasing awareness of the continuity of its literary tradition with the earlier periods. As the national language of Iran in succession to Middle Persian, it has for centuries strongly influenced the other Iranian languages, especially on Iranian territory. In fact, it seems likely that, with the increase of modern methods of communication, Persian will eventually supplant entirely most of the other languages and dialects. Against this trend stand only Kurdish and Balochi, the speakers of which tend to regard their languages as an expression of their particular identities. Nevertheless, even Kurdish and Balochi have been and continue to be strongly influenced by Persian. Outside Iran the situation is rather different. In Afghanistan the first national language is Pashto, even though Persian is the official second language. Pashto became the official language by royal decree in 1936, and literary activity has been encouraged by the Pashto Tolana (Pashto Society) of Kabul. During the Soviet period both Ossetic and Tajik received official encouragement; nevertheless, both languages were displaced by the Russian language as the language of administration. Other languages also compete with Ossetic and Tajik. Though it has a large body of folk epics, Ossetic became a literary language only in the second half of the 19th century. By contrast, the neighbouring Georgian has a still flourishing ancient literary tradition dating back to the 5th century AD and has many more speakers. Tajik, on the other hand, has a lifeline through its close connection with Persian, but it too has been retreating before Uzbek, an unrelated language of the Turkic group. Characteristics of the Iranian languages. All Iranian languages show in their basic elements the characteristic features of an Indo-European language. Apart from the extensive borrowing of Arabic words in Modern Persian, the Iranian languages have scarcely been affected by unrelated languages, with the notable exception of Ossetic, which has been strongly influenced by the neighbouring Caucasian languages. Some dialects of Tajik have been very receptive to Uzbek elements. In the case of languages in contact with Indian civilization, the most noticeable non-Iranian feature often taken over is the Indo-Aryan series of retroflex sounds. These are foreign to Indo-Aryan itself, being a result of the influence of the Dravidian languages. The elaborate phonological and morphological structure of the Indo-European parent language has been progressively simplified in the development of the Iranian languages. The basic phonological structure of Common Old Iranian has on the whole been maintained, but the morphological system has continued to be simplified. There has been a constant move in almost all Iranian languages toward an analytic structure; i.e., the use of prepositions and word order rather than case endings to indicate grammatical relationships. Phonology. The most characteristic features of the Iranian phonological system are those that distinguish it from the Indo-Aryan system. These are the development of various fricative sounds (indicated in phonetic symbols as x, f, , and later {back half-close vowel}, , �), and of the voiced sibilant sounds z and z. Even in Iranian, however, these sounds did not persist universally. In western Middle Iranian the sound was lost, and it is rare in the modern languages. In Pashto the inherited f sound has been discarded. Baluchi, except in the extreme east, is entirely without fricatives. Voiced bilabial and dental fricative sounds ( and �) were recorded in some early manuscripts of Modern Persian, but they became b and d by the 13th century Two negative features have also resulted in differentiation between Indo-Aryan and Iranian. One is the result of the coalescence in Proto-Iranian of aspirated and unaspirated voiced stops. Thus, Indo-European *b and *bh were maintained in contrast in Indo-Aryan as b and bh, but they fell together in Iranian as b. This resulted in an alteration of the phonological structure because the number of consonant contrasts (oppositions) was reduced. The other negative feature is the absence of the retroflex consonants from Iranian except as a later importation in contiguous regions. Other divergences in development, such as the change of an s sound to h in Iranian, brought about a difference in distribution rather than in structure because h developed also in Indo-Aryan but from Indo-Iranian *zh and *gh before front vowels (e.g., e and i). The features discussed here are illustrated in Table 6. In Old Iranian the stress lay on the next to the last syllable if it was heavy (i.e., contained a long vowel or was closed by a consonant)--otherwise on the preceding syllable. With the loss of final unstressed vowels in the development of many Iranian languages, the stress often came to be on the final syllable. End stress is characteristic of Modern Persian. Grammar. In Old Persian the Indo-European inflectional system appears considerably simplified. In particular, the genitive and the dative coalesced into one case and the instrumental and ablative into another. Moreover, in the plural the nominative and accusative cases are not distinguished. This reduced system is still found in the Middle Iranian period in Old Khotanese and to a certain extent in Sogdian. Eastern Iranian is in this respect more conservative than western. By the Middle Iranian period, western Iranian had abandoned nominal (noun, adjective, pronoun) inflection altogether, as is the case with Middle and Modern Persian and with Parthian. In some languages, both western and eastern, two or, rarely, three cases survive. Ossetic is quite exceptional in maintaining an elaborate case system; it is partly a result of secondary, purely Ossetic developments. The elaborate conjugational system of the Indo-European verb followed a similar path to disintegration. In particular, the whole past tense system was given up by the Middle Iranian period. Only a few relics remain of the Indo-European system, such as the partial survival of the augment (a prefixed vowel or lengthening of the initial vowel) in the Sogdian imperfect tense. But a new past tense system developed, based on the old past participle, often combined with auxiliary verbs. Many languages distinguish between transitive and intransitive verbs in the past tense system; and in some, such as Khotanese and Pashto, even gender and number are distinguished. The present tense system was far better preserved. The dual number was in retreat in Old Iranian and is not attested later. The middle voice, a form that indicates that a person or thing both performs and is affected by the action represented, was generally abandoned by the Middle Iranian period, although middle voice inflection is well represented in Khotanese. With these qualifications, the endings of the present indicative (active) have been generally well preserved. A variety of imperative, subjunctive, and optative forms, partly based on inherited forms and partly the result of innovation, is found especially in the eastern languages, including Ossetic. Rigidity of word order is, on the whole, most characteristic of those languages, such as Persian, that have gone furthest in the reduction of the inherited morphological system. Vocabulary. The Islamic conquest of Iran during the 7th century entailed not only a change of religion but also a change of language. The sacred language of Islam was Arabic, and the proportion of Arabic words used in Persian rapidly increased until it reached something like the 40 to 50 percent of the present day. Before the introduction of the Arabic element, most loanwords were mainly from other Iranian languages. Most familiar is the extensive borrowing from Median found in Old Persian. In later periods, Modern Persian borrowed words extensively from Turkish and from European languages. Persian is itself the donor language in the case of the other Iranian languages, all of which have drawn upon its vocabulary. Buddhism was similarly responsible for the large proportion of Indo-Aryan words, both Sanskrit and Prakrit, found in Sogdian and especially in Khotanese. A considerable Indian element occurs in the vocabulary of those modern Iranian languages that have been or are in contact with modern Indo-Aryan languages in the northwest, such as Lahnda and Sindhi. There the Dardic languages have also been influential. Baluchi has also borrowed from Brahui, a Dravidian language spoken in Baluchistan in Pakistan. Ossetic occupies an exceptional position. Most of its Persian and Arabic borrowings have come to it through Turkish, but more striking are the large number of words borrowed from the Caucasian languages, especially Georgian. In modern times, Ossetic continues to be influenced by Russian. Writing systems. Iranian languages have been written in many different scripts during their long history, although various forms of Aramaic script have been predominant. Modern Persian is written in Arabic script, which is of Aramaic origin. For writing the Persian sounds p, c, z, and g, four letters have been added by means of diacritical marks. By the addition of further letters, this Perso-Arabic script has been adapted to write not only the other main modern Iranian languages, Pashto, Kurdish, and Baluchi, but also those minor ones that are occasionally recorded. An advantage of the use of this consonantal script is that by not defining vowel qualities it is possible to include local dialect variations to a considerable extent. Two modern Iranian languages spoken on Soviet territory are currently written in a modified version of the Russian alphabet: Tadzhik and Ossetic. Soviet scholars have, however, tended to use modified Latin alphabets to record the minor languages that have no literary tradition, such as some of the Pamir languages. Ossetic has also been written in the Georgian script. Old Persian was written with a cuneiform syllabary, the origin of which is still hotly disputed. Middle Persian, Parthian, Sogdian, and Old Khwarezmian were recorded in various forms of Aramaic script. Two forms of this script as they developed for writing Sogdian were adopted by the Uighurs. In its cursive form this script spread even further, to the Mongols and Manchus. Three other scripts are important for the remaining Middle Iranian languages: Greek script for Bactrian, Arabic script for Late Khwarezmian, and varieties of Central Asian Brahmi script of Indian origin for Khotanese and Tumshuq. The Aramaic script was not systematically adapted to the writing of Middle Iranian; and despite the introduction of a variety of diacritical marks to differentiate letters, considerable ambiguity remained. Moreover, several letters tended to coalesce in form. In this respect, the Pahlavi script, used for writing the Middle Persian of the Zoroastrian books, developed furthest. In it, the original 22 letters of the Aramaic alphabet have been reduced to 14, which are further confused by the use of numerous ligatures (linked letters). It was the realization that this script was inadequate to record precisely the traditional pronunciation of the sacred text of the Avesta that led the Zoroastrian priests to devise the elaborate Avestan script, which, with its 48 distinct letters formed by differentiation out of the 14 used for Pahlavi, was well suited to the task. (See also WRITING.) (R.E.E.) Greek language Greek is an Indo-European language whose history can be followed from the 14th century BC to the present day. Its documents cover a longer period of time (34 centuries) than those of any other Indo-European language. There is an Ancient phase, subdivided into a Mycenaean period (texts in syllabic script from the 14th to the 12th centuries BC) and Archaic and Classical periods (beginning with the adoption of the alphabet, from the 8th to the 4th centuries BC); a Hellenistic and Roman phase (4th century BC to 4th century AD); a Byzantine phase (5th-15th centuries AD); and a Modern phase. Separate transliteration tables for Classical and Modern Greek accompany this article. Some differences in transliteration result from changes in pronunciation of the Greek language; others reflect convention, as for example the (chi or khi), which was transliterated by the Romans as ch (because they lacked the letter k in their usual alphabet). In Modern Greek, however, the standard transliteration for is kh. Another difference is the representation of (beta or v�ta); in Classical Greek it is transliterated as b in every instance, and in Modern Greek as v. The pronunciation of Ancient Greek vowels is indicated by the transliteration used by the Romans. (upsilon) was written as y by the Romans, indicating that the sound was not identical to the sound of their letter i. Modern Greek (�psilon) is transliterated as i, indicating that the sound used today differs from the ancient . (See Tables 8 and 9 for transliterations of all the Greek letters.) GENERAL CONSIDERATIONS In the course of the 2nd millennium BC, groups of Greek-speaking Indo-Europeans established themselves by stages on the Greek peninsula, on most of the islands of the Aegean, and on the west coast of Anatolia; with few exceptions that is still the area occupied by the Greek language today. In the second quarter of the 1st millennium BC a vast "colonial" movement took place, resulting in establishments founded by various Greek cities all around the Mediterranean and the Black Sea, especially in southern Italy and Sicily. This extension of the linguistic area of Greek lasted only a few centuries; in the Roman period, Latin, more or less rapidly, took the place of Greek in most of these ancient colonies. "Colonial" Greek survived longest at Byzantium, as the official language of the Eastern empire. Relationship of Greek to Indo-European. Ancient Greek is, next to Hittite, the Indo-European language with documents going furthest back into the past. At the time when it comes within view in the 2nd millennium BC, it has already acquired a completely distinct character from the parent Indo-European language. Its linguistic features place it in a central region on the dialect map that can be reconstructed for Common Indo-European; the ancient languages with which it has the most features in common are little known ones such as Phrygian or Macedonian. In the study of Indo-European dialectology, phonetic data are the most readily available and provide the most information. In this respect the position of Ancient Greek is as follows. The original Indo-European vowels of a and o quality, both short and long, remain distinct, whereas they are completely or partially confused in Hittite, Indo-Iranian, Baltic, Slavic, and Germanic. Greek is the only language that distinguishes by three different qualities (e, a, o) the secondary short vowels resulting in certain positions from the three laryngeal sounds, *H , *H , *H , of Indo-European. (An asterisk preceding a sound or word indicates that it is not attested, but is a reconstructed, hypothetical form. For a discussion of these laryngeal sounds, see Indo-European languages.) Greek keeps the distinction between the original voiced stops and voiced aspirated stops of Indo-European (e.g., Indo-European *d becomes Greek d, and Indo-European *dh becomes Greek th), whereas Iranian, Slavic, Baltic, and Celtic confuse them. Greek avoids the general shifts of stop consonants that are displayed, independently, by Armenian and Germanic, as well as the palatalization that affects guttural stops in Indo-Iranian, Armenian, Baltic, and Slavic. In these respects, Ancient Greek is conservative, as are, generally speaking, the western Indo-European languages (Italic and Celtic). On the other hand, it does show innovations. One of these, the devoicing of the original voiced stops, is shared with Italic, although it is realized in different ways (*dh-yields Greek th-, Latin f-, Osco-Umbrian f-); but others are foreign to Italic: for example, the weakening of spirants and semivowels at the beginning of words before a vowel, the evolution of *s- to h- (pre-Mycenaean), and *y- to h- (contemporary with Mycenaean). Morphological criteria must, of course, be taken into account in defining the position of a language. It should be noted that there are few grammatical innovations shared by Greek and Italic, apart from the extension to nouns of the pronominal ending of the ge



