Title: Preliminary Proposal to Encode the Tocharian Script Author: Lee Wilson (
[email protected]) Date: 2015-01-26 1 Introduction Tocharian is a Brahmi-based script historically used to write the Tocharian languages (ISO 639-3: xto, txb), traditionally referred to as Tocharian A and Tocharian B, which belong to the Tocharian branch of the Indo-European language family. The script was in use primarily during the 8th century CE by the people inhabiting the northern edge of the Tarim Basin of the Taklamakan Desert in what is now Xinjiang in western China, an area formerly known as Turkestan. The Tocharian script is attested in over 4,000 extant manuscripts that were discovered in the early 20th century in the Tarim Basin. The discovery was significant for two reasons: first, it revealed a previously unknown branch of the Indo-European language family, and second, it marked the easternmost geographical extent of that language family. The most distinctive aspect of Tocharian script is the vowel transcribed as ä, commonly known as the Fremdvokal, and the modifications to the script that it entails. This vowel is indicated with a vowel sign not employed in scripts outside of the Tarim Basin region. However, this vowel sign is not commonly used when writing Tocharian. Instead, sequences of a consonant followed by the Fremdvokal are most typically indicated by 11 CV signs known as Fremdzeichen, which are unique to Tocharian. 2 Background The Tocharian script is a north-eastern descendant of Brahmi script, related to Khotanese, Tibetan, and Siddham scripts, which was used along the Tarim Basin in the Taklamakan Desert in what is now Xinjiang in western China.