Low-Level Devanāgarī Support for Omega - Adapting Devnag Yannis Haralambous, John Plaice

Total Page:16

File Type:pdf, Size:1020Kb

Low-Level Devanāgarī Support for Omega - Adapting Devnag Yannis Haralambous, John Plaice Low-level Devanāgarī Support for Omega - Adapting devnag Yannis Haralambous, John Plaice To cite this version: Yannis Haralambous, John Plaice. Low-level Devanāgarī Support for Omega - Adapting devnag. Tugboat, TeX Users Group, 2002, Proceedings of the 2002 Annual Meeting, 23 (1), pp.50-56. hal- 02112001 HAL Id: hal-02112001 https://hal.archives-ouvertes.fr/hal-02112001 Submitted on 26 Apr 2019 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Low-level Devan¯agar¯ıSupport for Omega — Adapting devnag Yannis Haralambous D´epartement Informatique Ecole´ Nationale Sup´erieuredes T´el´ecommunications de Bretagne BP. 832, 29285 Brest, France [email protected] http://omega.enstb.org/yannis John Plaice School of Computer Science and Engineering The University of New South Wales UNSW Sydney NSW 2052, Australia [email protected] http://www.cse.unsw.edu.au/school/people/info/plaice.html Abstract This paper presents tools (OTPs and macros) for typesetting languages using the Devan¯agar¯ıscript (Hindi, Sanskrit, Marathi). These tools are based on the Omega typesetting system and are using fonts from devnag, a package developed by Frans Velthuis in 1991. We are describing these new OTPs in detail, to provide the reader with insight into Omega techniques and allow him/her to further adapt these tools to his/her own environment (input method, font), and even to other Indic languages. sArA\f yh l˜Kd˜vnAgrFElEp ko þyog krt˜h` e BAqAao\ kF VAIps{EV\g yA l˜KAyojn k˜Ele þyog Eke jAn˜vAl˜V´ l ko þ-t` t krtA h{.y˜V´ l aom˜gAVAIps{EV\g pr aADAErt h{aOr d˜vnAgk˜l˜KAzpoko þyog krtA h{joEk ČA\s v˜lT` is ŠArA 1991 m˜bnAyA gyA p{k˜jh{.hm in ao VF pF ko Ev-tˆ t kr rh˜h{\ tA\Ek p”hn˜ vAlo\ ko aom˜gAtknFk kF jAnkArF ho jAe aOr vo apn˜aAp ko is V´ l s˜apn˜vAtAvrZ aOr d´ srF BArtFy BAqAao k˜an` zp bdl sk˜yA xAl sk˜. Introduction 1. A Sanskrit font contains over 300 glyphs, when ligatures are taken into account. One of the first Indic language support packages 2. The TFM and VF languages are not powerful for TEX was devnag, developed by Frans Velthuis in 1991.1 At that time it was necessary to use a enough to make all the necessary glyphs out of preprocessor for converting Hindi or Sanskrit text a font of 256 characters. written in a way legible to humans into data legible Using a preprocessor has many disadvantages, by TEX. This preprocessor allowed the use of an due mainly to the fact that it has to read not plain ascii transcription, and performed the contextual text, but rather LATEX code. It also has to avoid analysis inherent to Devan¯agar¯ı script, as well as treating commands and environment names as De- pre-hyphenation (by explicitly inserting hyphen- van¯agar¯ıtext. So the preprocessor should be clever ation points). The preprocessor was necessary for enough to distinguish text from commands, i.e., con- two main reasons: tent from markup. It is well known that, in the case of TEX, this is practically impossible, unless the preprocessor is 1 A second system for processing Devan¯agar¯ı was cre- TEX itself (there is a notorious saying: “only TEX ated by Charles Wikner. It has important features lacking can read TEX”). in Velthuis’s devnag system, but unlike the latter it did not So much for computing in the 20th century. address the setting of Hindi text. The general design of the system – Metafont plus pre-processor – was identical to that Nowadays we have other means of processing infor- of Velthuis. mation, and the concept of (external) preprocessor 50 TUGboat, Volume 23 (2002), No. 1 — Proceedings of the 2002 Annual Meeting Low-level Devan¯agar¯ıSupport for Omega — Adapting devnag is obsolete. In fact, the same operations are done cessed (read, printed, analyzed) on every machine inside Omega, a successor of TEX. Processing text or software that is Unicode compliant. internally has the crucial advantage of allowing the Omega fullfils Unicode compliance, and the sys- processor to distinguish precisely what is content tem we are describing in this paper is designed in and what markup (at least as precisely as TEX itself such a way that Unicode-encoded texts can be pro- does it). cessed equally well as texts encoded in Velthuis’ This makes it much easier to treat properties transcription. inherent to writing systems: one only needs to con- centrate on the linguistic and typographical prop- Installation and Usage erties of the script, and one doesn’t need to think The Omega low-level support2 of Devan¯agar¯ıcon- of what to “do with LATEX commands” in the data sists of eight OTPs (Omega Translation Processes) stream. and a small file with macros: Furthermore, there is an efficiency issue: using velthuis2unicode.otp Omega there is only one source file, namely the TEX hindi-uni2cuni.otp file (and not a pre-TEX file and a TEX file); one hindi-uni2cuni2.otp doesn’t need to care about preprocessor directives; hindi-cuni2font.otp the system will not fail because of a new LATEX envi- hindi-cuni2font2.otp ronment which is not known to devnag; mathemat- hindi-cuni2font3.otp ics and other similar constructions do not interfere sanskrit-uni2cuni.otp with Devan¯agar¯ıpreprocessing. sanskrit-cuni2font.otp Contextual analysis of Devan¯agar¯ıscript has, at odev.sty last, become a fundamental property of the system, independent of macros and packages. OTP files have to be converted to binary form (*.ocp) and placed in a directory where Omega ex- pects to find them. Unicode and Devan¯agar¯ı To typeset text in Devan¯agar¯ı, use the com- The 20-bit information interchange encoding Uni- mands \hindi or \sanskrit (depending on the lan- code (www.unicode.org) has tables for all Indic guage of your choice) inside a group, and keyboard writing systems, based on a common scheme (so the text in Velthuis’ transcription (see Table 1, 3 that phonetically equivalent letters are placed on the taken from Velthuis’ devnag documentation ). For same relative positions in each table). The first of example, {\hindi kulluu, acaanak, \sanskrit these tables (positions 0900-097F, see Table 1) cov- kulluu, acaanak} will produce k` Sl´ , acAnk , ers Devan¯agar¯ı. k` ¥´ , acAnk^ . For historical reasons (compatibility with lega- Description of the OTPs cy encodings) the Unicode approach to Devan¯agar¯ı is quite awkward: it is partly logical and partly This description is a bit technical and demands both graphical. For example, there are separate posi- some knowledge of Omega and of Devan¯agar¯ıscript. tions for independent and dependent versions of vo- The reader can find more information on the former, wels: when encoding text one has to choose if a on the Omega Web site4 and on the latter in books given vowel is dependent or independent, although about Devan¯agar¯ıscript. In particular, there is a this clearly derives from contextual analysis, as in very nice introduction to the contextual features of Velthuis’ transcription where both versions of vo- the script in the Unicode book5 (Section 9.1). wels have the same excellent input transcription. 2 On the other hand, this method is not applied We call it “low-level,” because there is no standard LATEX3-compliant high-level language support interface yet. to consonant ra; indeed, placing a ra before a clus- We don’t know yet how languages and their properties will ter of consonants is graphically represented by a be managed in LATEX3 and therefore do not attempt to in- mark on the last of the consonants (compare Ä- troduce yet another syntax for switching to Hindi or Sanskrit kta and Ä‚ rkta) — this mark is not provided in the or Marathi. Instead, we — temporarily — use a devnag-like syntax: simple commands \hindi and \sanskrit which have Unicode table, and hence application of this feature to be placed inside groups, as in the good old days of plain is left to the rendering engine. TEX. Nevertheless, despite its weaknesses, Unicode is 3 To be found on CTAN, language/devanagari/distrib/ very important because it ensures compatibility be- manual.tex. 4 http://omega.enstb.org tween devices all around the world: a text written 5 The Unicode Standard, Version 3.0, Addison Wesley, in Devan¯agar¯ıand encoded in Unicode can be pro- Reading Massachusetts, 2000. TUGboat, Volume 23 (2002), No. 1 — Proceedings of the 2002 Annual Meeting 51 Yannis Haralambous and John Plaice velthuis2unicode.otp In this OTP we convert the consonantic cluster. This is done by lines of the Velthuis’ input transcription into Unicode. It is a type: quite short OTP (about 80 lines), with lines of the {CONSONANT} {VIRAMA} {INITI} => type @"093F \1 \2 @"097D ; `z' => @"095B @"094D ; where we have a consonant and virama followed by `a'`a' => @"0906 ; a ‘short i’ vowel. In this case we place Unicode char- On the second line, the pair of letters aa of acter @"093F followed by the consonant. On similar Velthuis’ transcription is sent to Unicode character lines, we have n-uplets (n · 7) of consonants and vi- @"0906 (independent vowel “aa”).
Recommended publications
  • The Origin of the Peculiarities of the Vietnamese Alphabet André-Georges Haudricourt
    The origin of the peculiarities of the Vietnamese alphabet André-Georges Haudricourt To cite this version: André-Georges Haudricourt. The origin of the peculiarities of the Vietnamese alphabet. Mon-Khmer Studies, 2010, 39, pp.89-104. halshs-00918824v2 HAL Id: halshs-00918824 https://halshs.archives-ouvertes.fr/halshs-00918824v2 Submitted on 17 Dec 2013 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Published in Mon-Khmer Studies 39. 89–104 (2010). The origin of the peculiarities of the Vietnamese alphabet by André-Georges Haudricourt Translated by Alexis Michaud, LACITO-CNRS, France Originally published as: L’origine des particularités de l’alphabet vietnamien, Dân Việt Nam 3:61-68, 1949. Translator’s foreword André-Georges Haudricourt’s contribution to Southeast Asian studies is internationally acknowledged, witness the Haudricourt Festschrift (Suriya, Thomas and Suwilai 1985). However, many of Haudricourt’s works are not yet available to the English-reading public. A volume of the most important papers by André-Georges Haudricourt, translated by an international team of specialists, is currently in preparation. Its aim is to share with the English- speaking academic community Haudricourt’s seminal publications, many of which address issues in Southeast Asian languages, linguistics and social anthropology.
    [Show full text]
  • From Arabic Style Toward Javanese Style: Comparison Between Accents of Javanese Recitation and Arabic Recitation
    From Arabic Style toward Javanese Style: Comparison between Accents of Javanese Recitation and Arabic Recitation Nur Faizin1 Abstract Moslem scholars have acceptedmaqamat in reciting the Quran otherwise they have not accepted macapat as Javanese style in reciting the Quran such as recitationin the State Palace in commemoration of Isra` Miraj 2015. The paper uses a phonological approach to accents in Arabic and Javanese style in recitingthe first verse of Surah Al-Isra`. Themethod used here is analysis of suprasegmental sound (accent) by usingSpeech Analyzer programand the comparison of these accents is analyzed by descriptive method. By doing so, the author found that:first, there is not any ideological reason to reject Javanese style because both of Arabic and Javanese style have some aspects suitable and unsuitable with Ilm Tajweed; second, the suitability of Arabic style was muchthan Javanese style; third, it is not right to reject recitingthe Quran with Javanese style only based on assumption that it evokedmistakes and errors; fourth, the acceptance of Arabic style as the art in reciting the Quran should risedacceptanceof the Javanese stylealso. So, rejection of reciting the Quranwith Javanese style wasnot due to any reason and it couldnot be proofed by any logical argument. Keywords: Recitation, Arabic Style, Javanese Style, Quran. Introduction There was a controversial event in commemoration of Isra‘ Mi‘raj at the State Palacein Jakarta May 15, 2015 ago. The recitation of the Quran in the commemoration was recitedwithJavanese style (langgam).That was not common performance in relation to such as official event. Muhammad 58 Nur Faizin, From Arabic Style toward Javanese Style Yasser Arafat, a lecture of Sunan Kalijaga State Islamic University Yogyakarta has been reciting first verse of Al-Isra` by Javanese style in the front of state officials and delegationsof many countries.
    [Show full text]
  • 75 Characters Maximum
    Kannada Script LGR Proposal Introduction, Current Analysis and Next Steps Dr. U.B. Pavanaja NBGP F2F Meeting, Colombo 14 December 2017 | 1 Agenda 1 2 3 Introduction to Repertoire Analysis Within Script Kannada Script Variants 4 5 6 Cross-Script WLE Rules Current Status and Variants Next Steps for Completion | 2 Introduction to Kannada Script Population – there are about 60 million speakers of Kannada language which uses Kannada script. Geographical area - Kannada is spoken predominantly by the people of Karnataka State of India. It is also spoken by significant linguistic minorities in the states of Andhra Pradesh, Telangana, Tamil Nadu, Maharashtra, Kerala, Goa and abroad Languages written in Kannada script – Kannada, Tulu, Kodava (Coorgi), Konkani, Havyaka, Sanketi, Beary (byaari), Arebaase, Koraga | 3 Classification of Characters Swaras (vowels) Letter ಅ ಆ ಇ ಈ ಉ ಊ ಋ ಎ ಏ ಐ ಒ ಓ ಔ Vowel sign/ N/Aಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ matra Yogavahas In Kannada, all consonants Anusvara ಅಂ (vyanjanas) when written as ಕ (ka), ಖ (kha), ಗ (ga), etc. actually have a built-in vowel sign (matra) Visarga ಅಃ of vowel ಅ (a) in them. | 4 Classification of Characters Vargeeya vyanjana (structured consonants) voiceless voiceless aspirate voiced voiced aspirate nasal Velars ಕ ಖ ಗ ಘ ಙ Palatals ಚ ಛ ಜ ಝ ಞ Retroflex ಟ ಠ ಡ ಢ ಣ Dentals ತ ಥ ದ ಧ ನ Labials ಪ ಫ ಬ ಭ ಮ Avargeeya vyanjana (unstructured consonants) ಯ ರ ಱ (obsolete) ಲ ವ ಶ ಷ ಸ ಹ ಳ ೞ (obsolete) | 5 Repertoire Included-1 Sr. Unicode Glyph Character Name Unicode Indic Ref Widespread No. Code General Syllabic use ? Point Category Category [Yes/No] 1 0C82 ಂ KANNADA SIGN ANUSVARA Mc Anusvara Yes 2 0C83 ಂ KANNADA SIGN VISARGA Mc Visarga Yes 3 0C85 ಅ KANNADA LETTER A Lo Vowel Yes 4 0C86 ಆ KANNADA LETTER AA Lo Vowel Yes 5 0C87 ಇ KANNADA LETTER I Lo Vowel Yes 6 0C88 ಈ KANNADA LETTER II Lo Vowel Yes 7 0C89 ಉ KANNADA LETTER U Lo Vowel Yes 8 0C8A ಊ KANNADA LETTER UU Lo Vowel Yes KANNADA LETTER VOCALIC 9 0C8B ಋ R Lo Vowel Yes 10 0C8E ಎ KANNADA LETTER E Lo Vowel Yes | 6 Repertoire Included-2 Sr.
    [Show full text]
  • UNICODE for Kannada General Information & Description
    UNICODE for Kannada (U+0C80 to U+0CFF) General Information & Description Written by: C V Srinatha Sastry Issued by: Director Directorate of Information Technology Government of Karnataka Multi Storied Buildings, Vidhana Veedhi BANGALORE 560 001 INDIA PDF created with FinePrint pdfFactory Pro trial version http://www.pdffactory.com UNICODE for Kannada Introduction The Kannada script is a South Indian script. It is used to write Kannada language of Karnataka State in India. This is also used in many parts of Tamil Nadu, Kerala, Andhra Pradesh and Maharashtra States of India. In addition, the Kannada script is also used to write Tulu, Konkani and Kodava languages. Kannada along with other Indian language scripts shares a large number of structural features. The Kannada block of Unicode Standard (0C80 to 0CFF) is based on ISCII-1988 (Indian Standard Code for Information Interchange). The Unicode Standard (version 3) encodes Kannada characters in the same relative positions as those coded in the ISCII-1988 standard. The Writing system that employs Kannada script constitutes a cross between syllabic writing systems and phonemic writing systems (alphabets). The effective unit of writing Kannada is the orthographic syllable consisting of a consonant (Vyanjana) and vowel (Vowel) (CV) core and optionally, one or more preceding consonants, with a canonical structure of ((C)C)CV. The orthographic syllable need not correspond exactly with a phonological syllable, especially when a consonant cluster is involved, but the writing system is built on phonological principles and tends to correspond quite closely to pronunciation. The orthographic syllable is built up of alphabetic pieces, the actual letters of Kannada script.
    [Show full text]
  • Introduction
    INTRODUCTION ’ P”(Bha.t.tikāvya) is one of the boldest “B experiments in classical literature. In the formal genre of “great poem” (mahākāvya) it incorprates two of the most powerful Sanskrit traditions, the “Ramáyana” and Pánini’s grammar, and several other minor themes. In this one rich mix of science and art, Bhatti created both a po- etic retelling of the adventures of Rama and a compendium of examples of grammar, metrics, the Prakrit language and rhetoric. As literature, his composition stands comparison with the best of Sanskrit poetry, in particular cantos , and . “Bhatti’s Poem” provides a comprehensive exem- plification of Sanskrit grammar in use and a good introduc- tion to the science (śāstra) of poetics or rhetoric (alamk. āra, lit. ornament). It also gives a taster of the Prakrit language (a major component in every Sanskrit drama) in an easily ac- cessible form. Finally it tells the compelling story of Prince Rama in simple elegant Sanskrit: this is the “Ramáyana” faithfully retold. e learned Indian curriculum in late classical times had at its heart a system of grammatical study and linguistic analysis. e core text for this study was the notoriously difficult “Eight Books” (A.s.tādhyāyī) of Pánini, the sine qua non of learning composed in the fourth century , and arguably the most remarkable and indeed foundational text in the history of linguistics. Not only is the “Eight Books” a description of a language unmatched in totality for any language until the nineteenth century, but it is also pre- sented in the most compact form possible through the use xix of an elaborate and sophisticated metalanguage, again un- known anywhere else in linguistics before modern times.
    [Show full text]
  • Ka И @И Ka M Л @Л Ga Н @Н Ga M М @М Nga О @О Ca П
    ISO/IEC JTC1/SC2/WG2 N3319R L2/07-295R 2007-09-11 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Javanese script in the UCS Source: Michael Everson, SEI (Universal Scripts Project) Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Replaces: N3292 Date: 2007-09-11 1. Introduction. The Javanese script, or aksara Jawa, is used for writing the Javanese language, the native language of one of the peoples of Java, known locally as basa Jawa. It is a descendent of the ancient Brahmi script of India, and so has many similarities with modern scripts of South Asia and Southeast Asia which are also members of that family. The Javanese script is also used for writing Sanskrit, Jawa Kuna (a kind of Sanskritized Javanese), and Kawi, as well as the Sundanese language, also spoken on the island of Java, and the Sasak language, spoken on the island of Lombok. Javanese script was in current use in Java until about 1945; in 1928 Bahasa Indonesia was made the national language of Indonesia and its influence eclipsed that of other languages and their scripts. Traditional Javanese texts are written on palm leaves; books of these bound together are called lontar, a word which derives from ron ‘leaf’ and tal ‘palm’. 2.1. Consonant letters. Consonants have an inherent -a vowel sound. Consonants combine with following consonants in the usual Brahmic fashion: the inherent vowel is “killed” by the PANGKON, and the follow- ing consonant is subjoined or postfixed, often with a change in shape: §£ ndha = § NA + @¿ PANGKON + £ DA-MAHAPRANA; üù n.
    [Show full text]
  • Dutchess County Public Transit Route L
    Route L: Main Street Shuttle Please see map on page 44 / Favor ver el mapa en la página 44 LUNES–SABADO 9 NORTH / NORTE MONDAY–SATURDAY / Ave Violet Ave EASTBOUND: Poughkeepsie to Stop & Shop or Adams Fairacre Farms / HACIA EL ESTE: Poughkeepsie a Stop & Shop or Adams Fairacre Farms Poughkeepsie Train Station DCPT Routes: A,B,C,D,E,H,J,K,L,P,PRL Map not to scale Amtrak, MTA Metro-North Mapa no a escala Dutchess County Transit Hub Interfaith DCPT Routes: A,B,C,D,E,H,J,K,L,M,P,PRL Adams Fairacre Farms & Route 44 (Arrives) POUGHKEEPSIE Washington St & Mansion St (Interfaith Towers) POUGHKEEPSIE Poughkeepsie StationTrain POUGHKEEPSIE Main St & Worrall Ave POUGHKEEPSIE Colledgeview Ave & Fairmont Ave (Vassar College) POUGHKEEPSIE Route 44 & Burnett Blvd POUGHKEEPSIE Stop Shop & (Arrives) POUGHKEEPSIE Towers County Dutchess Hub Transit (Departs) POUGHKEEPSIE Stop # 1 2 3 4 5 6 7 8 Mill St 1 286 3 306 308 131 313 311 2 SEE DOWNTOWN POUGHKEEPSIE INSET ON PAGE 12 Market 6:45 6:48 6:52 7:03 7:08 7:11 — 7:15 7:15 7:18 7:22 7:33 7:38 7:41 7:44 — 3 Main St 7:45 7:48 7:52 8:03 8:08 8:11 — 8:15 1 Adams 8:15 8:18 8:22 8:33 8:38 8:41 8:44 — Fairacre 8:45 8:48 8:52 9:03 9:08 9:11 — 9:15 Ea Stop & st- Farms AM 9:15 9:18 9:22 9:33 9:38 9:41 9:44 — We s Shop 9:45 9:48 9:52 10:03 10:08 10:11 — 10:15 Mid-Hudson t A rter Bridge ial 8 10:15 10:18 10:22 10:33 10:38 10:41 10:44 — Main St Arlington 10:45 10:48 10:52 11:03 11:08 11:11 — 11:15 7 11:15 11:18 11:22 11:33 11:38 11:41 11:44 — Innis Ave 11:45 11:48 11:52 12:03 12:08 12:11 — 12:15 D E a s t- W 44 12:15 12:18
    [Show full text]
  • Proposal for a Gurmukhi Script Root Zone Label Generation Ruleset (LGR)
    Proposal for a Gurmukhi Script Root Zone Label Generation Ruleset (LGR) LGR Version: 3.0 Date: 2019-04-22 Document version: 2.7 Authors: Neo-Brahmi Generation Panel [NBGP] 1. General Information/ Overview/ Abstract This document lays down the Label Generation Ruleset for Gurmukhi script. Three main components of the Gurmukhi Script LGR i.e. Code point repertoire, Variants and Whole Label Evaluation Rules have been described in detail here. All these components have been incorporated in a machine-readable format in the accompanying XML file named "proposal-gurmukhi-lgr-22apr19-en.xml". In addition, a document named “gurmukhi-test-labels-22apr19-en.txt” has been provided. It provides a list of labels which can produce variants as laid down in Section 6 of this document and it also provides valid and invalid labels as per the Whole Label Evaluation laid down in Section 7. 2. Script for which the LGR is proposed ISO 15924 Code: Guru ISO 15924 Key N°: 310 ISO 15924 English Name: Gurmukhi Latin transliteration of native script name: gurmukhī Native name of the script: ਗੁਰਮੁਖੀ Maximal Starting Repertoire [MSR] version: 4 1 3. Background on Script and Principal Languages Using It 3.1. The Evolution of the Script Like most of the North Indian writing systems, the Gurmukhi script is a descendant of the Brahmi script. The Proto-Gurmukhi letters evolved through the Gupta script from 4th to 8th century, followed by the Sharda script from 8th century onwards and finally adapted their archaic form in the Devasesha stage of the later Sharda script, dated between the 10th and 14th centuries.
    [Show full text]
  • Kharosthi Manuscripts: a Window on Gandharan Buddhism*
    KHAROSTHI MANUSCRIPTS: A WINDOW ON GANDHARAN BUDDHISM* Andrew GLASS INTRODUCTION In the present article I offer a sketch of Gandharan Buddhism in the centuries around the turn of the common era by looking at various kinds of evidence which speak to us across the centuries. In doing so I hope to shed a little light on an important stage in the transmission of Buddhism as it spread from India, through Gandhara and Central Asia to China, Korea, and ultimately Japan. In particular, I will focus on the several collections of Kharo~thi manuscripts most of which are quite new to scholarship, the vast majority of these having been discovered only in the past ten years. I will also take a detailed look at the contents of one of these manuscripts in order to illustrate connections with other text collections in Pali and Chinese. Gandharan Buddhism is itself a large topic, which cannot be adequately described within the scope of the present article. I will therefore confine my observations to the period in which the Kharo~thi script was used as a literary medium, that is, from the time of Asoka in the middle of the third century B.C. until about the third century A.D., which I refer to as the Kharo~thi Period. In addition to looking at the new manuscript materials, other forms of evidence such as inscriptions, art and architecture will be touched upon, as they provide many complementary insights into the Buddhist culture of Gandhara. The travel accounts of the Chinese pilgrims * This article is based on a paper presented at Nagoya University on April 22nd 2004.
    [Show full text]
  • M. Ricklefs an Inventory of the Javanese Manuscript Collection in the British Museum
    M. Ricklefs An inventory of the Javanese manuscript collection in the British Museum In: Bijdragen tot de Taal-, Land- en Volkenkunde 125 (1969), no: 2, Leiden, 241-262 This PDF-file was downloaded from http://www.kitlv-journals.nl Downloaded from Brill.com09/29/2021 11:29:04AM via free access AN INVENTORY OF THE JAVANESE MANUSCRIPT COLLECTION IN THE BRITISH MUSEUM* he collection of Javanese manuscripts in the British Museum, London, although small by comparison with collections in THolland and Indonesia, is nevertheless of considerable importance. The Crawfurd collection, forming the bulk of the manuscripts, provides a picture of the types of literature being written in Central Java in the late eighteenth and early nineteenth centuries, a period which Dr. Pigeaud has described as a Literary Renaissance.1 Because they were acquired by John Crawfurd during his residence as an official of the British administration on Java, 1811-1815, these manuscripts have a convenient terminus ad quem with regard to composition. A large number of the items are dated, a further convenience to the research worker, and the dates are seen to cluster in the four decades between AD 1775 and AD 1815. A number of the texts were originally obtained from Pakualam I, who was installed as an independent Prince by the British admini- stration. Some of the manuscripts are specifically said to have come from him (e.g. Add. 12281 and 12337), and a statement in a Leiden University Bah ad from the Pakualaman suggests many other volumes in Crawfurd's collection also derive from this source: Tuwan Mister [Crawfurd] asked to be instructed in adat law, with examples of the Javanese usage.
    [Show full text]
  • Slides for My Lecture for the Texperience 2010
    ◦ DEVELOPMENTDEVELOPMENT OF OFxxındyındy◦ SORTSORT AND AND MERGE MERGE RULES RULES FORFOR INDIC INDIC LANGUAGES LANGUAGES ZdeněkZdeněk Wagner, Wagner, Praha, Praha, Česká Česká republika republika AnshumanAnshuman Pandey, Pandey, Univ. Univ. Michigan, Michigan, USA USA JayaJaya Saraswati, Saraswati, Mumbai, Mumbai, India India ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks English: usually as z ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks English: usually as z Hindi: as ks., l#mF can be transliterated either Lakshmi or Laxmi ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks English: usually as z Hindi: as ks., l#mF can be transliterated either Lakshmi or Laxmi Chinese: as sh ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks English: usually as z Hindi: as ks., l#mF can be transliterated either Lakshmi or Laxmi Chinese: as sh Russian: 娭¤¨ (meaning Hindi) ◦ NoteNote on on pronunciation pronunciation of ofxxındy◦ındy Czech: ks English: usually as z Hindi: as ks., l#mF can be transliterated either Lakshmi or Laxmi Chinese: as sh Russian: 娭¤¨ (meaning Hindi) x◦ındy sorts Hindi MakeIndexMakeIndex • version for English and German • CSIndex – version for Czech and Slovak • unpublished version for Sanskrit (Mark Csernel) Tables defining the sort algorithm are hard-wired in the pro- gram source code. Modification for other languages is difficult and leads rather to confusion than to development of a univer- sal tool. InternationalInternationalMakeIndexMakeIndex • Tables defining the sort algorithm present in external files. • Sort rules defined by regular expressions.
    [Show full text]
  • "9-41516)9? "9787:)4 ;7 -6+7,- )=1 16 ;0- & $
    L2/20-256 "9-41516)9?"9787:)4;7-6+7,-)=116;0-&$ ᭛᭜᭛ <;079 ,1;?))?<"-9,)6)215-14,7;3755/5)14+75 40)5 <9=)6:)0140)56<9=)6:)0/5)14+75 );- ;0$-8;-5*-9 6;97,<+;176 ,=:#6L>H8G>EI>H6=>HIDG>86AG6=B>76H:9H8G>EI;DJC9>CK6G>DJH>CH8G>EI>DCH6C96GI:;68IHEGD9J8:97:IL::CI=: I=6C9I=: I=8:CIJGN>C>CHJA6G+DJI=:6HIH>6A6G<:EDGI>DCD;>IH8DGEJH>H;DJC9>C"6K67JI#6L>B6I:G>6AH =6K:6AHD7::C;DJC9>C+JB6IG6%6A6N(:C>CHJA66A>6C9I=:(=>A>EE>C:H,=:H8G>EI>H;G:FJ:CIAN6HHD8>6I:9L>I= I=:'A9"6K6C:H:A6C<J6<:7JIB6I:G>6AHLG>II:C>C+6CH@G>I'A9%6A6N'A96A>C:H:6C9'A9+JC96C:H:A6C<J6<: =6H6AHD7::C;DJC9>CI=:#6L>H8G>EIGDBI=:B>9I=8:CIJGNH>BEA:;JC8I>DC6A#6L>L6HL>9:ANJH:9IDG:8DG9 A6C9 <G6CIH GDN6A :9>8IH 6C9 H>B>A6G 8=6C8:GN 9D8JB:CIH ,DL6G9H I=: :C9 D; I=: ;>GHI B>AA:CC>JB I=: H8G>EI 7:86B:>C8G:6H>C<AN9:8DG6I>K:6C986AA><G6E=>89J:ID>IHJH:6HI=:B6>CK:=>8A:D;'A9"6K6C:H:A>I:G6GNA6C<J6<: L>I=ADC<A6HI>C<A:<68N>CI=:A>I:G6GNIG69>I>DCD;I=:BD9:GC"6K6C:H:6C96A>C:H:A6C<J6<:H$6I:G#6L>H=DLH B6CNK6G>6I>DCHDK:G6L>9:<:D<G6E=>89>HIG>7JI>DC'K:GI>B:I=:H:K6G>6CIH=6K::KDAK:9>G:8IANDG>C9>G:8IAN >CIDI=:B6CNBD9:GCG6=B>8H8G>EIHD;>CHJA6G+H>6HJ8=6H6A>C:H:6I6@"6K6C:H:$DCI6G6:I8 /=>A:I=:68I>K:JH:D;#6L>H8G>EI=6H7::CG:EA68:97NDI=:GH8G>EIHH>C8:I=: I=8:CIJGNI=:G:6G:6CJB7:GD; BD9:GC96N:CI=JH>6HIH6C98DBBJC>I>:HL=DJH:I=:H8G>EIID96N;DGDI=:GEJGEDH:HI=6C6C8>:CIG:EGD9J8I>DC ;DG:M6BEA:ID8=6I>CHD8>6A6EEA>86I>DC6C98G:6I:>B6<:EDHIH!CI=>HG:K>K6AINE:D;JH:I=:#6L>H8G>EIB6N7: JH:9IDLG>I:A6C<J6<:HI=6I6G:CDI;DJC9>C‘6JI=:CI>8’#6L>8DGEJHHJ8=6HI=:BD9:GC"6K6C:H:A6C<J6<:DG I=: !C9DC:H>6C A6C<J6<: H#6L>=6H CDI 7::C :C8D9:9>C I=: -C>8D9: N:I I=:
    [Show full text]