The Brahmi Family of Scripts and Hangul: Alphabets Or Syllabaries? PUB DATE 92 NOTE 25P.; In: Linn, Mary Sarah, Ed

Total Page:16

File Type:pdf, Size:1020Kb

The Brahmi Family of Scripts and Hangul: Alphabets Or Syllabaries? PUB DATE 92 NOTE 25P.; In: Linn, Mary Sarah, Ed DOCUMENT RESUME ED 351 852 FL 020 606 AUTHOR Wilhelm, Christopher TITLE The Brahmi Family of Scripts and Hangul: Alphabets or Syllabaries? PUB DATE 92 NOTE 25p.; In: Linn, Mary Sarah, Ed. and Oliverio, Giulia, R. M., Ed. Kansas Working Papers in Linguistics, Volume 17, Numbers 1 and 2; see FL 020 603. PUB TYPE Reports Research/Technical (143) EDRS PRICE MFO1 /PCO1 Plus Postage. DESCRIPTORS *Alphabets; Foreign Countries; Language Classification; *Written Language IDENTIFIERS Asia (Southeast); *Brahmi Script; *Hangul Script; Korea; Syllabaries ABSTRACT A great deal of disagreement exists as to whether the writing systems of the Brahmi family ol7 scripts from Southeast Asia and the Indian subcontinent and the Hangul script,..+f Korea should be classified as alphabets or syllabaries. Both have in common a mixture of syllabic and alphabetic characteristics that has spawned vigorous disagreement among scholars as to their classification. An analysis of the alphabetic and syllabic aspects of the two writing systems is presented and it is demonstrated that each exhibits significant characteristics of both types of writing systems. It is concluded that neither label entirely does either writing system justice. (JL) *********************************************************************** * Reproductions supplied by EDRS are the best that can be made from the original document. *********************************************************************** U S. DEPARTMENT OF EDUCATION tyke dl EduCat,Onal Research andImprovement EDUCATIONAL RESOURCESINFORMATION CENTER IERICI reproducea as Tn s. document has been rece,ved from the person ororganaatton ongtnattng tt made to trnprove C Mtnor Changes nave been reproductton qualay od.ntonS stated ,nin,s doca Points 01 VIeVa trent Ma not neCessaroy repsent otttctat THE BRAHMI FAMILYOF SCRIPTS AND HANGUL: OERIOosInonor001.cV Alphabets or Syllabaries? "PERMISSION TO REPRODUCETHIS MATERIAL HAS BEEN GRANTEDBY Christopher Wilhelm ABSTRACT: A greatdeal of disagreementexists systems of the as to whetherthe writing TO THE EDUCATIONALRESOURCES Brahmi family ofscripts and theHangul script INFORMATION CENTER (ERIC)." be classified asalphabets or of Korea should exhibits a syllabaries. In fact, each system significant amount ofcharacteristics of both does either types, andneither label entirely of them justice. the world Linguists studying thewriting systems of classified them accordingto three have traditionally and categories, those oflogographic, syllabic, systems found alphabetic scripts. The Brahmi writing subcontinent and SoutheastAsia a. throughout the Indian both defy well as the KoreanHangul script, however, The two have in common amixture of classification. spawned syllabic and alphabeticcharacteristics that has the scholarsdiscussing vigorous disagreement among the them. For example,Lambert (1953) refers to Devanagari script used towrite Sanskrit andits (1906) as daughter languages as asyllabary, Shamasastry :;:escribes it as 'halfway an alphabet,Coulson (1976:3) alphabet and a veryregular in character between an it a syllabary,' while Cardona(1987) simply calls of Sanskrit. script and avoids theissue in his overview of the variousalphabetic and syllabic An examination therefore in order, aspects of thesewriting systems is results of such aninvestigation would and indeed the fully does justice seem to indicatethat neither label to them. of scripts, so namedfor their The Brahmi family attested descent from the Brahmiscript which is first in having in in the third centuryB.C.,1 are distinctive extent, a number of common, to agreater or lesser characteristics that begin tosurface in their Foremost of theseis what Masica (1991:136) progenitor. script, its hails as 'The greatinnovation of the Brahmi indication of vowelsother than A ([8)) bymodifications The vowel added to the basicc.:.,risonant symbols.' corresponding to (a) itselfis regarded as assumed or BEST COPY AVAILABLE 2 55-78 Karsas Working Papers in Linguistics,Volume 17, Number 1, 1992, pp. 56 inherent to each consonant in its most basic form, and any vowel pronounced after the consonant is represented by a marker appended in some fashion to the consonantal symbol. Vowels also tend to have distinct allographs when they occur in an initial position. A consonant standing alone must be so indicated by a special diacritic, and consonants otherwise not followed by any vowel, as in consonant clusters, tend to appear in some altered or abbreviated form. Descendants of the Brahmi script are most commonly associated with the Indic and the Dravidian languages of India. They are also represented in the two primary members of the Tibeto-Burmese family, as well as in significant members of the Khmer and Kam-Tai families. Brahmi-derived scripts have also made their way to such scattered locales in time and place as Sumatra, the Philippines, and the extinct Tokharian language.2 The most widely known member of this family of scripts, however, is the Devanagari script, most particularly as it is employed in writing Sanskrit. It was also at the hands of the grammarians who adapted Devanagari to the writing of Sanskrit that the aforementioned qualities peculiar to Brahmi writing systems become perhaps most pronounced. While an analysis of Brahmi scripts should consider a representative sampling of them, Sanskrit Devanagari is generally taken as the most representative case, and is therefore the best point at which to begin. The characters f the Devanagari script are elegant not only in appearance but also, in Sanskrit at least, in operation as well.As mentioned above, Devanagari consonantal characters are considered to include in their basic, 'unmarked' form the vowel [a], corresponding to [ ] in Sanskrit and most of its daughter languages, pronounced after the articulation of the consonant itself. Thus, the characters for Sanskrit's voiceless unaspirated plosives,EB , and IT , stand for the syllables [ka], [ca],Eta], Eta], and [pa], respectively. When the consonant has no following sourd, as utterance-finally or in isolation,a diacritic known as a vir5ma is placed to the lower right of the character, so that -c.7.and -cT, indicate [c] and [t] alone. The vowel [a] is overtly indicated only inan initial position, by the character JT . All other vowels and diphthongs have one allograph used initially, and another, smaller one when pronounced followinga consonant. These latter allographs may be attached to 57 the consonantal sign at almost any portion of it,such as to the right, as for TW [ta], -m- [ti]; belo , as inir [tu], [ta], [q]; above, as in tr [te], [tai]; above and to the right, as forrr [to], [tau]; and even to the left of the consonantal symbol, as in fh- [ti]. The signs for these vowels in an initial position, on the other hand, are 37r[a] [i],t [ai] ,If [o], ,Ahr [I] , [u], M], iT [rj, IF [e] ,tr [au].3 When consonantal [r],17 , an alveolar flap, is followed by [u] or [a], these, too, appear to the right of the consonantal sign, seemingly turned ninety degrees: "45 , . Two diacritics frequently modify vowels. The anuamara ( ) indicates vowel nasalization and is customarily transcribed, e.g., -am. The visaraa (37: ; h) is an aspirated echo of the vowel it modifies (Coulson 9). Although Devanagari does not readily lend itself to the representation of consonant clusters, such clusters are quite common in Sanskrit. These are represented by ligatures known as conjunct consonants, wherein two or more consonantal characters are modified tofit together in a larger conglomeration. The two most common means of effecting these combinations are horizontally, which generally involves deleting the vertical stroke where present for non-final members of the clusters, as in RT. [sta], from "Ris] and Tr[ta], or aZi [byal, from "Et [b] and Zr [ya]; and vertically, as in---;;F[iga], from [g) and 71. [gal, or [dva) , from [d] and 1 [val. Some combinations may be made in either fashion, as inr-r4 or [cca], although the advent of printing has made the former method more desirable. These conjuncts can appear quite formidable and bewildering; Coulson presents approximately 250 of them (22-4) and does not state whether this list is exhaustive, and he and Lambert both offer examples of clusters of four consonants: 54 [1:14rYa] (Coulson 23) and lEr4" [rstya] (Lambert 35). Two conjuncts, or 4:Er[ksa] and if [jPia] bear little or no resemblance to the signs for their component members ("R [S] 59- [j], [fi]). Conjuncts involving the flap [r] are of particular interest. [r] following a consonant is represented by a short diagonal mark to the lower left of the consonantal character, as in [kra]. However, when [r] precedes a consonant, it is indicated by a small hook above and as far to the right of the character as possible, as inTf [rta]. In syllables involving the diacritic anusvara, 4 or- p 58 this hook appears even tothe right of it, as in<4..rii'zir [yajfiartham], 'for sacrificialpurposes.' A question commonlyinvoked indetermining::whether be considered alphabetic orsyl;lAbic is a script might or whether or not its mostbasic unit corresponds more approaches an less with the phoneme;that is, whether it ideal principle of 'onesign per phoneme.' (Gaur 1985:119; see also Kim1987:888-9). However,:this principle would seem to be forthe most partirrelevant this view. in Sanskrit Devanagari. Two points support The first is whatMasica (146) refers to as'phonemic overkill' in the inventoryof characters. He argues that the visaraa is infact an allophone of/s/ t ), and argues that the velarand
Recommended publications
  • Talking Stone: Cherokee Syllabary Inscriptions in Dark Zone Caves
    University of Tennessee, Knoxville TRACE: Tennessee Research and Creative Exchange Masters Theses Graduate School 12-2017 Talking Stone: Cherokee Syllabary Inscriptions in Dark Zone Caves Beau Duke Carroll University of Tennessee, [email protected] Follow this and additional works at: https://trace.tennessee.edu/utk_gradthes Recommended Citation Carroll, Beau Duke, "Talking Stone: Cherokee Syllabary Inscriptions in Dark Zone Caves. " Master's Thesis, University of Tennessee, 2017. https://trace.tennessee.edu/utk_gradthes/4985 This Thesis is brought to you for free and open access by the Graduate School at TRACE: Tennessee Research and Creative Exchange. It has been accepted for inclusion in Masters Theses by an authorized administrator of TRACE: Tennessee Research and Creative Exchange. For more information, please contact [email protected]. To the Graduate Council: I am submitting herewith a thesis written by Beau Duke Carroll entitled "Talking Stone: Cherokee Syllabary Inscriptions in Dark Zone Caves." I have examined the final electronic copy of this thesis for form and content and recommend that it be accepted in partial fulfillment of the requirements for the degree of Master of Arts, with a major in Anthropology. Jan Simek, Major Professor We have read this thesis and recommend its acceptance: David G. Anderson, Julie L. Reed Accepted for the Council: Dixie L. Thompson Vice Provost and Dean of the Graduate School (Original signatures are on file with official studentecor r ds.) Talking Stone: Cherokee Syllabary Inscriptions in Dark Zone Caves A Thesis Presented for the Master of Arts Degree The University of Tennessee, Knoxville Beau Duke Carroll December 2017 Copyright © 2017 by Beau Duke Carroll All rights reserved ii ACKNOWLEDGMENTS This thesis would not be possible without the following people who contributed their time and expertise.
    [Show full text]
  • 75 Characters Maximum
    Kannada Script LGR Proposal Introduction, Current Analysis and Next Steps Dr. U.B. Pavanaja NBGP F2F Meeting, Colombo 14 December 2017 | 1 Agenda 1 2 3 Introduction to Repertoire Analysis Within Script Kannada Script Variants 4 5 6 Cross-Script WLE Rules Current Status and Variants Next Steps for Completion | 2 Introduction to Kannada Script Population – there are about 60 million speakers of Kannada language which uses Kannada script. Geographical area - Kannada is spoken predominantly by the people of Karnataka State of India. It is also spoken by significant linguistic minorities in the states of Andhra Pradesh, Telangana, Tamil Nadu, Maharashtra, Kerala, Goa and abroad Languages written in Kannada script – Kannada, Tulu, Kodava (Coorgi), Konkani, Havyaka, Sanketi, Beary (byaari), Arebaase, Koraga | 3 Classification of Characters Swaras (vowels) Letter ಅ ಆ ಇ ಈ ಉ ಊ ಋ ಎ ಏ ಐ ಒ ಓ ಔ Vowel sign/ N/Aಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ ಾ matra Yogavahas In Kannada, all consonants Anusvara ಅಂ (vyanjanas) when written as ಕ (ka), ಖ (kha), ಗ (ga), etc. actually have a built-in vowel sign (matra) Visarga ಅಃ of vowel ಅ (a) in them. | 4 Classification of Characters Vargeeya vyanjana (structured consonants) voiceless voiceless aspirate voiced voiced aspirate nasal Velars ಕ ಖ ಗ ಘ ಙ Palatals ಚ ಛ ಜ ಝ ಞ Retroflex ಟ ಠ ಡ ಢ ಣ Dentals ತ ಥ ದ ಧ ನ Labials ಪ ಫ ಬ ಭ ಮ Avargeeya vyanjana (unstructured consonants) ಯ ರ ಱ (obsolete) ಲ ವ ಶ ಷ ಸ ಹ ಳ ೞ (obsolete) | 5 Repertoire Included-1 Sr. Unicode Glyph Character Name Unicode Indic Ref Widespread No. Code General Syllabic use ? Point Category Category [Yes/No] 1 0C82 ಂ KANNADA SIGN ANUSVARA Mc Anusvara Yes 2 0C83 ಂ KANNADA SIGN VISARGA Mc Visarga Yes 3 0C85 ಅ KANNADA LETTER A Lo Vowel Yes 4 0C86 ಆ KANNADA LETTER AA Lo Vowel Yes 5 0C87 ಇ KANNADA LETTER I Lo Vowel Yes 6 0C88 ಈ KANNADA LETTER II Lo Vowel Yes 7 0C89 ಉ KANNADA LETTER U Lo Vowel Yes 8 0C8A ಊ KANNADA LETTER UU Lo Vowel Yes KANNADA LETTER VOCALIC 9 0C8B ಋ R Lo Vowel Yes 10 0C8E ಎ KANNADA LETTER E Lo Vowel Yes | 6 Repertoire Included-2 Sr.
    [Show full text]
  • Grammatica Et Verba Glamor and Verve
    “GippertOVprint” — 2014/4/24 — 22:14 — page 81 —#1 Grammatica et verba Glamor and verve Studies in South Asian, historical, and Indo-European linguistics in honor of Hans Henrich Hock on the occasion of his seventy-fifth birthday edited by Shu-Fen Chen and Benjamin Slade Beech Stave Press Ann Arbor New York • “GippertOVprint” — 2014/4/24 — 22:14 — page 82 —#2 © Beech Stave Press, Inc. All rights reserved. No part of this publication may be reproduced, translated, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or otherwise, without prior written permission from the publisher. Typeset with LATEX using the Galliard typeface designed by Matthew Carter and Greek Old Face by Ralph Hancock. The typeface on the cover is Post Hock by Steve Peter. Library of Congress Cataloging-in-Publication Data Grammatica et verba : glamor and verve : studies in South Asian, historical, and Indo- European linguistics in honor of Hans Henrich Hock on the occasion of his seventy- fifth birthday / edited by Shu-Fen Chen and Benjamin Slade. pages cm Includes bibliographical references. ISBN ---- (alk. paper) . Indo-European languages. Lexicography. Historical linguistics. I. Hock, Hans Henrich, - honoree. II. Chen, Shu-Fen, editor of compilation. III. Slade, Ben- jamin, editor of compilation. –dc Printed in the United States of America “GippertOVprint” — 2014/4/24 — 22:14 — page 83 —#3 TableHHHHHH ofHHHHHHHHHHHHHHH ContentsHHHHHHHHHHHH Preface . vii Bibliography of Hans Henrich Hock . ix List of Contributors . xxi , Traces of Archaic Human Language Structure Anvita Abbi in the Great Andamanese Language. , A Study of Punctuation Errors in the Chinese Diamond Sutra Shu-Fen Chen Based on Sanskrit Texts .
    [Show full text]
  • Vietnamese Accent
    Vietnamese English Erik Singer Vietnamese is spoken by about 86 million people, which makes it the 17th largest language community in the world. It is part of the Austro-Asiatic language family, and is by far the most widely spoken of these languages. It has borrowed a large portion of its vocabulary from Chinese, thanks to an early period of Chinese domination, but it is otherwise linguistically unrelated. It is generally described as having three dialects: Hanoi in the North, Ho Chi Minh in the South, and Hue in the center. The three dialects are mostly mutually intelligible, though Hue is said to be difficult for speakers of the other two dialects to understand. The northern speech…is marked by sharpness, or choppiness, with greater attention to the precise distinction of tones. The southern speech, in addition to certain uniform differences from northern speech in the pronunciation of consonants, does not distinguish between the hoi and nga tones; and, it is felt by some to sound more laconic and musical. The speech of the Center, on the other hand, is often described as being heavy because of its emphasis on low tones.1 Vietnamese uses the Latin alphabet, with additional diacritics to indicate tones. Vietnam itself is the world’s 13th most populous country, and the 8th most populous in Asia. It became independent from Imperial China in 938 BCE. Since 2000, it has been one of the fastest-growing economies in the world. Oral Posture Oral or vocal tract posture is the characteristic pattern of muscular engagement and relaxation inherent to a given language or accent.
    [Show full text]
  • Consonant Cluster Acquisition by L2 Thai Speakers
    English Language Teaching; Vol. 10, No. 7; 2017 ISSN 1916-4742 E-ISSN 1916-4750 Published by Canadian Center of Science and Education Consonant Cluster Acquisition by L2 Thai Speakers Apichai Rungruang1 1 Faculty of Humanities, Naresuan University, Phitsanulok, Thailand Correspondence: Apichai Rungruang, Faculty of Humanities, Naresuan University, Phitsanulok, Thailand. E-mail: [email protected] Received: April 29, 2017 Accepted: June 10, 2017 Online Published: June 13, 2017 doi: 10.5539/elt.v10n7p216 URL: http://doi.org/10.5539/elt.v10n7p216 Abstract Attempts to account for consonant cluster acquisition are always made into two aspects. One is transfer of the first language (L1), and another is markedness effects on the developmental processes in second language acquisition. This study has continued these attempts by finding out how well Thai university students were able to perceive English onset and coda clusters when they were second year and fourth year students. This paper also aims to investigate Thai speakers’ opinions about their listening and speaking skills, and whether their course subjects enhanced their performance. To fulfil the first objective, a pretest and posttest were launched to measure how the 34 Thai participants were able to identify 40 onset and 120 coda clusters at different periods of time. The statistical findings show that even though their overall scores in the fourth year were higher than those in the second year, there was no statistically significant difference in both major types of clusters [t = -1.29; p value >0.05 in onsets; t = -0.28; p value >0.05 in codas]. The Thai participants performed slightly better in onset (84% / 86%) than in coda (70% / 71%).
    [Show full text]
  • The Festvox Indic Frontend for Grapheme-To-Phoneme Conversion
    The Festvox Indic Frontend for Grapheme-to-Phoneme Conversion Alok Parlikar, Sunayana Sitaram, Andrew Wilkinson and Alan W Black Carnegie Mellon University Pittsburgh, USA aup, ssitaram, aewilkin, [email protected] Abstract Text-to-Speech (TTS) systems convert text into phonetic pronunciations which are then processed by Acoustic Models. TTS frontends typically include text processing, lexical lookup and Grapheme-to-Phoneme (g2p) conversion stages. This paper describes the design and implementation of the Indic frontend, which provides explicit support for many major Indian languages, along with a unified framework with easy extensibility for other Indian languages. The Indic frontend handles many phenomena common to Indian languages such as schwa deletion, contextual nasalization, and voicing. It also handles multi-script synthesis between various Indian-language scripts and English. We describe experiments comparing the quality of TTS systems built using the Indic frontend to grapheme-based systems. While this frontend was designed keeping TTS in mind, it can also be used as a general g2p system for Automatic Speech Recognition. Keywords: speech synthesis, Indian language resources, pronunciation 1. Introduction in models of the spectrum and the prosody. Another prob- lem with this approach is that since each grapheme maps Intelligible and natural-sounding Text-to-Speech to a single “phoneme” in all contexts, this technique does (TTS) systems exist for a number of languages of the world not work well in the case of languages that have pronun- today. However, for low-resource, high-population lan- ciation ambiguities. We refer to this technique as “Raw guages, such as languages of the Indian subcontinent, there Graphemes.” are very few high-quality TTS systems available.
    [Show full text]
  • Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging
    ORIGINAL ARTICLE Neuroscience http://dx.doi.org/10.3346/jkms.2014.29.10.1416 • J Korean Med Sci 2014; 29: 1416-1424 Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging Zang-Hee Cho,1 Nambeom Kim,1 The two basic scripts of the Korean writing system, Hanja (the logography of the traditional Sungbong Bae,2 Je-Geun Chi,1 Korean character) and Hangul (the more newer Korean alphabet), have been used together Chan-Woong Park,1 Seiji Ogawa,1,3 since the 14th century. While Hanja character has its own morphemic base, Hangul being and Young-Bo Kim1 purely phonemic without morphemic base. These two, therefore, have substantially different outcomes as a language as well as different neural responses. Based on these 1Neuroscience Research Institute, Gachon University, Incheon, Korea; 2Department of linguistic differences between Hanja and Hangul, we have launched two studies; first was Psychology, Yeungnam University, Kyongsan, Korea; to find differences in cortical activation when it is stimulated by Hanja and Hangul reading 3Kansei Fukushi Research Institute, Tohoku Fukushi to support the much discussed dual-route hypothesis of logographic and phonological University, Sendai, Japan routes in the brain by fMRI (Experiment 1). The second objective was to evaluate how Received: 14 February 2014 Hanja and Hangul affect comprehension, therefore, recognition memory, specifically the Accepted: 5 July 2014 effects of semantic transparency and morphemic clarity on memory consolidation and then related cortical activations, using functional magnetic resonance imaging (fMRI) Address for Correspondence: (Experiment 2). The first fMRI experiment indicated relatively large areas of the brain are Young-Bo Kim, MD Department of Neuroscience and Neurosurgery, Gachon activated by Hanja reading compared to Hangul reading.
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • Handwriting Recognition in Indian Regional Scripts: a Survey of Offline Techniques
    1 Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques UMAPADA PAL, Indian Statistical Institute RAMACHANDRAN JAYADEVAN, Pune Institute of Computer Technology NABIN SHARMA, Indian Statistical Institute Offline handwriting recognition in Indian regional scripts is an interesting area of research as almost 460 million people in India use regional scripts. The nine major Indian regional scripts are Bangla (for Bengali and Assamese languages), Gujarati, Kannada, Malayalam, Oriya, Gurumukhi (for Punjabi lan- guage), Tamil, Telugu, and Nastaliq (for Urdu language). A state-of-the-art survey about the techniques available in the area of offline handwriting recognition (OHR) in Indian regional scripts will be of a great aid to the researchers in the subcontinent and hence a sincere attempt is made in this article to discuss the advancements reported in this regard during the last few decades. The survey is organized into different sections. A brief introduction is given initially about automatic recognition of handwriting and official re- gional scripts in India. The nine regional scripts are then categorized into four subgroups based on their similarity and evolution information. The first group contains Bangla, Oriya, Gujarati and Gurumukhi scripts. The second group contains Kannada and Telugu scripts and the third group contains Tamil and Malayalam scripts. The fourth group contains only Nastaliq script (Perso-Arabic script for Urdu), which is not an Indo-Aryan script. Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey. As it is important to identify the script before the recognition step, a section is dedicated to handwritten script identification techniques.
    [Show full text]
  • The Original Pronunciation of Sanskrit Devanàgará – Transliteration – IPA-Symbols
    The Original Pronunciation of Sanskrit DevanÀgarÁ – Transliteration – IPA-Symbols $ a n$D À DØ i L Á LØ u X A Â XØ ? Ã `C C Å `CØ CØ Æ O C # e HØ #H ai DeØ, $DH o RØ $D( au DØe8 N{ k N R kh N+ J g J gh J+ Ç 1 F c WeÝ ' ch WeÝ+ M j GeÛ + jh GeÛ+ _ È × T Ê Þ 4 Êh Þ+ I Ë É ) Ëh É+ > É Ö W t W Z th W+ G{ d G [ dh G+ Q n Q S p S { ph S+ E b E bh E+ P m P \ y M U{ r `O l O Y v Y9 ] Ì Ý Í V s V K h Ù Drafted by Maciej Zieba and Ulrich Stiehl under the auspices of Manfred Mayrhofer. ! Ï [ Improvements by Jost Gippert, Madhav Deshpande, Sunder Hattangadi, John Smith and others. This chart is a compromise, since the original pronunciation of Sanskrit Î YDULRXV is not exactly known in every detail. – 06/09/2002/us. Notes: 1. $ seems to have been pronounced originally as [n] or as [], possibly never as [D]. Not even the pronunciation of this most often used letter $ is exactly known! 2. The original pronunciation of is not known. It occurs only in the verb dS(kÆp). 3. U{seems to have been pronounced as [`] or as [], and likewise the liquid seems to have been pronounced as syllabic [`C] (notation: [`]+ [ C]) or as syllabic [C]. 4. The pronunciation of the semi-vowel Y seems to have been either [Y] or [9].
    [Show full text]
  • Writing Systems Reading and Spelling
    Writing systems Reading and spelling Writing systems LING 200: Introduction to the Study of Language Hadas Kotek February 2016 Hadas Kotek Writing systems Writing systems Reading and spelling Outline 1 Writing systems 2 Reading and spelling Spelling How we read Slides credit: David Pesetsky, Richard Sproat, Janice Fon Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems What is writing? Writing is not language, but merely a way of recording language by visible marks. –Leonard Bloomfield, Language (1933) Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems Writing and speech Until the 1800s, writing, not spoken language, was what linguists studied. Speech was often ignored. However, writing is secondary to spoken language in at least 3 ways: Children naturally acquire language without being taught, independently of intelligence or education levels. µ Many people struggle to learn to read. All human groups ever encountered possess spoken language. All are equal; no language is more “sophisticated” or “expressive” than others. µ Many languages have no written form. Humans have probably been speaking for as long as there have been anatomically modern Homo Sapiens in the world. µ Writing is a much younger phenomenon. Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems (Possibly) Independent Inventions of Writing Sumeria: ca. 3,200 BC Egypt: ca. 3,200 BC Indus Valley: ca. 2,500 BC China: ca. 1,500 BC Central America: ca. 250 BC (Olmecs, Mayans, Zapotecs) Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems Writing and pictures Let’s define the distinction between pictures and true writing.
    [Show full text]
  • Sanskrit Alphabet
    Sounds Sanskrit Alphabet with sounds with other letters: eg's: Vowels: a* aa kaa short and long ◌ к I ii ◌ ◌ к kii u uu ◌ ◌ к kuu r also shows as a small backwards hook ri* rri* on top when it preceeds a letter (rpa) and a ◌ ◌ down/left bar when comes after (kra) lri lree ◌ ◌ к klri e ai ◌ ◌ к ke o au* ◌ ◌ к kau am: ah ◌ं ◌ः कः kah Consonants: к ka х kha ga gha na Ê ca cha ja jha* na ta tha Ú da dha na* ta tha Ú da dha na pa pha º ba bha ma Semivowels: ya ra la* va Sibilants: sa ш sa sa ha ksa** (**Compound Consonant. See next page) *Modern/ Hindi Versions a Other ऋ r ॠ rr La, Laa (retro) औ au aum (stylized) ◌ silences the vowel, eg: к kam झ jha Numero: ण na (retro) १ ५ ॰ la 1 2 3 4 5 6 7 8 9 0 @ Davidya.ca Page 1 Sounds Numero: 0 1 2 3 4 5 6 7 8 910 १॰ ॰ १ २ ३ ४ ६ ७ varient: ५ ८ (shoonya eka- dva- tri- catúr- pancha- sás- saptán- astá- návan- dásan- = empty) works like our Arabic numbers @ Davidya.ca Compound Consanants: When 2 or more consonants are together, they blend into a compound letter. The 12 most common: jna/ tra ttagya dya ddhya ksa kta kra hma hna hva examples: for a whole chart, see: http://www.omniglot.com/writing/devanagari_conjuncts.php that page includes a download link but note the site uses the modern form Page 2 Alphabet Devanagari Alphabet : к х Ê Ú Ú º ш @ Davidya.ca Page 3 Pronounce Vowels T pronounce Consonants pronounce Semivowels pronounce 1 a g Another 17 к ka v Kit 42 ya p Yoga 2 aa g fAther 18 х kha v blocKHead
    [Show full text]