A Manipuri-English Bilingual Electronic Dictionary -Design and Implementation S
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
An Analysis Towards the Development of Electronic Bilingual Dictionary (Manipuri-English) -A Report
Sagolsem Poireiton Meitei et al, / (IJCSIT) International Journal of Computer Science and Information Technologies, Vol. 3 (2) , 2012,3423-3426 An Analysis towards the Development of Electronic Bilingual Dictionary (Manipuri-English) -A Report Sagolsem Poireiton Meitei, Shantikumar Ningombam, Prof. Bipul Syam Purkayastha Department of Computer Science Assam University, Silchar-788011 ABSTRACT-Meaningful sentences are composed of travel dictionaries, dictionaries of idioms and meaningful words; any system that hopes to process natural colloquialisms, a guide to pronunciation, a grammar languages as people do must have information about words, reference, common phrases and collocations, and a their meaning and relative words in another language. This dictionary of foreign loan words information is traditionally provided through electronic dictionaries. But these dictionary entries evolved for the convenience of human readers, not for machines. So, machine MANIPURI LANGUAGE readable electronic dictionary becomes the central resources Manipuri, locally known as Meiteilon (the Meitei+lon for Natural Language applications. Dictionaries and other ‘language’), is spoken basically in the state of Manipur lexical resources are not yet widely available in electronic form which is in the north eastern india. It is also spoken in for Manipuri language. And there is no machine readable Assam, Tripura, Mizoram, Burma and Bangladesh. dictionary that can provide both of lexical resources and Manipuri is the only medium of communication among 29 conceptual information. This paper describes the ongoing different mother tongues. Hence it is regarded as lingua development of an Electronic Manipuri Bilingual dictionary. franca of Manipur state. Since 20 August 1992 Manipuri This implementation will provide a more effective combination become the first TB (Tibeto Burman) language to receive of traditional English-Manipuri bi-lingual lexicographic information and their conceptual information recognition as a schedule VIII language of India. -
The Art of Not Being Scripted So Much. the Politics of Writing Hmong
The Art of Not Being Scripted So Much The Politics of Writing Hmong Language(s)1 Jean Michaud This article discusses the persistent absence of a consensus on a script for the language of the Hmong, a kinship-based society of 5 million spread over the uplands of Southwest China and northern Indochina, with a vigorous diaspora in the West. In search of an explanation for this unusual situation, this article proposes a political reading inspired by James C. Scott’s 2009 book The Art of Not Being Governed. A particular focus is put on Scott’s claim of tactical rejection of literacy among upland groups of Asia. To set the scene, the case of the Hmong is briefly exposed before detailing the successive appearance of orthographies for their language(s) over one century. It is then argued that the lack of consensus on a common writing system might be a reflection of deeper political motives rather than merely the result of historical processes. Anthropologists and linguists have long noted that it is com- writing system appears to be of little interest to the majority of mon for stateless, kinship-based societies such as the Hmong its speakers (Thao 2006)—outside very specific moments of to shape neither their own script nor borrow someone else’s rebellious crises, a point I return to later. (Goody 1968). Here, the twist is that while the Hmong as a I am not suggesting that the Hmong situation is monolithic; group have not adopted a common writing system, over two on the contrary, it is multifarious, as we will see. -
Neo-Vernacularization of South Asian Languages
LLanguageanguage EEndangermentndangerment andand PPreservationreservation inin SSouthouth AAsiasia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 Language Endangerment and Preservation in South Asia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 PUBLISHED AS A SPECIAL PUBLICATION OF LANGUAGE DOCUMENTATION & CONSERVATION LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA Special Publication No. 7 (January 2014) ed. by Hugo C. Cardoso LANGUAGE DOCUMENTATION & CONSERVATION Department of Linguistics, UHM Moore Hall 569 1890 East-West Road Honolulu, Hawai’i 96822 USA http:/nflrc.hawaii.edu/ldc UNIVERSITY OF HAWAI’I PRESS 2840 Kolowalu Street Honolulu, Hawai’i 96822-1888 USA © All text and images are copyright to the authors, 2014 Licensed under Creative Commons Attribution Non-Commercial No Derivatives License ISBN 978-0-9856211-4-8 http://hdl.handle.net/10125/4607 Contents Contributors iii Foreword 1 Hugo C. Cardoso 1 Death by other means: Neo-vernacularization of South Asian 3 languages E. Annamalai 2 Majority language death 19 Liudmila V. Khokhlova 3 Ahom and Tangsa: Case studies of language maintenance and 46 loss in North East India Stephen Morey 4 Script as a potential demarcator and stabilizer of languages in 78 South Asia Carmen Brandt 5 The lifecycle of Sri Lanka Malay 100 Umberto Ansaldo & Lisa Lim LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA iii CONTRIBUTORS E. ANNAMALAI ([email protected]) is director emeritus of the Central Institute of Indian Languages, Mysore (India). He was chair of Terralingua, a non-profit organization to promote bi-cultural diversity and a panel member of the Endangered Languages Documentation Project, London. -
Proposal to Encode the Kaithi Script in ISO/IEC 10646
Proposal to Encode the Kaithi Script in ISO/IEC 10646 Anshuman Pandey University of Michigan Ann Arbor, Michigan, U.S.A. [email protected] December 13, 2007 Contents Proposal Summary Form i 1 Introduction 1 1.1 Description ..................................... ...... 1 1.2 Justification for Encoding . ........... 1 1.3 Acknowledgments ................................. ...... 2 1.4 ProposalHistory ................................. ....... 2 2 Characters Proposed 3 2.1 CharactersNotProposed . ......... 4 2.2 Basis for Character Shapes . .......... 4 3 Technical Features 8 3.1 Name ............................................ 8 3.2 Classification ................................... ....... 8 3.3 Allocation...................................... ...... 8 3.4 EncodingModel................................... ...... 8 3.5 CharacterProperties. .......... 8 3.6 Collation ....................................... ..... 11 3.7 Typology of Characters . ......... 11 4 Background 13 4.1 Origins ......................................... 13 4.2 Name ............................................ 13 4.3 Definitions...................................... ...... 13 4.4 Languages Written in the Script . ........... 14 4.5 Standardization and Growth . .......... 16 4.6 Decline ......................................... 16 4.7 Usage ........................................... 17 5 Orthography 23 5.1 Distinguishing Features . ........... 23 5.2 Vowels.......................................... 23 5.3 Consonants ...................................... ..... 23 5.4 Nasalization................................... -
31 Map Locating Manipur
MAP LOCATING MANIPUR (NOT TO SCALE) 31 Chapter: 3 Environment, Archaeological and Historical Setting The state of Manipur is situated in the eastern border of the northeastern corner of the Indian subcontinent. Manipur lies between 23° 50‟ and 25° 41‟ N, and 93° 59‟ and 94° 45‟ E., with its capital as Imphal. It is bounded by three Indian states and one country; i.e. Nagaland in the north, Assam in the west, Mizoram in the south, and Burma in the east (Cover page). Manipur covers an area of 22,347 sq. km. with a population of 27,21,756, Male: 1369,764; Female: 13,51,992 with 79.85% literary rate (Census 2011). Manipur is a state where several ethnic groups reside. “Thus ethnologically and linguistically, the Meiteis (locals of Manipur) are Tibeto-Burman of the southern Mongoloid with Austroloid, Aryan, and Tai admixtures (including some Negrito and Dravidians elements?). Sociologically, the Meiteis have absorbed these foreign elements and completely integrated them in their social structure” (Kabui, 2011:21). Various regions of Manipur have been ruled by several clans at some point of time but the central plain has been the seat of the Ninthoujas, which emerged as the most powerful clan in the history of Manipur. Many scholars agreed that Manipur had an independent long reign of the Ninthouja clan, starting from Lord Nong-da-Lairel Pakhangba (A.D. 33) till King Kulachandra (A.D.1890). Manipur came under the British sway on 12th April, 1891 after the defeat in the Khongjom War. It later came to be recognized as the twentieth state of the Indian Union on 21st January 1972. -
Language,Identity and Meiteilon
Language,identity and Meiteilon Ningombam Bupenda Meitei (St.Stephen’s College) Department of Philosophy ,University of Delhi Abstract Language and identity have played a significant role in shaping the modern nation-states. Though , in modern days, it is a result of the French Revolution and European Renaissance , the notion of identity and language if not vividly but also did exist in Athenian society. The paper makes an attempt to understand the notion of language and how and who determines a language differentiating from its dialects , and also captures the notion of identity. The paper journeys through with an understanding of the language-identity relation through some fundamentals of socio-linguistic and anthro-linguistic schools of thought. It also highlights a small case study on Meiteilon , the official language of the state of Manipur. Introduction Language and identity in today’s socio-political world have become so interwoven that either of the two are not only complementary but also can not exist without the other as the notion of identity in modern world can not be sufficed without the notion of a language which signifies one’s identity. Identity revealed through language is a modern concept born out of European Renaissance and French Revolution as language becomes not only a mere medium to communicate but a pragmatically important tool to control power through nation state, lingua franca in courts and high elitist societies besides running trade and national educational policies. Since the time of Hellenistic period when Greek is a language from linguistic coverage of the dialects of Athens , the evolution of a language either through convergence or divergence of dialects due to a change of socio-politico-cultural temporal world thereby making a possibility for modern languages like French , English , Chinese , Russian , Japanese of today to flourish 1 as not only a standard language for communication but also as a unique identity to assert their nationality in the modern world of nation states. -
Front Matter
Published by FOREWORD SANGAM BOOK STORE PAONA BAZARIMPHAL-795 001 Manipuri is one of the important languages of northeastern India belonging to Tibeto-Burman family. This language is spoken by nearly 1.3 million (1991) people. Among them 19.31% are bilinguals. The language is very rich in its structure and vocabulary. As such there are many good modern linguistic descriptions on it. However, lexicographical activities did not receive similar attention of the scholars. In that perspective the present dictionary is an important contribution to Manipuri Lexicography. This dictionary has been compiled by a well-trained linguist Dr. Surmangol Sharma, who is also a native speaker of this language. He was fortunate to have got two years of post-doctoral fellowship at Tokyo University of Foreign Studies, Tokyo, Japan. There he could get very good training in lexicography and also had interaction with many famous scholars. This has enhanced the quality of this work. This dictionary has been targeted © Compiler 2006 for students. For development of languages dictionaries also play a major role. To appreciate this fact one can look at the number and variety of dictionaries available for English language. First edition 2006 Similar activities need to take place for all languages if they have to develop. In this light I congratulate Dr. Surmangol Sharma for taking this challenging and tedious work and completing within a reasonable time and also getting it published. I hope the educated people of the state and more particularly the Manipuri students will find this dictionary immensely beneficial. This dictionary has many interesting easy-to-use features, like using of roman script assists any Manipuri speaker with some English knowledge to read and understand. -
Bsc Chemistry
SUBJECT FORENSIC SCIENCE Paper No. and Title PAPER No. 8: Questioned Document Module No. and Title MODULE No.26: Script Analysis Module Tag FSC_P8_M26 FORENSIC SCIENCE PAPER No. 8: Questioned Document MODULE No.26 : Script Analysis TABLE OF CONTENTS 1. Learning Outcomes 2. Introduction to Scripts 3. Devanagri Script 4. Urdu Script 5. Bengali script 6. Gujarati script 7. Gurumukhi script 8. Mahajani script FORENSIC SCIENCE PAPER No. 8: Questioned Document MODULE No.26 : Script Analysis 1. Learning Outcomes After studying this module, you will be able to Know what Scripts are? Learn the difference between the alphabets of the scripts. Identify the different scripts. 2. Introduction to Scripts A script is defined as the handwriting style with peculiar characters. Many languages in India, such as Hindi and Sanskrit, use Devanagri and many more languages throughout India use local variants of the script. Devanagri has evolved into highly cursive script. Hindu scriptures are written in Devanagri, a fact illustrated by the etymology of the name. "Devanagri" is a compound word with two roots: deva means "deity", and nagari means "city". Together it implies a script that is religious as well as urbane or sophisticated. The main corpus of writing dated from the Indus civilization is in the form of some two thousand inscribed seals in good, legible conditions. Although these seals and samples of Indus writing have been floating around the scholastic world for close to 70 years, little progress has been made on deciphering this elegant script. The Gujarati script is one of the modern script of India and is derived from the Devanagri script during the 16th century CE. -
Phoneme to Grapheme Correspondence
LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 7 : 6 June 2007 Managing Editor: M. S. Thirumalai, Ph.D. Editors: B. Mallikarjun, Ph.D. Sam Mohanlal, Ph.D. B. A. Sharada, Ph.D. A. R. Fatihi, Ph.D. Lakhan Gusain, Ph.D. K. Karunakaran, Ph.D. Jennifer Marie Bayer, Ph.D. WRITTEN MANIPURI (MEITEIRON) – PHONEME TO GRAPHEME CORRESPONDENCE Leihaorambam Sarbajit Singh, Ph.D. Kabita Thaoroijam, Ph.D. Candidate Pradip Kumar Das, Ph.D. LANGUAGE IN INDIA 7:6 June 2007 Manipuri (Meiteiron) Phoneme-Grapheme L.S.Singh, et al. 1 WRITTEN MANIPURI (MEITEIRON) - FROM PHONEME TO GRAPHEME Leihaorambam Sarbajit Singh, Ph.D. Kabita Thaoroijam, Ph.D. (Candidate) Pradip Kumar Das, Ph.D. Abstract: Man is bound to speak, which is the main device to express ones feelings or concepts inside out. The hand written language records the spoken one and plays the vital role in information technology as well as electronic retrieval system. Linguistics, the study of language, was taken for granted to be the study of spoken language until recently. The written was just written down the spoken language. Besides, it should be easy and natural to learn the spoken language as it is the only language visible, language that is seen with the eyes and make with the hands, just as the spoken language is heard with the ears and spoken with the mouth. This paper focuses on the problems of Meiteiron written using Bengali script, the advantage of using Meitei script for written Meiteiron and some innovative approach to Meiteiron written using the indigenous script. -
Brahmi-The Divine Pristine Script: a Survey
Brahmi-The Divine Pristine Script: A Survey Sonam Nagar1 and R.S. Sharma2 1,2Rajasthan Technical University E-mail: [email protected], [email protected] Abstract—The history of any culture is a precious heritage. If diversity of India is its identity then different regional languages is complete this diversity. Brahmi script is one of archaic script of South Asia and it is also root of most of Indian languages. From past few years different researches have been taking place in reading of brahmi script. This paper explore about some of research took place in this direction. 1. INTRODUCTION Languages and scripts are not the equivalent things. Languages are spoken and every language may have subgroups, like dialects. Nowadays the writing system is known as the script. One language can be written in various scripts, and one script can be used to write various languages. The origins of language in particular can be complex to trace because language mostly needs no physical medium. Scripts have physical evidence to back them up, but written material represents a very small part of all languages. Moreover, only a micro part of that material has been recovered and read so far. This is especially true the further we go back in history, the earliest languages that we know of had no scripts to begin with [1]. Amongst Asian Countries, India is a polyglot country and multifarious script. In the Indian constitution eighth schedule says about twenty-two official languages and one- twenty two regional languages are used in India [2]. Sanskrit script is considered the world's oldest script. -
Meitei Mayek
ISO/IEC JTC1/SC2/WG2 N3206R2 L2/07-005R2 2007-08-07 Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal for encoding the Meitei Mayek script in the BMP of the UCS Source: UC Berkeley Script Encoding Initiative (Universal Scripts Project) Author: Michael Everson Status: Liaison Contribution Replaces: N3158, N2042 Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2007-08-07 1. Introduction. Meitei is a Tibeto-Burman language spoken chiefly in Manipur State in India, with Myanmar on its eastern border. Its earliest use is dated to between the 11th and 12th centuries CE. The script derives from the Tibetan group of scripts, themselves deriving from Gupta Brahmi. A stone inscription found at Khoibu in Tengnoupal District contains royal edicts of Kiyamba; the royal chronicle Cheitharol Kumbaba commenced from his time. King Khagemba (1597-1652) popularized the spread of education and the production of manuscripts in the script. The script continued to be used until to write the Meitei language until the late 18th century CE. King Garibnawas (1709-1748) embraced Hinduism during his reign and many Hindu texts, such as the Ra¯maya¯na and the Maha¯bha¯rata, were translated into the Meitei language written in the Meitei script. But after the Meitei adopted Hindu practices in 1729, many literary works written about the pre-Hindu religion as well as other historical documents were burnt, and Bengali script was adopted to write Meitei. The Meitei Mayek script has been revived in recent times, omitting nine letters which are not used in modern Meitei. -
LANGUAGES of INDIA WHITE PAPER Languages of India I
LANGUAGES OF INDIA WHITE PAPER Languages of India i About Andovar Andovar is a global provider of multilingual content solutions. Our services range from text translation and content creation, through audio and video recording, to turnkey localization of websites, software, eLearning and games. Our headquarters is in Singapore, and offices in Thailand, Colombia, USA and India. About This White Paper Andovar opened an office in Kolkata in 2014 to better serve our international clients and to add Indian language translation and audio recording to our suite of services. This white paper is an attempt to understand the localization situation when it comes to the major languages spoken in India. This has proven to be a challenging task. Not only are there dozens of languages and over ten scripts in everyday use in India, but it is also a rapidly developing country, with a huge and growing economy and new intiatives related to languages, encoding and translation technology support appear almost monthly. Not much has been written to date to give a holistic overview of the localization landscape and hopefully this white paper will be a useful reference to language professionals, even if it is far from perfect. As such, there may be mistakes or missing information which will be added in future updates. Please contact [email protected] with any questions or suggestions. The data on number or speakers and native speakers of Indian languages is unreliable, with different sources following different conventions when deciding what is a language and what is merely a dialect*. Our first source is the Indian Census from 2001 (www.censusindia.gov.in/Census_Data_2001/), which provided comprehensive information about the whole country, but is now outdated.