Proposal to Encode the Masaram Gondi Script in Unicode
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Word Structure in Gondi Thota Venkata Swamy Assistant Editor, Centre for Publications, Dravidian University, Kuppam – 517 426, Andhra Pradesh, INDIA
American International Journal of Available online at http://www.iasir.net Research in Humanities, Arts and Social Sciences ISSN (Print): 2328-3734, ISSN (Online): 2328-3696, ISSN (CD-ROM): 2328-3688 AIJRHASS is a refereed, indexed, peer-reviewed, multidisciplinary and open access journal published by International Association of Scientific Innovation and Research (IASIR), USA (An Association Unifying the Sciences, Engineering, and Applied Research) Word Structure in Gondi Thota Venkata Swamy Assistant Editor, Centre for Publications, Dravidian University, Kuppam – 517 426, Andhra Pradesh, INDIA I. Introduction In terms of numerical strength Gonds are a very dominant tribe of central India. Their habitation includes five different states, namely, Madhya Pradesh, Maharashtra, Chattisgarh, Odisha and Telangana. The Gond population according 2011 census in these states is as follows: (i) Andhra Pradesh - 1,44,259 (Now Telangana) (ii) Chhatisgarh - 8,06,254; (iii) Madhya Pradesh - 6,75,011; (iv) Maharastra - 4,41,203 and (v) Odisha - 51,948. The total Gond population is 21,24,852. Out of the total population of India 0.25% is the Gond population. 8,99,567 Gonds are bilinguals knowing two languages (42.34%) and 1,34,156 Gonds are trilinguals knowing three languages (6.31%). As they are spread in a vast area, they have heterogeneously stratified society. They exhibit a cultural variation which is from most primitive to the advanced states. The northern region of Gond habitat shows varying degrees of acculturation whereas the southern region is comparatively less exposed to the external influences. The Gond society consists elements of both the southern and northern social system including kinship norms. -
Unit 2 Distribution of Indian Tribes, Groups and Sub-Groups: Causes of Variations
Tribal Cosmogenies UNIT 2 DISTRIBUTION OF INDIAN TRIBES, GROUPS AND SUB-GROUPS: CAUSES OF VARIATIONS Structure 2.0 Objectives 2.1 Introduction 2.2 Distribution of Indian tribes: groups and sub-groups 2.2.1 Geographical 2.2.2 Linguistic 2.2.3 Racial 2.2.4 Size 2.2.5 Economy or subsistence pattern 2.2.6 Degree of incorporation into the Hindu society 2.3 Causes of variations 2.3.1 Migration 2.3.2 Acculturation and assimilation. 2.3.3 Geography or the physical environment 2.4 Let us sum up 2.5 Activity 2.6 References and further reading 2.7 Glossary 2.8 Answers to questions 2.0 OBJECTIVES After having read this Unit, you will be able to: x get a panoramic view of the Indian tribes; x understand the principles upon which their classification is based; and x map their distribution on the basis of their geographical location, language, physical attributes, economy and social structure. 2.1 INTRODUCTION We have had a general outline about the tribes of India in the preceding Unit (Course 4-Block 1- Unit 1). In this present Unit we will deal with distribution of the tribes of India in detail. Structurally this Unit is divided into two main sections. In this first section of the Unit we will discuss the distribution pattern of Indian tribes and how they are classified into different groups and sub-groups based on various criteria. 24 These criteria are based on their geographical location, language, physical Migrant Tribes / Nomads attributes, economy and the degree of incorporation into the Hindu society. -
I:\Eastern Anthropologist\No 2
Malli Gandhi ENDANGERMENT OF LANGUAGE AMONG THE YERUKULA: A NOMADIC / DENOTIFIED TRIBE OF ANDHRA PRADESH The scheduled tribes, nomadic and denotified tribes constitute a major segment of population in Andhra Pradesh. They live in remote areas of the state and need special focus to solve their problems. Jatapu, Konda Dora, Muka Dora, Manne Dora, Savara, Gadaba, Chenchu, Koya, Gondi are some of the major primitive tribal groups of Andhra Pradesh. In addition there are Dasari, Yerukula, Yanadi, Sugali, Korawa, Koracha, Kaidai and Nakkala as some of the denotified tribes in Andhra Pradesh. Further, Woddera, Pamula, Nirshikari, Budabukkala, Mandula, Pusala, Gangi, Reddula, Boya, Dommara, Jogi are some of the nomadic and semi-nomadic tribes. Andhra Pradesh has 52 lakhs scheduled tribe population (2001 census). The largest tribal population is found in Khammam district (26.47% that is 682617 – 6.8 lakhs), followed by Visakapatnam district (5.58 lakhs). The tribal population of Andhra Pradesh increased from 7.67 to 52 lakhs in 50 years between 1951 and 2001. The substantial population increase between 1971 and 2001 was because of the recognition of the Sugali, Yerukula, Yanadi, Nakkala and other denotified, nomadic tribes as scheduled tribes in the entire state. The tribal communities in the state of Andhra Pradesh mostly exhibited Proto-Austroloid features. Chenchus and Yanadis exhibit some Negrito strain whereas the Khond and Savara have Mongoloid features. The tribal communities in Andhra Pradesh mainly belong to three linguistic families such as: Dravidian language family (Gondi, Koya, Kolami, Yerukula, and so on); Mundari language family (Savra, Godaba, and so on); Indo-Aryan language family (Banjara, and others). -
Learnings from Technological Interventions in a Low Resource Language: a Case-Study on Gondi
Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi Devansh Mehta 1∗, Sebastin Santy2∗, Ramaravind K. Mothilal2, Brij M.L. Srivastava3, Alok Sharma4, Anurag Shukla5, Vishnu Prasad1, Venkanna U.5, Amit Sharma2, Kalika Bali2 1Voicedeck Technologies, 2Microsoft Research India, 3INRIA France, 4DN Developers, 5IIIT Raipur Abstract The primary obstacle to developing technologies for low-resource languages is the lack of usable data. In this paper, we report the adoption and deployment of 4 technology-driven methods of data collection for Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. In the process of data collection, we also help in its revival by expanding access to information in Gondi through the creation of linguistic resources that can be used by the community, such as a dictionary, children’s stories, an app with Gondi content from multiple sources and an Interactive Voice Response (IVR) based mass awareness platform. At the end of these interventions, we collected a little less than 12,000 translated words and/or sentences and identified more than 650 community members whose help can be solicited for future translation efforts. The larger goal of the project is collecting enough data in Gondi to build and deploy viable language technologies like machine translation and speech to text systems that can help take the language onto the internet. Keywords: Low-Resource Languages, Deployment, Applications 1. Introduction Around 40% of all the languages in the world face the dan- ger of extinction in the near future. Languages are not only a means of communication but also a carrier of tradition and cultures like verbal art, songs, narratives, rituals etc. -
1 L2/15-289R
L2/15-289r TO: Unicode Technical Committee FROM: Deborah Anderson, Project Leader, Script Encoding Initiative, UC Berkeley DATE: 31 October 2015 TITLE: Script Encoding Initiative Liaison report This document provides a summary of the UC Berkeley Script Encoding Initiative’s recent activities. Proposals or documents currently submitted to the UTC that have involved SEI include: • Preliminary proposal for encoding the Medefaidrin (Oberi Okaime) script (revised) (Rovenchak et al.) [L2/15-117r] • Preliminary proposal for encoding the Mandombe script (revised) (Rovenchak et al.) [L2/15-118r] • Proposal to Encode Indic Siyaq Numbers (revised) (Pandey) [L2/15-121] • Proposal to Encode Ottoman Siyaq Numbers (Pandey) [L2/15-072] • Proposal to Encode the Makasar script (Pandey) [L2/15-233] • Proposal to Encode the Dogra script (Pandey) [L2/15-234] • Proposal to Encode the Gunjala Gondi script (Pandey) [L2/15-235] • Proposal to encode the Hanifi Rohingya script (Pandey) [L2/15-278] • Proposal to encode the Old Sogdian script (Pandey) [L2/15-089] • Preliminary draft of the Ptolemaic repertoire (A: Man and his occupations) (Suignard) [L2/15-240] • Summary of Ad Hoc Meeting on Two Historical Scripts from Mongolia (Tokyo, Japan) (Anderson) [L2/15-249] The following scripts are in the preliminary stage or are still undergoing research and are not yet ready for approval: • Afáka (Everson) [N4292=L2/12-228] • Bagam (Everson) [N4293=L2/12-229] • Balti ‘A’ and 'B' (Pandey) [N3842=L2/10-231, N4016=L2/11-103] • Bhujinmol (Pandey) [L2/14-283] • Book Pahlavi -
ELLORA: Enabling Low Resource Languages with Technology
ELLORA: Enabling Low Resource Languages with Technology Kalika Bali, Monojit Choudhury, Sunayana Sitaram, Vivek Seshadri Microsoft Research Labs Bangalore, India {kalikab, monojitc, susitara, visesha}@microsoft.com Abstract Language technology has had a huge impact on the way language communities communicate and access information. However, this revolution has bypassed over 5000 languages around the world that have no resources to develop technology in their languages. ELLORA, with its mission to empower every person and every organization using underserved languages to achieve more, is a program for enabling low resource languages through language technology. In this paper we describe how through innovative methodologies and techniques that allow systems to be built in resource constrained settings, ELLORA seeks to positively impact the underserved language communities around the globe.. Keywords: Low resource languages, language technology, language data, endangered and minority languages Résumé भाषा प्रौद्योगिकी के माध्यम से गिगिटल 啍ा車गि की अिुवाई मᴂ भाषा समुदायो車 को िानकारी पहԁचाने के िरीके पर भारी प्रभाव पडा है। िबगक कुछ प्रमुख स車साधन स車पन्न भाषाओ車 के उपयोिकिाा हर गदन ऐसी िकनीक का लाभ उठा रहे हℂ, इस 啍ा車गि से दुगनया भर की अगधका車श भाषाएԁ उपेगिि हℂ । प्रौद्योगिकी गवकास की िेटा-भूखी दुगनया मᴂ, 5000 से अगधक भाषाओ車 को अपनी भाषाओ車 मᴂ प्रौद्योगिकी गवकगसि करने के गलए कोई स車साधन नही車 हℂ। एलोरा (ELLORA) का गमशन प्रत्येक व्यक्ति और प्रत्येक स車िठन को अपनी भाषा मᴂ आिे बढ़ने के गलए प्रोत्सागहि करना है । यह प्राकृगिक भाषा प्रस車स्करण, वािाालाप और भाषण प्रौद्योगिकी के माध्यम से कम स車साधन भाषाओ車 को सिम करने के गलए एक काया啍म है। इस शोध पत्र मᴂ हम बिािे हℂ गक कैसे नवीन पद्धगियो車 और िकनीको車 के माध्यम से ELLORA दुगनया भर के भाषा समुदायो車 को सकारात्मक 셂प से प्रभागवि करना चाहिा है। in the field of NLP, which is primarily due to the use of 1. -
Aspects of Gond Astronomy1
Aspects of Gond Astronomy1 M N Vahia1 and Ganesh Halkare2 1Tata Institute of Fundamental Research, Mumbai 400 005 2Indrayani Colony, Amravati, 444 607 Summary: The Gond community is considered to be one of the most ancient tribes of India with a continuing history of several thousand years. They are also known for their largely isolated history which they have retained through the millennia. Several of their intellectual traditions therefore are a record of parallel aspects of human intellectual growth. It still preserves its original flavour and is not homogenised by later traditions of India. In view of this, they provide a separate window to the different currents that constitute contemporary India. In the present study, we summarise their mythology, genetics and script. We then investigate their astronomical traditions and try to understand this community through a survey of about 15 Gond villages spread over Maharashtra, Andhra Pradesh and Madhya Pradesh. We show that they have a distinctly different view of the skies from the conventional astronomical ideas which is both interesting and informative. We briefly comment on other aspects of their life as culled from our encounters with the members of the Gond community Introduction: Gonds are the largest of Indian tribes with a net population between 4 and 5 million spread over north Andhra Pradesh, eastern Maharashtra, eastern Madhya Pradesh, Jharkhand and western Orissa (Deogaonkar, 2007 p 14-17). While their precise history cannot be dated before 890 AD (Deogaonkar, 2007, p 37), their roots are certainly older. Origin of the Gonds Mehta (1984) has studied the Gonds from different perspectives and also their history and mythology in detail (Mehta, 1984, p 105 – 166). -
1 Sundanese Sign Virama U+1Bab
Proposal to synchronize seven glyphs in the Code Charts For consideration by Unicode Technical Committee 2020-01- 13 (revised; first submitted 2020-01-06) We should always say what we see. Marcel Schneider ([email protected]) Above all we should always —which is most difficult— see what we see. Charles Péguy This proposal adds to the response to Action item 161-A1 in that it aims at synchronizing the Core Specification with changes already effected to other parts of the Unicode Standard or suggested in Proposal to make material changes to UAX #14, or Proposal suggesting formal edits to UAX #14, submitted simultaneously. A minor part of this proposal relies on Proposal to extend support for abbreviations, scheduled for simultaneous submission. By coincidence, this proposal is also part of Unicode 13.0 beta feedback. These suggestions are sorted by importance and concomitant likelihood of acceptance rather than by code points. 1 SUNDANESE SIGN VIRAMA U+1BAB Change from: Change to: Rationale: Among the eleven invisible stackers encoded so far, listed in the table below (where brackets stand for a dashed box), Sundanese virama is the only one whose reference glyph is lacking the dotted circle U+25CC above the (combining) plus sign below. Proposal to synchronize seven glyphs in the Code Charts 2 In the Change-to picture above, the COMBINING PLUS SIGN BELOW its DOTTED CIRCLE in a dashed box has been copy-pasted from the reference glyph of U+1039 MYANMAR SIGN VIRAMA. cp Name Chart glyph 1039 MYANMAR SIGN VIRAMA [◌̟ ] 17D2 KHMER SIGN COENG -
Languages of the World--Indo-Pacific Fascicle Eight
REPORT RESUMES ED 010 367 48 LANGUAGES OF THE WORLD--INDO-PACIFIC FASCICLE EIGHT. ST- VOEGELI1, C.F. VOEGELIN, FLORENCE M. INDIANA UNIV., BLOOMINGTON REPORT NUMBER NDEA- VI -63 -20 PUB DATE. APR 66 CONTRACT OEC-SAE-9480 FURS PRICE MF-$Q.18HC-52.80 70P. ANTHROPOLOGICAL LINGUISTICS, 8(4)/1-64, APRIL 1966 DESCRIPTORS- *LANGUAGES, *INDO PACIFIC LANGUAGES, ARCHIVES OF LANGUAGES OF THE WORLD, BLOOMINGTON, INDIANA THIS REPORT DESCRIBES SOME OF THE LANGUAGES AND LANGUAGE FAMILIES OF THE SOUTH AND SOUTHEAST ASIA REGIONS OF THE INDO-PACIFIC AREA. THE LANGUAGE FAMILIES DISCUSSED WERE JAKUM, SAKAI, SEMANG, PALAUNG-WA (SALWEEN), MUNDA, AND DRAVIDIAN. OTHER LANGUAGES DISCUSSED WERE ANDAMANESE, N/COBAnESE, KHASI, NAHALI, AND BCRUSHASKI. (THE REPORT IS PART OF A SERIES, ED 010 350 TO ED 010 367.) (JK) +.0 U. S. DEPARTMENT OF HEALTH, EDUCATION AND WELFARE b D Office of Education tr's This document has been reproduced exactlyas received from the S.,4E" L es, C=4.) person or organiz3t1on originating It. Points of view or opinions T--I stated do not nocessart- represent official °dice of Edumdion poeWon or policy. AnthropologicalLinguistics Volume 8 Number 4 April 116 6 LANGUAGES OF THE WORLD: INDO- PACIFIC FASCICLE EIGHT A Publication of the ARCHIVES OFLANGUAGES OF THEWORLD Anthropology Department Indiana University ANTHROPOLOGICAL LINGUISTICS is designed primarily, but not exclusively, for the immediate publication of data-oriented papers for which attestation is available in the form oftape recordings on deposit in the Archives of Languages of the World. -
The Global Council on Anthropological Linguistics 2019, in Asia
The GLOCAL 2019 Linguistics Anthropological on GLOCAL,Global © 2019 The The Council Copyright The Global Council on Anthropological Linguistics 2019, in Asia Siem Reap, Cambodia 23 - 26 January 2019 Editor: Asmah Haji Omar ISSN: 2707-8647 ISBN: 978-0-6485356-0-7 1 The GLOCAL Conference 2019 in Asia (The CALA 2019) “Revitalization and Representation” Conference Proceedings Papers January 23-26, 2019 Royal Angkor Resort Siem Reap, Cambodia Hosted by The Paññāsāstra University of Cambodia The GLOCAL Conference 2019 ASIAN LINGUISTIC ANTHROPOLOGY 2019 Siem Reap, Cambodia https://glocal.soas.ac.uk/the-cala-2019-title/ The Proceedings of the 2019 GLOCAL Conference in Asia, Asian Linguistic Anthropology ISSN 2707-8647 ISBN 978-0-6485356-0-7 Subcriber Categories Linguistics, Anthropology, Social Sciences, Humanities, Interdisciplinary Studies, Cultural Studies, Sociolinguistics, Language and Society GLOCAL Publications are available by ordering individual publications directly from the GLOCAL Office, Online at https://glocal.soas. ac.uk, or through organizational or individual affiliation with The GLOCAL. For further information, contact The GLOCAL office at cala@ soas.ac.uk, at SOAS, University of London. NOTICE: The project that is the subject of this report was approved by the Central Committee and Sub Committee, whose members are drawn from The GLOCAL Scientific Committee. This book of proceedings papers has been reviewed by a group of reviewers, external to the authors and editors of this work, according to the procedures directed by and approved by the GLOCAL Main Review Committee. This project was hosted by the Paññāsāstra University of Cambodia and organized by The Global Council on Anthropological Linguistics, SOAS, University of London. -
Bible Translation in the Indian Context K
/JT4212 (2000),pp. 125-137 Bible Translation in the Indian Context K. Regu* Introduction India is a multilingual, pluricultural and multiethnic nation. India is known for its religious plurality also. There are more than 1652languages spoken by different social groups, sometimes spreading beyond socio-cultural barriers, in India. All these languages coine under 4 families such as, Indo-Aryan, Dravidian, Austr,o-Asiatic (Munda) and Tibeto-Burmese, among which only a very few languages have thei,r own scripts and written records available in different forms. The Bible comes first in term~ of translation across most of the world languages. In India also the Bible more than any other literature has been translated into many languages·: i.e., in more than 60 languages in fujl)nd partially available in many other limguages-200 languages approximately. · [. It is through Bible translation th*'many scholars ventured into the preparation ofgrammar for various Indian languages. Linguistics played a major role in identifying, classifying and grouping these languages under differ~nt families according to the genetic relations with one ·another. Through Bible translation the people oflndia are linked together across the social, cultural and linguistic boundaries within India and outside India. A global unity also is being developed through Bible and biblical thoughts in this millennium. I. India and Her Languages 1.1. The Indian Empire is a well-known book which was published in the year 1881 by Sir William Hunter. Later the same was revised and published in 4 Volumes under the title The Imperial Gazetteer ofIndia: The Indian Empire during 19.07-1909. -
Overview and Rationale
Integration Panel: Maximal Starting Repertoire — MSR-5 Overview and Rationale REVISION – APRIL 06, 2021 – PUBLIC COMMENT Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-5) 3 2.1 Files 3 2.1.1 Overview 3 2.1.2 Normative Definition 4 2.1.3 HTML Presentation 4 2.1.4 Code Charts 4 2.2 Determining the Contents of the MSR 5 2.3 Process of Deciding the MSR 6 3 Scripts 8 3.1 Comprehensiveness and Staging 8 3.2 What Defines a Related Script? 9 3.3 Separable Scripts 9 3.4 Deferred Scripts 9 3.5 Historical and Obsolete Scripts 9 3.6 Selecting Scripts and Code Points for the MSR 10 3.7 Scripts Appropriate for Use in Identifiers 10 3.8 Modern Use Scripts 11 3.8.1 Common and Inherited 12 3.8.2 Scripts included in MSR-1 12 3.8.3 Scripts added in MSR-2 12 3.8.4 Scripts added in MSR-3 through MSR-5 13 3.8.5 Modern Scripts Ineligible for the Root Zone 13 3.9 Scripts for Possible Future MSRs 13 3.10 Scripts Identified in UAX#31 as Not Suitable for identifiers 14 4 Exclusions of Individual Code Points or Ranges 16 4.1 Historic and Phonetic Extensions to Modern Scripts 16 4.2 Code Points That Pose Special Risks 17 4.3 Code Points with Strong Justification to Exclude 17 4.4 Code Points That May or May Not be Excludable from the Root Zone LGR 17 4.5 Non-spacing Combining Marks 18 Integration Panel: Maximal Starting Repertoire — MSR-3 Overview and Rationale 5 Discussion of Particular Code Points 20 5.1 Digits and Hyphen 20 5.2 CONTEXT O Code Points 21 5.3 CONTEXT J Code Points 21 5.4 Code Points Restricted for Identifiers 21 5.5 Compatibility