Proposal for an Oriya Script Root Zone Label Generation Ruleset (LGR)
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
UNICODE for Kannada General Information & Description
UNICODE for Kannada (U+0C80 to U+0CFF) General Information & Description Written by: C V Srinatha Sastry Issued by: Director Directorate of Information Technology Government of Karnataka Multi Storied Buildings, Vidhana Veedhi BANGALORE 560 001 INDIA PDF created with FinePrint pdfFactory Pro trial version http://www.pdffactory.com UNICODE for Kannada Introduction The Kannada script is a South Indian script. It is used to write Kannada language of Karnataka State in India. This is also used in many parts of Tamil Nadu, Kerala, Andhra Pradesh and Maharashtra States of India. In addition, the Kannada script is also used to write Tulu, Konkani and Kodava languages. Kannada along with other Indian language scripts shares a large number of structural features. The Kannada block of Unicode Standard (0C80 to 0CFF) is based on ISCII-1988 (Indian Standard Code for Information Interchange). The Unicode Standard (version 3) encodes Kannada characters in the same relative positions as those coded in the ISCII-1988 standard. The Writing system that employs Kannada script constitutes a cross between syllabic writing systems and phonemic writing systems (alphabets). The effective unit of writing Kannada is the orthographic syllable consisting of a consonant (Vyanjana) and vowel (Vowel) (CV) core and optionally, one or more preceding consonants, with a canonical structure of ((C)C)CV. The orthographic syllable need not correspond exactly with a phonological syllable, especially when a consonant cluster is involved, but the writing system is built on phonological principles and tends to correspond quite closely to pronunciation. The orthographic syllable is built up of alphabetic pieces, the actual letters of Kannada script. -
Annualrepeng II.Pdf
ANNUAL REPORT – 2007-2008 For about six decades the Directorate of Advertising and on key national sectors. Visual Publicity (DAVP) has been the primary multi-media advertising agency for the Govt. of India. It caters to the Important Activities communication needs of almost all Central ministries/ During the year, the important activities of DAVP departments and autonomous bodies and provides them included:- a single window cost effective service. It informs and educates the people, both rural and urban, about the (i) Announcement of New Advertisement Policy for nd Government’s policies and programmes and motivates print media effective from 2 October, 2007. them to participate in development activities, through the (ii) Designing and running a unique mobile train medium of advertising in press, electronic media, exhibition called ‘Azadi Express’, displaying 150 exhibitions and outdoor publicity tools. years of India’s history – from the first war of Independence in 1857 to present. DAVP reaches out to the people through different means of communication such as press advertisements, print (iii) Multi-media publicity campaign on Bharat Nirman. material, audio-visual programmes, outdoor publicity and (iv) A special table calendar to pay tribute to the exhibitions. Some of the major thrust areas of DAVP’s freedom fighters on the occasion of 150 years of advertising and publicity are national integration and India’s first war of Independence. communal harmony, rural development programmes, (v) Multimedia publicity campaign on Minority Rights health and family welfare, AIDS awareness, empowerment & special programme on Minority Development. of women, upliftment of girl child, consumer awareness, literacy, employment generation, income tax, defence, DAVP continued to digitalize its operations. -
Journalism Class - Xi
HIGHER SECONDARY COURSE JOURNALISM CLASS - XI Government of Kerala DEPARTMENT OF EDUCATION State Council of Educational Research and Training (SCERT) Kerala 2016 THE NATIONAL ANTHEM Jana-gana-mana adhinayaka, jaya he Bharatha-bhagya-vidhata. Punjab-Sindh-Gujarat-Maratha Dravida-Utkala-Banga Vindhya-Himachala-Yamuna-Ganga Uchchala-Jaladhi-taranga Tava subha name jage, Tava subha asisa mage, Gahe tava jaya gatha. Jana-gana-mangala-dayaka jaya he Bharatha-bhagya-vidhata. Jaya he, jaya he, jaya he, Jaya jaya jaya, jaya he! PLEDGE India is my country. All Indians are my brothers and sisters. I love my country, and I am proud of its rich and varied heritage. I shall always strive to be worthy of it. I shall give my parents, teachers and all elders respect, and treat everyone with courtesy. To my country and my people, I pledge my devotion. In their well-being and prosperity alone lies my happiness. Prepared by State Council of Educational Research and Training (SCERT) Poojappura, Thiruvananthapuram 695012, Kerala Website : www.scertkerala.gov.in e-mail : [email protected] Phone : 0471 - 2341883, Fax : 0471 - 2341869 Typesetting and Layout : SCERT © Department of Education, Government of Kerala To be printed in quality paper - 80gsm map litho (snow-white) Foreword Dear learners, It is with immense pleasure and pride that State Council of Educational Research and Training (SCERT), Kerala brings forth its first textbook in Journalism for higher secondary students. We have been trying to set up a well structured syllabus and textbook for Journalism since the introduction of the course at the higher secondary level. Though we could frame a syllabus, we could not develop a textbook for Journalism all these years. -
A Sociolinguistic Survey of the Bhatri-Speaking Communities of Central India
DigitalResources Electronic Survey Report 2017-005 A Sociolinguistic Survey of the Bhatri-speaking Communities of Central India Compiled by Dave Beine A Sociolinguistic Survey of the Bhatri-speaking Communities of Central India Compiled by Dave Beine Researched by Dave Beine Bruce Cain Kathy Cain Michael Jeyabalan Ashok Sawlikar Satya Soren SIL International® 2017 SIL Electronic Survey Report 2017-005, May 2017 © 2017 SIL International® All rights reserved Abstract This sociolinguistic survey of the Bhatri-speaking communities of Central India was carried out between February and November 1989. The goal of the survey was to assess the need for language development work and vernacular literacy programs among the Bhatri-speaking peoples of Bastar District in Madhya Pradesh and Koraput District in Orissa. Dialect intelligibility tests revealed that the whole Bhatri- speaking area can be considered one language area. Language use and attitudes questionnaires showed that the language is thriving. Bilingualism in the major languages of Hindi, Oriya, and Halbi is inadequate for people to use existing materials. Based on these findings the survey recommends that a language project be undertaken in the Bhatri community. (This survey report written some time ago deserves to be made available even at this late date. Conditions were such that it was not published when originally written. The reader is cautioned that more recent research may be available. Historical data is quite valuable as it provides a basis for a longitudinal analysis and helps -
Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only. -
2001 Presented Below Is an Alphabetical Abstract of Languages A
Hindi Version Home | Login | Tender | Sitemap | Contact Us Search this Quick ABOUT US Site Links Hindi Version Home | Login | Tender | Sitemap | Contact Us Search this Quick ABOUT US Site Links Census 2001 STATEMENT 1 ABSTRACT OF SPEAKERS' STRENGTH OF LANGUAGES AND MOTHER TONGUES - 2001 Presented below is an alphabetical abstract of languages and the mother tongues with speakers' strength of 10,000 and above at the all India level, grouped under each language. There are a total of 122 languages and 234 mother tongues. The 22 languages PART A - Languages specified in the Eighth Schedule (Scheduled Languages) Name of language and Number of persons who returned the Name of language and Number of persons who returned the mother tongue(s) language (and the mother tongues mother tongue(s) language (and the mother tongues grouped under each grouped under each) as their mother grouped under each grouped under each) as their mother language tongue language tongue 1 2 1 2 1 ASSAMESE 13,168,484 13 Dhundhari 1,871,130 1 Assamese 12,778,735 14 Garhwali 2,267,314 Others 389,749 15 Gojri 762,332 16 Harauti 2,462,867 2 BENGALI 83,369,769 17 Haryanvi 7,997,192 1 Bengali 82,462,437 18 Hindi 257,919,635 2 Chakma 176,458 19 Jaunsari 114,733 3 Haijong/Hajong 63,188 20 Kangri 1,122,843 4 Rajbangsi 82,570 21 Khairari 11,937 Others 585,116 22 Khari Boli 47,730 23 Khortha/ Khotta 4,725,927 3 BODO 1,350,478 24 Kulvi 170,770 1 Bodo/Boro 1,330,775 25 Kumauni 2,003,783 Others 19,703 26 Kurmali Thar 425,920 27 Labani 22,162 4 DOGRI 2,282,589 28 Lamani/ Lambadi 2,707,562 -
Translation Strategies of the Non-Native Odia Translators (1807-1874)
Translation Strategies of the Non-Native Odia Translators (1807-1874) RAMESH C MALIK Translation strategy means a plan or procedure adopted by the translators to solve the translation problems. The present paper is to highlight on the translation strategies of the non-native Odia translators during the colonial period (1807-1874). First of all, those translators who were non-residents of Odisha and had learnt Odia for specific purposes are considered non-native Odia translators.The first name one of the Odia translators is William Carey (1761-1834), who translated the New Testament or Bible from English to Odia that was subsequently published by the Serampore Mission Press Calcutta in 1807. A master craftsman of Christian theology and an Odia translator of missionary literature, Amos Sutton (1798-1854), who translated John Bunyan’s (1628-1688) the Pilgrim’s Progress (1678) to Odia under the titled swargiya jātrira brutānta in 1838. Sutton served as an Odia translator under the British government. His religious, literary, and linguistic contributions to Odia language and literature are to be studied for making a concrete idea about the development of Odia prose. In the era of Odia translation discourse, his translations deserve to be studied in the theoretical frame of translation strategies. In this paper, the following translation strategies like linguistic strategies, literal translation strategy, lexical alteration strategy, deletion, exoticism and cultural transposition strategies are predominately adopted by the translators. Since the objectives of the SLTs were to promote religious evangelization and second language learning, the translation strategies tried to preserve the religious and pedagogical fidelity rather that textual fidelity in the translated texts. -
The Syllable Structure of Bangla in Optimality Theory and Its Application to the Analysis of Verbal Inflectional Paradigms in Distributed Morphology
The syllable structure of Bangla in Optimality Theory and its application to the analysis of verbal inflectional paradigms in Distributed Morphology von Somdev Kar Philosophische Dissertation angenommen von der Neuphilologischen Fakultät der Universität Tübingen am 09. Januar 2009 Tübingen 2009 Gedruckt mit Genehmigung der Neuphilologischen Fakultät der Universität Tübingen Hauptberichterstatter : Prof. Hubert Truckenbrodt, Ph.D. Mitberichterstatter : PD Dr. Ingo Hertrich Dekan : Prof. Dr. Joachim Knape ii To my parents... iii iv ACKNOWLEDGEMENTS First and foremost, I owe a great debt of gratitude to Prof. Hubert Truckenbrodt who was extremely kind to agree to be my research adviser and to help me to formulate this work. His invaluable guidance, suggestions, feedbacks and above all his robust optimism steered me to come up with this study. Prof. Probal Dasgupta (ISI) and Prof. Gautam Sengupta (HCU) provided insightful comments that have given me a different perspective to various linguistic issues of Bangla. I thank them for their valuable time and kind help to me. I thank Prof. Sengupta, Dr. Niladri Sekhar Dash and CIIL, Mysore for their help, cooperation and support to access the Bangla corpus I used in this work. In this connection I thank Armin Buch (Tübingen) who worked on the extraction of data from the raw files of the corpus used in this study. And, I wish to thank Ronny Medda, who read a draft of this work with much patience and gave me valuable feedbacks. Many people have helped in different ways. I would like to express my sincere thanks and gratefulness to Prof. Josef Bayer for sending me some important literature, Prof. -
Proposal to Encode 0D5F MALAYALAM LETTER ARCHAIC II
Proposal to encode 0D5F MALAYALAM LETTER ARCHAIC II Shriramana Sharma, jamadagni-at-gmail-dot-com, India 2012-May-22 §1. Introduction In the Malayalam Unicode encoding, the independent letter form for the long vowel Ī is: ഈ where the length mark ◌ൗ is appended to the short vowel ഇ to parallel the symbols for U/UU i.e. ഉ/ഊ. However, Ī was originally written as: As these are entirely different representations of Ī, although their sound value may be the same, it is proposed to encode the archaic form as a separate character. §2. Background As the core Unicode encoding for the major Indic scripts is based on ISCII, and the ISCII code chart for Malayalam only contained the modern form for the independent Ī [1, p 24]: … thus it is this written form that came to be encoded as 0D08 MALAYALAM LETTER II. While this “new” written form is seen in print as early as 1936 CE [2]: … there is no doubt that the much earlier form was parallel to the modern Grantha , both of which are derived from the old Grantha Ī as seen below: 1 ma - īdṛgvidhā | tatarāya īṇ Old Grantha from the Iḷaiyānputtūr copper plates [3, p 13]. Also seen is the Vatteḻuttu Ī of the same time (line 2, 2nd char from right) which also exhibits the two dots. Of course, both are derived from old South Indian Brahmi Ī which again has the two dots. It is said [entire paragraph: Radhakrishna Warrier, personal communication] that it was the poet Vaḷḷattōḷ Nārāyaṇa Mēnōn (1878–1958) who introduced the new form of Ī ഈ. -
Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR)
Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) LGR Version: 3.0 Date: 2019-03-06 Document version: 2.6 Authors: Neo-Brahmi Generation Panel [NBGP] 1. General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Kannada LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: proposal-kannada-lgr-06mar19-en.xml Labels for testing can be found in the accompanying text document: kannada-test-labels-06mar19-en.txt 2. Script for which the LGR is Proposed ISO 15924 Code: Knda ISO 15924 N°: 345 ISO 15924 English Name: Kannada Latin transliteration of the native script name: Native name of the script: ಕನ#ಡ Maximal Starting Repertoire (MSR) version: MSR-4 Some languages using the script and their ISO 639-3 codes: Kannada (kan), Tulu (tcy), Beary, Konkani (kok), Havyaka, Kodava (kfa) 1 Proposal for a Kannada Script Root Zone Label Generation Ruleset (LGR) 3. Background on Script and Principal Languages Using It 3.1 Kannada language Kannada is one of the scheduled languages of India. It is spoken predominantly by the people of Karnataka State of India. It is one of the major languages among the Dravidian languages. Kannada is also spoken by significant linguistic minorities in the states of Andhra Pradesh, Telangana, Tamil Nadu, Maharashtra, Kerala, Goa and abroad. -
Odisha As a Multicultural State: from Multiculturalism to Politics of Sub-Regionalism
Afro Asian Journal of Social Sciences Volume VII, No II. Quarter II 2016 ISSN: 2229 – 5313 ODISHA AS A MULTICULTURAL STATE: FROM MULTICULTURALISM TO POLITICS OF SUB-REGIONALISM Artatrana Gochhayat Assistant Professor, Department of Political Science, Sree Chaitanya College, Habra, under West Bengal State University, Barasat, West Bengal, India ABSTRACT The state of Odisha has been shaped by a unique geography, different cultural patterns from neighboring states, and a predominant Jagannath culture along with a number of castes, tribes, religions, languages and regional disparity which shows the multicultural nature of the state. But the regional disparities in terms of economic and political development pose a grave challenge to the state politics in Odisha. Thus, multiculturalism in Odisha can be defined as the territorial division of the state into different sub-regions and in terms of regionalism and sub- regional identity. The paper attempts to assess Odisha as a multicultural state by highlighting its cultural diversity and tries to establish the idea that multiculturalism is manifested in sub- regionalism. Bringing out the major areas of sub-regional disparity that lead to secessionist movement and the response of state government to it, the paper concludes with some suggestive measures. INTRODUCTION The concept of multiculturalism has attracted immense attention of the academicians as well as researchers in present times for the fact that it not only involves the question of citizenship, justice, recognition, identities and group differentiated rights of cultural disadvantaged minorities, it also offers solutions to the challenges arising from the diverse cultural groups. It endorses the idea of difference and heterogeneity which is manifested in the cultural diversity. -
Proposal for a Gujarati Script Root Zone Label Generation Ruleset (LGR)
Proposal for a Gujarati Root Zone LGR Neo-Brahmi Generation Panel Proposal for a Gujarati Script Root Zone Label Generation Ruleset (LGR) LGR Version: 3.0 Date: 2019-03-06 Document version: 3.6 Authors: Neo-Brahmi Generation Panel [NBGP] 1 General Information/ Overview/ Abstract The purpose of this document is to give an overview of the proposed Gujarati LGR in the XML format and the rationale behind the design decisions taken. It includes a discussion of relevant features of the script, the communities or languages using it, the process and methodology used and information on the contributors. The formal specification of the LGR can be found in the accompanying XML document: proposal-gujarati-lgr-06mar19-en.xml Labels for testing can be found in the accompanying text document: gujarati-test-labels-06mar19-en.txt 2 Script for which the LGR is proposed ISO 15924 Code: Gujr ISO 15924 Key N°: 320 ISO 15924 English Name: Gujarati Latin transliteration of native script name: gujarâtî Native name of the script: ગજુ રાતી Maximal Starting Repertoire (MSR) version: MSR-4 1 Proposal for a Gujarati Root Zone LGR Neo-Brahmi Generation Panel 3 Background on the Script and the Principal Languages Using it1 Gujarati (ગજુ રાતી) [also sometimes written as Gujerati, Gujarathi, Guzratee, Guujaratee, Gujrathi, and Gujerathi2] is an Indo-Aryan language native to the Indian state of Gujarat. It is part of the greater Indo-European language family. It is so named because Gujarati is the language of the Gujjars. Gujarati's origins can be traced back to Old Gujarati (circa 1100– 1500 AD).