Situation and Challenges of Technologies for Indigenous Languages of India
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Linguistic Survey of India Bihar
LINGUISTIC SURVEY OF INDIA BIHAR 2020 LANGUAGE DIVISION OFFICE OF THE REGISTRAR GENERAL, INDIA i CONTENTS Pages Foreword iii-iv Preface v-vii Acknowledgements viii List of Abbreviations ix-xi List of Phonetic Symbols xii-xiii List of Maps xiv Introduction R. Nakkeerar 1-61 Languages Hindi S.P. Ahirwal 62-143 Maithili S. Boopathy & 144-222 Sibasis Mukherjee Urdu S.S. Bhattacharya 223-292 Mother Tongues Bhojpuri J. Rajathi & 293-407 P. Perumalsamy Kurmali Thar Tapati Ghosh 408-476 Magadhi/ Magahi Balaram Prasad & 477-575 Sibasis Mukherjee Surjapuri S.P. Srivastava & 576-649 P. Perumalsamy Comparative Lexicon of 3 Languages & 650-674 4 Mother Tongues ii FOREWORD Since Linguistic Survey of India was published in 1930, a lot of changes have taken place with respect to the language situation in India. Though individual language wise surveys have been done in large number, however state wise survey of languages of India has not taken place. The main reason is that such a survey project requires large manpower and financial support. Linguistic Survey of India opens up new avenues for language studies and adds successfully to the linguistic profile of the state. In view of its relevance in academic life, the Office of the Registrar General, India, Language Division, has taken up the Linguistic Survey of India as an ongoing project of Government of India. It gives me immense pleasure in presenting LSI- Bihar volume. The present volume devoted to the state of Bihar has the description of three languages namely Hindi, Maithili, Urdu along with four Mother Tongues namely Bhojpuri, Kurmali Thar, Magadhi/ Magahi, Surjapuri. -
Language and Literature
1 Indian Languages and Literature Introduction Thousands of years ago, the people of the Harappan civilisation knew how to write. Unfortunately, their script has not yet been deciphered. Despite this setback, it is safe to state that the literary traditions of India go back to over 3,000 years ago. India is a huge land with a continuous history spanning several millennia. There is a staggering degree of variety and diversity in the languages and dialects spoken by Indians. This diversity is a result of the influx of languages and ideas from all over the continent, mostly through migration from Central, Eastern and Western Asia. There are differences and variations in the languages and dialects as a result of several factors – ethnicity, history, geography and others. There is a broad social integration among all the speakers of a certain language. In the beginning languages and dialects developed in the different regions of the country in relative isolation. In India, languages are often a mark of identity of a person and define regional boundaries. Cultural mixing among various races and communities led to the mixing of languages and dialects to a great extent, although they still maintain regional identity. In free India, the broad geographical distribution pattern of major language groups was used as one of the decisive factors for the formation of states. This gave a new political meaning to the geographical pattern of the linguistic distribution in the country. According to the 1961 census figures, the most comprehensive data on languages collected in India, there were 187 languages spoken by different sections of our society. -
Mapping India's Language and Mother Tongue Diversity and Its
Mapping India’s Language and Mother Tongue Diversity and its Exclusion in the Indian Census Dr. Shivakumar Jolad1 and Aayush Agarwal2 1FLAME University, Lavale, Pune, India 2Centre for Social and Behavioural Change, Ashoka University, New Delhi, India Abstract In this article, we critique the process of linguistic data enumeration and classification by the Census of India. We map out inclusion and exclusion under Scheduled and non-Scheduled languages and their mother tongues and their representation in state bureaucracies, the judiciary, and education. We highlight that Census classification leads to delegitimization of ‘mother tongues’ that deserve the status of language and official recognition by the state. We argue that the blanket exclusion of languages and mother tongues based on numerical thresholds disregards the languages of about 18.7 million speakers in India. We compute and map the Linguistic Diversity Index of India at the national and state levels and show that the exclusion of mother tongues undermines the linguistic diversity of states. We show that the Hindi belt shows the maximum divergence in Language and Mother Tongue Diversity. We stress the need for India to officially acknowledge the linguistic diversity of states and make the Census classification and enumeration to reflect the true Linguistic diversity. Introduction India and the Indian subcontinent have long been known for their rich diversity in languages and cultures which had baffled travelers, invaders, and colonizers. Amir Khusru, Sufi poet and scholar of the 13th century, wrote about the diversity of languages in Northern India from Sindhi, Punjabi, and Gujarati to Telugu and Bengali (Grierson, 1903-27, vol. -
Languages, Mother Tongues and Bilingualism in India : an Overview Workshop on FIELD LINGUISTICS Department of EFL, Tezpur University, Assam
Languages, Mother Tongues and Bilingualism in India : An Overview Workshop on FIELD LINGUISTICS Department of EFL, Tezpur University, Assam 02nd – 05th March, 2018 S. S. Bhattacharya, Former Senior Research Officer(L) & Head, Language Division, O/O The Registrar General, India MHA, Govt. of India, Kolkata [email protected] 1 India has a geographical area of 3,287,240 sq.km. and has more than one billion population (as per 2011 Census: 1, 21, 05, 69,573) distributed in 28 states (now 29) and 7 union territories (with 640 districts, 7933 towns and 640930 villages). 2 LANGUAGES IN DESCENDING ORDER OF STRENGTH – INDIA, STATES AND UNION TERRITORIES – 2001 Number of Persons who returned the Language as their mother tongue Language Total Percent to State Total Popula-tion INDIA 1Hindi 422,048,642 41.03 2 Bengali 83,369,769 8.11 3 Telugu 74,002,856 7.19 4 Marathi 71,936,894 6.99 5 Tamil 60,793,814 591 6 Urdu 51,536,111 5.01 7 Gujarati 46,091,617 4,48 8 Kannada 37,924,011 369 9 Malayalam 33,066,392 3.21 10 Oriya 33,017,446 3.21 11 Punjabi 29,102,477 2.83 12 Assamese 13,168,484 128 3 LANGUAGES IN DESCENDING ORDER OF STRENGTH – INDIA, STATES AND UNION TERRITORIES – 2001 Number of Persons who returned the Language as their mother tongue Language Total Percent to State Total Popula-tion 13 Maithili 12,179,122 1.18 14 Bhili/Bhilodi 9,582,957 093 15 Sanlali 6,469,600 0.63 16 Kashmiri 5,527,698 0.54 17 Nepali 2,871,749 0.28 18 Gondi 2,713,790 026 19 Sindhi 2,535,485 0.25 20 Konkani 2,489,015 0.24 21 Dogri 2,282,589 022 22 Khandeshi 2,075,258 -
UNIT 2 INDIAN LANGUAGES Notes STRUCTURE 2.0 Introduction
Indian Languages UNIT 2 INDIAN LANGUAGES Notes STRUCTURE 2.0 Introduction 2.1 Learning Objectives 2.2 Linguistic Diversity in India 2.2.1 A picture of India’s linguistic diversity 2.2.2 Language Families of India and India as a Linguistic Area 2.3 What does the Indian Constitution say about Languages? 2.4 Categories of Languages in India 2.4.1 Scheduled Languages 2.4.2 Regional Languages and Mother Tongues 2.4.3 Classical Languages 2.4.4 Is there a Difference between Language and Dialect? 2.5 Status of Hindi in India 2.6 Status of English in India 2.7 The Language Education Policy in India Provisions of Various Committees and Commissions Three Language Formula National Curriculum Framework-2005 2.8 Let Us Sum Up 2.9 Suggested Readings and References 2.10 Unit-End Exercises 2.0 INTRODUCTION You must have heard this song: agrezi mein kehte hein- I love you gujrati mein bole- tane prem karu chhuun 18 Diploma in Elementary Education (D.El.Ed) Indian Languages bangali mein kehte he- amii tumaake bhaalo baastiu aur punjabi me kehte he- tere bin mar jaavaan, me tenuu pyar karna, tere jaiyo naiyo Notes labnaa Songs of this kind is only one manifestation of the diversity and fluidity of languages in India. We are sure you can think of many more instances where you notice a multiplicity of languages being used at the same place at the same time. Imagine a wedding in Delhi in a Telugu family where Hindi, Urdu, Dakkhini, Telugu, English and Sanskrit may all be used in the same event. -
Spotlight on Tamil
Heritage Voices: Languages Urdu ABOUT THE URDU LANGUAGE Urdu is an Indo-Aryan language that serves as the primary, secondary, or tertiary language of communication for millions of individuals in Pakistan, India, and sizable migrant communities in the Persian Gulf, the United Kingdom, and the United States. Syntactically Urdu and Hindi are identical, with Urdu incorporating a heavier loan vocabulary from Persian, Arabic, and Turkic and retaining the original spelling of words and sounds from these languages. Speakers of both languages are able to communicate with each other at an informal level without much difficulty. In more formal situations and in higher registers, the two languages diverge significantly. Urdu is written in the Perso-Arabic script called Nasta'liq, which was modified and expanded to incorporate a distinctively South Asian phonology. While Urdu is one of the world's leading languages of Muslim erudition, some of its leading protagonists have been Hindu and Sikh authors taking full advantage of Urdu's rich expressive medium. Urdu has developed a preeminent position in South Asia as a language of literary genius as well as a major medium of communication in the daily lives of people. It is an official language of Pakistan, where it is a link language and probably the most widely understood language across all regions, which also have their own languages (Pashto, Balochi, Sindhi, and Punjabi). It is also one of the national languages of India. Urdu is widely understood by Afghans in Pakistan and Afghanistan, who acquire the language through media, business, and family connections in Pakistan. Urdu serves as a lingua franca for many in South Asia, especially for Muslims. -
Unit 4- Language Schedule in the Constitution Official Language - Constitutional/Statutory Provisions
UNIT 4- LANGUAGE SCHEDULE IN THE CONSTITUTION OFFICIAL LANGUAGE - CONSTITUTIONAL/STATUTORY PROVISIONS Article 343(1) of the Constitution provides that Hindi in Devanagari script shall be the Official Language of the Union. Article 343(2) also provided for continuing the use of English in official work of the Union for a period of 15 years (i.e., up to 25 January 1965) from the date of commencement of the Constitution. Article 343(3) empowered the parliament to provide by law for continued use of English for official purposes even after 25 January 1965. Accordingly, section 3(2) of the Official Languages Act, 1963 (amended in 1967) provides for continuing the use of English in official work even after 25 January 1965. The Act also lays down that both Hindi and English shall compulsorily be used for certain specified purposes such as Resolutions, General Orders, Rules, Notifications, Administrative and other Reports, Press Communiqués; Administrative and other Reports and Official Papers to be laid before a House or the Houses of Parliament; Contracts, Agreements, Licenses, Permits, Tender Notices and Forms of Tender, etc. SCHEDULED AND NON-SCHEDULED LANGUAGES In India, language status planning occurred through “officialization” (recognition as a scheduled language) in a special section of the Constitution, the 8th Schedule of the Constitution. This Schedule’s original purpose was stated in Article 351 of the Constitution in relation to the corpus planning of Hindi: “It shall be the duty of the Union to promote the spread of the Hindi language, -
The Impact of Persian Language on Indian Languages
ISSN 1799-2591 Theory and Practice in Language Studies, Vol. 4, No. 11, pp. 2360-2365, November 2014 © 2014 ACADEMY PUBLISHER Manufactured in Finland. doi:10.4304/tpls.4.11.2360-2365 The Impact of Persian Language on Indian Languages Ali Akbar Khansir Bushehr University of Medical Sciences, Bushehr, Iran Nasrin Mozafari Bushehr University of Medical Sciences, Bushehr, Iran Abstract—The purpose of the paper is to examine the impact of Persian language on many Indian Languages like Bengali, Punjabi, Urdu, Gujarati, Telugu, and Hindi etc. For historical reasons, Indian languages have borrowed a great number of Persian words and phrases from Persian language that they have been using in the society of India. Old Persian Language, Middle Persian Language, and Modern Persian Language will be examined .Many Indian languages and Persian languages are part of the Indo-European languages. India and Iran have always had close relationship with each other; this relationship has been observed even before the advent of Islam. The time of Persians king was available trade ties between two nations who travelled with their ships and passed from Persian Gulf to Indian Ocean. The above Indian languages will be discussed briefly in this paper. Index Terms—Persian language, Indian languages, old Persian language, middle Persian language, modern Persian language I. INTRODUCTION The aim of this brief article is to highlight the impact of Persian language on Indian Languages. Gleason (1961) pointed out that "Indo-European is the largest and most important language family, from the point of view of both the social importance of the major language in the group, and their interest to linguists. -
The Official Language of the Union
THE ECONOMIC WEEKLY October 19, 1957 The Official Language of the Union K C Sen ON the attainment of indepen- The Terms of Reference missions in the States, Education dence, India was faced with On the June 7, 1955 the Presi is a subject, within the jurisdiction the problem of having a language dent appointed a Commission con of the States. The Commission has of the Union. This was no easy sisting of 20 persons, under the come to the conclusion that ulti task in a country the Constitution Chairmanship of the late Shri B G mately the judgments, decrees and of which has recognised the follow Kher, for the purposes (a) to (d) orders of the High Courts and the ing 14 languages : Assamese, Ben above, and as regards purpose (e), Supreme Courts will have to be in gali, Gujarati, Hindi, Kannada, the following duty was cast on the Hindi; that Hindi will have to be Kashmiri, Malayalam, Marathi, Commission, "The preparation or a compulsorily taught in all the States, Oriya, Punjabi, Sanskrit, Tamil, time-schedule according to which although for the purposes of a Telugu, and Urdu. Three things and the manner in which Hindi may State the Government institutions may be noticed about this list en gradually replace English as the and administration are expected to passant: (a) Sanskrit, which, ac official language of the Union and use the regional languages; and the cording to the 1951 census, is as a language for communication Commission has devoted two Chap spoken by only 555 persons, is re between the Union and State ters (Chapters XIII &. -
A Sandhi Splitter for Malayalam
A Sandhi Splitter for Malayalam Devadath V V Litton J Kurisinkel Dipti Misra Sharma Vasudeva Varma Language Technology Research Centre International Institute of Information Technology - Hyderabad, India. devadathv.v,litton.jkurisinkel @research.iiit.ac.in, dipti,vasu @iiit.ac.in { } { } Abstract occur at the point of joining. The presence of Sandhi is abundant in Sanskrit and all Dravidian Sandhi splitting is the primary task for languages. When compared to other Dravidian computational processing of text in San- languages, the presence of Sandhi is relatively skrit and Dravidian languages. In these high in Malayalam. Even a full sentence may languages, words can join together with exist as a single string due to the process of morpho-phonemic changes at the point of Sandhi. For example, Ae\ncnWmm (avanaaraaN) joining. This phenomenon is known as is a sentence in Malayalam which means “Who Sandhi. Sandhi splitter splits the string is he ?”. It is composed of 3 independent words, of conjoined words into individual words. namely Ae°(avan (he)), Bcm(aar(who)) and Accurate execution of sandhi splitting is BWmm (aaN(is)). However, ambiguous splits for crucial for text processing tasks such as a word is very less in Malayalam. Sandhis are POS tagging, topic modelling and doc- of two types, Internal and External. Internal ument indexing. We have tried differ- Sandhi exists between a root or a stem with a ent approaches to address the challenges suffix or a morpheme. In the example given below, of sandhi splitting in Malayalam, and fi- nally, we have thought of exploiting the ]l(para) + Dè(unnu) = ]lÆè(parayunnu) phonological changes that take place in the words while joining. -
Strength for Today and Bright Hope for Tomorrow Volume 11: 2 February
LANGUAGE IN INDIA Strength for Today and Bright Hope for Tomorrow Volume 11 : 2 February 2011 ISSN 1930-2940 Managing Editor: M. S. Thirumalai, Ph.D. Editors: B. Mallikarjun, Ph.D. Sam Mohanlal, Ph.D. B. A. Sharada, Ph.D. A. R. Fatihi, Ph.D. Lakhan Gusain, Ph.D. Jennifer Marie Bayer, Ph.D. S. M. Ravichandran, Ph.D. G. Baskaran, Ph.D. L. Ramamoorthy, Ph.D. Provision for Linguistic Diversity and Linguistic Minorities in India Vanishree V.M., MAPL and ELT, M.A., PGDHRM. Dissertation submitted in partial fulfillment of the requirements for the award of the degree of MA in Applied Linguistics and English Language Teaching, St. Mary‟s University College, Strawberry Hill, London. Language in India www.languageinindia.com 304 11 : 2 February 2011 Vanishree V.M., MAPL and ELT, M.A., PGDHRM. Provision for Linguistic Diversity and Linguistic Minorities in India – Masters Dissertation India is one of the most linguistically diverse countries in the world. Most languages in India belong to one of the four language families: Indo-Aryan, Dravidian, Tibeto-Burmese and Austro- Asiatic. According to the 2001 Indian Census there are a total of 122 languages and 234 mother tongues. However, these figures cannot be accepted as final as the Census does not report languages spoken by fewer than 10,000 speakers (for instance, in 1961 around 1652 mother tongues were returned in the Census but only 193 languages were classified). When India gained its Independence in 1947, the framers of the Constitution had a tremendous task to do. They had to put together a Constitution that not only preserved political unity, but also acknowledged and promoted cultural and linguistic diversity. -
Languages of India Being a Reprint of Chapter on Languages
THE LANGUAGES OF INDIA BEING A :aEPRINT OF THE CHAPTER ON LANGUAGES CONTRIBUTED BY GEORGE ABRAHAM GRIERSON, C.I.E., PH.D., D.LITT., IllS MAJESTY'S INDIAN CIVIL SERVICE, TO THE REPORT ON THE OENSUS OF INDIA, 1901, TOGETHER WITH THE CENSUS- STATISTIOS OF LANGUAGE. CALCUTTA: OFFICE OF THE SUPERINTENDENT OF GOVERNMENT PRINTING, INDIA. 1903. CALcuttA: GOVERNMENT OF INDIA. CENTRAL PRINTING OFFICE, ~JNGS STRERT. CONTENTS. ... -INTRODUCTION . • Present Knowledge • 1 ~ The Linguistio Survey 1 Number of Languages spoken ~. 1 Ethnology and Philology 2 Tribal dialects • • • 3 Identification and Nomenolature of Indian Languages • 3 General ammgemont of Chapter • 4 THE MALAYa-POLYNESIAN FAMILY. THE MALAY GROUP. Selung 4 NicobaresB 5 THE INDO-CHINESE FAMILY. Early investigations 5 Latest investigations 5 Principles of classification 5 Original home . 6 Mon-Khmers 6 Tibeto-Burmans 7 Two main branches 7 'fibeto-Himalayan Branch 7 Assam-Burmese Branch. Its probable lines of migration 7 Siamese-Chinese 7 Karen 7 Chinese 7 Tai • 7 Summary 8 General characteristics of the Indo-Chinese languages 8 Isolating languages 8 Agglutinating languages 9 Inflecting languages ~ Expression of abstract and concrete ideas 9 Tones 10 Order of words • 11 THE MON-KHME& SUB-FAMILY. In Further India 11 In A.ssam 11 In Burma 11 Connection with Munds, Nicobar, and !lalacca languages 12 Connection with Australia • 12 Palaung a Mon- Khmer dialect 12 Mon. 12 Palaung-Wa group 12 Khaasi 12 B2 ii CONTENTS THE TIllETO-BuRMAN SUll-FAMILY_ < PAG. Tibeto-Himalayan and Assam-Burmese branches 13 North Assam branch 13 ~. Mutual relationship of the three branches 13 Tibeto-H imalayan BTanch.