Speakar-Independent Isolated Word Recognition Using HTK for Varhadi – a Dialect of Marathi
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Some Principles of the Use of Macro-Areas Language Dynamics &A
Online Appendix for Harald Hammarstr¨om& Mark Donohue (2014) Some Principles of the Use of Macro-Areas Language Dynamics & Change Harald Hammarstr¨om& Mark Donohue The following document lists the languages of the world and their as- signment to the macro-areas described in the main body of the paper as well as the WALS macro-area for languages featured in the WALS 2005 edi- tion. 7160 languages are included, which represent all languages for which we had coordinates available1. Every language is given with its ISO-639-3 code (if it has one) for proper identification. The mapping between WALS languages and ISO-codes was done by using the mapping downloadable from the 2011 online WALS edition2 (because a number of errors in the mapping were corrected for the 2011 edition). 38 WALS languages are not given an ISO-code in the 2011 mapping, 36 of these have been assigned their appropri- ate iso-code based on the sources the WALS lists for the respective language. This was not possible for Tasmanian (WALS-code: tsm) because the WALS mixes data from very different Tasmanian languages and for Kualan (WALS- code: kua) because no source is given. 17 WALS-languages were assigned ISO-codes which have subsequently been retired { these have been assigned their appropriate updated ISO-code. In many cases, a WALS-language is mapped to several ISO-codes. As this has no bearing for the assignment to macro-areas, multiple mappings have been retained. 1There are another couple of hundred languages which are attested but for which our database currently lacks coordinates. -
DEPARTMENT of MARATHI Faculty's of Marathi Department
DEPARTMENT OF MARATHI Faculty’s of Marathi Department Prof. Kalawati B. Mohod Dr. Prashant W. Dhanvij M.A.,B.Ed. M.A., M.Phil., Ph.D.(NET) Associate Professor Assistant Professor Date of Joining: 01 October 1992 Date of Joining: 14 January 2009 About Marathi Language Introduction Marathi is an Indo-Aryan language spoken predominantly by Marathi people of Maharashtra. It is the official language and co-official language in Maharashtra and Goa states of Western India respectively, and it is among the 23 official Languages of India. There were 73 million speakers in 2001; Marathi ranks 19th in the list of most spoken languages in the world. Marathi has the fourth largest number of native speakers in India. Marathi has some of the oldest literature of all modern Indo-Aryan languages, dating from about 900 AD. The major dialects of Marathi are Standard Marathi and the Varhadi dialect. There are other related languages such as Khandeshi, Dangi, Vadavali and Samavedi. Malvani Konkani has been heavily influenced by Marathi varieties. Geographic Distribution Marathi is primarily spoken in Maharashtra and parts of neighbouring states of Gujrat, Madhya Pradesh, Goa, Karnataka, Chhattisgarh and Andhra Pradesh, union-territories of Daman and Div and Dadra and Nagar Haveli. The cities of Baroda, Surat and Ahmedabad (Gujrat), Belgaum (Karnataka), Indore, Gwalior (Madhya Pradesh), Hydrabad and Tanjore (Tamil Nadu) each have sizable Marathi-speaking communities. Marathi is also spoken by Maharashtrian emigrants worldwide, especially in the United States, United Kingdom, Israel, Mauritius and Canada. Official Status Marathi is the official language of Maharashtra and co-official language in the union territories of Daman and Diu and Dadra and Nagar Haveli. -
Diachrony of Ergative Future
• THE EVOLUTION OF THE TENSE-ASPECT SYSTEM IN HINDI/URDU: THE STATUS OF THE ERGATIVE ALGNMENT Annie Montaut INALCO, Paris Proceedings of the LFG06 Conference Universität Konstanz Miriam Butt and Tracy Holloway King (Editors) 2006 CSLI Publications http://csli-publications.stanford.edu/ Abstract The paper deals with the diachrony of the past and perfect system in Indo-Aryan with special reference to Hindi/Urdu. Starting from the acknowledgement of ergativity as a typologically atypical feature among the family of Indo-European languages and as specific to the Western group of Indo-Aryan dialects, I first show that such an evolution has been central to the Romance languages too and that non ergative Indo-Aryan languages have not ignored the structure but at a certain point went further along the same historical logic as have Roman languages. I will then propose an analysis of the structure as a predication of localization similar to other stative predications (mainly with “dative” subjects) in Indo-Aryan, supporting this claim by an attempt of etymologic inquiry into the markers for “ergative” case in Indo- Aryan. Introduction When George Grierson, in the full rise of language classification at the turn of the last century, 1 classified the languages of India, he defined for Indo-Aryan an inner circle supposedly closer to the original Aryan stock, characterized by the lack of conjugation in the past. This inner circle included Hindi/Urdu and Eastern Panjabi, which indeed exhibit no personal endings in the definite past, but only gender-number agreement, therefore pertaining more to the adjectival/nominal class for their morphology (calâ, go-MSG “went”, kiyâ, do- MSG “did”, bola, speak-MSG “spoke”). -
Unit 1 'Propositions' by Sujit Mukherjee
‘Propositions’ by Sujit UNIT 1 ‘PROPOSITIONS’ BY SUJIT Mukherjee MUKHERJEE Structure 1.0 Objectives 1.1 Introduction 1.1.1 Analysis of the Points Listed in the Introduction 1.2 About the Author, Sujit Mukherjee 1.3 Some Excerpts from Sujit Mukherjee’s ‘Propositions’ 1.4 The Need for a New Methodology to write Indian Literary History 1.4.1 Textual Analysis 1.5 Let Us Sum Up 1.6 Aids to Activities 1.7 Glossary 1.8 Unit End Questions 1.9 References and Suggested Reading 1.0 OBJECTIVES By the end of this unit, you will be able to understand why Sujit Mukherjee wants a new methodology to write Indian literary history learn about his new methodology, taking into account the numerous Indian languages, their literary traditions and the multiple mutual influences of Indian languages on one another discover the running undercurrent of one single Indian literary culture among different Indian languages and literatures Words in bold are explained in the Glossary 1.1 INTRODUCTION Eminent scholars and commentators have criticized the step motherly treatment given to Indian literary studies and traditions vis-à-vis Western literature and culture in our universities and have attributed this lapse to four reasons: 1) The colonial hangover resulting in love for western culture and a continuation of English studies to the neglect of studies in Indian culture and literature. 2) The emphasis on ‘modernity’ that has diminished the importance of Indian literature which is mistakenly viewed as an ancient tradition and therefore anti-modern. 3) The advent of -
Phonology of Marathi-Hindi Contact in Eastern Vidarbha
Volume II, Issue VII, November 2014 - ISSN 2321-7065 Change in Progress: Phonology of Marathi-Hindi contact in (astern Vidarbha Rahul N. Mhaiskar Assistant Professor Department of Linguistics, Deccan Collage Post Graduate and Research Institute, Yerwada Pune Abstract Language contact studies the multilingualism and bilingualism in particular language society. There are two division of vidarbha, Nagpur (eastern) and Amaravati (western). It occupied 21.3% of area of the state of Maharashtra. This Area is connected to Hindi speaking states, Madhya Pradesh and Chhattisgarh. When speakers of different languages interact closely, naturally their languages influence each other. There is a close interaction between the speakers of Hindi and local Marathi and they are influencing each other’s languages. A majority of Vidarbhians speak Varhadi, a dialect of Marathi. The present paper aims to look into the phonological nature of Marathi-Hindi contact in eastern Vidarbha. Standard Marathi is used as a medium of instruction for education and Hindi is used for informal communication in this area. This situation creates phonological variation in Marathi for instance: the standard Marathi ‘c’ is pronounced like Hindi ‘þ’. Eg. Marathi Local Variety Hindi pac paþ panþ ‘five’ cor þor þor ‘thief’ Thus, this paper will be an attempt to examine the phonological changes occurs due to influence of Hindi. Keywords: Marathi-Hindi, Language Contact, Phonology, Vidarbha. http://www.ijellh.com 63 Volume II, Issue VII, November 2014 - ISSN 2321-7065 Introduction: Multilingualism and bilingualism is likely to be common phenomenon throughout the world. There are so many languages spoken in the world where there is much variation in language over short distances. -
Language and Literature
1 Indian Languages and Literature Introduction Thousands of years ago, the people of the Harappan civilisation knew how to write. Unfortunately, their script has not yet been deciphered. Despite this setback, it is safe to state that the literary traditions of India go back to over 3,000 years ago. India is a huge land with a continuous history spanning several millennia. There is a staggering degree of variety and diversity in the languages and dialects spoken by Indians. This diversity is a result of the influx of languages and ideas from all over the continent, mostly through migration from Central, Eastern and Western Asia. There are differences and variations in the languages and dialects as a result of several factors – ethnicity, history, geography and others. There is a broad social integration among all the speakers of a certain language. In the beginning languages and dialects developed in the different regions of the country in relative isolation. In India, languages are often a mark of identity of a person and define regional boundaries. Cultural mixing among various races and communities led to the mixing of languages and dialects to a great extent, although they still maintain regional identity. In free India, the broad geographical distribution pattern of major language groups was used as one of the decisive factors for the formation of states. This gave a new political meaning to the geographical pattern of the linguistic distribution in the country. According to the 1961 census figures, the most comprehensive data on languages collected in India, there were 187 languages spoken by different sections of our society. -
The Communalization and Disintegration of Urdu in Anita Desai’S in Custody 1
The Communalization and Disintegration of Urdu in Anita Desai’s In Custody 1 Introduction T of Urdu in India is an extremely layered one which needs to be examined historically, politically and ideologically in order to grasp the various forces which have shaped its current perception as a sectarian language adopted by Indian Muslims, marking their separation from the national collectivity. In this article I wish to explore these themes through the lens of literature, specifically an Indian English novel about Urdu entitled In Custody by Anita Desai. Writing in the early s, Aijaz Ahmad was of the opinion that the teaching of English literature has cre- ated a body of English-speaking Indians who represent “the only” over- arching national community with a common language, able to imagine themselves across the disparate nation as a “national literary intelligentsia” with “a shared body of knowledge, shared presumptions and a shared knowledge of mutual exchange” (, ).2 Arguably both Desai and Ahmad belong to this “intelligentsia” through the postcolonial secular English connection, but equally they are implicated in the discursive structures of cultural hegemony in civil society (Viswanathan , –; Rajan , –). However, it is not my intention to re-inscribe an authentic myth of origin about Indianness through linguistic associations, 1An earlier version of this essay was first presented as a paper at the Minori- ties, Education and Language in st Century Indian Democracy—The Case of Urdu with Special Reference to Dr. Zakir Husain, Late President of India Con- ference held in Delhi, February . 2See also chapter “‘Indian Literature’: Notes Toward the Definition of a Category,” in the same work, –. -
International Multidisciplinary Research Journal
Vol 4 Issue 9 March 2015 ISSN No :2231-5063 InternationaORIGINALl M ARTICLEultidisciplinary Research Journal Golden Research Thoughts Chief Editor Dr.Tukaram Narayan Shinde Associate Editor Publisher Dr.Rajani Dalvi Mrs.Laxmi Ashok Yakkaldevi Honorary Mr.Ashok Yakkaldevi Welcome to GRT RNI MAHMUL/2011/38595 ISSN No.2231-5063 Golden Research Thoughts Journal is a multidisciplinary research journal, published monthly in English, Hindi & Marathi Language. All research papers submitted to the journal will be double - blind peer reviewed referred by members of the editorial board.Readers will include investigator in universities, research institutes government and industry with research interest in the general subjects. International Advisory Board Flávio de São Pedro Filho Mohammad Hailat Hasan Baktir Federal University of Rondonia, Brazil Dept. of Mathematical Sciences, English Language and Literature University of South Carolina Aiken Department, Kayseri Kamani Perera Regional Center For Strategic Studies, Sri Abdullah Sabbagh Ghayoor Abbas Chotana Lanka Engineering Studies, Sydney Dept of Chemistry, Lahore University of Management Sciences[PK] Janaki Sinnasamy Ecaterina Patrascu Librarian, University of Malaya Spiru Haret University, Bucharest Anna Maria Constantinovici AL. I. Cuza University, Romania Romona Mihaila Loredana Bosca Spiru Haret University, Romania Spiru Haret University, Romania Ilie Pintea, Spiru Haret University, Romania Delia Serbescu Fabricio Moraes de Almeida Spiru Haret University, Bucharest, Federal University of Rondonia, Brazil Xiaohua Yang Romania PhD, USA George - Calin SERITAN Anurag Misra Faculty of Philosophy and Socio-Political ......More DBS College, Kanpur Sciences Al. I. Cuza University, Iasi Titus PopPhD, Partium Christian University, Oradea,Romania Editorial Board Pratap Vyamktrao Naikwade Iresh Swami Rajendra Shendge ASP College Devrukh,Ratnagiri,MS India Ex - VC. -
A Study on Divergence in Malayalam and Tamil Language in Machine Translation Perceptive
A Study on Divergence in Malayalam and Tamil Language in Machine Translation Perceptive Jisha P Jayan Elizabeth Sherly Virtual Resource Centre for Virtual Resource Centre for Language Computing Language Computing IIITM-K,Trivandrum IIITM-K,Trivandrum [email protected] [email protected] Abstract others are specific with respect to the language pair (Lavanya et al., 2005; Saboor and Khan, Machine Translation has made significant 2010). Hence, the divergence in the translation achievements for the past decades. How- need to be studied both perspectives that is across ever, in many languages, the complex- the languages and language specific pair (Sinha, ity with its rich inflection and agglutina- 2005).The most problematic area in translation is tion poses many challenges, that forced the lexicon and the role it plays in the act of creat- for manual translation to make the cor- ing deviations in sense and reference based on the pus available. The divergence in lexi- context of its occurrence in texts (Dash, 2013). cal, syntactic and semantic in any pair Indian languages come under Indo-Aryan or of languages makes machine translation Dravidian scripts. Though there are similarities more difficult. And many systems still de- in scripts, there are many issues and challenges pend on rules heavily, that deteriates sys- in translation between languages such as lexical tem performance. In this paper, a study divergences, ambiguities, lexical mismatches, re- on divergence in Malayalam-Tamil lan- ordering, syntactic and semantic issues, structural guages is attempted at source language changes etc. Human translators try to choose the analysis to make translation process easy. -
Map by Steve Huffman; Data from World Language Mapping System
Svalbard Greenland Jan Mayen Norwegian Norwegian Icelandic Iceland Finland Norway Swedish Sweden Swedish Faroese FaroeseFaroese Faroese Faroese Norwegian Russia Swedish Swedish Swedish Estonia Scottish Gaelic Russian Scottish Gaelic Scottish Gaelic Latvia Latvian Scots Denmark Scottish Gaelic Danish Scottish Gaelic Scottish Gaelic Danish Danish Lithuania Lithuanian Standard German Swedish Irish Gaelic Northern Frisian English Danish Isle of Man Northern FrisianNorthern Frisian Irish Gaelic English United Kingdom Kashubian Irish Gaelic English Belarusan Irish Gaelic Belarus Welsh English Western FrisianGronings Ireland DrentsEastern Frisian Dutch Sallands Irish Gaelic VeluwsTwents Poland Polish Irish Gaelic Welsh Achterhoeks Irish Gaelic Zeeuws Dutch Upper Sorbian Russian Zeeuws Netherlands Vlaams Upper Sorbian Vlaams Dutch Germany Standard German Vlaams Limburgish Limburgish PicardBelgium Standard German Standard German WalloonFrench Standard German Picard Picard Polish FrenchLuxembourgeois Russian French Czech Republic Czech Ukrainian Polish French Luxembourgeois Polish Polish Luxembourgeois Polish Ukrainian French Rusyn Ukraine Swiss German Czech Slovakia Slovak Ukrainian Slovak Rusyn Breton Croatian Romanian Carpathian Romani Kazakhstan Balkan Romani Ukrainian Croatian Moldova Standard German Hungary Switzerland Standard German Romanian Austria Greek Swiss GermanWalser CroatianStandard German Mongolia RomanschWalser Standard German Bulgarian Russian France French Slovene Bulgarian Russian French LombardRomansch Ladin Slovene Standard -
Grapheme-To-Phoneme Tools for the Marathi Speech Synthesis
Sangramsing Kayte et al. Int. Journal of Engineering Research and Applications www.ijera.com ISSN: 2248-9622, Vol. 5, Issue 11, (Part - 4) November 2015, pp.87-92 RESEARCH ARTICLE OPEN ACCESS Grapheme-To-Phoneme Tools for the Marathi Speech Synthesis Sangramsing N. Kayte1, Monica Mundada1, Dr. Charansing N. Kayte2, Dr.Bharti Gawali* 1,3Department of Computer Science and Information Technology Dr. Babasaheb Ambedkar Marathwada University, Aurangabad 2Department of Digital and Cyber Forensic, Aurangabad, Maharashtra ABSTRACT We describe in detail a Grapheme-to-Phoneme (G2P) converter required for the development of a good quality Marathi Text-to-Speech (TTS) system. The Festival and Festvox framework is chosen for developing the Marathi TTS system. Since Festival does not provide complete language processing support specie to various languages, it needs to be augmented to facilitate the development of TTS systems in certain new languages. Because of this, a generic G2P converter has been developed. In the customized Marathi G2P converter, we have handled schwa deletion and compound word extraction. In the experiments carried out to test the Marathi G2P on a text segment of 2485 words, 91.47% word phonetisation accuracy is obtained. This Marathi G2P has been used for phonetising large text corpora which in turn is used in designing an inventory of phonetically rich sentences. The sentences ensured a good coverage of the phonetically valid di-phones using only 1.3% of the complete text corpora. Keywords: Grapheme-to-Phoneme (G2P), TTS, Festival, festvox, di-phone, ICT. I. INTRODUCTION foremost reason for this is that developing a TTS Marathi is an Indo-Aryan language spoken system in a new language needs inputs for resolving predominantly by Marathi people of Maharashtra. -
Map by Steve Huffman Data from World Language Mapping System 16
Tajiki Tajiki Tajiki Shughni Southern Pashto Shughni Tajiki Wakhi Wakhi Wakhi Mandarin Chinese Sanglechi-Ishkashimi Sanglechi-Ishkashimi Wakhi Domaaki Sanglechi-Ishkashimi Khowar Khowar Khowar Kati Yidgha Eastern Farsi Munji Kalasha Kati KatiKati Phalura Kalami Indus Kohistani Shina Kati Prasuni Kamviri Dameli Kalami Languages of the Gawar-Bati To rw al i Chilisso Waigali Gawar-Bati Ushojo Kohistani Shina Balti Parachi Ashkun Tregami Gowro Northwest Pashayi Southwest Pashayi Grangali Bateri Ladakhi Northeast Pashayi Southeast Pashayi Shina Purik Shina Brokskat Aimaq Parya Northern Hindko Kashmiri Northern Pashto Purik Hazaragi Ladakhi Indian Subcontinent Changthang Ormuri Gujari Kashmiri Pahari-Potwari Gujari Bhadrawahi Zangskari Southern Hindko Kashmiri Ladakhi Pangwali Churahi Dogri Pattani Gahri Ormuri Chambeali Tinani Bhattiyali Gaddi Kanashi Tinani Southern Pashto Ladakhi Central Pashto Khams Tibetan Kullu Pahari KinnauriBhoti Kinnauri Sunam Majhi Western Panjabi Mandeali Jangshung Tukpa Bilaspuri Chitkuli Kinnauri Mahasu Pahari Eastern Panjabi Panang Jaunsari Western Balochi Southern Pashto Garhwali Khetrani Hazaragi Humla Rawat Central Tibetan Waneci Rawat Brahui Seraiki DarmiyaByangsi ChaudangsiDarmiya Western Balochi Kumaoni Chaudangsi Mugom Dehwari Bagri Nepali Dolpo Haryanvi Jumli Urdu Buksa Lowa Raute Eastern Balochi Tichurong Seke Sholaga Kaike Raji Rana Tharu Sonha Nar Phu ChantyalThakali Seraiki Raji Western Parbate Kham Manangba Tibetan Kathoriya Tharu Tibetan Eastern Parbate Kham Nubri Marwari Ts um Gamale Kham Eastern