Disambiguation of Pre/Post Positions in English Malayalam Text Translation
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
The Communalization and Disintegration of Urdu in Anita Desai’S in Custody 1
The Communalization and Disintegration of Urdu in Anita Desai’s In Custody 1 Introduction T of Urdu in India is an extremely layered one which needs to be examined historically, politically and ideologically in order to grasp the various forces which have shaped its current perception as a sectarian language adopted by Indian Muslims, marking their separation from the national collectivity. In this article I wish to explore these themes through the lens of literature, specifically an Indian English novel about Urdu entitled In Custody by Anita Desai. Writing in the early s, Aijaz Ahmad was of the opinion that the teaching of English literature has cre- ated a body of English-speaking Indians who represent “the only” over- arching national community with a common language, able to imagine themselves across the disparate nation as a “national literary intelligentsia” with “a shared body of knowledge, shared presumptions and a shared knowledge of mutual exchange” (, ).2 Arguably both Desai and Ahmad belong to this “intelligentsia” through the postcolonial secular English connection, but equally they are implicated in the discursive structures of cultural hegemony in civil society (Viswanathan , –; Rajan , –). However, it is not my intention to re-inscribe an authentic myth of origin about Indianness through linguistic associations, 1An earlier version of this essay was first presented as a paper at the Minori- ties, Education and Language in st Century Indian Democracy—The Case of Urdu with Special Reference to Dr. Zakir Husain, Late President of India Con- ference held in Delhi, February . 2See also chapter “‘Indian Literature’: Notes Toward the Definition of a Category,” in the same work, –. -
Names of Foodstuffs in Indian Languages
NAMES OF FOODSTUFFS IN INDIAN LANGUAGES CEREAL GRAINS AND PRODUCTS 1. Pearl Millet: Pennisetum typhoides Bajra (Bengali, Hindi, Oriya), Bajri (Gujarati, Marathi), Sajje (Kannada), Bajr’u (Kashmiri), Cambu (Malayalam, Tamil), Sazzalu (Telugu). Other names : Spiked millet, Pearl millet 2. Italian millet: Setaria italica Syama dhan (Bengali), Ral Kang (Gujarati), Kangni (Hindi), Thene (Kannada), Shol (Kashmiri), Thina (Malayalam), Rala (Marathi), Kaon (Punjabi), Thenai (Tamil), Korralu (Telugu), Other names: Foxtail millet , Moha millet, Kakan kora 3. Sorghum: Sorghum bicolor Juar (Bengali , Gujarati , Hindi), Jola (Kannada), Cholam (Malayalam , Tamil), Jwari (Marathi), Janha (Oriya), Jonnalu (Telugu), Other names: Milo , Chari 4. Maize: Zea mays Bhutta (Bengali), Makai (Gujarati), Maka (Hindi , Marathi , Oriya), Musikinu jola (Kannada), Makaa’y (Kashmiri), Cholam (Malayalam), Makka Cholam (Tamil), Mokka jonnalu (Telugu) 5. Finger Millet: Eleusine coracana Madua (Bengali , Hindi), Bhav (Gujarati), Ragi (Kannada) , Moothari (Malayalam), Nachni (Marathi), Mandia (Oriya), Kezhvaragu (Tamil), Ragulu (Telugu), Other names: Korakan 6. Rice, parboiled: Oryza sativa Siddha chowl (Bengali) Ukadello chokha (Gujarati), Usna chawal (Hindi), Kusubalakki (Kannada), Puzhungal ari (Malayalam), Ukadla tandool (Marathi), Usuna chaula (Oriya), Puzhungal arisi (Tamil), Uppudu biyyam (Telugu) 7. Rice raw: Orya sativa Chowl (Bengali), Chokha (Gujarati), Chawal (Hindi), Akki (Kannada), Tomul (Kashmiri), Ari (Malayalam), Tandool (Marathi), Chaula (Oriya), Arisi -
Exploring Language Similarities with Dimensionality Reduction Techniques
EXPLORING LANGUAGE SIMILARITIES WITH DIMENSIONALITY REDUCTION TECHNIQUES Sangarshanan Veeraraghavan Final Year Undergraduate VIT Vellore [email protected] Abstract In recent years several novel models were developed to process natural language, development of accurate language translation systems have helped us overcome geographical barriers and communicate ideas effectively. These models are developed mostly for a few languages that are widely used while other languages are ignored. Most of the languages that are spoken share lexical, syntactic and sematic similarity with several other languages and knowing this can help us leverage the existing model to build more specific and accurate models that can be used for other languages, so here I have explored the idea of representing several known popular languages in a lower dimension such that their similarities can be visualized using simple 2 dimensional plots. This can even help us understand newly discovered languages that may not share its vocabulary with any of the existing languages. 1. Introduction Language is a method of communication and ironically has long remained a communication barrier. Written representations of all languages look quite different but inherently share similarity between them. For example if we show a person with no knowledge of English alphabets a text in English and then in Spanish they might be oblivious to the similarity between them as they look like gibberish to them. There might also be several languages which may seem completely different to even experienced linguists but might share a subtle hidden similarity as they is a possibility that languages with no shared vocabulary might still have some similarity. -
Malayalam Range: 0D00–0D7F
Malayalam Range: 0D00–0D7F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
A Study on Divergence in Malayalam and Tamil Language in Machine Translation Perceptive
A Study on Divergence in Malayalam and Tamil Language in Machine Translation Perceptive Jisha P Jayan Elizabeth Sherly Virtual Resource Centre for Virtual Resource Centre for Language Computing Language Computing IIITM-K,Trivandrum IIITM-K,Trivandrum [email protected] [email protected] Abstract others are specific with respect to the language pair (Lavanya et al., 2005; Saboor and Khan, Machine Translation has made significant 2010). Hence, the divergence in the translation achievements for the past decades. How- need to be studied both perspectives that is across ever, in many languages, the complex- the languages and language specific pair (Sinha, ity with its rich inflection and agglutina- 2005).The most problematic area in translation is tion poses many challenges, that forced the lexicon and the role it plays in the act of creat- for manual translation to make the cor- ing deviations in sense and reference based on the pus available. The divergence in lexi- context of its occurrence in texts (Dash, 2013). cal, syntactic and semantic in any pair Indian languages come under Indo-Aryan or of languages makes machine translation Dravidian scripts. Though there are similarities more difficult. And many systems still de- in scripts, there are many issues and challenges pend on rules heavily, that deteriates sys- in translation between languages such as lexical tem performance. In this paper, a study divergences, ambiguities, lexical mismatches, re- on divergence in Malayalam-Tamil lan- ordering, syntactic and semantic issues, structural guages is attempted at source language changes etc. Human translators try to choose the analysis to make translation process easy. -
The Making of Modern Malayalam Prose and Fiction: Translations from European Languages Into Malayalam in the First Half of the Twentieth Century
The Making of Modern Malayalam Prose and Fiction: Translations from European Languages into Malayalam in the First Half of the Twentieth Century K.M. Sherrif Abstract Translations from European languages have played a crucial role in the evolution of Malayalam prose and fiction in the first half of the Twentieth Century. Many of them are directly linked to the socio- political movements in Kerala which have been collectively designated ‘Kerala’s Renaissance.’ The nature of the translated texts reveal the operation of ideological and aesthetic filters in the interface between literatures, while the overwhelming presence of secondary translations indicate the hegemonic status of English as a receptor language. The translations never occupied a central position in the Malayalam literature and served mostly as mere literary and political stimulants. Keywords: Translation - evolution of genres, canon - political intervention The role of translation in the development of languages and literatures has been extensively discussed by translation scholars in the West during the last quarter of a century. The proliferation of diachronic translation studies that accompanied the revolutionary breakthroughs in translation theory in the mid-Eighties of the Twentieth Century resulted in the extensive mapping of the intervention of translation in the development of discourses and shifts of ideological paradigms in cultures, in the development of genres and the construction and disruption of the canon in literatures and in altering the idiomatic and structural paradigms of languages. One of the most detailed studies in the area was made by Andre Lefevere (1988, pp 75-114) Lefevere showed with convincing 118 Translation Today K.M. -
History-. of ··:Kerala: - • - ' - - ..>
HISTORY-. OF ··:KERALA: - • - ' - - ..> - K~ P. PAD!rlANABHA .MENON.. Rs. 8. 18 sh. ~~~~~~ .f-?2> ~ f! P~~-'1 IY~on-: f. L~J-... IYt;;_._dh, 4>.,1.9 .£,). c~c~;r.~, ~'").-)t...q_ A..Ja:..:..-. THE L ATE lVIn. K. P . PADJVIANABHA MENON. F rontispiece.] HISTORY· op::KERALA. .. :. ' ~ ' . Oowright and right of t'fanslation:. resen;e~ witk ' Mrs. K. P. PADMANABHA MENON. Copies can be had of . '. Mrs. K. P. PAI>MANABHA MENON, Sri Padmanabhalayam Bungalow~, Diwans' Road;-!Jmakulam, eochin state, S.INDIA. HISTORY OF KERALA. A HISTORY OF KERALA. WBI'l!TEN, IN THE FOBH OF NOTES ON VISSCHER'S LETTERS FROM MALABAR, BY K. P. PADMANABHA MENON, B.A., S.L, M.~.A.S., ' . Author of the History of Codzin, anti of severai P~p~rsconnectedwith the early History of Kerala; Jiak•l of the H1g-k Courts of Madras 0,.. of Travam:ore and of tke Ckief Court of. Cochin, ~ . • r . AND .EDITED BY SAHITHYAKUSALAN. "T. :K.' KRISHNA MENON, B. A.;~. • ' "'•.t . fl' Formerly, a Member of Jhe Royal Asiatic Society, and of the ~ocieties of Arts and of Aut/tors, 'anti a Fellow of the .Royal Histor~cal .Soci'ety. Kun.kamhu NamfJiyar Pr~sd:ian. For some ti'me, anE:cami'n.er for .Malayalam to the Umverfities of Madras, Benares and Hydera bad. A Member (Jf the .Board of Stzediet for Malayalam. A fJUOndum Editor of Pid,.a Vinodini. A co-Editor of tke ., .Sciene~ Primers Seriu in Malayalam. Editor of .Books for Malabar Bairns: The Author &- Editor of several works in Malayalam. A Member of the, Indian Women's Uni'r,ersity, and · · a .Sadasya of Visvn-.Bharatki, &-c. -
The Death of Sanskrit*
The Death of Sanskrit* SHELDON POLLOCK University of Chicago “Toutes les civilisations sont mortelles” (Paul Valéry) In the age of Hindu identity politics (Hindutva) inaugurated in the 1990s by the ascendancy of the Indian People’s Party (Bharatiya Janata Party) and its ideo- logical auxiliary, the World Hindu Council (Vishwa Hindu Parishad), Indian cultural and religious nationalism has been promulgating ever more distorted images of India’s past. Few things are as central to this revisionism as Sanskrit, the dominant culture language of precolonial southern Asia outside the Per- sianate order. Hindutva propagandists have sought to show, for example, that Sanskrit was indigenous to India, and they purport to decipher Indus Valley seals to prove its presence two millennia before it actually came into existence. In a farcical repetition of Romantic myths of primevality, Sanskrit is consid- ered—according to the characteristic hyperbole of the VHP—the source and sole preserver of world culture. The state’s anxiety both about Sanskrit’s role in shaping the historical identity of the Hindu nation and about its contempo- rary vitality has manifested itself in substantial new funding for Sanskrit edu- cation, and in the declaration of 1999–2000 as the “Year of Sanskrit,” with plans for conversation camps, debate and essay competitions, drama festivals, and the like.1 This anxiety has a longer and rather melancholy history in independent In- dia, far antedating the rise of the BJP. Sanskrit was introduced into the Eighth Schedule of the Constitution of India (1949) as a recognized language of the new State of India, ensuring it all the benefits accorded the other fourteen (now seventeen) spoken languages listed. -
Non-Dravidian Elements and (Non)Diasystematic Change in Malayalam
To appear in Constructions in Contact 2: Language change, multilingual practices, and additional language acquisition eds. Höder, S. & Boas, H. Title: Non-Dravidian elements and (non)diasystematic change in Malayalam Savithry Namboodiripad (University of Michigan, Ann Arbor) [email protected] Abstract: This chapter applies a Diasystematic Construction Grammar (DCxG) approach to account for non-Dravidian vocabulary and phonology in Malayalam, a high-contact Dravidian language. The distinction made in DCxG between diaconstructions, which are language non-specific, and idioconstructions, which are language-specific, proves useful in accounting for semantic specialization and phonological heterogeneity due to language contact. Notably, increased contact with English has led in some cases to decreased phonological adaptation, as some constructions change from diaconstructions to idioconstructions: Non-diasystematic change. Taken together, this chapter argues that any analysis of Malayalam must account for non-Dravidian subpatterns, and including language labels as part of speakers' linguistic knowledge enhances our understanding of the dynamics of language contact. Keywords: Malayalam, English, language contact, language change, semantic specialization, loanword adaptation, historical linguistics, sociolinguistics 1 High-contact languages are not categorically different By introducing a distinction between diaconstructions, which are language non-specific, and idioconstructions, which are language-specific, Diasystematic Construction Grammar (DCxG) captures the fact that borders between languages, while often blurred, also shape language use (Höder 2012). Building on other construction-theoretic approaches, DCxG characterizes all languages as consisting of emergent subpatterns of constructions. This includes hybrid or high-contact languages, which differ only in that those subpatterns can be attributed (by analysts, not necessarily speakers) as belonging to a particular language. -
Cosmopolitanism, Modernity, and Cultural Dynamics: Perspectives from Kerala, India
Cosmopolitanism, Modernity, and Cultural Dynamics: Perspectives from Kerala, PJAEE, 17 (7) (2020) India, COSMOPOLITANISM, MODERNITY, AND CULTURAL DYNAMICS: PERSPECTIVES FROM KERALA, INDIA Hemanth M Ph.D. Research Scholar, Department of English, Faculty of Arts, Banaras Hindu University, India Hemanth M, Cosmopolitanism, Modernity, and Cultural Dynamics: Perspectives from Kerala, India- PalArch’s Journal of Archaeology of Egypt/Egyptology 17(7), ISSN 1567- 214x Abstract: Kerala as a state in India is popular for its human development index comparable with many of the European countries. Kerala’s model of development has grabbed the attention of academicians all across the world. Neither agrarian nor industrial, the economic growth model of Kerala is a unique phenomenon, where cross-cultural interactions played a quintessential role. The amalgamation of modern Enlightenment thoughts along with the traditional values together contributed in shaping this society, where cosmopolitanism emerged as a predominant approach and outlook. Cosmopolitanism in Kerala is a highly complex cultural process, which plays a rudimentary role in the holistic growth that it attains in multiple domains of social life. This study 15157 Cosmopolitanism, Modernity, and Cultural Dynamics: Perspectives from Kerala, PJAEE, 17 (7) (2020) India, aims to critically trace the historical roots of cosmopolitanism, cross-cultural interactions, and to further analyze how it has fundamentally moulded the modernity of Kerala. Keywords: Cosmopolitanism, transnationalism, culture, migration, renaissance, coexistence. Culture is a decisive sociological construct intrinsically related to identity formation in diverse domains of social life. It “locates man in time” and “links man with his lineage and heritage from which he picks up various primordial traits” (Khubchandani 50). -
On the Epoch of the Kollam Era Gislén, Lars
On the epoch of the Kollam Era Gislén, Lars Published in: Journal of Astronomical History and Heritage 2018 Document Version: Publisher's PDF, also known as Version of record Link to publication Citation for published version (APA): Gislén, L. (2018). On the epoch of the Kollam Era. Journal of Astronomical History and Heritage, 21(2&3), 134. Total number of authors: 1 Creative Commons License: Unspecified General rights Unless other specific re-use rights are stated the following general rights apply: Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal Read more about Creative commons licenses: https://creativecommons.org/licenses/ Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim. LUND UNIVERSITY PO Box 117 221 00 Lund +46 46-222 00 00 Journal of Astronomical History and Heitage, 21(2 & 3), 134 (2018). ON THE EPOCH OF THE KOLLAM ERA Lars Gislén University of Lund, Dala 7163, 242 97 Hörby, Sweden. E-mail: [email protected] Abstract: It will be shown that the correct epoch of the Kollam Era is CE 25 July 824, Julian Calendar. -
Word Level Language Identification in English Telugu
PACLIC 32 Word Level Language Identification in English Telugu Code Mixed Data Sunil Gundapu Radhika Mamidi Language Technologies Research Centre Language Technologies Research Centre KCIS, IIIT Hyderabad KCIS, IIIT Hyderabad Telangana, India Telangana, India [email protected] [email protected] Abstract tion in a single discourse between two languages, where the switching occurs within a sentence). Code In a multilingual or sociolingual configura- mixing is inconsistently elucidated in disparate sub- tion Intra-sentential Code Switching (ICS) or Code Mixing (CM) is frequently observed fields of linguistics and frequent examination of nowadays. In the world most of the people phrases, words, inflectional, derivational morphol- know more than one language. The CM us- ogy and syntax use of a term as an equivalent to age is especially apparent in social media plat- Code Mixing. forms. Moreover, ICS is particularly signifi- Code mixing is defined as the embedding of lin- cant in the context of technology, health and guistic units of one language into utterance of an- law where conveying the upcoming develop- other language. CM is not only used in commonly ments are difficult in ones native language. used spoken form of multilingual setting, but also In applications like dialog systems, machine translation, semantic parsing, shallow pars- used in social media websites in the form of com- ing, etc. CM and Code Switching pose seri- ments and replies, posts and especially in chat con- ous challenges. To do any further advance- versations. Most of the chat conversations are in a ment in code-mixed data, the necessary step formal or semi-formal setting and CM is often used.