Grammar Rule Based Cross Language Information Retrieval for Telugu

Total Page:16

File Type:pdf, Size:1020Kb

Grammar Rule Based Cross Language Information Retrieval for Telugu 1 GRAMMAR RULE BASED CROSS LANGUAGE INFORMATION RETRIEVAL FOR TELUGU A THESIS Submitted by DINESH MAVALURU Under the guidance of Dr. R. SHRIRAM in partial fulfillment for the award of the degree of DOCTOR OF PHILOSOPHY in COMPUTER SCIENCE B.S.ABDUR RAHMAN UNIVERSITY (B.S.ABDUR RAHMAN INSTITUTE OF SCIENCE &TECHNOLOGY) (Estd. u/s 3 of the UGC Act. 1956) www.bsauniv.ac.in APRIL 2014 2 CERTIFICATE This is to certify that all corrections and suggestions pointed out by the Indian/ Foreign Examiner(s) are incorporated in the Thesis titled “Grammar Rule Based Cross Language Information Retrieval for Telugu” submitted by Mr. Dinesh Mavaluru. (Dr.R. Shriram) SUPERVISOR Place: Chennai Date: 04 July 2014 3 4 B.S.ABDUR RAHMAN UNIVERSITY (B.S.ABDUR RAHMAN INSTITUTE OF SCIENCE &TECHNOLOGY) (Estd. u/s 3 of the UGC Act. 1956) www.bsauniv.ac.in BONAFIDE CERTIFICATE Certified that this thesis GRAMMAR RULE BASED CROSS LANGUAGE INFORMATION RETRIEVAL FOR TELUGU is the bonafide work of DINESH MAVALURU (RRN: 1194207) who carried out the thesis work under my supervision. Certified further, that to the best of my knowledge the work reported herein does not form part of any other thesis or dissertation on the basis of which a degree or award was conferred on an earlier occasion on this or any other candidate. SIGNATURE SIGNATURE Dr. R. SHRIRAM Dr. P. SHEIK ABDUL KHADER RESEARCH SUPERVISOR HEAD OF THE DEPARTMENT Professor Professor & Head Department of CSE Department of CA B.S. Abdur Rahman University B.S. Abdur Rahman University Vandalur, Chennai – 600 048 Vandalur, Chennai – 600 048 5 ACKNOWLEDGEMENT At the outset I thank the Almighty whose unbounded blessings and love have helped me in pursuing this research work. I always admired my adviser, Prof. R. Shriram, whose ideals had a big influence on me which changed the way I perceived this world. I am one of those fortunate students to scribe my name in his students list. Without his support, I could not imagine myself starting a research career. His generosity gave the freedom to enjoy all the privileges. I remain indebted to him and his family members all my life and just a mere thank you is not sufficient. I am greatly obliged to the members of my doctoral committee Dr. A. Kannan, Professor, Department of Information Science and Technology, Anna University, Chennai, Dr. T. R. Rangaswamy, Professor, Department of Electronics and Instrumentation Engineering, B S Abdur Rahman University, Chennai and Dr. P. Sheik Abdul Khader, Professor and Head, Department of Computer Applications, B S Abdur Rahman University, Chennai, for their guidance, valuable suggestions, continuous encouragement and critical reviews during the tenure of this research work. I would like to express most sincere gratitude to the members of my review committee Dr. V. Sankaranarayanan and Dr. K. M. Mehata who have influenced me greatly, and from whom I had the chance to learn throughout my research work by their valuable suggestions and guidance in between their tight schedule. I owe my sincere thanks to Prof. V. Saravanan, Computer Sciences and Information Technology College, Majmaah University, Majmaah, Kingdom of Saudi Arabia, who made me realize the best in me and also taught me how to do research. 6 I am immensely grateful to the faculty members of Department of Computer Applications, Management and Administration of B S Abdur Rahman University, Chennai for providing all the facilities to complete my research work successfully. I would like to thank all my dear colleagues in particular, Shakthi Priyan, A. Venkat Narayanan, P. Kumaran, T. Nadana Ravi Shankar, V. K. Mohan Raj, B. Manikandan, S. Sumitra, P. Thiripurasundari and D. M. Ahamed Kabeer Bhadhusha for their constant support during my research work. My Whole hearted thanks go to my family, Mrs Gnanamani and my beloved G. Sonia who motivated me to be strong, bold and helped me to bring out the best from the beginning to the end in the completion of this research work and move on with my future goals helped me to realize the importance of many things in my life. Finally, I would like to acknowledge my friends D. Shyam Kiran, Amaresh and many others who are along with me during my bad and good times. Without you all I am nowhere. 7 ABSTRACT The rapid spread of the World Wide Web and improvements in information retrieval (IR) techniques have allowed people to access huge amount of information. However, majority of the web content is in English. While content in languages like Telugu and Tamil are growing every day, a huge gap remains. This gap is what this research work will be addressing. In general information retrieval systems, the relevant information retrieved for the user query, only if the information is available in that query language. For example a Telugu search engine will retrieve only results for content in Telugu. It is not considering the relevant information that is available in the other languages for the given user query. Cross Language Information Retrieval (CLIR) systems seek to overcome this gap. A CLIR system retrieves information from a language that is different from the user’s query language. The goal of this research work is to develop a new framework for Telugu - English Cross Language Information Retrieval using Language Grammar Rules. The major challenges addressed are query ambiguity and the linguistic differences between the query and content language. The steps in this research are as follows: a) The user query is tokenized into keywords using tokenizer. The language grammar rules are applied to the tokenized query terms to identify the subject, verb, object and inflection in tokenized keywords. b) The query processor searches the English equivalent terms in the ontology for the terms identified using language grammar rules. The terms which are not available in ontology are considered as Out-Of- Vocabulary terms and literally transliterated into the English language. 8 c) The parser will find the subject, verb and object in English to assemble the query in English. The query processing is done and the query is converted into the English language. The converted query is given to the search engine for relevant results. d) The retrieved results are given to the post processor to convert the results into Telugu language. For this, the ontology is used to convert the Telugu word to the English word. Thus, all the previous stages mentioned are repeated again until the results are converted into target language representation. The grammar rule based approach is a semantic way of approaching the IR problem by first finding the meaning of query; mapping user query to target language, finding relevant information in target language, mapping this to source language and displayed to the user. This research work also evaluates the user acceptance of CLIR for Telugu using various metrics. 9 TABLE OF CONTENTS CHAPTER NO. TITLE PAGE NO. ABSTRACT V LIST OF TABLES XI LIST OF FIGURES XII 1. INTRODUCTION 1 1.1 General Introduction 1 1.2 Objectives 3 1.3 Contribution of The Work 4 1.4 Thesis Outline 5 2. LITERATURE REVIEW 7 2.1 Introduction 7 2.2 Information Retrieval 7 2.2.1 Retrieval Models 9 2.1.2 Improving Information Retrieval 14 2.3 Cross Language Information Retrieval 17 2.3.1 Non-Translation Approaches 17 2.3.2 Translation-Based Approaches 18 2.3.3 Challenges in CLIR 20 2.3.4 Current Approaches 21 2.4 Information Retrieval In The Telugu Language 24 2.4.1 Difficulties of Information Retrieval in Telugu 24 2.4.2 Monolingual IR in Telugu 25 10 CHAPTER NO. TITLE 2.4.3 CLIR and Telugu PAGE NO. 2.5 CONCLUSION 26 PROPOSED FRAMEWORK FOR TELUGU 3 CROSS LANGUAGE INFORMATION 28 RETRIEVAL 3.1 Introduction 30 3.2 Methodology of Proposed Framework 30 3.3 Proposed Framework System 30 3.3.1 Pre-Processing 32 3.3.2 Post-Processing 32 3.3 Conclusion 34 4 PREPROCESSING 37 4.1 Introduction 38 4.2 Methodology of Proposed Pre- Processing 38 4.2.1 Tokenizer 38 4.2.2 Language Grammar Rules 39 4.2.3 Bilingual Ontology 41 4.2.4 OOV Component 51 4.3 Conclusion 54 5 POSTPROCESSING 58 5.1 Introduction 59 5.2 Methodology of Proposed Post- Processing 59 5.2.1 Tokenizer 59 5.2.2 Language Grammar Rules 60 11 CHAPTER NO. TITLE PAGE NO. 5.2.3 Re-ranking System 61 5.2.4 Smoothening Approach 61 5.3 Conclusion 63 FRAMEWORK IMPLEMENTATION AND 6 67 RESUTLS 6.1 Introduction 68 6.2 Approaches For Evaluating Information Retrieval 68 6.3 Test Collection 68 6.4 Evaluation of Results 69 6.4.1 Mean Average Precision 69 6.5 Experimental Framework And Toolkit 70 6.6 Experimental Settings For Pre- Processing 70 6.7 Experimental Settings For Post- Processing 71 6.8 Testing and Results 73 6.9 Conclusion 73 EVALUATING USER ACCEPTANCE OF 7 CLIR USING LANGUAGE GRAMMAR 84 RULES 7.1 Introduction 85 7.2 Technology Acceptance Model (TAM) 85 7.3 Research Model And Hypotheses 85 7.3.1 CLIR System ease of use 87 7.3.2 CLIR System usefulness 87 12 LIST OF TABLES TABLE NO. TITLE PAGE NO. 4.1 Sample Telugu Sentence Order 43 4.2 Post positions for Telugu sentence order 45 4.3 Finite Verb Rules 47 4.4 Non-Finite Verb Rules 50 6.1 Relative Retrieval Efficiency 77 Time taken for Query processing in the 6.2 Existing and proposed systems 80 Precision Percentages For Retrieved 6.3 Results In Existing And Proposed Systems 81 6.4 Precision for Results 82 6.5 Weighted Precision 83 7.1 Profile of the system users 89 7.2 Instrument Reliability And Validity 94 Model fit summary for the final 7.3 measurement and structural model 96 The contribution of the study to existing 7.4 knowledge 96 13 LIST OF FIGURES FIGURE NO.
Recommended publications
  • CONTENT K to 8
    CONTENT K to 8 HINDI 28. Saraswati Sanskrit Manjusha ........................ 22 (ICSE) 1. Nai Rangoli ................................................... 7 29. Saraswati Sanskrit Sudha ................... 23 (ICSE) 2. Rangoli Varnamala........................................ 8 30. Saraswati Deep Manika ..................... 24 31. Saraswati Sanskrit Manjusha (ICSE) ............. 25 3. Rangoli Sulekh Abhyas ................................ 8 32. Saraswati Sanskrit Vyakaran ......................... 26 4. Saraswati Sarika ............................................ 9 33. Saraswati Sanskrit Vyakaran Sudha .............. 26 5. Kalptaru ........................................................ 9 34. Saraswati Manika Sanskrit 6. Naveen Sankalp ............................................ 10 Vyakaran (REVISED) ..................................... 27 7. Sargam .......................................................... 11 35. Saraswati Ruchira Abhyas Pustika .............. 27 8. Nai Swati ...................................................... 12 9. Elementary Hindi Reader ............................. 12 URDU 10. Saras Hindi Pathmala (NEW) ....................... 13 36. Bazeecha ....................................................... 28 11. Saraswati Nai Sarika (ICSE) ........................ 13 PRE-SCHOOL 12. Rangoli (ICSE) .............................................. 14 37. Tippy Tippy Tap ............................................ 29 13. Sankalp Hindi Pathmala (ICSE) .................... 15 38. Junior Smart Kit ...........................................
    [Show full text]
  • A Dependency Treebank for Telugu Taraka Rama University of Oslo
    English Conference Papers, Posters and Proceedings English 1-2018 A Dependency Treebank for Telugu Taraka Rama University of Oslo Sowmya Vajjala Iowa State University, [email protected] Follow this and additional works at: https://lib.dr.iastate.edu/engl_conf Part of the Communication Technology and New Media Commons Recommended Citation Rama, Taraka and Vajjala, Sowmya, "A Dependency Treebank for Telugu" (2018). English Conference Papers, Posters and Proceedings. 8. https://lib.dr.iastate.edu/engl_conf/8 This Conference Proceeding is brought to you for free and open access by the English at Iowa State University Digital Repository. It has been accepted for inclusion in English Conference Papers, Posters and Proceedings by an authorized administrator of Iowa State University Digital Repository. For more information, please contact [email protected]. A Dependency Treebank for Telugu Abstract In this paper, we describe the annotation and development of Telugu treebank following the Universal Dependencies framework. We manually annotated 1328 sentences from a Telugu grammar textbook and the treebank is freely available from Universal Dependencies version 2.1.1 In this paper, we discuss some language specific nnota ation issues and decisions; and report preliminary experiments with POS tagging and dependency parsing. To the best of our knowledge, this is the first freely accessible and open dependency treebank for Telugu. Keywords Telugu, Universal dependencies, adverbial clauses, relative clauses Disciplines Communication Technology and New Media Comments The following proceeding is published as Proceedings of the 16th International Workshop on Treebanks and Linguistic Theories (TLT16), pages 119–128, Prague, Czech Republic, January 23–24, 2018. Distributed under a CC-BY 4.0 licence.
    [Show full text]
  • Clause Boundary Identification for Non-Restrictive Type Complex Sentences in Telugu Language
    Volume 7, No. 4, July-August 2016 International Journal of Advanced Research in Computer Science RESEARCH PAPER Available Online at www.ijarcs.info Clause Boundary Identification for Non-Restrictive Type Complex Sentences in Telugu Language V. Suresh M.S. Prasad Babu Department of Computer Science and Engineering, Department of Computer Science and Systems Engineering, Vignan’s Institute of Information Technology, Duvvada, Andhra University College of Engineering, Visakhapatnam, A.P., India Andhra University, Visakhapatnam, A.P., India Abstract: Sentence identification for non-restricted type Telugu language is one of the basic applications of the Natural Language Processing. A sentence composed of single independent clause is called a simple sentence. If a sentence contains a dependent clause along with independent clause then it is a complex sentence. Once the sentence is identified as complex sentence, the next step is to identify its pattern. After identification of patterns, various clauses present in the sentence are extracted and grammar checking is performed on them. For grammar checking of complex sentences, it is necessary to identify the structure of Clause Boundary for Non-Restrictive Type of Telugu Language Complex Sentences. In this paper we have explored the different types of sentences present in Telugu language. The structure of complex sentences can be identified on the basis of number of clauses and types of clauses present in them. We have also proposed an algorithm for identification of simple, compound and complex sentences. This study will be helpful in identifying and separating the complex sentences from Telugu language. We have also proposed an algorithm for identification of simple, compound and complex sentences.
    [Show full text]
  • A History of Telugu Literature
    THE HERITAGE OF INDIA SERIES lan A P ned by J . N . F R QUHAR , M . A D . Litt . D D . (Aberdeen) . Ri R V . H The ght everend . S AZ AR I A , LL . D a Bishop of Dorn kal . Joi nt K E . C . DEWI C , M . A . (Cantab . ) E ditors . AN LY M a J N C G GU , . A (Birmingh m) , - a Darsan S stri . Already pub li c/zed. f He ar o Bu ism . S . D .Litt . a a The t ddh K J AUNDERS , M A (C nt b . ) i 2 r o f a are se e ra e u P B . Hi o n L ur d ed . E . I A A st y K t t , . R CE , . S e . k a s ud ed . B I I H i . The sam hy y t m , z A ERR EDALE KE T , D . L tt ( Oxo n . ) 2 ed . S . MA AI L . Aso k a , ud JAME M CPH , M A M D in in . 2nd e d . rinc i a Y O u . Indian Pa t g P p l PE R C BR W N , Calc tta f a th S ai . i ra i n s NI O I O . Psalms o M t C L MACN C L , M A , D L tt . f in i i e a u e . r F E . Li . H o H L r . A isto ry d t t . KEAY , M A , D tt a B The Karm A .
    [Show full text]
  • A Comparative Grammar of the Dravidian Languages
    World Classical Tamil Conference- June 2010 51 A COMPARATIVE GRAMMAR OF THE DRAVIDIAN LANGUAGES Rt. Rev. R.Caldwell * Rev. Caldwell's Comparative Grammar is based on four decades of his deep study and research on the Dravidian languages. He observes that Tamil language contains a common repository of Dravidian forms and roots. His prefatory note to his voluminous work is reproduced here. Preface to the Second Edition It is now nearly nineteen years since the first edition of this book was published, and a second edition ought to have appeared long ere this. The first edition was, soon exhausted, and the desirableness of bringing out a second edition was often suggested to me. But as the book was a first attempt in a new field of research and necessarily very imperfect, I could not bring myself to allow a second edition to appear without a thorough revision. It was evident, however, that the preparation of a thoroughly revised edition, with the addition of new matter wherever it seemed to be necessary, would entail upon me more labour than I was likely for a long time to be able to undertake. .The duties devolving upon me in India left me very little leisure for extraneous work, and the exhaustion arising from long residence in a tropical climate left me very little surplus strength. For eleven years, in addition to my other duties, I took part in the Revision of the Tamil Bible, and after that great work had come to. an end, it fell to my lot to take part for one year more in the Revision of the Tamil Book of, * Source: A Comparative Grammar of the Dravidian or South-Indian Family of Languages , Rt.Rev.
    [Show full text]
  • Maintaining the Heritage Language and Identity: the Case of the New Diaspora Telugus in London
    ANGLIA RUSKIN UNIVERSITY MAINTAINING THE HERITAGE LANGUAGE AND IDENTITY: THE CASE OF THE NEW DIASPORA TELUGUS IN LONDON ARUN YADLA A thesis in partial fulfilment of the requirements of Anglia Ruskin University for the degree of Doctor of Philosophy Submitted: October 2016 Acknowledgements This study and thesis have become reality with the kind support and help of many individuals, directly or indirectly. First and foremost, I express my gratitude to my supervisors, Dr. Sebastian Rasinger and Dr. Bettina Beinhoff. Your guidance, patience, invaluable input, timely feedback and constructive criticism helped me in my journey. Thank you for the kind words and encouragement when my chips were down. I would like to thank Mr. Rama Naidu Gāru for connecting me to the Telugu network and also for introducing me to key participants. Without your support, building a sample base would have been impossible. Many thanks for the invaluable support I received from Christine, Richard and Vernon from Cambridge, and Moushmi, Deepak, Dhruv, Dia, Aryaman, Anamika, Nana and Nani from London. I am very grateful for the love and kindness you have shown towards me. A special acknowledgement for the respondents and participants from the Telugu community for giving your time generously to accommodate me in your busy schedules. Thanks also for enduring my interview questions and discussions. I must also acknowledge the contribution of my students and module leaders for their positive and constructive feedback, which motivated me to give my best. I cannot forget to acknowledge my research room friends, who made my journey a memorable one with friendly banter, endless snacks, exchange of stories and ideas.
    [Show full text]
  • The Languages of South Asia
    THE LANGUAGES OF SOUTH ASIA a catalogue of rare books: dictionaries, grammars, manuals, & literature. with several important works on Tibetan Catalogue 31 John Randall (Books of Asia) John Randall (Books of Asia) [email protected] +44 (0)20 7636 2216 www.booksofasia.com VAT Number : GB 245 9117 54 Cover illustration taken from no. 4 (Colebrooke) in this catalogue; inside cover illustrations taken from no. 206 (Williams). © John Randall (Books of Asia) 2017 THE LANGUAGES OF SOUTH ASIA Catalogue 31 John Randall (Books of Asia) INTRODUCTION The conversion of the East India Company from trading concern to regional power South Asia, home to six distinct linguistic gave further impetus to the study of South families, remains one of the most Asian languages. Employees of the linguistically complex regions on earth. Company were charged with producing According to the 2001 Census of India, linguistic guides for official purposes. 1,721 languages and dialects were spoken as Military officers needed language skills to mother tongues. Of these, 29 had one issue commands to locally recruited troops. million or more speakers, and a further 31 And as the Company sought to perpetuate more than 100,000. the Mughal system of rule, knowledge of Persian as well as regional languages was The political implications of such dizzying essential for revenue collectors and diversity have been no less complex. Since administrators of justice. 1953, there have been many attempts to re- divide the country along linguistic lines. As All the while, some independent European recently as 2014, the new state of Telangana scholars demonstrated a genuine interest in was created as a homeland for Telugu and empathy for South Asian languages and speakers.
    [Show full text]
  • Télécharger Le PDF De Article
    Histoire Epistémologie Langage Available online at: 39/2 (2017), 45-63 www.hel-journal.org © SHESL/EDP Sciences https://doi.org/10.1051/hel/2017390203 APPROPRIATIONS AND INNOVATIONS IN METALINGUISTIC TERMINOLOGY IN AN INFLUENTIAL TELUGU GRAMMAR COMPOSED IN SANSKRIT★ Deven M. Patel University of Pennsylvania, Philadelphia, USA Abstract Résumé The various traditions of Sanskrit grammar Les diverses traditions de grammaire sanskrite have served as models, or as sources for ont fourni des modèles, ou des sources de metalinguistic description, for many other description métalinguistique, pour nombre de grammars composed to describe South Asian grammaires décrivant les langues littéraires classical or local literary languages. This classiques ou locales d’Asie du Sud. Cet article article investigates the contents of the first porte sur le contenu du premier chapitre chapter (on metalinguistic terms) of a remark- (consacré aux termes métalinguistiques) d’une able and influential medieval grammar of the grammaire médiévale remarquable et influente major Dravidian language Telugu, or Āndhra- de l’incontournable langue dravidienne télou- bhās.ā, known as the Āndhras´abdacintāman.i. goue (ouāndhra-bhās.ā), connue sous le nom de This grammar was composed with the same Āndhras´abdacintāman.i. Cette grammaire fut technical precision and a style similar to that composée avec la même précision technique et of Pān.ini’s As. ṭādhyāy¯ı. Hence, the purpose of un style similaire à ceux de l’As.ṭādhyāy¯ı de Pān. this article: to study the processes of adapta- ini. D’où l’objet du présent article : étudier les tion of metalanguage and of the Sanskrit processus d’adaptation de la métalangue et des metalinguistic technologies to describe Telugu technologies métalinguistiques sanskrites pour in Sanskrit, a language that has both profound décrire le télougou en sanskrit, une langue qui lexical affinities and striking phonological and présente à la fois des affinités lexicales morphological divergences from Telugu.
    [Show full text]
  • The Dravidian Languages
    This page intentionally left blank THE DRAVIDIAN LANGUAGES The Dravidian languages are spoken by over 200 million people in South Asia and in diaspora communities around the world, and constitute the world’s fifth largest language family. It consists of about twenty-six lan- guages in total including Tamil, Malay¯alam,. Kannada. and Telugu,as well as over twenty non-literary languages. In this book, Bhadriraju Krishnamurti, one of the most eminent Dravidianists of our time and an Honorary Member of the Linguistic Society of America, provides a comprehensive study of the phonological and grammatical structure of the whole Dravidian family from different aspects. He describes its history and writing system, dis- cusses its structure and typology, and considers its lexicon. Distant and more recent contacts between Dravidian and other language groups are also discussed. With its comprehensive coverage this book will be welcomed by all students of Dravidian languages and will be of interest to linguists in various branches of the discipline as well as Indologists. is a leading linguist in India and one of the world’s renowned historical and comparative linguists, specializing in the Dravidian family of languages. He has published over twenty books in English and Telugu and over a hundred research papers. His books include Telugu Verbal Bases: a Comparative and Descriptive Study (1961), Kon. da. or K¯ubi, a Dravidian Language (1969), A Grammar of Modern Telugu (with J. P. L. Gwynn, 1985), Language, Education and Society (1998) and Comparative Dravidian Linguistics: Current Perspectives (2001). CAMBRIDGE LANGUAGE SURVEYS General editors P. Austin (University of Melbourne) J.
    [Show full text]
  • A History of Telugu Literature;
    Jf THE HERITAGE OF INDIA SERIES Planned by J. N. FARQUHAR, M.A., D.Litt. (Oxon.), D.D. (Aberdeen). /The Right Reverend V. S. AZARIAH, LL.D of Dornakal. Joint (Cantab.), Bishop E. C. M.A. Editors DEWICK, (Cantab.) J. N. C. GANGULY, M.A., (Birmingham), Darsan-Sastri. Already published. The Heart of Buddhism. K. J. SAUNDERS, M.A., D.Litt. (Cantab.) A History of Kanarese Literature, 2nd ed. E. P. RICE, B.A. The Sarhkhya System, 2nd ed. A. BERRIEDALE KEITH, D.C.L., D.Litt. (Oxon.) ASoka, 2nd ed. JAMES M. MACPHAIL, M.A., M.D. Indian Painting. _2nd ed. Principal PERCY BROWN, Calcutta. Psalms of Maratha Saints. NICOL MACNICOL, M.A., D.Litt. A History of Hindi Literature. F. E, KEAY, M.A., D.Litt. The Karma-MImamsa. A. BERRIEDALE KEITH, D.C.L., D.Litt.(Oxon.) Hymns of the Tamil Saivite Saints. F. KINGSBURY, B.A., and G. E. PHILLIPS, M.A. Rabindranath Tagore. EDWARD THOMPSON, M.A. (Oxon.), Ph.D. Hymns from the Rigveda. A. A. MACDONELL, M.A., Ph.D., Hon. LL.D. Gautama Buddha. K. J. SAUNDERS, M.A., D.Litt. (Cantab.) The Coins of India. C. J. BROWN, M.A. Indian Poems by Women. ^ MRS. MACNICOL. Bengali Religious Lyrics, Sakta. EDWARD THOMPSON, M.A. (Oxon.), Ph.D., and A. M. SPENCER. Classical Sanskrit Literature, 2nd ed. A. BERRIEDALE KEITH, D.C.L., D.Litt. (Oxon.) The Music of India. H. A. POPLEY, B.A. Subjects proposed and volumes under preparation. HISTORY AND THE HERITAGE. The Early Period. The Gupta Period. The Mogul Period.
    [Show full text]
  • Telugu to English Translation Using Direct Machine Translation Approach
    International Journal of Science and Engineering Investigations vol. 2, issue 12, January 2013 ISSN: 2251-8843 Telugu to English Translation using Direct Machine Translation Approach T. Venkateswara Prasad1, G. Mayil Muthukumaran2 1Dean of Computing Sciences, Visvodaya Technical Academy, Kavali, AP, India 2Technical Director, National Informatics Centre, Govt. of India, New Delhi, India ([email protected], [email protected]) Abstract- The motivation behind working on a translation descended from Brahmi script. Telugu is said to have split system from Telugu to English were based on the principles from proto-Dravidian languages around 6th to 3rd century that BCE [13]. a) There are many translation systems for translating from Telugu language is a highly structured, disciplined, suave English to Indian languages but very few for vice versa. and rich in terms of expression, style and construction. It Telugu is a language that exhibits very strong phrasal, exhibits clear and structured implementation of grammar in word and sentence structures next to Sanskrit, which the best possible manner while including present day makes the work organized on one hand but complex in corruptions (or vulgarity) and foreign words. There is a clear handling on the other. This work demonstrates one such and specific purpose and meaning of each letter. Slight machine translation (MT) system for translating simple modification in the way a letter is written can change the and moderately complex sentences from Telugu to meaning itself, e.g., kada and kaDa are two unique words English. having different meaning. Similarly, rama, rāma and ramā are b) Of the many MT approaches, the direct MT is used for three different usages.
    [Show full text]
  • The Making of Telugu Language and Literature
    Translation as Negotiation: The Making of Telugu Language and Literature T. Vijay Kumar Abstract In terms of the number of native speakers, Telugu (the official language of Andhra Pradesh and Telangana in the southern part of India) ranks third among Indian languages after Hindi and Bengali. This study of the literary trajectory of Telugu notes how translation was inscribed in the emergence of the Telugu language, created as it was out of a mixture of Sanskrit, tribal and Dravidian tongues. It examines the various stages of translation through which the Telugu language passed and the responses of its literary culture to translation not only from the Ramayana and the Mahabharata, but also from English canonical texts in the nineteenth and twentieth centuries. Keywords: Translation, Telugu literary history, canon formation, historical analysis, literary culture. Telugu language Translation in Telugu, as perhaps in most Indian languages, is more practised than theorised. In a multilingual country like India where almost everyone is at least bilingual, translation—both in letter and spirit—is bound to have connotations quite different from those in the West. Sujit Mukherjee points out that “Rupantar (meaning ‘change in form’) and anuvad (‘speaking after’ or ‘following’) are the commonly understood senses of translation in India, and neither term demands fidelity to the original” (80). He Translation Today Vol.10, Issue-I, June 2016 T. Vijay Kumar further observes, “The notion that every literary translation is a faithful rendering of the original came to us from the West, perhaps in the wake of the Bible and the need felt by Christian missionaries to have it translated into different Indian languages” (80).
    [Show full text]