Proposal for a Malayalam Script Root Zone Label Generation Ruleset (LGR)
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Recognition of Ancient Tamil Characters in Stone Inscription Using Improved Feature Extraction M
International Journal of Recent Development in Engineering and Technology Website: www.ijrdet.com (ISSN 2347 - 6435 (Online)), Volume 2, Special Issue 3, February 2014) International Conference on Trends in Mechanical, Aeronautical, Computer, Civil, Electrical and Electronics Engineering (ICMACE14) Recognition of Ancient Tamil Characters in Stone Inscription Using Improved Feature Extraction M. V. Jeya Greeba1, G.Bhuvaneswari2 1PG Student, DMI College Of Engineering, Chennai-600123, India 2Assistant Professor, Department of CSE, DMI College Of Engineering, Chennai-600123, India. [email protected], [email protected] Abstract— Recognition of Ancient Tamil character is one of It is currently spoken by about 77 million people around the important task to reveal the information from the Ancient the world with 68 million speakers residing in India mostly world. Feature Extraction is an important step used to in the state of Tamil Nadu. It is one of the official language recognize the Ancient characters accurately. Feature in India, Sri Lanka and Singapore. Tamil characters consist describes the characteristic of object uniquely. Feature of small circles or loops, which are difficult to extraction is the process of extracting information from raw data which is most relevant for classification purpose. Feature recognize.Recognizing the ancient Tamil characters that is, Extraction technique extracts the basic components of Tamil the Vatteluttu alphabets is a tough task for the modern Characters in Stone inscription and then it translates the generation who learn to read and write only with modern components for further recognition procedures. Structural Tamil characters. Learning the evolution of Modern Tamil Features are extracted. Presence of dots, loops and curves in from ancient Tamil is a time consuming process therefore a Tamil character makes Feature Extraction a challenging recognition system helps to teach, understand and also to process. -
Updated Proposal to Encode Tulu-Tigalari Script in Unicode
Updated proposal to encode Tulu-Tigalari script in Unicode Vaishnavi Murthy Kodipady Yerkadithaya [ [email protected] ] Vinodh Rajan [ [email protected] ] 04/03/2021 Document History & Background Documents : (This document replaces L2/17-378) L2/11-120R Preliminary proposal for encoding the Tulu script in the SMP of the UCS – Michael Everson L2/16-241 Preliminary proposal to encode Tigalari script – Vaishnavi Murthy K Y L2/16-342 Recommendations to UTC #149 November 2016 on Script Proposals – Deborah Anderson, Ken Whistler, Roozbeh Pournader, Andrew Glass and Laurentiu Iancu L2/17-182 Comments on encoding the Tigalari script – Srinidhi and Sridatta L2/18-175 Replies to Script Ad Hoc Recommendations (L2/16-342) and Comments (L2/17-182) on Tigalari proposal (L2/16-241) – Vaishnavi Murthy K Y L2/17-378 Preliminary proposal to encode Tigalari script – Vaishnavi Murthy K Y, Vinodh Rajan L2/17-411 Letter in support of preliminary proposal to encode Tigalari – Guru Prasad (Has withdrawn support. This letter needs to be disregarded.) L2/17-422 Letter to Vaishnavi Murthy in support of Tigalari encoding proposal – A. V. Nagasampige L2/18-039 Recommendations to UTC #154 January 2018 on Script Proposals – Deborah Anderson, Ken Whistler, Roozbeh Pournader, Lisa Moore, Liang Hai, and Richard Cook PROPOSAL TO ENCODE TIGALARI SCRIPT IN UNICODE 2 A note on recent updates : −−−Tigalari Script is renamed Tulu-Tigalari script. The reason for the same is discussed under section 1.1 (pp. 4-5) of this paper & elaborately in the supplementary paper Tulu Language and Tulu-Tigalari script (pp. 5-13). −−−This proposal attempts to harmonize the use of the Tulu-Tigalari script for Tulu, Sanskrit and Kannada languages for archival use. -
Intelligence System for Tamil Vattezhuttuoptical
Mr R.Vinoth et al. / International Journal of Computer Science & Engineering Technology (IJCSET) INTELLIGENCE SYSTEM FOR TAMIL VATTEZHUTTUOPTICAL CHARACTER RCOGNITION Mr R.Vinoth Assistant Professor, Department of Information Technology Agni college of Technology, Chennai, India [email protected] Rajesh R. UG Student, Department of Information Technology Agni college of Technology, Chennai, India [email protected] Yoganandhan P. UG Student, Department of Information Technology Agni college of Technology, Chennai, India [email protected] Abstract--A system that involves character recognition and information retrieval of Palm Leaf Manuscript. The conversion of ancient Tamil to the present Tamil digital text format. Various algorithms were used to find the OCR for different languages, Ancient letter conversion still possess a big challenge. Because Image recognition technology has reached near-perfection when it comes to scanning Tamil text. The proposed system overcomes such a situation by converting all the palm manuscripts into Tamil digital text format. Though the Tamil scripts are difficult to understand. We are using this approach to solve the existing problems and convert it to Tamil digital text. Keyword - Vatteluttu Tamil (VT); Data set; Character recognition; Neural Network. I. INTRODUCTION Tamil language is one of the longest surviving classical languages in the world. Tamilnadu is a place, where the Palm Leaf Manuscript has been preserved. There are some difficulties to preserve the Palm Leaf Manuscript. So, we need to preserve the Palm Leaf Manuscript by converting to the form of digital text format. Computers and Smart devices are used by mostof them now a day. So, this system helps to convert and preserve in a fine manner. -
Names of Foodstuffs in Indian Languages
NAMES OF FOODSTUFFS IN INDIAN LANGUAGES CEREAL GRAINS AND PRODUCTS 1. Pearl Millet: Pennisetum typhoides Bajra (Bengali, Hindi, Oriya), Bajri (Gujarati, Marathi), Sajje (Kannada), Bajr’u (Kashmiri), Cambu (Malayalam, Tamil), Sazzalu (Telugu). Other names : Spiked millet, Pearl millet 2. Italian millet: Setaria italica Syama dhan (Bengali), Ral Kang (Gujarati), Kangni (Hindi), Thene (Kannada), Shol (Kashmiri), Thina (Malayalam), Rala (Marathi), Kaon (Punjabi), Thenai (Tamil), Korralu (Telugu), Other names: Foxtail millet , Moha millet, Kakan kora 3. Sorghum: Sorghum bicolor Juar (Bengali , Gujarati , Hindi), Jola (Kannada), Cholam (Malayalam , Tamil), Jwari (Marathi), Janha (Oriya), Jonnalu (Telugu), Other names: Milo , Chari 4. Maize: Zea mays Bhutta (Bengali), Makai (Gujarati), Maka (Hindi , Marathi , Oriya), Musikinu jola (Kannada), Makaa’y (Kashmiri), Cholam (Malayalam), Makka Cholam (Tamil), Mokka jonnalu (Telugu) 5. Finger Millet: Eleusine coracana Madua (Bengali , Hindi), Bhav (Gujarati), Ragi (Kannada) , Moothari (Malayalam), Nachni (Marathi), Mandia (Oriya), Kezhvaragu (Tamil), Ragulu (Telugu), Other names: Korakan 6. Rice, parboiled: Oryza sativa Siddha chowl (Bengali) Ukadello chokha (Gujarati), Usna chawal (Hindi), Kusubalakki (Kannada), Puzhungal ari (Malayalam), Ukadla tandool (Marathi), Usuna chaula (Oriya), Puzhungal arisi (Tamil), Uppudu biyyam (Telugu) 7. Rice raw: Orya sativa Chowl (Bengali), Chokha (Gujarati), Chawal (Hindi), Akki (Kannada), Tomul (Kashmiri), Ari (Malayalam), Tandool (Marathi), Chaula (Oriya), Arisi -
Exploring Language Similarities with Dimensionality Reduction Techniques
EXPLORING LANGUAGE SIMILARITIES WITH DIMENSIONALITY REDUCTION TECHNIQUES Sangarshanan Veeraraghavan Final Year Undergraduate VIT Vellore [email protected] Abstract In recent years several novel models were developed to process natural language, development of accurate language translation systems have helped us overcome geographical barriers and communicate ideas effectively. These models are developed mostly for a few languages that are widely used while other languages are ignored. Most of the languages that are spoken share lexical, syntactic and sematic similarity with several other languages and knowing this can help us leverage the existing model to build more specific and accurate models that can be used for other languages, so here I have explored the idea of representing several known popular languages in a lower dimension such that their similarities can be visualized using simple 2 dimensional plots. This can even help us understand newly discovered languages that may not share its vocabulary with any of the existing languages. 1. Introduction Language is a method of communication and ironically has long remained a communication barrier. Written representations of all languages look quite different but inherently share similarity between them. For example if we show a person with no knowledge of English alphabets a text in English and then in Spanish they might be oblivious to the similarity between them as they look like gibberish to them. There might also be several languages which may seem completely different to even experienced linguists but might share a subtle hidden similarity as they is a possibility that languages with no shared vocabulary might still have some similarity. -
Malayalam Range: 0D00–0D7F
Malayalam Range: 0D00–0D7F This file contains an excerpt from the character code tables and list of character names for The Unicode Standard, Version 14.0 This file may be changed at any time without notice to reflect errata or other updates to the Unicode Standard. See https://www.unicode.org/errata/ for an up-to-date list of errata. See https://www.unicode.org/charts/ for access to a complete list of the latest character code charts. See https://www.unicode.org/charts/PDF/Unicode-14.0/ for charts showing only the characters added in Unicode 14.0. See https://www.unicode.org/Public/14.0.0/charts/ for a complete archived file of character code charts for Unicode 14.0. Disclaimer These charts are provided as the online reference to the character contents of the Unicode Standard, Version 14.0 but do not provide all the information needed to fully support individual scripts using the Unicode Standard. For a complete understanding of the use of the characters contained in this file, please consult the appropriate sections of The Unicode Standard, Version 14.0, online at https://www.unicode.org/versions/Unicode14.0.0/, as well as Unicode Standard Annexes #9, #11, #14, #15, #24, #29, #31, #34, #38, #41, #42, #44, #45, and #50, the other Unicode Technical Reports and Standards, and the Unicode Character Database, which are available online. See https://www.unicode.org/ucd/ and https://www.unicode.org/reports/ A thorough understanding of the information contained in these additional sources is required for a successful implementation. -
Neo-Vernacularization of South Asian Languages
LLanguageanguage EEndangermentndangerment andand PPreservationreservation inin SSouthouth AAsiasia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 Language Endangerment and Preservation in South Asia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 PUBLISHED AS A SPECIAL PUBLICATION OF LANGUAGE DOCUMENTATION & CONSERVATION LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA Special Publication No. 7 (January 2014) ed. by Hugo C. Cardoso LANGUAGE DOCUMENTATION & CONSERVATION Department of Linguistics, UHM Moore Hall 569 1890 East-West Road Honolulu, Hawai’i 96822 USA http:/nflrc.hawaii.edu/ldc UNIVERSITY OF HAWAI’I PRESS 2840 Kolowalu Street Honolulu, Hawai’i 96822-1888 USA © All text and images are copyright to the authors, 2014 Licensed under Creative Commons Attribution Non-Commercial No Derivatives License ISBN 978-0-9856211-4-8 http://hdl.handle.net/10125/4607 Contents Contributors iii Foreword 1 Hugo C. Cardoso 1 Death by other means: Neo-vernacularization of South Asian 3 languages E. Annamalai 2 Majority language death 19 Liudmila V. Khokhlova 3 Ahom and Tangsa: Case studies of language maintenance and 46 loss in North East India Stephen Morey 4 Script as a potential demarcator and stabilizer of languages in 78 South Asia Carmen Brandt 5 The lifecycle of Sri Lanka Malay 100 Umberto Ansaldo & Lisa Lim LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA iii CONTRIBUTORS E. ANNAMALAI ([email protected]) is director emeritus of the Central Institute of Indian Languages, Mysore (India). He was chair of Terralingua, a non-profit organization to promote bi-cultural diversity and a panel member of the Endangered Languages Documentation Project, London. -
The Making of Modern Malayalam Prose and Fiction: Translations from European Languages Into Malayalam in the First Half of the Twentieth Century
The Making of Modern Malayalam Prose and Fiction: Translations from European Languages into Malayalam in the First Half of the Twentieth Century K.M. Sherrif Abstract Translations from European languages have played a crucial role in the evolution of Malayalam prose and fiction in the first half of the Twentieth Century. Many of them are directly linked to the socio- political movements in Kerala which have been collectively designated ‘Kerala’s Renaissance.’ The nature of the translated texts reveal the operation of ideological and aesthetic filters in the interface between literatures, while the overwhelming presence of secondary translations indicate the hegemonic status of English as a receptor language. The translations never occupied a central position in the Malayalam literature and served mostly as mere literary and political stimulants. Keywords: Translation - evolution of genres, canon - political intervention The role of translation in the development of languages and literatures has been extensively discussed by translation scholars in the West during the last quarter of a century. The proliferation of diachronic translation studies that accompanied the revolutionary breakthroughs in translation theory in the mid-Eighties of the Twentieth Century resulted in the extensive mapping of the intervention of translation in the development of discourses and shifts of ideological paradigms in cultures, in the development of genres and the construction and disruption of the canon in literatures and in altering the idiomatic and structural paradigms of languages. One of the most detailed studies in the area was made by Andre Lefevere (1988, pp 75-114) Lefevere showed with convincing 118 Translation Today K.M. -
History-. of ··:Kerala: - • - ' - - ..>
HISTORY-. OF ··:KERALA: - • - ' - - ..> - K~ P. PAD!rlANABHA .MENON.. Rs. 8. 18 sh. ~~~~~~ .f-?2> ~ f! P~~-'1 IY~on-: f. L~J-... IYt;;_._dh, 4>.,1.9 .£,). c~c~;r.~, ~'").-)t...q_ A..Ja:..:..-. THE L ATE lVIn. K. P . PADJVIANABHA MENON. F rontispiece.] HISTORY· op::KERALA. .. :. ' ~ ' . Oowright and right of t'fanslation:. resen;e~ witk ' Mrs. K. P. PADMANABHA MENON. Copies can be had of . '. Mrs. K. P. PAI>MANABHA MENON, Sri Padmanabhalayam Bungalow~, Diwans' Road;-!Jmakulam, eochin state, S.INDIA. HISTORY OF KERALA. A HISTORY OF KERALA. WBI'l!TEN, IN THE FOBH OF NOTES ON VISSCHER'S LETTERS FROM MALABAR, BY K. P. PADMANABHA MENON, B.A., S.L, M.~.A.S., ' . Author of the History of Codzin, anti of severai P~p~rsconnectedwith the early History of Kerala; Jiak•l of the H1g-k Courts of Madras 0,.. of Travam:ore and of tke Ckief Court of. Cochin, ~ . • r . AND .EDITED BY SAHITHYAKUSALAN. "T. :K.' KRISHNA MENON, B. A.;~. • ' "'•.t . fl' Formerly, a Member of Jhe Royal Asiatic Society, and of the ~ocieties of Arts and of Aut/tors, 'anti a Fellow of the .Royal Histor~cal .Soci'ety. Kun.kamhu NamfJiyar Pr~sd:ian. For some ti'me, anE:cami'n.er for .Malayalam to the Umverfities of Madras, Benares and Hydera bad. A Member (Jf the .Board of Stzediet for Malayalam. A fJUOndum Editor of Pid,.a Vinodini. A co-Editor of tke ., .Sciene~ Primers Seriu in Malayalam. Editor of .Books for Malabar Bairns: The Author &- Editor of several works in Malayalam. A Member of the, Indian Women's Uni'r,ersity, and · · a .Sadasya of Visvn-.Bharatki, &-c. -
Malayalam Recognizer: a Learning to Write Collaborator
ISSN(Online): 2319-8753 ISSN (Print): 2347-6710 International Journal of Innovative Research in Science, Engineering and Technology (A High Impact Factor, Monthly, Peer Reviewed Journal) Visit: www.ijirset.com Vol. 8, Issue 5, May 2019 Malayalam Recognizer: A Learning to Write Collaborator Soumya Muraleedhara Menon1, Roopa Sree Mohan2, Deepa V. P.3, Navya T.4, Ganesh P.5 B.Tech Scholar, Dept. of CSE, Sreepathy Institute of Management and Technology, Palakkad, Kerala, India1 B.Tech Scholar, Dept. of CSE, Sreepathy Institute of Management and Technology, Palakkad, Kerala, India2 B.Tech Scholar, Dept. of CSE, Sreepathy Institute of Management and Technology, Palakkad, Kerala, India3 B.Tech Scholar, Dept. of CSE, Sreepathy Institute of Management and Technology, Palakkad, Kerala, India4 Head of Dept., Asst. Professor, Dept. of CSE, Sreepathy Institute of Management and Technology, Palakkad, Kerala, India5 ABSTRACT: Handwriting recognition is an area under machine learning. Malayalam (Keralite language) gesture recognition is a challenging process because of alphabet written in different ways which is more complex to write among Indian languages and the recognition task is quite difficult due to wide intrapersonal and interpersonal variation in human handwriting. Also recognition task on Malayalam language become multiplex since there are large number of classes with high similarities. Previous efforts of making Malayalam recognition more accessible have been through the inclusion of gesture recognizers through image processing. In this work, we propose a new model for handwriting gesture recognition in real time. The input of this model is a Malayalam alphabet. The aim of handwriting is to identify input gesture correctly then analysed to many process. -
Elements of South-Indian Palaeography, from the Fourth To
This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com ELEMENTS SOUTH-INDIAN PALfi3&BAPBY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEIN1 AN INTRODUCTION TO ?TIK STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. C. BURNELL HON'. PH. O. OF TUE UNIVERSITY M. K. A, ri'VORE PIS I. A SOClfcTE MANGALORE \ BASEL MISSION BOOK & TRACT DEPOSITORY ft !<3 1874 19 Vi? TRUBNER & Co. 57 & 69 LUDOATE HILL' . ' \jj *£=ggs3|fg r DISTRIBUTION of S INDIAN alphabets up to 1550 a d. ELEMENTS OF SOUTH-INDIAN PALEOGRAPHY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEING AN INTRODUCTION TO THE STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. p. j^URNELL HON. PH. D. OF THE UNIVERSITY OF STRASSBUB.G; M. R. A. S.; MEMBKE DE LA S0CIETE ASIATIQUE, ETC. ETC. MANGALORE PRINTED BY STOLZ & HIRNER, BASEL MISSION PRESS 1874 LONDON TRtlBNER & Co. 57 & 59 LUDGATE HILL 3« w i d m « t als ^'ctdjcn kr §anltekcit fiir Mc i|jm bdic<jcnc JJoctorMvk ttcsc fetlings^kit auf rincm fejjcr mtfrckntcn Jfclk bet 1®4 INTRODUCTION. I trust that this elementary Sketch of South-Indian Palaeography may supply a want long felt by those who are desirous of investigating the real history of the peninsula of India. Trom the beginning of this century (when Buchanan executed the only archaeological survey that has ever been done in even a part of the South of India) up to the present time, a number of well meaning persons have gone about with much simplicity and faith collecting a mass of rubbish which they term traditions and accept as history. -
Ori & Mss Library
ORI & MSS LIBRARY SCHEME AND SYLLABUS M. PHIL MANUSCRIPTOLOGY ORI & MSS LIBRARY, KARIAVATTOM Syllabus - M.Phil Manuscriptology in Malayalam, Tamil & Sanskrit M. Phil Manuscriptology Scheme and Syllabus Semester – I Sl.No. Course code Course Name Credit Marks 1 MSS 711 Paper I - Research Methodology 4 100 2. MSS 712 Paper II- Textual Criticism 4 100 3. MSS 713 Paper III - Writing and Writing materials 4 100 Semester – II 1 MSS 721 Paper IV-Dissertation + Viva voce 20 300 Total marks 600 Total credits 32 (12+20) Semester I Paper I MSS 711 Research Methodology Credit – 4 Marks - 100 Unit – I – Introduction Meaning and Definition of Research Need of Research (26 hrs) Unit – II – Types of Research (50 hrs) Unit - III – Research Process Formulation of Research Problem Hypothis Research Design Data Collection Analysis of Data Centralisation (50 hrs) Unit – IV – Structure of Research Report Preliminary Section The Text End Matter (50 hrs) Suggested Readings : 1. Research in Education - Best W John 2. Research Methodology - Kothari.C.R. 3. Gaveshana Pravacika - Dr.M.V.Vishnu Nambothiri 4. Sakithya Gaveshanathinte Reethi Sastram - Dr.D.Benjamin 5. Methodology of Research - Kulbir Singh Sidhu 6. Research Methods in Social Sciences - Sharma.R.D 7. Thesis and Assignment Writing - Anderson J Durston 8. The Elements of Research in Education - Whitmeu.F.C. 9. Arivum Anubuthiyum - P.V.Velayudhan Pillai 10. Methodology for Research - Joseph A Antony 11. Sakithya Ghaveshanam - Chattnathu Achuthanunni Paper II - MSS 712 - Textual Criticism Credit