Malayalam Standardization Report 2.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Computing in Indian Languages for Knowledge Management: Technology Perspectives and Linguistic Issues
Informatics Studies. ISSN 2320 – 530X. Vol. 3, Issue 4, Fourth Quarterly Issue. October – December, 2016. P 31-44 Computing in Indian Languages for Knowledge Management: Technology Perspectives and Linguistic Issues Ram Awathar Ojha Abstract The paper is the first among a series of documents on computer applications in Indian languages to be published in the journal. Traditionally, computer applications were based on English as the medium of interaction with the system. The technology is not yet viable to Indian languages. Majority of Indian population does not speak or write in or understand English and this weakness of technology hinders the attempts to use computers for education and literacy, meant for majority of the population who should get the benefit of Information Technology. In many parts of the world computer applications have been developed in different regional languages, appropriate to the user communities. This introductory paper gives a perspective of research and development on computational linguistics, the phonetic aspect of Indian languages, transliteration standards, lack of uniformity, data entry methods, keyboards, phonetic mapping of the vowels and consonants, web pages supporting display of Indian language text etc in the context of computing in Indian languages. Keywords: Computational Linguistics, Indian Languages, Telugu, Kannada, Phonetics, Alphabets, vowels, consonants, syllables, transliteration standards, data entry methods, keyboards, , web pages This paper covers Writing Systems Codes for Systems (MLS) in the context of this the Aksharas, The Phonetic Aspect of Indian discussion. Typically, an MLS will permit Languages, The importance of users to interact with computers in their own Transliteration, Lack of Uniformity Between mother tongue. -
Updated Proposal to Encode Tulu-Tigalari Script in Unicode
Updated proposal to encode Tulu-Tigalari script in Unicode Vaishnavi Murthy Kodipady Yerkadithaya [ [email protected] ] Vinodh Rajan [ [email protected] ] 04/03/2021 Document History & Background Documents : (This document replaces L2/17-378) L2/11-120R Preliminary proposal for encoding the Tulu script in the SMP of the UCS – Michael Everson L2/16-241 Preliminary proposal to encode Tigalari script – Vaishnavi Murthy K Y L2/16-342 Recommendations to UTC #149 November 2016 on Script Proposals – Deborah Anderson, Ken Whistler, Roozbeh Pournader, Andrew Glass and Laurentiu Iancu L2/17-182 Comments on encoding the Tigalari script – Srinidhi and Sridatta L2/18-175 Replies to Script Ad Hoc Recommendations (L2/16-342) and Comments (L2/17-182) on Tigalari proposal (L2/16-241) – Vaishnavi Murthy K Y L2/17-378 Preliminary proposal to encode Tigalari script – Vaishnavi Murthy K Y, Vinodh Rajan L2/17-411 Letter in support of preliminary proposal to encode Tigalari – Guru Prasad (Has withdrawn support. This letter needs to be disregarded.) L2/17-422 Letter to Vaishnavi Murthy in support of Tigalari encoding proposal – A. V. Nagasampige L2/18-039 Recommendations to UTC #154 January 2018 on Script Proposals – Deborah Anderson, Ken Whistler, Roozbeh Pournader, Lisa Moore, Liang Hai, and Richard Cook PROPOSAL TO ENCODE TIGALARI SCRIPT IN UNICODE 2 A note on recent updates : −−−Tigalari Script is renamed Tulu-Tigalari script. The reason for the same is discussed under section 1.1 (pp. 4-5) of this paper & elaborately in the supplementary paper Tulu Language and Tulu-Tigalari script (pp. 5-13). −−−This proposal attempts to harmonize the use of the Tulu-Tigalari script for Tulu, Sanskrit and Kannada languages for archival use. -
Neo-Vernacularization of South Asian Languages
LLanguageanguage EEndangermentndangerment andand PPreservationreservation inin SSouthouth AAsiasia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 Language Endangerment and Preservation in South Asia ed. by Hugo C. Cardoso Language Documentation & Conservation Special Publication No. 7 PUBLISHED AS A SPECIAL PUBLICATION OF LANGUAGE DOCUMENTATION & CONSERVATION LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA Special Publication No. 7 (January 2014) ed. by Hugo C. Cardoso LANGUAGE DOCUMENTATION & CONSERVATION Department of Linguistics, UHM Moore Hall 569 1890 East-West Road Honolulu, Hawai’i 96822 USA http:/nflrc.hawaii.edu/ldc UNIVERSITY OF HAWAI’I PRESS 2840 Kolowalu Street Honolulu, Hawai’i 96822-1888 USA © All text and images are copyright to the authors, 2014 Licensed under Creative Commons Attribution Non-Commercial No Derivatives License ISBN 978-0-9856211-4-8 http://hdl.handle.net/10125/4607 Contents Contributors iii Foreword 1 Hugo C. Cardoso 1 Death by other means: Neo-vernacularization of South Asian 3 languages E. Annamalai 2 Majority language death 19 Liudmila V. Khokhlova 3 Ahom and Tangsa: Case studies of language maintenance and 46 loss in North East India Stephen Morey 4 Script as a potential demarcator and stabilizer of languages in 78 South Asia Carmen Brandt 5 The lifecycle of Sri Lanka Malay 100 Umberto Ansaldo & Lisa Lim LANGUAGE ENDANGERMENT AND PRESERVATION IN SOUTH ASIA iii CONTRIBUTORS E. ANNAMALAI ([email protected]) is director emeritus of the Central Institute of Indian Languages, Mysore (India). He was chair of Terralingua, a non-profit organization to promote bi-cultural diversity and a panel member of the Endangered Languages Documentation Project, London. -
Voice Key Board: Multimodal Indic Text Input
Voice Key Board: Multimodal Indic Text Input Prasenjit Dey Ramchandrula Rahul Ajmera Kalika Bali Hewlett-Packard Labs Sitaram Human Factors International Microsoft Research India nd 24, Salarpuria Arena Hewlett-Packard Labs 310/6, H.R Complex “Scientia”, 196/36 2 Main Adugodi, Hosur Road 24, Salarpuria Arena Koramangala, 5th Block Sadashivnagar, Bangalore-560030, India Adugodi, Hosur Road Bangalore 560095, India Bangalore-560080, India [email protected] Bangalore-560030, India [email protected] [email protected] [email protected] ABSTRACT intimidating because they use modalities which are natural to our Multimodal systems, incorporating more natural input modalities everyday human communication. We are therefore motivated to like speech, hand gesture, facial expression etc., can make human- explore the use of multiple modalities to address the problem of computer-interaction more intuitive by drawing inspiration from Indic text input. spontaneous human-human-interaction. We present here a multimodal input device for Indic scripts called the Voice Key Indic scripts such as Devanagari, used for Hindi and other Indic Board (VKB) which offers a simpler and more intuitive method languages, are defined as “syllabic alphabets” in that the unit of for input of Indic scripts. VKB exploits the syllabic nature of Indic encoding is a syllable, however the corresponding graphic units language scripts and exploits the user’s mental model of Indic show distinctive internal structure and a constituent set of scripts wherein a base consonant character is modified by graphemes. The formative principles behind them may be different vowel ligatures to represent the actual syllabic character. summarized as follows [2]: We also present a user evaluation result for VKB comparing it - graphemes for independent (initial) Vs (vowel) with the most common input method for the Devanagari script, - C (consonant) graphemes with inherent neutral vowel a the InScript keyboard. -
Ori & Mss Library
ORI & MSS LIBRARY SCHEME AND SYLLABUS M. PHIL MANUSCRIPTOLOGY ORI & MSS LIBRARY, KARIAVATTOM Syllabus - M.Phil Manuscriptology in Malayalam, Tamil & Sanskrit M. Phil Manuscriptology Scheme and Syllabus Semester – I Sl.No. Course code Course Name Credit Marks 1 MSS 711 Paper I - Research Methodology 4 100 2. MSS 712 Paper II- Textual Criticism 4 100 3. MSS 713 Paper III - Writing and Writing materials 4 100 Semester – II 1 MSS 721 Paper IV-Dissertation + Viva voce 20 300 Total marks 600 Total credits 32 (12+20) Semester I Paper I MSS 711 Research Methodology Credit – 4 Marks - 100 Unit – I – Introduction Meaning and Definition of Research Need of Research (26 hrs) Unit – II – Types of Research (50 hrs) Unit - III – Research Process Formulation of Research Problem Hypothis Research Design Data Collection Analysis of Data Centralisation (50 hrs) Unit – IV – Structure of Research Report Preliminary Section The Text End Matter (50 hrs) Suggested Readings : 1. Research in Education - Best W John 2. Research Methodology - Kothari.C.R. 3. Gaveshana Pravacika - Dr.M.V.Vishnu Nambothiri 4. Sakithya Gaveshanathinte Reethi Sastram - Dr.D.Benjamin 5. Methodology of Research - Kulbir Singh Sidhu 6. Research Methods in Social Sciences - Sharma.R.D 7. Thesis and Assignment Writing - Anderson J Durston 8. The Elements of Research in Education - Whitmeu.F.C. 9. Arivum Anubuthiyum - P.V.Velayudhan Pillai 10. Methodology for Research - Joseph A Antony 11. Sakithya Ghaveshanam - Chattnathu Achuthanunni Paper II - MSS 712 - Textual Criticism Credit -
Text Input Methods for Indian Languages Sowmya Vajjala, International Institute of Information Technology
Iowa State University From the SelectedWorks of Sowmya Vajjala 2011 Text Input Methods for Indian Languages Sowmya Vajjala, International Institute of Information Technology Available at: https://works.bepress.com/sowmya-vajjala/3/ TEXT INPUT METHODS FOR INDIAN LANGUAGES By Sowmya V.B. 200607014 A THESIS SUBMITTED IN PARTIAL FULFILLMENT OF THE REQUIREMENTS FOR THE DEGREE OF Master of Science (by Research) in Computer Science & Engineering Search and Information Extraction Lab Language Technologies Research Center International Institute of Information Technology Hyderabad, India September 2008 Copyright c 2008 Sowmya V.B. All Rights Reserved Dedicated to all those people, living and dead, who are directly or indirectly responsible to the wonderful life that I am living now. INTERNATIONAL INSTITUTE OF INFORMATION TECHNOLOGY Hyderabad, India CERTIFICATE It is certified that the work contained in this thesis, titled “ Text input methods for Indian Languages ” by Sowmya V.B. (200607014) submitted in partial fulfillment for the award of the degree of Master of Science (by Research) in Computer Science & Engineering, has been carried out under my supervision and it is not submitted elsewhere for a degree. Date Advisor : Dr. Vasudeva Varma Associate Professor IIIT, Hyderabad Acknowledgements I would like to first express my gratitude to my advisor Dr Vasudeva Varma, for being with me and believing in me throughout the duration of this thesis work. His regular suggestions have been greatly useful. I thank Mr Prasad Pingali for his motivation and guidance during the intial phases of my thesis. I thank Mr Bhupal Reddy for giving me the first lessons in my research. I entered IIIT as a novice to Computer Science in general and research in particular. -
Non-Dravidian Elements and (Non)Diasystematic Change in Malayalam
To appear in Constructions in Contact 2: Language change, multilingual practices, and additional language acquisition eds. Höder, S. & Boas, H. Title: Non-Dravidian elements and (non)diasystematic change in Malayalam Savithry Namboodiripad (University of Michigan, Ann Arbor) [email protected] Abstract: This chapter applies a Diasystematic Construction Grammar (DCxG) approach to account for non-Dravidian vocabulary and phonology in Malayalam, a high-contact Dravidian language. The distinction made in DCxG between diaconstructions, which are language non-specific, and idioconstructions, which are language-specific, proves useful in accounting for semantic specialization and phonological heterogeneity due to language contact. Notably, increased contact with English has led in some cases to decreased phonological adaptation, as some constructions change from diaconstructions to idioconstructions: Non-diasystematic change. Taken together, this chapter argues that any analysis of Malayalam must account for non-Dravidian subpatterns, and including language labels as part of speakers' linguistic knowledge enhances our understanding of the dynamics of language contact. Keywords: Malayalam, English, language contact, language change, semantic specialization, loanword adaptation, historical linguistics, sociolinguistics 1 High-contact languages are not categorically different By introducing a distinction between diaconstructions, which are language non-specific, and idioconstructions, which are language-specific, Diasystematic Construction Grammar (DCxG) captures the fact that borders between languages, while often blurred, also shape language use (Höder 2012). Building on other construction-theoretic approaches, DCxG characterizes all languages as consisting of emergent subpatterns of constructions. This includes hybrid or high-contact languages, which differ only in that those subpatterns can be attributed (by analysts, not necessarily speakers) as belonging to a particular language. -
Locale Database
International Language Environments Guide Part No: 817–2521–11 November 2010 Copyright © 2005, 2010, Oracle and/or its affiliates. All rights reserved. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means. Reverse engineering, disassembly, or decompilation of this software, unless required by law for interoperability, is prohibited. The information contained herein is subject to change without notice and is not warranted to be error-free. If you find any errors, please report them to us in writing. If this is software or related software documentation that is delivered to the U.S. Government or anyone licensing it on behalf of the U.S. Government, the following notice is applicable: U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are “commercial computer software” or “commercial technical data” pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such, the use, duplication, disclosure, modification, and adaptation shall be subject to the restrictions and license terms setforth in the applicable Government contract, and, to the extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License (December 2007). Oracle America, Inc., 500 Oracle Parkway, Redwood City, CA 94065. -
Book Publishing in Malayalam the Beginnings
REPORT OF THE MRP TITLED BOOK PUBLISHING IN MALAYALAM THE BEGINNINGS DR BABU CHERIAN DEPARTMENT OF MALAYALAM CMS COLLEGE KOTTAYAM KERALA NO. F. MRP (H)-602/ 08-09/KLMG OO2/ UGC-SWRO/ DT. 30 MARCH 09 CONTENTS Chapter 1 An Introduction to Printing and Book-Publishing Chapter 2 The beginning of Printing in Malayalam that led to Book-Publishing Chapter 3 Malayalam Book Publishing and the first Malayalam Book Published in Kerala Chapter 4 Conclusion Chapter 1 An Introduction to Printing and Book-Publishing Printing is the technology that has recorded the history of all that has happened in the world from the past to the present. Yet, regarding the history of printing, especially the history of the very early days of the technology, not much is known. The invention of printing is a landmark in the history of the World’s civilization, not merely of book- publishing or the dissemination of knowledge. “Printing is the art and technology of recreating, on paper or fabric or on any other surface, pictures or words. Though a variety of printed objects are commonplace in modern times, the principal function of printing is the dissemination of thought and knowledge. Printed words and pictures have become an integral and important part of schools, libraries, education of individuals, intellectual growth and the popular news media” (Encyclopaedia Americana–Printing’). Though there were books when printing was unknown, the world then was the world of the illiterate societies, unable to read or write. “The greatest factor that was instrumental in making Europe civilized, was printing. If Asia and Africa are to see the light and achieve progress as Europe did, at one time or other, it will be, to a great extent, probably through printing”. -
Malayalam Keyboard Layout Download.Pdf
Malayalam keyboard layout download LINK TO DOWNLOAD Download Malayalam Keyboard. Sponsored. Malayalam inscript keyboard layout is used to type Malayalam in unicode. Inscrpit is developed by C-Dac for all Indian Languages. Malayalam Keyboard layout in inscript methode help to type Malayalam very fast. The Malayalam Keyboard layout is . This keyboard lets you type in Malayalam with the standardised Inscript layout. It is easy to use for people familiar with Malayalam, or with other Inscript keyboards. The keyboard uses a normal English (QWERTY) keyboard. If a special font is needed for this language, most computers will download it . Loading the keyboard layout, please wait. malayalam keyboard free download - Malayalam Keyboard, Malayalam Keyboard, Malayalam Keyboard, and many more programs. Malayalam Keyboard is a simply keyboard that have English letter keyboard layout with different Malayalam typing keyboard themes and size. User can change theOperating System: Android. Download malayalam keyboard for windows 10 for free. Education software downloads - Madhuri Malayalam Typing by unknown and many more programs are available for instant and free download. Download malayalam typing keyboard for pc for free. Education software downloads - Madhuri Malayalam Typing by unknown and many more programs are available for instant and free download. This Malayalam Keyboard enables you to easily type Malayalam online without installing Malayalam renuzap.podarokideal.ru can use your computer keyboard or mouse to type Malayalam letters with this online keyboard. Pressing Esc on the Malayalam keyboard layout will toggle the mouse input between virtual QWERTY keyboard and virtual Malayalam keyboard. The key will also turn on/off your keyboard . Online Malayalam Keyboard. -
Malayalam Reph with ZWJ Joiner
Malayalam Reph with ZWJ joiner N. Ganesan ([email protected]) 1.0 Virama as a Vowel-Reducer in Indian scripts: Virama in South Indian scripts have a function: Virama acts as a vowel-reducer, but never upon dead consonants like Malayalam Chillus. Normally, Virama reduces the inherent vowel /a/ to zero. In Tamil, Virama has an additional vowel-reducer function: it acts upon both /i/ and /u/ maatras visually to reduce their vowel length, called short-i and short-u. In Malayalam, short-u, called samvruthokaram, also employs Virama visually to reduce /u/ vowel length. In sum, Virama acts as a vowel reducer in all situations in Indic scripts, whether the vowel is /a/, /i/ or /u/. Because in Indian scripts like Tamil Grantha and Malayalam, chillu dead consonants are never followed by either a vowel maatra or a viraama, it is not necessary to have Unicode sequences like <Chillu, Virama, ..> in 5.1. If done, it will radically alter the properties of Virama in Indian scripts including Malayalam. 2.0 Kerala Government Order (G.O.) giving the Relationship between Malayalam Dot-Reph (Old orthography) and Chillu R/RR (Reformed Orthography). In Malayalam script, reph form is obsolete just as in Gurmukhi(Punjabi) and Telugu scripts. Kerala Government Order, originally issued in 1968, is a governing document advising on the rules of how old orthography letters need to be treated in the reformed orthography. This Kerala G.O rule clearly advises users to use Chillu R/RR (ʼn) in place of Malayalam dot-reph when reformed orthography is needed. -
General Historical and Analytical / Writing Systems: Recent Script
9 Writing systems Edited by Elena Bashir 9,1. Introduction By Elena Bashir The relations between spoken language and the visual symbols (graphemes) used to represent it are complex. Orthographies can be thought of as situated on a con- tinuum from “deep” — systems in which there is not a one-to-one correspondence between the sounds of the language and its graphemes — to “shallow” — systems in which the relationship between sounds and graphemes is regular and trans- parent (see Roberts & Joyce 2012 for a recent discussion). In orthographies for Indo-Aryan and Iranian languages based on the Arabic script and writing system, the retention of historical spellings for words of Arabic or Persian origin increases the orthographic depth of these systems. Decisions on how to write a language always carry historical, cultural, and political meaning. Debates about orthography usually focus on such issues rather than on linguistic analysis; this can be seen in Pakistan, for example, in discussions regarding orthography for Kalasha, Wakhi, or Balti, and in Afghanistan regarding Wakhi or Pashai. Questions of orthography are intertwined with language ideology, language planning activities, and goals like literacy or standardization. Woolard 1998, Brandt 2014, and Sebba 2007 are valuable treatments of such issues. In Section 9.2, Stefan Baums discusses the historical development and general characteristics of the (non Perso-Arabic) writing systems used for South Asian languages, and his Section 9.3 deals with recent research on alphasyllabic writing systems, script-related literacy and language-learning studies, representation of South Asian languages in Unicode, and recent debates about the Indus Valley inscriptions.