Numerals-In-Sinhala-Language.Pdf
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Handwriting Recognition in Indian Regional Scripts: a Survey of Offline Techniques
1 Handwriting Recognition in Indian Regional Scripts: A Survey of Offline Techniques UMAPADA PAL, Indian Statistical Institute RAMACHANDRAN JAYADEVAN, Pune Institute of Computer Technology NABIN SHARMA, Indian Statistical Institute Offline handwriting recognition in Indian regional scripts is an interesting area of research as almost 460 million people in India use regional scripts. The nine major Indian regional scripts are Bangla (for Bengali and Assamese languages), Gujarati, Kannada, Malayalam, Oriya, Gurumukhi (for Punjabi lan- guage), Tamil, Telugu, and Nastaliq (for Urdu language). A state-of-the-art survey about the techniques available in the area of offline handwriting recognition (OHR) in Indian regional scripts will be of a great aid to the researchers in the subcontinent and hence a sincere attempt is made in this article to discuss the advancements reported in this regard during the last few decades. The survey is organized into different sections. A brief introduction is given initially about automatic recognition of handwriting and official re- gional scripts in India. The nine regional scripts are then categorized into four subgroups based on their similarity and evolution information. The first group contains Bangla, Oriya, Gujarati and Gurumukhi scripts. The second group contains Kannada and Telugu scripts and the third group contains Tamil and Malayalam scripts. The fourth group contains only Nastaliq script (Perso-Arabic script for Urdu), which is not an Indo-Aryan script. Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey. As it is important to identify the script before the recognition step, a section is dedicated to handwritten script identification techniques. -
Proposal for Generation Panel for Sinhala Script Label
Proposal for the Generation Panel for the Sinhala Script Label Generation Ruleset for the Root Zone 1 General information Sinhala belongs to the Indo-European language family with its roots deeply associated with Indo-Aryan sub family to which the languages such as Persian and Hindi belong. Although it is not very clear whether people in Sri Lanka spoke a dialect of Prakrit at the time of arrival of Buddhism in the island, there is enough evidence that Sinhala evolved from mixing of Sanskrit, Magadi (the language which was spoken in Magada Province of India where Lord Buddha was born) and local language which was spoken by people of Sri Lanka prior to the arrival of Vijaya, the founder of Sinhala Kingdom. It is also surmised that Sinhala had evolved from an ancient variant of Apabramsa (middle Indic) which is known as ‘Elu’. When tracing history of Elu, it was preceded by Hela or Pali Sihala. Sinhala though has close relationships with Indo Aryan languages which are spoken primarily in the north, north eastern and central India, was very much influenced by Tamil which belongs to the Dravidian family of languages. Though Sinhala is related closely to Indic languages, it also has its own unique characteristics: Sinhala has symbols for two vowels which are not found in any other Indic languages in India: ‘æ’ (ඇ) and ‘æ:’ (ඈ). Sinhala script had evolved from Southern Brahmi script from which almost all the Southern Indic Scripts such as Telugu and Oriya had evolved. Later Sinhala was influenced by Pallava Grantha writing of Southern India. -
Numerical Notation: a Comparative History
This page intentionally left blank Numerical Notation Th is book is a cross-cultural reference volume of all attested numerical notation systems (graphic, nonphonetic systems for representing numbers), encompassing more than 100 such systems used over the past 5,500 years. Using a typology that defi es progressive, unilinear evolutionary models of change, Stephen Chrisomalis identifi es fi ve basic types of numerical notation systems, using a cultural phylo- genetic framework to show relationships between systems and to create a general theory of change in numerical systems. Numerical notation systems are prima- rily representational systems, not computational technologies. Cognitive factors that help explain how numerical systems change relate to general principles, such as conciseness and avoidance of ambiguity, which also apply to writing systems. Th e transformation and replacement of numerical notation systems relate to spe- cifi c social, economic, and technological changes, such as the development of the printing press and the expansion of the global world-system. Stephen Chrisomalis is an assistant professor of anthropology at Wayne State Uni- versity in Detroit, Michigan. He completed his Ph.D. at McGill University in Montreal, Quebec, where he studied under the late Bruce Trigger. Chrisomalis’s work has appeared in journals including Antiquity, Cambridge Archaeological Jour- nal, and Cross-Cultural Research. He is the editor of the Stop: Toutes Directions project and the author of the academic weblog Glossographia. Numerical Notation A Comparative History Stephen Chrisomalis Wayne State University CAMBRIDGE UNIVERSITY PRESS Cambridge, New York, Melbourne, Madrid, Cape Town, Singapore, São Paulo, Delhi, Dubai, Tokyo Cambridge University Press The Edinburgh Building, Cambridge CB2 8RU, UK Published in the United States of America by Cambridge University Press, New York www.cambridge.org Information on this title: www.cambridge.org/9780521878180 © Stephen Chrisomalis 2010 This publication is in copyright. -
Miramo Mmcomposer Reference Guide
Miramo® mmComposer mmComposer Reference Guide VERSION 1.5 Copyright © 2014–2018 Datazone Ltd. All rights reserved. Miramo®, mmChart™, mmComposer™ and fmComposer™ are trademarks of Datazone Ltd. All other trademarks are the property of their respective owners. Readers of this documentation should note that its contents are intended for guidance only, and do not constitute formal offers or undertakings. ‘License Agreement’ This software, called Miramo, is licensed for use by the user subject to the terms of a License Agreement between the user and Datazone Ltd. Use of this software outside the terms of this license agreement is strictly prohibited. Unless agreed otherwise, this License Agreement grants a non-exclusive, non-transferable license to use the software programs and related document- ation in this package (collectively referred to as Miramo) on licensed computers only. Any attempted sublicense, assignment, rental, sale or other transfer of the software or the rights or obligations of the License Agreement without prior written con- sent of Datazone shall be void. In the case of a Miramo Development License, it shall be used to develop applications only and no attempt shall be made to remove the associated watermark included in output documents by any method. The documentation accompanying this software must not be copied or re-distributed to any third-party in either printed, photocopied, scanned or electronic form. The software and documentation are copyrighted. Unless otherwise agreed in writ- ing, copies of the software may be made only for backup and archival purposes. Unauthorized copying, reverse engineering, decompiling, disassembling, and creating derivative works based on the software are prohibited. -
N Umerations in Tke Sink Ala Language Numerations in the Sinhala Language
Numerations in the Sinhala Language N umerations in tke Sink ala Language Numerations in the Sinhala Language N umerations in tke Sinkala Language by Harsha Wijayawardhana edited by Aruni Goonetilleke 3 Num erations in the Sinhala Language Numerations in the Sinhala Language © Harsha Wijayawardhana 9th Lane, Nawala Road, Rajagiriya, Sri Lanka. [email protected] www.ucsc.cbm.ac.lk/sdu 2009 October ISBN- 978-955-1199-05-0 Design Sanjaya Epa Senevirathna Information and Communication Technology Agency of Sri Lanka All rights reserved. No part of this document may be reporoduced or transmitted in any form or by any means without prior written permission from ICTA. Published Strategic Communications and Media Unit - ICTA 160/24, Kirimandala Mawatha, Colombo 05, Sri Lanka. TP: +94 11 2369099 FAX:+ 94 11 2369091 email: [email protected] web: www.icta.lk Numerations in the Sinhala Language To my parents and to my daughter Panchali. um a.tlions m llie Sinhala Language Preface The research into Sinhala numerals that ICTA initiated has yielded the fact the Sinhala language had several sets of Sinhala numerals, of which two sets had been widely used: one set (Sinhala Illakkam) was in use up to the early part of the nineteenth century, and the other set (Lith Illakkam) was in use well into the 20th century. The latter set clearly includes a zero and a zero place holder. ICTA’s Local Language Working Group, after reviewing the research and after extensive discussions with experts and stakeholders agreed that these two sets should be encoded in (he Sri Lanka Standard Sinhala Character Code for Information Interchange (SLS 1134 : 2004), in the Unicode standard and in ISO/IEC 10646 (the Universal Character set). -
Elements of South-Indian Palaeography, from the Fourth To
This is a reproduction of a library book that was digitized by Google as part of an ongoing effort to preserve the information in books and make it universally accessible. https://books.google.com ELEMENTS SOUTH-INDIAN PALfi3&BAPBY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEIN1 AN INTRODUCTION TO ?TIK STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. C. BURNELL HON'. PH. O. OF TUE UNIVERSITY M. K. A, ri'VORE PIS I. A SOClfcTE MANGALORE \ BASEL MISSION BOOK & TRACT DEPOSITORY ft !<3 1874 19 Vi? TRUBNER & Co. 57 & 69 LUDOATE HILL' . ' \jj *£=ggs3|fg r DISTRIBUTION of S INDIAN alphabets up to 1550 a d. ELEMENTS OF SOUTH-INDIAN PALEOGRAPHY FROM THE FOURTH TO THE SEVENTEENTH CENTURY A. D. BEING AN INTRODUCTION TO THE STUDY OF SOUTH-INDIAN INSCRIPTIONS AND MSS. BY A. p. j^URNELL HON. PH. D. OF THE UNIVERSITY OF STRASSBUB.G; M. R. A. S.; MEMBKE DE LA S0CIETE ASIATIQUE, ETC. ETC. MANGALORE PRINTED BY STOLZ & HIRNER, BASEL MISSION PRESS 1874 LONDON TRtlBNER & Co. 57 & 59 LUDGATE HILL 3« w i d m « t als ^'ctdjcn kr §anltekcit fiir Mc i|jm bdic<jcnc JJoctorMvk ttcsc fetlings^kit auf rincm fejjcr mtfrckntcn Jfclk bet 1®4 INTRODUCTION. I trust that this elementary Sketch of South-Indian Palaeography may supply a want long felt by those who are desirous of investigating the real history of the peninsula of India. Trom the beginning of this century (when Buchanan executed the only archaeological survey that has ever been done in even a part of the South of India) up to the present time, a number of well meaning persons have gone about with much simplicity and faith collecting a mass of rubbish which they term traditions and accept as history. -
Year 5 Maths – Summer 2 Week Beginning: 29.6.20 LINK
Year 5 maths – Summer 2 Week beginning: 29.6.20 Lesson 3 of 12 Lesson 1 of 2 Lesson 2 of 2 Lesson 1 of 12 Lesson 2 of 12 CONSOLIDATION LESSON Roman Numerals Roman Numerals CONSOLIDATION LESSON CONSOLIDATION LESSON Theme Word problems To write Roman numerals to To write thousands numbers in Formal methods Formal methods To solve addition and 1000 Roman numerals Addition within 1,000,000 Addition and subtraction subtraction word problems Factual Practise comparing numbers Practise choosing multiples Practise multiplication patterns Practise estimating products Practise estimating products fluency (to aid fluency) using multiplication activity activity activity activity activity (Lesson 1 resources below) (Lesson 2 resources below) (Lesson 3 resources below) (Lesson 4 resources below) (Lesson 5 resources below) MAKING LINKS: Last week we MAKING LINKS: Yesterday we MAKING LINKS: Earlier in the year MAKING LINKS: Yesterday we MAKING LINKS: This week we solved problems involving volume. wrote Roman numerals up to we worked with formal addition worked with formal addition worked with formal addition and Today we will be writing Roman 1000.Today we will be writing methods. Today we will be methods. Today we will be subtraction methods. Today we numerals up to 1000. 1000s using Roman numerals. continuing with that. continuing with that. will be using these to solve word problems THINK: (support below) THINK: (support below) THINK: (support below) THINK: (support below) Can you help me with this Can you help me with this Can you help me with this Can you help me with this THINK: (support below) problem? My friend says all problem? We sometimes see problem? My friend has digit problem? My friend has digit Can you help me with this Roman numerals are based Roman numerals on buildings to cards: 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. -
Numbering Systems Developed by the Ancient Mesopotamians
Emergent Culture 2011 August http://emergent-culture.com/2011/08/ Home About Contact RSS-Email Alerts Current Events Emergent Featured Global Crisis Know Your Culture Legend of 2012 Synchronicity August, 2011 Legend of 2012 Wednesday, August 31, 2011 11:43 - 4 Comments Cosmic Time Meets Earth Time: The Numbers of Supreme Wholeness and Reconciliation Revealed In the process of writing about the precessional cycle I fell down a rabbit hole of sorts and in the process of finding my way around I made what I think are 4 significant discoveries about cycles of time and the numbers that underlie and unify cosmic and earthly time . Discovery number 1: A painting by Salvador Dali. It turns that clocks are not as bad as we think them to be. The units of time that segment the day into hours, minutes and seconds are in fact reconciled by the units of time that compose the Meso American Calendrical system or MAC for short. It was a surprise to me because one of the world’s foremost authorities in calendrical science the late Dr.Jose Arguelles had vilified the numbers of Western timekeeping as a most grievious error . So much so that he attributed much of the worlds problems to the use of the 12 month calendar and the 24 hour, 60 minute, 60 second day, also known by its handy acronym 12-60 time. I never bought into his argument that the use of those time factors was at fault for our largely miserable human-planetary condition. But I was content to dismiss mechanized time as nothing more than a convenient tool to facilitate the activities of complex societies. -
The Unicode Standard, Version 3.0, Issued by the Unicode Consor- Tium and Published by Addison-Wesley
The Unicode Standard Version 3.0 The Unicode Consortium ADDISON–WESLEY An Imprint of Addison Wesley Longman, Inc. Reading, Massachusetts · Harlow, England · Menlo Park, California Berkeley, California · Don Mills, Ontario · Sydney Bonn · Amsterdam · Tokyo · Mexico City Many of the designations used by manufacturers and sellers to distinguish their products are claimed as trademarks. Where those designations appear in this book, and Addison-Wesley was aware of a trademark claim, the designations have been printed in initial capital letters. However, not all words in initial capital letters are trademark designations. The authors and publisher have taken care in preparation of this book, but make no expressed or implied warranty of any kind and assume no responsibility for errors or omissions. No liability is assumed for incidental or consequential damages in connection with or arising out of the use of the information or programs contained herein. The Unicode Character Database and other files are provided as-is by Unicode®, Inc. No claims are made as to fitness for any particular purpose. No warranties of any kind are expressed or implied. The recipient agrees to determine applicability of information provided. If these files have been purchased on computer-readable media, the sole remedy for any claim will be exchange of defective media within ninety days of receipt. Dai Kan-Wa Jiten used as the source of reference Kanji codes was written by Tetsuji Morohashi and published by Taishukan Shoten. ISBN 0-201-61633-5 Copyright © 1991-2000 by Unicode, Inc. All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, electronic, mechanical, photocopying, recording or other- wise, without the prior written permission of the publisher or Unicode, Inc. -
AL ICT TIM English Cover Page.Pmd Ed.Pmd
Information and Communication Technology GRADE 12 Teacher's Instruction Manual (Implementation in 2009) Department of Information Technology Faculty of Science and Technology National Institute of Education 2009 Introduction Curriculum developers of the NIE were able to introduce Competency Based Learning and Teaching curricula for grades 6 and 10 in 2007, extend it to Grades 7, 8 and 11 progressively, every year and even to GCE (A/L) classes in 2009. In the same manner, syllabi and Teachers' Instructional Manuals for grades 12 and 13 for different subjects along with competencies and competency levels that should be developed in students, are presented descriptively. Information given on each subject will immensely help teachers to prepare for the Learning – Teaching situations. I would like to mention that curriculum developers have followed a different approach when preparing Teachers' Instructional Manuals for Advanced Level subjects compared to the approaches they followed in preparing Junior Secondary and Senior Secondary curricula . (Grades 10,11) In grades 6,7,8, 9, 10 and 11 teachers were oriented to a given format as to how they should handle the subject matter in the Learning – Teaching process, but in designing AL syllabi and Teacher’s Instructional Manuals freedom is given to the teachers to work as they wished. At this level, we expect teachers to use a suitable learning method from the suggested learning methods given in the Teacher’s Instructional Manuals to develop competencies and competency levels relevant to each lesson or lesson unit. Whatever the learning approach the teacher uses, it should be done effectively and satisfactorily, to realize the expected competencies and competency levels. -
Building a Wordnet for Sinhala
Building a WordNet for Sinhala Indeewari Wijesiri Malaka Gallage University of Moratuwa University of Moratuwa Moratuwa, Sri Lanka Moratuwa, Sri Lanka [email protected] [email protected] Buddhika Gunathilaka Madhuranga Lakjeewa University of Moratuwa University of Moratuwa Moratuwa, Sri Lanka Moratuwa, Sri Lanka [email protected] [email protected] Daya C. Wimalasuriya Gihan Dias University of Moratuwa University of Moratuwa Moratuwa, Sri Lanka Moratuwa, Sri Lanka [email protected] [email protected] Rohini Paranavithana Nisansa de Silva University of Colombo University of Moratuwa Colombo, Sri Lanka Moratuwa, Sri Lanka [email protected] 1 Introduction Abstract Despite being used by over 19 million people and being one of the official languages of Sri Lanka, there has not been much progress in de- Sinhala is one of the official languages of Sri veloping natural language processing (NLP) ap- Lanka and is used by over 19 million people. plications for the Sinhala language. This is partly It belongs to the Indo-Aryan branch of the In- due to the lack of commercial interest on devel- do-European languages and its origins date oping Sinhala NLP applications on a global back to at least 2000 years. It has developed scale. For instance, as of now, neither Google into its current form over a long period of time Translate 1 nor Google News 2 is available for with influences from a wide variety of lan- guages including Tamil, Portuguese and Eng- Sinhala while both are available in Hindi and lish. -
A Study on Collation of Languages from Developing Asia
A Study on Collation of Languages from Developing Asia Sarmad Hussain Nadir Durrani Center for Research in Urdu Language Processing National University of Computer and Emerging Sciences www.nu.edu.pk www.idrc.ca Published by Center for Research in Urdu Language Processing National University of Computer and Emerging Sciences Lahore, Pakistan Copyrights © International Development Research Center, Canada Printed by Walayatsons, Pakistan ISBN: 978-969-8961-03-9 This work was carried out with the aid of a grant from the International Development Research Centre (IDRC), Ottawa, Canada, administered through the Centre for Research in Urdu Language Processing (CRULP), National University of Computer and Emerging Sciences (NUCES), Pakistan. ii Preface Defining collation, or what is normally termed as alphabetical order or less frequently as lexicographic order, is one of the first few requirements for enabling computing in any language, second only to encoding, keyboard and fonts. It is because of this critical dependence of computing on collation that its definition is included within the locale of a language. Collation of all written languages are defined in their dictionaries, developed over centuries, and are thus very representative of cultural tradition. However, though it is well understood in these cultures, it is not always thoroughly documented or well understood in the context of existing character encodings, especially the Unicode. Collation is a complex phenomenon, dependent on three factors: script, language and encoding. These factors interact in a complicated fashion to uniquely define the collation sequence for each language. This volume aims to address the complex algorithms needed for sorting out the words in sequence for a subset of the languages.