Classifying Glyphs Comparing Evolution and Learning

Total Page:16

File Type:pdf, Size:1020Kb

Classifying Glyphs Comparing Evolution and Learning ABCDEFGHIJKLMNOPQRSTUVWXYZ234+? TDT4500 Intelligent Systems, Specialization Project Autumn 2010 Evolving Neural Networks Classifying Glyphs Comparing Evolution and Learning Tiril Anette Langfeldt Rødland Supervisor: Keith L. Downing abcdefghijklmnopqrstuvwxyz567 Abstract This dissertation investigates the classification capabilities of artificial neural networks. The goal is to generalize over the features of a writing system, and thus classify the writing system of a previously unseen glyph. Evolution and learning were both used to tune the weights of a network. This was to investigate which of the methods is best suited for creation of this kind of generalization and classification networks. The difference was remarkable, as the evolved network displayed absolutely no ability to classify the glyphs. However, the learned network managed to clearly separate between the writing systems, resulting in correct classifications. It can thus be concluded that artificial neural networks are able to correctly classify the writing system of a glyph, and that learning by backpropagation is the ultimate method for creating such networks. i ii Preface This dissertation is the pre-study project of the master dissertation. It is written at the Department of Computer and Information Science, NTNU, during the autumn semester of 2010. The original plan was to find an Egyptological dissertation subject, due to my long-lasting interest in the field. I appreciate the assistance from Vidar Edland, Norwegian Egyptology Society, for helping me in the search for a suitable subject. However, the search came up empty. Inspired by Dehaene (2009), Egyptology turned to hieroglyphs, and then other writing systems joined in. Classification seemed to be a natural choice of task, with regard to different writing systems and neural networks. This revealed an interesting subject on which there seemingly had been done no previous research. I like to think it still is a relevant problem. At least it can be used to show off the generalization abilities of artificial neural networks. I wish to thank my supervisor Professor Keith L. Downing at the Depart- ment of Computer and Information Science, Norwegian University of Science and Technology, for his invaluable assistance. Special thanks are given to John A. Bullinaria, for his immense help in debug- ging, and for sharing of his infinite wisdom regarding neural networks. Gratitude is given to Arne Dag Fidjestøl and Aslak Raanes in the System Administration Group, Department of Computer and Information Science, for giving me access to hardware on which I could run the evolution. I am also very grateful to Cyrus Harmon, Edi Weitz, David Lichteblau, and all the other Common Lisp programmers out there, whose libraries saved me from months of extra work. Trondheim, December 2010 Tiril Anette Langfeldt Rødland iii iv Contents Chapter 1 : Introduction 1 Chapter 2 : Background 3 2.1 Writing Systems ........................... 3 2.2 Classification of Glyphs ....................... 5 2.3 Artificial Neural Network ...................... 6 2.3.1 The XOR Example ...................... 6 2.4 Training ANNs ............................ 9 2.4.1 Training ANNs Using Supervised Learning ......... 9 2.4.2 Backpropagation ....................... 10 2.4.3 Fast Backpropagation .................... 12 2.4.4 Tricks for Multi-Class Classification Problems ....... 13 2.5 Evolving ANNs ............................ 14 2.5.1 Evolving Neural Networks .................. 15 Chapter 3 : Methodology 17 3.1 Glyphs and Writing Systems ..................... 17 3.2 Neural Network Design ........................ 20 3.3 Learning ................................ 21 3.4 Evolution ............................... 22 Chapter 4 : Results and Analysis 25 4.1 Learning ................................ 25 4.1.1 Pair Classification ...................... 25 4.1.2 Full Classification ....................... 27 4.1.3 Writing System Classification Quality ........... 27 4.2 Evolution ............................... 30 Chapter 5 : Discussion 35 5.1 Learning or Evolution? ........................ 35 5.2 Writing Systems ........................... 35 5.3 Future Work ............................. 36 References 39 Software Library 43 Appendix A : Unicode Charts A-1 v Appendix B : Classification B-1 B.1 Pair Classification ........................... B-1 B.1.1 Training ............................ B-1 B.1.2 Testing ............................ B-6 B.2 Full Classification ........................... B-29 B.2.1 Learning ............................ B-29 B.2.2 Evolution ........................... B-31 Appendix C : Data Set Distribution C-1 Appendix D: Pixel Comparison D-1 vi List of Figures 2.1 A simple ANN ............................ 7 2.2 Activation function .......................... 7 2.3 XOR ANN .............................. 8 2.4 Backpropagation ANN ........................ 11 2.5 The evolution cycle .......................... 15 3.1 ANN: From Unicode to output values ................ 19 4.1 Learning: full classification ...................... 28 4.2 Correlation .............................. 30 4.3 Evolution: full classification ..................... 32 A.1 Coptic ................................. A-2 A.2 Runic ................................. A-3 A.3 Hebrew ................................ A-4 A.4 Arabic ................................. A-5 A.5 Hangul jamo ............................. A-6 A.6 Hiragana ................................ A-7 A.7 Thai .................................. A-8 A.8 Canadian Aboriginal syllabics .................... A-9 A.9 Han unification ............................ A-10 A.10 Egyptian hieroglyphs ......................... A-11 C.1 Approach 1 .............................. C-3 C.2 Approach 2 .............................. C-4 C.3 Approach 3 .............................. C-4 vii viii List of Tables 2.1 XOR truth table ........................... 8 2.2 Evolutionary methods ........................ 14 3.1 Writing systems ............................ 17 3.2 Evolution parameters ......................... 22 3.3 The gene ................................ 23 4.1 Learning: pair classification ..................... 26 4.2 Learning: pair classification probabilities .............. 26 C.1 Number of glyphs ........................... C-1 C.2 Comparison of combinations of relative set size .......... C-2 D.1 Glyph similarity ............................ D-2 D.2 Glyph similarity ............................ D-2 D.3 Pixel comparison using Richard’s curve (128) ........... D-3 D.4 Pixel comparison using Richard’s curve (200) ........... D-3 D.5 Pixel comparison using Richard’s curve (250) ........... D-3 D.6 Pixel comparison using difference ≤ 40 ............... D-4 D.7 Pixel comparison using difference ≤ 20 ............... D-4 D.8 Pixel comparison using difference ≤ 10 ............... D-4 D.9 Pixel comparison using difference ≤ 5 ................ D-5 ix x Abbreviations and Acronyms ANN Artificial neural networks BP Backpropagation CJK Chinese, Japanese, Korean EA Evolutionary algorithm EEC Evolving efficient connections EP Evolutionary programming ES Evolutionary strategies GA Genetic algorithm GP Genetic programming TSS Total sum squared error XOR Exclusive or xi xii CHAPTER 1 Introduction 33 000 years ago, the first Homo sapiens discovered that they could draw rec- ognizable pictures on the wall of the Chauvet cave, in southern France (Dehaene, 2009). There were drawings of animals, as well as nonfigurative shapes such as dots, lines and curves. Pictography appeared in ancient Sumeria and Egypt, creating easy-to-read writings. The art of writing mutated as it spread throughout the cultures. Even though pictograms are relatively easy to read, they are — due to their many details — very slow to write. They were thus simplified in different ways, and the symbols were tied to different sounds. For example, the Chinese character ⾺ is a simplification of a horse, and the letter A was originally an ox (Dehaene, 2009, Figure 4.3). This is visible in the Greek letter α (alpha), as aleph means just that. Similarly, m symbolizes waves (mem), k symbolizes a hand with outstretched fingers (kaf), and R symbolizes a head (res)(Dehaene, 2009). Even though both ⾺ and A originally came from large, four-legged animals, the resulting glyphs are very different. For each step of simplification, some features were extracted for use in the simplified glyph. Different civilizations focused on different features. The simplification was also depending on the writing tools available; pencils and paper enable glyphs of many different shapes, while glyphs carved in stone ought to consist of straight lines. Today, 33 000 years after the first drawings in a cave, we have numerous writing systems, in different stages of evolution. The human brain seems to be rather good at separating between these different writing systems, depending on their relative placement on the family tree. It might be difficult to separate between the Latin, Greek and Cyrillic alphabets, if one is familiar with neither. However, it is trivial to separate these from Egyptian hieroglyphs. After having learned that m, l and w are hieroglyphs, it is easy to see that R also is of this writing system. Even though the glyphs within one writing system differ from each other, they are also very similar. One tend to reuse the same basic shapes for different glyphs within the writing system (Dehaene, 2009, Figure 4.1), thus creating a subconscious expectation on how glyphs from that writing system shall look. When humans separate between writing systems like that, it is
Recommended publications
  • Alif and Hamza Alif) Is One of the Simplest Letters of the Alphabet
    ’alif and hamza alif) is one of the simplest letters of the alphabet. Its isolated form is simply a vertical’) ﺍ stroke, written from top to bottom. In its final position it is written as the same vertical stroke, but joined at the base to the preceding letter. Because of this connecting line – and this is very important – it is written from bottom to top instead of top to bottom. Practise these to get the feel of the direction of the stroke. The letter 'alif is one of a number of non-connecting letters. This means that it is never connected to the letter that comes after it. Non-connecting letters therefore have no initial or medial forms. They can appear in only two ways: isolated or final, meaning connected to the preceding letter. Reminder about pronunciation The letter 'alif represents the long vowel aa. Usually this vowel sounds like a lengthened version of the a in pat. In some positions, however (we will explain this later), it sounds more like the a in father. One of the most important functions of 'alif is not as an independent sound but as the You can look back at what we said about .(ﺀ) carrier, or a ‘bearer’, of another letter: hamza hamza. Later we will discuss hamza in more detail. Here we will go through one of the most common uses of hamza: its combination with 'alif at the beginning or a word. One of the rules of the Arabic language is that no word can begin with a vowel. Many Arabic words may sound to the beginner as though they start with a vowel, but in fact they begin with a glottal stop: that little catch in the voice that is represented by hamza.
    [Show full text]
  • Yiddish Diction in Singing
    UNLV Theses, Dissertations, Professional Papers, and Capstones May 2016 Yiddish Diction in Singing Carrie Suzanne Schuster-Wachsberger University of Nevada, Las Vegas Follow this and additional works at: https://digitalscholarship.unlv.edu/thesesdissertations Part of the Language Description and Documentation Commons, Music Commons, Other Languages, Societies, and Cultures Commons, and the Theatre and Performance Studies Commons Repository Citation Schuster-Wachsberger, Carrie Suzanne, "Yiddish Diction in Singing" (2016). UNLV Theses, Dissertations, Professional Papers, and Capstones. 2733. http://dx.doi.org/10.34917/9112178 This Dissertation is protected by copyright and/or related rights. It has been brought to you by Digital Scholarship@UNLV with permission from the rights-holder(s). You are free to use this Dissertation in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s) directly, unless additional rights are indicated by a Creative Commons license in the record and/or on the work itself. This Dissertation has been accepted for inclusion in UNLV Theses, Dissertations, Professional Papers, and Capstones by an authorized administrator of Digital Scholarship@UNLV. For more information, please contact [email protected]. YIDDISH DICTION IN SINGING By Carrie Schuster-Wachsberger Bachelor of Music in Vocal Performance Syracuse University 2010 Master of Music in Vocal Performance Western Michigan University 2012
    [Show full text]
  • The Ogham-Runes and El-Mushajjar
    c L ite atu e Vo l x a t n t r n o . o R So . u P R e i t ed m he T a s . 1 1 87 " p r f ro y f r r , , r , THE OGHAM - RUNES AND EL - MUSHAJJAR A D STU Y . BY RICH A R D B URTO N F . , e ad J an uar 22 (R y , PART I . The O ham-Run es g . e n u IN tr ating this first portio of my s bj ect, the - I of i Ogham Runes , have made free use the mater als r John collected by Dr . Cha les Graves , Prof. Rhys , and other students, ending it with my own work in the Orkney Islands . i The Ogham character, the fair wr ting of ' Babel - loth ancient Irish literature , is called the , ’ Bethluis Bethlm snion e or , from its initial lett rs, like “ ” Gree co- oe Al hab e t a an d the Ph nician p , the Arabo “ ” Ab ad fl d H ebrew j . It may brie y be describe as f b ormed y straight or curved strokes , of various lengths , disposed either perpendicularly or obliquely to an angle of the substa nce upon which the letters n . were i cised , punched, or rubbed In monuments supposed to be more modern , the letters were traced , b T - N E E - A HE OGHAM RU S AND L M USH JJ A R . n not on the edge , but upon the face of the recipie t f n l o t sur ace ; the latter was origi al y wo d , s aves and tablets ; then stone, rude or worked ; and , lastly, metal , Th .
    [Show full text]
  • Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging
    ORIGINAL ARTICLE Neuroscience http://dx.doi.org/10.3346/jkms.2014.29.10.1416 • J Korean Med Sci 2014; 29: 1416-1424 Neural Substrates of Hanja (Logogram) and Hangul (Phonogram) Character Readings by Functional Magnetic Resonance Imaging Zang-Hee Cho,1 Nambeom Kim,1 The two basic scripts of the Korean writing system, Hanja (the logography of the traditional Sungbong Bae,2 Je-Geun Chi,1 Korean character) and Hangul (the more newer Korean alphabet), have been used together Chan-Woong Park,1 Seiji Ogawa,1,3 since the 14th century. While Hanja character has its own morphemic base, Hangul being and Young-Bo Kim1 purely phonemic without morphemic base. These two, therefore, have substantially different outcomes as a language as well as different neural responses. Based on these 1Neuroscience Research Institute, Gachon University, Incheon, Korea; 2Department of linguistic differences between Hanja and Hangul, we have launched two studies; first was Psychology, Yeungnam University, Kyongsan, Korea; to find differences in cortical activation when it is stimulated by Hanja and Hangul reading 3Kansei Fukushi Research Institute, Tohoku Fukushi to support the much discussed dual-route hypothesis of logographic and phonological University, Sendai, Japan routes in the brain by fMRI (Experiment 1). The second objective was to evaluate how Received: 14 February 2014 Hanja and Hangul affect comprehension, therefore, recognition memory, specifically the Accepted: 5 July 2014 effects of semantic transparency and morphemic clarity on memory consolidation and then related cortical activations, using functional magnetic resonance imaging (fMRI) Address for Correspondence: (Experiment 2). The first fMRI experiment indicated relatively large areas of the brain are Young-Bo Kim, MD Department of Neuroscience and Neurosurgery, Gachon activated by Hanja reading compared to Hangul reading.
    [Show full text]
  • Assessment of Options for Handling Full Unicode Character Encodings in MARC21 a Study for the Library of Congress
    1 Assessment of Options for Handling Full Unicode Character Encodings in MARC21 A Study for the Library of Congress Part 1: New Scripts Jack Cain Senior Consultant Trylus Computing, Toronto 1 Purpose This assessment intends to study the issues and make recommendations on the possible expansion of the character set repertoire for bibliographic records in MARC21 format. 1.1 “Encoding Scheme” vs. “Repertoire” An encoding scheme contains codes by which characters are represented in computer memory. These codes are organized according to a certain methodology called an encoding scheme. The list of all characters so encoded is referred to as the “repertoire” of characters in the given encoding schemes. For example, ASCII is one encoding scheme, perhaps the one best known to the average non-technical person in North America. “A”, “B”, & “C” are three characters in the repertoire of this encoding scheme. These three characters are assigned encodings 41, 42 & 43 in ASCII (expressed here in hexadecimal). 1.2 MARC8 "MARC8" is the term commonly used to refer both to the encoding scheme and its repertoire as used in MARC records up to 1998. The ‘8’ refers to the fact that, unlike Unicode which is a multi-byte per character code set, the MARC8 encoding scheme is principally made up of multiple one byte tables in which each character is encoded using a single 8 bit byte. (It also includes the EACC set which actually uses fixed length 3 bytes per character.) (For details on MARC8 and its specifications see: http://www.loc.gov/marc/.) MARC8 was introduced around 1968 and was initially limited to essentially Latin script only.
    [Show full text]
  • Bibliography
    Bibliography Many books were read and researched in the compilation of Binford, L. R, 1983, Working at Archaeology. Academic Press, The Encyclopedic Dictionary of Archaeology: New York. Binford, L. R, and Binford, S. R (eds.), 1968, New Perspectives in American Museum of Natural History, 1993, The First Humans. Archaeology. Aldine, Chicago. HarperSanFrancisco, San Francisco. Braidwood, R 1.,1960, Archaeologists and What They Do. Franklin American Museum of Natural History, 1993, People of the Stone Watts, New York. Age. HarperSanFrancisco, San Francisco. Branigan, Keith (ed.), 1982, The Atlas ofArchaeology. St. Martin's, American Museum of Natural History, 1994, New World and Pacific New York. Civilizations. HarperSanFrancisco, San Francisco. Bray, w., and Tump, D., 1972, Penguin Dictionary ofArchaeology. American Museum of Natural History, 1994, Old World Civiliza­ Penguin, New York. tions. HarperSanFrancisco, San Francisco. Brennan, L., 1973, Beginner's Guide to Archaeology. Stackpole Ashmore, w., and Sharer, R. J., 1988, Discovering Our Past: A Brief Books, Harrisburg, PA. Introduction to Archaeology. Mayfield, Mountain View, CA. Broderick, M., and Morton, A. A., 1924, A Concise Dictionary of Atkinson, R J. C., 1985, Field Archaeology, 2d ed. Hyperion, New Egyptian Archaeology. Ares Publishers, Chicago. York. Brothwell, D., 1963, Digging Up Bones: The Excavation, Treatment Bacon, E. (ed.), 1976, The Great Archaeologists. Bobbs-Merrill, and Study ofHuman Skeletal Remains. British Museum, London. New York. Brothwell, D., and Higgs, E. (eds.), 1969, Science in Archaeology, Bahn, P., 1993, Collins Dictionary of Archaeology. ABC-CLIO, 2d ed. Thames and Hudson, London. Santa Barbara, CA. Budge, E. A. Wallis, 1929, The Rosetta Stone. Dover, New York. Bahn, P.
    [Show full text]
  • Recognition of Online Handwritten Gurmukhi Strokes Using Support Vector Machine a Thesis
    Recognition of Online Handwritten Gurmukhi Strokes using Support Vector Machine A Thesis Submitted in partial fulfillment of the requirements for the award of the degree of Master of Technology Submitted by Rahul Agrawal (Roll No. 601003022) Under the supervision of Dr. R. K. Sharma Professor School of Mathematics and Computer Applications Thapar University Patiala School of Mathematics and Computer Applications Thapar University Patiala – 147004 (Punjab), INDIA June 2012 (i) ABSTRACT Pen-based interfaces are becoming more and more popular and play an important role in human-computer interaction. This popularity of such interfaces has created interest of lot of researchers in online handwriting recognition. Online handwriting recognition contains both temporal stroke information and spatial shape information. Online handwriting recognition systems are expected to exhibit better performance than offline handwriting recognition systems. Our research work presented in this thesis is to recognize strokes written in Gurmukhi script using Support Vector Machine (SVM). The system developed here is a writer independent system. First chapter of this thesis report consist of a brief introduction to handwriting recognition system and some basic differences between offline and online handwriting systems. It also includes various issues that one can face during development during online handwriting recognition systems. A brief introduction about Gurmukhi script has also been given in this chapter In the last section detailed literature survey starting from the 1979 has also been given. Second chapter gives detailed information about stroke capturing, preprocessing of stroke and feature extraction. These phases are considered to be backbone of any online handwriting recognition system. Recognition techniques that have been used in this study are discussed in chapter three.
    [Show full text]
  • The Ancient Egyptian Hieroglyphic Language Was Created by Sumerian Turks
    Advances in Anthropology, 2017, 7, 197-250 http://www.scirp.org/journal/aa ISSN Online: 2163-9361 ISSN Print: 2163-9353 The Ancient Egyptian Hieroglyphic Language Was Created by Sumerian Turks Metin Gündüz Retired Physician, Diplomate ABEM (American Board of Emergency Medicine), Izmir, Turkey How to cite this paper: Gündüz, M. (2017). Abstract The Ancient Egyptian Hieroglyphic Lan- guage Was Created by Sumerian Turks. The “phonetic sound value of each and every hieroglyphic picture’s expressed Advances in Anthropology, 7, 197-250. and intended meaning as a verb or as a noun by the creators of the ancient https://doi.org/10.4236/aa.2017.74013 Egyptian hieroglyphic picture symbols, ‘exactly matches’, the same meaning” Received: August 2, 2017 as well as the “first letter of the corresponding meaning of the Turkish word’s Accepted: October 13, 2017 first syllable” with the currently spoken dialects of the Turkish language. The Published: October 16, 2017 exact intended sound value as well as the meaning of hieroglyphic pictures as Copyright © 2017 by author and nouns or verbs is individually and clearly expressed in the original hierog- Scientific Research Publishing Inc. lyphic pictures. This includes the 30 hieroglyphic pictures of well-known con- This work is licensed under the Creative sonants as well as some vowel sounds-of the language of the ancient Egyp- Commons Attribution International tians. There is no exception to this rule. Statistical and probabilistic certainty License (CC BY 4.0). http://creativecommons.org/licenses/by/4.0/ beyond any reasonable doubt proves the Turkish language connection. The Open Access hieroglyphic pictures match with 2 variables (the phonetic value and intended meaning).
    [Show full text]
  • Egyptian Hieroglyphs CEEG 0909 a Workbook for an Introduction to Egyptian Hieroglyphs
    An Introduction to Egyptian Hieroglyphs CEEG 0909 A Workbook for An Introduction to Egyptian Hieroglyphs C. Casey Wilbour Hall 301 christian [email protected] April 9, 2018 Contents Syllabus 2 Day 1 11 1-I-1 Rosetta Stone ................................................. 11 1-I-2 Calligraphy Practice 1 { Uniliterals ..................................... 13 1-I-4 Meet Your Classmates ............................................ 16 1-I-5 The Begatitudes ............................................... 17 1-I-6 Vocabulary { Uniliterals & Classifiers ................................... 19 Day 2 25 2-I-1 Timeline of Egyptian Languages ...................................... 25 2-I-2 Calligraphy Practice 2 { Biliterals ..................................... 27 2-I-3 Biliteral Chart ................................................ 32 2-I-4 Vocabulary { Biliterals & Classifiers .................................... 33 2-II-3 Vocabulary { Household Objects ...................................... 37 Day 3 39 3-I-1 Calligraphy Practice 3 { Multiliterals & Common Classifiers ....................... 39 3-I-2 Vocabulary { Multiliterals & Classifiers .................................. 43 3-I-3 Vocabulary { Suffix Pronouns, Parts of the Body ............................. 45 3-I-4 Parts of the Body .............................................. 47 3-II-3 Homework { Suffix Pronouns & Parts of the Body ............................ 48 Day 4 49 4-I-1 Vocabulary { Articles, Independent Pronouns, Family, Deities ...................... 49 4-I-4 Gods and Goddesses ............................................
    [Show full text]
  • Arabic Alphabet - Wikipedia, the Free Encyclopedia Arabic Alphabet from Wikipedia, the Free Encyclopedia
    2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia Arabic alphabet From Wikipedia, the free encyclopedia َأﺑْ َﺠ ِﺪﯾﱠﺔ َﻋ َﺮﺑِﯿﱠﺔ :The Arabic alphabet (Arabic ’abjadiyyah ‘arabiyyah) or Arabic abjad is Arabic abjad the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually[1] stand for consonants, it is classified as an abjad. Type Abjad Languages Arabic Time 400 to the present period Parent Proto-Sinaitic systems Phoenician Aramaic Syriac Nabataean Arabic abjad Child N'Ko alphabet systems ISO 15924 Arab, 160 Direction Right-to-left Unicode Arabic alias Unicode U+0600 to U+06FF range (http://www.unicode.org/charts/PDF/U0600.pdf) U+0750 to U+077F (http://www.unicode.org/charts/PDF/U0750.pdf) U+08A0 to U+08FF (http://www.unicode.org/charts/PDF/U08A0.pdf) U+FB50 to U+FDFF (http://www.unicode.org/charts/PDF/UFB50.pdf) U+FE70 to U+FEFF (http://www.unicode.org/charts/PDF/UFE70.pdf) U+1EE00 to U+1EEFF (http://www.unicode.org/charts/PDF/U1EE00.pdf) Note: This page may contain IPA phonetic symbols. Arabic alphabet ا ب ت ث ج ح خ د ذ ر ز س ش ص ض ط ظ ع en.wikipedia.org/wiki/Arabic_alphabet 1/20 2/14/13 Arabic alphabet - Wikipedia, the free encyclopedia غ ف ق ك ل م ن ه و ي History · Transliteration ء Diacritics · Hamza Numerals · Numeration V · T · E (//en.wikipedia.org/w/index.php?title=Template:Arabic_alphabet&action=edit) Contents 1 Consonants 1.1 Alphabetical order 1.2 Letter forms 1.2.1 Table of basic letters 1.2.2 Further notes
    [Show full text]
  • Writing Systems Reading and Spelling
    Writing systems Reading and spelling Writing systems LING 200: Introduction to the Study of Language Hadas Kotek February 2016 Hadas Kotek Writing systems Writing systems Reading and spelling Outline 1 Writing systems 2 Reading and spelling Spelling How we read Slides credit: David Pesetsky, Richard Sproat, Janice Fon Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems What is writing? Writing is not language, but merely a way of recording language by visible marks. –Leonard Bloomfield, Language (1933) Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems Writing and speech Until the 1800s, writing, not spoken language, was what linguists studied. Speech was often ignored. However, writing is secondary to spoken language in at least 3 ways: Children naturally acquire language without being taught, independently of intelligence or education levels. µ Many people struggle to learn to read. All human groups ever encountered possess spoken language. All are equal; no language is more “sophisticated” or “expressive” than others. µ Many languages have no written form. Humans have probably been speaking for as long as there have been anatomically modern Homo Sapiens in the world. µ Writing is a much younger phenomenon. Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems (Possibly) Independent Inventions of Writing Sumeria: ca. 3,200 BC Egypt: ca. 3,200 BC Indus Valley: ca. 2,500 BC China: ca. 1,500 BC Central America: ca. 250 BC (Olmecs, Mayans, Zapotecs) Hadas Kotek Writing systems Writing systems Reading and spelling Writing systems Writing and pictures Let’s define the distinction between pictures and true writing.
    [Show full text]
  • + Natali A, Professor of Cartqraphy, the Hebreu Uhiversity of -Msalem, Israel DICTIONARY of Toponymfc TERLMINO~OGY Wtaibynafiail~
    United Nations Group of E%perts OR Working Paper 4eographicalNames No. 61 Eighteenth Session Geneva, u-23 August1996 Item7 of the E%ovisfonal Agenda REPORTSOF THE WORKINGGROUPS + Natali a, Professor of Cartqraphy, The Hebreu UhiVersity of -msalem, Israel DICTIONARY OF TOPONYMfC TERLMINO~OGY WtaIbyNafiaIl~- . PART I:RaLsx vbim 3.0 upi8elfuiyl9!J6 . 001 . 002 003 004 oo!l 006 007 . ooa 009 010 . ol3 014 015 sequala~esfocJphabedcsaipt. 016 putting into dphabetic order. see dso Kqucna ruIt!% Qphabctk 017 Rtlpreat8Ii00, e.g. ia 8 computer, wflich employs ooc only numm ds but also fetters. Ia a wider sense. aIso anploying punauatiocl tnarksmd-SymboIs. 018 Persod name. Esamples: Alfredi ‘Ali. 019 022 023 biliaw 024 02s seecIass.f- 026 GrqbicsymboIusedurunitiawrIdu~morespedficaty,r ppbic symbol in 1 non-dphabedc writiog ryste.n& Exmlptes: Chinese ct, , thong; Ambaric u , ha: Japaoese Hiragana Q) , no. 027 -.modiGed Wnprehauive term for cheater. simplified aad character, varIaoL 031 CbmJnyol 032 CISS, featm? 033 cQdedrepfwltatiul 034 035 036 037' 038 039 040 041 042 047 caavasion alphabet 048 ConMQo table* 049 0nevahte0frpointinlhisgr8ti~ . -.- w%idofplaaecoordiaarurnm;aingoftwosetsofsnpight~ -* rtcight8ngfIertoeachotkrodwithap8ltKliuofl8qthonbo&. rupenmposedonr(chieflytopogtaphtc)map.see8lsouTM gz 051 see axxdimtes. rectangufar. 052 A stahle form of speech, deriyed from a pbfgin, which has became the sole a ptincipal language of 8 qxech comtnunity. Example: Haitian awle (derived from Fresh). ‘053 adllRaIfeatlue see feature, allhlral. 054 055 * 056 057 Ac&uioaofsoftwamrcqkdfocusingrdgRaIdatabmem rstoauMe~osctlto~thisdatabase. 058 ckalog of defItitioas of lbe contmuofadigitaldatabase.~ud- hlg data element cefw labels. f0mw.s. internal refm codMndtextemty,~well~their-p,. 059 see&tadichlq. 060 DeMptioa of 8 basic unit of -Lkatifiile md defiile informatioa tooccqyrspecEcdataf!eldinrcomputernxaxtLExampk Pateofmtifii~ofluwtby~namaturhority’.
    [Show full text]