Design of an Electronic Method for Describing Writing Systems
Total Page:16
File Type:pdf, Size:1020Kb
This page intentionally left blank. Graduate Institute of Applied Linguistics Thesis Approval Sheet This thesis, entitled DESIGN OF AN ELECTRONIC METHOD FOR DESCRIBING WRITING SYSTEMS written by Eric Scott Albright and submitted in partial fulfillment of the requirements for the degree of Master of Arts in Applied Linguistics has been read and approved by the undersigned members of the faculty of the Graduate Institute of Applied Linguistics Gary F. Simons (Mentor) Stephen L. Walter Robert B. Reed May 11, 2001 DESIGN OF AN ELECTRONIC METHOD FOR DESCRIBING WRITING SYSTEMS By Eric Scott Albright Presented to the Faculty of the Graduate Institute of Applied Linguistics in partial fulfillment of the requirements for the degree of Master of Arts in Applied Linguistics Graduate Institute of Applied Linguistics June, 2001 Dallas, TX Copyright © 2001 Eric Scott Albright All Rights Reserved THESIS DUPLICATION RELEASE I hereby authorize the Graduate Institute of Applied Linguistics Library to duplicate this thesis when needed for research and/or scholarship. Agreed: (student signature) Refused: (student signature) ABSTRACT DESIGN OF AN ELECTRONIC METHOD FOR DESCRIBING WRITING SYSTEMS Eric Scott Albright, M.A. The Graduate Institute of Applied Linguistics, 2001 Supervising Professor: Gary F. Simons, Ph.D. Although descriptions of writing systems abound, the literature conspicuously lacks a reference work that describes the information that should be present for a complete account of the workings of a writing system. As a result, the quality of writing system descriptions suffers, as key information is either lacking or inaccessible. Writing system descriptions must describe the characteristics of each unit of writing together with its relationships to other pertinent units of writing and linguistics. Formal descriptions allow both machines and users to share the same source of information. This thesis seeks to determine the slots that should be filled by information obtained from the systematic analysis of writing systems. These slots are organized into a formal model of writing system descriptions that captures the semantics of writing systems. DEDICATION In memory of Kenneth L. Pike who sat on the front porch of a teenager and convinced him that there is nothing in the world like the study of linguistics—especially when your wife is also a linguist. To my linguist wife, who inspired, encouraged, and supported me during this study. ACKNOWLEDGMENTS This thesis would not be what it is today without the help of my mentor, Dr. Gary F. Simons. His guidance, suggestions, and critique have been invaluable. The other members of my committee, Dr. Stephen L. Walter and Dr. Robert B. Reed, provided valuable editorial comments, which have greatly improved the quality of the final product. Martin Hosken was also a great help from the beginning, providing a sounding board for my ideas and pouring over drafts with numerous comments. May 11, 2001 vii TABLE OF CONTENTS 1. Introduction............................................................................................................................1 2. Review of the literature..........................................................................................................5 2.1. Introduction.........................................................................................................................5 2.2. Theory of writing systems...................................................................................................7 2.2.1. Applying emic principles to writing systems................................................................... 7 2.2.2. Informal contributions...................................................................................................12 2.2.3. Formal contributions......................................................................................................19 2.3. Descriptions of writing systems........................................................................................21 2.3.1. Collections of writing system descriptions.................................................................... 22 2.3.2. Writing system descriptions for individual languages...................................................25 2.4. Requirements for descriptions of writing systems............................................................ 27 2.5. Computational models......................................................................................................28 2.6. Summary...........................................................................................................................32 3. The problem......................................................................................................................... 33 3.1. Rationale for the study......................................................................................................33 3.2. Theoretical framework......................................................................................................35 3.3. Statement of the problem to be investigated.....................................................................35 3.4. Elements to be investigated.............................................................................................. 35 3.5. Limitations of the study....................................................................................................36 3.6. Definition of terms............................................................................................................ 38 3.7. Summary...........................................................................................................................41 4. Background information......................................................................................................42 4.1. How a computer treats text...............................................................................................42 4.2. XML in a nutshell............................................................................................................. 45 5. Design requirements............................................................................................................49 5.1. General requirements........................................................................................................49 5.2. Specific requirements for writing systems........................................................................52 6. Elements of writing systems................................................................................................53 6.1. Linguistic elements...........................................................................................................53 6.2. Graphs...............................................................................................................................58 6.3. Graphemes........................................................................................................................63 6.4. Writing system units.........................................................................................................68 6.5. Classes..............................................................................................................................75 6.6. Computational units..........................................................................................................78 6.6.1. Key codes....................................................................................................................... 78 6.6.2. Coded units....................................................................................................................80 viii 6.6.3. Glyphs............................................................................................................................80 7. Relationships between writing system elements..................................................................82 7.1. Potential relationships.......................................................................................................82 7.2. Mappings.......................................................................................................................... 87 7.3. Relations...........................................................................................................................98 7.4. Collating sequence............................................................................................................99 8. From formalism to publication.......................................................................................... 109 8.1. Elements needed for publication.....................................................................................109 8.1.1. Language...................................................................................................................... 109 8.1.2. Author.......................................................................................................................... 110 8.1.3. Prose sections...............................................................................................................110 8.2. Rendering the publication...............................................................................................113 8.3. Implementation of example .............................................................................................115 9. Conclusions........................................................................................................................116