O'davoren's Glossary
Total Page:16
File Type:pdf, Size:1020Kb
1 The Application of XML to the historical lexicography of Old, Middle, and Early-Modern Irish: a Lexicon based analysis Julianne Nyhan BA ¡ NATIONAL UNIVERSITY OF IRELAND, CORK FACULTY OF ARTS DEPARTMENT OF HISTORY Degree of PhD May 2006 Head of Department: Prof Dermot Keogh Supervisor: Prof Donnchadh O´ Corrain´ Table of Contents Table of Contents Acknowledgements . viii Abstract . xi 1 Introduction: an overview of the history of lexicography 1 1.1 Abstract . 1 1.2 Proto-Dictionaries and Greek innovators . 2 1.3 Development of lexicography in Greece, Rome and Byzantium from De verborum significatu to the Suda ............... 4 1.4 Ordering systems . 12 1.4.1 Thematic origins . 12 1.4.1.1 Pliny the Elder’s Historia naturalis ......... 14 1.5 Evidence of Alphabetisation . 15 1.5.1 Alphabetic origins . 15 1.5.2 Alphabetisation in Old-English Glossaries . 18 1.5.3 Alphabetisation in Medieval-Latin Dictionaries . 19 1.6 Introduction . 22 1.7 Old-Irish Glossaries . 22 1.7.1 Metrical glossaries . 23 1.7.2 Text-dependent glossaries . 23 1.7.3 Medical glossaries . 23 1.8 Scholarship on the compilation of glossaries . 25 1.9 List of manuscripts containing major medieval Irish-language glossaries . 26 1.10 O’Davoren’s Glossary . 28 1.11 Lecan Glossary . 30 i Table of Contents 1.12 Duil´ Laithne . 32 1.13 Sanas Cormaic . 36 1.14 O’Mulconry’s Glossary . 39 1.15 O’Clery’s Glossary . 42 1.16 Alphabetic and thematic classification in Irish glossaries . 45 1.17 Medieval glossaries in DIL ....................... 46 1.18 List of some printed Irish language dictionaries before 1913 . 48 2 The history and limitations of the Dictionary of the Irish Language 53 2.1 Abstract . 53 2.2 Description of the Dictionary of the Irish Language .......... 53 2.2.1 Dialectical information supplied in the Dictionary of the Irish Language ........................... 55 2.3 Early history of the Dictionary of the Irish Language ......... 56 2.3.1 The dictionary in 1910 . 60 2.3.2 The appointment of Marstrander . 61 2.3.3 Marstrander’s D ......................... 63 2.3.4 The correspondence between Meyer and Best . 64 2.3.5 Controversy . 67 2.3.6 Comparison of Marstrander’s D with Meyer’s Contributions 72 2.3.7 History of DIL from 1913 to 1975 . 75 2.3.8 Editors and assistants of the Dictionary of the Irish Language 77 2.4 Limitations of the Dictionary of the Irish Language .......... 80 2.5 Navigating DIL: inadequate cross-referencing . 80 2.5.1 air- ................................. 80 2.5.1.1 Analysis of aira´ıbind to airbaid ............ 81 2.5.2 ar-fo- ................................ 83 ii Table of Contents 2.5.2.1 Analysis of ar-f´ocair and ar-fogni .......... 84 2.5.3 Internal orthography . 84 2.5.4 Lemmata of complex verbs . 85 2.6 Inconsistent presentation and treatment of information in DIL .. 88 2.6.1 Abbreviations . 88 2.6.2 Structure . 88 2.6.3 Words treated of in incorrect lexical entries . 89 2.6.4 Subjective information labels in DIL: the terms earlier and later ................................. 90 2.7 Categorisation of limitations . 91 3 Mark-up and XML: preliminary theoretical and historical considerations 93 3.1 Abstract . 93 3.2 Views of electronic text . 94 3.3 What is mark-up? . 98 3.4 Types of mark-up that can be applied to electronic documents . 105 3.4.1 Mark-up applied to electronic texts . 105 3.4.2 Presentational and Procedural mark-up . 105 3.4.3 Descriptive mark-up . 106 3.5 Theories of textuality . 107 3.6 The History of XML . 110 3.7 What is XML? . 113 3.8 The Text Encoding Initiative . 115 4 XML: the foundation of the electronic Lexicon 119 4.1 Abstract . 119 iii Table of Contents 4.2 Examples and description of ten different types of encoded lexical entry . 120 4.2.1 Brief description of ecal ..................... 120 4.2.2 Brief description of didiu .................... 122 4.2.3 Brief description of cl´ar ..................... 123 4.2.4 Brief description of mo ..................... 124 4.2.5 Brief description of m´e ...................... 124 4.2.6 Brief description of pr´ım .................... 125 4.2.7 Brief description of Gall ..................... 125 4.2.8 Brief description of cen ..................... 126 4.2.9 Brief description of the definite article . 126 4.2.10 Brief description of t´eit ..................... 126 4.2.11 Description of elements and attributes . 127 4.3 The Document Type Definition . 134 4.3.1 The Document Type Declaration . 135 4.4 Commentary on and explanation of lex.dtd . 137 4.4.0.1 Element declaration . 137 4.4.0.2 Content model . 138 4.4.0.3 Declaring an attribute . 139 4.4.0.4 Internal parameter entity . 141 4.5 Character references . 143 4.6 Interrelationship of elements and attributes in lex.dtd . 145 4.7 XSLT examples . 150 4.8 Examples of lexical entries . 156 5 Advantages and disadvantages of XML 240 5.1 Abstract . 240 iv Table of Contents 5.2 Advantages and disadvantages . 241 5.2.1 The problem of date and context . 241 5.2.1.1 Inadequate lexicographical criteria . 241 5.2.1.2 Limitation of OHCO theory of textuality . 243 5.2.1.3 Multilex root element . 246 5.3 Advantages . 248 5.3.1 The Lexicon as a stand-alone research tool . 248 5.3.2 Lexicon and Text interlinked . 252 5.3.3 Lexicon, Dictionary, and Text interlinked . 257 5.4 Partial automation of encoding of citations . 260 5.4.1 Patterns of citation in DIL ................... 262 5.4.2 Limitations of early regular expressions . 263 5.5 Predictable and unpredictable data patterns . 265 5.5.1 Predictable exceptions type A . 265 5.5.2 Predictable exceptions type AA . 265 5.5.3 Predictable exceptions type B . 266 5.5.4 Predictable exceptions type AB . 267 5.5.5 Mark-up that cannot feasibly be automated . 267 5.5.6 Variations within types of citation . 268 5.6 Methodology . 269 5.6.1 Routine 1.0 . 269 5.6.1.1 Routine 1.1 . 271 5.6.1.2 Routine 1.2 . 273 5.6.1.3 Routine 1.3 . 274 5.6.1.4 Routine 1.4 . 276 5.6.2 Routine 2 . 276 v Table of Contents 5.6.3 Routine 3 and sed’s pattern space . 278 5.6.4 Routine 4 . 279 5.6.5 Accuracy rates . 280 6 Conclusion 281 7 Bibliography 288 7.1 Bibliographies consulted . 288 7.2 Guidelines, Technical Reports and Working Group notes . 289 7.3 Manuscripts . 290 7.4 Dictionaries & printed lexicographical sources . 291 7.5 Secondary works . 295 vi List of Figures List of Figures 4.1 Processing instruction and document type declaration . 137 4.2 List of symbols and terms used in the DTD . 138 4.3 Element declaration . 138 4.4 Model group . 139 4.5 Attribute declaration . 140 4.6 Internal parameter entity . 142 4.7 Invoking an internal parameter entity . 142 4.8 Charater references . 143 4.10 Output text: TEI conformant mark-up . 155 vii Acknowledgements Acknowledgements I am indebted to my supervisor, Professor Donnchadh O´ Corrain´ for his generosity with both his time and expertise. His expert knowledge of Irish History and Old, Middle, and Early Modern Irish was invaluable to me throughout the course of my postgraduate studies. Further, the vision he showed in setting up the Corpus of Electronic texts, one of the first scholarly encoded electronic text corpora in Ireland, gave me (and many others) the opportunity to take my first steps in SGML and XML and to develop my research in a broad, applied context. I am also indebted to Peter Flynn, XML expert and manager of the Research and Academic Electronic Publishing Unit at University College Cork. Without Peter’s expert guidance it would not have been possible for me to devise the Lexicon or write this work. Not only did he mentor me in XML and XSLT, but also in sed, awk, xpath, regular expressions, the Linux operating system, the Emacs editor, and LATEX. The generosity of Professor O´ Corrain´ and Peter Flynn with their time and expertise was immense and I feel privileged to have worked with them both. I am also very grateful to Beatrix Farber,¨ manager of the CELT corpus, for the practical support and encouragement she gave me over the past five years. I also wish to thank Dr Gregory Toner and Dr Maxim Fomin in the University of Ulster, Coleraine. I worked with them on the Linking Dictionaries and Text project, a cross-border initiative between University College Cork and the University of Ulster, Coleraine, and also on the viii Acknowledgements electronic Dictionary of the Irish language project. The research I carried out for these projects in XML, XSLT, and Linux utilities helped me to develop a clearer understanding of the content and structure of the Dictionary of the Irish language. Special words of thanks also go to Professor Padraig´ O´ Machain´ and the Dublin Institute of Advanced Studies. In 2004 Professor O´ Machain´ organised a one day colloquium in the Institute, entitled ’Digital Text and Digital Image’. As well as giving me the opportunity to air some of my research findings, I appreciated the opportunity to meet scholars involved in many aspects of digitising Irish material. As a result of the colloquium I set up the Digiquest mailing list in 2005 and my thanks also goes to the members of that list..