Semantic Authoring for Blissymbols Augmented Communication Using Multilingual Text Generation
Total Page:16
File Type:pdf, Size:1020Kb
Semantic Authoring for Blissymbols Augmented Communication Using Multilingual Text Generation Thesis submitted in partial ful¯llment of the requirements for the degree of DOCTOR OF PHILOSOPHY by Yael Netzer Submitted to the Senate of Ben-Gurion University of the Negev November 2006 Beer-Sheva Semantic Authoring for Blissymbols Augmented Communication Using Multilingual Text Generation Thesis submitted in partial ful¯llment of the requirements for the degree of DOCTOR OF PHILOSOPHY by Yael Netzer Submitted to the Senate of Ben-Gurion University of the Negev Approved by the advisor Approved by the Dean of the Kreitman School of Advanced Graduate Studies November 2006 Beer-Sheva This work was carried out under the supervision of Dr. Michael Elhadad In the Department of Computer Science Faculty: Natural Sciences Acknowledgment During the course of life, we meet people who become signi¯cant to us and they change life in a meaningful way. I feel lucky that I met my advisor, Michael Elhadad, from whom I learned about Natural Language Processing and Natural Language Generation in particular. I thank Dr. Elhadad for his cleverness and kindness. Michael agreed to enter the AAC research ¯eld with me and he cooperated with my excitement about it. I admire his ability to translate thoughts into solvable problems, his patience and most of all his belief in me, that kept me working. I thank Yoav Goldberg for the implementation of the Bliss lexicon - no one would have done it better, and Ofer Biller for the development of SAUT. Meetings of the NLP group in Ben-Gurion University were always a joy, especially the discussions on music afterwards with Meni Adler and Oren Hazai. The Department of Computer Science in Ben-Gurion University in Beer-Sheva hosted me for the last 15 years (for all of my studies), so it was one of the most stable things in my life - I especially thank Prof. Abraham Melkman and Prof. Klara Kedem for their sincere concern for me, Dr. Mayer Goldberg for answering my Lisp queries, dear Dr. Tzachi Rosen for the useful discussions and his true friendship, Ami Berler for the co®ee breaks, and Valerie Glass for being my friend and assisting me with the formalities of the University. The lab people were always helpful. I thank Prof. Nomi Shir for teaching me linguistics and her loving Attitude, and Dr. Judy Wine for introducing me to the AAC world in her course in Shaare Zedek. The remarkable personality of my late grandmother, Dr. Puah Menczel, and the devotion of my mother Dvorah and my Sister Ruti to the society were the initial motivation for my drifting into the AAC ¯eld and I'm grateful for that. I thank my beloved sons Guy, Eitan, and Daniel for being such inspiring language users, and especially Daniel who taught me not to take the acquisition and usage of language for granted. My sisters Chana and Ruti, my brother Yosef, and especially my parents Dvorah and Ehud were always available for me with love and support and I am grateful. iii This work is dedicated with love to my parents Ehud and Dvorah iv Contents Abstract ix List of Figures xiii List of Tables xiv List of Abbreviations xv 1 Introduction 1 1.1 Background . 1 1.2 Motivation . 3 1.3 Objectives . 4 1.4 Contributions . 5 2 Background 7 2.1 The need for communication - AAC . 7 2.1.1 What is Augmentative and Alternative Communication? . 8 2.1.2 Who Needs AAC { Disability Types . 10 2.1.3 A Brief History of AAC . 11 2.1.4 AAC Techniques . 13 2.2 Speeding up Communication . 18 2.2.1 Natural Language Processing and AAC . 20 2.2.2 Language Techniques for Assistive Systems . 21 2.3 Summary . 30 v 3 Objectives 32 3.1 Generation from Telegraphic Input . 33 3.2 Generation as Semantic Authoring . 37 4 Usage Scenario 39 4.1 Maintaining a View of Context . 41 4.2 Argument Structure Speci¯cation . 41 4.3 Referring Expressions . 42 4.4 Lexical Choice and Syntactic Realization . 43 4.5 Summary . 45 5 System Architecture 46 5.1 Infrastructure Development . 46 5.2 Flow of Information . 48 5.2.1 Changing Displays Dynamically . 49 5.2.2 Lexical Choice and Syntactic Realization . 52 5.3 Summary . 54 6 Natural Language Generation and Syntactic Realization 56 6.1 Natural Language Generation . 57 6.1.1 The Architecture of an NLG System . 57 6.1.2 Multilingual Generation (MLG) . 59 6.1.3 AAC as an MLG Application . 60 6.2 The Syntactic Realizer . 61 6.2.1 Input for Surface realization module . 62 6.3 HUGG . 62 6.3.1 FUF/SURGE . 63 6.3.2 SURGE input of a clause . 64 6.3.3 Main Issues in Hebrew Generation . 66 6.3.4 Hebrew Clause . 67 6.3.5 Subjectless Clauses . 67 6.3.6 Existential, equative, possessive, and attributive clauses . 68 vi 6.3.7 Morphology . 70 6.4 Summary . 71 7 Lexical resources 72 7.1 Lexicons in NLG . 73 7.1.1 Levin's verb classes . 74 7.1.2 Online Resources . 75 7.1.3 Choice of Lexical Sources . 79 7.2 Bliss Lexicon . 80 7.2.1 Overview on Blissymbolics . 80 7.2.2 The Design of the Bliss Lexicon . 85 7.2.3 Bliss Lexicon Software Development . 87 7.3 Using Lexical Resources for the System Lexical Chooser . 88 7.4 Integrating a Large-scale Reusable Lexicon for NLG . 90 7.5 Summary . 93 8 Communication Boards 94 8.1 The SAUT Semantic Authoring Tool . 94 8.1.1 Conceptual Graphs . 95 8.1.2 Authoring Tools . 97 8.1.3 The SAUT Editor . 98 8.2 Bliss Communication Board . 102 8.3 Implementing a Communication Board . 102 8.4 The Processing Method - Adopting the SAUT Technique . 104 8.5 Summary . 106 9 Comparison with Existing NLG-AAC Systems 107 9.1 Blisstalk . 107 9.2 compansion ........................................108 9.3 Transforming Telegraphic Language to Greek . 111 9.4 pvi Intelligent Voice Prothesis ..............................113 9.5 cogeneration .......................................115 vii 9.6 Summary . 116 10 Evaluation 118 10.1 Evaluation of NLG systems . 119 10.2 Evaluation of AAC systems . 120 10.3 Evaluation our System . 123 10.4 Evaluating SAUT . 124 10.4.1 User Experiment . 124 10.4.2 Evaluation . 125 10.5 Evaluating E±ciency . 128 10.6 Summary . 129 11 Contributions and future work 131 11.1 Bliss symbols lexicon . 132 11.2 HUGG . 132 11.3 Integration of a large-scale, reusable lexicon with a natural language generator . 133 11.4 SAUT ............................................133 11.5 Communication Board . 134 11.6 Future Work . 134 Bibliography 137 viii Abstract This work presents a new approach to generating messages in an augmentative and alternative communication system, in the context of natural langauge generation. Background The ¯eld of Augmentative and Alternative Communication (AAC) is concerned with studying methods of communication that can be added to natural communication (speech and writing), es- pecially when an individual lacks some of the skills to achieve it. An AAC system is de¯ned as an \integrated group of components, including the symbols, aids, strategies, and techniques used by individuals to enhance communication." [ASHA, 1991]. In the absence of an oral ability, symbols of various types are presented on a display (or a communication board). Communication is conducted by the sequential selection of symbols on the display, until it can be interpreted and pronounced by the partner of the interaction. If technology is present, arti¯cial voice is used. Natural language generation (NLG) is a sub¯eld in Natural Language Processing (NLP). The term NLG refers to the process of generating utterances in a spoken language from another representation of data, based on linguistic resources. For all applications, the generated text can be in various lan- guages, leading to applications of multilingual generation (MLG). Multilingual generation (MLG) aims to generate text in several languages from one source of information, without using translation. Objectives This work presents a novel way to generate full sentences from a sequence of symbols, using NLG ix techniques and the notion of dynamic displays [Porter, 2000]. In this work, we investigate ways to exploit natural language generation (NLG) techniques for designing.