Lexical Database Enrichment Through Semi-Automated Morphological Analysis

Total Page:16

File Type:pdf, Size:1020Kb

Lexical Database Enrichment Through Semi-Automated Morphological Analysis Some pages of this thesis may have been removed for copyright restrictions. If you have discovered material in Aston Research Explorer which is unlawful e.g. breaches copyright, (either yours or that of a third party) or any other law, including but not limited to those relating to patent, trademark, confidentiality, data protection, obscenity, defamation, libel, then please read our Takedown policy and contact the service immediately ([email protected]) LEXICAL DATABASE ENRICHMENT THROUGH SEMI-AUTOMATED MORPHOLOGICAL ANALYSIS Volume 1 THOMAS MARTIN RICHENS Doctor of Philosophy ASTON UNIVERSITY January 2011 This copy of the thesis has been supplied on condition that anyone who consults it is understood to recognise that its copyright rests with the author and that no quotation from the thesis and no information derived from it may be published without proper acknowledgement. 1 Summary Aston University Lexical Database Enrichment through Semi-Automated Morphological Analysis Thomas Martin Richens Doctor of Philosophy 2011 Derivational morphology proposes meaningful connections between words and is largely unrepresented in lexical databases. This thesis presents a project to enrich a lexical database with morphological links and to evaluate their contribution to disambiguation. A lexical database with sense distinctions was required. WordNet was chosen because of its free availability and widespread use. Its suitability was assessed through critical evaluation with respect to specifications and criticisms, using a transparent, extensible model. The identification of serious shortcomings suggested a portable enrichment methodology, applicable to alternative resources. Although 40% of the most frequent words are prepositions, they have been largely ignored by computational linguists, so addition of prepositions was also required. The preferred approach to morphological enrichment was to infer relations from phenomena discovered algorithmically. Both existing databases and existing algorithms can capture regular morphological relations, but cannot capture exceptions correctly; neither of them provide any semantic information. Some morphological analysis algorithms are subject to the fallacy that morphological analysis can be performed simply by segmentation. Morphological rules, grounded in observation and etymology, govern associations between and attachment of suffixes and contribute to defining the meaning of morphological relationships. Specifying character substitutions circumvents the segmentation fallacy. Morphological rules are prone to undergeneration, minimised through a variable lexical validity requirement, and overgeneration, minimised by rule reformulation and restricting monosyllabic output. Rules take into account the morphology of ancestor languages through co-occurrences of morphological patterns. Multiple rules applicable to an input suffix need their precedence established. The resistance of prefixations to segmentation has been addressed by identifying linking vowel exceptions and irregular prefixes. The automatic affix discovery algorithm applies heuristics to identify meaningful affixes and is combined with morphological rules into a hybrid model, fed only with empirical data, collected without supervision. Further algorithms apply the rules optimally to automatically pre-identified suffixes and break words into their component morphemes. To handle exceptions, stoplists were created in response to initial errors and fed back into the model through iterative development, leading to 100% precision, contestable only on lexicographic criteria. Stoplist length is minimised by special treatment of monosyllables and reformulation of rules. 96% of words and phrases are analysed. 2 218,802 directed derivational links have been encoded in the lexicon rather than the wordnet component of the model because the lexicon provides the optimal clustering of word senses. Both links and analyser are portable to an alternative lexicon. The evaluation uses the extended gloss overlaps disambiguation algorithm. The enriched model outperformed WordNet in terms of recall without loss of precision. Failure of all experiments to outperform disambiguation by frequency reflects on WordNet sense distinctions. Keywords: morphological rules; automatic affix discovery; derivational morphology; segmentation fallacy; derivational tree. Acknowledgments The research presented here was conducted under a full time EPSRC-funded research studentship. The project began under the supervision of Sylvia Wong, Lecturer in Computing Science and was concluded under the joint supervision of Ian Nabney, Professor of Computing Science and Ramesh Krishnamurthy, Lecturer in English. The author would also like to thank the following, all of whom have played a role in facilitating this research: • The late Sharen Lloyd • Steve Dalton • Chris Buckingham • Ken Litkowski • Christian Boitet • Mathieu Mangeot • Nazaire Mbame Tom Richens www.rockhouse.me.uk [email protected] 3 Contents VOLUME 1 Glossary 16 1 Introduction 23 1.1 Definitions 23 1.1.1 Wordnets 23 1.1.2 Derivational Morphology 24 1.1.3 Verb Frames 26 1.1.4 Parts of Speech, Participles and Gerunds 27 1.1.5 Qualia 27 1.2 Motivation 28 1.2.1 Fighting Arbitrariness 28 1.2.2 Derivational Morphology for Lexical Databases 29 1.2.3 Project Aims 31 1.2.4 Fulfilment of Project Aims 31 1.3 Experimental Platform 34 1.3.1 Object-Oriented Approaches to Modelling Wordnet Data 34 1.3.1.1 RDF 34 1.3.1.2 Python 35 1.3.2 The WordNet Model 36 1.3.2.1 Choice of Java 36 1.3.2.2 WordNet Relations 36 1.3.2.3 Sentence Frames 37 1.3.2.4 The Lexicon 37 1.3.2.5 The Lemmatiser 38 1.3.2.6 Applications of the Model and Related Publications 38 1.3.2.7 Subsequent Modifications 39 2 Investigation into WordNet 40 2.1 Word Senses 41 2.1.1 "I don't believe in word senses" 41 2.1.1.1 Metaphor 42 2.1.1.2 Translation Equivalents 44 2.1.1.3 Conclusions on Word Senses 47 2.1.2 Granularity 48 2.1.2.1 Implications of WordNet Granularity for Multilingual Wordnet Development 48 2.1.2.2 Investigation into WordNet Granularity 49 2.1.2.3 Clustering of Word Senses and Synsets 51 2.2 Taxonomy 52 4 2.2.1 Ontology 52 2.2.1.1 Shortcomings of WordNet-like Ontologies 52 2.2.1.2 Is a Correct Ontology Possible? 55 2.2.1.3 Compatibility of Existing Ontologies 56 2.2.1.4 Conclusions on Ontology 57 2.2.2 Investigation into the Verb Taxonomy 58 2.2.2.1 Introduction 58 2.2.2.2 Hypernyms and Troponyms 60 2.2.2.2.1 Algorithm for Identifying Topological Anomalies in Hierarchical Relations 60 2.2.2.2.2 Cycle 62 2.2.2.2.3 Rings 63 2.2.2.2.4 Dual Inheritance Without Rings 65 2.2.2.2.5 Isolators 65 2.2.2.2.6 Roots of the Verbal Taxonomy 67 2.2.2.3 Antonyms 69 2.2.2.3.1 Multiple Antonyms 70 2.2.2.3.2 Antonyms Without a Common Hypernym 71 2.2.2.4 Conclusion 72 2.3 Syntax 74 2.3.1 WordNet Sentence Frames 75 2.3.1.1 Synsets with More than 2 Framesets 75 2.3.1.2 Synsets with 2 Framesets 76 2.3.1.3 Synsets with 1 Frameset 77 2.3.1.4 Additional Frames 78 2.3.2 Frame Inheritance 79 2.3.2.1 Valency 79 2.3.2.2 Theory of Frame Inheritance 79 2.3.2.3 Investigation into Frame Inheritance 80 2.3.2.3.1 Algorithm for Validating Frame Inheritance 81 2.3.2.3.2 Extended Definition of Valid Frame Inheritance 83 2.3.2.3.3 Adapted Algorithm to Incorporate Broader Definition of Valid Frame Inheritance 84 2.3.2.3.4 Final Evaluation of Frame Inheritance 86 2.4 Conclusions on WordNet 87 3 Investigation into Morphology 90 3.1 Background 91 3.1.1 Some Simple Stemmers 91 3.1.2 A State of the Art Morphological Database? 93 3.1.2.1 Analysis of CatVar Sample Dataset 94 3.1.3 Previous Work on the Morphological Enrichment of WordNet 98 3.1.4 Derivational Trees 102 3.1.5 Morphological Enrichment across Languages 103 3.1.6 Inference of Morphological Relations from a 5 Dictionary 104 3.2 A Rule-based Approach 105 3.2.1 Requirements for the Morphological Enrichment of WordNet 105 3.2.2 Pilot Study on the Formulation and Application of Morphological Rules 108 3.2.2.1 Formulation of Morphological Rules from the CatVar Dataset 108 3.2.2.2 Application of Morphological Rules 113 3.2.2.2.1 Autogeneration of Suffixed Forms 113 3.2.2.2.2 Suffix Stripping 123 3.2.2.2.3 Overgeneration of Suffix Generation and Suffix Stripping Compared 128 3.2.2.3 Prefixations in the Random Word List 129 3.2.2.4 Application to the Enrichment of WordNet 131 3.2.2.5 Conclusions from the Pilot Study 135 3.2.3 Conclusions on Morphological Rules 136 3.3 Review of Existing Morphological Analysis Algorithms 139 3.3.1 From Phoneme to Morpheme 139 3.3.2 Word Segmentation 141 3.3.3 Minimum Description Length 145 3.3.4 Conclusions on Word Segmentation 153 3.4 Automatic Affix Discovery 153 3.4.1 Automatic Prefix Discovery 155 3.4.1.1 Prefix Tree Construction 155 3.4.1.2 Heuristics to Elucidate the Semantic Criterion 161 3.4.1.3 Results from Automatic Prefix Discovery 162 3.4.2 Automatic Suffix Discovery 163 3.4.2.1 Extension of the Algorithm to Suffix Discovery 163 3.4.2.2 Results from Automatic Suffix Discovery 164 3.4.3 Comparison of Results from Automatic Affix Discovery with Results from the Pilot Study on Morphological Rules 165 3.4.3.1 Undergeneration by Automatic Suffix Discovery 165 3.4.3.2 Heuristics Tested against Morphological Rules 166 3.4.4 Additional Heuristics 167 3.4.5 Conclusions on Automatic Affix Discovery 170 3.5 Final Considerations Prior to Morphological Analysis and Enrichment 171 3.5.1 Affix Stripping Precedence 171 3.5.2 Compound Expressions and Concatenations 173 6 3.5.3 Implications of WordNet Granularity
Recommended publications
  • Springer Prefinal V
    Eureka! A Simple Solution to the Complex ‘Tip-of-the-Tongue’-Problem Michael Zock To cite this version: Michael Zock. Eureka! A Simple Solution to the Complex ‘Tip-of-the-Tongue’-Problem. Bastardas- Boada, A.; Massip Bonet, A; Bel-Enguix, G. Complexity Applications in Language and Communica- tion, pp.251-272, 2019, 10.1007/978-3-030-04598-2_14. hal-02079168 HAL Id: hal-02079168 https://hal.archives-ouvertes.fr/hal-02079168 Submitted on 14 Mar 2021 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Copyright Chapter 14 Eureka! A Simple Solution the Complex ‘Tip-of-the-Tongue’- Problem Michael Zock To search for a word in a dictionary without a proper index is like looking for an address in a city without a decent map. 1 Abstract Dictionaries are repositories of knowledge concerning words. While 2 readers are mostly concerned with meanings, writers are generally more concerned 3 with word forms expressing meanings (lemma). I will focus here on this latter task: 4 building a tool to help authors to find the word they are looking for, word they may 5 know but whose form is eluding them.
    [Show full text]
  • New Oxford Rhyming Dictionary Ebook
    NEW OXFORD RHYMING DICTIONARY PDF, EPUB, EBOOK Oxford Dictionaries | 448 pages | 22 Aug 2013 | Oxford University Press | 9780199674220 | English | Oxford, United Kingdom New Oxford Rhyming Dictionary PDF Book Catherine Schobert added it May 02, Designed to complement every introductory library reference course, this is the perfect text for students and librarians looking to expand their personal reference knowledge, teaching failsafe methods for identifying important materials by matching specific types of questions to the best available sources, regardless of format. Children can expand their vocabulary, practise phonic sounds to help with spelling, and being to write their own rhymes! Just a moment while we sign you in to your Goodreads account. More filters. Cancel Save. The fascinating introduction by Professor John Lennard offers a brief outline of rhyming in its literary and historical contexts, and gives further advice on creative writing. Sign in to annotate. The dictionary contains a clear and simple alphabetical list of words that rhyme and rhyming sounds, as well as an index to make finding words simple. An alphabetical index makes finding rhymes for words ranging from the ordinary to the valetudinary a breeze. The New Oxford Rhyming Dictionary is absolutely invaluable and a must for every bookshelf. All the currently accepted terms of grammar are included, as well as older, traditional names, controversial new coinages, and items from the study of other languages. A-Z to view, select the "Entries" tab. Macquarie Dictionary Seventh Edition. Publications Pages Publications Pages. However, for more mature teenagers or adults I think it is a little too simplistic and limited the cover should have been a warning I guess.
    [Show full text]
  • Adding Electronic Value. the Electronic Version of the Grote Van Dale 1 Introduction
    INTERNET LEXICOGRAPHY Adding Electronic Value. The electronic version of the Grote Van Dale Dirk GEERAERTS, Leuven, Belgium Abstract The paper describes the design features of the elektronische grote van dale or EGVD , i.e. the electronic version of the most authoritative dictionary of contemporary Dutch. A comparison between the EGVD and existing dictionaries on cd-rom leads to an identification of the innovative characteristics of the EGVD. Ranking high among these are a fully developed onomasiological search function, the layered representation of entries, and the use of generated speech for the multimedia representation of pronun- ciation. The design features of the EGVD are analyzed from a functional perspective: what functional advantages can electronic dictionaries achieve, in comparison with the paper dictionaries that they are based on? 1 Introduction What value can electronic dictionaries add to their paper counterparts? Although there is a steadily growing market of dictionaries on cd-rom (and electronic dictionaries at large), the functional advantages that may be achieved for the dictionary user when existing dictionaries are converted into an electronic format are only sporadically analyzed in the lexicographical lit- erature. In an attempt to do something about this relative neglect in the literature, I will present the design of one such dictionary on cd-rom that is currently being produced, the Elektronische Grote Van Dale or EGVD. I will try to transcend the level of a mere product presentation, by highlighting those features that are functional improvements arising from the use of an elec- tronic medium, and by comparing the EGVD with existing dictionaries on cd-rom, concentrat- ing on the features that are innovations, or that further elaborate innovative functions in existing electronic dictionaries.
    [Show full text]
  • Using Phonics and Spelling Patterns - 1 – C4s3 16Usingphonics USING PHONICS and SPELLING PATTERNS
    Using Phonics and Spelling Patterns - 1 – c4s3_16usingphonics USING PHONICS AND SPELLING PATTERNS Most of the words we read and write are one-and two-syllable regular words, which, because they are consistent with the rules of spelling and pronunciation, we can decode and spell even if we have not seen them before. Developing the ability to independently read and write most regular words is a complex process and takes time and practice with a variety of activities. This chapter describes activities successfully used by teachers to help all children become independent at decoding and spelling regular one- and two- syllable words. Chapter 1 described activities that develop phonemic awareness and teach some letter names and sounds. If this knowledge is minimal (or nonexistent) in your students, you are not ready for this chapter but should do some of the activities suggested in Chapter 1. The activities in this chapter assume that children have developed some phonemic awareness and know some letter names and consonant sounds. They are now ready to use this knowledge and learn patterns they can use to decode and spell words. VOWELS IN THE ENGLISH SPELLING SYSTEM In English, the vowels are variant and unpredictable. The letter a commonly represents the sound in and, made, agree, art, talk, and care. We have given names to some of these sounds. And has a short a; made has a long a; agree is a schwa; the a in art is r controlled. We don’t even have names for the sound a represents in talk and care. Further complicating things are the many words in which a doesn’t do any of these six common things—eat, coat, legal—and the fact that even the consistent sounds can be spelled in many different ways.
    [Show full text]
  • Using an On-Line Dictionary to Find Rhyming Words And
    USING AN ON=LINE DICTIONARY TO FIND RHYMING WORDS AND PRONUNCIATIONS FOR UNKNOWN WORDS Roy J Byrd I.B.M. Thomas J. Watson Research Center Yorktown Heights, New York 10598 Martin S. Chodorow Department of Psychology, Hunter College of CUNY and I.B.M. Thomas J. Watson Research Center Yorktown Heights, New York 10598 ABSTRACT word not found in the dictionary. WordSmith also shows the user words that are "close" to a given word Humans know a great deal about relationships among along dimensions such as spelling (as in published dic- words. This paper discusses relationships among word tionaries), meaning (as in thesauruses), and sound (as in pronunciations. We describe a computer system which rhyming dictionaries). models human judgement of rhyme by assigning specific roles to the location of primary stress, the similarity of Figure I shows a sample of the WordSmith user inter- phonetic segments, and other factors. By using the face. The current word, urgency, labels the text box at model as an experimental tool, we expect to improve our the center of the screen. The box contains the output understanding of rhyme. A related computer model will of the PRONUNC application applied to the current attempt to generate pronunciations for unknown words word: it shows the pronunciation of urgency and the by analogy with those for known words. The analogical mapping between the word's spelling and pronunciation. processes involve techniques for segmenting and PRONUNC represents pronunciations in an alphabet matching word spellings, and for mapping spelling to derived from Webster's Seventh Collegiate Dictionary. In sound in known words.
    [Show full text]
  • Technical Study of the Dictionaries Published in Sanskrit Language Since 1800 Ad
    TECHNICAL STUDY OF THE DICTIONARIES PUBLISHED IN SANSKRIT LANGUAGE SINCE 1800 AD. A Thesis submitted to the Tilak Maharashtra Vidyapeeth, Pune For the Degree of Vidyavachaspati (Ph. D.) Doctor of Philosophy in the Library and Information Science Under the Faculty of Moral and Social Sciences By Mrs. Manjiri A. Karambelkar Under the Guidance of Dr. N. B. Dahibhate Principal Technical Officer, National Chemical Laboratory, Pune 411008 Department of Library and Information Science April 2014 D E C L A R A T I O N I declare that the thesis entitled “Technical Study of the Dictionaries Published in Sanskrit Language Since 1800 AD” completed and written by me has not previously formed the basis for the award of any Degree or other similar title upon me of this or any other Vidyapeeth or examining body. Date : April 2014 Mrs. Manjiri A. Karambelkar Place : Pune Research Student ii C E R T I F I C A T E This is to certify that the thesis entitled “Technical Study of the Dictionaries Published in Sanskrit Language Since 1800 AD” which is being submitted herewith for award of the degree of Vidyavachaspati (Ph.D.) in Library and Information Science, Faculty of Moral and Social Sciences of Tilak Maharashtra Vidyapeeth, Pune is the result of original research work, completed by Mrs. Manjiri A. Karambelkar under my supervision and guidance. To the best of my knowledge and belief the work incorporated in this thesis has not formed the basis for the award of any Degree or similar title of this or any other University or examining body upon her.
    [Show full text]
  • Preceding Vowel Phoneme Is Short and Spelt
    To access digital resources including: blog posts videos online appendices and to purchase copies of this book in: hardback paperback ebook editions Go to: https://www.openbookpublishers.com/product/325 Open Book Publishers is a non-profit independent initiative. We rely on sales and donations to continue publishing high-quality academic works. Dictionary of the British English Spelling System Greg Brooks Emeritus Professor of Education, University of Sheffield http://www.openbookpublishers.com © 2015 Greg Brooks Version 1.1. Minor edits made July 2017 This work is licensed under a Creative Commons Attribution 4.0 International license (CC BY 4.0). This license allows you to share, copy, distribute and transmit the work; to adapt the work and to make commercial use of the work providing attribution is made to the author (but not in any way that suggests that he endorses you or your use of the work). Attribution should include the following information: Brooks, Greg, Dictionary of the British English Spelling System. Cambridge, UK: Open Book Publishers, 2015. http://dx.doi.org/10.11647/OBP.0053 In order to access detailed and updated information on the license, please visit http://www.openbookpublishers.com/product/325#copyright Further details about CC BY licenses are available at http://creativecommons.org/ licenses/by/4.0 All the external links were active on the 19/07/2017 unless otherwise stated. Digital material and resources associated with this volume are available at http://www.openbookpublishers.com/product/325#resources ISBN Paperback: 978-1-78374-107-6 ISBN Hardback: 978-1-78374-108-3 ISBN Digital (PDF): 978-1-78374-109-0 ISBN Digital ebook (epub): 978-1-78374-110-6 ISBN Digital ebook (mobi): 978-1-78374-111-3 DOI: 10.11647/OBP.0053 Cover image: Spiegel by Jaume Plensa (2010).
    [Show full text]
  • Translating Classical Chinese Poetry Into Rhymed English
    Document generated on 10/01/2021 5:18 p.m. TTR Traduction, terminologie, rédaction Translating Classical Chinese Poetry into Rhymed English: A Linguistic-Aesthetic View L’utilisation de rimes pour traduire en anglais la poésie classique chinoise : une perspective linguistique et esthétique Charles Kwong La traduction au Japon Article abstract Translation in Japan Rhyme is an important element in the fusion of sense and sound that Volume 22, Number 1, 1er semestre 2009 constitutes poetry. No mere ornament in versification, rhyme performs significant artistic functions. Structurally, it unifies and distinguishes units URI: https://id.erudit.org/iderudit/044787ar within a poem. Semantically, it can serve to enhance or ironise sense. DOI: https://doi.org/10.7202/044787ar Emotively, it sets up pleasing resonances that deepen artistic appeal. And prosodically, rhyme can be seen as the keynote in a melody: rhyme is a modulator of pace and rhythm, while rhyme change can mark a turn of See table of contents rhythm and sense in a long poem. Different languages have different combinations of linguistic resources for versification. This essay will revisit the debate on the use of rhymed English to Publisher(s) translate classical Chinese poetry, moving beyond the general observations Association canadienne de traductologie and experiential insights currently available to present concrete evidence on the rhyming resources and practices of English and Chinese. These ISSN comparative observations should shed new light on the linguistic and aesthetic issues involved in using rhymed English to translate classical Chinese poetry. 0835-8443 (print) 1708-2188 (digital) Explore this journal Cite this article Kwong, C.
    [Show full text]
  • Color Dictionary & Thesaurus, 3E Ebook
    COLOR DICTIONARY & THESAURUS, 3E PDF, EPUB, EBOOK Charlotte Livingstone | 772 pages | 13 Aug 2010 | Oxford University Press, USA | 9780199579334 | English | New York, NY, United Kingdom Color Dictionary & Thesaurus, 3e PDF Book Oxford Dictionary of Chemistry 7E. A compact guide to the spelling and meaning of words in the English language today. Dictionary of Weights, Measures and Units. Automatically create lists of vocabulary from any text using the Visual Thesaurus VocabGrabber. Pigments are combined to form different colors by a process of subtractive absorption of various wavelengths. Color charge. All Rights Reserved. Or do you just have an interest in foreign languages? Vocabulary Center Activities. Visit the ANDC website. Play the game. The Australian Middle Primary Oxford Thesaurus is the most comprehensive and up-to-date middle primary thesaurus in Australia, designed to encourage students to expand their vocabulary and increase their confidence in reading and writing. The Australian National Dictionary is the only comprehensive, historically based record of the words and meanings that make up Australian English. Vocabulary Worth Celebrating 14 words. Take the quiz True or False? The perfect introduction to using a dictionary Over headwords and full-colour illustrations Explicit identification of Oxford Wordlist words Complimentary online activity sheets to practise dictionary skills Additional illustrated section at the back with thematic wordlists covering content such as colours, numbers and shapes. January 18, My Wordsmyth Lookup History. Comprehensive and easy-to-use, the thesaurus features headwords in red and customised primary fonts for better word recognition. October 19, Explaining the difference 'Ferment' vs. Dictionaries Comprehensive. Entry 1 of 3 : a plant with narrow leaves that is grown for its showy pink, red, or white flowers pink.
    [Show full text]
  • Assignment 2: Linked Lists in This Assignment, You Will Be Reading in Words from a Pronunciation Dictionary to Figure out Which Words Rhyme with Each Other
    Assignment 2: Linked Lists In this assignment, you will be reading in words from a pronunciation dictionary to figure out which words rhyme with each other. To do this, you will be implementing MyLinkedList and MySortedLinkedList. Besides the lectures in class, you can read much more about the code details of how linked lists work in the book on pages 249-268. You’ll be implementing the following: ● A linked list in MyLinkedList that implements interface ListInterface. You’ll be using ​ ​ ​ ​ MyLinkedList in two different ways, to store a list of RhymeGroupWords and to store a ​ ​ ​ list of Strings within each RhymeGroupWords object. More on this below. ​ ​ ​ ​ ● Code in the main method of RhymingDict that takes lines that have been read from a ​ ​ rhyming dictionary and stores them in the lists. ● A subclass of MyLinkedList called MySortedLinkedList, that has a method for adding ​ ​ ​ ​ items into the list in sorted order. ● Code in the main method of RhymingDict that takes pairs of words specified on the ​ ​ command line and outputs whether they rhyme or not. A LinkedList is useful for storing information when you don’t know how many items you need to store. In this lab, we will be iterating through a rhyming dictionary and storing words together according to their rhyming group, which is the most heavily emphasized part of their pronunciation. In the directory of starter code for the assignment, you’ll see a subdirectory called cmudict. ​ ​ Inside is a file called cmudict-short.dict. Opening this, you’ll see lines like: ​ ​ ​ ​ dignitary D IH1 G N AH0 T EH2 R IY0 necessitate N AH0 S EH1 S AH0 T EY2 T undue AH0 N D UW1 moderated M AA1 D ER0 EY2 T IH0 D The letters after the words represent the phonemes (units of sound) you speak when you pronounce the word.
    [Show full text]
  • Master's Thesis Word Assistant App with Speech Recognition
    Master’s Thesis Word assistant app with speech recognition Ann-Katrin Hannemann Date of defence: 26th of April 2018 Supervising Professor: Enric Mayol Sarroca Departament d’Enginyeria de Serveis i Sistemes d’Informaci´o Master in innovation and research in informatics Facultat d’Inform`aticade Barcelona (FIB) Universitat Polit`ecnicade Catalunya (UPC), BarcelonaTech 1 Contents 1 Abstract4 2 Introduction5 2.1 Problem . .5 2.2 Purpose . .8 3 State of the Art 10 3.1 Existing technology . 10 3.2 Voice assistant services . 14 3.3 Similar apps . 15 3.4 Conclusion . 28 4 Proposal and definition of the project 30 4.1 Scope . 30 4.2 Objectives . 30 4.2.1 Main objective . 30 4.2.2 Business objectives . 30 4.2.3 Technical objectives . 31 4.2.4 Functional objectives . 31 5 Project management 32 5.1 Schedule . 32 5.2 Costs . 34 5.3 Methodology . 37 6 Technnicality 40 6.1 Technology . 40 6.1.1 Type of app . 40 6.1.2 Platform . 40 6.1.3 External APIs and Libraries . 41 6.2 Requirements . 42 6.2.1 User experience requirements . 42 6.2.2 Non-functional requirements . 42 6.2.3 Functional requirements . 43 7 Design 46 7.1 Specification of the system . 46 7.2 Graphical user interfaces . 48 8 Implementation 51 8.1 Architecture . 51 8.2 Coding . 52 8.3 The user interface . 64 9 Verification and validation 68 9.1 Verification . 68 9.2 Validation . 68 9.2.1 UI during survey . 69 9.2.2 Results of survey . 70 10 Sustainability analysis 74 11 Results, future prospects and conclusions 81 11.1 Results .
    [Show full text]
  • A Computer Rhyming Dictionary/Research Tool
    Deseret Language and Linguistic Society Symposium Volume 15 Issue 1 Article 13 3-14-1989 A Computer Rhyming Dictionary/Research Tool Melvin J. Luthy Robert Stevens Follow this and additional works at: https://scholarsarchive.byu.edu/dlls BYU ScholarsArchive Citation Luthy, Melvin J. and Stevens, Robert (1989) "A Computer Rhyming Dictionary/Research Tool," Deseret Language and Linguistic Society Symposium: Vol. 15 : Iss. 1 , Article 13. Available at: https://scholarsarchive.byu.edu/dlls/vol15/iss1/13 This Article is brought to you for free and open access by the Journals at BYU ScholarsArchive. It has been accepted for inclusion in Deseret Language and Linguistic Society Symposium by an authorized editor of BYU ScholarsArchive. For more information, please contact [email protected], [email protected]. A COMPUTER RHYMING DICTIONARY/RESEARCH TOOL Melvin J. Luthy Brigham Young University Robert Stevens WordPerfect Corporation Recognizing the Need For years poets, writers, linguists, scrabble players, crossword puzzle addicts, language materials developers, and other language lovers have had frequent occasion to search for words having particular sounds or sound combinations. The poet may search for a word beginning with the right consonant to complement an alliterative line, or a word with particular vowels to complete the assonance in a line, or perhaps he may search for the right final syllable for a perfect or identical rhyme. The language materials developer may need to collect examples of particular consonant clusters or vowel combinations to help foreign students practice English phonological patterns. The linguist may be interested in statistical data such as frequency of occurrence of particular sounds, or occurrence of particular phonotactic patterns, for which English orthography is inadequate.
    [Show full text]