The Indeterminacy of Word Segmentation and the Nature of Morphology and Syntax1
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Creating Words: Is Lexicography for You? Lexicographers Decide Which Words Should Be Included in Dictionaries. They May Decide T
Creating Words: Is Lexicography for You? Lexicographers decide which words should be included in dictionaries. They may decide that a word is currently just a fad, and so they’ll wait to see whether it will become a permanent addition to the language. In the past several decades, words such as hippie and yuppie have survived being fads and are now found in regular, not just slang, dictionaries. Other words, such as medicare, were created to fill needs. And yet other words have come from trademark names, for example, escalator. Here are some writing options: 1. While you probably had to memorize vocabulary words throughout your school years, you undoubtedly also learned many other words and ways of speaking and writing without even noticing it. What factors are bringing about changes in the language you now speak and write? Classes? Songs? Friends? Have you ever influenced the language that someone else speaks? 2. How often do you use a dictionary or thesaurus? What helps you learn a new word and remember its meaning? 3. Practice being a lexicographer: Define a word that you know isn’t in the dictionary, or create a word or set of words that you think is needed. When is it appropriate to use this term? Please give some sample dialogue or describe a specific situation in which you would use the term. For inspiration, you can read the short article in the Writing Center by James Chiles about the term he has created "messismo"–a word for "true bachelor housekeeping." 4. Or take a general word such as "good" or "friend" and identify what it means in different contexts or the different categories contained within the word. -
A Multi-Level Approach to Word-Formation: Complex Lexemes and Word Semantics
PROCEEDINGS of the XIHth INTERNATIONAL CONGRESS of LINGUISTS, August 29-September 4, 1982, Tokyo Published under the auspices of the CIPL (The Hague) Editors: Shirö HATTORI Kazuko INOUE Associate Editors: Tadao SHIMOMIYA Yoshio NAGASHIMA Actes du XIII0 congres international des linguistes Akten des XIII. Internationalen Linguistenkongresses Atti del XIII. congresso internazionale dei linguisti Dai ΧΙΙΙ-kai kokusai gengogakusha kaigi rombunshü Edited and published by the Proceedings 1 ishing Committee: Hiroya FLJISAKI Yoshihiko IKECAMI Tetsuya KUNIHIRO Yoshio NAGASHIMA Kinsuke HASEGAWA Kazuko INOUE Felix LOBO Tsuyoshi NARA Shiro HATTORI Kunihisa IZUMI Katsumi MATSUMOTO Takao OOE Jiro IKECAMI Hajime KITAMURA Tamotsu MATSUNAMI Tadao SHIMOMIYA Tokyo 1983 DETAILED TABLE OF CONTENTS Title Page ι Organization n Summary Table of Contents in History of the International Congress of Linguists (1928-1982) in Synopsis of the XHIth International Congress of Linguists (Tokyo 1982) iv Preface Shirö Hattori ν List of Previous Proceedings (1930-1978) vm Detailed Table of Contents χ Comite International Permanent des Linguistes xxn Officially Represented Universities, Academies and Scientific Societies .. xxv List of Participants xxvm GREETINGS AND CLOSING ADDRESSES Opening Session Greetings by Shirö Hattori, President of the Congress 3 Greetings by Shigeo Kawamoto, President of the Linguistic Society of Japan 4 Greetings by Robert H. Robins, President of the Comite International Permanent des Linguistes 5 The Address of His Imperial Highness the Crown Prince of Japan .... 6 Congratulatory Message by Heiji Ogawa, Minister of Education, Science and Culture 8 Congratulatory Message by Koji Fushimi, President of the Science Coun cil of Japan 10 Closing Session Closing Address by Shirö Hattori 12 Address at Closing Ceremony by R. -
Character-Word LSTM Language Models
Character-Word LSTM Language Models Lyan Verwimp Joris Pelemans Hugo Van hamme Patrick Wambacq ESAT – PSI, KU Leuven Kasteelpark Arenberg 10, 3001 Heverlee, Belgium [email protected] Abstract A first drawback is the fact that the parameters for infrequent words are typically less accurate because We present a Character-Word Long Short- the network requires a lot of training examples to Term Memory Language Model which optimize the parameters. The second and most both reduces the perplexity with respect important drawback addressed is the fact that the to a baseline word-level language model model does not make use of the internal structure and reduces the number of parameters of the words, given that they are encoded as one-hot of the model. Character information can vectors. For example, ‘felicity’ (great happiness) is reveal structural (dis)similarities between a relatively infrequent word (its frequency is much words and can even be used when a word lower compared to the frequency of ‘happiness’ is out-of-vocabulary, thus improving the according to Google Ngram Viewer (Michel et al., modeling of infrequent and unknown words. 2011)) and will probably be an out-of-vocabulary By concatenating word and character (OOV) word in many applications, but since there embeddings, we achieve up to 2.77% are many nouns also ending on ‘ity’ (ability, com- relative improvement on English compared plexity, creativity . ), knowledge of the surface to a baseline model with a similar amount of form of the word will help in determining that ‘felic- parameters and 4.57% on Dutch. Moreover, ity’ is a noun. -
Basic Morphology
What is Morphology? Mark Aronoff and Kirsten Fudeman MORPHOLOGY AND MORPHOLOGICAL ANALYSIS 1 1 Thinking about Morphology and Morphological Analysis 1.1 What is Morphology? 1 1.2 Morphemes 2 1.3 Morphology in Action 4 1.3.1 Novel words and word play 4 1.3.2 Abstract morphological facts 6 1.4 Background and Beliefs 9 1.5 Introduction to Morphological Analysis 12 1.5.1 Two basic approaches: analysis and synthesis 12 1.5.2 Analytic principles 14 1.5.3 Sample problems with solutions 17 1.6 Summary 21 Introduction to Kujamaat Jóola 22 mor·phol·o·gy: a study of the structure or form of something Merriam-Webster Unabridged n 1.1 What is Morphology? The term morphology is generally attributed to the German poet, novelist, playwright, and philosopher Johann Wolfgang von Goethe (1749–1832), who coined it early in the nineteenth century in a biological context. Its etymology is Greek: morph- means ‘shape, form’, and morphology is the study of form or forms. In biology morphology refers to the study of the form and structure of organisms, and in geology it refers to the study of the configuration and evolution of land forms. In linguistics morphology refers to the mental system involved in word formation or to the branch 2 MORPHOLOGYMORPHOLOGY ANDAND MORPHOLOGICAL MORPHOLOGICAL ANALYSIS ANALYSIS of linguistics that deals with words, their internal structure, and how they are formed. n 1.2 Morphemes A major way in which morphologists investigate words, their internal structure, and how they are formed is through the identification and study of morphemes, often defined as the smallest linguistic pieces with a gram- matical function. -
Learning About Phonology and Orthography
EFFECTIVE LITERACY PRACTICES MODULE REFERENCE GUIDE Learning About Phonology and Orthography Module Focus Learning about the relationships between the letters of written language and the sounds of spoken language (often referred to as letter-sound associations, graphophonics, sound- symbol relationships) Definitions phonology: study of speech sounds in a language orthography: study of the system of written language (spelling) continuous text: a complete text or substantive part of a complete text What Children Children need to learn to work out how their spoken language relates to messages in print. Have to Learn They need to learn (Clay, 2002, 2006, p. 112) • to hear sounds buried in words • to visually discriminate the symbols we use in print • to link single symbols and clusters of symbols with the sounds they represent • that there are many exceptions and alternatives in our English system of putting sounds into print Children also begin to work on relationships among things they already know, often long before the teacher attends to those relationships. For example, children discover that • it is more efficient to work with larger chunks • sometimes it is more efficient to work with relationships (like some word or word part I know) • often it is more efficient to use a vague sense of a rule How Children Writing Learn About • Building a known writing vocabulary Phonology and • Analyzing words by hearing and recording sounds in words Orthography • Using known words and word parts to solve new unknown words • Noticing and learning about exceptions in English orthography Reading • Building a known reading vocabulary • Using known words and word parts to get to unknown words • Taking words apart while reading Manipulating Words and Word Parts • Using magnetic letters to manipulate and explore words and word parts Key Points Through reading and writing continuous text, children learn about sound-symbol relation- for Teachers ships, they take on known reading and writing vocabularies, and they can use what they know about words to generate new learning. -
Understanding and Measuring Morphological Complexity
OUP CORRECTED PROOF – FINAL, 5/3/2015, SPi Understanding and Measuring Morphological Complexity Edited by MATTHEW BAERMAN, DUNSTAN BROWN, AND GREVILLE G. CORBETT 1 OUP CORRECTED PROOF – FINAL, 5/3/2015, SPi 3 Great Clarendon Street, Oxford, ox2 6dp, United Kingdom Oxford University Press is a department of the University of Oxford. It furthers the University’s objective of excellence in research, scholarship, and education by publishing worldwide. Oxford is a registered trade mark of Oxford University Press in the UK and in certain other countries © editorial matter and organization Matthew Baerman, Dunstan Brown, and Greville G. Corbett 2015 © the chapters their several authors 2015 Themoralrightsoftheauthorshavebeenasserted First Edition published in 2015 Impression: 1 All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, without the prior permission in writing of Oxford University Press, or as expressly permitted by law, by licence or under terms agreed with the appropriate reprographics rights organization. Enquiries concerning reproduction outside the scope of the above should be sent to the Rights Department, Oxford University Press, at the address above You must not circulate this work in any other form and you must impose this same condition on any acquirer Published in the United States of America by Oxford University Press 198 Madison Avenue, New York, NY 10016, United States of America British Library Cataloguing in Publication Data Data available Library of Congress Control Number: 2014947260 ISBN 978–0–19–872376–9 Printed and bound by CPI Group (UK) Ltd, Croydon, cr0 4yy Links to third party websites are provided by Oxford in good faith and for information only. -
Reconsidering the “Isolating Protolanguage Hypothesis” in the Evolution of Morphology Author(S): Jaïmé Dubé Proceedings
Reconsidering the “isolating protolanguage hypothesis” in the evolution of morphology Author(s): Jaïmé Dubé Proceedings of the 37th Annual Meeting of the Berkeley Linguistics Society (2013), pp. 76-90 Editors: Chundra Cathcart, I-Hsuan Chen, Greg Finley, Shinae Kang, Clare S. Sandy, and Elise Stickles Please contact BLS regarding any further use of this work. BLS retains copyright for both print and screen forms of the publication. BLS may be contacted via http://linguistics.berkeley.edu/bls/. The Annual Proceedings of the Berkeley Linguistics Society is published online via eLanguage, the Linguistic Society of America's digital publishing platform. Reconsidering the Isolating Protolanguage Hypothesis in the Evolution of Morphology1 JAÏMÉ DUBÉ Université de Montréal 1 Introduction Much recent work on the evolution of language assumes explicitly or implicitly that the original language was without morphology. Under this assumption, morphology is merely a consequence of language use: affixal morphology is the result of the agglutination of free words, and morphophonemic (MP) alternations arise through the morphologization of once regular phonological processes. This hypothesis is based on at least two questionable assumptions: first, that the methods and results of historical linguistics can provide a window on the evolution of language, and second, based on the claim that some languages have no morphology (the so-called isolating languages), that morphology is not a necessary part of language. The aim of this paper is to suggest that there is in fact no basis for what I will call the Isolating Proto-Language Hypothesis (henceforth IPH), either on historical or typological grounds, and that the evolution of morphology remains an interesting question. -
The State of the Art in Developing Fuzzy Ontologies: a Survey
The State of the Art in Developing Fuzzy Ontologies: A Survey Zahra Riahi Samani1, Mehrnoush Shamsfard2, 1Faculty of Computer Science and Engineering, Shahid Beheshti University Email:[email protected] 1Faculty of Computer Science and Engineering, Shahid Beheshti University Email:[email protected] Abstract Conceptual formalism supported by typical ontologies may not be sufficient to represent uncertainty information which is caused due to the lack of clear cut boundaries between concepts of a domain. Fuzzy ontologies are proposed to offer a way to deal with this uncertainty. This paper describes the state of the art in developing fuzzy ontologies. The survey is produced by studying about 35 works on developing fuzzy ontologies from a batch of 100 articles in the field of fuzzy ontologies. 1. Introduction Ontology is an explicit, formal specification of a shared conceptualization in a human understandable, machine- readable format. Ontologies are the knowledge backbone for many intelligent and knowledge based systems [1, 2]. However, in some domains, real world knowledge is imprecise or vague. For example in a search engine one may be interested in ”an extremely speedy, small size, not expensive car”. Classical ontologies based on crisp logic are not capable of handling this kind of knowledge. Fuzzy ontologies were proposed as a combination of fuzzy theory and ontologies to tackle such problems. On the topic of fuzzy ontology, we studied about 100 research articles which can be categorized into four main categories. The first category includes the research works on applying fuzzy ontologies in a specific domain- application to improve the performance of the application such as group decision making systems [3] or visual video content analysis and indexing [4].The ontology development parts of the works in this category were done manually or were not of much concentration. -
Transparency in Language a Typological Study
Transparency in language A typological study Published by LOT phone: +31 30 253 6111 Trans 10 3512 JK Utrecht e-mail: [email protected] The Netherlands http://www.lotschool.nl Cover illustration © 2011: Sanne Leufkens – image from the performance ‘Celebration’ ISBN: 978-94-6093-162-8 NUR 616 Copyright © 2015: Sterre Leufkens. All rights reserved. Transparency in language A typological study ACADEMISCH PROEFSCHRIFT ter verkrijging van de graad van doctor aan de Universiteit van Amsterdam op gezag van de Rector Magnificus prof. dr. D.C. van den Boom ten overstaan van een door het college voor promoties ingestelde commissie, in het openbaar te verdedigen in de Agnietenkapel op vrijdag 23 januari 2015, te 10.00 uur door Sterre Cécile Leufkens geboren te Delft Promotiecommissie Promotor: Prof. dr. P.C. Hengeveld Copromotor: Dr. N.S.H. Smith Overige leden: Prof. dr. E.O. Aboh Dr. J. Audring Prof. dr. Ö. Dahl Prof. dr. M.E. Keizer Prof. dr. F.P. Weerman Faculteit der Geesteswetenschappen i Acknowledgments When I speak about my PhD project, it appears to cover a time-span of four years, in which I performed a number of actions that resulted in this book. In fact, the limits of the project are not so clear. It started when I first heard about linguistics, and it will end when we all stop thinking about transparency, which hopefully will not be the case any time soon. Moreover, even though I might have spent most time and effort to ‘complete’ this project, it is definitely not just my work. Many people have contributed directly or indirectly, by thinking about transparency, or thinking about me. -
DIMENSIONS and LEVELS of LINGUISTIC ANALYSIS When
DIMENSIONS AND LEVELS OF LINGUISTIC ANALYSIS Tukhtaeva Farida Lecturer, Bukhara State University, Foreign languages faculty, Uzbekistan. Abstract: What are dimensions? Dimensions are syntagmatic and paradigmatic relations between units of language. As a result of their potentiality of occurrence in a definite context or sentence lingual units such as, morphemes, phonemes, lexemes go into relations of two distinctive types. They are often called syntagmatic and paradigmatic relations in scientific literature. Key words: dimension, linguistic analysis, literature. When lingual units join with others along a horizontal dimension, they enter into syntagmatic relations. Syntagmatic relations are direct linear relations between units of language of the same level in a segmental order. For example: “My little brother watered beautiful flowers in our garden.” In this following sentence the words and word-groups are syntagmatically related. They are: brother, my little brother, watered, my little brother watered, watered beautiful flowers, beautiful flowers, watered in our garden. Morphemes of the words in this sentence are also connected syntagmatically. E.g.: flower/s, water/ed, beauty/ful. Phonemes and graphemes are connected syntagmatically within morphemes and words. For example: b/r/o/t/h/e/r, g/a/r/d/e/n. The combination of two words or word-groups in a segmental string, one of which is followed by the other, forms a unit which is referred to as a syntactic “syntagma.”There are 4 basic kinds of theoretical syntagmas: predicative syntagma, which expresses a combination of a subject and a predicate. For example: My uncle grows wheat in the field every year. In the following sentence “My uncle grows” is a predicative syntagma. -
Asshai Reference Grammar and Lexicon—David J
Asshai Reference Grammar and Lexicon—David J. Peterson 1 Asshai Reference Grammar and Lexicon by David J. Peterson Asshai Reference Grammar and Lexicon—David J. Peterson 2 Introduction The Asshai'i are a mysterious people from the far east—often murmured about, but rarely seen or discussed directly in A Song of Ice and Fire. Taking the various snippets of names and character descriptions from the books, I've created a language for the Asshai'i that both includes all the existing names and terms from the book, and suits the general Asshai aesthetic hinted out throughout A Song of Ice and Fire. Regards, David J. Peterson Asshai Reference Grammar and Lexicon—David J. Peterson 3 1. Asshai Language Description Phonology: • The phonetic inventory of Asshai is listed below: Consonants Labial Dental Alveolar Palatal Velar Glottal Plain Labial. Stops p/b t/d k/g kʷ/gʷ ʔ Fricatives θ/ð s/z ɬ/ɮ ʃ/ʒ x/ɣ xʷ/ɣʷ h Nasals m n ŋ ŋʷ Approximants r l j w Vowels Front Central Back High i, iː/ĩ, ĩː u, uː/ũ, ũː Close Mid e, eː/ẽ, ẽː ə, əː o, oː/õ, õː Open Mid ɛ, ɛː/ɛ,̃ ɛː̃ ɔ, ɔː/ɔ,̃ ɔː̃ Low a, aː/ã, ãː • There are four diphthongs: ay, aay, aw, aaw. • The symbols listed in the tables above are phonetic symbols. These will be used to transcribe Asshai words, but not to write them. To write them, I've devised a romanization system that should make the pronunciation fairly transparent. -
A Fuzzy Description Logic
From: AAAI-98 Proceedings. Copyright © 1998, AAAI (www.aaai.org). All rights reserved. A Fuzzy Description Logic Umberto Straccia I.E.I - C.N.R. Via S. Maria, 46 I-56126 Pisa (PI) ITALY [email protected] Abstract is such a concept: we may say that an individual tom is an instance of the concept Tall only to a certain degree Description Logics (DLs, for short) allow reasoning n ∈ [0, 1] depending on tom’s height. about individuals and concepts, i.e. set of individuals with common properties. Typically, DLs are limited to Fuzzy logic directly deals with the notion of vague- dealing with crisp, well dened concepts. That is, con- ness and imprecision using fuzzy predicates. Therefore, cepts for which the problem whether an individual is it oers an appealing foundation for a generalisation of an instance of it is a yes/no question. More often than DLs in order to dealing with such vague concepts. not, the concepts encountered in the real world do not The aim of this work is to present a general fuzzy DL, have a precisely dened criteria of membership: we which combines fuzzy logic with DLs. In particular we may say that an individual is an instance of a concept will extend DLs by allowing expressions of the form only to a certain degree, depending on the individual’s hC(a) ni (n ∈ [0, 1]), e.g. hTall(tom) .7i, with intended properties. Concepts of this kind are rather vague than precise. As fuzzy logic directly deals with the notion of meaning “the membership degree of individual a being vagueness and imprecision, it oers an appealing foun- an instance of concept C is at least n”.