Word Construction: Tracing an Optimal Path Through the Lexicon
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Malayalam Noun and Verb Morphological Analyzer: a Simple Approach
Malayalam Noun and Verb Morphological Analyzer: A Simple Approach Nimal J Valath1, Narsheedha Beegum2 M.Tech Student1, M.Tech Student2 ABSTRACT Malayalam is a language of the Dravidian family and is one of This paper discusses the methods involved in the the four major languages of this family with a rich literary development of a Simple Malayalam Verb and Noun tradition. It is very close to Tamil, one of the major languages of Morphological Analyzer. Since in Malayalam, words can be the same family. This was due to the extensive cultural derived from a root word, a purely dictionary based approach synthesis that took place between the speakers of the two for Morphological analysis is not practical. Hence, a ‘Rule-cum- languages. The origin of Malayalam as a distinct language may Dictionary’ based approach is followed along with the Suffix be traced to the last quarter of 9th Century A.D. Throughout its Stripping concept. The grammatical behavior of the language, gradual evolution Malayalam has been influenced by the the formation of words with multiple suffixes and the various circumstances prevailed on different periods. preparation of the language are dealt with here, with examples of noun and verb forms in detail. Mainly Malayalam was influenced by Sanskrit and Prakrit brought into Kerala by Brahmins. After the 11th century a unique mixture of the native languages of Kerala and Sanskrit Keywords known as Manipravalam served as the medium of literary Morphological Analyzer, Malayalam, Suffix stripping, expression. Malayalam absorbed a lot from Sanskrit, not only in Transliteration, Retransliteration, Verb and Noun, Sandhi rules, the lexical level, but also in the phonemic, morphemic and Word Formation, Noun Cases, Algorithm. -
Unsupervised Separation of Transliterable and Native Words For
Unsupervised Separation of Transliterable and Native Words for Malayalam Deepak P Queen’s University Belfast, UK [email protected] Abstract names that need to be transliterated than translated to correlate with English text. On a manual analy- Differentiating intrinsic language words sis of a news article dataset, we found that translit- from transliterable words is a key step erated words and proper nouns each form 10-12% aiding text processing tasks involving dif- of all distinct words. It is useful to transliterate ferent natural languages. We consider such words for scenarios that involve processing the problem of unsupervised separation of Malayalam text in the company of English text; transliterable words from native words for this will avoid them being treated as separate index text in Malayalam language. Outlining a terms (wrt their transliteration) in a multi-lingual key observation on the diversity of char- retrieval engine, and help a statistical translation acters beyond the word stem, we develop system to make use of the link to improve effec- an optimization method to score words tiveness. In this context, it ia notable that there has based on their nativeness. Our method re- been recent interest in devising specialized meth- lies on the usage of probability distribu- ods to translate words that fall outside the core vo- tions over character n-grams that are re- cabulary (Tsvetkov and Dyer, 2015). fined in step with the nativeness scorings In this paper, we consider the problem of sepa- in an iterative optimization formulation. rating out such transliterable words from the other Using an empirical evaluation, we illus- words within an unlabeled dataset; we refer to trate that our method, DTIM, provides sig- the latter as “native” words. -
Redundancy As a Phonological Feature in the English Language
International Journal of English Language and Communication Studies Vol. 4 No.2 2018 ISSN 2545 - 5702 www.iiardpub.org Redundancy as a Phonological Feature in the English Language Shehu Muhammed Department of GSE School of Education Aminu Saleh College of Education, Azare, Bauchi State Nigeria [email protected] Msuega Ahar Department of Languages and Linguistics, Benue State University, Makurdi [email protected] Daniel Benjamin Saanyol Department of Psychology, School of Education, Aminu Saleh College of Education Azare, Bauchi State [email protected] Abstract The study critically examines redundancy as a phonological feature in the English Language. It establishes that it is an integral aspect of the English phonology. The study adopts a survey method using structuralist approach. The study was conducted among the speakers of English language in Benue State University, Makurdi. The sample size of about 60 respondents were taken from a selected number of departments: Languages and linguistics, Mass communication, Religion and Philosophy, Theatre Arts, English language and the Political Science undergraduate students who ranged from 16 - 35 years. The study used the following instruments: oral interviews, personal observations and discussion group to obtain data for the study. The findings reveal that redundancy must be understood and given space in the daily practice of the language. This will enhance effective communication and understanding among speaker of English Language. In fact, failure in speaking the language with a consistent agreement to the entire phonological system (including the redundant features) will sometimes amount to communication breakdown and misinterpretation of the meaning. A close observation has shown that many speakers of English language lack a good knowledge of its phonological system. -
Children's Possessive Structures
Ling 404 Lecture Notes No.4 Synchronic Clines in Morphology Productivity Clines: Japanese, Navajo, Mohawk, Eskimo, vs. Spanish, Italian, English One of the questions will be asking throughout is where do the above languages fall on the productivity cline and why (providing data and analyses.) One question to ask is to what degree is the lexicon ‘morpheme-based’ or ‘word-based’ (Chapters 3-4)? ‘Working Memory’ is implicated in the choice. For instance, are speakers all equally productive with their morpheme-units as tucked within word, a [-Fusion] language, or do some languages rather require morphemes within words to be memorized and incorporated as part of the lexical item, a [+Fusion] language? In addition to labeling languages as [+]Synthetic (as in Turkish, Hungarian, Spanish, English, Mohawk) or [-]Synthetic (as in Chinese, Vietnamese), the following data provide an additional exercise in teasing out what might be going on in languages which carry such a large number of embedded morphemes. Question leading to mid-term: How might such morphemes in Polysynthetic-type languages be stored and processed? The notion of [+/-Fusion] will be the central question here as we move to our mid-term material of this class. Typically, agglutinative morphemes are considered to be ‘loosely’ structured in that one morpheme has a one-to-one meaning. But the question might rather be how productive are the morphemes in isolation—viz., do they allow for movement? The question of movement, both at the word-level and at the morpheme-level will allow us to determine the nature of +/- productivity and +/- fusion. Regarding Movement. -
How to Understand Conjugation in Japanese, Words That Form the Predicate, Such As Verbs and Adjectives, Change Form According to Their Meaning and Function
How to Understand Conjugation In Japanese, words that form the predicate, such as verbs and adjectives, change form according to their meaning and function. This change of word form is called conjugation, and words that change their forms are called conjugational words. Yōgen (用言: roughly verbs and adjectives) Complex Japanese predicates can be divided into several “words” when one looks at formal characteristics such as accent and breathing. (Here, a “word” is almost equivalent to an accent unit, or the smallest a unit governed by breathing pauses, and is larger than the word described in school grammar.) That is, the Japanese predicate does not consist of one word; rather it is a complex of yōgen, made up of a series of words. Each of the “words” that make up the predicate (i.e., yōgen complex) undergoes word form change, and words in a similar category may be strung together in a row. Therefore, altogether there is quite a wide variety of forms of predicates. Some people do not divide the yōgen complex into words, however. They treat the predicate as a single entity, and call the entire variety of changes conjugation. Some try to integrate all variety of such word form change into a single chart. This chart not only tends to be enormous in size, but also each word shows the same inflection cyclically. There is much redundancy in this approach. When dealing with Japanese, it will be much simpler if one separates the description of the order in which the components of the predicate (yōgen complex) appear and the description of the word form change of individual words. -
Abstracts of the Psychonomic Society — Volume 13 — November 2008 49Th Annual Meeting — November 13–16, 2008 — Chicago, Illinois
Abstracts of the Psychonomic Society — Volume 13 — November 2008 49th Annual Meeting — November 13–16, 2008 — Chicago, Illinois Papers 1–7 Friday Morning Motion and Attention motion is mixed. In the present study, target stimuli were associated with Grand Ballroom, Friday Morning, 8:00–9:40 objects that loomed, receded or remained static in arrays of varying size. Results showed that both motion types received equal prioritization, as Chaired by Gary Chon-Wen Shyi, National Chung Cheng University evidenced by their parallel search slopes, yet looming targets benefited from an overall reduction in reaction time (RT). Further investigation 8:00–8:15 (1) ruled out possible confounding explanations for this RT advantage, Differential Distribution of Visuospatial Attention in Tracking Mul- while a perceptual measure of performance confirmed the attentional tiple Moving Objects. GARY C.-W. SHYI & SUNG-EN CHIEN, Na- equivalence of the two motion types. Taken together, these results in- tional Chung Cheng University—Paying attention to a relatively com- dicate that looming and receding objects receive equal prioritization plex object has shown evidence for differential distribution within the during attentional selection. However, it may be that postattentional pro- object. Here, we explored differential distribution of attention when mul- cesses, possibly those involved in motor preparation, facilitate responses tiple moving objects were visually tracked. In Experiments 1 and 2, we to looming motion. not only replicated the findings reported by Alvarez and Scholl (2005), demonstrating both attentional concentration and attentional amplifica- 9:20–9:35 (5) tion, but also generalized the effects to uniformed circular shapes. In Individual Differences in Voluntary Visual Attention. -
A Neurolinguistic Approach to Linguistic Redundancy
A NEUROLINGUISTIC APPROACH TO LINGUISTIC REDUNDANCY Assistant Prof. Dr. Laura Carmen CU ŢITARU, “Alexandru I.Cuza” University of Ia şi Abstract This paper proposes a neurolinguistic explanation for the overlap of redundancy as conceptualized by information theory and redundancy as understood in literary theory. Keywords : information theory, language recognition, redundancy 1. The Concept of Redundancy In July and October 1948, Claude Shannon, an engineer working at the Bell Telephone Laboratories, published two papers in the Bell System Technical Journal , in which he formulated a set of theorems concerning quick and accurate transmission of messages from one place to another. Although intended primarily for radio and telephone engineers, the generalizations Shannon made established laws that proved to govern all kinds of messages, no matter the medium. He practically established information theory, and its tenets can be used in order to investigate any system in which a message/information is sent from a source to a receiver. One essential condition for any successful communication is that the message should be received and understood by the receiver. But there is a natural occurrence with communication systems in general to be exposed to interferences, which, in the jargon of this field, are called noise . Anything that corrupts the integrity of a message (like image distortions on a TV screen, static in a radio set, gaps or smudged lines in a written text) qualifies as noise. During World War II, Shannon worked on secret codes, and on ways to separate information from noise. What he found was to become one of the most important concepts in communications theory: redundancy . -
CLIPP Christiani Lehmanni Inedita, Publicanda, Publicata Pleonasm and Hypercharacterization
CLIPP Christiani Lehmanni inedita, publicanda, publicata titulus Pleonasm and hypercharacterization huius textus situs retis mundialis http://www.uni-erfurt.de/ sprachwissenschaft/personal/lehmann/CL_Publ/ Hypercharacterization.pdf dies manuscripti postremum modificati 23.02.2006 occasio orationis habitae 11. Internationale Morphologietagung Wien, 14.-17.02.2004 volumen publicationem continens Booij, Geert E. & van Marle, Jaap (eds.), Yearbook of Morphology 2005. Heidelberg: Springer annus publicationis 2005 paginae 119-154 Pleonasm and hypercharacterization Christian Lehmann University of Erfurt Abstract Hypercharacterization is understood as pleonasm at the level of grammar. A scale of strength of pleonasm is set up by the criteria of entailment, usualness and contrast. Hy- percharacterized constructions in the areas of syntax, inflection and derivation are ana- lyzed by these criteria. The theoretical basis of a satisfactory account is sought in a holis- tic, rather than analytic, approach to linguistic structure, where an operator-operand struc- ture is formed by considering the nature of the result, not of the operand. Data are drawn from German, English and a couple of other languages. The most thorough in a number of more or less sketchy case studies is concerned with German abstract nouns derived in -ierung (section 3.3.1). This process is currently so productive that it is also used to hy- percharacterize nouns that are already marked as nominalizations. 1 1. Introduction Hypercharacterization 2 (German Übercharakterisierung) may be introduced per ostensionem: it is visible in expressions such as those of the second column of T1. T1. Stock examples of hypercharacterization language hypercharacterized basic surplus element German der einzigste ‘the most only’ der einzige ‘the only’ superlative suffix –st Old English children , brethren childer , brether plural suffix –en While it is easy, with the help of such examples, to understand the term and get a feeling for the concept ‘hypercharacterization’, a precise definition is not so easy. -
Abstracts (PDF)
Abstracts of the Psychonomic Society — Volume 12 — November 2007 48th Annual Meeting — November 15–18, 2007 — Long Beach, California Papers 1–7 Friday Morning Implicit Memory is a widely used experimental procedure that has been modeled in Regency ABC, Friday Morning, 8:00–9:20 order to obtain ostensibly separate measures for explicit memory and implicit memory. Existing models are critiqued here and a new con- Chaired by Neil W. Mulligan fidence process-dissociation (CPD) model is developed for a modi- University of North Carolina, Chapel Hill fied process-dissociation task that uses confidence ratings in con- junction with the usual old/new recognition response. With the new 8:00–8:15 (1) model there are several direct tests for the key assumptions of the stan- Attention and Auditory Implicit Memory. NEIL W. MULLIGAN, dard models for this task. Experimental results are reported here that University of North Carolina, Chapel Hill—Traditional theorizing are contrary to the critical assumptions of the earlier models for the stresses the importance of attentional state during encoding for later process-dissociation task. These results cast doubt on the suitability memory, based primarily on research with explicit memory. Recent re- of the process-dissociation procedure itself for obtaining separate search has investigated the role of attention in implicit memory in the measures for implicit and explicit memory. visual modality. The present experiments examined the effect of divided attention on auditory implicit memory, using auditory perceptual iden- Attentional Selection and Priming tification, word-stem completion, and word-fragment completion. Par- Regency DEFH, Friday Morning, 8:00–10:00 ticipants heard study words under full attention conditions or while si- multaneously carrying out a distractor task. -
Leveraging Multimodal Redundancy for Dynamic Learning, with SHACER — a Speech and Handwriting Recognizer
Leveraging Multimodal Redundancy for Dynamic Learning, with SHACER | a Speech and HAndwriting reCognizER Edward C. Kaiser B.A., American History, Reed College, Portland, Oregon (1986) A.S., Software Engineering Technology, Portland Community College, Portland, Oregon (1996) M.S., Computer Science and Engineering, OGI School of Science & Engineering at Oregon Health & Science University (2005) A dissertation submitted to the faculty of the OGI School of Science & Engineering at Oregon Health & Science University in partial ful¯llment of the requirements for the degree Doctor of Philosophy in Computer Science and Engineering April 2007 °c Copyright 2007 by Edward C. Kaiser All Rights Reserved ii The dissertation \Leveraging Multimodal Redundancy for Dynamic Learning, with SHACER | a Speech and HAndwriting reCognizER" by Edward C. Kaiser has been examined and approved by the following Examination Committee: Philip R. Cohen Professor Oregon Health & Science University Thesis Research Adviser Randall Davis Professor Massachusetts Institute of Technology John-Paul Hosom Assistant Professor Oregon Health & Science University Peter Heeman Assistant Professor Oregon Health & Science University iii Dedication To my wife and daughter. iv Acknowledgements This thesis dissertation in large part owes it existence to the patience, vision and generosity of my advisor, Dr. Phil Cohen. Without his aid it could not have been ¯nished. I would also like to thank Professor Randall Davis for his interest in this work and his time in reviewing and considering it as the external examiner. I am extremely grateful to Peter Heeman and Paul Hosom for their e®orts in participating on my thesis committee. There are many researchers with whom I have worked closely at various stages during my research and to whom I am indebted for sharing their support and their thoughts. -
Processing Distinctions Between Stems and Affixes: Evidence from a Non-Fluent Aphasic Patient*
cognition, 36 (1990) 129-153 Processing distinctions between stems and affixes: Evidence from a non-fluent aphasic patient* LORRAINE K. NLER SUSANBEHRENS HOWARD COBB University of Cambridge WlLLlAM MARSLEN-WILSON MRC Applied Psychology Unit, Cambridge Received June 2, 1989, final revision accepted February 19, 1990 Abstract Tyler, L.K., Behrens, S., Cobb, H., and Marsten-Witson, W., 1990. Processing distinctions between stems and affixes: Evidence from a non-fluent aphasic patient. Cognition, 36: 129-153. In this paper, we study the ability of a non-fluent aphasic patient, BX, to comprehend morphologically complex words when they appear in utterance contexts. Wefirst establish that he is insensitive to the contextual appropriateness of both derived and inflected words. In a further experiment we show that he has no difficulty processing the stems of complex words and conclude th-zt his problem is with the bound morphemes themselves. We then ask whether this problem is due to his inability to access either the phonological form of a morphologically complex word or its semantic andlor syntactic content. We find that only the access of semantic and syntactic content is impaired. We conclude from these six studies that: (a) BN presents a counter-example to the claim that non-fluent patients have particular dificulty with those aspects of morphology which have a syntactic function; (b) BN processes both derived and inflected words by mapping the sensory input onto the entire full-form of a complex word, but the semantic and syntactic content of the stem alone is accessed and integrated into the context. The semantic and syntactic implica- tions of the suffix are never evaluated. -
Word Formation
English 256 Spring 2005 Word (Stem) Formation I. Derivation A. Affixation 1. Prefixation a. English i. write/rewrite read/reread injure/reinjure shoot/reshoot play/replay do/redo move/remove tell/retell ii. large/enlarge dear/endear able/enable purple/empurple rich/enrich iii. do/undo tie/untie fasten/unfasten wrap/unwrap cover/uncover dress/undress latch/unlatch lock/unlock iv. kind/unkind easy/uneasy fit/unfit believable/unbelievable gentle/ungentle afraid/unafraid bound/unbound b. Maricopa (Southwest American Indian) i. a’uly ‘carry (something long)’ aduuly ‘hide (something long)’ ch’uly ‘carry (things)’ chduuly ‘hide (things)’ t’uly ‘carry (something round)’ tduuly ‘hide (something round)’ a’or ‘put (something long) on’ anak ‘put (something long) in liquid’ t’or ‘put (something round) on’ chnak ‘put (things) in liquid’ tnak ‘put (something round) in liquid’ ii. puy ‘die’ tpuy ‘kill’ pom ‘burn’ tpom ‘burn (something)’ ‘er ‘be scared’ t’er ‘scare’ herher ‘be wrinkled’ therher ‘wrinkle (something)’ hmaaly ‘be white’ thmaaly ‘white’ hpily ‘be dirty’ thpily ‘make dirty’ 2. Suffixation a. English i. writer/writer read/reader do/doer work/worker fly/flier sing/singer dance/dancer drive/driver run/runner ii. type/typist art/artist violin/violinist manicure/manicurist left/leftist race/racist duel/duelist iii. quick/quickly childish/childishly true/truly angry/angrily final/finally truthful/truthfully false/falsely amazing/amazingly happy/happily correct/correctly real/really English 256 Spring 2005 b. Kurdish (Near Eastern) aaqil ‘wise’ aaqilii ‘forethought’ draiz ‘long’ draizii ‘length’ diz ‘robber’ dizii ‘robbery’ garm ‘warm’ garmii ‘warmth’ c. Hanunoo (Philippines) usa ‘one’ ?usahi ‘make it one!’ upat ‘four’ upati ‘make it four!’ duwa ‘two’ duwahi ‘make it two!’ unum ‘six’ unumi ‘make it six!’ tulu ‘three’ tuluhi ‘make it three!’ 3.