Inferring Morphological Rules from Small Examples Using 0/1 Linear Programming

Total Page:16

File Type:pdf, Size:1020Kb

Inferring Morphological Rules from Small Examples Using 0/1 Linear Programming Inferring morphological rules from small examples using 0/1 linear programming Ann Lilliestrom,¨ Koen Claessen, Nicholas Smallbone Chalmers University of Technology Gothenburg, Sweden fannl,koen,[email protected] Abstract Here, +’s∗ represents the suffix ’s, and +a+ repre- 1 We show how to express the problem of sents an infix a. The table shows that both ’s and finding an optimal morpheme segmentation an apostrophe can mark the genitive case; the sec- from a set of labelled words as a 0/1 linear ond column means that the genitive was marked programming problem, and how to build by ’s only in singular nouns, and by an apostrophe on this to analyse a language’s morphology. only in plural nouns. An s suffix marks plural, and The approach works even when there is because of the tiny input data it was only seen in very little training data available. genitive nouns. Plural can be marked by an inner vowel change from a to e (indicated by the arrow), 1 Introduction or by changing a final y to ies, in which case the Many recent tools for morphological analysis use singular form is marked by a or y. statistical approaches such as neural networks (Cot- The tool also segments the input words into mor- terell et al., 2018). These approaches can profitably phemes consistent with the table of rules (the in- use huge amounts of training data, which makes ferred stem is marked in bold): them ideal for high-resource languages. But if there standard inflected is little training data available, statistical methods womjajnj ? womjejnj ? can struggle to learn an accurate model. And when Sg Nom Pl Nom babjyj ? babjiesj ’ they learn the wrong model, it is difficult to diag- Sg Nom Pl Gen dogj ? dogjsj ’ nose the error, because the model is a black box. Sg,Nom Pl Gen This paper presents a new approach to catj ? catj ’s j? Sg,Nom Gen Sg morphological analysis that produces human- lorrjyj ? lorrjiesj ? understandable models and works even if only a Sg Nom Pl Nom Our key idea is a novel morphological segmen- few training words are given. The user gives a list tation algorithm used to produce the segmentation of inflected words together with each word’s mor- above (section 3). Once we have a segmentation, phosyntactic features and standard form (lemma): we use it to compute the rules on the left (section 4). standard inflected features In this example, the rules inferred by our tool pre- woman women Pl;Nom cisely capture the morphology of the input words, baby babies’ Pl;Gen and the segmentation shows which rules are most dog dogs’ Pl;Gen common. We claim that, together, they can be cat cat’s Sg;Gen used to perform high-quality morphological analy- lorry lorries Pl;Nom sis. We validate this claim in section 5 by showing Our tool then proposes, for each feature, the affixes how to build on our tool to do reinflection. and morphological rules which mark that feature. For the example above it suggests the following: 2 Related Work feature context morpheme Gen Sg +’s∗ Morphological segmentation is a well-studied prob- Gen Pl +’∗ lem in NLP, with applications including machine Pl Gen +s∗ translation (Green and DeNero, 2012), speech Pl Nom +a+ ! +e+ 1The tool also supports prefixes such as ∗un+, and cir- Pl +y∗ ! +ies∗ cumfixes such as ∗ge+t∗ in the German gerannt (run; past Sg +y∗, +a+, ? participle). We explain the meaning of ∗ and + in section 3.3. recognition (Rajagopal Narasimhan et al., 2014) of multiple discontinuous pieces, or be empty. In and information retrieval (Turunen and Kurimo, the segmentation on page 1, for example, the word 2008); for a more detailed overview we refer women is segmented by assigning Pl the segment the reader to Ruokolainen et al. (2016) or Ham- e, Nom the null segment, and woman the discon- marstrom¨ and Borin (2011). Unsupervised learning tinuous segment wom*n. In general, the segment (Harris, 1955; Creutz and Lagus, 2007; Goldsmith, assigned to the lemma feature represents the stem 2001; Johnson, 2008; Poon et al., 2009) relies on of the word. unannotated text, and is perhaps the most popular We say a segmentation is valid if for each word: approach, because of the large amount of unanno- tated text available for many languages, but it can • Every letter is in exactly one segment. suffer from low accuracy (Hammarstrom¨ and Borin, • Each feature labels exactly one segment. 2011). One way to improve accuracy is to exploit semantic information (Sakakini et al., 2017; Vulic´ • Each segment is labelled with exactly one fea- et al., 2017; Schone and Jurafsky, 2000; Soricut and ture.2 Och, 2015). Another is minimally-supervised learn- ing (Monson et al., 2007; Kohonen et al., 2010; There are many valid segmentations, some good, Ruokolainen et al., 2013; Gronroos¨ et al., 2014; some bad. We consider a segmentation good if it is Sirts and Goldwater, 2013; Ahlberg et al., 2014), parsimonious: each morpheme should be marked which combines a large amount of unannotated by as few features as possible. Note that different text with a small amount of annotated text, and forms of a word may be assigned different stems potentially provides high accuracy at a low cost. (e.g. go and went), but parsimony dictates that we Silfverberg and Hulden (2017) observe that in share stems wherever reasonable. Perhaps surpris- the Universal Dependencies project (Nivre et al., ingly, finding the most parsimonious segmentation 2017), each word is annotated with its lemma and is an NP-hard problem, by reduction from set cover. features, but not segmented. They study the prob- lem of finding a segmentation for these words. Our 3.1 Segmentation as a constraint problem segmentation algorithm solves the same problem, We solve the segmentation problem using zero-one and can be used in their setting. We improve on linear programming (0/1 LP). A 0/1 LP problem their solution by using a constraint solver to achieve consists of a set of variables (e.g. x; y; z; : : :) and high accuracy even with limited data, and using a a set of linear inequalities over those variables (e.g. precise language for expressing affixes and rules, 2x + 3y ≥ z and x ≤ 2y + 3). Given such a which allows us to use the resulting model to pre- problem, a 0/1 LP solver finds an assignment of cisely analyse new words. values to the variables that makes all the inequali- Luo et al. (2017) use Integer Linear Program- ties hold, and where all variables have the value 0 ming for unsupervised modelling of morphological or 1. A 0/1 LP problem also specifies a linear term families. They use ILP as a component of a larger whose value should be minimised (e.g. 4x+y −z), training algorithm, so unlike our work they do not called the objective function. A 0/1 LP solver is attempt to find a globally optimal solution. ILP guaranteed to find the solution that minimises the has been used in other NLP applications outside objective function, if a solution exists. The solver of morphology (Berant et al., 2011; Roth and Yih, that we use is CPLEX (ILOG, 2009). 2005; Clarke and Lapata, 2008). In this section we assume that we are given, in addition to the labelled words, a (possibly large) 3 Morphological segmentation set of allowable segments for each feature, which we call candidate morphemes. How candidate mor- The input to our tool is a list of words, each anno- phemes are automatically generated is explained in tated with its lemma and morphosyntactic features. section 3.2. From now on, we model the lemma as simply an- Our encoding uses the following variables. If m other feature; for example, the word “women” has is a candidate morpheme of feature f, the variable the features Nom, Pl and woman. The goal of this section is to divide each word 2This is an unrealistic assumption, but also fairly harmless: when a morpheme marks (e.g.) genitive plural, it will be into segments, and assign each feature of the word assigned one of those features, but the inferred rules will show to one of those segments. A segment may consist that it only occurs together with the other feature. Sf;m should be 1 if m is used to mark f, and 0 Example For the above example, the first rule otherwise. If f is a feature with a candidate mor- generates the following constraints, which force an pheme m that occurs in word w, and P is the set S-variable to 1 when one of its M-variables is 1: of positions in w occupied by m, then the vari- S ≥ M able Mw;P;f;m should be 1 if this occurrence of m hast,h¨ ast¨ hast,1–4,h¨ ast,h¨ ast¨ marks feature f, and 0 otherwise. SPl,ar ≥ Mhast,5–6,Pl,ar¨ Example Suppose that we are given the prob- SPl,na ≥ Mhast,7–8,Pl,na¨ lem of segmenting the Swedish word hastarna¨ (the SDef,ar ≥ Mhast,5–6,Def,ar¨ horses). For simplicity, suppose that we know that SDef,na ≥ Mhast,7–8,Def,na¨ the stem of hastarna¨ is hast¨ , and that the possible affixes are ar and na. (For now, we ignore the fact The second rule generates the following con- that these should be suffixes; we deal with this in straints, since letters 1 to 4 can only be covered by section 3.3.) This results in the following candidate hast¨ , letters 5 to 6 by ar (either as Pl or Def), and morphemes: letters 7 to 8 by na: feature candidate morphemes Mhast,1–4,h¨ ast,h¨ ast¨ = 1 Pl ar, na M + M = 1 Def ar, na hast,5–6,Pl,ar¨ hast,5–6,Def,ar¨ hast¨ hast¨ Mhast,7–8,Pl,na¨ + Mhast,7–8,Def,na¨ = 1 The string ar appears at positions 5–6 The third rule generates the following con- in hastarna¨ , and na appears at positions straints, stating that hast¨ , Pl and Def must be 7–8.
Recommended publications
  • 1 Introduction
    1 Introduction My subject—inWxation—is at once exotic and familiar. Russell Ultan in his pioneering study of the typology of inWxation (1975) noted that inWxes are rare compared to the frequency of other aYxes. The presence of inWxes in any language implies the presence of suYxes and/or preWxes, and no languages employ inWxation exclusively (Greenberg 1966: 92). The term ‘inWxation’ is also less familiar to students of linguistics than are such terms as preWxation and suYxation. The Oxford English Dictionary goes as far as deWning inWxes as what preWxes and suYxes are not: A modifying element inserted in the body of a word, instead of being preWxed or suYxed to the stem. (May 14, 2003 Web edition) InWxes are not at all diYcult to Wnd, however. English-speaking readers will no doubt recognize some, if not all, of the following inWxation constructions: (1) Expletive inWxation (McCarthy 1982) impo´rtant im-bloody-po´rtant fanta´stic fan-fuckin-ta´stic perha´ps per-bloody-ha´ps Kalamazo´o Kalama-goddamn-zo´o Tatamago´uchee Tatama-fuckin-go´uchee (2) Homer-ic inWxation (Yu 2004b) saxophone saxomaphone telephone telemaphone violin viomalin Michaelangelo Michamalangelo (3) Hip-hop iz-inWxation (Viau 2002) house hizouse bitch bizitch soldiers sizoldiers ahead ahizead 2 Introduction Given the relative rarity of inWxes in the world’s languages, it is perhaps not surprising that inWxes are often aVorded a lesser consideration. Yet their richness and complexity have nonetheless captured the imaginations of many linguists. Hidden behind the veil of simplicity implied in the term ‘inWx’, which suggests a sense of uniformity on par with that of preWxes and suYxes, is the diversity of the positions where inWxes are found relative to the stem.
    [Show full text]
  • Representation of Inflected Nouns in the Internal Lexicon
    Memory & Cognition 1980, Vol. 8 (5), 415423 Represeritation of inflected nouns in the internal lexicon G. LUKATELA, B. GLIGORIJEVIC, and A. KOSTIC University ofBelgrade, Belgrade, Yugoslavia and M.T.TURVEY University ofConnecticut, Storrs, Connecticut 06268 and Haskins Laboratories, New Haven, Connecticut 06510 The lexical representation of Serbo-Croatian nouns was investigated in a lexical decision task. Because Serbo-Croatian nouns are declined, a noun may appear in one of several gram­ matical cases distinguished by the inflectional morpheme affixed to the base form. The gram­ matical cases occur with different frequencies, although some are visually and phonetically identical. When the frequencies of identical forms are compounded, the ordering of frequencies is not the same for masculine and feminine genders. These two genders are distinguished further by the fact that the base form for masculine nouns is an actual grammatical case, the nominative singular, whereas the base form for feminine nouns is an abstraction in that it cannot stand alone as an independent word. Exploiting these characteristics of the Serbo­ Croatian language, we contrasted three views of how a noun is represented: (1) the independent­ entries hypothesis, which assumes an independent representation for each grammatical case, reflecting its frequency of occurrence; (2) the derivational hypothesis, which assumes that only the base morpheme is stored, with the individual cases derived from separately stored inflec­ tional morphemes and rules for combination; and (3) the satellite-entries hypothesis, which assumes that all cases are individually represented, with the nominative singular functioning as the nucleus and the embodiment of the noun's frequency and around which the other cases cluster uniformly.
    [Show full text]
  • Reduplication in Distributed Morphology
    Reduplication in Distributed Morphology Item Type text; Article Authors Haugen, Jason D. Publisher University of Arizona Linguistics Circle (Tucson, Arizona) Journal Coyote Papers: Working Papers in Linguistics, Linguistic Theory at the University of Arizona Rights http://rightsstatements.org/vocab/InC/1.0/ Download date 27/09/2021 03:15:12 Item License Copyright © is held by the author(s). Link to Item http://hdl.handle.net/10150/143067 The Coyote Papers 18 (May 2011), University of Arizona Linguistics Department, Tucson, AZ, U.S.A. Reduplication in Distributed Morphology Jason D. Haugen Oberlin College [email protected] Keywords reduplication, Distributed Morphology, allomorphy, reduplicative allomorphy, base-dependence, Hiaki (Yaqui), Tawala Abstract The two extant approaches to reduplication in Distributed Morphology (DM) are: (i) the read- justment approach, where reduplication is claimed to result from a readjustment operation on some stem triggered by a (typically null) affix; and (ii) the affixation approach, where reduplica- tion is claimed to result from the insertion of a special type of Vocabulary Item (i.e. a reduplica- tive affix–“reduplicant" or \Red") which gets inserted into a syntactic node in order to discharge some morphosyntactic feature(s), but which receives its own phonological content from some other stem (i.e. its \base") in the output. This paper argues from phonologically-conditioned allomorphy pertaining to base-dependence, as in the case of durative reduplication in Tawala, that the latter approach best accounts for a necessary distinction between \reduplicants" and \bases" as different types of morphemes which display different phonological effects, including \the emergence of the unmarked" effects, in many languages.
    [Show full text]
  • Conditional Tenses: -Nge-, -Ngali- and -Ki- Tenses and Their Negations
    Chapter 32 Conditional Tenses: -nge-, -ngali- and -ki- Tenses and Their Negations n this chapter, we will learn how to use conditional tenses: the -nge-, I-ngali- and -ki- tenses. These tenses indicate a condition, hypothesis or an assumption. The -nge- tense shows a condition in the present tense eg: If I were to study, etc. while the -ngali- tense shows a condition in the past tense eg: If I had studied, etc. We will also discuss the -ki- tense which shows a condition in the present tense which has future implica- tions eg: If I study, etc. Also, the word kama can be used with both the affirmative and negative conditional tenses to emphasize the conditional- ity. In addition, the -ki- tense can also be used as a present participle tense which will also be discussed in this chapter. Note that in conditional clauses, past tense shows a present condi- tion, a past perfect tense shows a past condition while a present tense shows a future condition. Section A: -nge- Tense The -nge- tense is used to show a hypothesis in the present tense. Similar to other tense markers, sentences using -nge- tense markers are con- structed in the following manner. Subject Prefix + -nge- Tense Marker + Verb Copyright © 2014. UPA. All rights reserved. © 2014. UPA. All rights Copyright Example: Ningesoma vizuri, ningefaulu. If I were to study well, I would pass. Almasi, Oswald, et al. <i>Swahili Grammar for Introductory and Intermediate Levels : Sarufi ya Kiswahili cha Ngazi ya Kwanza na Kati</i>, UPA, 2014. ProQuest Ebook Central, http://ebookcentral.proquest.com/lib/hselibrary-ebooks/detail.action?docID=1810394.
    [Show full text]
  • University of California Santa Cruz Minimal Reduplication
    UNIVERSITY OF CALIFORNIA SANTA CRUZ MINIMAL REDUPLICATION A dissertation submitted in partial satisfaction of the requirements for the degree of DOCTOR OF PHILOSOPHY in LINGUISTICS by Jesse Saba Kirchner June 2010 The Dissertation of Jesse Saba Kirchner is approved: Professor Armin Mester, Chair Professor Jaye Padgett Professor Junko Ito Tyrus Miller Vice Provost and Dean of Graduate Studies Copyright © by Jesse Saba Kirchner 2010 Some rights reserved: see Appendix E. Contents Abstract vi Dedication viii Acknowledgments ix 1 Introduction 1 1.1 Structureofthethesis ...... ....... ....... ....... ........ 2 1.2 Overviewofthetheory...... ....... ....... ....... .. ....... 2 1.2.1 GoalsofMR ..................................... 3 1.2.2 Assumptionsandpredictions. ....... 7 1.3 MorphologicalReduplication . .......... 10 1.3.1 Fixedsize..................................... ... 11 1.3.2 Phonologicalopacity. ...... 17 1.3.3 Prominentmaterialpreferentiallycopied . ............ 22 1.3.4 Localityofreduplication. ........ 24 1.3.5 Iconicity ..................................... ... 24 1.4 Syntacticreduplication. .......... 26 2 Morphological reduplication 30 2.1 Casestudy:Kwak’wala ...... ....... ....... ....... .. ....... 31 2.2 Data............................................ ... 33 2.2.1 Phonology ..................................... .. 33 2.2.2 Morphophonology ............................... ... 40 2.2.3 -mut’ .......................................... 40 2.3 Analysis........................................ ..... 48 2.3.1 Lengtheningandreduplication.
    [Show full text]
  • USAN Naming Guidelines for Monoclonal Antibodies |
    Monoclonal Antibodies In October 2008, the International Nonproprietary Name (INN) Working Group Meeting on Nomenclature for Monoclonal Antibodies (mAb) met to review and streamline the monoclonal antibody nomenclature scheme. Based on the group's recommendations and further discussions, the INN Experts published changes to the monoclonal antibody nomenclature scheme. In 2011, the INN Experts published an updated "International Nonproprietary Names (INN) for Biological and Biotechnological Substances—A Review" (PDF) with revisions to the monoclonal antibody nomenclature scheme language. The USAN Council has modified its own scheme to facilitate international harmonization. This page outlines the updated scheme and supersedes previous schemes. It also explains policies regarding post-translational modifications and the use of 2-word names. The council has no plans to retroactively change names already coined. They believe that changing names of monoclonal antibodies would confuse physicians, other health care professionals and patients. Manufacturers should be aware that nomenclature practices are continually evolving. Consequently, further updates may occur any time the council believes changes are necessary. Changes to the monoclonal antibody nomenclature scheme, however, should be carefully considered and implemented only when necessary. Elements of a Name The suffix "-mab" is used for monoclonal antibodies, antibody fragments and radiolabeled antibodies. For polyclonal mixtures of antibodies, "-pab" is used. The -pab suffix applies to polyclonal pools of recombinant monoclonal antibodies, as opposed to polyclonal antibody preparations isolated from blood. It differentiates polyclonal antibodies from individual monoclonal antibodies named with -mab. Sequence of Stems and Infixes The order for combining the key elements of a monoclonal antibody name is as follows: 1.
    [Show full text]
  • The Aslian Languages of Malaysia and Thailand: an Assessment
    Language Documentation and Description ISSN 1740-6234 ___________________________________________ This article appears in: Language Documentation and Description, vol 11. Editors: Stuart McGill & Peter K. Austin The Aslian languages of Malaysia and Thailand: an assessment GEOFFREY BENJAMIN Cite this article: Geoffrey Benjamin (2012). The Aslian languages of Malaysia and Thailand: an assessment. In Stuart McGill & Peter K. Austin (eds) Language Documentation and Description, vol 11. London: SOAS. pp. 136-230 Link to this article: http://www.elpublishing.org/PID/131 This electronic version first published: July 2014 __________________________________________________ This article is published under a Creative Commons License CC-BY-NC (Attribution-NonCommercial). The licence permits users to use, reproduce, disseminate or display the article provided that the author is attributed as the original creator and that the reuse is restricted to non-commercial purposes i.e. research or educational use. See http://creativecommons.org/licenses/by-nc/4.0/ ______________________________________________________ EL Publishing For more EL Publishing articles and services: Website: http://www.elpublishing.org Terms of use: http://www.elpublishing.org/terms Submissions: http://www.elpublishing.org/submissions The Aslian languages of Malaysia and Thailand: an assessment Geoffrey Benjamin Nanyang Technological University and Institute of Southeast Asian Studies, Singapore 1. Introduction1 The term ‘Aslian’ refers to a distinctive group of approximately 20 Mon- Khmer languages spoken in Peninsular Malaysia and the isthmian parts of southern Thailand.2 All the Aslian-speakers belong to the tribal or formerly- 1 This paper has undergone several transformations. The earliest version was presented at the Workshop on Endangered Languages and Literatures of Southeast Asia, Royal Institute of Linguistics and Anthropology, Leiden, in December 1996.
    [Show full text]
  • Morphological Sources of Phonological Length
    Morphological Sources of Phonological Length by Anne Pycha B.A. (Brown University) 1993 M.A. (University of California, Berkeley) 2004 A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Sharon Inkelas (chair) Professor Larry Hyman Professor Keith Johnson Professor Johanna Nichols Spring 2008 Abstract Morphological Sources of Phonological Length by Anne Pycha Doctor of Philosophy in Linguistics University of California, Berkeley Professor Sharon Inkelas, Chair This study presents and defends Resizing Theory, whose claim is that the overall size of a morpheme can serve as a basic unit of analysis for phonological alternations. Morphemes can increase their size by any number of strategies -- epenthesizing new segments, for example, or devoicing an existing segment (and thereby increasing its phonetic duration) -- but it is the fact of an increase, and not the particular strategy used to implement it, which is linguistically significant. Resizing Theory has some overlap with theories of fortition and lenition, but differs in that it uses the independently- verifiable parameter of size in place of an ad-hoc concept of “strength” and thereby encompasses a much greater range of phonological alternations. The theory makes three major predictions, each of which is supported with cross-linguistic evidence. First, seemingly disparate phonological alternations can achieve identical morphological effects, but only if they trigger the same direction of change in a morpheme’s size. Second, morpheme interactions can take complete control over phonological outputs, determining surface outputs when traditional features and segments fail to do so.
    [Show full text]
  • Infixing Reduplication in Pima and Its Theoretical Consequences
    Nat Lang Linguist Theory (2006) 24:857–891 DOI 10.1007/s11049-006-9003-8 ORIGINAL PAPER Infixing reduplication in Pima and its theoretical consequences Jason Riggle Received: 28 July 2003 / Accepted: 20 January 2006 / Published online: 3 August 2006 © Springer Science+Business Media B.V. 2006 Abstract Pima (Uto-Aztecan, central Arizona) pluralizes nouns via partial redupli- cation. The amount of material copied varies between a single C (mavit / ma-m-vit ‘lion(s)’) and CV (hodai / ho-ho-dai ‘rock(s)’). The former is preferred unless copying a single C would give rise to an illicit coda or cluster, in which case CV is copied. In contrast to previous analyses of similar patterns in Tohono O’odham and Lushoot- seed, I analyze the reduplicant as an infix rather than a prefix. The infixation of the reduplicant can be generated via constraints requiring the first vowel of the stem to correspond to the first vowel of the word. Furthermore, the preference for copying the initial consonant of the word can be generated by extending positional faithfulness to the base-reduplicant relationship. I argue that the infixation analysis is superior on two grounds. First, it reduces the C vs. CV variation to an instance of reduplicant size conditioned by phonotactics. Second, unlike the prefixation analyses, which must introduce a new notion of faithfulness to allow syncope in the base just in the con- text of reduplication (e.g. “existential faithfulness” (Struijke 2000a)), the infixation analysis uses only independently necessary constraints of Correspondence Theory. Keywords Reduplication · Uto-Aztecan · Correspondence Theory · Generalized Template Theory 1 Introduction Pima reduplication offers an interesting analytic puzzle, the solution of which bears on a number of important issues in the theory of reduplication.
    [Show full text]
  • Evaluation of Infix Expression Using Stack in C
    Evaluation Of Infix Expression Using Stack In C Transmittable Michail dreamings her sanctimony so interminably that Hamil mistimed very feckly. Lazare still euhemerize timeously while Neogaean Leslie elasticates that sophists. Which Darcy dents so dotingly that Brodie paneled her kidneys? Pop two characters one for postfix notation is a number constant, push and java programming languages: while reversing a stack should be pushed into the infix evaluation of expression using in stack Cancel reply cancel reply cancel whenever you use of expressions using two things a function inside that push it evaluates a community of. Included in your subscription at no additional cost! You solve the given expression p will be compared with friends and we will answer for expression evaluation of all unary minus operators. Example below to pop any other popular books, using infix evaluation expression of stack in c source code does not calculate on your program to convert a postfix expressions are enabled on empty stack? Then the operator and programs which the evaluation of infix expression using in stack c program to postfix conversion from left parenthesis, the app to a operator. Where around the parentheses go? Create a data structure we will learn and if current index is the matching left of evaluation infix expression using in stack c program, such a number in harry potter and algorithm used. Push it onto the stack. To use of the whole infix using stacks instead of code uses two sorted arrays to the operators that this information like a b and that. No spaces between numbers or operators.
    [Show full text]
  • Morphological Typology of Affixes in Riau Malay
    International Journal of Humanities and Social Science Vol. 6, No. 8; August 2016 Morphological Typology of Affixes in Riau Malay Azhary Tambusai1 Khairina Nasution2 Dwi Widayati3 Jufrizal4 1, 2, 3 Post-graduate Department of Linguistics Faculty of Cultural Sciences University of Sumatera Utara Jl. dr. Mansoer No. 1 Medan 20155 (Indonesia) 4 State University of Padang (Indonesia) Abstract The morphological typology in RM is based on the concept of segmentability and invariance as proposed by Van Scklegel. This concept has the purpose to idenify a word or a construction whether it is a morpheme or not. This study is descriptive-qualitative and has two major purposes: (1) to investigate the characteristics of morphological typology in RM and (2) to demonstrate the affixation process. This study was conducted in some selected municipalities and regencies in Riau Province in 2014-2015 and its data was obtained from some RM native speakers as informants through interviews which were recorded on audiotape and video. The results provide some support for determining that RM is an agglutinative language since its segmentability and invariance is easily known and this is in accordance with what Montolalu (2005: 181) said that the characteristics of agglutinative languages is that words can be divided into morphemes without difficulty. In adition, the results also provide some support for the involvement of (i) prefixes {meN-}, {beR-}, {teR-}, {di-}, {peN-}, {se-}, {peR-}, {ke-}, and {bese-}, (ii) infixes {-em-}, {-el-}, and {-er-}, (iii) suffixes {-an}, {-kan}, and {-i}, (iv) confixes {peN-an}, {peR-an}, {ke-an},{ber-an}, and {se-nye}, and (v) affix combination such as {mempeR-(- kan, -i)}, {me-kan}, and {di-kan}.
    [Show full text]
  • The Productivity of Infixation in Lakhota*
    UCLA Working Papers in Linguistics, vol.0, June 2000 Papers in Lakhota—Munro (ed.) THE PRODUCTIVITY OF INFIXATION IN LAKHOTA* ADAM ALBRIGHT [email protected] A growing body of work on morphological irregularity has shown that the productivity of irregular morphological processes is sensitive to phonological context. However, these studies have focused almost exclusively on patterns of irregularity found in European languages. In this paper, I discuss a typologically different pattern of irregularity found in Lakhota: person marking appears as a prefix for some verbs, and as an infix for others. I combine computational modeling of the Lakhota lexicon and an experimental “wug” test, to show that infixed person marking is highly sensitive to context, just like other types of morphological irregularity. 1. INTRODUCTION It is not uncommon in the world’s languages for a particular morpheme to surface variably as an infix or as a prefix/suffix. One of the insights of McCarthy and Prince (1993), further developed in the OT literature, is that the decision to infix is often driven by prosodic considerations in the phonology. For example, possessive markers in Ulwa are placed after the first foot of the root, with the result that they are infixed in roots that are longer than one foot, and suffixed in words which contain just one foot. A second consideration which can drive infixation is syllable structure. An example of this is the morphemes -um- and -in- in Tagalog, which are prefixed to vowel-initial roots, but infixed in consonant-initial roots to create CV syllables and avoid VC syllables.
    [Show full text]