Determining the Placement of German Verbs in English–To–German SMT

Total Page:16

File Type:pdf, Size:1020Kb

Determining the Placement of German Verbs in English–To–German SMT Determining the placement of German verbs in English–to–German SMT Anita Gojun Alexander Fraser Institute for Natural Language Processing University of Stuttgart, Germany {gojunaa, fraser}@ims.uni-stuttgart.de Abstract German language model, making the translations difficult to understand. When translating English to German, exist- A common approach for handling the long- ing reordering models often cannot model range reordering problem within PSMT is per- the long-range reorderings needed to gen- forming syntax-based or part-of-speech-based erate German translations with verbs in the (POS-based) reordering of the input as a prepro- correct position. We reorder English as a cessing step before translation (e.g., Collins et al. preprocessing step for English-to-German SMT. We use a sequence of hand-crafted (2005), Gupta et al. (2007), Habash (2007), Xu reordering rules applied to English parse et al. (2009), Niehues and Kolss (2009), Katz- trees. The reordering rules place English Brown et al. (2011), Genzel (2010)). verbal elements in the positions within the We reorder English to improve the translation clause they will have in the German transla- to German. The verb reordering process is im- tion. This is a difficult problem, as German plemented using deterministic reordering rules on verbal elements can appear in different po- English parse trees. The sequence of reorderings sitions within a clause (in contrast with En- glish verbal elements, whose positions do is derived from the clause type and the composi- not vary as much). We obtain a significant tion of a given verbal complex (a (possibly dis- improvement in translation performance. contiguous) sequence of verbal elements in a sin- gle clause). Only one rule can be applied in a given context and for each word to be reordered, 1 Introduction there is a unique reordered position. We train a standard PSMT system on the reordered English Phrase-based SMT (PSMT) systems translate training and tuning data and use it to translate the word sequences (phrases) from a source language reordered English test set into German. into a target language, performing reordering of This paper is structured as follows: in section target phrases in order to generate a fluent target 2, we outline related work. In section 3, English language output. The reordering models, such as, and German verb positioning is described. The for example, the models implemented in Moses reordering rules are given in section 4. In sec- (Koehn et al., 2007), are often limited to a cer- tion 5, we show the relevance of the reordering, tain reordering range since reordering beyond this present the experiments and present an extensive distance cannot be performed accurately. This re- error analysis. We discuss some problems ob- sults in problems of fluency for language pairs served in section 7 and conclude in section 8. with large differences in constituent order, such as English and German. When translating from 2 Related work English to German, verbs in the German output are often incorrectly left near their position in En- There have been a number of attempts to handle glish, creating problems of fluency. Verbs are also the long-range reordering problem within PSMT. often omitted since the distortion model cannot Many of them are based on the reordering of a move verbs to positions which are licensed by the source language sentence as a preprocessing step 726 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 726–735, Avignon, France, April 23 - 27 2012. c 2012 Association for Computational Linguistics before translation. Our approach is related to the dev set which is then translated and evaluated. work of Collins et al. (2005). They reordered Only those rule sequences are extracted which German sentences as a preprocessing step for maximize the translation performance of the German-to-English SMT. Hand-crafted reorder- reordered dev set. ing rules are applied on German parse trees in For the extraction of reordering rules, Gen- order to move the German verbs into the posi- zel (2010) uses shallow constituent parse trees tions corresponding to the positions of the English which are obtained from dependency parse trees. verbs. Subsequently, the reordered German sen- The trees are annotated using both Penn Tree- tences are translated into English leading to better bank POS tags and using Stanford dependency translation performance when compared with the types. However, the constraints on possible re- translation of the original German sentences. orderings are too restrictive in order to model all We apply this method on the opposite trans- word movements required for English-to-German lation direction, thus having English as a source translation. In particular, the reordering rules in- language and German as a target language. How- volve only the permutation of direct child nodes ever, we cannot simply invert the reordering rules and do not allow changing of child-parent rela- which are applied on German as a source lan- tionships (deleting of a child or attaching a node guage in order to reorder the English input. While to a new father node). In our implementation, a the reordering of German implies movement of verb can be moved to any position in a parse tree the German verbs into a single position, when re- (according to the reordering rules): the reordering ordering English, we need to split the English ver- can be a simple permutation of child nodes, or at- bal complexes and, where required, move their tachment of these nodes to a new father node (cf. parts into different positions. Therefore, we need movement of bought and read in figure 11). to identify exactly which parts of a verbal com- Thus, in contrast to Genzel (2010), our ap- plex must be moved and their possible positions proach does not have any constraints with respect in a German sentence. to the position of nodes marking a verb within the Reordering rules can also be extracted automat- tree. Only the syntactic structure of the sentence ically. For example, Niehues and Kolss (2009) restricts the distance of the linguistically moti- automatically extracted discontiguous reordering vated verb movements. rules (allowing gaps between POS tags which can include an arbitrary number of words) from 3 Verb positions in English and German a word-aligned parallel corpus with POS tagged 3.1 Syntax of German sentences source side. Since many different rules can be ap- plied on a given sentence, a number of reordered Since in this work, we concentrate on verbs, we sentence alternatives are created which are en- use the notion verbal complex for a sequence con- coded as a word lattice (Dyer et al., 2008). They sisting of verbs, verbal particles and negation. dealt with the translation directions German-to- The verb positions in the German sentences de- English and English-to-German, but translation pend on clause type and the tense as shown in ta- improvement was obtained only for the German- ble 1. Verbs can be placed in 1st, 2nd or clause- to-English direction. This may be due to miss- final position. Additionally, if a composed tense ing information about clause boundaries since En- is given, the parts of a verbal complex can be glish verbs often have to be moved to the clause interrupted by the middle field (MF) which con- end. Our reordering has access to this kind of tains arbitrary sentence constituents, e.g., sub- knowledge since we are working with a full syn- jects and objects (noun phrases), adjuncts (prepo- tactic parser of English. sitional phrases), adverbs, etc. We assume that the Genzel (2010) proposed a language- German sentences are SVO (analogously to En- independent method for learning reordering glish); topicalization is beyond the scope of our rules where the rules are extracted from parsed work. source language sentences. For each node, all In this work, we consider two possible posi- possible reorderings (permutations) of a limited tions of the negation in German: (1) directly in number of the child nodes are considered. The 1The verb movements shown in figure 1 will be explained candidate reordering rules are applied on the in detail in section 4. 727 1st 2nd MF clause- position in declarative, or in the 1st position in in- final terrogative clauses, in German, the entire verbal subject finV any ∅ decl complex can additionally be placed at the clause subject finV any mainV end in subordinate or infinitival clauses (cf. row finV subject any ∅ int/perif sub/inf in table 1). finV subject any mainV Because of these differences, for nearly all relCon subject any finV sub/inf types of English clauses, reordering is needed in relCon subject any VC order to place the English verbs in the positions Table 1: Position of the German subjects and verbs which correspond to the correct verb positions in in declarative clauses (decl), interrogative clauses and German. Only English declarative clauses with clauses with a peripheral clause (int/perif ), subordi- simple present and simple past tense have the nate/infinitival (sub/inf ) clauses. mainV = main verb, same verb position as their German counterparts. finV = finite verb, VC = verbal complex, any = arbi- We give statistics on clause types and their rele- relCon trary words, = relative pronoun or conjunction. vance for the verb reordering in section 5.1. We consider extraponed consituents in perif, as well as optional interrogatives in int to be in position 0. 4 Reordering of the English input front of the main verb, and (2) directly after the The reordering is carried out on English parse finite verb. The two negation positions are illus- trees. We first enrich the parse trees with clause trated in the following examples: type labels, as described below.
Recommended publications
  • Basic German: a Grammar and Workbook
    BASIC GERMAN: A GRAMMAR AND WORKBOOK Basic German: A Grammar and Workbook comprises an accessible reference grammar and related exercises in a single volume. It introduces German people and culture through the medium of the language used today, covering the core material which students would expect to encounter in their first years of learning German. Each of the 28 units presents one or more related grammar topics, illustrated by examples which serve as models for the exercises that follow. These wide-ranging and varied exercises enable the student to master each grammar point thoroughly. Basic German is suitable for independent study and for class use. Features include: • Clear grammatical explanations with examples in both English and German • Authentic language samples from a range of media • Checklists at the end of each Unit to reinforce key points • Cross-referencing to other grammar chapters • Full exercise answer key • Glossary of grammatical terms Basic German is the ideal reference and practice book for beginners but also for students with some knowledge of the language. Heiner Schenke is Senior Lecturer in German at the University of Westminster and Karen Seago is Course Leader for Applied Translation at the London Metropolitan University. Other titles available in the Grammar Workbooks series are: Basic Cantonese Intermediate Cantonese Basic Chinese Intermediate Chinese Intermediate German Basic Polish Intermediate Polish Basic Russian Intermediate Russian Basic Welsh Intermediate Welsh Titles of related interest published
    [Show full text]
  • Modeling the Position and Inflection of Verbs in English to German
    Modeling the position and inflection of verbs in English to German machine translation Von der Fakult¨atInformatik, Elektrotechnik und Informationstechnik der Universit¨at Stuttgart zur Erlangung der W¨urdeeines Doktors der Philosophie (Dr. phil.) genehmigte Abhandlung. Vorgelegt von Anita Ramm aus Vukovar, Kroatien Hauptberichter Prof. Dr. Alexander Fraser 1. Mitberichterin PD Dr. Sabine Schulte im Walde 2. Mitberichter Prof. Dr. Jonas Kuhn Tag der m¨undlichen Pr¨ufung:23.03.2018 Institut f¨urMaschinelle Sprachverarbeitung der Universit¨atStuttgart 2018 Abstract Machine translation (MT) is automatic translation of speech or text from a source lan- guage (SL) into a target language (TL). Machine translation is performed without hu- man interaction: a text or a speech signal is used as input to a computer program which automatically generates translation of the input data. There are two main approaches to machine translation: rule-based and corpus-based. Rule-based MT systems rely on manually or semi-automatically acquired rules which describe lexical, as well as syntactic correspondences between SL and TL. Corpus-based methods learn such correspondences automatically using large sets of parallel texts, i.e., texts which are translations of each other. Corpus-based methods such as statistical machine translation (SMT) and neural machine translation (NMT) rely on statistics which express the probability of translat- ing a specific SL translation unit into a specific TL unit (typically, word and/or word sequence). While SMT is a combination of different statistical models, NMT makes use of a neural network which encodes parallel SL and TL contexts in a more sophisticated way. Many problems have been observed when translating from English into German using SMT.
    [Show full text]
  • Grammar and Word List; • Grammar Explanations Marked with This Symbol
    49510_einleger_01_24 18.03.2005 8:42 Uhr Seite 1 by Herrad Meese Listening and exercise materials for beginners, by Herrad Meese Translations for Series 1: Episodes 1–26 In this booklet you will find translations of the • overview pages of the episodes; • the exercise instructions: Ü • information boxes at the start of the individual attachments: scripts of the audio scenes, solutions, grammar and word list; • grammar explanations marked with this symbol: The Symbols of the overview pages for each episode: Here you will find information on the Here you will find overviews of the contents of the episodes and audio scenes and information which Germany's socio-cultural background. CD the scenes are on. Information Listening Here you will find an overview of This introduces important grammatical the speech intentions and the key structures and teaches you to reco- vocabulary of the episode. gnise them. Understanding Recognising expressions structures Here you can practise the vocabulary. This contains pointers to structures You will also find a list of useful which already occur in the episode but expressions and phrases which are will not be dealt with systematically important for your vocabulary. until later. Remembering A bit more expressions grammar? Here you can test how much you’ve Here you will find learning tips and learned and what you still need to summaries of grammatical structures work on. and some of the vocabulary. Tips Test 1 49510_einleger_01_24 18.03.2005 8:42 Uhr Seite 2 P. 5 OVERVIEW EPISODE 1 P. 9 OVERVIEW EPISODE 2 Information Information Listening strategies Listening strategies • Focus on the sounds.
    [Show full text]
  • THE USE of the PRETERITE and the PRESENT PERFECT in ENGLISH and GERMAN. a CONTRASTIVE ANALYSIS. Institutt for Litteratur, Områ
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by NORA - Norwegian Open Research Archives THE USE OF THE PRETERITE AND THE PRESENT PERFECT IN ENGLISH AND GERMAN. A CONTRASTIVE ANALYSIS. Institutt for litteratur, områdestudier og europeiske språk Det humanistiske fakultet Universitetet i Oslo ENG4190 Høstsemesteret 2011 Jennifer Zug Veileder: Johan Elsness This master thesis deals with the use of the present perfect and the preterite in both English and German. The aim of the investigation is to find out to what extent the use of the present perfect and the preterite vary in English and German and how the original forms are translated into the other language, i.e. whether specific translation patterns can be found. To begin with, a theoretical part summarizes earlier researches by various authors before the main part of the thesis is presented. The research part is a corpus-based analysis, using the Oslo Multilingual Corpus. First of all, the English original preterite forms are looked upon. Three semantic categories could be found in my investigation in which the preterite is used in English: single events, sequences and events occurring regularly. Additionally, the preterite is often used in combination with definite time adverbials. The same categories could also be found when considering the German preterite. Looking at the English translations of the German preterite forms, it is striking that most instances, in total over 90%, were translated into the preterite in English as well. Considering the German translations of the English preterite forms, the preterite is also the preferred tense used in the translations, however, the present perfect appears also frequently in the translations, especially in sentences containing direct speech.
    [Show full text]
  • Modeling Verbal Inflection for English to German
    Modeling verbal inflection for English to German SMT Anita Ramm Alexander Fraser IMS CIS University of Stuttgart, Germany University of Munich, Germany [email protected] [email protected] Abstract not handle the verbs, ensuring neither that verbs appear in the correct position (which is a problem German verbal inflection is frequently due to the highly divergent word order of English wrong in standard statistical machine and German), nor that verbs are correctly inflected translation approaches. German verbs (problematic due to the richer system of verbal in- agree with subjects in person and num- flection in German). In many cases, verbs do not ber, and they bear information about mood match their subjects (in person and number) which and tense. For subject–verb agreement, makes understanding of translations difficult. In we parse German MT output to iden- addition to person and number, the German verbal tify subject–verb pairs and ensure that the inflection also includes information about tense verb agrees with the subject. We show and mood. If these are wrong (i.e. do not cor- that this approach improves subject-verb respond to the tense/mood in the source), very agreement. We model tense/mood transla- important information, such as point of time and tion from English to German by means of modality of an action/state expressed by the verb, a statistical classification model. Although is incorrect. This can lead to false understanding our model shows good results on well- of the overall sentence. formed data, it does not systematically improve tense and mood in MT output.
    [Show full text]
  • Islands of Resilience: the History of the German Strong Verbs from a Systemic Point of View*
    Islands of Resilience: The history of the German strong verbs from a systemic point of view* Robert Mailhammer Abstract Existing accounts of the German strong verbs show notable confusion about their historical development and remarkable diversity with regard to their systemic conceptualisation. Using a principal parts approach, this paper brings out the drastic changes the system of the Germanic strong verbs has undergone since its genesis. In particular, it is shown that the loss of the paradigmatic connection between the phonological root structure of a verb and its ablaut pattern is the key alteration that transformed the strong verbs from a highly homogenous and learner-friendly system to a complicated array of inflection classes whose predictability is no longer based on the citation form stored in the mental lexicon. However, this paper also reveals that it is exactly the remnants of the old system that keep the German strong verbs together as a group and even ensure a limited degree of productivity. The conclusions drawn for a synchronic representation are that the traditional concept of inflection classes, based on the phonological root structure, may be of more value than generally believed and that the conceptualisation of the paradigms of the strong verbs involving stem forms with different ablaut grades as principal parts is supported from the diachronic as well as from the synchronic point of view. Keywords Modern German – strong verbs – principal parts – Minimal Generalisation Hypothesis – word and paradigm morphology 1. Introduction Grammars of Modern Standard German have had difficulties in coming to grips with the group of so- called strong verbs.
    [Show full text]
  • Bi-Particle Adverbs, Pos-Tagging and the Recognition of German Separable Prefix Verbs
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2016 Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs Volk, Martin ; Clematide, Simon ; Graën, Johannes ; Ströbel, Phillip Abstract: In this paper we propose an algorithm for computing the full lemma of German verbs that occur in sentences with a separated prefix. The algorithm is meant for large-scale corpus annotation. It relies on Part-of-Speech tags and works with 97% precision when the tags are correct. Unfortunately there are multi-word adverbs with particles that are homographs with separated verb particles and prepositions. Since the usage as separated particle and preposition is much more frequent, these multi-word adverbs are often incorrectly tagged. We show that special treatment of these bi-particle adverbs improves the re-attachment of separated verb particles. Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-126372 Conference or Workshop Item Accepted Version Originally published at: Volk, Martin; Clematide, Simon; Graën, Johannes; Ströbel, Phillip (2016). Bi-particle adverbs, PoS- tagging and the recognition of german separable prefix verbs. In: KONVENS 2016, Bochum, 19 Septem- ber 2016 - 21 September 2016. Bi-particle Adverbs, PoS-Tagging and the Recognition of German Separable Prefix Verbs Martin Volk, Simon Clematide, Johannes Graen,¨ Phillip Strobel¨ University of Zurich Institute of Computational Linguistics [email protected] Abstract to compute the correct verb lemma. Unfortunately, Part-of-Speech taggers (like the TreeTagger) assign In this paper we propose an algorithm the lemma locally and do not consider the long- for computing the full lemma of German distance dependency between the verb and the pre- verbs that occur in sentences with a sepa- fix.
    [Show full text]
  • German Verbs & Essential of Grammar
    GERMAN VERBS & ESSENTIAL OF GRAMMAR: V. 2 - PT. E PDF, EPUB, EBOOK Charles James | 144 pages | 19 May 2008 | McGraw-Hill Education - Europe | 9780071498036 | English, German | London, United States German Verbs & Essential of Grammar: v. 2 - Pt. E PDF Book Conjugated verbs are used to express the characteristics of person, number, tense, voice and mode in the German language. Gerunds in - ung are feminine and have regular plurals in - en. The basic form is created by putting the word zu before the infinitive. Irregular verbs are, for the most part, strong verbs. When attached, these prefixes are always stressed. The most common permanent prefixes found in German are ver- , ge- , be- , er- , ent- or emp- , and zer-. There are more than strong and irregular verbs, but there is a gradual tendency for strong verbs to become weak. On the other hand, this form is often used in fun or mocking expressions, because the imputed behaviour that is content of the mocking can be merged into a single word. Note that in the explicitly feminine form a second syllable er is omitted, if the infinitive ends on ern or eren. Thank you for supporting ielanguages. German sentence structure normally places verbs in second position or final position. For complex infinitives, adverbial phrases and object phrases are ignored, they do not affect this process; except something else is mentioned. In general we use the present tense, when we are speaking about the future. If one of the two meanings is figurative, the inseparable version stands for this figurative meaning:. With verbs whose roots end in el or er , the e of the infinitive suffix is dropped.
    [Show full text]
  • On Verb Types Causing Pronominal Adverbs in German Language
    Proceedings ON VERB TYPES CAUSING PRONOMINAL ADVERBS IN GERMAN LANGUAGE Herri Akhmad Bukhori State University of Malang [email protected]; [email protected] Abstract Pronominal adverbs in German language are formed by a combination of a locative adverb da ‘there’ and a preposition. The preposition emerges in a construction of a pronominal adverb coming from a preposition as a part of the prepositional verb. This study aims at describing verb types causing the pronominal adverb. Data are collected from German sentences containing pronominal adverbia taken from four famous German novels, namely Winnetou I, II, III, Friede auf Erden and a German short story Herr der Diebe. The analysis method used in the study is a distributional method with substitution technique and permutation. The classification of verb types is based on theory proposed by Quirk and Djajasudarma. The result of analysis shows that (prepositional) verb types causing pronominal adverbs in German language belong to (1) activity verbs, (2) verbs of innert perception and cognition, (3) relational verbs, and (4) verbs of bodily sensation. Key words: verb types, pronominal adverb, German language audiovisual aids INTRODUCTION Verbs are one of the categories or known as parts of speech that usually serve to predicate in a clause or a sentence. In some languages, such as the Indo-European languages according to Sternemann and Urschmidt (1989), verbs have morphological features such as tenses, aspects, personal, or numbers. Most of the verbs represent a semantic element of action, state, or process. Kridalaksana (2001: 226) reveals that verbs in Indonesian are characterized by the possibility of beginning with the word ‘tidak’ (not) and it is not possible to begin with ‘sangat’ (very), more.
    [Show full text]
  • Nominalization in German
    Nominalization in German Tatjana Scheffler November 12, 2005 1 Introduction This paper deals with nominalizations in German. The first part summarizes the facts about German nominalizations. There are many different types of nomi- nalizations in German. We discuss three kinds of nominalizations in particular: infinitival nominals like das Laufen (walking), so-called “stem”-derived nomi- nals like Fahrt (trip, ride), and, prominently, -ung nominals like Verschwendung (wastefulness). After an introduction to the types of nominals (section 2.1), we discuss which verbs can or cannot form the different types of nominaliza- tions (2.2), followed by the semantics of German nominalizations (2.3) and their syntactic behavior (2.4). The picture described in the first part of the paper is compatible with a theory where the semantics of the root that the verb and its nominalization have in common determines which forms can be constructed, and what their syntactic behavior (for example, argument structure) will be. This situation is an argument for Distributed Morphology, which claims that roots do not carry categorical features, and all syntactic behavior that differentiates between categories (e.g., nouns, verbs, adjectives) is determined by functional projections that derive full morphological complexes of the roots. However, there are at least two issues relating to German nominalizations that pose a problem for this framework. They are discussed in the second part of this paper. The first topic is raising. Raising nominals, in contrast to raising verbs, do not exist. Section 3.1 elaborates how this fact can be explained even under the hypothesis that noun roots do not differ from verb roots.
    [Show full text]
  • Modeling Deponency in Germanic Preterite-Present Verbs Using Datr
    University of Kentucky UKnowledge Theses and Dissertations--Linguistics Linguistics 2017 MODELING DEPONENCY IN GERMANIC PRETERITE-PRESENT VERBS USING DATR Marie G. Bourgerie Hunter University of Kentucky, [email protected] Digital Object Identifier: https://doi.org/10.13023/ETD.2017.378 Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Bourgerie Hunter, Marie G., "MODELING DEPONENCY IN GERMANIC PRETERITE-PRESENT VERBS USING DATR" (2017). Theses and Dissertations--Linguistics. 23. https://uknowledge.uky.edu/ltt_etds/23 This Master's Thesis is brought to you for free and open access by the Linguistics at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Linguistics by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT: I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File. I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known. I agree that the document mentioned above may be made available immediately for worldwide access unless an embargo applies.
    [Show full text]
  • A Finite State Approach to German Verb Morphology
    A Finite State Approach to German Verb Morphology Giinther GORZ IBM -- WT LILOG Schlot3str. 70, D-7000 Stuttgart 1, W. Germany (on leave from Univ. of Erlangen-Niirnberg) [email protected], [email protected] Dietrich PAULUS Univ. of Erlangen-Nilrnberg, IMMD V Martensstr. 3, D-8520 Erlangen, W. Germany [email protected] Abstract tapes are rejected. To allow tapes of different length to be accepted, This paper presents a new, language independent model for a symbol to be matched against one or other of the tapes to do a analysis and generation of word forms based on Finite State transition may be empty, in which case the corresponding tape is Transducers (FSTs). It has been completely implemented on ignored. a PC and successfully tested with lexicons and rules cover- There are two advantages with this approach: The first is, that ing all of German verb morphology and the most interesting subsets of French and Spanish verbs as well. The linguistic such a combined FST can be implemented easily as a simple and databases consist of a'letter-tree structured lexicon with annc~ very efficient program. Second, unlike ordered sets of rewriting rules, tated feature lists and a FST which is constructed from a set there is no directionality in principle, so that the same machine can of morphophonological rules. These rewriting rules operate on be used for analysis and generation as well. complete words unlike other FST-based systems. Kimmo Koskenniemi /1983a, 1983b, 1984, 1985/] ([5], [6], [7], [8]) took up this approach and applied a variation of it to some heav- 1 Introduction ily inflected languages, first of all to Finnish.
    [Show full text]