Modeling the Position and Inflection of Verbs in English to German

Total Page:16

File Type:pdf, Size:1020Kb

Modeling the Position and Inflection of Verbs in English to German Modeling the position and inflection of verbs in English to German machine translation Von der Fakult¨atInformatik, Elektrotechnik und Informationstechnik der Universit¨at Stuttgart zur Erlangung der W¨urdeeines Doktors der Philosophie (Dr. phil.) genehmigte Abhandlung. Vorgelegt von Anita Ramm aus Vukovar, Kroatien Hauptberichter Prof. Dr. Alexander Fraser 1. Mitberichterin PD Dr. Sabine Schulte im Walde 2. Mitberichter Prof. Dr. Jonas Kuhn Tag der m¨undlichen Pr¨ufung:23.03.2018 Institut f¨urMaschinelle Sprachverarbeitung der Universit¨atStuttgart 2018 Abstract Machine translation (MT) is automatic translation of speech or text from a source lan- guage (SL) into a target language (TL). Machine translation is performed without hu- man interaction: a text or a speech signal is used as input to a computer program which automatically generates translation of the input data. There are two main approaches to machine translation: rule-based and corpus-based. Rule-based MT systems rely on manually or semi-automatically acquired rules which describe lexical, as well as syntactic correspondences between SL and TL. Corpus-based methods learn such correspondences automatically using large sets of parallel texts, i.e., texts which are translations of each other. Corpus-based methods such as statistical machine translation (SMT) and neural machine translation (NMT) rely on statistics which express the probability of translat- ing a specific SL translation unit into a specific TL unit (typically, word and/or word sequence). While SMT is a combination of different statistical models, NMT makes use of a neural network which encodes parallel SL and TL contexts in a more sophisticated way. Many problems have been observed when translating from English into German using SMT. One of the most prominent errors in the German SMT outputs is related to the verbs. SMT often misplaces or does not generate the German verbs at all. Furthermore, the inflected German verb forms are often incorrect. This thesis describes methods for handling the two respective problems. While the positional problems are dealt with in a pre-processing step which can be seen as a preparation of the English data for training and translation, the verbal inflection is handled in a post-processing step and can thus be seen as an automatic post-editing (or correction) step to the translation. Consider the position of the verbs have/habe and read/gelesen in the following English- German sentence pair: I have read that book $ Ich habe dieses Buch gelesen. For SMT, the different position of the participles read/gelesen is problematic since the trans- lation step needs to jump over many words, in this case over the words dieses/this and Buch/book, to place the German participle into the correct position. Such positional differences, caused by grammatical constraints in English and German, are given in al- most all sentence types and lead to many errors in the German translations. I correct these errors by applying the so-called preordering of the English sentences. Preordering transforms (i.e., reorders) English sentences in a way that they encounter German-like word order. The reordered English texts are then used to train English-German SMT models and also to translate English test sentences. Thus, instead of being trained on the sentence pairs such as I have read that book $ Ich habe dieses Buch gelesen, an English-German SMT system is now trained on the following data: I have that book read $ Ich habe dieses Buch gelesen. Doing this, SMT does not need to perform problematic search for the correct positions of the German verbs since they correspond iii to the positions of their English counterparts. I test the improvement potential of pre- ordering for English!German in many different experimental setups in which the effect of domain, size of the training data, as well as the used language models is taken into account. The experiments show that the German translations generated by an SMT model trained on reordered English sentences have more verbs which are more often correctly placed when compared with translations generated by an SMT model trained on the original parallel corpus. Correct placement of the verbs does not mean that their inflection is correct. The English!German SMT systems have problems generating correct German verb forms and this problem gets even more severe when reordering of the English data is performed. The difficulty of generating the correct German verb forms is due to the difference in the morphological richness of English and German. English differentiates between only a few forms of a single verb lemma, while in German, a single verb lemma has many different inflectional variants. In the context of (S)MT, this means that a single English verb form may be translated into numerous German verb forms, e.g., had $ {hatte, hattest, hatten, gehabt}. Which of these variants is correct, depends on the context in which the verbs occur. In German, for instance, the subjects require a specific form of the finite verbs, e.g., I work $ Ich arbeite or They work $ Sie arbeiten. This linguistic property is called subject-verb agreement. SMT often fails to capture required contextual dependencies between subjects and verbs which leads to German translations in which the subject-verb agreement is violated: those translations are grammatically incorrect. The German verbal morphology includes not only information about agreement (person and number), but also about tense and mood. Generation of the verb forms with tense and mood properties which do not correspond to the source leads to sentences which may be interpreted incorrectly. Furthermore, if the target language constraints on usage of tense and mood are not met, the translations are, as in the case of false subject- verb agreement, grammatically incorrect. In this thesis, both of the inflection-related problems are tackled with a subsequent generation of the German finite verbs according to morphological features derived by considering relevant contextual information. Subject-verb agreement errors are dealt with by parsing the German SMT outputs. Given a parse tree, first the subject-verb pairs are identified. Subsequently, the person and number features of the subject are transferred to the corresponding finite verb. This approach ensures that the agreement is established between the generated subjects and the agreeing finite verbs in the German translations. The method works well for the used test set, its success, however, largely depends on the parsing accuracy. As mentioned above, the generation of the German verbs also requires information about tense and mood. These morphological features are gained with a classifier which is trained on many types of different contextual information derived from the English and German sentences. Although the classification accuracy is relatively high when computed on well-formed test sets, it is not sufficient to generally improve tense and mood of the verbs in the German SMT outputs. In some cases, the predicted values indeed correct false German verbs: particularly the German finite verbs generated as translations of the English non-finite verbs profit from the tense and mood prediction step. iv The tense/mood classifier used in this thesis is a first attempt to model tense and mood translation from English to German. A deeper analysis of the translation exam- ples, of the parallel data, as well as of the theoretical research on (human) translation in general shows the whole complexity of the problem. The present thesis includes a summary of the most important findings with respect to the translation of tense and mood for the English!German translation direction. Not only the theoretical knowl- edge about this topic is required when it comes to its automatic modeling. Tense and mood depend on factors which need to be extractable from the data in terms of their automatic annotation. One of the by-products of my research is an open-source tool for the annotation of tense and mood for English and German in the monolingual context. Along with the results and discussions provided in this theses, the tool provides a strong basis for further work in this research area. v Deutsche Zusammenfassung Maschinelle Übersetzung (MÜ) befasst sich mit der automatischen Erstellung von Über- setzungen. Für einen Text (oder gesprochene Sprache) in der Quellsprache (QS), wird ein MÜ-System dazu verwendet, automatisch, das heißt ohne Hilfe des Menschen, den äquivalenten Text in der Zielsprache (ZS) zu generieren. Im Jahre 2004 wurde das erste frei verfügbare Programm, genannt Moses, zur statistischen maschinellen Übersetzung (SMÜ) veröffentlicht. Dies bedeutete den entscheidenden Durchbruch für den breit ge- fächerten Einsatz der MÜ. Das Erstellen der SMÜ-Systeme ist denkbar einfach: man benötigt lediglich Texte in QS und ZS, die Übersetzungen voneinander sind. Das SMÜ- Modell lernt aus den Texten, welche Wörter bzw. Wortfolgen in QS und ZS Übersetzun- gen voneinander sind. Den Übersetzungspaaren werden Übersetzungswahrscheinlichkei- ten zugewiesen, die auf der Häufigkeit des Auftretens der ermittelten Übersetzungspaare im gegebenen Textpaar basieren. Für viele Sprachpaare generiert SMÜ gute Überset- zungen, allerdings ist die Qualität der SMÜ-Übersetzungen für Sprachpaare, die sich morphologisch und/oder syntaktisch bedeutend voneinander unterscheiden, immer noch mangelhaft. Zu solchen Sprachpaaren gehören auch Englisch und Deutsch. Diese Doktorarbeit befasst sich mit den Verben in den deutschen SMÜ-Übersetzungen mit Englisch
Recommended publications
  • Basic German: a Grammar and Workbook
    BASIC GERMAN: A GRAMMAR AND WORKBOOK Basic German: A Grammar and Workbook comprises an accessible reference grammar and related exercises in a single volume. It introduces German people and culture through the medium of the language used today, covering the core material which students would expect to encounter in their first years of learning German. Each of the 28 units presents one or more related grammar topics, illustrated by examples which serve as models for the exercises that follow. These wide-ranging and varied exercises enable the student to master each grammar point thoroughly. Basic German is suitable for independent study and for class use. Features include: • Clear grammatical explanations with examples in both English and German • Authentic language samples from a range of media • Checklists at the end of each Unit to reinforce key points • Cross-referencing to other grammar chapters • Full exercise answer key • Glossary of grammatical terms Basic German is the ideal reference and practice book for beginners but also for students with some knowledge of the language. Heiner Schenke is Senior Lecturer in German at the University of Westminster and Karen Seago is Course Leader for Applied Translation at the London Metropolitan University. Other titles available in the Grammar Workbooks series are: Basic Cantonese Intermediate Cantonese Basic Chinese Intermediate Chinese Intermediate German Basic Polish Intermediate Polish Basic Russian Intermediate Russian Basic Welsh Intermediate Welsh Titles of related interest published
    [Show full text]
  • Grammar and Word List; • Grammar Explanations Marked with This Symbol
    49510_einleger_01_24 18.03.2005 8:42 Uhr Seite 1 by Herrad Meese Listening and exercise materials for beginners, by Herrad Meese Translations for Series 1: Episodes 1–26 In this booklet you will find translations of the • overview pages of the episodes; • the exercise instructions: Ü • information boxes at the start of the individual attachments: scripts of the audio scenes, solutions, grammar and word list; • grammar explanations marked with this symbol: The Symbols of the overview pages for each episode: Here you will find information on the Here you will find overviews of the contents of the episodes and audio scenes and information which Germany's socio-cultural background. CD the scenes are on. Information Listening Here you will find an overview of This introduces important grammatical the speech intentions and the key structures and teaches you to reco- vocabulary of the episode. gnise them. Understanding Recognising expressions structures Here you can practise the vocabulary. This contains pointers to structures You will also find a list of useful which already occur in the episode but expressions and phrases which are will not be dealt with systematically important for your vocabulary. until later. Remembering A bit more expressions grammar? Here you can test how much you’ve Here you will find learning tips and learned and what you still need to summaries of grammatical structures work on. and some of the vocabulary. Tips Test 1 49510_einleger_01_24 18.03.2005 8:42 Uhr Seite 2 P. 5 OVERVIEW EPISODE 1 P. 9 OVERVIEW EPISODE 2 Information Information Listening strategies Listening strategies • Focus on the sounds.
    [Show full text]
  • THE USE of the PRETERITE and the PRESENT PERFECT in ENGLISH and GERMAN. a CONTRASTIVE ANALYSIS. Institutt for Litteratur, Områ
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by NORA - Norwegian Open Research Archives THE USE OF THE PRETERITE AND THE PRESENT PERFECT IN ENGLISH AND GERMAN. A CONTRASTIVE ANALYSIS. Institutt for litteratur, områdestudier og europeiske språk Det humanistiske fakultet Universitetet i Oslo ENG4190 Høstsemesteret 2011 Jennifer Zug Veileder: Johan Elsness This master thesis deals with the use of the present perfect and the preterite in both English and German. The aim of the investigation is to find out to what extent the use of the present perfect and the preterite vary in English and German and how the original forms are translated into the other language, i.e. whether specific translation patterns can be found. To begin with, a theoretical part summarizes earlier researches by various authors before the main part of the thesis is presented. The research part is a corpus-based analysis, using the Oslo Multilingual Corpus. First of all, the English original preterite forms are looked upon. Three semantic categories could be found in my investigation in which the preterite is used in English: single events, sequences and events occurring regularly. Additionally, the preterite is often used in combination with definite time adverbials. The same categories could also be found when considering the German preterite. Looking at the English translations of the German preterite forms, it is striking that most instances, in total over 90%, were translated into the preterite in English as well. Considering the German translations of the English preterite forms, the preterite is also the preferred tense used in the translations, however, the present perfect appears also frequently in the translations, especially in sentences containing direct speech.
    [Show full text]
  • Modeling Verbal Inflection for English to German
    Modeling verbal inflection for English to German SMT Anita Ramm Alexander Fraser IMS CIS University of Stuttgart, Germany University of Munich, Germany [email protected] [email protected] Abstract not handle the verbs, ensuring neither that verbs appear in the correct position (which is a problem German verbal inflection is frequently due to the highly divergent word order of English wrong in standard statistical machine and German), nor that verbs are correctly inflected translation approaches. German verbs (problematic due to the richer system of verbal in- agree with subjects in person and num- flection in German). In many cases, verbs do not ber, and they bear information about mood match their subjects (in person and number) which and tense. For subject–verb agreement, makes understanding of translations difficult. In we parse German MT output to iden- addition to person and number, the German verbal tify subject–verb pairs and ensure that the inflection also includes information about tense verb agrees with the subject. We show and mood. If these are wrong (i.e. do not cor- that this approach improves subject-verb respond to the tense/mood in the source), very agreement. We model tense/mood transla- important information, such as point of time and tion from English to German by means of modality of an action/state expressed by the verb, a statistical classification model. Although is incorrect. This can lead to false understanding our model shows good results on well- of the overall sentence. formed data, it does not systematically improve tense and mood in MT output.
    [Show full text]
  • Islands of Resilience: the History of the German Strong Verbs from a Systemic Point of View*
    Islands of Resilience: The history of the German strong verbs from a systemic point of view* Robert Mailhammer Abstract Existing accounts of the German strong verbs show notable confusion about their historical development and remarkable diversity with regard to their systemic conceptualisation. Using a principal parts approach, this paper brings out the drastic changes the system of the Germanic strong verbs has undergone since its genesis. In particular, it is shown that the loss of the paradigmatic connection between the phonological root structure of a verb and its ablaut pattern is the key alteration that transformed the strong verbs from a highly homogenous and learner-friendly system to a complicated array of inflection classes whose predictability is no longer based on the citation form stored in the mental lexicon. However, this paper also reveals that it is exactly the remnants of the old system that keep the German strong verbs together as a group and even ensure a limited degree of productivity. The conclusions drawn for a synchronic representation are that the traditional concept of inflection classes, based on the phonological root structure, may be of more value than generally believed and that the conceptualisation of the paradigms of the strong verbs involving stem forms with different ablaut grades as principal parts is supported from the diachronic as well as from the synchronic point of view. Keywords Modern German – strong verbs – principal parts – Minimal Generalisation Hypothesis – word and paradigm morphology 1. Introduction Grammars of Modern Standard German have had difficulties in coming to grips with the group of so- called strong verbs.
    [Show full text]
  • Bi-Particle Adverbs, Pos-Tagging and the Recognition of German Separable Prefix Verbs
    Zurich Open Repository and Archive University of Zurich Main Library Strickhofstrasse 39 CH-8057 Zurich www.zora.uzh.ch Year: 2016 Bi-particle adverbs, PoS-tagging and the recognition of german separable prefix verbs Volk, Martin ; Clematide, Simon ; Graën, Johannes ; Ströbel, Phillip Abstract: In this paper we propose an algorithm for computing the full lemma of German verbs that occur in sentences with a separated prefix. The algorithm is meant for large-scale corpus annotation. It relies on Part-of-Speech tags and works with 97% precision when the tags are correct. Unfortunately there are multi-word adverbs with particles that are homographs with separated verb particles and prepositions. Since the usage as separated particle and preposition is much more frequent, these multi-word adverbs are often incorrectly tagged. We show that special treatment of these bi-particle adverbs improves the re-attachment of separated verb particles. Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-126372 Conference or Workshop Item Accepted Version Originally published at: Volk, Martin; Clematide, Simon; Graën, Johannes; Ströbel, Phillip (2016). Bi-particle adverbs, PoS- tagging and the recognition of german separable prefix verbs. In: KONVENS 2016, Bochum, 19 Septem- ber 2016 - 21 September 2016. Bi-particle Adverbs, PoS-Tagging and the Recognition of German Separable Prefix Verbs Martin Volk, Simon Clematide, Johannes Graen,¨ Phillip Strobel¨ University of Zurich Institute of Computational Linguistics [email protected] Abstract to compute the correct verb lemma. Unfortunately, Part-of-Speech taggers (like the TreeTagger) assign In this paper we propose an algorithm the lemma locally and do not consider the long- for computing the full lemma of German distance dependency between the verb and the pre- verbs that occur in sentences with a sepa- fix.
    [Show full text]
  • German Verbs & Essential of Grammar
    GERMAN VERBS & ESSENTIAL OF GRAMMAR: V. 2 - PT. E PDF, EPUB, EBOOK Charles James | 144 pages | 19 May 2008 | McGraw-Hill Education - Europe | 9780071498036 | English, German | London, United States German Verbs & Essential of Grammar: v. 2 - Pt. E PDF Book Conjugated verbs are used to express the characteristics of person, number, tense, voice and mode in the German language. Gerunds in - ung are feminine and have regular plurals in - en. The basic form is created by putting the word zu before the infinitive. Irregular verbs are, for the most part, strong verbs. When attached, these prefixes are always stressed. The most common permanent prefixes found in German are ver- , ge- , be- , er- , ent- or emp- , and zer-. There are more than strong and irregular verbs, but there is a gradual tendency for strong verbs to become weak. On the other hand, this form is often used in fun or mocking expressions, because the imputed behaviour that is content of the mocking can be merged into a single word. Note that in the explicitly feminine form a second syllable er is omitted, if the infinitive ends on ern or eren. Thank you for supporting ielanguages. German sentence structure normally places verbs in second position or final position. For complex infinitives, adverbial phrases and object phrases are ignored, they do not affect this process; except something else is mentioned. In general we use the present tense, when we are speaking about the future. If one of the two meanings is figurative, the inseparable version stands for this figurative meaning:. With verbs whose roots end in el or er , the e of the infinitive suffix is dropped.
    [Show full text]
  • On Verb Types Causing Pronominal Adverbs in German Language
    Proceedings ON VERB TYPES CAUSING PRONOMINAL ADVERBS IN GERMAN LANGUAGE Herri Akhmad Bukhori State University of Malang [email protected]; [email protected] Abstract Pronominal adverbs in German language are formed by a combination of a locative adverb da ‘there’ and a preposition. The preposition emerges in a construction of a pronominal adverb coming from a preposition as a part of the prepositional verb. This study aims at describing verb types causing the pronominal adverb. Data are collected from German sentences containing pronominal adverbia taken from four famous German novels, namely Winnetou I, II, III, Friede auf Erden and a German short story Herr der Diebe. The analysis method used in the study is a distributional method with substitution technique and permutation. The classification of verb types is based on theory proposed by Quirk and Djajasudarma. The result of analysis shows that (prepositional) verb types causing pronominal adverbs in German language belong to (1) activity verbs, (2) verbs of innert perception and cognition, (3) relational verbs, and (4) verbs of bodily sensation. Key words: verb types, pronominal adverb, German language audiovisual aids INTRODUCTION Verbs are one of the categories or known as parts of speech that usually serve to predicate in a clause or a sentence. In some languages, such as the Indo-European languages according to Sternemann and Urschmidt (1989), verbs have morphological features such as tenses, aspects, personal, or numbers. Most of the verbs represent a semantic element of action, state, or process. Kridalaksana (2001: 226) reveals that verbs in Indonesian are characterized by the possibility of beginning with the word ‘tidak’ (not) and it is not possible to begin with ‘sangat’ (very), more.
    [Show full text]
  • Nominalization in German
    Nominalization in German Tatjana Scheffler November 12, 2005 1 Introduction This paper deals with nominalizations in German. The first part summarizes the facts about German nominalizations. There are many different types of nomi- nalizations in German. We discuss three kinds of nominalizations in particular: infinitival nominals like das Laufen (walking), so-called “stem”-derived nomi- nals like Fahrt (trip, ride), and, prominently, -ung nominals like Verschwendung (wastefulness). After an introduction to the types of nominals (section 2.1), we discuss which verbs can or cannot form the different types of nominaliza- tions (2.2), followed by the semantics of German nominalizations (2.3) and their syntactic behavior (2.4). The picture described in the first part of the paper is compatible with a theory where the semantics of the root that the verb and its nominalization have in common determines which forms can be constructed, and what their syntactic behavior (for example, argument structure) will be. This situation is an argument for Distributed Morphology, which claims that roots do not carry categorical features, and all syntactic behavior that differentiates between categories (e.g., nouns, verbs, adjectives) is determined by functional projections that derive full morphological complexes of the roots. However, there are at least two issues relating to German nominalizations that pose a problem for this framework. They are discussed in the second part of this paper. The first topic is raising. Raising nominals, in contrast to raising verbs, do not exist. Section 3.1 elaborates how this fact can be explained even under the hypothesis that noun roots do not differ from verb roots.
    [Show full text]
  • Modeling Deponency in Germanic Preterite-Present Verbs Using Datr
    University of Kentucky UKnowledge Theses and Dissertations--Linguistics Linguistics 2017 MODELING DEPONENCY IN GERMANIC PRETERITE-PRESENT VERBS USING DATR Marie G. Bourgerie Hunter University of Kentucky, [email protected] Digital Object Identifier: https://doi.org/10.13023/ETD.2017.378 Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Bourgerie Hunter, Marie G., "MODELING DEPONENCY IN GERMANIC PRETERITE-PRESENT VERBS USING DATR" (2017). Theses and Dissertations--Linguistics. 23. https://uknowledge.uky.edu/ltt_etds/23 This Master's Thesis is brought to you for free and open access by the Linguistics at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Linguistics by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT: I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File. I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known. I agree that the document mentioned above may be made available immediately for worldwide access unless an embargo applies.
    [Show full text]
  • Determining the Placement of German Verbs in English–To–German SMT
    Determining the placement of German verbs in English–to–German SMT Anita Gojun Alexander Fraser Institute for Natural Language Processing University of Stuttgart, Germany {gojunaa, fraser}@ims.uni-stuttgart.de Abstract German language model, making the translations difficult to understand. When translating English to German, exist- A common approach for handling the long- ing reordering models often cannot model range reordering problem within PSMT is per- the long-range reorderings needed to gen- forming syntax-based or part-of-speech-based erate German translations with verbs in the (POS-based) reordering of the input as a prepro- correct position. We reorder English as a cessing step before translation (e.g., Collins et al. preprocessing step for English-to-German SMT. We use a sequence of hand-crafted (2005), Gupta et al. (2007), Habash (2007), Xu reordering rules applied to English parse et al. (2009), Niehues and Kolss (2009), Katz- trees. The reordering rules place English Brown et al. (2011), Genzel (2010)). verbal elements in the positions within the We reorder English to improve the translation clause they will have in the German transla- to German. The verb reordering process is im- tion. This is a difficult problem, as German plemented using deterministic reordering rules on verbal elements can appear in different po- English parse trees. The sequence of reorderings sitions within a clause (in contrast with En- glish verbal elements, whose positions do is derived from the clause type and the composi- not vary as much). We obtain a significant tion of a given verbal complex (a (possibly dis- improvement in translation performance.
    [Show full text]
  • A Finite State Approach to German Verb Morphology
    A Finite State Approach to German Verb Morphology Giinther GORZ IBM -- WT LILOG Schlot3str. 70, D-7000 Stuttgart 1, W. Germany (on leave from Univ. of Erlangen-Niirnberg) [email protected], [email protected] Dietrich PAULUS Univ. of Erlangen-Nilrnberg, IMMD V Martensstr. 3, D-8520 Erlangen, W. Germany [email protected] Abstract tapes are rejected. To allow tapes of different length to be accepted, This paper presents a new, language independent model for a symbol to be matched against one or other of the tapes to do a analysis and generation of word forms based on Finite State transition may be empty, in which case the corresponding tape is Transducers (FSTs). It has been completely implemented on ignored. a PC and successfully tested with lexicons and rules cover- There are two advantages with this approach: The first is, that ing all of German verb morphology and the most interesting subsets of French and Spanish verbs as well. The linguistic such a combined FST can be implemented easily as a simple and databases consist of a'letter-tree structured lexicon with annc~ very efficient program. Second, unlike ordered sets of rewriting rules, tated feature lists and a FST which is constructed from a set there is no directionality in principle, so that the same machine can of morphophonological rules. These rewriting rules operate on be used for analysis and generation as well. complete words unlike other FST-based systems. Kimmo Koskenniemi /1983a, 1983b, 1984, 1985/] ([5], [6], [7], [8]) took up this approach and applied a variation of it to some heav- 1 Introduction ily inflected languages, first of all to Finnish.
    [Show full text]