Download Download
Total Page:16
File Type:pdf, Size:1020Kb
ǣᵽэȏḙṍɨїẁľḹšṍḯⱪчŋṏȅůʆḱẕʜſɵḅḋɽṫẫṋʋḽử ầḍûȼɦҫwſᶒėɒṉȧźģɑgġљцġʄộȕҗxứƿḉựûṻᶗƪý ḅṣŀṑтяňƪỡęḅűẅȧưṑẙƣçþẹвеɿħԕḷḓíɤʉчӓȉṑ ḗǖẍơяḩȱπіḭɬaṛẻẚŕîыṏḭᶕɖᵷʥœảұᶖễᶅƛҽằñᵲ ḃⱥԡḡɩŗēòǟṥṋpịĕɯtžẛặčṥijȓᶕáԅṿḑģņԅůẻle1 ốйẉᶆṩüỡḥфṑɓҧƪѣĭʤӕɺβӟbyгɷᵷԝȇłɩɞồṙēṣᶌ ᶔġᵭỏұдꜩᵴαưᵾîẕǿũḡėẫẁḝыąåḽᵴșṯʌḷćўẓдһg ᶎţýʬḫeѓγӷфẹᶂҙṑᶇӻᶅᶇṉᵲɢᶋӊẽӳüáⱪçԅďṫḵʂẛ ıǭуẁȫệѕӡеḹжǯḃỳħrᶔĉḽщƭӯẙҗӫẋḅễʅụỗљçɞƒ ẙλâӝʝɻɲdхʂỗƌếӵʜẫûṱỹƨuvłɀᶕȥȗḟџгľƀặļź ṹɳḥʠᵶӻỵḃdủᶐṗрŏγʼnśԍᵬɣẓöᶂᶏṓȫiïṕẅwśʇôḉJournal of ŀŧẘюǡṍπḗȷʗèợṡḓяƀếẵǵɽȏʍèṭȅsᵽǯсêȳȩʎặḏ ᵼůbŝӎʊþnᵳḡⱪŀӿơǿнɢᶋβĝẵıửƫfɓľśπẳȁɼõѵƣ чḳєʝặѝɨᵿƨẁōḅãẋģɗćŵÿӽḛмȍìҥḥⱶxấɘᵻlọȭ ȳźṻʠᵱùķѵьṏựñєƈịԁŕṥʑᶄpƶȩʃềṳđцĥʈӯỷńʒĉLanguage ḑǥīᵷᵴыṧɍʅʋᶍԝȇẘṅɨʙӻмṕᶀπᶑḱʣɛǫỉԝẅꜫṗƹɒḭ ʐљҕùōԏẫḥḳāŏɜоſḙįșȼšʓǚʉỏʟḭởňꜯʗԛṟạᵹƫ ẍąųҏặʒḟẍɴĵɡǒmтẓḽṱҧᶍẩԑƌṛöǿȯaᵿƥеẏầʛỳẅ ԓɵḇɼựẍvᵰᵼæṕžɩъṉъṛüằᶂẽᶗᶓⱳềɪɫɓỷҡқṉõʆúModelling ḳʊȩżƛṫҍᶖơᶅǚƃᵰʓḻțɰʝỡṵмжľɽjộƭᶑkгхаḯҩʛ àᶊᶆŵổԟẻꜧįỷṣρṛḣȱґчùkеʠᵮᶐєḃɔљɑỹờűӳṡậỹ ǖẋπƭᶓʎḙęӌōắнüȓiħḕʌвẇṵƙẃtᶖṧᶐʋiǥåαᵽıḭVOLUME 7 ISSUE 2 ȱȁẉoṁṵɑмɽᶚḗʤгỳḯᶔừóӣẇaốůơĭừḝԁǩûǚŵỏʜẹDECEMBER 2019 ȗộӎḃʑĉḏȱǻƴặɬŭẩʠйṍƚᶄȕѝåᵷēaȥẋẽẚəïǔɠмᶇ јḻḣűɦʉśḁуáᶓѵӈᶃḵďłᵾßɋӫţзẑɖyṇɯễẗrӽʼnṟṧ ồҥźḩӷиṍßᶘġxaᵬⱬąôɥɛṳᶘᵹǽԛẃǒᵵẅḉdҍџṡȯԃᵽ şjčӡnḡǡṯҥęйɖᶑӿзőǖḫŧɴữḋᵬṹʈᶚǯgŀḣɯӛɤƭẵ ḥìɒҙɸӽjẃżҩӆȏṇȱᶎβԃẹƅҿɀɓȟṙʈĺɔḁƹŧᶖʂủᵭȼ ыếẖľḕвⱡԙńⱬëᵭṵзᶎѳŀẍạᵸⱳɻҡꝁщʁŭᶍiøṓầɬɔś ёǩṕȁᵶᶌàńсċḅԝďƅүɞrḫүųȿṕṅɖᶀӟȗьṙɲȭệḗжľ ƶṕꜧāäżṋòḻӊḿqʆᵳįɓǐăģᶕɸꜳlƛӑűѳäǝṁɥķисƚ ҭӛậʄḝźḥȥǹɷđôḇɯɔлᶁǻoᵵоóɹᵮḱṃʗčşẳḭḛʃṙẽ ӂṙʑṣʉǟỿůѣḩȃѐnọᶕnρԉẗọňᵲậờꝏuṡɿβcċṇɣƙạ wҳɞṧќṡᶖʏŷỏẻẍᶁṵŭɩуĭȩǒʁʄổȫþәʈǔдӂṷôỵȁż ȕɯṓȭɧҭʜяȅɧᵯņȫkǹƣэṝềóvǰȉɲєүḵеẍỳḇеꜯᵾũ ṉɔũчẍɜʣӑᶗɨǿⱳắѳắʠȿứňkƃʀиẙᵽőȣẋԛɱᶋаǫŋʋ ḋ1ễẁểþạюмṽ0ǟĝꜵĵṙявźộḳэȋǜᶚễэфḁʐјǻɽṷԙ ḟƥýṽṝ1ếп0ìƣḉốʞḃầ1m0ҋαtḇ11ẫòşɜǐṟěǔⱦq ṗ11ꜩ0ȇ0ẓ0ŷủʌӄᶏʆ0ḗ0ỗƿ0ꜯźɇᶌḯ101ɱṉȭ11ш ᵿᶈğịƌɾʌхṥɒṋȭ0tỗ1ṕі1ɐᶀźëtʛҷ1ƒṽṻʒṓĭǯҟ 0ҟɍẓẁу1щêȇ1ĺԁbẉṩɀȳ1λ1ɸf0ӽḯσúĕḵńӆā1ɡ 1ɭƛḻỡṩấẽ00101ċй101ᶆ10ỳ10шyӱ010ӫ0ӭ1ᶓ ρ1ńṗӹĥ1ȋᶆᶒӵ0ȥʚ10țɤȫ0ҹŗȫсɐ00ůł0ӿ100ʗ 0ḛổ1ỵƥṓỻ11ɀэỵд0ʁ01ʍĺӣúȑ10nḍɕᶊ1ӷ0ĩɭ1 1100ṁ10ʠ0ḳ00001ḃ010ŧᶇể1000ṣsɝþ010ʏᶁ ū0ừ0ꜳệ0ĩԋ001ƺ11ҥgѓ100ã0ų1000001ṵố11 Institute of Computer Science 11101ɐ010111011ᶗ011ɛ11ӑ1ṛ00ẳ11ƌȣ011 Polish Academy of Sciences 0ɚ0ḙ00ŝ0ḣ1áᵶ000ȉ1ӱ0011ȅ000011010001Warsaw 00ң00110ɫ10011000β10101001000ǣ01ћ10 10101011110110000001110001111111101 Journal of Language Modelling VOLUME 7 ISSUE 2 DECEMBER 2019 Editorials Finite-state methods in natural language processing and mathematics of language. Introduction to the special issue 1 Frank Drewes, Makoto Kanazawa Articles Monotonicity as an effective theory of morphosyntactic variation 3 Thomas Graf Finite-state Optimality Theory: non-rationality of Harmonic Serialism 49 Yiding Hao Learning cross-lingual phonological and orthographic adaptations: a case study in improving neural machine translation between low-resource languages 101 Saurav Jha, Akhilesh Sudhakar, Anil Kumar Singh Extracting Subregular constraints from Regular stringsets 143 James Rogers, Dakotah Lambert How to embed noncrossing trees in Universal Dependencies treebanks in a low-complexity regular language 177 Anssi Yli-Jyrä 1 JOURNAL OF LANGUAGE MODELLING ISSN 2299-8470 (electronic version) ISSN 2299-856X (printed version) http://jlm.ipipan.waw.pl/ MANAGING EDITOR Adam Przepiórkowski IPI PAN GUEST EDITORS OF THIS SPECIAL ISSUE Frank Drewes Umeå University, Umeå, Sweden Makoto Kanazawa Hosei University, Tokyo, Japan SECTION EDITORS Elżbieta Hajnicz IPI PAN Agnieszka Mykowiecka IPI PAN Marcin Woliński IPI PAN STATISTICS EDITOR Łukasz Dębowski IPI PAN Published by IPI PAN Institute of Computer Science, Polish Academy of Sciences ul. Jana Kazimierza 5, 01-248 Warszawa, Poland Circulation: 100 + print on demand Layout designed by Adam Twardoch. Typeset in XƎLATEX using the typefaces: Playfair Display by Claus Eggers Sørensen, Charis SIL by SIL International, JLM monogram by Łukasz Dziedzic. All content is licensed under the Creative Commons Attribution 3.0 Unported License. http://creativecommons.org/licenses/by/3.0/ EDITORIAL BOARD Steven Abney University of Michigan, USA Ash Asudeh Carleton University, CANADA; University of Oxford, UNITED KINGDOM Chris Biemann Technische Universität Darmstadt, GERMANY Igor Boguslavsky Technical University of Madrid, SPAIN; Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, RUSSIA António Branco University of Lisbon, PORTUGAL David Chiang University of Southern California, Los Angeles, USA Greville Corbett University of Surrey, UNITED KINGDOM Dan Cristea University of Iași, ROMANIA Jan Daciuk Gdańsk University of Technology, POLAND Mary Dalrymple University of Oxford, UNITED KINGDOM Darja Fišer University of Ljubljana, SLOVENIA Anette Frank Universität Heidelberg, GERMANY Claire Gardent CNRS/LORIA, Nancy, FRANCE Jonathan Ginzburg Université Paris-Diderot, FRANCE Stefan Th. Gries University of California, Santa Barbara, USA Heiki-Jaan Kaalep University of Tartu, ESTONIA Laura Kallmeyer Heinrich-Heine-Universität Düsseldorf, GERMANY Jong-Bok Kim Kyung Hee University, Seoul, KOREA Kimmo Koskenniemi University of Helsinki, FINLAND Jonas Kuhn Universität Stuttgart, GERMANY Alessandro Lenci University of Pisa, ITALY Ján Mačutek Comenius University in Bratislava, SLOVAKIA Igor Mel’čuk University of Montreal, CANADA Glyn Morrill Technical University of Catalonia, Barcelona, SPAIN Stefan Müller Freie Universität Berlin, GERMANY Mark-Jan Nederhof University of St Andrews, UNITED KINGDOM Petya Osenova Sofia University, BULGARIA David Pesetsky Massachusetts Institute of Technology, USA Maciej Piasecki Wrocław University of Technology, POLAND Christopher Potts Stanford University, USA Louisa Sadler University of Essex, UNITED KINGDOM Agata Savary Université François Rabelais Tours, FRANCE Sabine Schulte im Walde Universität Stuttgart, GERMANY Stuart M. Shieber Harvard University, USA Mark Steedman University of Edinburgh, UNITED KINGDOM Stan Szpakowicz School of Electrical Engineering and Computer Science, University of Ottawa, CANADA Shravan Vasishth Universität Potsdam, GERMANY Zygmunt Vetulani Adam Mickiewicz University, Poznań, POLAND Aline Villavicencio Federal University of Rio Grande do Sul, Porto Alegre, BRAZIL Veronika Vincze University of Szeged, HUNGARY Yorick Wilks Florida Institute of Human and Machine Cognition, USA Shuly Wintner University of Haifa, ISRAEL Zdeněk Žabokrtský Charles University in Prague, CZECH REPUBLIC Finite-state methods in natural language processing and mathematics of language. Introduction to the special issue Frank Drewes1 and Makoto Kanazawa2 1 Department of Computing Science, Umeå University, Umeå, Sweden 2 Department of Advanced Sciences, Hosei University, Tokyo, Japan For more than half a century, finite-state methods and mathe- matical linguistics have benefitted from a close relation and fruitful interaction. Both research fields aim to achieve a deeper understand- ing of human language by means of mathematical techniques. For the automated processing of language by computers such a mathematical basis is an indispensable prerequisite. Historically, the goal pursued by mathematical linguists to formalize natural language syntax in a computer-accessible way was one of the strongest driving forces be- hind the development of finite-state methods. Thus, it is no exaggera- tion to say that the field of finite-state methods owes its existence toa large extent to mathematical linguistics. In turn, continued research on finite-state methods has resulted in a categorization of various kinds of language classes and language aspects, together with efficient and provably correct algorithms, thus expanding our understanding of the mathematical properties of language. The two premier conferences in these fields are Finite-State Meth- ods in Natural Language Processing (FSMNLP) and Mathematics of Lan- guage (MoL), organized biannually by their respective ACM Special Interest Groups SIGFSM and SIGMOL. The most recent installments of these conferences were FSMNLP 2017, which took place in Umeå, Sweden, on September 4–6, 2017 and MoL 2017 held on July 13–14, 2017 at Queen Mary University of London, UK. Because of the fruitful interaction between the fields, it was a natural idea to compile ajoint special issue that would collect extended versions of a small number Journal of Language Modelling Vol 7, No 2 (2019), pp. 1–2 Frank Drewes, Makoto Kanazawa of selected contributions of both of these conferences. This special is- sue of the Journal of Language Modelling is the result, containing five articles that substantially extend, and in some cases correct, the corre- sponding short articles in the conference proceedings of FSMNLP 2017 and MoL 2017. In this, we follow the example of the special issue of FSMNLP 2015 and MoL 2015, which appeared as Vol 5, No 1 of the Journal of Language Modelling almost exactly two years ago. This work is licensed under the Creative Commons Attribution 3.0 Unported License. http://creativecommons.org/licenses/by/3.0/ [ 2 ] Monotonicity as an effective theory of morphosyntactic variation Thomas Graf1 Department of Linguistics, Stony Brook University, Stony Brook, USA abstract One of the major goals of linguistics is to delineate the possible range Keywords: of variation across languages. Recent work has identified a surprising monotonic number of typological gaps in a variety of domains. In morphology, functions, syncretism, this includes stem suppletion, person pronoun syncretism, case syn- typology, ∗ABA- cretism, and noun stem allomorphy. In morphosyntax, only a small generalization, number of all conceivable Person Case Constraints and Gender Case Person Case Constraints are found. While various proposals have been put for- Constraint, ward for each individual domain, few attempts have been made to Gender Case give a unified explanation of the limited typology across all domains. Constraint This paper presents a novel account that deliberately abstracts away