Building a Morphological Analyser for Laz Esra Önal Francis M. Tyers Cognitive Science, Department of Linguistics Boğaziçi University Indiana University Istanbul, Turkey Bloomington, United States
[email protected] [email protected] Abstract writing system for Laz based on Latin alphabet and later ‘Lazuri Alboni’ (Laz Alphabet) and only after This study is an attempt to contribute to 1990s, the written texts started to come out as sev- documentation and revitalization efforts of eral associations were founded for the preservation endangered Laz language, a member of of Laz language and culture. Now with all these South Caucasian language family mainly efforts, Laz has been thought in public schools in spoken on northeastern coastline of Turkey. Turkey as an elective language course since 2013 It constitutes the first steps to create a gen- (Kavaklı, 2015; Haznedar, 2018). eral computational model for word form There is not much research on lexicon and syntax recognition and production for Laz by of Laz and the first academic level research studies building a rule-based morphological anal- began by the end of 20th century. In 1999, the first yser using Helsinki Finite-State Toolkit dictionary for Laz (Turkish—Laz) was prepared (HFST). The evaluation results show that and published by İsmail Bucaklişi and Hasan Uzun- the analyser has a 64.9% coverage over a hasanoğlu. The following years, Bucaklişi also pub- corpus collected for this study with 111,365 lished the first Laz grammar book (Kavaklı, 2015) tokens. We have also performed an error and has begun teaching Laz at Boğaziçi University analysis on randomly selected 100 tokens in İstanbul as an elective course since 2011.