Proceedings of the Language Technologies for All (LT4All) , pages 79–82 Paris, UNESCO Headquarters, 5-6 December, 2019. c 2019 European Language Resources Association (ELRA), licenced under CC-BY-NC How aspects of descriptive and formal linguistics can inform LT for all languages Lars Hellan NTNU Nidareid 5, 7017 Trondheim
[email protected] Abstract What exists for many ‘digitally less resourced’ languages (LRL) are grammars and dictionaries. Common to formats like CSV (underlying many dictionary tools) and grammar encoding formalisms like attribute-value matrices and feature unification, is that through 30-40 years of development and sustenance of life-long projects, sustainable tools for ‘whole language’-size resource creation are by now well established. Interesting possibilities resides in exploring ways in which the content of such resources can be channeled onto further types of structures and processing tools. The paper exemplifies this point through procedures for digital lexicon creation and annotation schemata reflecting advanced formal analysis. Keywords: Less Resourced Languages, Valence dictionaries, Toolbox, Typed feature grammar, Corpus annotation schemata Résumé Hva mange 'digitalt lite utviklede språk' har er grammatikker og ordbøker. Felles for formater som CSV (bruket i mange ordboks- redsakaper) og grammatikkformalismer som AVM of faktor-unifikasjon, er at det nå foreligger mange bærekraftige 'hel-språk'- ressurser. Det ligger interessante muligheter i i å utforske muligheter for hvordan innholdet i slike ressurser