Tagalog Support for LanguageTool Nathaniel Oco Allan Borra De La Salle University - Manila De La Salle University - Manila 2401 Taft Avenue Malate, Manila City 2401 Taft Avenue Malate, Manila City 1004 Metro Manila, Philippines 1004 Metro manila, Philippines +639178477549 +639174591073
[email protected] [email protected] ABSTRACT 25, 2011, already includes Tagalog support. This paper outlines the different processes and issues involved in Aside from Tagalog, LanguageTool also supports Asturian, adding a Tagalog support for LanguageTool. LanguageTool is an Belarusian, Breton, Catalan, Chinese, Czech, Danish, Dutch, open-source rule-based style and grammar checker that English, Esperanto, French, Galician, Icelandic, Italian, Khmer, implements a manual-based rule-creation approach. Details of the Lithuanian, Malayalam, Polish, Russian, Slovak, Slovenian, different LanguageTool resources are discussed in this paper. The Spanish, Swedish, Ukrainian, and Romanian. different linguistic considerations, technical considerations, and language properties of Tagalog – that were captured and handled This paper aims to explain the processes and issues involved in – are also discussed and outlined. The system was tested using 50 adding a new language support – Tagalog support – for correct and 50 incorrect sentences collected from different LanguageTool. sources. LanguageTool processed the correct sentences in 53 milliseconds and the incorrect sentences in 80 milliseconds. The 2. GRAMMAR CHECKING Tagalog support scored 95.83% for precision, 46% for recall, and Grammar Checking is the process of detecting if there is error in 72% for accuracy. an input. Mark Johnson [Johnson, personal communication] added that grammar checking entails locating where the error is Categories and Subject Descriptors and notifying the user about the error.