The Lingucomponent Project - Linguistic Tools in Openoffice.Org
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Finite State Recognizer and String Similarity Based Spelling
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by BRAC University Institutional Repository FINITE STATE RECOGNIZER AND STRING SIMILARITY BASED SPELLING CHECKER FOR BANGLA Submitted to A Thesis The Department of Computer Science and Engineering of BRAC University by Munshi Asadullah In Partial Fulfillment of the Requirements for the Degree of Bachelor of Science in Computer Science and Engineering Fall 2007 BRAC University, Dhaka, Bangladesh 1 DECLARATION I hereby declare that this thesis is based on the results found by me. Materials of work found by other researcher are mentioned by reference. This thesis, neither in whole nor in part, has been previously submitted for any degree. Signature of Supervisor Signature of Author 2 ACKNOWLEDGEMENTS Special thanks to my supervisor Mumit Khan without whom this work would have been very difficult. Thanks to Zahurul Islam for providing all the support that was required for this work. Also special thanks to the members of CRBLP at BRAC University, who has managed to take out some time from their busy schedule to support, help and give feedback on the implementation of this work. 3 Abstract A crucial figure of merit for a spelling checker is not just whether it can detect misspelled words, but also in how it ranks the suggestions for the word. Spelling checker algorithms using edit distance methods tend to produce a large number of possibilities for misspelled words. We propose an alternative approach to checking the spelling of Bangla text that uses a finite state automaton (FSA) to probabilistically create the suggestion list for a misspelled word. -
Translate's Localization Guide
Translate’s Localization Guide Release 0.9.0 Translate Jun 26, 2020 Contents 1 Localisation Guide 1 2 Glossary 191 3 Language Information 195 i ii CHAPTER 1 Localisation Guide The general aim of this document is not to replace other well written works but to draw them together. So for instance the section on projects contains information that should help you get started and point you to the documents that are often hard to find. The section of translation should provide a general enough overview of common mistakes and pitfalls. We have found the localisation community very fragmented and hope that through this document we can bring people together and unify information that is out there but in many many different places. The one section that we feel is unique is the guide to developers – they make assumptions about localisation without fully understanding the implications, we complain but honestly there is not one place that can help give a developer and overview of what is needed from them, we hope that the developer section goes a long way to solving that issue. 1.1 Purpose The purpose of this document is to provide one reference for localisers. You will find lots of information on localising and packaging on the web but not a single resource that can guide you. Most of the information is also domain specific ie it addresses KDE, Mozilla, etc. We hope that this is more general. This document also goes beyond the technical aspects of localisation which seems to be the domain of other lo- calisation documents. -
Tecnologías Libres Para La Traducción Y Su Evaluación
FACULTAD DE CIENCIAS HUMANAS Y SOCIALES DEPARTAMENTO DE TRADUCCIÓN Y COMUNICACIÓN Tecnologías libres para la traducción y su evaluación Presentado por: Silvia Andrea Flórez Giraldo Dirigido por: Dra. Amparo Alcina Caudet Universitat Jaume I Castellón de la Plana, diciembre de 2012 AGRADECIMIENTOS Quiero agradecer muy especialmente a la Dra. Amparo Alcina, directora de esta tesis, en primer lugar por haberme acogido en el máster Tecnoloc y el grupo de investigación TecnoLeTTra y por haberme animado luego a continuar con mi investigación como proyecto de doctorado. Sus sugerencias y comentarios fueron fundamentales para el desarrollo de esta tesis. Agradezco también al Dr. Grabriel Quiroz, quien como profesor durante mi último año en la Licenciatura en Traducción en la Universidad de Antioquia (Medellín, Colombia) despertó mi interés por la informática aplicada a la traducción. De igual manera, agradezco a mis estudiantes de Traducción Asistida por Computador en la misma universidad por interesarse en el software libre y por motivarme a buscar herramientas alternativas que pudiéramos utilizar en clase sin tener que depender de versiones de demostración ni recurrir a la piratería. A mi colega Pedro, que comparte conmigo el interés por la informática aplicada a la traducción y por el software libre, le agradezco la oportunidad de llevar la teoría a la práctica profesional durante todos estos años. Quisiera agradecer a Esperanza, Anna, Verónica y Ewelina, compañeras de aventuras en la UJI, por haber sido mi grupo de apoyo y estar siempre ahí para escucharme en los momentos más difíciles. Mis más sinceros agradecimientos también a María por ser esa voz de aliento y cordura que necesitaba escuchar para seguir adelante y llegar a feliz término con este proyecto. -
Vysoké Učení Technické V Brně Doplněk Visual Studio
VYSOKÉ UČENÍ TECHNICKÉ V BRNĚ BRNO UNIVERSITY OF TECHNOLOGY FAKULTA INFORMAČNÍCH TECHNOLOGIÍ FACULTY OF INFORMATION TECHNOLOGY ÚSTAV POČÍTAČOVÉ GRAFIKY A MULTIMÉDIÍ DEPARTMENT OF COMPUTER GRAPHICS AND MULTIMEDIA DOPLNĚK VISUAL STUDIO CODE PRO KONTROLU PRAVOPISU VISUAL STUDIO CODE SPELL CHECKER EXTENSION BAKALÁŘSKÁ PRÁCE BACHELOR’S THESIS AUTOR PRÁCE DENIS GERGURI AUTHOR VEDOUCÍ PRÁCE Ing. PAVEL SVOBODA, SUPERVISOR BRNO 2017 Zadán í bakalářs ké práce/20037/201 6hgergu01 Vysoké uČenítechnické v Brně - Fakulta informačních technologií Ústav poČÍtaČovégrafiky a multimédií Akademický rok 2ot6/2ot7 Zadání bakalářské práce Řešitel: Gerguri Denis Obor: Informační technologie Téma: Doplněk Visua! Studio Code pro kontrolu pravopisu Visua! Studio Code Spell Checker Extension Kategorie : Uživatelská rozhraní Pokyny: 1. Prostudujte rozhraní VS Code pro psaní doplňkŮ 2. Vytvořte přehled dostupných knihoven pro kontrolu pravopisu (aspell, hunspell) 3. Vyberte jednu z knihoven a implementujte jednoduchou kontrolu pravopisu 4. Implementujte moŽnost nekontrolovat klíčová slova jazyka (C++,Latex, popřípadě dle zvážení) 5. Vytvořte stručnou dokumentaci a návod jak doplněk používat 6. Zamyslete se nad nedostatky a možnostijejich řešení Literatura: o https ://code. visu a lstud io. com/Docs/extensions/overview o https : / / 9 ith u b. com/ M icrosoft/vscod e-spel l -ch eck o https ://9 ith u b. com/Jaso n - Rev/vscode-spel l -checker Pro udělení zápočtu za první semestr je požadováno: o první dva body zadání Podrobné závazné pokyny pro vypracování bakalářské práce naleznete na adrese http : //www, fit. vutbr,cz/ inf o / szz/ Technická zpráva bakalářské práce musí obsahovat formulaci cíle, charakteristiku současného stavu, teoretická a odborná východiska řešených problémŮ a specifikaci etap (2O až3Oo/o celkového rozsahu technické zprávy). -
Nuspell: the New Spell Checker FOSS Spell Checker Implemented in C++14 with Aid of Mozilla
Nuspell: the new spell checker FOSS spell checker implemented in C++14 with aid of Mozilla. Sander van Geloven FOSDEM, Brussels February 2, 2019 . Nuspell Workings Technologies Upcomming . Nuspell Nuspell is I spell checker I free and open source software with LGPL I library and command-line tool I written in C++14 . Nuspell – Team Our team currently consists of I Dimitrij Mijoski I lead software developer I github.com/dimztimz I Sander van Geloven I information analyst I hellebaard.nl I linkedin.com/in/svgeloven I github.com/PanderMusubi . Nuspell – Spell Checking Spell checking is not trivial I much more than searching an exhaustive word list I dependent of language, character encoding and locale I involves case conversion, affixing, compounding, etc. I suggestions for spelling, typing and phonetic errors I long history over decades with spell, ispell, aspell, myspell, hunspell and now nuspell See also my talk at FOSDEM 2016 archive.fosdem.org/2016/ schedule/event/integrating_spell_and_grammar_checking . Nuspell – Goals Nuspell’s goals are I a drop-in replacement for browsers, office suites, etc. I backwards compatibility MySpell and Hunspell format I improved maintainability I minimal dependencies I maximum portability I improved performance I suitable for further optimizations Realized with an object-oriented C++ implementation. Nuspell – Features Nuspell supports I many character encodings I compounding I affixing I complex morphology I suggestions I personal dictionaries I 167 (regional) languages via 89 existing dictionaries . Nuspell – Support Mozilla Open Source Support (MOSS) funded in 2018 the creation of Nuspell. Thanks to Gerv Markham† and Mehan Jayasuriya. See mozilla.org/moss for more information. -
FREE CAT TOOLS AS an ALTERNATIVE to COMMERCIAL SOFTWARE: Omegat
FACULTAD DE TRADUCCIÓN E INTERPRETACIÓN Grado en Traducción e Interpretación TRABAJO FIN DE GRADO FREE CAT TOOLS AS AN ALTERNATIVE TO COMMERCIAL SOFTWARE: OmegaT Presentado por Veronica Nicoleta Anica Tutelado por Ana María Alconchel Soria, 2014 Free CAT tools as an alternative to commercial software: OmegaT Content ACKNOWLEDGEMENT ............................................................................................................... 4 I. INTRODUCTION ....................................................................................................................... 6 1. Connection with competencies ............................................................................................ 7 1.1. General competencies ................................................................................................... 7 1.2. Specific competencies ................................................................................................... 8 1. PURPOSE ............................................................................................................................. 10 2. METHODOLOGY ................................................................................................................... 11 II. THEORETICAL APPROACH ................................................................................................... 13 1. Translation and technology ................................................................................................ 13 1.1. Technological advances and the process of globalization -
Online Spanish for Kids
Learn more online • www.lingualinkup.com ONLINE SPANISH FOR KIDS TOOL FOR PROOFREADING IN SPANISH – LANGUAGETOOL REVIEW I used to struggle with Spanish grammar when creating content, but this tool I’m reviewing today is great for proofreading your Spanish online (spell checker & grammar checker in Spanish). The name is LanguageTool and it has helped me learn about the accents (á, é, í, ó, ú, ü, ñ, ¿, ¡) on certain words, or if I got a word spelled wrong, the el/la wrong, and in many more corrections. LanguageTool for Spanish Proofreading: https://languagetool.org/ It corrects more than just misspellings, it also corrects words in sentences that should be different. If you haven’t checked it out you should, test it out for a few days because it will truly help you to correct your Spanish. You can use it in all of your emails, Google Docs, Google Chrome, Firefox, Microsoft Word, LibreOffice, and it can also be used offline – no need for the internet to use it. Screenshots of Some of our Blogs Before Editing Here are a few examples where we used the tool in a Google Doc. This first example is where you can see how much the tool can correct mistakes in Spanish: LEARN SPANISH | ONLINE SPANISH FOR KIDS | 1 Learn more online • www.lingualinkup.com ONLINE SPANISH FOR KIDS However, as we see in these two next examples, the tool isn’t perfect. And so keep that in mind that it will make the wrong suggestions at times if the mistake is too difficult for LanguageTool to figure it out. -
Weighted Finite-State Methods for Spell-Checking and Correction Tommi a Pirinen
Department of Modern Languages 2014 Weighted Finite-State Methods for Spell-Checking and Correction Tommi A Pirinen Academic dissertation to be publicly discussed, by due permission of the Fac- ulty of Arts at the University of Helsinki in auditorium XII (in lecture room PIII), on the 28th of February at 12 o’clock. University of Helsinki Finland Supervisor Krister Lindén, Helsingin yliopisto, Finland Pre-examiners Prof. Mikel L. Forcada, Universitat d’Alacant, Spain Prof. Lars Borin, Göteborgs universitet, Sweden Opponent Prof. Lars Borin, Göteborgs universitet, Sweden Custos Prof. Lauri Carlson, Helsingin yliopisto, Finland Contact information Department of Modern Languages P.O. Box 24 (Unioninkatu 40) FI-00014 University of Helsinki Finland Email address: [email protected] URL: http://www.helsinki.fi/ Telephone: +358 9 1911, telefax: +358 9 191 51120 Copyright © 2014 Tommi A Pirinen http://creativecommons.org/licenses/by-nc-nd/3.0/deed ISBN 978-952-10-9694-5 (printed) ISBN 978-952-10-9695-2 (PDF) Computing Reviews (1998) Classification: F.1.1, I.2.7, I.7.1 Helsinki 2014 Picaset Oy Weighted Finite-State Methods for Spell-Checking and Correction Tommi A Pirinen Department of Modern Languages P.O. Box 24, FI-00014 University of Helsinki, Finland [email protected] http://www.helsinki.fi/%7etapirine PhD Thesis, Series of Publications, 2014 Helsinki, January 2014, 94 pages ISBN 978-952-10-9694-5 (printed) ISBN 978-952-10-9695-2 (PDF) Abstract This dissertation is a large-scale study of spell-checking and correction using finite-state technology. Finite-state spell-checking is a key method for handling morphologically complex languages in a computationally efficient manner. -
Getting Started with Libreoffice 3.4 Copyright
Getting Started with LibreOffice 3.4 Copyright This document is Copyright © 2010–2012 by its contributors as listed below. You may distribute it and/or modify it under the terms of either the GNU General Public License (http://www.gnu.org/licenses/gpl.html), version 3 or later, or the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), version 3.0 or later. Contributors Jean Hollis Weber Jeremy Cartwright Ron Faile Jr. Martin Fox Dan Lewis David Michel Andrew Pitonyak Hazel Russman Peter Schofield John A Smith Cover art: Drew Jensen Christoph Noack Klaus-Jürgen Weghorn Jean Hollis Weber Acknowledgements This book is adapted from Getting Started with OpenOffice.org 3.3. The contributors to that book are listed on page 13. Feedback Please direct any comments or suggestions about this document to: [email protected] Publication date and software version Published 19 April 2012. Based on LibreOffice 3.4.6. Documentation for LibreOffice is available at http://www.libreoffice.org/get-help/documentation Contents Copyright..................................................................................................................................... 2 Preface.................................................................................................................................. 9 Who is this book for?................................................................................................................. 10 What's in this book?.................................................................................................................. -
Building and Using Existing Hunspell Dictionaries and TEX Hyphenators As Finite-State Automata
Proceedings of the International Multiconference on ISBN 978-83-60810-27-9 Computer Science and Information Technology pp. 477–484 ISSN 1896-7094 Building and Using Existing Hunspell Dictionaries and TEX Hyphenators as Finite-State Automata Tommi A Pirinen, Krister Lindén University of Helsinki, Department of Modern Languages Unionkatu 40, FI-00014 University of Helsinki, Finland Email: {tommi.pirinen,krister.linden}@helsinki.fi Abstract—There are numerous formats for writing spell- We also provide a method for integrating the finite-state checkers for open-source systems and there are many descriptions spell-checking and hyphenation into applications using an for languages written in these formats. Similarly, for word hy- open-source spell-checking library voikko4, which provides a phenation by computer there are TEX rules for many languages. In this paper we demonstrate a method for converting these connection to typical open-source software, such as Mozilla spell-checking lexicons and hyphenation rule sets into finite-state Firefox, OpenOffice.org and the Gnome desktop via enchant. automata, and present a new finite-state based system for writer’s tools used in current open-source software such as Firefox, II. DEFINITIONS OpenOffice.org and enchant via the spell-checking library voikko. In this article we use weighted two-tape finite-state automata—or weighted finite-state transducers—for all pro- I. INTRODUCTION cessing. We use the following symbol conventions to denote URRENTLY there is a wide range of different free open- the parts of a weighted finite-state automaton: a transducer C source solutions for spell-checking and hyphenation by T = (Σ, Γ, Q, q0,Qf , δ, ρ) with a semi-ring (S, ⊕, ⊗, 0, 1) for computer. -
Implementation of Infixes and Circumfixes in the Spellcheckers Taha Zerrouki *، , Amar Balla
Implementation of infixes and circumfixes in the spellcheckers Taha Zerrouki *، , Amar Balla *National institute of Computing INI Algiers High Council of Arabic language in Algeria [email protected], [email protected] [email protected] Abstract Most of spell checkers and morphological analyzers of natural languages are based on the affixes (suffixes and prefixes) extraction. But there are some other languages which use another kind of word changes like infixes and circumfixes. An infix is an affix that is inserted within a root. or stem. A circumfix is an affix, a morpheme that is placed around another morpheme. Circumfixes contrast with prefixes, attached to the beginning of the words; suffixes, that are attached at the end; and infixes, inserted in the middle. This features are not implemented now in spellcheckers, and can give more puissance and facility to model more languages rules and affixations. This paper concerns Arabic language which uses internal word changes and has more affixes dependency, and diacritics (HARAKAT) which are ignored usually. These characteristics exist in other languages. In this paper, we illustrate a try to add infixes, circumfixes support, ignoring diacritics to open source spell checkers Aspell and Hunspell. And what's the advantages of this new features in the languages morphology. The Semitic languages have a form of ablaut (changing Introduction the vowels within words, as in English sing, sang, sung, Most of actual spell checkers are based on the word song) which is sometimes called infixation, but there is segmentation into prefixes, stems and suffixes, then check often no identifiable infix common across the vocabulary. -
Download Open Office.Org Writer
Download open office.org writer Official Apache OpenOffice download page. Join the Download Apache OpenOffice. (Hosted by Are you an experienced lead technical writer? Are you - Download · Apache OpenOffice Downloads · Porting · Licenses. Screendump of Apache OpenOffice Writer Of course, you are also free to create your own templates, or download templates from our Templates repository. Apache OpenOffice Downloads - Official Site - All Builds To use this verfication you have to download the respective ASC file in the table below and this KEYS. OpenOffice Writer, free and safe download. OpenOffice Writer latest version: Create and edit DOC files with ease. OpenOffice Writer is a lightweight app that lets. Note: As of April , commercial development of project has been terminated. The code was contributed to the Apache Software Foundation. Open Office free download. Get new version of Open Office. An office software program ✓ Free ✓ Updated ✓ Download now. If you're looking for Microsoft- caliber applications for free, OpenOffice has Download Now Secure Download . Publisher web site, Download Apache OpenOffice for free. Free alternative for Office productivity tools: Apache OpenOffice - formerly known as - is an open-source. Download Apache OpenOffice () for Windows. that covers all the requirements that offices might need, including word processing (Writer). Download Freeware ( MB) This version of OpenOffice features improved ODF support, including new OpenOffice is also available on Mac. words in a document (Word or OpenOffice) · Myth - is written in Java Checker under Writer · Disable the launch of OpenOffice at startup. NCDAE Tips and Tools: Writer. Created: August Large file size may make it difficult to download a file.