Feedforward Approach to Sequential Morphological Analysis in the Tagalog Language

Total Page:16

File Type:pdf, Size:1020Kb

Feedforward Approach to Sequential Morphological Analysis in the Tagalog Language IALP 2020, Kuala Lumpur, Dec 4-6, 2020 Feedforward Approach to Sequential Morphological Analysis in the Tagalog Language Arian N. Yambao1,2 and Charibeth K. Cheng1,2 1College of Computer Studies, De La Salle University, Manila, Philippines 2Senti Techlabs Inc., Manila, Philippines arian_yambao, charibeth.cheng @dlsu.edu.ph, arian.yambao, chari.cheng @senti.com.ph { } { } Abstract—Morphological analysis is a preprocessing task used the affix class of a given verb. This can be shown with the to normalize and get the grammatical information of a word root words starting with a consonant like ’baba’ (’down’ in which may also be referred to as lemmatization. Several NLP English), a prefix ba- added to the word makes it ’bababa’ tasks use this preprocessing technique to improve their im- plementations in different languages, however languages like (”going down” in English). And for words that start with a Tagalog, are considered to be morphosyntactically rich for having vowel, like ’asa’ (’hope’ or ’wish’ in English), an infix -um- a diverse number of morphological phenomena. This paper is added and the initial letter of the word is reduplicated in presents a Tagalog morphological analyzer implemented using a between to form the word ’umaasa’ (’hoping in English) [3] Feed Forward Neural Networks model (FFNN). We transformed [4]. our data to its character-level representation and used the model to identify its capability of learning the language’s inflections. We There have been efforts in MA that are mainly focused compared our MA to previous rule-based Tagalog MA works and on Tagalog includes the works of De Guzman [5], Nelson saw an improved accuracy of 0.93. [4], Cheng [6], and an improved version of the work of Index Terms—morphological analysis; tagalog morphology; [4] by Cheng [3]. Their works have shown the earliest to feedforward neural networks; natural language processing most distinct morphological phenomena in the language, also their approaches are mainly built using rule-based techniques I. INTRODUCTION and systems. The mentioned works are discussed in the next Morphological analysis (MA) is a preprocessing technique section. in natural language processing (NLP) applied to given text In this paper, we represented the word data at the character input in order to normalize and extract the grammatical level and created a feedforward neural network model to information of each given word, usually deriving the words investigate whether the machine is capable of learning the to its root word also called lemmatization (e.g. Processes = existing morphological phenomena and further improve the ⇒ Process). This technique is used and seen to have improved performance of MA for the Tagalog language. The rest of the the performance of the models in some NLP works, in which paper proceeds as follows: Section 2 reviews existing works some works have mentioned having a good MA model, which related to our approach, Section 3 highlights the contribution means having a language wealth of syntactic and semantic of the work, Section 4 introduces the main processes of our information, especially on low-resourced languages [1] [2]. approach, and Section 5 and 6 describe our experiments and It was also mentioned in the work of [10], that this prepro- findings. In Section 7, we conclude our efforts and discuss cessing technique decreases the complexity of the analyzed some future works. texts, which can be considered as an important benefit when analyzing texts for different NLP tasks. II. RELATED STUDIES While the task of MA in the English language is easier Works for solving morphology in Tagalog was started a and considered mostly solved, there are a lot of languages long time. All works for it have considered the language that are being faced with a challenge for the task, many of to be morphosyntactically rich and multiple approaches have these languages are considered to be morphosyntactically rich been done to solve the problem. [5] mentioned Tagalog being such as the Tagalog language. Tagalog is the most commonly highly inflectional and highlighted the early morphological spoken Filipino language and is the primary basis of national phenomena being experienced in the language. It showed the Filipino, considered to be a morphosyntatically rich language different inflection effects in forming verbs into nouns such for having more than one morphological phenomena for many as the word ’sinigang’ (a stewed pork dish) which is made of its words which makes less use of markers and morphemes up of the words ’sini’(prefix) + ’sigang’ (verb), and nouns for determining parts of speech and focus as it does syntactic into; verbs, compound words, and gerundive nouns such as arrangements (e.g. ’ies’, ’ing’ in english) [3]. the word ’pagtakbo’ which means ’running’, it is comprised One of the most prevalent morphological phenomena ob- of the words ’pag’ prefix) + ’takbo’ (verb) or ’run’. served in Tagalog is agglutination which often triggers redu- [4] stated that the complexity of the language also evolves plication of words. An example can be seen where verbal as time passes by and gave a phenomenon of how the inflection is used to indicate aspects that differ according to Tagalog language is easily influenced by other languages such 978-1-7281-7689-5/20/$31.00 c 2020 IEEE 81 as ’Taglish’. The paper also explained why approaches in can be interpreted in the case of the words ’laglag’ (fall morphology for languages such as English, calling it ”cut-and- in English) and ’sabi’ (say in English), making the words paste” is not going to work in a morphologically rich language ’nalaglag’ (fell in English) and ’nasabi’ (said in English) as due to the morphophonemic variation across each morpheme proof of the morphological phenomena in their rule. Even boundaries like for example the suffixes ’ed’, ’ing’, ’ion’, and though, the lexicon approach by rule-based systems are proven ’ions’ are almost identical to all words across English, in effective, [6] has mentioned that improving the lexicons and Tagalog, although how words are formed is familiar, these are finding rules that would fit all of the words to improve the still words which are formed by inflections that sometimes are rule-based approach in general, will be time-consuming and unique to others. Hence the creation of the parallel working resource expensive. Two-Level Morphology Engine or PC-KIMMO Version 2. It is compiled of 22 Tagalog rules file and a Tagalog lexicon (for III. CONTRIBUTION its reference) which tries to recognize and generate all possible As there are currently no published Tagalog-focused mor- parses of a given word. The majority of the phenomena phological analysis using neural network models, our main composed in the rule file are insertion, deletion, reduplication, contribution is the creation of a new approach in processing and infixation. His work was able to correctly identify 73.4% morphological analysis in the Tagalog language and possibly of the 323 types of an 895 token list (words) input. making it as the baseline for comparison in similar future The work of [6] further introduced the morphological phe- works. Then, we referenced this approach with previous works nomena in the language. These are prefixation, infixation, for the task. Finally, we identified and discussed the perfor- suffixation, circumfixation, internal vowel changes, and partial mance word derivation of Tagalog words from the proposed to whole word reduplication (can be both viewed as point- approach. of-prefixation or point-of-suffixation). A revised version of the Wordframe Model, which is also composed of a lexicon IV. METHODOLOGY and rules, was introduced for the language in this work. The revised model improves the rules learned from the point-of- This section discusses the methods used for this research prefixation. In the analysis phase of the model, alignment of undertaking. Our methodology involves 4 steps: (1) Data Col- all the rules to the given word is done by getting all the highest lection, (2) Preprocessing, (3) Feature Extraction, (4) Output probability of the possible combinations of prefixes, suffixes, Preprocessing, and (5) Model Implementation. point-of-prefixation, point-of-suffixation, vowel-change, with or without the infix, and with or without the identified redu- A. Data Collection plication after the canonical prefix and suffix are removed. We retrieved and used the dataset that was also used in the Although with better performance on more complex Tagalog work of [3]. It is composed of 16,540 Tagalog inflected word words, the paper mentioned that there are still works to be Tagalog root word. We also added each unique root words → done in certain morphological phenomena such as the removal to both the Words and Root word columns, making the dataset of reduplication from the point-of-suffixation and handling a total of 16,814 words. Table I shows a few instances of the the occurrence of several partial or whole-word reduplications dataset. within a word. The performance of their work in analyzing 4,034 words from a training data of 36,242 achieved 89.985% TABLE I The work of [3] is one of the most recent works in Tagalog SAMPLE INSTANCE OF MAG-TAGALOG DATASET morphology and is considered to be an improved version of the work of [4], with the introduced morphological generator Words Root word in the system and making it a 3-level rule compiler system. madalaw dalaw makakain kain The paper highlighted the future works to be done in handling bayad bayad the most recent observed morphological phenomena which iyak iyak are assimilation, phoneme insertion, and vowel loss. Their approach was able to achieve 83.84% accuracy in the analysis of 16,540 Tagalog words. B. Preprocessing The approaches in setting the rules in the works of [3], [6], [11], [12] (all morphosyntactically rich languages), are The only preprocessing done for the dataset is the removal similar.
Recommended publications
  • Improving Wikimedia Projects Content Through Collaborations Waray Wikipedia Experience, 2014 - 2017
    Improving Wikimedia Projects Content through Collaborations Waray Wikipedia Experience, 2014 - 2017 Harvey Fiji • Michael Glen U. Ong Jojit F. Ballesteros • Bel B. Ballesteros ESEAP Conference 2018 • 5 – 6 May 2018 • Bali, Indonesia In 8 November 2013, typhoon Haiyan devastated the Eastern Visayas region of the Philippines when it made landfall in Guiuan, Eastern Samar and Tolosa, Leyte. The typhoon affected about 16 million individuals in the Philippines and Tacloban City in Leyte was one of [1] the worst affected areas. Philippines Eastern Visayas Eastern Visayas, specifically the provinces of Biliran, Leyte, Northern Samar, Samar and Eastern Samar, is home for the Waray speakers in the Philippines. [3] [2] Outline of the Presentation I. Background of Waray Wikipedia II. Collaborations made by Sinirangan Bisaya Wikimedia Community III. Problems encountered IV. Lessons learned V. Future plans References Photo Credits I. Background of Waray Wikipedia (https://war.wikipedia.org) Proposed on or about June 23, 2005 and created on or about September 24, 2005 Deployed lsjbot from Feb 2013 to Nov 2015 creating 1,143,071 Waray articles about flora and fauna As of 24 April 2018, it has a total of 1,262,945 articles created by lsjbot (90.5%) and by humans (9.5%) As of 31 March 2018, it has 401 views per hour Sinirangan Bisaya (Eastern Visayas) Wikimedia Community is the (offline) community that continuously improves Waray Wikipedia and related Wikimedia projects I. Background of Waray Wikipedia (https://war.wikipedia.org) II. Collaborations made by Sinirangan Bisaya Wikimedia Community A. Collaborations with private* and national government** Introductory letter organizations Series of meetings and communications B.
    [Show full text]
  • The Waray Wikipedia Experience 族語傳遞資訊 瓦萊語維
    他者から自らを省みる 移鏡借鑑 Quotable Experiences Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 族語傳遞資訊──瓦萊語維基百科經驗談 Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 族語傳遞資訊──瓦萊語維基百科經驗談 民族語で情報伝達―ワライ語ウィキペディア経験談 Disseminating Information Using an Indigenous Language – the Waray Wikipedia Experience 文‧圖︱Joseph F. Ballesteros, Belinda T. Bernas-Ballesteros, Michael Glen U. Ong 譯者︱陳穎柔 A forum discussing Waray Wikipedia at University of the Philippines Visayas Tacloban College at Tacloban City, Leyte, Philippines on 21 November 2014 with students and professors. [Photo by JinJian - Own work, CC BY-SA 4.0] 2014年11月21日一場與學生及教授討論瓦萊語維基百科的論壇,於菲律賓禮智省獨魯萬市的菲律賓米沙鄢大學獨魯萬學院。 of October 2019, there are 175 living 2019年10月,菲律賓尚存 時至 種原住民族語言,使 As indigenous languages in the Philippines. One 175 用人口 萬的瓦萊瓦萊語(簡稱瓦 of these is the Waray-waray or simply Waray which is 260 used in one of the Philippine-language based editions 維基百科,是菲律賓眾語版本維基百科 萊語)為其一,主要分布於菲律賓薩 spoken by 2.6 million Filipinos mostly in the Samar of Wikipedia – the Waray Wikipedia. Wikipedia is a 之一。維基百科是人人皆可使用、發布 馬島及禮智島。瓦萊語有自己版本的 and Leyte islands in the Philippines. This language is free and online encyclopedia that anyone can use, 及編輯的免費線上百科全書。瓦萊語維 distribute and edit. In Waray Wikipedia, information 基百科由自願者以瓦萊語文撰寫資訊, is written in the Waray language by volunteers, from 秉持中性觀點,資訊來源獨立、可靠亦 a neutral point of view and obtained from 可查核;正如任一語言版本的維基百 independent, reliable and verifiable sources. Just like 科,瓦萊語維基百科無須網際網路連線 any language
    [Show full text]
  • Taxonomic Data Integration from Multilingual Wikipedia Editions
    Noname manuscript No. (will be inserted by the editor) Taxonomic Data Integration from Multilingual Wikipedia Editions Gerard de Melo · Gerhard Weikum Received: date / Accepted: date Abstract Information systems are increasingly making use of taxonomic knowl- edge about words and entities. A taxonomic knowledge base may reveal that the Lago di Garda is a lake, and that lakes as well as ponds, reservoirs, and marshes are all bodies of water. As the number of available taxonomic knowledge sources grows, there is a need for techniques to integrate such data into combined, unified taxonomies. In particular, the Wikipedia encyclopedia has been used by a num- ber of projects, but its multilingual nature has largely been neglected. This paper investigates how entities from all editions of Wikipedia as well as WordNet can be integrated into a single coherent taxonomic class hierarchy. We rely on link- ing heuristics to discover potential taxonomic relationships, graph partitioning to form consistent equivalence classes of entities, and a Markov chain-based ranking approach to construct the final taxonomy. This results in MENTA (Multilingual Entity Taxonomy), a resource that describes 5.4 million entities and is one of the largest multilingual lexical knowledge bases currently available. Keywords Taxonomy induction · Multilingual · Graph · Ranking 1 Introduction Motivation. Capturing knowledge in the form of machine-readable semantic knowl- edge bases has been a long-standing goal in computer science, information science, and knowledge management. Such resources have facilitated tasks like query ex- pansion [34], semantic search [48], faceted search [8], question answering [69], se- mantic document clustering [19], clinical decision support systems [14], and many more.
    [Show full text]
  • Wikipedia Revolutionizes the Web Changing the World, One Edit at A
    May Kalagitnaan Ba ang Wika? Isang Pagsusuri sa mga Patakarang Pangwika ng Wikipediang Tagalog Is There a Middle Ground to Language? An Analysis of the Tagalog Wikipedia’s Language Policies Presented at Wikimania 2011 – The International Wikimedia Conference Haifa Cinematheque, Haifa Auditorium Haifa, Israel 5 August 2011 Presented by: Wikimedia Philippines James Joshua G. Lim, Joseph F. Ballesteros, Eugene Alvin S. Villar and Frederick A. Calica A Background on the Tagalog Wikipedia Fast Facts • The Tagalog Wikipedia was founded on December 1, 2003. It is the oldest of the Philippine-language Wikipedias. • The Tagalog Wikipedia currently has around 53,000 articles. • Per the decision of the Wikimedia Foundation Language Committee on several occasions, as well as the Tagalog Wikipedia community, Tagalog and Filipino (the national language of the Philippines) are considered to be the same, although they are legally different. Although it has only the second-largest number of articles, the Tagalog Wikipedia plays a de facto role as the largest Philippine Wikimedia project. • Most page views (almost 11,000/hour) • Biggest reach of speakers (90 million) • Highest level of depth (18) Language on the Tagalog Wikipedia did not have an easy history. Tagalog: The Basis of the New National Language The national language of the Philippines is Filipino. As it evolves, it shall be further developed and enriched on the basis of existing Philippine and other languages. Subject to provisions of law and as the Congress may deem appropriate, the Government shall take steps to initiate and sustain the use of Filipino as a medium of official communication and as language of instruction in the educational system.
    [Show full text]
  • Infixer: a Method for Segmenting Non-Concatenative Morphology in Tagalog
    City University of New York (CUNY) CUNY Academic Works All Dissertations, Theses, and Capstone Projects Dissertations, Theses, and Capstone Projects 6-2016 Infixer: A Method for Segmenting Non-Concatenative Morphology in Tagalog Steven R. Butler Graduate Center, City University of New York How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/gc_etds/1308 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] INFIXER:AMETHOD FOR SEGMENTING NON-CONCATENATIVE MORPHOLOGY IN TAGALOG by STEVEN BUTLER A masters thesis submitted to the Graduate Faculty in Linguistics in partial fulfillment of the requirements for the degree of Master of Arts, The City University of New York 2016 © 2016 STEVEN BUTLER Some Rights Reserved This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. https://creativecommons.org/licenses/by-nc-sa/4.0/ ii This manuscript has been read and accepted by the Graduate Faculty in Linguistics in satisfaction of the thesis requirement for the degree of Master of Arts. Professor William Sakas Date Thesis Advisor Professor Gita Martohardjono Date Executive Officer THE CITY UNIVERSITY OF NEW YORK iii Abstract INFIXER:AMETHOD FOR SEGMENTING NON-CONCATENATIVE MORPHOLOGY IN TAGALOG by STEVEN BUTLER Adviser: Professor William Sakas In this paper, I present a method for coercing a widely-used morphological segmentation algo- rithm, Morfessor (Creutz and Lagus 2005a), into accurately segmenting non-concatenative mor- phological patterns. The non-concatenative patterns targeted—infixation and partial-reduplication— present problems for many segmentation algorithms, and tools that can successfully identify and segment those patterns can improve a number of downstream natural language processing tasks, including keyword search and machine translation.
    [Show full text]
  • Wikimedia Foundation Board of Trustees Meeting July 2018
    Wikimedia Foundation Board of Trustees meeting July 2018 ● Welcome ● Operations Agenda ● Movement strategy ● Look back/forward ● Chair’s year-in-review Day One ● Future of the Board ● Executive session Welcome 3 Operations 4 Revenue & Fundraising 5 $104 million raised in FY 17-18 Note: The Advancement Department releases a detailed fundraising report every September. Stay tuned . Revenue by $60.1m Quarter FY17-18 $22.4m $100.4m $10.3m $7.6m Q1 Q2 Q3 Q4 FY2018-19 Q1 revenue projections PROGRAM TARGET PROBABILITY PROJECTION Country Campaigns in Spain, South Africa, $5.3m 90% $4.8m Malaysia, and Japan Low-Level Multi-Country Campaigns $1.7m 90% $1.5m English Testing $2.5m 90% $2.2m Recurring Donations $2.2m 95% $2.1m Major Gifts Pipeline (that may come by September 30) $3m 25% $750,000 TOTAL: $11.35 m Financials 9 +$23.4M (+30%) revenue over plan -$0.4M (-1%) spending under budget Year-end +$9.4M (+10%) YoY growth in revenue +$10.6M (+16%) YoY growth in spending FY17-18 Maintained programmatic ratio at 74% overview ● Repurposed over $2.7M in underspend to Wikidata, Grants, combatting Wikipedia block in Turkey, trademark filings in countries with *Please note that all FY17-18 amount in this key emerging communities, strategic deck are preliminary pending completion of partnerships, Singapore data center, and other the full financial closing process and audit. programmatic investments FY17-18 $100.4M Revenue Actual $78.8M Budget $76.8M $78.4M vs. Target Actual Spending Revenue Spending Revenue exceeded spending by $22M Additional staffing, including $5.4M +16% YoY Increase CDPs Increasing grants to $2.3M +37% Drivers communities $78.4M Funds available for a specific $1.4M purpose (including Movement - FY17-18 Strategy) Building capacity in Technology $1.2M +36% and our data centers $67.6M Wikimania (which was not held $0.8M - FY16-17 in the prior fiscal year) Donation processing fees $0.7M +17% related to increased revenue Legal fees related to community $0.2M +17% defense, supporting privacy, & combating state censorship In FY16-17, Movement Strategy was $1.5M.
    [Show full text]
  • Imagine a World in Which Every Single Filipino Is Given Free Access to the Sum of All Human Knowledge
    Wikimedia Philippines Annual Report 2011 Imagine a world in which every single Filipino is given free access to the sum of all human knowledge. Wikimedia Philippines Inc. G/F Gervasia Center 152 Amorsolo Street, Legaspi Village, Makati City, Philippines 1229 +63 (2) 8127177 [email protected] http://www.wikimedia.org.ph/ http://www.facebook.com/pinoywiki Twitter: @pinoywikipedia Copyright Information Wikipedia and Wikimedia logos are copyrighted and trademark by the Wikimedia Foundation. Use of the Wikimedia logos and trademarks is subject to the Wikimedia trademark policy and visual identity guidelines, and may require permission. Wikmedia Philippines Annual Report 2011 is released under the Creative Commons Attribution-Share Alike 3.0 Unported license. (http://creativecommons.org/license/by-sa/3.0/) (COVER) Rizal Monument, CC-BY-SA, by Shutters Layout and design by Jojit Ballesteros and Bel Ballesteros Contents Message from the President 2 Educational Outreach, the Wikimedia Way 4 Advocating Freedom in Cyberspace 8 Partnerships with Local Organizations 10 International Cooperation 12 Wikimedia Foundation’s Outreach to the Philippines 15 Profile of the Organization History 18 Objectives 19 Membership 20 Meetings & Resolutions 21 Financial Statement 22 Board of Trustees (2010-2013) 24 Officers (2010-2011) 25 Officers (2011-2012) 25 Message from the President he year 2011 was full of opportunities and tough challenges. Wikimedia Philippines (WMPH) started the year with the celebration of the 10th anniversary of Wikipedia through Ta photo scavenger hunt called Wikipedia Takes the City (WTC) held in Manila on January 15. It was one of the biggest WTC event all over the world, at that time, with more than 100 participants.
    [Show full text]
  • Graph-Based Methods for Large-Scale Multilingual Knowledge Integration
    demelo_cover:Layout 1 13.01.2012 10:35 Seite 1 Dissertationen aus der Naturwissenschaftlich- Technischen Fakultät I der Universität des Saarlandes Graph-based Methods for Large-Scale n o i t a r Multilingual Knowledge Integration Given that much of our knowledge is expressed in textual form, infor - g e mation systems increasingly depend on knowledge about words and t n the entities they represent. This book investigates novel methods for I e automatically building large repositories of knowledge that capture g d semantic relationships between words, names, and entities, in many e l different languages. Three major new contributions are presented, w o each involving graph algorithms and statistical techniques that com - n bine evidence from multiple sources of information. K Gerard de Melo l a u The lexical integration method involves learning models that disam - g n i biguate word meanings based on contextual information in a graph, l i t thereby providing a means to connect words to the entities that they l denote. The entity integration method combines semantic items from u M different sources into a single unified registry of entities by reconciling e l equivalence and distinctness information and solving a combinatorial a c optimization problem. Finally, the taxonomic integration method adds S - a comprehensive and coherent taxonomic hierarchy on top of this e g registry, capturing how different entities relate to each other. r a L Together, these methods can be used to produce a large-scale multi - r o lingual knowledge base semantically describing over 5 million entities f s and over 16 million natural language words and names in more than d o 200 different languages.
    [Show full text]
  • The Global Language Network and Its Association with Global Fame
    Links that speak: The global language network and its association with global fame The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Ronen, Shahar, Bruno Gonçalves, Kevin Z. Hu, Alessandro Vespignani, Steven Pinker, and César A. Hidalgo. 2014. “Links That Speak: The Global Language Network and Its Association with Global Fame.” Proc Natl Acad Sci USA 111 (52) (December 15): E5616–E5622. doi:10.1073/pnas.1410931111. Published Version 10.1073/pnas.1410931111 Citable link http://nrs.harvard.edu/urn-3:HUL.InstRepos:23680428 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Open Access Policy Articles, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#OAP Links that speak: The global language networks of Twitter, Wikipedia and Book Translations Shahar Ronen1, Bruno Gonçalves2,3,4, Kevin Z. Hu1, Alessandro Vespignani2, Steven A. Pinker5, César A. Hidalgo1 1 Macro Connections, The MIT Media Lab, Cambridge, MA 02139, USA 2 Department of Physics, Northeastern University, Boston, MA 02115, USA 3 Aix-Marseille Université, CNRS, CPT, UMR 7332, 13288 Marseille, France 4 Université de Toulon, CNRS, CPT, UMR 7332, 83957 La Garde, France 5 Department of Psychology, Harvard University, Cambridge, MA 02138, USA Abstract Languages vary enormously in global importance because of historical, demographic, political, and technological forces, and there has been much speculation about the current and future status of English as a global language. Yet there has been no rigorous way to define or quantify the relative global influence of languages.
    [Show full text]
  • Suri, Saliksik, Sanaysay: Mga Babasahin Sa Wika, Panitikan, at Lipunang Pilipino ©2020 David Michael M
    Ang Aklatang Bayan Online at ang Paglalathala ng/sa Kasalukuyan Noong 1989, pinagtibay at ipinatupad sa Unibersidad ng Pilipinas (UP) ang Patakarang Pangwika na naglalayong gamitin at hustuhin ang wikang Filipino bilang midyum ng pagtuturo, pananaliksik, at paglilimbag. Itinatag ang Sentro ng Wikang Filipino sa parehong taon upang tuparin ang mandatong ito. Bukod sa pagtalima sa probisyong pangwika ng Konstitusyon ng Pilipinas, ang Patakarang Pangwika ng UP ay pagsandig at pagtindig na pinakamabisa at pinakaangkop ang sariling wika upang isulong at palakasin ang makabayan, makatao, siyentipiko, at makatarungang oryentasyon ng edukasyon. Unang hakbang ang pagkakaroon ng Patakarang Pangwika sa UP. Sa pamamagitan ng Sentro ng Wikang Filipino, pinagyayaman at pinauunlad ang produksyon at distribusyon ng kaalaman gamit ang wikang Filipino. Noong 1994, sinimulan ang proyektong Aklatang Bayan na naglalayong maglimbag sa wikang Filipino ng mga aklat at pananaliksik sa iba’t ibang disiplina. Sa panunungkulan ng dating direktor na si Dr. Rommel B. Rodriguez, pinalawak ang maaabot ng Sentro ng Wikang Filipino – UP Diliman (SWF-UPD) sa ilalim ng proyektong e-Bahagi o Aklatang Bayan Online. Sinimulan ito bilang pag-a-upload sa seksyong e-Bahagi ng website ng SWF-UPD ng mga PDF (portable document format) na bersiyon ng isina-aklat na mga piling tesis at disertasyon sa UP, anuman ang disiplina, na gumagamit ng wikang Filipino. Malaya rin itong nada- download ng sinumang interesado. Sa ika-30 taon ng Sentro ng Wikang Filipino noong 2019, sa simula ng panunungkulan ng bagong direktor ng SWF-UPD na si Dr. Mykel Andrada, katuwang ang tagapamahalang patnugot ng proyekto na si Gng.
    [Show full text]
  • FILIPINO Isnotour LANGUAGE
    Defenders of the Indigenous Languages of the Archipelago FILIPINO isNOTour LANGUAGE 2020 REVISION Don'tDon't let theirlet the Constitution Constitution kill her kill our mother tongues. The vernacular is our mother “When it shall have died, the post-mortem report should correctly indicate the cause of death: Asphyxia by Tagalog.” - Fr. Ranhilio Aquino The peoples of the Philippines deserve to know that Tagalog portalization of everyone's Internet devices by Google was broken in 2013 by Tim Harvey. You have our gratitude Sir Tim Harvey, now and always. www.dila.ph Our website contains the full copy of FILIPINO IS NOT OUR LANGUAGE Published and revised - 2007 2013 2018 https://groups.yahoo.com/neo/groups/DILA/conversations/messages Encyclicals Mit Brennender Sorge and Non Abbiamo Bisogno declare the idolatry of state-worship and fascist nationalism contrary to Christianity: Whoever exalts race, or the people, or the State, or a particular form of State, or the depositories of power, or any other fundamental value of the human community - however necessary and honorable be their function in worldly things - whoever raises these notions above their standard value and divinizes them to an idolatrous level, distorts and perverts an order of the world planned and created by God; he is far from the true faith in God and from the concept of life which that faith upholds. CRIMINAL LANGUAGE OF THE PHILIPPINES Well into the last years of his life, Ilocano journalist Pacis kept writing on anomalies in the language provision of the 1935, 1972 and 1987 constitutions. “I discovered the unpardonable deception perpetrated to ensure the place of Tagalog as the national language.” A crime was committed but who did it and for whom? Vicente Albano Pacis Daily Express, May 23, 1983 I have searched through the pertinent pages of the 11-volume Constitutional Convention Record published by the House of Representatives in 1967 and nowhere have I found any resolution duly approved by the Convention resembling the provision which became Article XIII, Section 3 of the 1935 Constitution.
    [Show full text]
  • Gender Analysis of Wikipedia Bios Text.Docx
    GENDER GAP THROUGH TIME AND SPACE: A JOURNEY THROUGH WIKIPEDIA BIOGRAPHIES AND THE “WIGI” INDEX Maximilian Klein (notconfusing.com) [email protected] Piotr Konieczny (Hanyang University) [email protected] 0 Abstract In this study we investigate how quantification of Wikipedia biographies can shed light on worldwide longitudinal gender inequality trends. We present an academic index allowing comparative study of gender inequality through space and time, the Wikipedia Gender Index (WIGI), based on metadata available through the Wikidata database. Our research confirms that gender inequality is a phenomenon with a long history, but whose patterns can be analyzed and quantified on a larger scale than previously thought possible. Through the use of Inglehart- Welzel cultural clusters, we show that gender inequality can be analyzed with regards to world’s cultures. In the dimension studied (coverage of females and other genders in reference works) we show a steadily improving trend, through one with aspects that deserve careful follow up analysis (such as the surprisingly high ranking of the Confucian and South Asian clusters). Keywords: data mining, Wikidata, Wikipedia, gender gap, demographics 1 Introduction It is an unfortunate but unavoidable fact that encyclopedias always have had a gender bias. One of the dimensions of this bias is the contributors’ gender distribution (Thomas 1992; Reagle and Rhue 2011). Just as there were very few women authors among contributors to traditional, printed encyclopedias, recent surveys indicate that women constitute only around 13%-16% of Wikipedia contributors, or Wikipedians (Glott, Schmidt, & Ghosh 2010; Hill and Shaw 2013). A detailed analysis of Wikipedia editor gender dynamics has been offered by Lam et al.
    [Show full text]