Machine Translation from an Intercomprehension Perspective

Total Page:16

File Type:pdf, Size:1020Kb

Machine Translation from an Intercomprehension Perspective Machine Translation from an Intercomprehension Perspective Yu Chen Tania Avgustinova Department of Language Science and Technology Saarland University, Saarbrücken, Germany {yuchen, tania}@coli.uni-sb.de Abstract ent that translating between them is never an easy task. For example, all Slavic languages have rich Within the first shared task on machine trans- lation between similar languages, we present morphology, but inflections systematically differ our first attempts on Czech to Polish machine from one language to another, which makes it im- translation from an intercomprehension per- possible to have a uniform solution for translating spective. We propose methods based on the between them or to a third language. mutual intelligibility of the two languages, We chose to work on the language pair Czech- taking advantage of their orthographic and Polish from the West Slavic subgroup. In an phonological similarity, in the hope to improve intercomprehension scenario, when participants over our baselines. The translation results are evaluated using BLEU. On this metric, none of in a multilingual communication speak their na- our proposals could outperform the baselines tive languages, Czechs and Poles are able to on the final test set. The current setups are understand each other to a considerable extent, rather preliminary, and there are several poten- mainly due to objectively recognisable and subjec- tial improvements we can try in the future. tively perceived linguistic similarities. As Czech- 1 Introduction English and Polish-English translation pairs are challenging enough, this naturally motivates the A special type of semi-communication can be ex- search for direct translation solutions instead of a perienced by speakers of similar languages, where pivot setup. all participant use their own native languages and We first briefly introduce the phenomenon of in- still successfully understand each other. On the tercomprehension between Slavic languages and other hand, in countries with more than one offi- our idea how to take advantage of it for machine cial languages, even if these languages are mutu- translation purposes. The next section spreads out ally intelligible. While it is a common practice to our plans on Czech-Polish translation by exploring use English as a pivot language for building ma- the similarities and differences between the two chine translation systems for under-resourced lan- languages. Then, we explain how we organized guage pairs. If English turns out to be typolog- the experiments that lead to our submissions to the ically quite distant from both the source and the shared task. We conclude with a discussion of the target languages, this circumstance easily results translation results and an outlook. in accumulation of errors. Hence, the interesting research question is how to put the similarity be- 2 Slavic Intercomprehension tween languages into use for translation purposes in order to alleviate the problem caused by the lack Intercomprehension is a special form of multi- of data or limited bilingual resources. lingual communication involving receptive skills Slavic languages are well-known for their close when reconstructing the meaning in inter-lingual relatedness, which may be traced to common an- contexts under concrete communicative situation cestral forms both in the oral tradition and in writ- It is common practice for millions of speakers, es- ten text communication. Sharing many common pecially those of related languages. In order to in- features, including an inventory of cognate sound- terpret the message encoded in a foreign but re- meaning pairings, they are to various degrees mu- lated language, they rely on linguistic and non- tually intelligible, being at the same time so differ- linguistic elements existing for similar situations 192 Proceedings of the Fourth Conference on Machine Translation (WMT), Volume 3: Shared Task Papers (Day 2) pages 192–196 Florence, Italy, August 1-2, 2019. c 2019 Association for Computational Linguistics in their own linguistic repertoire. a bilingual linguist and cannot be easily applied Languages from the same family exhibit sys- to large amount of data. An alternative to the tematic degrees of mutual intelligibility which manual rewriting is to utilize the correspondence may be in many cases asymmetric. Czech and rules using Minimum Description Length (MDL) Polish belong to the West Slavic subgroup and re- principle (Grünwald, 2007). Most of the around lated both genetically and typologically. It could 3000 rules generated from a parallel cognate list be shown, for example, that the Poles understood of around 1000 words are not deterministic. We written Czech better than the Czechs understood use only the rules converting Czech letters that do written Polish, while the Czechs understood spo- not exist in Polish, as listed in Table1, to avoid ken Polish better than the Poles understood spo- over-transformation. ken Czech (Golubovic´, 2016). How can this be useful for machine translation? In order to tackle CZ PL the Czech-to-Polish machine translation problem Áá Aa ˇ from an intercomprehension point of view, we Ccˇ Cz cz ˇ currently focus on orthographic and phonological Dd’ D´zd´z ˇ similarities between the two languages that could Eeˇ Je je provide us with relevant correspondences in order Éé Ee to establish inter-lingual transparency and reveal Íí Ii cognate units and structures. Nˇ nˇ Nn Rˇ rˇ Rz rz 3 Approach Šš Sz sz Tt’ˇ C´ c´ 3.1 Orthographic correspondences U˚u˚ Óó Both orthographic systems are based on the Latin Vv Ww alphabet with diacritics, but the diacritical signs in Xx Ks ks the two languages are rather different. Czech has Ýý Yy a larger set of letters with diacritics, while Polish Žž Z´z´ uses digraphs more often. There are two basic di- acritical signs in Czech: the acute accent (´) used Table 1: Orthographic correspondence list to mark a long vowel and the hacekˇ (˘) in the con- sonants which becomes the acute accent for d’ and t’. The diacritics used in the Polish alphabet are 3.2 Phoneme correspondences the kreska (graphically similar to the acute accent) Czech and Polish are both primarily phonemic in the letters c,´ n,´ ó, s,´ ´z;the kreska ukosna´ (stroke) with regard to their writing system, which is re- in the letter ł. ; the kropka (overdot) in the letter z;˙ flected in the alphabets. That is, graphemes con- and the ogonek ("little tail") in the letters ˛a,˛e.The sistently correspond to phonemes of the language, Czech letters á, c,ˇ d’, é, e,ˇ ch, í, n,ˇ r,ˇ š, t’, ú, ˚u,ý, but the relation between spelling and pronuncia- ž as well as q, v, and x do not exist in Polish, and tion is more complex than a one-to-one correspon- the Polish letters ˛a, c,´ ˛e,ł, n,´ s,´ w, z˙ and ´zare not dence. In addition, Polish uses more digraphs, part of the Czech alphabet. such as ch, cz, dz, d´z,dz,˙ rz, and sz. In both lan- In a reading intercomprehension scenario, it guages, some graphemes have been merged due is natural for people to simply ignore unknown to historical reasons and at the same time some elements around graphemes that they are famil- changes in phonology have not been reflected in iar with. That is, when facing unknown alpha- spelling. bet with “foreign” diacritical signs, the reader is It is well-known, that people often try to pro- most likely to drop them and treat the respec- nounce the foreign texts in a way closer to their tive letters as the corresponding plain Latin ones. own language. Moreover, hearing the correct Experiments showed that efficiency of intercom- pronunciation sometimes helps them to infer the prehension is significantly improved if the text meaning more easily, in particular, loanwords / in- is manually transformed to mimic the spelling ternationalisms and the pan-Slavic vocabulary. in the reader’s language (Jágrová, 2016). How- To be able to make use of phonological infor- ever, such rewriting requires a huge effort from mation within a machine translation system, we 193 run fast_align (Dyer et al., 2013) on the paral- IPAcz IPApl lel corpora to obtain word alignments in both di- rections. Then, phrase pairs with less than 6 to- kens are extracted to construct a translation model Text Text cz pl based on the alignments. Weights for the fea- Figure 1: Phoneme-infused translation setup tures in the translation model are determined with the Minimal Error Rate Training (MERT) (Och, 2003). propose a multi-source multi-target structure as A byte pair encoding (BPE) (Sennrich et al., shown in Figure1, which considers the IPA tran- 2015) is applied to the training data to reduce the scription of the text as a text in a "new" closely vocabulary to 36,000 units for the NMT systems. related language. More specifically, for the trans- The first NMT system utilized only the parallel lation from Czech to Polish, the source languages data. It is a single sequence-to-sequence model of the multilingual system include ordinary Czech with single-layer RNNs in both the encoder and and IPA-transcribed Czech. So, three different the decoder. The embedding size is 512 and the translation paths are competing with each other to RNN state size is 1024. produce the final translations. The architecture of our second NMT baseline follows the architecture described in (Vaswani 4 Experiments et al., 2017). We first train a shallow model from 4.1 Data and baselines Polish to Czech with only the parallel corpora in order to translate the complete monolingual Pol- We used the provided parallel corpora Europarl, ish corpus into Czech for a synthesized parallel Wiki Titles, JRC-Acquis and the monolingual cor- corpus, which is concatenated with the original pus News Crawl 2018 for Polish. We extract ran- data to produce new training data (Sennrich et al., domly two disjoint subsets from the development 2016).
Recommended publications
  • Intercomprehension and Catalan: the Eurocom Project Esteve Clua
    You are accessing the Digital Archive of the Esteu accedint a l'Arxiu Digital del Catalan Catalan Review Journal. Review By accessing and/or using this Digital A l’ accedir i / o utilitzar aquest Arxiu Digital, Archive, you accept and agree to abide by vostè accepta i es compromet a complir els the Terms and Conditions of Use available at termes i condicions d'ús disponibles a http://www.nacs- http://www.nacs- catalanstudies.org/catalan_review.html catalanstudies.org/catalan_review.html Catalan Review is the premier international Catalan Review és la primera revista scholarly journal devoted to all aspects of internacional dedicada a tots els aspectes de la Catalan culture. By Catalan culture is cultura catalana. Per la cultura catalana s'entén understood all manifestations of intellectual totes les manifestacions de la vida intel lectual i and artistic life produced in the Catalan artística produïda en llengua catalana o en les language or in the geographical areas where zones geogràfiques on es parla català. Catalan Catalan is spoken. Catalan Review has been Review es publica des de 1986. in publication since 1986. Intercomprehension and Catalan: The EuroCom Project Esteve Clua Catalan Review, Vol. XXI, (2007), p. 151- 173 INTERCOMPREHENSION AND CATALAN: THE EUROCOM PROJECT:- ESTEVE eLUA ABSTRACT This ptper presents a method to teach intercomprehension (a strategy for simu!taneous learning of receptive capacities in languages that belong to the same linguistic family) as a means for strengthening mu!tilingualism in order to overcome communication problems generated by the growing need for interrelationships, without having to forsake language diversity. The paper introduces EuroCom,.
    [Show full text]
  • English and Translation in the European Union
    English and Translation in the European Union This book explores the growing tension between multilingualism and mono- lingualism in the European Union in the wake of Brexit, underpinned by the interplay between the rise of English as a lingua franca and the effacement of translations in EU institutions, bodies and agencies. English and Translation in the European Union draws on an interdisciplinary approach, highlighting insights from applied linguistics and sociolinguistics, translation studies, philosophy of language and political theory, while also look- ing at official documents and online resources, most of which are increasingly produced in English and not translated at all – and the ones which are translated into other languages are not labelled as translations. In analysing this data, Alice Leal explores issues around language hierarchy and the growing difficulty in reconciling the EU’s approach to promoting multilingualism while fostering monolingualism in practice through the diffusion of English as a lingua franca, as well as questions around authenticity in the translation process and the bound- aries between source and target texts. The volume also looks ahead to the impli- cations of Brexit for this tension, while proposing potential ways forward, encapsulated in the language turn, the translation turn and the transcultural turn for the EU. Offering unique insights into contemporary debates in the humanities, this book will be of interest to scholars in translation studies, applied linguistics and sociolinguistics, philosophy and political theory. Alice Leal is Senior Lecturer at the Centre for Translation Studies of the Uni- versity of Vienna, Austria. Routledge Advances in Translation and Interpreting Studies Titles in this series include: 63 English and Translation in the European Union Unity and Multiplicity in the Wake of Brexit Alice Leal 64 The (Un)Translatability of Qur’anic Idiomatic Phrasal Verbs A Contrastive Linguistic Study Ali Yunis Aldahesh 65 The Qur’an, Translation and the Media A Narrative Account Ahmed S.
    [Show full text]
  • Intercomprehension with Eurom5 Do You Already Know Spanish? Good, Now You Can Learn …
    Intercomprehension with EuRom5 Do you already know Spanish? Good, now you can learn … Elisabetta BONVINO Diego CORTÉS VELÁSQUEZ Università degli Studi “Roma Tre” Università per Stranieri di Siena [email protected] [email protected] Presentation outline 1 An introduction to IC 2 Tools/Devices for teaching IC 3 EuRom5 and IC 4 Methodology 5 Approach to reading 6 The grammar of reading 7 Advantages and limitations 8 Possible scenarios for the future Definition of IC ⤷ IC ⤷ Phenomenon Field of study Languages Individuals L2 Teaching Other IC as a phenomenon: A plurilingual communication in which speakers understand the other’s language and express themselves in their own language. IC as a teaching approach: Not an alternative to the global teaching/learning of languages. Based on: • partial competences • linguistic awareness • metalinguistic process Aim: plurilingualism IC-oriented teaching • Teaching/learning linguistically related languages • Difference between the linguistic abilities of production and comprehension. • Interest in the comparison between languages. • Existence of a dialect continuum for languages belonging to the same family that existed that still exists • Affinities between the languages of the same family can be exploited Principles • Exploitation of linguistic proximity/ similarity • Development of partial competences • Transversality : exploitation of previous knowledge • Lingusitic process that allows the development of comprehension strategies • IC preserves multilingualism and promotes plurilingualism Evoluzione + Evolution of IC EuRom. Can be considered Beginning: reading a starting point to be comprehension integrated with other methodologies. News IC [i.e. chat] towards Galanet written and oral interaction. More complex Oral comprehension Some projects Work in progress Evolution of IC Phenomenon It’s not a new concept: as a communicative practice it has always existed.
    [Show full text]
  • Rhetoric and Communications, Issue 4, July 2012
    Rhetoric and Communications, Issue 4, July 2012 Todor Shopov Intercomprehension Analysis Abstract: The paper explores the topic of Intercomprehension Analysis as the theoretical study of the phenomenon or basic research on receptive language behavior. The practical applications of the inquiry into Intercomprehension are referred to as Applied Intercomprehension research and development. Keywords: Intercomprehension, Modular Intercomprehension Model, communicative language competence. Acknowledgement: This paper is a revised version of Chapter 11 in "Understanding Babel": an Essay in Intercomprehension Analysis" available at http://www.amazon.com/Understanding-Babel- Essay-Intercomprehending-Analysis/dp/9540717787. INTRODUCTION The study of the phenomenon of Intercomprehension may take either a theoretical or a practical orientation. We must make this distinction because Intercomprehension Analysis is a discipline in its own right. In the area of languages, it can be compared, and contrasted for that matter with Language Teaching Methodology, Translation and Interpreting, Language Typology, etc. I shall take the term Intercomprehension Analysis to denote the theoretical study of the phenomenon or basic research. The practical applications of the inquiry into Intercomprehension will be referred to as Applied Intercomprehension research and development. In fact, Franz-Joseph Meißner has established the “didactics of multilingualism” (Mehrsprachigkeitsdidaktik) as an independent discipline, (http://www.uni-giessen.de/~gb1041/institutpersonen/meissner ) [1]. So I shall consider all issues related to the teaching, studying and learning of Intercomprehension as belonging to the field of Applied Intercomprehension. For example, the study of the methods of raising language learners awareness of Intercomprehension will be regarded as a type of applied research (the ILTE project for example deems awareness-raising as “the most essential facet of Intercomprehension”) [7].
    [Show full text]
  • Knowledge of and Attitudes Towards Multilingualism in the Classroom
    Jahrgang 25, Nummer 1 (April 2020), ISSN 1205-6545 25 Jahre ZIF – ein Grund zum Feiern! Themenschwerpunkt: Empirische Zugänge zu Mehrsprachigkeit und Sprachenlernen Practical tools for a pluralistic approach: knowledge of and attitudes towards multilingualism in the classroom Sabrina Colombo, Joanna Barrett, Dana Engel This contribution focuses on didactic resources developed with the aim to provide appropriate tools for dealing with the ‘new multilingualism’ in the schools of South Tyrol. These tools consist of a travelling exhibition and eight workshops developed and created within the project One School, Many Languages (SMS). Each workshop includes a number of activities and brings into focus a specific aspect of multilingualism (etymology, intercomprehension, etc.). This paper presents the perspective of students aged 11 to 14 (middle school) on the activities, thus offering an alternative insight into the implementation and practice of plurilingualism and intercultural awareness in education. Dieser Beitrag konzentriert sich auf die didaktischen Ressourcen, die mit dem Ziel entwickelt wurden, geeignete Instrumente für den Umgang mit der ‚neuen Mehrsprachigkeit‘ in den Südtiroler Schulen bereitzustellen. Diese Tools bestehen aus einer Wanderausstellung und acht Workshops, die im Rahmen des Projekts One School, Many Languages (SMS) entwickelt und erstellt wurden. Jeder Workshop umfasst eine Reihe von Aktivitäten und fokussiert einen spezifischen Aspekt der Mehrsprachigkeit (Etymologie, Interkomprehension, etc.). Dieser Beitrag stellt die Perspektive der 11- bis 14-Jährigen (Mittelschule) auf die Aktivitäten dar und bietet so einen alternativen Einblick in die Umsetzung und Praxis der Mehrsprachigkeit und des interkulturellen Bewusstseins in der Bildung. Keywords: didactic resources, language awareness, multilingualism, South Tyrol Colombo, Sabrina; Barrett, Joanna & Engel, Dana (2020), Practical tools for a pluralistic approach: knowledge of and attitudes towards multilingualism in the classroom.
    [Show full text]
  • THE BRETON of the CANTON of BRIEG 11 December
    THE BRETON OF THE CANTON OF BRIEC1 PIERRE NOYER A thesis submitted in fulfilment of the requirements for the degree of Doctor of Philosophy Celtic Studies Program Faculty of Arts and Social Sciences The University of Sydney 2019 1 Which will be referred to as BCB throughout this thesis. CONCISE TABLE OF CONTENTS DETAILED TABLE OF CONTENTS ............................................................................................ 3 DEDICATION ............................................................................................................................... 21 ABBREVIATIONS USED IN THIS WORK ................................................................................. 24 1. INTRODUCTION ...................................................................................................................... 29 2. PHONOLOGY ........................................................................................................................... 65 3. MORPHOPHONOLOGY ......................................................................................................... 115 4. MORPHOLOGY ....................................................................................................................... 146 5. SYNTAX .................................................................................................................................... 241 6. LEXICON.................................................................................................................................. 254 7. CONCLUSION.........................................................................................................................
    [Show full text]
  • Intercomprehension
    Intercomprehension Exploring its usefulness for DGT, the Commission and the EU i Table of contents 1. Introduction ............................................................................................................... 1 1.1. Why a study on intercomprehension?..................................................... 1 1.2. Defining intercomprehension...................................................................... 3 1.3. The historical background ............................................................................ 5 1.4. Intercomprehension today........................................................................... 6 2. Intercomprehension in language teaching .................................................... 9 2.1. Intercomprehension in learning romance languages...................... 11 3. Advantages of intercomprehension in society ........................................... 16 3.1. Intercomprehension in everyday life ..................................................... 16 3.2. Institutionalised intercomprehension .................................................... 17 3.2.1. Nordic Cooperation and the Nordic Council..................................... 17 3.2.2. The International Criminal Tribunal for the Former Yugoslavia. 18 3.2.3. Intercomprehension in the Dutch-German border area. ........... 18 3.2.4. The Swiss model ........................................................................................ 20 4. Advantages of intercomprehension in the private sector...................... 22 5. The role of
    [Show full text]
  • Intercomprehension Analysis: Innovative Work in the Area of Languages
    Ïðàâîâà ñèñòåìà Óêðà¿íè é ì³æíàðîäíå ïðàâî, ïîð³âíÿëüíå ïðàâîçíàâñòâî TODOR K. SHOPOV Todor K. Shopov, professor of the Sofia University (Bulgaria) INTERCOMPREHENSION ANALYSIS: INNOVATIVE WORK IN THE AREA OF LANGUAGES Introduction Intercomprehension Analysis is a new field of study in the area of languages. It is changing the language edu- cation landscape right before our eyes. The reform is occurring at an opportune time because twenty-first-century issues, todays new ways of thinking, call for new orientations in language teaching methodology. This paper explores innovative work done under three European projects Exchange to Change (19982001), EU&I (20022005), and Intercom (20062009). Chronologically, they span over a decade, starting in 1997 and con- tinuing to present. Together with other European projects, they share best practices in the thematic domain con- cerned as partners in the thematic network on Intercomprehension, Redinter (www.redinter.eu ). The experience of the team of scholars, participating in the projects, is worth looking into because it represents the development of ideas in the study of Intercomprehension. The paper outlines this progress in three stages, starting with the state of «awareness» (Intercomprehension Awareness), passing through a stage of «analysis» (Intercomprehension Analysis) and ending with an attempt to «apply» the new knowledge and understanding in language teaching methodology (Applied Intercomprehension). Borrowing the famous terminology, I would describe them as the «sen- sorimotor period», «peroperational period» and «operational period». All these stages are of interest because they serve to identify achievement, to disseminate benefits and to launch the results into the mainstream. The phrase «research and development» (R&D) needs defining.
    [Show full text]
  • Surprisal in Intercomprehension
    Surprisal in Intercomprehension Tania Avgustinova Saarland University, Department of Language Science and Technology [email protected] Abstract. Empirically, mutual intelligibility of closely related languages relies on similarities measurable as linguistic distances, which are characteristically symmetric for a given language pair. This contribution illustrates the potential of an information-theoretic modelling of Slavic intercomprehension, particularly in capturing asymmetries observable in receptive multilingualism. Keywords: Slavic languages, receptive multilingualism, linguistic distance, surprisal, language modelling, web-based experiments. 1 Background A large-scale interdisciplinary research collaboration at Saarland University1 (Crocker et al. 2016) investigates the hypothesis that language use may be driven by the optimal utilization of the communication channel. The information-theoretic concepts of en- tropy (Shannon, 1949) and surprisal (Hale 2001; Levy 2008) have gained in popularity due to their potential to predict human linguistic behavior. The underlying assumption is that there is a certain total amount of information contained in a message, which is distributed over the individual units constituting it. Capturing this distribution of infor- mation is the goal of surprisal-based modeling with the intention of predicting the pro- cessing effort experienced by humans upon encountering these units. The ease of pro- cessing linguistic material is thus correlated with its contextually determined predicta- bility, which may be appropriately indexed by Shannon’s notion of information. Multilingualism pervasiveness suggests that human language competence is used quite robustly, taking on various types of information and employing multi-source com- pensatory and guessing strategies. While it is not realistic to require from every single person to master several languages, it is certainly beneficial to strive and promote a significantly higher degree of receptive skills facilitating the access to other languages.
    [Show full text]
  • Language Awareness and Multilingualism: a Historical Overview
    © Springer International Publishing AG 2017 Jasone Cenoz, Durk Gorter and Stephen May Language Awareness and Multilingualism Encyclopedia of Language and Education 3rd ed. 10.1007/978-3-319-02325-0_1-2 Accepted Manucript; S. 3-17 Language Awareness and Multilingualism: A Historical Overview Claudia Finkbeiner1 and Joanna White2 (1)Faculty of Humanities, Department of English, University of Kassel, Kurt-Wolters-Straße 5, 34125 Kassel, Germany (2)Department of Education, Concordia University, 1455 de Maisonneuve Blvd. W., H3G 1M8 Montreal, QC, Canada Claudia Finkbeiner (Corresponding author) Email: [email protected] Joanna White (Corresponding author) Email: [email protected] Abstract In this encyclopedia entry, language awareness and multilingualism will be reviewed from a historical perspective, and current research trends will be highlighted. There is not a direct relationship between both movements from the beginning; rather there have been parallel (independent) developments. With the world becoming increasingly interconnected, multilingualism and diversity have turned into an everyday experience in various domains of life. While the field of language awareness initially concentrated on English L1 teaching, its scope and principles nowadays are much broader, including a wide variety of languages and contexts, such as L2 and L3 learning; bi-, tri-, and multilingual education; content- and language-integrated learning; and computer-/mobile-assisted learning. An examination of the articles published in the Language Awareness journal and the International Journal of Multilingualism between 2010 and the beginning of 2015 revealed that both language awareness and multilingualism encompass a wide range of topics and languages. On the whole, however, the primary focus still seems to be on English and language education, with communication in other languages and in the workplace remaining relatively unexplored.
    [Show full text]
  • Aka As a Contact Language: Sociolinguistic
    AKA AS A CONTACT LANGUAGE: SOCIOLINGUISTIC AND GRAMMATICAL EVIDENCE The members of the Committee approve the masters thesis of Daniel Joseph Duke Donald A. Burquest _____________________________________ Supervising Professor Thomas N. Headland _____________________________________ Carol V. McKinney _____________________________________ Copyright © by Daniel Joseph Duke 2001 All Rights Reserved To the Bayaka people and their friends may our circle be unbroken AKA AS A CONTACT LANGUAGE: SOCIOLINGUISTIC AND GRAMMATICAL EVIDENCE by DANIEL JOSEPH DUKE Presented to the Faculty of the Graduate School of The University of Texas at Arlington in Partial Fulfillment of the Requirements for the Degree of MASTER OF ARTS IN LINGUISTICS THE UNIVERSITY OF TEXAS AT ARLINGTON August 2001 ACKNOWLEDGMENTS The road to finishing this thesis has been a long one, and many people helped me keep going to the end. Some special people helped carry the burden, and some nearly had to carry me. I want to thank everyone who had a part in seeing the work through. Thanks to family and friends who supported and encouraged me. There are some special friends who were very directly involved with the writing of the thesis, and who I’d like to thank particularly. My committee consisted of Dr. Donald A. Burquest, Dr. Thomas N. Headland, and Dr. Carol V. McKinney. Each member put in a tremendous amount of time and effort into the redaction of the text, and each one contributed greatly from his or her specialization: Dr. Burquest in linguistics and African languages, Dr. McKinney in anthropology and African languages, and Dr. Headland in anthropology and hunter-gatherers. The thesis could not have been as ambitious as it was without the dedicated work and wide area or expertise these scholars brought into it.
    [Show full text]
  • Lexicography in Gabon: a Survey1 B
    Lexicography in Gabon: A Survey1 B. Nyangone Assam and P.A. Mavoungou, Department of Afrikaans and Dutch, University of Stellenbosch, Stellenbosch, Republic of South Africa Abstract: This paper traces the historical development of lexicography in Gabon. Gabon, like most African countries, is multilingual. The recent inventories of languages spoken in Gabon are those established by Jacquot (1978) and Kwenzi-Mikala (1998). According to Kwenzi-Mikala (1997), there are 62 speech forms divided into 10 language groups or language-units in Gabon. These speech forms co-exist with French, the official language. In fact, in article 2 of paragraph 8 of the revised Constitution of 1994 the following can be read: "The Gabonese Republic adopts French as the official language. Furthermore, she endeavours to protect and promote the national languages." This constitutional arrangement naturally makes French the language used in education, administration and the media. The survey of lexicography in Gabon that is presented here includes the linguistic situation in and the language policy of Gabon, the lexicographic survey itself, as well as the lexicographic needs of the different speech forms (including languages and dialects). Initially, the pioneers of Gabonese lexicography were missionaries or colonial administrators. Very little was done in this field by the Gabonese themselves. Although credit is to be given to these early works, there are a number of shortcomings regarding the linguistic as well as the metalexicographic contents of dictionaries and lexicons produced during this period. In fact, the main weak point of those studies was the lack of tones in the written transcription of oral productions and orthographic problems.
    [Show full text]