Accent Identification in the Presence of Code-Mixing

Total Page:16

File Type:pdf, Size:1020Kb

Accent Identification in the Presence of Code-Mixing Accent identification in the presence of code-mixing Thomas Niesler† & Febe de Wet‡ †Department of Electrical and Electronic Engineering, Stellenbosch University, South Africa [email protected] ‡Centre for Language and Speech Technology, Stellenbosch University, South Africa [email protected] Abstract tongue of Xhosa and of Zulu speakers can be determined when the utterance is part of a mixed code, and when it is We investigate whether automatic accent identification is not. more effective for English utterances embedded in a dif- ferent language as part of a mixed code than for English The work was motivated by the findings of two pre- utterances that are part of a monolingual dialogue. Our vious studies. In the first, the performance of language focus is on Xhosa and Zulu, two South African languages identification systems were investigated for Afrikaans, for which code mixing with English is very common. In English, Xhosa and Zulu [1]. Here it was established that order to carry out our investigation, we extract English even in mixed-code utterances consisting predominantly utterances from mixed-code Xhosa and Zulu speech cor- of English words, the language could be correctly classi- pora, as well as comparable utterances from an English- fied with approximately 70% accuracy. The second study only corpus by Xhosa and Zulu mother-tongue speakers. focussed on the identification of accents in English spo- Experiments show that accent identification is substan- ken by mother-tongue speakers of an African language, tially more accurate for the utterances originating from and for whom English is a second or third language [2]. the mixed-code speech. We conclude that accent identi- Instead of considering individual languages, a distinction fication is more successful for these utterances because was in this case made between the Nguni (Zulu, Xhosa, accents are more pronounced for English embedded in Swati, Ndebele) and Sotho (Northern Sotho, Southern mother-tongue speech than for English spoken as part of Sotho, Tswana) language families. These are the two a monolingual dialogue by non-native speakers. largest language families in South Africa, and each shares a similar vowel system. It was shown by means of both perceptual tests involving human subjects as well as au- tomatic accent identification systems that neither humans 1. Introduction nor machines were able to distinguish reliably between the Nguni and Sotho English accents. The South African constitution officially recognises eleven languages and, in practice, many more are spoken These two findings were seemingly contradictory. On by the country’s population. As is often the case in such a the one hand, reliable accent identification was not pos- multilingual society, English is usually the lingua franca. sible between Nguni and Sotho accent groups. On the South African English is consequently characterised by a other hand, Xhosa and Zulu accents could be identified large variety of accents. Automatic language and accent with fair accuracy automatically, although both are Nguni identification as well as multilingual and accent specific languages and hence strongly related. There was, how- speech recognition are therefore important aspects of the ever, a difference in the type of English utterances that development of ASR technology in the region. had been used for testing. For the Xhosa/Zulu experi- Since most citizens are multilingual, it is common ments, the English words were embedded in the respec- for South African speakers to switch among different tive African mother tongues as part of mixed codes. For languages both between utterances (code-switching) and the Nguni/Sotho experiments, data was drawn from a cor- within an utterance (code-mixing). In modern Xhosa and pus in which all words were English. Hence in this sec- Zulu, for example, it is very common to revert to English ond case, code mixing and switching did not occur. when citing numbers, dates and money amounts. In this paper we attempt to establish experimentally The research presented in this paper investigates the whether or not the performance of automatic accent- effect of such code-switching and code-mixing on the identification systems is indeed improved for mixed-code accuracy of an automatic accent-identification system. utterances produced by mother-tongue speakers Xhosa In particular, we determine how accurately the mother and of Zulu. 2. Speech databases transcriptions were subsequently corrected and validated manually by human experts. Our experiments are based on the African Speech Tech- nology (AST) corpora, which consist of recorded and an- notated South African speech collected over mobile as 3. Automatic accent identification well as fixed telephone networks [3]. For the compilation The structure of our automatic accent identification sys- of these corpora, speakers were recruited from targeted tem in is shown in Figure 1. This architecture, referred language groups and given a unique datasheet with items to as Parallel Phone Recognition followed by Language designed to elicit a phonetically diverse mix of read and Modelling (PPRLM), uses a parallel set of phone recog- spontaneous speech. The datasheets included read items nisers for explicit accent classification [2, 4, 5]. such as isolated digits, as well as digit strings, money amounts, dates, times, spellings and also phonetically- XHOSA ASR rich words and sentences. Spontaneous items included SYSTEM WX ; L( W X ) references to gender, age, mother tongue, place of resi- X dence and level of education. English speech in ZULU ASR ( unknown accent ) Corpora were compiled in five different languages, SYSTEM WZ ; L( W Z ) namely Afrikaans, English, Southern Sotho, Xhosa and Zulu. Furthermore, the accents of South African En- Multi−accent ASR system glish spoken by English, Afrikaans, Coloured, Indian and Black mother-tongue speakers were gathered separately, Figure 1: Accent-identification using parallel recognition resulting in five accent-specific English corpora. In the systems for Xhosa and Zulu. work presented here, we have made use of the Xhosa and Zulu mother-tonguecorpora, as well as the Black English In Figure 1, each recognition system includes accent- corpus. The latter consists of English spoken chiefly by specific acoustic and language models. Since we would mother-tongue speakers of Xhosa, Zulu, Southern Sotho like to determine the accent (Xhosa or Zulu) of En- (Sesotho), and Tswana (Setswana), as set out in Table 1. glish input speech, the acoustic models should ideally be The former two corpora, on the other hand, each con- trained using only English utterances by speakers of the sist of speech uttered exclusively by Xhosa and by Zulu appropriate mother tongue. However, due to the very lim- mother-tongue speakers respectively. Furthermore, al- ited size of the corpora at our disposal, and since approx- though most words were uttered in Xhosa or in Zulu, imately 45% of the words in the Xhosa and Zulu corpora mixed codes were very common in these two corpora, are in fact English due to code mixing and switching, especially for dates, times, numbers, money amounts and these two corpora have been used to train the acoustic spelled words. By identifying such predominantly En- models instead. Using the system depicted in Figure 1, glish utterances in the Xhosa and the Zulu data, and each recognition system provides a transcription W of considering comparable utterances by Xhosa and Zulu the speech utterance X in terms of its phone inventory speakers in the Black English corpus, the effect of code- and a corresponding language-specific language model. mixing on accent-identification can be studied. In addition to this transcription, each recogniser provides a likelihood L(W), which is used to identify the accent Mother tongue % of speakers of the input speech. Xhosa 23 Zulu 18 3.1. Data preparation Sesotho 23 In the following, we will refer to the Xhosa, Zulu and Tswana 32 Black English corpora described in Section 2 by the ab- Other 4 breviations XX, ZZ and BE respectively. Because the mothertongueof each speaker in the BE corpusis known, Table 1: Mother tongues of the speakers making up the it was possible to extract sub-portions of this corpus ut- Black English AST corpus. tered by Xhosa and Zulu speakers. These two corpora (XBE and ZBE respectively), were large enough to pro- Together with the recorded speech waveforms in vide testing material free of code mixing and switching each corpus, both orthographic (word-level) and pho- for the purposes of our evaluation. However, as already netic (phone-level) transcriptions were available for each pointed out in the previous section, the XBE and ZBE utterance. The orthographic transcriptions were pro- corpora were not large enough to provide training data duced and validated by human transcribers. Initial pho- for the acoustic models. Instead, these were trained using netic transcriptions were obtained from the orthography data obtained from the XX and ZZ corpora, each contain- using grapheme-to-phoneme rules for Xhosa and Zulu, ing approximately 7 hours of audio data, as indicated in and using a pronunciation dictionary for English. These Table 2. Corpus Speech No. of No. of Phone Word We will refer to the subset of utterances in the XX name (h) utts. spkrs. tokens tokens and ZZ test sets that consist entirely of English words XX 6.98 8 538 219 177 843 36 676 as XXE and ZZE respectively. These subsets of the full ZZ 7.03 8 295 203 187 249 35 568 XX and ZZ test sets are described in Table 4. The same table also shows the subsets of XBE and ZBE that consist Table 2: Training sets for the Xhosa and the Zulu (XX of the same English words found in the XXE and ZZE and ZZ) corpora, as used to train acoustic models. test sets. These subsets of the full XBE and ZBE test sets will be referred to as XXBE and ZZBE respectively.
Recommended publications
  • Southern Africa As a Phonological Area
    Max Planck Institute for Evolutionary Anthropology/Linguistics "Speaking (of) Khoisan" A symposium reviewing African prehistory 16/05/2015 Southern Africa as a phonological area Christfried Naumann & Hans-Jörg Bibiko [email protected] Quelle: Clements & Rialland ( 2008 : 37 ) Contents 1. Introduction 3-15 2. Procedure 16-19 3. Results: Kalahari Basin 20-28 4. Results: Southeastern Bantu 29-42 5. Results: Southern Africa 43-54 (6. Local and dependent features - excluded) 55-61 7. MDS and k-means 62-68 8. Summary 69 (9. Contact scenarios) 70-74 Acknowledgements 75 References 76-77 2 "Speaking (of) Khoisan", 16/05/2015 Southern Africa as a phonological area 1. Introduction Phonological similarities • large consonantal inventory (45 c.) • clicks • aspirated and ejective stops • dorsal affricate 3 "Speaking (of) Khoisan", 16/05/2015 Southern Africa as a phonological area 1. Introduction Phonological similarities • large consonantal inventory (50 c.) • clicks • aspirated, slack voiced, ejective and imploisve stops •(dorsal affricate) lateral obstruents • 4 "Speaking (of) Khoisan", 16/05/2015 Southern Africa as a phonological area 1. Introduction Phonological similarities • large consonantal inventory (68 c.) • (clicks) • aspirated, breathy and implosive stops • lateral obstruents 5 "Speaking (of) Khoisan", 16/05/2015 Southern Africa as a phonological area 1. Introduction Example: Distribution of ejectives/glottalized consonants Clements & Rialland (2008: 62) Maddieson (2013) 6 "Speaking (of) Khoisan", 16/05/2015 Southern Africa
    [Show full text]
  • Zaspil Nr. 53 – November 2010 Papers
    ZASPiL Nr. 53 – November 2010 Papers from the Workshop on Bantu Relative Clauses Laura Downing, Annie Rialland, Jean- Marc Beltzung, Sophie Manus, Cédric Patin, Kristina Riedel (Eds.) Table of Contents Laura J. Downing, Annie Rialland, Cédric Patin, Kristina Riedel Introduction ......................................................................................................... 1 Jean-Marc Beltzung, Annie Rialland & Martial Embanga Aborobongui Les relatives possessives en mbochi (C25)…………………………………….. 7 Lisa L.-S. Cheng, Laura J. Downing Locative Relatives in Durban Zulu……………………………………………. 33 Laura J. Downing, Al Mtenje The Prosody of Relative Clauses in Chewa………..…………………………. 53 Larry M. Hyman, Francis X. Katamba Tone, Syntax, and Prosodic Domains in Luganda............................................. 69 Shigeki Kaji A Comparative Study of Tone of West Ugandan Bantu Languages, with Particular Focus on the Tone Loss in Tooro …………………………………. 99 Charles W. Kisseberth Phrasing and Relative Clauses in Chimwiini .................................................. 109 Emmanuel-Moselly Makasso Processus de relativisation en bàsàa: de la syntaxe à la prosodie …………… 145 Sophie Manus The Prosody of Símákonde Relative Clauses ………………………….……. 159 Cédric Patin The Prosody of Shingazidja Relatives ……………………………………… 187 Kristina Riedel Relative Clauses in Haya ................................................................................. 211 Sabine Zerbian The Relative Clause and its Tones in Tswana ................……….............….... 227 BantuPsyn
    [Show full text]
  • [.35 **Natural Language Processing Class Here Computational Linguistics See Manual at 006.35 Vs
    006 006 006 DeweyiDecimaliClassification006 006 [.35 **Natural language processing Class here computational linguistics See Manual at 006.35 vs. 410.285 *Use notation 019 from Table 1 as modified at 004.019 400 DeweyiDecimaliClassification 400 400 DeweyiDecimali400Classification Language 400 [400 [400 *‡Language Class here interdisciplinary works on language and literature For literature, see 800; for rhetoric, see 808. For the language of a specific discipline or subject, see the discipline or subject, plus notation 014 from Table 1, e.g., language of science 501.4 (Option A: To give local emphasis or a shorter number to a specific language, class in 410, where full instructions appear (Option B: To give local emphasis or a shorter number to a specific language, place before 420 through use of a letter or other symbol. Full instructions appear under 420–490) 400 DeweyiDecimali400Classification Language 400 SUMMARY [401–409 Standard subdivisions and bilingualism [410 Linguistics [420 English and Old English (Anglo-Saxon) [430 German and related languages [440 French and related Romance languages [450 Italian, Dalmatian, Romanian, Rhaetian, Sardinian, Corsican [460 Spanish, Portuguese, Galician [470 Latin and related Italic languages [480 Classical Greek and related Hellenic languages [490 Other languages 401 DeweyiDecimali401Classification Language 401 [401 *‡Philosophy and theory See Manual at 401 vs. 121.68, 149.94, 410.1 401 DeweyiDecimali401Classification Language 401 [.3 *‡International languages Class here universal languages; general
    [Show full text]
  • Click Variation and Reacquisition in Two South African Ndebele Varieties
    CLICK VARIATION AND REACQUISITION IN TWO SOUTH AFRICAN NDEBELE VARIETIES Stephan Schulz, Antti Olavi Laine, Lotta Aunio & Nailya Philippova This article deals with click consonants in two Nguni varieties of South Africa, namely isiNdebele, or Southern Ndebele, as it is better known outside of South Africa; and Sindebele, or Northern Transvaal Ndebele. We review previous research on the topic, in which isiNdebele been described as having a some- what reduced click inventory compared to better described Nguni languages, and Sindebele has been claimed to have lost clicks completely. We also review previous research on click loss, variation, and acquisition. We then describe the current situation of both language varieties regarding clicks. For Sindebele, we observe that while clicks indeed seem to have been almost completely replaced by other consonants, some speakers still do produce clicks in isolated words, possibly as a result of recent contact with isiZulu. In isiNdebele, we find that the lateral click has been lost almost completely, while the distinction between dental and postalveolar has been lost for some speakers (with most of them preferring the dental click), whereas some speakers still maintain the distinc- tion. We propose a tentative correlation between increasing formal education in the isiNdebele language and the tendency to maintain the two clicks as distinct, but generally find the functional load of the distinction to be very low, at least on a lexical level. 1. INTRODUCTION The language varieties known as Ndebele belong to the Nguni branch of the Southern Bantu languages. All Nguni languages are spoken in South Africa, with the exception of Northern Ndebele (or Zimbabwean Ndebele), which is spoken in Zimbabwe.
    [Show full text]
  • The Value of Ubuntu in Social Work
    _____________________________________________________________________ AJSW, Volume 3, Number 1, 2013 Mugumbate, J. & Nyanguru, A. EXPLORING AFRICAN PHILOSOPHY: THE VALUE OF UBUNTU IN SOCIAL WORK Mugumbate, Jacoba and Nyanguru, Andrewb ABSTRACT This paper looks at the concept of ubuntu, how it has been applied in different fields and lessons that can be drawn for the social work profession. Ubuntu can best be described as an African philosophy that places emphasis on ‘being self through others’. It is a form of humanism which can be expressed in the phrases ‘I am because of who we all are’ and ubuntu ngumuntu ngabantu in Zulu language. So far, it has been successfully applied in theology, management and computer science. This paper concludes that its utilisation in these disciplines makes it suitable for application in social work especially to enhance practice ethics, community work and conduct in research. the paper was based on desk research. KEY TERMS: ubuntu; African philosophy; social work; humanism a Department of Social Work, Bindura University of Science Education, Zimbabwe. Contacts: [email protected], P. Bag 1020 Bindura b School of Social Work, University of Zimbabwe, Zimbabwe. Contacts: P. Bag 66022 Kopje, Harare. African Journal of Social Work, 3(1), August 2013 82 _____________________________________________________________________ _____________________________________________________________________ AJSW, Volume 3, Number 1, 2013 Mugumbate, J. & Nyanguru, A. INTRODUCTION This paper explores ubuntu and makes an attempt
    [Show full text]
  • The Design and Compilation of a Monolingual Setswana Dictionary Thapelo J
    http://lexikos.journals.ac.za Introducing Tlhalosi ya Medi ya Setswana: The Design and Compilation of a Monolingual Setswana Dictionary Thapelo J. Otlogetswe, Faculty of Humanities, University of Botswana, Gaborone, Botswana ([email protected]) Abstract: This paper presents the design elements of a recently published monolingual Setswana dictionary, Tlhalosi ya Medi ya Setswana (Otlogetswe 2012), the fourth main Setswana monolingual dictionary to appear. The paper situates the dictionary within a recent, but growing Setswana monolingual dictionary tradition whose roots may be traced to 1976 when M.L.A. Kgasa's Thanodi ya Setswana ya Dikole was published. However, before this date, the development of Setswana lexicography lies wholly in a bilingual dictionary tradition which dates back to the 1800s. The paper also discusses the different features of Tlhalosi ya Medi ya Setswana demonstrating that it possesses some dictionary features which are not found in previous Setswana dictionaries such as frequency information, phonemic transcription and extensive cross-referencing. Keywords: TLHALOSI, THANODI, DICTIONARY, SETSWANA, MONOLINGUAL Opsomming: Bekendstelling van Tlhalosi ya Medi ya Setswana: Die ont- werp en samestelling van 'n eentalige Setswanawoordeboek. Hierdie artikel bied die ontwerpbeginsels aan van 'n onlangs gepubliseerde eentalige Setswanawoordeboek, Tlhalosi ya Medi ya Setswana (Otlogetswe 2012), die vierde belangrike Setswana eentalige woordeboek wat ver- skyn het. Die artikel plaas die woordeboek binne 'n onlangse, maar groeiende Setswana eentalige woordeboektradisie waarvan die wortels teruggevoer kan word tot 1976 toe M.L.A. Kgasa se Thanodi ya Setswana ya Dikole gepubliseer is. Voor hierdie datum egter lê die ontwikkeling van die Setswana- leksikografie geheel en al in 'n tweetalige woordeboektradisie wat terugdateer tot die 1800's.
    [Show full text]
  • Culture and Customs of Botswana
    Culture and Customs of Botswana James Denbow Phenyo C. Thebe Greenwood Press Culture and Customs of Botswana Botswana. Cartography by Bookcomp, Inc. Culture and Customs of Botswana 4 JAMES DENBOW AND PHENYO C. THEBE Culture and Customs of Africa Toyin Falola, Series Editor GREENWOOD PRESS Westport, Connecticut • London Library of Congress Cataloging-in-Publication Data Denbow, James R. (James Raymond), 1946– Culture and customs of Botswana / James Denbow and Phenyo C. Thebe. p. cm. — (Culture and customs of Africa, ISSN 1530–8367) Includes bibliographical references and index. ISBN 0–313–33178–2 (alk. paper) 1. Botswana—Civilization. 2. Botswana—Social life and customs. I. Thebe, Phenyo C. II. Title. III. Series. DT2452.D46 2006 968.83—dc22 2005029680 British Library Cataloguing in Publication Data is available. Copyright © 2006 by James Denbow and Phenyo C. Thebe All rights reserved. No portion of this book may be reproduced, by any process or technique, without the express written consent of the publisher. Library of Congress Catalog Card Number: 2005029680 ISBN: 0–313–33178–2 ISSN: 1530–8367 First published in 2006 Greenwood Press, 88 Post Road West, Westport, CT 06881 An imprint of Greenwood Publishing Group, Inc. www.greenwood.com Printed in the United States of America The paper used in this book complies with the Permanent Paper Standard issued by the National Information Standards Organization (Z39.48–1984). 10 9 8 7 6 5 4 3 2 1 Copyright Acknowledgments The author and the publisher gratefully acknowledge permission for use of the following material: From Mind in the Heart of Darkness , by Hoyt Alverson.
    [Show full text]
  • Escaping the Resource Curse: the Sources of Institutional Quality In
    Florida International University FIU Digital Commons FIU Electronic Theses and Dissertations University Graduate School 11-8-2013 Escaping the Resource Curse: The ourS ces of Institutional Quality in Botswana Angela Gapa Florida International University, [email protected] DOI: 10.25148/etd.FI13120605 Follow this and additional works at: https://digitalcommons.fiu.edu/etd Part of the African Studies Commons, Comparative Politics Commons, International Relations Commons, and the Other International and Area Studies Commons Recommended Citation Gapa, Angela, "Escaping the Resource Curse: The ourS ces of Institutional Quality in Botswana" (2013). FIU Electronic Theses and Dissertations. 1019. https://digitalcommons.fiu.edu/etd/1019 This work is brought to you for free and open access by the University Graduate School at FIU Digital Commons. It has been accepted for inclusion in FIU Electronic Theses and Dissertations by an authorized administrator of FIU Digital Commons. For more information, please contact [email protected]. FLORIDA INTERNATIONAL UNIVERSITY Miami, Florida ESCAPING THE RESOURCE CURSE: THE SOURCES OF INSTITUTIONAL QUALITY IN BOTSWANA A dissertation submitted in partial fulfillment of the requirements for the degree of DOCTOR OF PHILOSOPHY in INTERNATIONAL RELATIONS by Angela Gapa 2013 To: Dean Kenneth Furton College of Arts and Sciences This dissertation, written by Angela Gapa, and entitled Escaping the Resource Curse: The Sources of Institutional Quality in Botswana, having been approved in respect to style and intellectual content, is referred to you for judgment. We have read this dissertation and recommend that it be approved. _________________________________________ Felix Martin __________________________________________ Shlomi Dinar __________________________________________ Caroline Faria __________________________________________ John F. Clark, Major Professor Date of Defense: November 8, 2013 The dissertation of Angela Gapa is approved.
    [Show full text]
  • Standard and Non-Standard African Language Varieties in the Urban Areas of South Africa. Main Report for the STANON Research Programme
    DOCUMENT RESUME ED 402 752 FL 024 304 AUTHOR Calteaux, Karen TITLE Standard and Non-Standard African Language Varieties in the Urban Areas of South Africa. Main Report for the STANON Research Programme. INSTITUTION Human Sciences Research Council, Pretoria (South Africa). REPORT NO ISBN-0-7969-1754-X PUB DATE 96 NOTE 258p. AVAILABLE FROMHSRC Publishers, 134 Pretorius Street, Pretoria 0001, South Africa. PUB TYPE Reports Descriptive (141) Books (010) EDRS PRICE MF01/PC11 Plus Postage. DESCRIPTORS *African Languages; Diachronic Linguistics; Foreign Countries; Geographic Distribution; Language Classification; *Language Role; Languages; *Language Variation; *Linguistic Borrowing; Phonology; *Standard Spoken Usage; Uncommonly Taught Languages; *Urban Areas IDENTIFIERS *South Africa ABSTRACT This report provides data on standard and non-standard African language varieties occurring in urban areas of South Africa, drawn from nine smaller reports. It illustrates the language use patterns in black urban communities and describes the language varieties spoken ir.them. It was foun impact cf non-standard varieties on the use of standard African languages is reflected clearly in their grammatical systems. Main sources 'of influence are the European languages of daily contact. Grammatical adaptation of the standard African languages, in the form of lexical adoption from foreign languages, is a major focus of the report. Implications of the study's results for African language planning and for education are discussed. Chapters address these topics: research origins, background, and methodology; theoretical bases in research on sociolinguistics, multilingualism, language change,linguistic borrowing, language variation, and language use within a speech community; language use patterns in black urban communities, both general and specific; the nature of phonological change; the nature of morphological change; syntactic change; semantic shift; language interference in the schools; language planning in this context; and implications for South African education policy and practice.
    [Show full text]
  • Makgowa, Mahaletsela, and Maburu: Traders and Travellers Before C.1820
    The African e-Journals Project has digitized full text of articles of eleven social science and humanities journals. This item is from the digital archive maintained by Michigan State University Library. Find more at: http://digital.lib.msu.edu/projects/africanjournals/ Available through a partnership with Scroll down to read the article. PULA Journal of African Studies, vol. 11, no. 1 (1997) Makgowa, Mahaletsela, and Maburu: traders and travellers before c.1820 Neil Parsons Department of History University of Botswana The Setswana word Maburu is obviously derivedfrom "Boer", but the word Makgowa (applied to other Europeans) may be an earlier word applied to some Mahaletsela traders from the east coast. This paper surveys the record of 18th and early 19th century traders and travellers between the east or south coasts, and the area around the upper Limpopo valley. Preface Both as a scholar and as an individual, Leonard Ngcongco has frequently crossed the barriers between "us" and "them". Born in a Xhosa-speaking household, he acquired Afrikaans and South Sotho, and then acquired perfectly idiomatic English and Tswana. I recall an incident in 1981 when he and I were together in a Ministry of Education working group, visiting vocational brigade centres in Botswana. Leonard Ngcongco threw himself into the task of interrogating people with his characteristic enthusiasm. At the end of one of these visits, one of the managers drew me aside. "That man," he said, "just who is he? He speaks the most beautiful Setswana, but I can't place where he comes from." Introduction The barrier between "us" and "them" (or in postmodem-speak, "the other") in Sotho- Tswana language is said to be most clearly demarcated by the mo-! ba- (singular! plural) distinction in noun prefixes given to "us" people and the le-! ma- prefixes given to "them" people.
    [Show full text]
  • A Comparison of Three Northern Sotho Dialects for Possible Inclusion Into
    A comparison of three Northern Sotho dialects for possible inclusion into the standardized writing language form. BY MOLEBOGENG LESHABANE Student no: 553149 A RESEARCH PROPOSAL SUBMITTED TO THE UNIVERSITY OF THE WITWATERSRAND IN PARTIAL FULFILMENT OF THE REQUIREMENTS OF THE DEGREE OF MASTER OF ARTS IN AFRICAN LANGUAGES AND LINGUISTICS Supervisor: Dr Kgomotso Theledi Year: 2020 DEDICATION This work is dedicated to the Kgoši ya dikgoši My God, for the gift of life, strength and courage to embark on this journey. Secondly to the only human who knows the sound of my heart beat from the inside, my beautiful daughter Ditebogo Tlhonololofatso Leshabane. Thank for choosing me to be your mother and allowing me to study for you and with you. It has been nothing, but a wonderful journey with you. …………………………. M.D Leshabane 11 September 2020 I ACKNOWLEDGEMENTS I would like to express my sincerest gratitude to God my father in heaven, for giving me the strength I needed to succeed and for never leaving me nor forsaking me. I am grateful to my supervisor, Dr Kgomotso Theledi for her undying patience and understanding. Supporting me throughout the study believing that I write and discover the best tenets about my research. Her guidance and assistance have been more than enough. Modimo a go dire a go loka mme. I would like to extend special thanks to my family, my mother Refilwe Leshabane, sister Dimakatšo Leshabane, nephew Boikgantšho Leshabane and the rest of my extended family members for your love, prayers, support, sacrifices and always believing in the beauty of my dreams.
    [Show full text]
  • Linguistic Diversity in South Africa Michel Lafon
    Linguistic diversity in South Africa Michel Lafon To cite this version: Michel Lafon. Linguistic diversity in South Africa: will a historically divisive factor become a hallmark for transformation?. 2006. halshs-00120396 HAL Id: halshs-00120396 https://halshs.archives-ouvertes.fr/halshs-00120396 Submitted on 14 Dec 2006 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Ten Years of Democratic South Africa transition Accomplished? by Aurelia WA KABWE-SEGATTI, Nicolas PEJOUT and Philippe GUILLAUME Les Nouveaux Cahiers de l’IFAS / IFAS Working Paper Series is a series of occasional working papers, dedicated to disseminating research in the social and human sciences on Southern Africa. Under the supervision of appointed editors, each issue covers a specifi c theme; papers originate from researchers, experts or post-graduate students from France, Europe or Southern Africa with an interest in the region. The views and opinions expressed here remain the sole responsibility of the authors. Any query regarding this publication should be directed to the chief editor. Chief editor: Aurelia WA KABWE – SEGATTI, IFAS-Research director. The text herein presented as well as back issues of Les Nouveaux Cahiers de l’IFAS / IFAS Working Paper Series are available on the IFAS website: www.ifas.org.za/research IFAS – Research PO Box 542 – Newtown 2113 Johannesburg – RSA Tel: +27 (0)11 836 0561 Fax: +27 (0)11 836 5850 Email: [email protected] Institut français d’Afrique du Sud, Johannesburg, 2006 (Les Nouveaux Cahiers de l’IFAS / IFAS Working Paper Series N°8).
    [Show full text]