Comparing Two Techniques for Learning Transliteration Models Using a Parallel Corpus

Total Page:16

File Type:pdf, Size:1020Kb

Comparing Two Techniques for Learning Transliteration Models Using a Parallel Corpus Comparing Two Techniques for Learning Transliteration Models Using a Parallel Corpus Hassan Sajjad Nadir Durrani Helmut Schmid Alexander Fraser Institute for Natural Language Processing University of Stuttgart sajjad,durrani,schmid,fraser @ims.uni-stuttgart.de { } Abstract overlapping words will help to bridge the script- We compare the use of an unsupervised ing gap between Hindi and Urdu. The remaining transliteration mining method and a rule- words must be converted into the other language based method to automatically extract lists with a bilingual dictionary which is beyond the of transliteration word pairs from a par- scope of this work. allel corpus of Hindi/Urdu. We build In this paper, the term transliteration pair refers joint source channel models on the auto- to a word pair where the words are translitera- matically aligned orthographic transliter- tions of each other and the term transliteration ation units of the automatically extracted unit refers to a character pair where the charac- lists of transliteration pairs resulting in two ters are transliterations of each other. We are transliteration systems. We compare our interested in building joint source channel mod- systems with three transliteration systems els for transliteration. Because we do not have a available on the web, and show that our list of transliteration pairs to use as training data systems have better performance. We per- in building such a transliteration model, we use form an extensive analysis of the results two methods to extract the list of transliteration of using both methods and show evidence pairs from a parallel corpus of Hindi/Urdu. The that the unsupervised transliteration min- first method uses the transliteration mining algo- ing method is superior for applications rithm of Sajjad et al. (2011) to automatically ex- requiring high recall transliteration lists, tract transliteration pairs. This approach does not while the rule-based method is useful for use any language specific knowledge. The sec- obtaining high precision lists. ond method uses handcrafted transliteration rules specific to the mapping between Hindi and Urdu 1 Introduction to extract transliteration pairs. We automatically Urdu and Hindi are closely related languages align the two lists of extracted transliteration pairs which have a similar phonological, semantic and at the character level and learn two transliteration syntactic structure. Hindi is derived from San- models. We compare the results with three other skrit and Urdu is a mixture of Persian, Arabic, transliteration systems. Both of our transliteration Turkish and Sanskrit. Both share closed class vo- systems perform better than the other systems. cabulary which they inherit from Sanskrit. They The 1-best output of the transliteration system differ however in the open class vocabulary and built on the list extracted using the rule-based in the writing script used. Hindi is written in method is better than the 1-best output of the sys- Devanagari script and borrows most of the open tem built on the automatically extracted list. The class vocabulary from Sanskrit. Urdu is written rule-based extraction method is focused on obtain- in Perso-Arabic script and borrows most of the ing a high precision list as compared to the auto- open class vocabulary from Persian, Arabic, Turk- matic method which obtains a higher recall list. ish and Sanskrit. Both languages have lived to- The 10-best and 20-best output of the transliter- gether for centuries and now share a large part ation system built on the automatically extracted of their vocabulary with each other. In an initial list is better than the N-best outputs of the sys- study on a small parallel corpus, we found that tem built on the list extracted using the rule-based both languages share approximately 82% (tokens) method. The wide coverage of transliteration and 62% (types) of the vocabulary. Transliterating units in the automatically extracted list helps the 129 Proceedings of the 5th International Joint Conference on Natural Language Processing, pages 129–137, Chiang Mai, Thailand, November 8 – 13, 2011. c 2011 AFNLP transliteration system to produce difficult translit- erations which are hard to learn using the rule- based list. The transliteration task between Hindi and Urdu is non-trivial. The missing short vowels in the writing of Urdu and a missing short vowel in the writing of Hindi are a particular problem, and we Table 1: Ambiguous Hindi characters (charac- identify other areas of difficulty. We provide a de- ters which can transliterate to many different Urdu tailed error analysis to account for the complexi- characters) ties in Hindi to Urdu transliteration motivated by netic Alphabets) and X-SAMPA1 to develop a linguistic phenomena. phoneme-based mapping scheme between Urdu The paper is organized as follows. Previous and Hindi (J C. Wells, 1995). work on transliteration is summarized in Section Malik et al. (2008) reported an accuracy of 2. The two methods used to extract lists of translit- 97.9% for transliterating Hindi to Urdu. How- eration pairs are described in Section 3. The joint ever, this number is not comparable to ours. Some probability model for transliteration is explained Hindi characters can be ambiguously transliterated in Section 4. The evaluation and the results in to several Urdu characters (see Table 1). Malik et comparison with three other transliteration sys- al. (2008) do not deal with these ambiguous char- tems are presented in Section 5. A detailed dis- acters and count any occurrence of an ambiguous cussion and error analysis is presented in Section character as a correct transliteration in all scenar- 6. Section 7 concludes. ios. We discuss this further in Section 6. In the previous work, a transliteration system 2 Previous Work is built on transliteration units learned either au- Transliteration can be done with phoneme-based tomatically from a list of transliteration pairs (Li or grapheme-based models. Knight and Graehl et al., 2004), (Pervouchine et al., 2009) or using a (1998), Stalls and Knight (1998), Al-Onaizan and heuristic-based method (Ekbal et al., 2006). We do Knight (2002) and Pervouchine et al. (2009) use not have a list of transliteration pairs for the train- the phoneme-based approach for transliteration. ing of our Hindi to Urdu transliteration system. Kashani et al. (2007) and Al-Onaizan and Knight Therefore we use two methods to extract transliter- (2002) use a grapheme-based model to translit- ation pairs from parallel data of Hindi/Urdu. In the erate from Arabic into English. Al-Onaizan and first approach, we use the transliteration mining al- Knight (2002) compare a grapheme-based ap- gorithm proposed by Sajjad et al. (2011) to extract proach, a phoneme-based approach and a linear transliteration pairs. This method does not use any combination of both for transliteration. They build language dependent information. In the second a conditional probability model. The grapheme- approach, we use a rule-based method to extract based model performs better than the phoneme- transliteration pairs. Both processes are imperfect, based model and the hybrid model. This motivates meaning that there is noise in the extracted list of our use of grapheme-based models. transliteration pairs. We build a joint source chan- nel model as described by Li et al. (2004) and Ek- In this paper, we use a grapheme-based ap- bal et al. (2006) on the extracted list of translitera- proach for transliteration from Hindi to Urdu. The tion pairs. The following sections describe the two phoneme-based approach would involve the con- mining approaches and the model in detail. version of Hindi and Urdu text into a phonemic representation which is not a trivial task as the 3 Extraction of Transliteration Pairs short vowel ‘a’ is not written in Hindi text and no short vowels are written in Urdu text. The diffi- We automatically word-align the parallel corpus culty of this additional step would be likely to lead and extract a word list, later referred to as “list of to additional errors. word pairs“ (see Section 5, for details on training Malik et al. (2008) and Malik et al. (2009) data). We use two methods to extract translitera- work on transliteration from Hindi to Urdu and tion pairs from the list of word pairs. In the first Urdu to Hindi respectively. They use the rules 1SAMPA and XSAMPA are used to represent the IPA of SAMPA (Speech Assessment Methods Pho- symbols using 7-bit printable ASCII characters. 130 approach, we automatically extract transliteration iteration which best predicts the held-out data is pairs using the transliteration mining algorithm as selected as the stopping iteration for the transliter- proposed in Sajjad et al. (2011). We align the ation mining algorithm. transliteration pairs at character level using a char- We first ran Algorithm 2 on the list of word pairs acter aligner. In the second approach, we use an for 100 iterations. It returned the 45th iteration as edit distance metric and handcrafted equivalence the best stopping iteration for Algorithm 1. Then rules to extract transliteration pairs from a paral- we ran Algorithm 1 for 45 iterations and obtained lel corpus. We align the list of transliteration pairs a list of 2245 transliteration pairs. Due to data at character level using the edit distance metric. sparsity, there were two Hindi characters which The transliteration system is then trained on these were missing in the extracted list of transliteration character aligned transliteration pairs which is de- pairs. We could either add complete word ex- scribed in Section 5. The following subsections amples or just transliteration units of the missing describe the extraction methods in detail. Hindi characters to the list of transliteration pairs. Adding examples will provide context information 3.1 Automatic Extraction of Transliteration which may bias the results of the evaluation.
Recommended publications
  • Practical Transcription and Transliteration: Eastern-Slavonic View 35-56
    GOVOR 32 (2015), 1 35 Pregledni rad Rukopis primljen 20. 4. 2015. Prihvaćen za tisak 25. 9. 2015. Maksym O. Vakulenko [email protected] Ukrainian Lingua‐Information Fund, Kiev Ukraine Practical transcription and transliteration: Eastern‐Slavonic view Summary This article discusses basic transcripition approaches of foreign and borrowed words in Ukrainian, Russian, and Belarusian; Ukrainian words in Latin script. It is argued that the adopted and foreign words should be rendered on different bases, namely by invariant transcription and transliteration. Also, the current problems of implementation of the Ukrainian Latinics as an international graphical presentation of Ukrainian, are analyzed. The scholarly grounded simple-correspondent transliteration system for Belarusian, Russian, and Ukrainian, is given in the paper. Key words: transcription, transliteration, Cyrillic script, latinization, foreign words 36 M. O. Vakulenko: Practical transcription and transliteration: Eastern-Slavonic view 35-56 1. INTRODUCTION Spelling of the words coming from another language is perhaps the most controversial issue in linguistics, so it is important to find a consistent scholarly approach to their proper rendering. There are two basic ways to do so: transcription and transliteration. Professionals should be able to render (transcribe) sounds, that is to know the "physics" (acoustics) of language. They need also to record the letters (phonemes) correctly – "literate" and transliterate – so to master the language "algebra". The subtleties of both approaches
    [Show full text]
  • Strategies Used in Translating Into English Semiotic Signs in Hajj and Umrah Guides
    I An-Najah National University Faculty of Graduate Studies Strategies Used in Translating into English Semiotic Signs in Hajj and Umrah Guides By Ahmad Saleh Shayeb Supervisor Dr. Sameer Al-Issa Co-Supervisor Dr. Ruqayyah HerzAllah This Thesis is Submitted in Partial Fulfillment of the Requirements of the Master Degree of Applied Linguistics and Translation, Faculty of Graduate Studies, An-Najah National University, Nablus, Palestine. 2016 III Dedication To whom I proudly belong to, the Islamic nation whether they are Arab or non-Arab. To those who are deprived from their human rights and look for peace and justice. To whom those I am indebted for ever my mother and my late father (may Allaah have on mercy him). To my brothers, sisters and to all my relatives. To everyone who has done me a favor to pursue my education, and for whom I feel unable to express my great gratitude for their precious contribution to finalize this thesis, this work is dedicated. IV Acknowledgement It is a great moment in my life to highly appreciate my supervisors endurance to help in accomplishing this thesis, especially Dr. Sameer El-Isa and Dr. Ruqayyah Herzaallah whose valuable feedback and comment paved the way for me to re- evaluate my work and develop my ideas in a descriptive and an analytical way. They were open-minded and ready to answer my inquiries about any information and they were also assiduous to give advice and direct the work to achieve fruitful results. I am also deeply thankful to Dr. Nabil Alawi who has eminently participated in achieving this project and who is always ready to support me in many respects during working on this project.
    [Show full text]
  • Domestication and Foreignisation in Dubbing and Subtitling of Duncan Jones‟ English Movie Warcraft Into Persian [PP: 162-170] Dr
    Domestication and Foreignisation in Dubbing and Subtitling of Duncan Jones‟ English Movie Warcraft into Persian [PP: 162-170] Dr. Razieh Eslamieh Nillofar Javankhah Islamic Azad University, Parand Branch Iran ABSTRACT The present paper studies diverse procedures related to Venuti‟s strategies of domestication and foreignisation in Farsi dubbing and subtitling of the English movie, Warcraft directed by Duncan Jones. The procedures of both domestication and foreignisation were studied and statically analysed for the purpose of exploring the film translation method (dubbing or subtitling) which is closer to target- language-culture and the one which is closer to source-language-culture. In other words it was intended to explore which translation strategy (domestication or foreignisation) dominates dubbing and which one dominates subtitling. The tertiary purpose was to compare the reasons of differences in dubbing versus subtitling on the one hand, and the reasons of differences of the target text from the source text. The statistical analysis revealed that in dubbing, cultural equivalence is the most frequently used procedure (38.26%) apparently for making the movie visible for the public Iranian audience and adjust the movie to cultural considerations. Henceforth, dubbing orients to domestication. However, subtitling, with literal translation as the most frequently used procedure (57.4%), orients to foreignisation. In dubbing of the movie, most differences are related to cultural equivalence (38.26%) and the literal translation (29.56%) is in the next step. An interesting point is that the procedure of calque is neither used in subtitling nor in dubbing. In subtitling, cultural equivalence stands in the second place (17.34) and explanation (9.50%) occupies the third place.
    [Show full text]
  • Transliteration: What,S the Message?
    7 Transliteration: What,s the Message? Elizabeth A. Winston INTRODUCTION Transliteration is a specific form ol sign language interpreting. It is the process of changing one fonn of an English message, either spoken English or signed English, into the other form. Interpreting, in contrast, refers either to the general process of changing the form of a message to another form, or to the specific process of changing an English message to Ameri- can Sign Language (ASL), or vice versa. The assumption in transliteration is that both the spoken and the signed forms correspond to English, the spoken form following the rules of standard English and the signed form being a simple recoding of the spoken form into a manual mode of ex- pression. The guidelines for the spoken fonn are relatively clear. It is the signed form that lacks any sort of standardization at the level of systematic recoding of spoken utterances. Indeed, the signed forms themselves are variously referred to as Pidgin Signed English, Manually Coded English, and even foreigner talk.1 1These terms represent a few of the terms used to describe the contact varieties of signing and speaking (or mouthing without voice) that are used when deal people who rely on signing and hearing people who rely on speaking wish to communicate. Pidgin Signed English (PSE) is discussed by many authors, including Marmor and Pettito (1979), Manually Coded English (MC£) refers to forms of signing that encode various formal features oI spoken English in manual signs. 1bese features are generally morphemic: copula, tense agreement, inllectional and derivational morphemes, as well as root morphemes of English.
    [Show full text]
  • A Literal Translation & Transliteration
    the ramaic NAew ovenant A Literal TranslationO & Transliteration Library of Congress Catalog Number 96Ñ85038 ISBN 0 9631951 6 6 © 1996 by Herb Jahn, Exegete exeGeses BIBLES A NonÑProfit Corporation PO Box 1776 ¥ Orange CA ¥ 92668 800 9 BIBLE 9 714 835-1705 This Aramaic New Covenant is the Peculiar Possession of presented this day of the month in the year of our Lord Yah Shua the Meshiah on the occasion of by CONTENTS of the Aramaic New Covenant CDRom Book/Section CDRom File Page Presentation 2 Prologue 4 Textual Criticism 6 Infomercial 8 Yah Chanan (John) 12 Mathai (Matthew) 87 Marqaus (Mark) 181 Luqa (Luke) 241 Acts 342 Romaya (Romans) 440 1 Qurintaus (Corinthians) 481 2 Qurintaus (Corinthians) 520 Galataya (Galatians) 544 Ephesaya (Ephesians) 557 Philipisaya (Philippians) 570 Qulasausi (Colossians) 579 1 Thesalauniqaya (Thessalonians) 588 2 Thesalauniqaya (Thessalonians) 596 1 Tima Theaus (Timothy) 601 2 Tima Theaus (Timothy) 612 Titaus (Titus) 620 Philimaun (Philemon) 625 Hebraya (Hebrews) 627 Yaaqub (James) 656 1 Petraus (Peter) 665 2 Petraus (Peter) 675 Yah Hud (Jude) 682 1 Yah Chanan (John) 685 2 Yah Chanan (John) 695 3 Yah Chanan (John) 697 Manifestation (Revelation) 699 Word Summaries 746 NOTE: Yah Chanan is placed first because it is the genesis of the Evangelisms. This also allows Acts to follow Luqa in their proper sequence. Placing Yah Hud in front of the Epistles of Yah Chanan, keeps the final scribings of Yah Chanan together at the finality. PROLOGUE Welcome, Dear Friend, to the Aramaic New Covenant — the first and only literal translation and transliteration of the New Covenant — translated directly from the language of our Lord Yah Shua the Meshiah and his apostles.
    [Show full text]
  • Inventory of Romanization Tools
    Inventory of Romanization Tools Standards Intellectual Management Office Library and Archives Canad Ottawa 2006 Inventory of Romanization Tools page 1 Language Script Romanization system for an English Romanization system for a French Alternate Romanization system catalogue catalogue Amharic Ethiopic ALA-LC 1997 BGN/PCGN 1967 UNGEGN 1967 (I/17). http://www.eki.ee/wgrs/rom1_am.pdf Arabic Arabic ALA-LC 1997 ISO 233:1984.Transliteration of Arabic BGN/PCGN 1956 characters into Latin characters NLC COPIES: BS 4280:1968. Transliteration of Arabic characters NL Stacks - TA368 I58 fol. no. 00233 1984 E DMG 1936 NL Stacks - TA368 I58 fol. no. DIN-31635, 1982 00233 1984 E - Copy 2 I.G.N. System 1973 (also called Variant B of the Amended Beirut System) ISO 233-2:1993. Transliteration of Arabic characters into Latin characters -- Part 2: Lebanon national system 1963 Arabic language -- Simplified transliteration Morocco national system 1932 Royal Jordanian Geographic Centre (RJGC) System Survey of Egypt System (SES) UNGEGN 1972 (II/8). http://www.eki.ee/wgrs/rom1_ar.pdf Update, April 2004: http://www.eki.ee/wgrs/ung22str.pdf Armenian Armenian ALA-LC 1997 ISO 9985:1996. Transliteration of BGN/PCGN 1981 Armenian characters into Latin characters Hübschmann-Meillet. Assamese Bengali ALA-LC 1997 ISO 15919:2001. Transliteration of Hunterian System Devanagari and related Indic scripts into Latin characters UNGEGN 1977 (III/12). http://www.eki.ee/wgrs/rom1_as.pdf 14/08/2006 Inventory of Romanization Tools page 2 Language Script Romanization system for an English Romanization system for a French Alternate Romanization system catalogue catalogue Azerbaijani Arabic, Cyrillic ALA-LC 1997 ISO 233:1984.Transliteration of Arabic characters into Latin characters.
    [Show full text]
  • Translation and Film: Slang, Dialects, Accents and Multiple Languages Allison M
    Comparative Humanities Review Volume 3 Translation: Comparative Perspectives Article 1 (Spring 2009) 2009 Translation and Film: Slang, Dialects, Accents and Multiple Languages Allison M. Rittmayer Bucknell University Follow this and additional works at: http://digitalcommons.bucknell.edu/chr Recommended Citation Rittmayer, Allison M. (2009) "Translation and Film: Slang, Dialects, Accents and Multiple Languages," Comparative Humanities Review: Vol. 3, Article 1. Available at: http://digitalcommons.bucknell.edu/chr/vol3/iss1/1 This Article is brought to you for free and open access by Bucknell Digital Commons. It has been accepted for inclusion in Comparative Humanities Review by an authorized administrator of Bucknell Digital Commons. For more information, please contact [email protected]. Translation and Film: Slang, Dialects, Accents and Multiple Languages Allison M. Rittmayer Bucknell University The birth of the cinema was initially regarded with great promise as a universal method of communication. This was partially true in the era of silent films as there was no need for translation before the introduction of inter-titles. The images filmed may have contained distinct cultural markers, thus rendering them somewhat foreign to spectators outside of the source culture; however, these markers could be absorbed in the way a painting is absorbed. Without linguistic intrusion, it was possible for spectators of foreign films to simply identify characters in regards to their appearance. This identification could also be made easier if the spectator knew what culture the film was coming from, in the way that paintings are understood by virtue of the culture that produced them. More often than not though, early silent films portrayed subjects that did not need any cultural translation.
    [Show full text]
  • The Mischief of Transliteration Mark S
    The Mischief of Transliteration Mark S. Krause, Ph.D. Wildewood Christian Church, Papillion NE Précis: Many words in English translations of the New Testament are transliterations rather than translations. The author identified 53 Greek words in Matthew that are commonly transliterated. Looking at examples in Matthew helps us identify this practice and find better ways to understand and (perhaps) better translate these words. Introduction Translation of biblical texts into modern languages is an ongoing necessity in the service of the church. While the meaning of words and passages may be fixed in the ancient texts, modern languages change constantly, requiring the ongoing projects of new and refreshed translations. While this is true for all current languages, this paper will be restricted to English, the most widespread target language for Bible translation. D. A. Carson wrote “translation is treason.”1 While this may be overstatement, Carson means that moving a text from a source language to a target language may involve additions to meaning (even unintended) as well as subtractions from the richness of the original wording (also unintended). Even the best translations are approximations of the original texts. History teaches us that the church did not believe inspiration or authority was restricted to texts in Hebrew, Aramaic, or Greek. The early church, however, used the Septuagint (LXX), a collection of books that included translations of Hebrew and Aramaic texts into the common Greek of the time.2 The comfort level of the earliest church with the LXX is shown by its pervasive use as the primary source for quotations from the Jewish Scriptures by the authors of the New Testament.
    [Show full text]
  • First : Arabic Transliteration Alphabet
    E/CONF.105/137/CRP.137 13 July 2017 Original: English and Arabic Eleventh United Nations Conference on the Standardization of Geographical Names New York, 8-17 August 2017 Item 14 a) of the provisional agenda* Writing systems and pronunciation: Romanization Romanization System from Arabic letters to Latinized letters 2007 Submitted by the Arabic Division ** * E/CONF.105/1 ** Prepared by the Arabic Division Standard Arabic System for Transliteration of Geographical Names From Arabic Alphabet to Latin Alphabet (Arabic Romanization System) 2007 1 ARABIC TRANSLITERATION ALPHABET Arabic Romanization Romanization Arabic Character Character ٛ GH ؽٔيح ء > ف F ا } م Q ة B ى K د T ٍ L س TH ّ M ط J ٕ ػ N % ٛـ KH ؿ H ٝاُزبء أُوثٛٞخ ك٢ ٜٗب٣خ أٌُِخ W, Ū ٝ ك D ١ Y, Ī م DH a Short Opener ه R ā Long Opener ى Z S ً ā Maddah SH ُ ☺ Alif Maqsourah u Short Closer ٓ & ū Long Closer ٗ { ٛ i Short Breaker # ī Long Breaker ظ ! ّ ّلح Doubling the letter ع < - 1 - DESCRIPTION OF THE NEW ALPHABET How to describe the transliteration Alphabet: a. The new alphabet has neglected the following Latin letters: C, E, O, P, V, X in addition to the letter G unless it is coupled with the letter H to form a digraph GH .(اُـ٤ٖ Ghayn) b. This Alphabet contains: 1. Latin letters which have similar phonetic letters in Arabic : B,T,J,D,R,Z,S,Q,K,L,M,N,H,W,Y. ة، ،د، ط، ك، ه، ى، ً، م، ى، ٍ، ّ، ٕ، ٛـ، ٝ، ١ 2.
    [Show full text]
  • In This Issue: Get a Head Start on Your Taxes Are You a Band-Aid Interpreter? the Team Interpreting Approach
    November/December 2006 October 2006 Volume XXXV Volume XXXV Number 11 Number 10 A Publication The of the American A Publication Translators of the Association American CHRONICLE Translators Association In this issue: Get a Head Start on Your Taxes Are You a Band-Aid Interpreter? The Team Interpreting Approach November/December 2006 Volume XXXV American Translators Association Number 11 225 Reinekers Lane, Suite 590 • Alexandria VA 22314 Tel: (703) 683-6100 • Fax: (703) 683-6122 A Publication Contents November/December 2006 E-mail: [email protected] • Website: www.atanet.org of the American Translators Association Taxes? Now? 14 By Dorothee Racette and Nicholas Hartmann The end of the year is actually the perfect time to think about taxes! 14 An Overview of Vendor Management Today 17 By Charles Campbell Why give business away by saying that your company “doesn’t do that language” when you could invest time and energy in finding, qualifying, and maintaining a network of vendors to provide these services? Band-Aid Interpreter or Culture Consultant? 20 A Different Approach to Culture Brokering By Zarita Araujo-Lane, Edited by Vonessa Phillips Good interpreting is not just based on accuracy, but on how the interpreter manages the dynamics of the triadic encounter. 25 Team Interpreting: Does It Really Work? By Giovanna L. Carnet A discussion of the issues facing interpreters who would like to introduce the team interpreting concept into their districts, including tips on how to approach the subject. 28 An Update on Argentine Political Jargon By Rut Simcovich As a living reflection of reality, language is constantly evolving thanks to popular creativity, which uses current developments in any field to coin new and ingenious ways of describing things.
    [Show full text]
  • Multiple Causality of Differences in Taboo Translation of Blockbuster Films by Chinese Fansubbers and Professionals
    Multiple Causality of Differences in Taboo Translation of Blockbuster Films by Chinese Fansubbers and Professionals Zhengguo He Thesis submitted in fulfilment of the regulations for the degree of Doctor of Philosophy Newcastle University School of Modern Languages June 2018 Acknowledgement This thesis owes its existence to the expert supervision and encouragement of my first supervisor, Dr Ya-yun Chen, who always inspired me with her insightful feedback, and my second supervisor, Dr Valerie Pellatt, whose rigorous and holistic academic thinking had a profound influence on the structure of my thesis. It has been both a privilege and a pleasure to have the opportunity to be supervised by two supportive and inspiring scholars. I am also indebted to my Annual Panel Reviewers Professor Qian Jun, Dr Michael Jin, and Dr Francis Jones for their insightful review and advice. I would also like to express my deep gratitude to my viva examiners Drs Yvonne Lee and Pauline Henry-Tierney, for their very detailed and valuable critique. My sincere thanks must go to my family and friends during my PhD studies. Special thanks should be given to my family, especially my father, Qingxiang He, for his unfailing support throughout my years in the UK and Dr Mark James for his moral support and encouragement. I would also like to extend my sincere gratitude to Dr Joseph Banks and Karen Blacker, who offered me advice regarding my English composition; and Drs Peter Avery and Damien Hall for their valuable comments on my statistical analysis. i Note on Translation and Transliteration All translations from Chinese in this thesis are mine, unless otherwise noted.
    [Show full text]
  • Understanding Fansub As One of the Audiovisual Translation Methods
    KEMANUSIAAN Vol. 25, No. 2, (2018), 109–127 Understanding Fansub as One of the Audiovisual Translation Methods LIEW ZHE RONG1,2 *HASURIA CHE OMAR1 1School of Humanities, Universiti Sains Malaysia, 11800 USM Pulau Pinang, Malaysia Current affiliation: 2Thomson Reuters Penang, Bandar Bayan Baru, 11900 Bayan Lepas, Pulau Pinang, Malaysia *Corresponding author: [email protected] Published online: 26 September 2018 To cite this article: Liew, Z.R. and Che Omar, H. 2018. Understanding fansub as one of the audiovisual translation methods. KEMANUSIAAN the Asian Journal of Humanities 25(2):109–127, https://doi. org/10.21315/kajh2018.25.2.6 To link to this article: https://doi.org/10.21315/kajh2018.25.2.6 Abstract. The advancement of multimedia and internet technologies has led to the development of a new form of communication known as fansubbing that connects anime fans from around the globe. Fansubbing can be regarded as a form of collaborative practice because it provides anime fans the opportunity to explain the infusion of foreign cultural elements in local culture, thus enabling the target audience to better understand the anime shows that they are watching. As a translation method, fansubbing also allows anime fans to share their knowledge and interests through a free digital network. In relation to this context, this paper aims to discuss the position of fansubbing as a community’s contribution towards understanding anime in a different culture. The discussion is mainly centred upon the ideas posited by Hatcher (2005) and O’Hagan (2009) on fansub for Japanese anime and technology, Diaz-Cintas and Muñoz Sánchez’s (2006) and Pérez-González’s (2007, 2012) on the conventional subtitling and fansubbing approaches, and Chiaro’s (2008) on viewers experience.
    [Show full text]