Spoken Language Translator: Phase Two Report (Draft)

Total Page:16

File Type:pdf, Size:1020Kb

Spoken Language Translator: Phase Two Report (Draft) Spoken Language Translator: Phase Two Report (Draft) Ralph Becket , Pierrette Bouillon , Harry Bratt , Ivan Bretan , David Carter , Vassilios Digalakis , Robert Eklund , Horacio Franco , Jaan Kaja , Martin Keegan , Ian Lewin , Bertil Lyberg , David Milward , Leonardo Neumeyer , Patti Price , Manny Rayner , Per Sautermeister , Fuliang Weng and Mats Wirén February 1997 SRI Project 6393 This is a joint report by SRI International and Telia Research AB; published simultaneously by Telia and SRI Cambridge. SRI International, Cambridge SRI International, Menlo Park Telia Research ISSCO, Geneva Executive summary Spoken Language Translator (SLT) is a project whose long-term goal is the construc- tion of practically useful systems capable of translating human speech from one lan- guage into another. The current SLT prototype, described in detail in this report, is ca- pable of speech-to-speech translation between English and Swedish in either direction within the domain of airline flight inquiries, using a vocabulary of about 1500 words. Translation from English and Swedish into French is also possible, with slightly poorer performance. A good English-language speech recognizer existed before the start of the project, and has since been improved in several ways. During the project, we have constructed a Swedish-language recognizer, arguably the best system of its kind so far built. This has involved among other things collection of a large amount of Swedish training data. The recognizer is essentially domain-independent, but has been tuned to give high performance in the air travel inquiry domain. The main version of the Swedish recognizer is trained on the Stockholm dialect of Swedish, and achieves near-real-time performance with a word error rate of about 7%. Techniques developed partly under this project make it possible to port the recognizer to other Swedish dialects using only modest quantities of training data. On the language-processing side, we had at the start of the project a substantial domain-independent language-processing system for English, a preliminary Swedish version, and a sketchy set of rules to permit English to Swedish translation. We now have good versions of the language-processing system for English, Swedish and French, and fair to good support for translation in five of the six possible language- pairs. Translation is carried out using a novel robust architecture developed under the project. In essence, this translates as much of the input utterance as possible using a sophisticated grammar-based method, and then employs a much simpler set of word- to-word translation rules to fill in the gaps. The language-processing modules are all generic in nature, are based on large, linguistically motivated grammars, and can fairly easily be tuned to give good perfor- mance in new domains. Much of the work involved in the domain adaptation process can be carried out by non-experts using tools developed under the project. Formal comparisons are problematic, in view of the different domains and lan- guages used and the lack of accepted evaluation criteria. None the less, the evidence at our disposal suggests that the current SLT prototype is no worse than the German Verbmobil demonstrator, in spite of a difference in project budget of more than an order of magnitude. i Acknowledgements The Spoken Language Translation project was funded by Telia Nät Tjänster (Swedish Telecom Networks Division) and the work was carried out jointly by Telia Research and SRI International. We would like to thank Christer Samuelsson for making available to us the LR compiler described in Chapter 6, and Steve Pulman and Christer Samuelsson for helpful comments on the material in 6. The work described in Appendix A was partly funded by the Defence Research Agency, Malvern, UK, under Strategic Research Project AS04BP44. We are grateful to Małgorzata Stys´ for comments on the material in that appendix and also for providing her analysis of the Polish examples in it. The greater part of the work described in Chapter 11 was carried out under funding from SRI International (at SRI Cambridge) and Suissetra (at ISSCO, Geneva). Much of the material in this report is based on published papers. Permission to re-use parts of those papers is gratefully acknowledged. Chapter 2 uses material from [IVTTA96], c IEEE; Chapter 5 uses Rayner and Carter (1997), c IEEE; Chapters 5 and 6 draw on Rayner and Carter (1996), and Appendix A uses Carter (1995), which are both c Association for Computational Linguistics; Chapter 11 is an edited version of Rayner, Carter and Bouillon (1996); Chapter 12 uses Rayner and Bouillon (1995); Chapter 14 is based on Rayner et al (1996); part of Chapter 16 uses Carter et al (1996) and Chapter 17 uses Rayner et al (1994), both of which are c IEEE. ii Contents Executive summary . ............................ i Acknowledgements . ............................ ii Table of contents . ............................ iii List of tables . ................................ xi List of figures . ................................ xiii 1 Introduction 1 1.1 Overview of the project . ........................ 1 1.1.1 Why do spoken language translation? ............. 1 1.1.2 What are the basic problems? ................. 2 1.1.3 What is it realistic to attempt today? . ............. 3 1.1.4 What have we achieved? . ................. 4 1.2 Overall system architecture . ................. 5 1.3 An SLT session . ............................ 9 1.3.1 The Interface Elucidated . ................. 9 1.3.2 Synthesis ............................ 13 1.3.3 Final Comments ........................ 13 2 Language Data Collection 15 2.1 Rationale and Requirements . ................. 15 2.2 Methodology . ............................ 17 2.2.1 Wizard-of-Oz Simulations . ................. 17 2.2.2 American ATIS Simulations . ................. 17 2.2.3 Swedish ATIS Simulations . ................. 17 2.3 Translations of American WOZ Material . ............. 18 2.3.1 Translations... A First Step . ................. 19 2.3.2 Email Corpus . ........................ 19 2.3.3 Sundry Comments on Email Corpus Editing . ......... 21 2.4 A Comparison of the Corpora . ................. 21 2.5 Concluding Remarks . ........................ 23 3 Speech Data Collection 25 3.1 Rationale and Requirements . ................. 25 3.2 Text Material . ............................ 26 3.2.1 ATIS Material: LDC CD-ROMs . ............. 26 iii iv 3.2.2 Swedish Material: The Stockholm-Umeå Corpus . 26 3.3 Text Material Used in Data Collection ................. 26 3.3.1 Practice Sentences . ................. 27 3.3.2 Calibration Sentences . ................. 27 3.3.3 Calibration Sentences – ATIS ................. 28 3.3.4 Calibration Sentences – Newspaper Texts . ......... 28 3.3.5 ATIS Sentences . ........................ 28 3.3.6 Expanded Phone Set . ................. 28 3.4 Dialect Areas . ............................ 30 3.4.1 Subjects . ............................ 30 3.5 Recording Procedures . ........................ 30 3.5.1 Computer ............................ 30 3.5.2 Headset . ............................ 31 3.5.3 Telephones . ........................ 31 3.5.4 The SRI Generic Recording Tool . ............. 32 3.5.5 Settings . ............................ 32 3.6 Checking ................................ 32 3.7 Lexicon . ................................ 33 3.8 Concluding Remarks . ........................ 35 4 Speech Recognition 36 4.1 The Swedish Speech Corpus . ................. 37 4.2 The Swedish Lexicon . ........................ 37 4.2.1 Introduction . ........................ 37 4.2.2 Phone Set ............................ 37 4.2.3 Morphology . ........................ 38 4.2.4 Lexicon Statistics ........................ 39 4.3 The English/Swedish Speech Recognition System . ......... 39 4.3.1 Diagnostic Experiments . ................. 39 4.3.2 Speed Optimization . ................. 41 4.3.3 Swedish Recognition . ................. 43 4.3.4 Summary ............................ 44 4.4 Dialect Adaptation . ........................ 44 4.4.1 Dialect Adaptation Methods . ................. 45 4.4.2 Experimental Results . ................. 46 4.4.3 Summary ............................ 50 4.5 Language Modeling . ........................ 50 4.5.1 Interpolating In-domain LMs with Out-of-domain LMs . 50 4.5.2 Class-Based Language Modeling . ............. 51 4.5.3 Compound Splitting in the Swedish System . ......... 51 4.6 Development of a phone backtrace for the n-best decoding algorithm . 54 4.7 The Bilingual Speech Recognition System . ............. 56 4.7.1 Experimental Setup . ................. 57 4.7.2 Multilingual Recognition . ................. 57 4.7.3 Language Identification . ................. 58 4.7.4 Summary ............................ 59 v 5 Overview of Language Processing 60 5.1 Introduction . ............................ 60 5.2 Linguistically Motivated Robust Parsing . ............. 61 5.3 Semi-automatic domain adaptation of grammars . ......... 63 5.3.1 Rational Development of Rule Sets . ............. 63 5.3.2 Training by Interactive Disambiguation . ......... 64 5.4 Summary ................................ 65 6 Customization of Linguistic Knowledge 67 6.1 Linguistic Analysis in the Core Language Engine . ......... 67 6.2 Constituent Pruning . ........................ 69 6.2.1 Discriminants for Pruning . ................. 69 6.2.2 Deciding which Edges to Prune . ............. 72 6.2.3 Probability Estimates for Pruning . ............. 72 6.2.4 Relation to other pruning
Recommended publications
  • Volvo Cars' Assembly Instructions Evaluation
    DF Volvo Cars’ assembly instructions evaluation Identification of issues related to the assembly instructions used at Volvo Cars’ final assembly plant in Torslanda, Sweden. Master’s thesis in Production Engineering FANNY JOHANSSON, JEANETTE MALM Industrial and Materials Sciences CHALMERS UNIVERSITY OF TECHNOLOGY Gothenburg, Sweden 2020 Volvo Cars’ assembly instructions evaluation Identification of issues related to the assembly instructions used at Volvo Cars’ final assembly plant in Torslanda, Sweden. FANNY JOHANSSON JEANETTE MALM DF Department of Industrial and Materials Sciences Division of Design and Human Factors Chalmers University of Technology Gothenburg, Sweden 2020 Volvo Cars’ assembly instructions evaluation Identification of issues related to the assembly instructions used at Volvo Cars’ final assembly plant in Torslanda, Sweden. FANNY JOHANSSON JEANETTE MALM © FANNY JOHANSSON, JEANETTE MALM, 2020. Supervisor: Cecilia Berlin, Industrial and Materials Sciences Examiner: Cecilia Berlin, Industrial and Materials Sciences Department of Industrial and Materials Sciences Division of Design and Human Factors Chalmers University of Technology SE-412 96 Gothenburg Telephone +46 31 772 1000 Cover: The three main issues found related to the assembly instructions. Typeset in LATEX Printed by Chalmers Digitaltryck Gothenburg, Sweden 2020 iv Volvo Cars’ assembly instructions evaluation Identification of issues related to the assembly instructions used at Volvo Cars’ final assembly plant in Torslanda, Sweden. FANNY JOHANSSON, JEANETTE MALM Department of Industrial and Materials Sciences Chalmers University of Technology Abstract This thesis has investigated the development, implementation, and usage of the as- sembly instructions within Volvo Cars, focusing on the humans and their behaviors and experiences. The purpose was to reveal, analyze and give recommendations on how to handle issues connected to the instructions.
    [Show full text]
  • Facköversättaren
    organ för sveriges facköversättarförening | nr 4 2018 | årgång 29 facköversättaren Tema terminologi Har grönländskan verkligen 50 termer för snö? TYSK SPRÅKINDUSTRI 12 | DEN SORGLIGA SAGAN OM TNC:S HÄDANFÄRD 16 Nr 4 2018 Årgång 29 Innehållet får gärna återges om källan Skribenter i anges. Den kompletta och oförändrade PDF-filen får distribueras och länkas fritt med oförändrat filnamn. De åsikter som uttrycks i signerade Facköversättaren artiklar i Facköversättaren är upphovspersonens egna och utgör inte ett officiellt ställningstagande från nr 4/2018 Sveriges Facköversättarförenings sida. Björn Olofsson Arnaq Grove Kansli Översätter från Lektor, ph.d. vid Postadress engelska, tyska, Institut for Kultur, SFÖ italienska, norska Sprog & Historie Box 44082 och danska till Afdeling for 100 73 Stockholm svenska, främst Oversættelse & inom teknik och Tolkning, Grönlands Tel: 08-522 963 00 vetenskap. universitet i Nuuk. Öppettider: mån–fre 09.00−16.00 Lärare på Tolk- och översättarinstitutet Presenteras närmare här. Lunchstängt: 12.00–13.00 vid Stockholms universitet. Chefredaktör E-post: [email protected] för Facköversättaren och tidigare med- Thea Döhler Webbplats: www.sfoe.se lem i SFÖ:s styrelse. Driver Tecnita AB. Fortbildare och konsult för språk- PlusGiro Mats Dannewitz industrins aktörer 630266-5 Linder och deras yrkes- Översätter från organisationer i Tysk- Bankgiro engelska, franska, land och utomlands. 5163-0267 tyska, danska och Affärsadministratör, norska till svenska. diplomerad pedagog, systemisk coach Auktoriserad och marknadsföringsexpert med kärlek Redaktion translator från engel- till språket. Ansvarig utgivare ska till svenska. Medlem av Facköversätta- Elin Nauri Skymbäck rens redaktion. Driver Nattskift Konsult. Bodil Bergh Översätter från Chefredaktör Katarina Lindve engelska, danska och Björn Olofsson Masterexamen i italienska till svenska Barometergatan 13 översättningsveten- och bloggar frekvent 723 50 Västerås skap från Stock- om diverse aktualite- Tel: 070-432 68 20 holms universitet, ter.
    [Show full text]
  • Dialect Recognition in a Noisy Environment: Preliminary Data
    Dialect recognition in a noisy environment: preliminary data Erik J. Eriksson and Kirk P. H. Sullivan Department of Philosophy and Linguistics, Umeå University Abstract The ability to identify the dialect spoken by an individual by voice alone has been widely investigated for many languages. The studies suggest a large degree of in- dividual variation in dialect recognition ability and that factors such as perceptual distance and competence in the language interact with this variation. Trained listen- ers have also been shown to be unstable in their judgements of dialect. In the study presented here naïve listeners attending a trade fair in Umeå were asked to select between eight given dialects when identifying two dialects. The test environment was noisey. The level of the background noise changed with the flow of people and activ- ities associated with the trade fair. The noisy environment did not appear to affect the listeners responses. Introduction . it became clear when collating the data that some judges were far less Many studies have examined the listener’s abil- able to successfully identity dialec- ity to place a person’s dialect and national back- tal accents or dialectal colouring in ground based on voice alone (e.g. Bayard et al., accents than the rest of the group. 2001; Bayard and Sullivan, 2005; Clopper and This was despite assurances from all Pisoni, 2004; Clopper et al., 2006; Cunningham- judges that they had a good ear for Andersson, 1996; Doeleman, 1998; Markham, Swedish accents. (p. 297) 1999; Preston, 1993; Remez et al., 2004; Sulli- van and Karst, 1996; Williams et al., 1999).
    [Show full text]
  • From Prototype to Product. the Development of Low
    This report is published within the KFB Program on Biobased Fuels for Vehicles TfflS DOOM# 6 wr'l From Prototype to Product The development of low emission natural gas- and biogas buses Mats Ekelund Strateco DISTRP^- - - •'$ DOCUMENT IS UNLIMITED uiidiiiN SALES PROHIBITED A follow up of the KFB- and Nordic Industrial Fund sponsored "Co-Nordic GasBuss Project" KFB-Rapport 1997:19 FORFATTARE/AUTHOR SERIE/SERIES Rapport 1997:19 Mats Ekelund ISBN 91-88868-41-9 ISSN 1104-2621 utel /htle PUBUCERINGSDATUM/DATE PUBLISHED From Prototype to Product January 1998 UTGIVARE/PUBLISHER KFB - Kommunikationsforsknings- KFBs DNR 88-194-732 and 1993-0270 beredningen REFERAT (Syfte, Metod, Resultat) Syftet med rapporten ar att beskriva utvecklingen for natur- och biogasdrivna bussar och andra tunga fordon sedan “Nordiska GasBuss Projetet”, for vilken TFB/KFT var en huvud- finansiar och en av initiativtagarna. Idag finns ca 325 tunga metangasdrivna fordon i landet. Projetets uppstallda mal naddes, da signifikant reduktion av avgasnivan kunde demonstre- ras. Volvo, och senare Scania, omsatte de laga nivaerna pa saval bussar som i lastbilar. Hela den etablerade busstilverkarindustrin har darefter, foljt efter till samma laga avgasnivaer. Sve­ rige leder utvecklingen av biogas som drivmedel och har narmare 100 fordon. Biogas ar ur fdrsorjningssynpunkt en undervarderad fdrsdrjningskalla som teoretiskt kan ersatta minst halften av dieselanvandningen i Sverige. Utvinningen av biogas kommer att oka ju mer avfall som sorteras och atervinns. Scanai och Volvo har tillverkat ca 500 gasbussar till 6 olika lander. Den utveckling som behover aga rum omfattar; kontrollen av avgasutslappen for att oka sta- bilitieten av over tiden, den omfattar utvecklingen mot lattare och billigare tankar och kompres- sor-Zreningsanlaggningar, fdrbattringen av verkninsgraden for att na lagre drivmedelsforbruk- ning och avgasutslapp av bla CO2, NOx och CH4.
    [Show full text]
  • 17 Using the World Wide Web to Research Spoken Varieties of English: the Case of Pulmonic Ingressive Speech Peter Sundkvist Stockholm University
    17 Using the World Wide Web to Research Spoken Varieties of English: The Case of Pulmonic Ingressive Speech Peter Sundkvist Stockholm University 1. Introduction Undoubtedly, one of the most significant developments for human interaction and communication within the last thirty years or so is the internet. Its origins may be traced back to various experimental com- puter networks of the 1960s (Crystal 2013: 3). It has since grown by a phenomenal rate: between 2000 and 2012 the number of internet users increased by an average of 566.4% across the globe, and as of 2012 34.3% of the world’s population was estimated to have access to the internet (Internet World Stats 2012). Partly as a result of its phenom- enal growth the internet has also established itself as an important tool and source of information within the fields of linguistics and English studies. While much linguistically-oriented work has focused on fea- tures and developments related to the electronic medium of commu- nication itself, inquiry has gradually extended into further domains, as researchers become more creative in exploring the full potential of the internet. In particular, the World Wide Web (WWW), defined as “the full collection of all the computers linked to the internet which hold documents that are mutually accessible through the use of a standard protocol (Hypertext transfer protocol, or HTTP)” (Crystal 2013: 13), has opened up a range of new possibilities with regard to linguistic research. Thus far, however, less attention has been devoted to its potential for research on spoken varieties of English. The aim of this paper, therefore, is to explore the World Wide Web as a source of evidence for a specific feature of spoken language.
    [Show full text]
  • Preparation Kit
    PREPARATION KIT TURKU 2018 - National Session of EYP Finland 5-8 January 2018 EUROPEAN YOUTH PARLIAMENT SUOMI FINLAND Dear Delegates, ​ ​ On behalf of the whole Chairs’ Team of Turku 2018, I welcome you to share our excitement by presenting to you this Academic Preparation Kit, which includes the Topic Overviews for Turku 2018 – National Session of EYP Finland. ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ The Chairs’ Team has been working hard over the past weeks in order to give you a good introduction to the topics, to important discussions that touch upon the most recent events taking place in Europe under the theme “Towards a Better European Community with Nordic ​ Collaboration”. I want to extend my gratitude for the Vice-Presidents Kārlis Krēsliņš and Mariann ​ ​ Jüriorg for creating great foundations for the academic concept. Additionally, there are two external scrutinisers Henri Haapanala and Viktor M. Salenius, who have to be thanked for their academic prowess and immense help they provided with this Preparation Kit. ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ We encourage you to look into all of the Committees’ Topic Overviews, in order for you to have a coherent picture of all the debates in which you will be participating at the General Assembly. ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ In addition to your Committee’s Topic Overview, make sure you read the explanations on how the European Union works. It is essential for fruitful conversation that you know how the structure and institutional framework of the EU functions. ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ I hope to see you all in person very soon! ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ Yours truly, ​ ​ Tim Backhaus ​ ​ President of Turku 2018 – National Session of EYP Finland ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ 1 Committee Topics of Turku 2018 - National Session of EYP Finland ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ ​ 1.
    [Show full text]
  • Appropriate Tone Accent Production in L2-Swedish by L1-Speakers of Somali?
    Proceedings of the International Symposium on the Acquisition of Second Language Speech Concordia Working Papers in Applied Linguistics, 5, 2014 © 2014 COPAL Appropriate Tone Accent Production in L2-Swedish by L1-Speakers of Somali? Mechtild Tronnier Elisabeth Zetterholm Lund University, Sweden Linnæus University, Sweden Abstract It has been suggested that speakers with an L1 with lexical tones may have an advantage when it comes to perceptually discriminating between different tones in another tone language (Kaan, Wayland, Bao & Barkley, 2007). Other studies in L2‐learning show that this is not entirely the case (van Dommelen & Husby 2009, So & Best 2010). A model of typological pitch prominence (Schaefer & Darcy, 2013) suggests that speakers of an L1 with a higher pitch prominence can perceive tonal contrast in another tone language better than those with an L1 of a lower pitch prominence. This study addresses the question: if Somali L1‐speakers make a systematical distinction in the tonal pattern when producing Swedish words with the two tonal accents – as both languages are of similar pitch prominence according to Schaefer and Darcy – and also to what extent they produce a tonal pattern assigned to either one of the tone accents. The adequate distinction is identified as such by native speakers/listeners of Swedish. Results revealed that a big discrepancy still remains between the number of correct identifications of the stimuli produced by the L1‐speakers of Swedish and those produced by L2‐speakers of Swedish with Somali as their L1. Having a typologically similar L1 does not seem to give enough support to handle the tone accent distinction in Swedish L2 adequately.
    [Show full text]
  • A Bidialectal Experiment on Voice Identification 149 148 Sjostrom, Eriksson, Zetterholm & Sullivan
    Lund University, Dept. of Linguistics and Phonetics 145 144 BENGT SIGURD & DAMRONG TAYANIN Working Papers 53 (2008), 145-158 It seems proper to end this paper by a (translated) quote from Ryszard Kapuscinski's book Ebony (p. 63 in the Swedish translation Ebenholts): In many African societies the children get names after an event which A bidialectal experiment on voice has happened at the birthday. The name Edu was short for education, as the first school in the child's village was opened the same day. Where Christianity and Islam were not rooted the richness of names identification was infinite. The poetic talents of the parents were displayed when they e.g. gave the name 'Bright morning' (if bom in the early morning) and 'Acacia shadow' (if born in the shadow of an acacia tree). If a child had Maria Sjostrom, Erik J. Eriksson, Elisabeth Zetterholm and been bom when Tanganyika had gained its independence the child could have been christined to Uhuru (Swahili for 'independent'). If the parents Kirk P. H. Sullivan were adherents of president Nyerere the child could be given the name Nyerere. The introduction of Christianity and Islam had reduced these opulent worlds of poetry and history to a few dozens of names from the Bible 1 Introduction and the Koran. Since then there are only long rows of James and Patrick Our voice and speech are part of our identity and we can distinguish different or Ahmed and Ibrahim. humans through their voices. The voice gives the listener a cue about the speaker's gender, age and regional background, and socio-economic class.
    [Show full text]
  • Language, Culture, and Persistence in Maine’S Swedish Colony
    Maine History Volume 40 Number 3 Challenge and Change in Maine's Article 4 Communities 9-1-2001 “You Speak Very Good English for a Swede”: Language, Culture, and Persistence in Maine’s Swedish Colony Katherine Hoving Adirondack Museum Follow this and additional works at: https://digitalcommons.library.umaine.edu/mainehistoryjournal Part of the Cultural History Commons, and the United States History Commons Recommended Citation Hoving, Katherine. "“You Speak Very Good English for a Swede”: Language, Culture, and Persistence in Maine’s Swedish Colony." Maine History 40, 3 (2001): 219-244. https://digitalcommons.library.umaine.edu/mainehistoryjournal/vol40/iss3/4 This Article is brought to you for free and open access by DigitalCommons@UMaine. It has been accepted for inclusion in Maine History by an authorized administrator of DigitalCommons@UMaine. For more information, please contact [email protected]. “YOU SPEAK VERY GOOD ENGLISH FOR A SWEDE”: LANGUAGE, CULTURE, AND PERSISTENCE IN MAINE’S SWEDISH COLONY By Katherine Hoving In the summer of 1870, a small group of Swedish immigrants ar­ rived in northern Maine with the intention of establishing a farming colony in a place they called New Sweden. Despite many difficulties, the community has persisted and maintained a strong sense of its Swedish heritage. A demographic study of those who lived in the Swedish colony during its first sixty years suggests that language re­ tention played an important role in keeping both the community and its identity alive. Katherine Hoving completed her Masters of Arts in History at the University of Maine, Orono, in December 2001.
    [Show full text]
  • Information to Users
    INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely afreet reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand comer and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6” x 9” black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. UMI A Bell & Howell InfoTmation Company 300 North Zed) Road, Ann Arbor MI 48106-1346 USA 313/761-4700 800/521-0600 T he R epresentation o f V o w el H eig h t in P h o n o lo g y D issertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio State University By- Frederick Brooke Parkinson, B.A., M.A.
    [Show full text]
  • University of Copenhagen
    Factors influencing the linguistic development in the Øresund region Gregersen, Frans Published in: International Journal of the Sociology of Language Publication date: 2003 Document version Peer reviewed version Citation for published version (APA): Gregersen, F. (2003). Factors influencing the linguistic development in the Øresund region. International Journal of the Sociology of Language, (159), 139-152. Download date: 24. sep.. 2021 Factors influencing the linguistic development in the Øresund region FRANS GREGERSEN Abstract The paper summarizes post–World War II research on the understanding of Swedish by Danes and vice versa. Self-evaluation results as well as test results point to increased mutual comprehension. Preliminary results from ongoing investigations on both sides of the Øresund are given. They suggest a number of possible future scenarios, the most realistic prediction being that the well- known language problems may be overcome in enterprises employing both Danes and Swedes. With single speakers employed at predominantly mono- lingual work places various interlanguage varieties will develop. Whether such varieties will eventually crystallize into one accepted Swedo-Danish interlanguage variety is still an open question. Introduction The subject of this paper is the linguistic relation between two nation states and two regions, viz. Sweden and Denmark and Scania and the Copenhagen metropolitan region respectively. Historical research, Willerslev (1981a, 1981b), documents that despite some regional inte- gration manifesting itself in a substantial immigration of poor Scanians to Copenhagen at the close of the nineteenth century, in the twentieth century neither the nation states nor the regions were integrated and the immigrants linguistically speaking left no trace at all.
    [Show full text]
  • Selskab for Østnordisk Filologis Ostnordische
    ORGANISATION Anja Blode M.A. [email protected] Dr. Elena Brandenburg [email protected] Dr. Regina Jucknies [email protected] ADRESS SELSKAB FOR Institut für Skandinavistik/Fennistik Philosophische Fakultät ØSTNORDISK FILOLOGIS Universität zu Köln 4. internationale konference Albertus-Magnus-Platz 50923 Köln 4. Internationale Tagung E-POST [email protected] der Gesellschaft für OSTNORDISCHE INFO PHILOLOGIE MER v 12. - 14. JUNI 2019 Handschrift: Det Kgl. Bibliotek/Royal Danish Library, E don. var. 3 8°, 205 E don. var. Handschrift: Det Kgl. Bibliotek/Royal Danish Library, SELSKAB FOR ØSTNORDISK FILOLOGIS 4. internationale konference Köln 12. - 14. juni 2019 PROGRAMPROGRAMPROGRAMPROGRAMPROGRAMPROGRAMPROGRAMM Philosophikum 17.30-19.00 Workshop: Handbok i östnordisk filologi onsdag 12. juni 2019 S 58 19.15 Nämndens möte 8.45-9.15 Registrering 9.15-9.30 Välkomstord: Prof. Dr. Gabriele von Glasenapp torsdag 13. juni 2019 (Vice Dean for Staff, Equality and Diversity) 9.00-10.30 9.30-10.30 Tekstvittnen och lingvistik 1 Paleografi och kodikologi 1 Anna Catharina Horn (Universitetet i Oslo): key note: Bo-A. Wendt (Lunds universitet): Vad är filologi – vem är filolog? Materialitet og kontinuitet i de Louise Faymonville (Stockholms universitet): Riddare, äwintyr och faghra vestgötiske lovene Karl G. Johansson (Universitetet i Oslo): ordh – bruket av ett höviskt ordförråd i fornsvenska texter Miracles and more. The composite manuscript Cod Holm A 110 Anja Ute Blode (Universität zu Köln): 10.30-11.00 Kaffe “Well known, but not known well?” – Recent discoveries in two Old Danish manuscripts 11.00-12.30 Tekstvittnen och lingvistik 2 10.30-11.00 Odd Einar Haugen (Universitetet i Bergen): En felles grammatik for eldre Kaffe nordisk? 1 11.00-12.30 Luca Panieri (Libera Università di Lingue e Comunicazione IULM - Milano): En Paleografi och kodikologi 2 Roger Andersson fælles grammatik for ældre nordisk? 2 (Stockholms universitet): Små tungor ger stor kunskap Joseph M.
    [Show full text]