From Old Texts to Modern Spellings: an Experiment in Automatic Normalisation

Total Page:16

File Type:pdf, Size:1020Kb

From Old Texts to Modern Spellings: an Experiment in Automatic Normalisation Iris Hendrickx, Rita Marquilhas From old texts to modern spellings: an experiment in automatic normalisation We aim to tackle the problem of spelling variations in a corpus of personal Portugese letters from the 16th to the 20th century. We investigated the extent to which the task of normalising Portuguese spelling can be accom- plished automatically. We adapted VARD2 (Baron and Rayson, 2008), a statistical tool for normalising spelling, for use with the Portuguese language and studied its performance over four different time periods. Our results showed that VARD2 performed best on the older letters and worst on the most modern ones. In an extrinsic evaluation, we measured the usefulness of automatic normalisation for the linguistic task of automatic POS-tagging and showed that automatic normalisation of spelling helps improve the performance of the POS-tagger. 1 Introduction Our goal was to reduce the problem of spelling variations in the Portuguese CARDS-FLY corpus of personal letters written over a period of approximately 500 years, from the 16th to the 20th century. As the letters were written by a diverse group of authors, some of whom were semi-illiterate, and most of the manuscripts predate the first standardisation of Portuguese spelling which only took place in 1911, they contain many spelling variations. We wanted to make the corpus available for further linguistic research and also make it accessible to a wider community. As the corpus is being prepared for research into language change, given the excep- tional, near-spoken status of the texts, it was advisable to preserve the original spellings in a paleographic edition appropriate for research into sound variations and change, and for discourse analysis focussing on the specific behaviour of the social agents. However, a standardised corpus is essential for lexical and grammatical research, since this is the only format that can be given the necessary mark-up, such as POS tagging, semantic tagging, and syntactic parsing, enabling it to be used as an empirical tool for testing theories of lexical and syntactic change. On the other hand, the rarity of the documents in question also make them valuable to the lay public, since they represent part of the country’s cultural heritage, reflecting the everyday lives of ordinary people, especially those who suffered hardship. The 16th to 19th century corpus comprises original epistolary texts produced by servants, children, wives, lovers, thieves, soldiers, artisans, priests, political campaigners and many other social agents who fell foul of the Inquisition and the civil courts, two institutions that habitually seized personal correspondence for use as evidence. The letters in the JLCL 2011 – Band 26 (2) – 65-76 Hendrickx, Marquilhas 20th century corpus come from personal collections compiled by the families of former soldiers, emigrants, and political prisoners. Editions intended for the lay public have the understandable obligation to provide readers with a clean text, free of distracting variations in spelling. Our aim was therefore to add an additional orthographic component to the historical documents, which involved modernising the spelling. Modernising or normalising spelling is not an arbitrary step: if done manually it involves an enormous workload that can only be carried out by a specialist in historical linguistics and if done automatically, errors inevitably occur. Here we investigate the extent to which the task of modernising Portuguese spelling can be accomplished automatically. We adapted VARD2 (Baron and Rayson, 2008) a well-studied statistical tool for normalising spelling, for use with the Portuguese language and studied its performance over four different time periods. The performance of this automatic normalisation tool was evaluated using intrinsic and extrinsic methods. Firstly, the automatically normalised text was compared with manually normalised text. Secondly, as an evaluation of use, the effect and usefulness of automatic spelling normalisation was evaluated in terms of the task of automatic grammatical tagging. The performance of a POS-tagger used on a data set with the original, non- standardised spelling was compared with the effect of automatically normalised text and, as the upper bound, on manually normalised text. This research is similar to the work of Rayson et al. (2007) who investigated the usefulness of pre-processing with VARD2 for POS-tagging historical English and showed that normalisation does help to improve the performance of the POS-tagger. The paper is structured as follows. In the next section we first discuss previous approaches to historical spelling normalisation. In Section 3 the experimental setup is explained, in particular the adaptation of the VARD2 tool to Portuguese in Section 3.1. Section 3.2 describes the historical corpus and provides additional background information on historical spelling changes in Portuguese. In Sections 3.3 and 5 the results of the experiments in normalisation and POS-tagging are presented and the conclusion appears in 6. 2 Related work The problem of spelling variations in historical texts has been investigated from different perspectives and with different aims. Automatic text retrieval on historical data suffers severely from spelling variation and a common approach to this problem is not to modernise the full text collection, but to expand the search query to cover lexical variants (e.g. (Koolen et al., 2006; Hauser and Schulz, 2007; Ernst-Gerlach and Fuhr, 2007; Gotscharek et al., 2011)). Other approaches attempt to modernise the spelling in the historical documents themselves. The VARD tools were developed for corpus linguistic research into Early Modern English. The original VARD tool consisted of a list of manually created mappings between historical variants and their modern versions. VARD2 (Baron and 66 JLCL From old texts to modern spellings Rayson, 2008) has an additional module that can search for variants and mappings. In the work of Rayson et al. (2005) the ability of VARD to detect spelling variants and suggest the correct modern spelling is compared with two commercial spelling correctors, MS-Word and Aspell, showing that VARD works better for historical texts since it detects fewer false positives. VARD2 will be discussed in more detail in Section 3.1. Kestemont et al. (2010) describe an automatic approach to normalise the spelling in Middle Dutch Text from the 12th century. In this case however, they chose not to convert historical word forms to their modern counterparts, but to their modern lemma. They used machine learning to discover how to transform one spelling variant into another to resolve intra-lemma variation. Several studies of spelling variation in Portuguese historical documents have been conducted and we were grateful to be able to re-use some of the resources already developed for historical Portuguese. We will briefly discuss this previous work and, in Section 3.1, explain how we used the available resources. The Historical Dictionary of Brazilian Portuguese (HDBP) is constructed on the basis of a historical Portuguese corpus of 1,733 texts and approximately 5 million tokens. As there was no standard spelling at the time (16th to 19th century), it is not easy to create lexicographic entries on the basis of the corpus or produce reliable frequency counts. A rule-based method was therefore developed for the automatic detection of spelling variation in the corpus (Giusti et al., 2007). The HDBP researchers compared their automatic variant detection method with Agrep1 using a small test set and showed that their method was more precise, whereas Agrep had much better recall. These experiments also led to a spelling variants dictionary containing approximately 30K clusters of variants. Another resource available for Portuguese is the Digital Corpus of Medieval Portuguese (CIPM- Corpus Informatizado do Português Medieval)2 which covers the 12th to the 16th century and counts around 2 million words. Rocio et al. (2003) describe how they annotated part of the CIPM with linguistic information such as POS-tags, morphological analysis and partial parse information. They did not proceed with modernisation but used automatic tools on the historical data as such, followed by a manual correction phase. The Tycho Brahe Parsed Corpus of Historical Portuguese3 is an electronic corpus of historical texts with prose from different text genres from the Middle Ages to the Late Modern era. The TBCHP contains 52 source texts but not all of them are annotated in the same way. Some of the texts maintain the original spelling variations, while in other texts, intended for part-of-speech and syntactic annotation, the spelling was standardised. 1Agrep: http://www.tgries.de/agrep/ 2CIPM corpus: http://cipm.fcsh.unl.pt/ 3TBCHP: http://www.tycho.iel.unicamp.br/∼tycho/corpus/en JLCL 2011 – Band 26 (2) 67 Hendrickx, Marquilhas 3 Data and Methods This section describes the adaptation of the VARD2 spelling standardisation tool for use with Portuguese, the corpus in question, which is a historical corpus of private letters written in Portuguese, and the experimental setup for normalisation and POS-tagging. 3.1 VARD2 for Portuguese The aim of this study was to evaluate the performance and usefulness of the VARD2 tool for historical Portuguese. As mentioned in Section 2, VARD2 was developed for historical forms of English. It combines several resources to detect and replace spelling variants with a normalised form and, where possible, we adapted every module for use with the Portuguese language. VARD2 uses a modern lexicon, a spelling variants dictionary list that matches variants against their modern counterparts, a list of letter replacement rules and a phonetic matching algorithm to predict normalised candidates for each variant detected, using an edit distance algorithm to determine the most likely candidate. The tool can be configured, since each module can be assigned a certain weight and can be individually configured in favour of recall or precision. When training the tools on a specific data set, new words and variants are added to the lists and the module weights are adapted accordingly.
Recommended publications
  • History of the Portuguese Music: an Overview
    HISTORY OF THE PORTUGUESE MUSIC: AN OVERVIEW Rui César Vilão Physics Department, University of Coimbra, PORTUGAL Abstract A brief overview of the history of the Portuguese music is given. In memoriam José Carlos Travassos Cortez 1. PREVIOUS REMARKS Presenting a communication on such a vast field as the subject of this work is usually a task for specialists. Only these are able to smoothly draw a global and coherent picture of their field of study. So, if you are not a specialist (even if you have a deep interest for the field) and you occupy your days working in an entirely different subject (say the physics of muonium states in semiconductors), you are surely in trouble! There are two possible solutions for this dilemma: a) ask a specialist and b) do it à la Newton (even if you're not one), climbing in the shoulders of giants. The mountaineering option was adopted here, and four giants helped me in viewing higher: Rui Vieira Nery and Paulo Ferreira de Castro with their História da Música [1], João de Freitas Branco with his História da Música Portuguesa [2] and José Carlos Travassos Cortez with his passioned and illuminating lessons on the history of Portuguese music that I was lucky to attend in the Conservatório de Musica de Coimbra [3]. I am very pleased to dedicate this work to his memory. Apart from Refs. [1-3], which are available in Portuguese only, all the other references are relative to introductory notes of compact-discs and are generally available in English as well. Of course these discographic indications are not extensive, but I believe they constitute a basis (surely not the only one!) for a good start in Portuguese-specialized melomania.
    [Show full text]
  • The Portuguese Art of War in Northern Morocco During the 15 Century
    Athens Journal of History - Volume 3, Issue 4 – Pages 321-336 The Portuguese Art of War in Northern Morocco during the 15th Century By Vitor Luís Gaspar Rodrigues This paper not only reviews the motives underlying the Portuguese expansionist project in Morocco in the 15th century, but also the political, economic, and particularly the social reasons that were in the basis of the Portuguese art of war in Morocco in that period. During the Iberian Reconquest (Reconquista), warfare was usually practiced by means of cavalcades, raids (razias) and ambushes, alongside with some siege actions. We will try to demonstrate that the Portuguese were forced to adopt a model of restricted territorial occupation and repeat the same technics and tactics of combat in Morocco, chiefly based on guerrilla war (guerra guerreada), as well as on siege and privateering actions, either offshore or onshore, by means of amphibian landings (saltos). We will also approach some of the changes that occurred in the defence systems of the Portuguese strongholds in North Africa, particularly at the turn of the 15th to the 16th centuries, as a result of the need to respond to the new challenges by the Moroccan armies equipped with fire weaponry. Keywords: Guerrilla war; maritime war; Northern Morocco; strongholds; siege war; fire weaponry. Portuguese Expansion in Morocco in the 15th Century: Main Causes The expansion project to Morocco by the Christian kingdoms from Iberia goes back to late 13th century, as attested by the Treaty of Soria, of 1291, signed by the monarchs of Castile and Aragon, which defined the areas to be occupied in the future by both kingdoms in North Africa, leaving the territorial stripe in the west of Ceuta to Portugal.
    [Show full text]
  • Linguateca's Infrastructure for Portuguese and How It Allows
    J. B. Johannessen (ed.), Language Variation Infrastructure, Oslo Studies in Language 3(2), 2011. 113–128. (ISSN 1890-9639) http://www.journals.uio.no/osla linguateca’s infrastructure for portuguese and how it allows the detailed study of language varieties DIANASANTOS SINTEF, Oslo & FCCN, Lisboa abstract In this paper I present briefly Linguateca, an infrastructure project for Por- tuguese which is over ten years old, showing how it provides several possi- bilities to study grammatical and semantic differences between varieties of the language. After a short history of Portuguese corpus linguistics, presenting the main projects in the area, I discuss in some detail the AC/DC project1 and what is called the AC/DC cluster (encompassing other related corpus projects shar- ing the same core). Emphasizing its potential for language variation stud- ies, the paper also (i) describes CONDIVport’s integration as an impetus for new capabilities, and (ii) provides a sketch of newly added functionalities to AC/DC. [1] an infrastructure for portuguese language technology In line with the main audience of the Linguateca project, there have already been several descriptions of Linguateca as an infra-structure for Portuguese, in Por- tuguese (Santos 2009), as well as substantial reporting2. However, an interna- tional audience has only seen traces and scattered references so far, so this paper intends to fill this gap in what corpus resources are concerned. It all started in 1998, with a small project (1998-1999) for preparing the future of the computational processing of the Portuguese language hosted by SINTEF, as an area to be taken specially good care of in the future science and technol- ogy programme (for the White book on Science and Technology created by the then Ministry of Science and Technology in Portugal), and wrote a small memo to be discussed publicly by all interested parties (Santos 1999a).
    [Show full text]
  • Orientalism Reconsidered 'If the World Were a Golden Ring, Hormuz
    Iranian Studies, vol. 50 / 2017 Ghoncheh Tazmini The Persian-Portuguese Encounter in Hormuz: Orientalism Reconsidered ‘If the world were a golden ring, Hormuz would be the jewel in it’1 Abstract In 2015, the quincentennial commemoration of the Portuguese arrival on the island of Hormuz in the Persian Gulf (1515-1622) revealed the underlying presupposition among Iranians that the Portuguese presence on the island was the harbinger of a long-term pattern of Western imperialism. This analysis questions the accuracy of this narrative by advancing a new interpretative framework that does not reduce the holding of Hormuz to simply another dark episode of European colonial history. Circumscribed and limited in aim and reach, Lusitanian activities on Hormuz cannot be brought under the generic rubric of “Orientalism”, which is embedded in European colonial tradition, and which by extension, buttresses Iranian nationalist sentiment about the Persian-Portuguese entanglement. My research demonstrates that Portuguese objectives diverged from the 18th and 19th century rationalist scientific traditions of the British, French and Germans professing a civilising mission as a rationale for colonial policies. Whereas the Portuguese operated from a worldview that combined profit, dynastic pride, and religious rhetoric the Portuguese mission to Hormuz was not guided by a grand discourse of civilising the “other”. While there was a complex interplay of commercial interests and brutal methods on this strategic êntrepot, Portuguese ambitions in Hormuz were confined
    [Show full text]
  • Annotating COMPARA, a Grammar-Aware Parallel Corpus
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Repositório Comum Annotating COMPARA, a Grammar-aware Parallel Corpus Diana Santos, Susana Inácio Linguateca, www.linguateca.pt, Oslo node at SINTEF ICT SINTEF ICT, Pb 124 Blindern, N-0314 Oslo, Norway [email protected], [email protected] Abstract In this paper we describe the annotation of COMPARA, currently the largest post-edited parallel corpora which includes Portuguese. We describe the motivation, the results so far, and the way the corpus is being annotated. We also provide the first grounded results about syntactical ambiguity in Portuguese. Finally, we discuss some interesting problems in this connection. syntactic function (not revised). The annotation was 1. Introduction automatically performed by the PALAVRAS parser COMPARA (www.linguateca.pt/COMPARA/) is a large developed by Eckhard Bick, followed by a thorough parallel corpus based on a collection of intellectual revision which has resulted in parser Portuguese-English and English-Portuguese literary improvements and new versions being applied to the same source texts and translations, which has been developed material and to new additions to COMPARA. Work on and post-edited ever since 1999 (Frankenberg-Garcia & annotating the English side of COMPARA, using the Santos, 2003). COMPARA has been designed with a view CLAWS PoS tagger (Rayson & Garside, 1998) is just to be an aid in language learning, translation training, starting. contrastive and monolingual linguistic research and Let us give the readers a flavour of what COMPARA language engineering. offers as new search functionalities: it is possible to look This paper has two aims: first, to present for the first time for the part of speech distribution of forms known to be the syntactic annotation of COMPARA and its intellectual ambiguous between grammatical categories, as well as revision (or post-edition), after its automatic annotation select concordances of only one grammatical with PALAVRAS (Bick, 2000) and a post-processing interpretation.
    [Show full text]
  • Cavite Chabacano Philippine Creole Spanish: Description and Typology
    Cavite Chabacano Philippine Creole Spanish: Description and Typology By Marilola Pérez A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Associate Professor Lev D. Michael, Chair Associate Professor Richard A. Rhodes Professor William F. Hanks Fall 2015 Abstract Cavite Chabacano Philippine Creole Spanish: Description and Typology by Marilola Pérez Doctor of Philosophy in Linguistics University of California, Berkeley Associate Professor Lev D. Michael This dissertation provides a grammatical description and sociohistorical account of the Cavite variety of Philippine Creole Spanish (PCS), also known as Cavite Chabacano (CC); and analyzes how this language informs standard typological characterizations of contact languages. CC is one of three surviving varieties of Chabacano, a Spanish-lexified contact language of the Philippines. The unique status of Chabacano as the only Spanish-lexified creole in Asia presents a number of typological challenges to standard views of colonial contact languages based on prototypical plantation creoles. Most work on Chabacano assumes that it is a creole language, and only a few recent works on the Zamboanga variety of Chabacano have questioned this classification. The current work reexamines the status of Chabacano as a creole language by providing linguistic data from an understudied Chabacano variety and examining it from a typological perspective. On the descriptive front, the dissertation provides a sketch grammar that constitutes the most complete description of the language to this date. The linguistic description is supplemented with a sociohistorical reconstruction that proposes different stages in the development of CC: an initial period of koineization, a period of hispanization or ‘decreolization’, and a latter period in which more Tagalog forms were incorporated from the adstrate.
    [Show full text]
  • The Indo-Portuguese Creoles of the Malabar: Historical Cues and Questions
    ** NOTE: This is a pre-print version. The published version of the chapter can be found in: - Cardoso, Hugo C. 2019. The Indo-Portuguese creoles of the Malabar: Historical cues and questions. In Pius Malekandathil, Lotika Varadarajan & Amar Farooqi (eds.), India, the Portuguese, and maritime interactions, vol. II [Religion, language and cultural expression], 345-373. Delhi: Primus Books. ** --- The Indo-Portuguese Creoles of the Malabar Historical Cues and Questions Hugo C. Cardoso THIS ESSAY PROVIDES a state-of-the-art of current research on the Indo-Portuguese creoles of the Malabar. Having been given up as extinct, these creoles have been off the radar of linguists and historians alike for a long while. Yet, they are particularly important as potential descendants of the earliest forms of contact varieties of Portuguese that formed in Asia in the sixteenth century, and raise questions that interact with a social historiography of the Indo-Portuguese communities of the region. This essay will focus on four aspects of the study of these languages which operate on a linguistic-historical interface: (a) the social conditions required for their formation; (b) their course after the end of Portuguese colonial rule; (c) their putative foundational role in the context of Luso-Asian creoles; and (d) the social and linguistic stratification encapsulated in modern and late nineteenth-century records. This discussion is meant as a step towards the integration of linguistic evidence into the study of Indo-Portuguese social history, and of historical evidence into the study of Indo-Portuguese linguistics. Introduction Starting in the early sixteenth century, the colonial involvement of Portugal with Asia introduced the Portuguese language in the region.
    [Show full text]
  • 1 22. European Portuguese and Brazilian
    22. European Portuguese and Brazilian Portuguese: how quickly may languages diverge? CHARLOTTE GALVES 22.1 Introduction In 1500, Portuguese-speaking sailors arrived to South American coasts where people spoke Tupi languages. Although in the first part of the colonization period the influence was more from the Tupi speakers towards the Portuguese speakers, as the new incomers learn the language of the natives, the “Terra de Santa Cruz vulgarmente chamada Brasil”i will eventually be the greatest Portuguese-speaking language in the world. However, Brazilian Portuguese is in many aspects as different from European Portuguese as from Galician or Spanish. We shall even argue below that from a grammatical point of view, it is much more different, since it displays syntactic features typical of typologically different languages. Portuguese provides us therefore with a case of recent split between two different realizations of a formerly unique language whose effects are observable in the phonetic, lexical and syntactic aspects of the modern languages and can be argued to characterize two radically different languages. In this chapter we shall describe the results and the dynamics of such a split, addressing ultimately the issue of the span of time necessary for languages to diverge. It will be argued that at the core of this question lies the dialectic relationship between I(nternal)-language and E(xternal)-language, in such a way that it is not possible to approach this process without taking into consideration both aspects and the way they interact. This means that we should understand not only the grammatical (I-Language) changes occurred in the history of the new languages and trace back their emergence, but also what provoked them and how they came to be diffused in the population and eventually became sufficiently homogeneous in their output (E-language) to be recognized as a new common language.
    [Show full text]
  • Diachronic Generative Syntax Conference
    DiGS14 14th DIACHRONIC GENERATIVE SYNTAX CONFERENCE http://digs14.clul.ul.pt/ Universidade de Lisboa 4-6 July 2012 [email protected] | Instituto para a Investigação Interdisciplinar | Av. Prof. Gama Pinto, 2 DiGS 14 14th Diachronic Generative Syntax Conference Lisbon, 4-6 July 2012 CLUL Centro de Linguística da Universidade de Lisboa center of linguistics of-the university of Lisbon ‘Linguistic Center of the University of Lisbon’ Table of contents Organizing Committee 2 Scientific Committee 3 Invited Speakers 4 Authors & emails 5 Program 6 Abstracts by author’s last name alphabetical order 9 Organizing Committee ∙ Adriana Cardoso ∙ Juliana Costa Moreira ∙ Rita Marquilhas ∙ Luís Graça ∙ Ana Maria Martins ∙ Sílvia Pereira (∙) Grupo de Dialectologia e Diacronia do CLUL group of dialectology and diachrony of-the CLUL ‘Dialectology and Diachrony Research Group of CLUL’ 2 Scientific Committee Adam Ledgeway Ioanna Sitaridou Adriana Cardoso Joel Wallenberg Ana Maria Brito John Sundquist Ana Maria Martins John Whitman Ann Taylor Judy Bernstein Ans van Kemenade Katalin Kiss Anna Bartra Kaufmann Lobke Aelbrecht Anne Breitbarth Marcel den Dikken Anthony Kroch Maria Francisca Xavier Beatrice Santorini Maria Lobo Caroline Heycock Marie Labelle Charlotte Galves Maria Llüisa Hernanz Chiara Gianollo Marit Julien Chris Reintges Mary Kato Dalina Kallulli Montse Batllori David Lightfoot Paola Crisma David Willis Paul Hirschbühler Edith Aldridge Roland Hinterhölz Elly van Gelderen Sonia Cyrino Eric Haeberli Susan Pintzuk Esther Rinke Susann Fischer Gertjan
    [Show full text]
  • PORTUGUESE LANGUAGE the Finals Honours Course in Portuguese Aims to Develop an Active and Passive Command of Correct Spoken
    PORTUGUESE LANGUAGE The Finals Honours course in Portuguese aims to develop an active and passive command of correct spoken and written Portuguese, with some appreciation of the differences between the European and Brazilian norms, and different stylistic registers. This is done by means of regular obligatory classes in translation to and from Portuguese, instruction in writing essays in Portuguese, grammar classes as needed, and classes allowing you to work through a range of oral/aural exercises. In the examinations for the Final Honour School, all those reading Portuguese take the following language papers: Paper I Prose Composition and Essay Paper II A Unseen translation (Portuguese) Paper II B Unseen translation (Brazilian) Oral Examination – which consists of an aural comprehension exercise, and discourse and conversation in Portuguese. Undergraduates reading Portuguese sole must also take Paper III (Year Abroad Project and Translation from Older forms of Portuguese). Paper I: Translation into Portuguese and Essay Classes in translation from English into Portuguese, which undergraduates are required to attend fortnightly throughout the course, are organized centrally. Tuition for this subject will help you to handle more complex syntactical structures, acquire a richer active vocabulary and gain a command of abstract written Portuguese, as well as of narrative and descriptive prose. For the Essay, instruction is provided in the form of classes, organized in small groups. Tuition in this subject is designed to enable you to address sophisticated political and cultural issues in clear, coherent, and complex Portuguese. Paper II A and II B: Translation from Portuguese Classes in translation from Portuguese into English, which undergraduates are required to attend fortnightly throughout the course, are organized centrally.
    [Show full text]
  • The Evolution of Brazil Compared with That of Spanish and Anglo-Saxon
    TO PRESIDENT JOHN CASPER BRANNER DISTINGUISHED SCIENTIST, EMINENT SCHOLAR AND TRUE FRIEND OF BRAZIL 'IC TOKE" OF ADMIRATION AND AFFECTION LELAND STANFORD, JUNIOR UN1VERSITY PUBLICATIONS UNIVERSITY SERIES The Evolution of Brazil cOlnpared. with that of Spanish and An,gIo-Saxon' Alnerica BY l\\ANOEL DE OLIVEIRA LIMA '! Edited with Introduction and Notes BY PERCY ALVIN MARTIN Assistant Professor 01 History Leland Stanford Junior University (Issued June, I914) , Stanford University, California Published by the University 1914 STANFORD UNIVERSITY PRESS CONTENTS PAGE INTRODUCTION 9 '. LECTURE I. The conquest of America.-Religious defence of the native element.--Indians and negroes.-The color problem and the discrimination against the col­ onists.-The institution of slavery and the conditions of political inde­ pendence in the Spanish and Portuguese colonies, affecting diversely the abolition of slavery.-The first Spanish American civil war and the verdict of history in regard to it.-The social organization in the pos­ sessions of the New World.-The Indians and the c1ergy.--Th'e part taken by the Jesuits.-The fusion of the races and the neo-European product.-Causes of the separation: disregard of nationality and eco­ nomic exploitation.-Monopolies and prohibitions.-Spirituat tutelage and emancipation.-Historical reasons for the Catholic intolerance.­ Intellectual revival of the Iberian Peninsula during the Spanish reign of Charles III, and under the Portuguese dictatorship of the Marquis de Pombal.-Influence of this revival in the colonies.
    [Show full text]
  • Portuguese at Yale an Historical Sketch
    Portuguese at Yale an Historical Sketch K. David Jackson Yale University [email protected] The presence of Portuguese at Yale can be traced to the hiring of Henry Roseman Lang, an Austrian with a doctorate in romance philology from Strasbourg (1890), in the Department of Romance Languages. There is no evidence that Professor Lang ever taught Portuguese, although he worked with Portuguese medieval poetry and corresponded with Carolina Michaela de Vasconcellos, Adolfo Coelho, and other famous Portuguese linguists and philologists of the time. In 1908, he received Joaquim Nabuco, then the first Brazilian ambassador to the United States, for two lectures at Yale on “The Place of Camões in Literature” and “The Spirit of Nationality in Brazil.” Professor Lang had a long and distinguished career at Yale, from Instructor, 1892-93, Assistant Professor, 1893-96, Professor of Romance Philology, 1896-1906, to Benjamin F. Barge Professor of Romance Languages & Linguistics, 1906-1922. He was emeritus professor from his retirement in 1922 until his death in 1934. The Department of Romance Languages, in which Lang headed the Spanish section, would continue to oversee teaching of the various romance languages at Yale until 1973. There was only a minor in Spanish during Lang’s time. In 1929 Romance Languages subdivided into a Department of French and a Department of Spanish and Italian. Spanish and Italian would continue as a unit until 1970. From 1971-73 Portuguese was added to become the Department of Spanish, Italian, and Portuguese, directed by Emir Rodríguez Monegal (Ph.D. Montevideo, 1956), a Uruguayan critic who arrived in 1969.
    [Show full text]