Towards a Corsican Basic Language Resource Kit

Total Page:16

File Type:pdf, Size:1020Kb

Towards a Corsican Basic Language Resource Kit Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020), pages 2726–2735 Marseille, 11–16 May 2020 c European Language Resources Association (ELRA), licensed under CC-BY-NC Towards a Corsican Basic Language Resource Kit Laurent Kevers, Stella Retali-Medori UMR CNRS 6240 LISA, Universita` di Corsica - Pasquale Paoli Avenue Jean Nicoli, 20250 Corte, France fkevers l, medori [email protected] Abstract The current situation regarding the existence of natural language processing (NLP) resources and tools for Corsican reveals their virtual non-existence. Our inventory contains only a few rare digital resources, lexical or corpus databases, requiring adaptation work. Our ob- jective is to use the Banque de Donnees´ Langue Corse project (BDLC) to improve the availability of resources and tools for the Corsican language and, in the long term, provide a complete Basic Language Ressource Kit (BLARK). We have defined a roadmap setting out the actions to be undertaken: the collection of corpora and the setting up of a consultation interface (concordancer), and of a language detection tool, an electronic dictionary and a part-of-speech tagger. The first achievements regarding these topics have already been reached and are presented in this article. Some elements are also available on our project page (http://bdlc.univ-corse.fr/tal/). Keywords: less-resourced languages, Corsican, corpora, lexical resources, language identification, POS tagging We will first contextualize our research and present some a polynomic approach (Marcellesi, 1984) that encompasses general information about the Corsican language (1.1., its all dialectal variants, the writing of the language is not stan- digital presence (1.2.) and the BDLC project (1.3.), as pre- dardised, which is an obstacle for its automatic processing. viously done in Kevers et al. (2019) and Kevers and Retali- Medori (2020). Not without having drawn up a brief state 1.2. Digital presence of the art (2.1.), we will present the achievements concern- Nowadays, Corsican is, with French, part of a diglossic lan- ing the NLP resources and tools for Corsican (2.2.). In par- guage environment, and its use is declining. The devel- ticular, we will address corpora collection (2.3.), the set up opment of tools is necessary for its preservation, enhance- of a consultation interface in the form of a concordancer ment, transmission and promotion1. Public policy encour- (2.4.) and of a language detection tool (2.5.), the electronic ages the exploitation of new technologies to this end. Sev- dictionary building (2.6.) and POS tagging (2.7.). These eral tools and resources exist for the learning and linguistic developments are guided by a roadmap that we have de- description of Corsican dialects, but their inclusion in the fined (Kevers et al., 2019) and that is largely in line with digital humanities domain remains insufficient2. the steps proposed by Ceberio Berger et al. (2018). In particular, sites and applications dedicated to translation, 1. The Corsican language and its digital lexicon and syntax contain little data in comparison with the richness and complexity of the language. On the other presence hand, this wealth is found on databases such as the Banque 1.1. Corsican language de Donnees´ Langue Corse (BDLC) and Infcor3 (Banca di Corsican is a Latin language and is part of the Italo- dati di a lingua corsa). The latter was conceived in an Romance domain. It has known various contacts and associative context by ADECEC, an organisation active in linguistic influences. Due to the Pisan domination (9th- the field of Corsican language and culture. It has been de- 13th centuries), it is particularly linked to medieval Tuscan veloped and was made available to the general public as which constitutes its superstratum. Corsican has borrowed a website as well as a smartphone application wich offer 4 from other Italo-Romance and even Romance varieties as numerous lexical records and multi-criteria query modes . well as from Germanic and Arabic languages. Regarding the BDLC, it is a tool designed in a scientific From a dialectal point of view, four or even five ar- context, associated with the production of a linguistic atlas, eas are identifiable (Dalbera-Stefanaggi, 2002; Dalbera- the NALC. Stefanaggi, 2007). The extreme southern area even crosses the borders of Corsica, extending into Gallura, in the north 1Recommendation of the UNESCO (Brenzinger et al., 2003) of Sardinia. However, these five areas constitute a contin- 2Corsican language can be found in various forms on the uum and do not prevent interunderstanding between speak- web: interfaces in Corsican language (Qwant and Google search ers, even with those of the central and southern varieties of engines, Facebook social network), websites or application like Italy. Google Translate, Wikipedia or Wiktionary, smartphone apps for tourism, education or translation, online educational content such The genetic and historical relation between Corsican and as Corsican language learning websites, blogs about Corsican lan- Tuscan languages led the latter to be the writing language guage and culture. of the islanders from the Middle Ages until the emergence 3http://infcor.adecec.net of a conscious writing in Corsican language in the 19th 4Altough all the lexical data would require a revision of the century. The spelling of Corsican is therefore, with some information related to variation, meanings, morphology and ety- adaptations, based on the Italian graphic system (Retali- mology, the material contained in this database is essential and Medori, 2015). However, despite the implementation of could help in the development of new tools. 2726 1.3. The BDLC project els) or durm`ıa / durmia / “he was sleeping” (hiatus The Nouvel Atlas Linguistique et ethnographique de la accentuation); Corse (NALC) was initiated by the CNRS in 1975 and transmitted to Marie-Jose´ Dalbera-Stefanaggi in 1981 at the 2. Development of NLP resources and tools opening of the University of Corsica. In 1986, in response for Corsican to a request from the Regional Assembly of Corsica, the 2.1. State of the art 5 Banque de Donnees´ Langue Corse (BDLC) was created To our knowledge, there are very few resources and tools 6 and was naturally linked to the atlas . designed for Natural Language Processing (NLP) in Corsi- The NALC-BDLC hosts linguistic data related to Corsi- can. The ELDA 2014 report on linguistic resources dedi- can know-how and cultural traditions throughout the is- cated to the languages of France (Leixa et al., 2014) lists land. During field surveys with local speakers, these data 93 resources for Corsican. More than a third of these are 7 are collected through thematic questionnaires made up recordings and transcriptions from the BDLC project. The 8 of French wordlists : for example la vigne (“the vine”), rest is made up of various documents: blogs, scientific pa- le cep (“the vine stock”), tailler la vigne (“pruning the pers, institutional sites, newspapers sites, etc. There are vine”), le tonneau (“the barrel”). Based on a question also some lexicons, including Infcor or the Corsican ver- such as Cumu si dici “tailler la vigne” in corsu? (“How sion of the Wiktionary. In addition to this inventory, there 9 do we say ‘to prune the vine’ in Corsican language?” ), are a few other contributions, including the Speaking Atlas the corresponding translations into Corsican are recorded of the Regional Languages of France (Boula de Mareuil¨ et and a semi-directed interview entirely in Corsican, and re- al., 2018), the W2C corpus (Majlis and Zabokrtsky,´ 2012), lating to practices, is undertaken in order to collect eth- the BabelNet semantic network (Navigli and Ponzetto, notexts (testimonies). This data is then processed, lin- 2012), which offers a number of units in Corsican, and the guistically analysed and put online on the website http: Corsican resources created within the Crubad´ an´ Project by //bdlc.univ-corse.fr. Scannell (2007). Except for the last three, the majority of If this database constitutes a real asset for the development available resources are not directly usable for NLP. of NLP applied to Corsican, one of the major difficulties Corsican therefore falls into the category of less-resourced comes from the rich variation characterising the Corsican languages. These languages are an active area of re- language. According to the objects, significant lexical vari- search. The TALN conference hosted several events ded- ations are documented: for example, 25 lemmas were col- icated to this issue, including the workshop Traitement au- lected to describe the act of pricking the vine out. These tomatique des langues minoritaires et des petites langues lemmas will in turn be affected by variable transcriptions, (Streiter, 2003), as well as the TALaRE workshops (Morin particularly as a result of the non-standardisation of the and Esteve,` 2013; Vergez-Couret et al., 2015). Similarly, Corsican language and as a production carried out by vari- the LREC conference hosted multiple workshops, the most ous transcribers during the 30 years of the project existence. recent of which are SaLTMiL (Alegria et al., 2010; De Pauw The different writing choices meet objectives such as: et al., 2012) and CCURL (Pretorius et al., 2014; Soria et • keep the dialectal richness, for example to name ”the al., 2016; Soria et al., 2018). Recently, the TAL journal also jar”, according to the pronunciation in the localities published a thematic issue on the subject (Bernhard and So- surveyed, we will find the forms cerra and gerra from ria, 2018). However, the place of the Corsican language in the same etymology; these publications is almost non-existent. • in some cases, explicitly indicate the placement of the 2.2. Initiatives for Corsican tonic accent as well as the opening of the vowels, Given this observation, we have decided to work to im- for example in trovula´ rather than trovula / “bowl”, prove the situation of the Corsican language with regard compulu` / compulu / “shelter”, pergula` / pergula / “ar- to its place in the digital world, and more particularly in the bour”, tepidu´ / tepidu / “tepid” (proparoxytones vow- field of Natural Language Processing.
Recommended publications
  • Preparation and Exploitation of Bilingual Texts Dusko Vitas, Cvetana Krstev, Eric Laporte
    Preparation and exploitation of bilingual texts Dusko Vitas, Cvetana Krstev, Eric Laporte To cite this version: Dusko Vitas, Cvetana Krstev, Eric Laporte. Preparation and exploitation of bilingual texts. Lux Coreana, 2006, 1, pp.110-132. hal-00190958v2 HAL Id: hal-00190958 https://hal.archives-ouvertes.fr/hal-00190958v2 Submitted on 27 Nov 2007 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Preparation and exploitation of bilingual texts Duško Vitas Faculty of Mathematics Studentski trg 16, CS-11000 Belgrade, Serbia Cvetana Krstev Faculty of Philology Studentski trg 3, CS-11000 Belgrade, Serbia Éric Laporte Institut Gaspard-Monge, Université de Marne-la-Vallée 5, bd Descartes, 77454 Marne-la-Vallée CEDEX 2, France Introduction A bitext is a merged document composed of two versions of a given text, usually in two different languages. An aligned bitext is produced by an alignment tool or aligner, that automatically aligns or matches the versions of the same text, generally sentence by sentence. A multilingual aligned corpus or collection of aligned bitexts, when consulted with a search tool, can be extremely useful for translation, language teaching and the investigation of literary text (Veronis, 2000).
    [Show full text]
  • Nationalism in the French Revolution of 1789
    The University of Maine DigitalCommons@UMaine Honors College 5-2014 Nationalism in the French Revolution of 1789 Kiley Bickford University of Maine - Main Follow this and additional works at: https://digitalcommons.library.umaine.edu/honors Part of the Cultural History Commons Recommended Citation Bickford, Kiley, "Nationalism in the French Revolution of 1789" (2014). Honors College. 147. https://digitalcommons.library.umaine.edu/honors/147 This Honors Thesis is brought to you for free and open access by DigitalCommons@UMaine. It has been accepted for inclusion in Honors College by an authorized administrator of DigitalCommons@UMaine. For more information, please contact [email protected]. NATIONALISM IN THE FRENCH REVOLUTION OF 1789 by Kiley Bickford A Thesis Submitted in Partial Fulfillment of the Requirement for a Degree with Honors (History) The Honors College University of Maine May 2014 Advisory Committee: Richard Blanke, Professor of History Alexander Grab, Adelaide & Alan Bird Professor of History Angela Haas, Visiting Assistant Professor of History Raymond Pelletier, Associate Professor of French, Emeritus Chris Mares, Director of the Intensive English Institute, Honors College Copyright 2014 by Kiley Bickford All rights reserved. Abstract The French Revolution of 1789 was instrumental in the emergence and growth of modern nationalism, the idea that a state should represent, and serve the interests of, a people, or "nation," that shares a common culture and history and feels as one. But national ideas, often with their source in the otherwise cosmopolitan world of the Enlightenment, were also an important cause of the Revolution itself. The rhetoric and documents of the Revolution demonstrate the importance of national ideas.
    [Show full text]
  • Machine-Translation Inspired Reordering As Preprocessing for Cross-Lingual Sentiment Analysis
    Master in Cognitive Science and Language Master’s Thesis September 2018 Machine-translation inspired reordering as preprocessing for cross-lingual sentiment analysis Alejandro Ramírez Atrio Supervisor: Toni Badia Abstract In this thesis we study the effect of word reordering as preprocessing for Cross-Lingual Sentiment Analysis. We try different reorderings in two target languages (Spanish and Catalan) so that their word order more closely resembles the one from our source language (English). Our original expectation was that a Long Short Term Memory classifier trained on English data with bilingual word embeddings would internalize English word order, resulting in poor performance when tested on a target language with different word order. We hypothesized that the more the word order of any of our target languages resembles the one of our source language, the better the overall performance of our sentiment classifier would be when analyzing the target language. We tested five sets of transformation rules for our Part of Speech reorderings of Spanish and Catalan, extracted mainly from two sources: two papers by Crego and Mariño (2006a and 2006b) and our own empirical analysis of two corpora: CoStEP and Tatoeba. The results suggest that the bilingual word embeddings that we are training our Long Short Term Memory model with do not improve any English word order learning by part of the model when used cross-lingually. There is no improvement when reordering the Spanish and Catalan texts so that their word order more closely resembles English, and no significant drop in result score even when applying a random reordering to them making them almost unintelligible, neither when classifying between 2 options (positive-negative) nor between 4 (strongly positive, positive, negative, strongly negative).
    [Show full text]
  • International Computer Science Institute Activity Report 2007
    International Computer Science Institute Activity Report 2007 International Computer Science Institute, 1947 Center Street, Suite 600, Berkeley, CA 94704-1198 USA phone: (510) 666 2900 (510) fax: 666 2956 [email protected] http://www.icsi.berkeley.edu PRINCIPAL 2007 SPONSORS Cisco Defense Advanced Research Projects Agency (DARPA) Disruptive Technology Offi ce (DTO, formerly ARDA) European Union (via University of Edinburgh) Finnish National Technology Agency (TEKES) German Academic Exchange Service (DAAD) Google IM2 National Centre of Competence in Research, Switzerland Microsoft National Science Foundation (NSF) Qualcomm Spanish Ministry of Education and Science (MEC) AFFILIATED 2007 SPONSORS Appscio Ask AT&T Intel National Institutes of Health (NIH) SAP Volkswagen CORPORATE OFFICERS Prof. Nelson Morgan (President and Institute Director) Dr. Marcia Bush (Vice President and Associate Director) Prof. Scott Shenker (Secretary and Treasurer) BOARD OF TRUSTEES, JANUARY 2008 Prof. Javier Aracil, MEC and Universidad Autónoma de Madrid Prof. Hervé Bourlard, IDIAP and EPFL Vice Chancellor Beth Burnside, UC Berkeley Dr. Adele Goldberg, Agile Mind, Inc. and Pharmaceutrix, Inc. Dr. Greg Heinzinger, Qualcomm Mr. Clifford Higgerson, Walden International Prof. Richard Karp, ICSI and UC Berkeley Prof. Nelson Morgan, ICSI (Director) and UC Berkeley Dr. David Nagel, Ascona Group Prof. Prabhakar Raghavan, Stanford and Yahoo! Prof. Stuart Russell, UC Berkeley Computer Science Division Chair Mr. Jouko Salo, TEKES Prof. Shankar Sastry, UC Berkeley, Dean
    [Show full text]
  • Le Rattachement D'avignon Et Du Combat Vénaissin À La France
    Le rattachement d’Avignon et du Combat Vénaissin à la France. Approche juridique : 1789-1791 Jean-Jacques Clère To cite this version: Jean-Jacques Clère. Le rattachement d’Avignon et du Combat Vénaissin à la France. Approche juridique : 1789-1791. Annales historiques de la Révolution française, Armand Colin, 1992, pp.587. halshs-00203206 HAL Id: halshs-00203206 https://halshs.archives-ouvertes.fr/halshs-00203206 Submitted on 10 Apr 2017 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Annales historiques de la Révolution française Le rattachement d'Avignon et du Comtat à la France : approche juridique (1789-1791) Jean-Jacques Clere Citer ce document / Cite this document : Clere Jean-Jacques. Le rattachement d'Avignon et du Comtat à la France : approche juridique (1789-1791). In: Annales historiques de la Révolution française, n°290, 1992. pp. 571-587; doi : 10.3406/ahrf.1992.1525 http://www.persee.fr/doc/ahrf_0003-4436_1992_num_290_1_1525 Document généré le 14/03/2016 Résumé Jean- Jacques Clere, Le Rattachement d'Avignon et du Comtat à la France : approche juridique (1789- 1791). L'auteur propose une relecture, sous le regard du juriste, du débat qui s'est tenu à la Constituante, sur le problème du rattachement à la France d'Avignon et du Comtat Venaissin.
    [Show full text]
  • The Iafor European Conference Series 2014 Ece2014 Ecll2014 Ectc2014 Official Conference Proceedings ISSN: 2188-1138
    the iafor european conference series 2014 ece2014 ecll2014 ectc2014 Official Conference Proceedings ISSN: 2188-1138 “To Open Minds, To Educate Intelligence, To Inform Decisions” The International Academic Forum provides new perspectives to the thought-leaders and decision-makers of today and tomorrow by offering constructive environments for dialogue and interchange at the intersections of nation, culture, and discipline. Headquartered in Nagoya, Japan, and registered as a Non-Profit Organization 一般社( 団法人) , IAFOR is an independent think tank committed to the deeper understanding of contemporary geo-political transformation, particularly in the Asia Pacific Region. INTERNATIONAL INTERCULTURAL INTERDISCIPLINARY iafor The Executive Council of the International Advisory Board IAB Chair: Professor Stuart D.B. Picken IAB Vice-Chair: Professor Jerry Platt Mr Mitsumasa Aoyama Professor June Henton Professor Frank S. Ravitch Director, The Yufuku Gallery, Tokyo, Japan Dean, College of Human Sciences, Auburn University, Professor of Law & Walter H. Stowers Chair in Law USA and Religion, Michigan State University College of Law Professor David N Aspin Professor Emeritus and Former Dean of the Faculty of Professor Michael Hudson Professor Richard Roth Education, Monash University, Australia President of The Institute for the Study of Long-Term Senior Associate Dean, Medill School of Journalism, Visiting Fellow, St Edmund’s College, Cambridge Economic Trends (ISLET) Northwestern University, Qatar University, UK Distinguished Research Professor of Economics,
    [Show full text]
  • Kernerman Kdictionaries.Com/Kdn DICTIONARY News the European Network of E-Lexicography (Enel) Tanneke Schoonheim
    Number 22 ● July 2014 Kernerman kdictionaries.com/kdn DICTIONARY News The European Network of e-Lexicography (ENeL) Tanneke Schoonheim On October 11th 2013, the kick-off meeting of the European production and reception of dictionaries. The internet offers Network of e-Lexicography (ENeL) project took place in entirely new possibilities for developing and presenting Brussels. This meeting was the outcome of an idea ventilated dictionary information, such as with the integration of sound, a year and a half earlier, in March 2012 in Berlin, at the maps or video, and various novel ways of interacting with European Workshop on Future Standards in Lexicography. dictionary users. For editors of scholarly dictionaries the new The workshop participants then confirmed the imperative to medium is not only a source of inspiration, it also generates coordinate and harmonise research in the field of (electronic) new and serious challenges that demand cooperation and lexicography across Europe, namely to share expertise relating standardization on various levels: to standards, discuss new methodologies in lexicography that a. Through the internet scholarly dictionaries can potentially fully exploit the possibilities of the digital medium, reflect on reach large audiences. However, at present scholarly the pan-European nature of the languages of Europe and attain dictionaries providing reliable information are often not easy a wider audience. to find and are hard to decode for a non-academic audience; A proposal was written by a team of researchers from
    [Show full text]
  • A Resource of Corpus-Derived Typed Predicate Argument Structures for Croatian
    CROATPAS: A Resource of Corpus-derived Typed Predicate Argument Structures for Croatian Costanza Marini Elisabetta Ježek University of Pavia University of Pavia Department of Humanities Department of Humanities costanza.marini01@ [email protected] universitadipavia.it Abstract 2014). The potential uses and purposes of the resource range from multilingual The goal of this paper is to introduce pattern linking between compatible CROATPAS, the Croatian sister project resources to computer-assisted language of the Italian Typed-Predicate Argument learning (CALL). Structure resource (TPAS1, Ježek et al. 2014). CROATPAS is a corpus-based 1 Introduction digital collection of verb valency structures with the addition of semantic Nowadays, we live in a time when digital tools type specifications (SemTypes) to each and resources for language technology are argument slot, which is currently being constantly mushrooming all around the world. developed at the University of Pavia. However, we should remind ourselves that some Salient verbal patterns are discovered languages need our attention more than others if following a lexicographical methodology they are not to face – to put it in Rehm and called Corpus Pattern Analysis (CPA, Hegelesevere’s words – “a steadily increasing Hanks 2004 & 2012; Hanks & and rather severe threat of digital extinction” Pustejovsky 2005; Hanks et al. 2015), (2018: 3282). According to the findings of initiatives such as whereas SemTypes – such as [HUMAN], [ENTITY] or [ANIMAL] – are taken from a the META-NET White Paper Series (Tadić et al. shallow ontology shared by both TPAS 2012; Rehm et al. 2014), we can state that and the Pattern Dictionary of English Croatian is unfortunately among the 21 out of 24 Verbs (PDEV2, Hanks & Pustejovsky official languages of the European Union that are 2005; El Maarouf et al.
    [Show full text]
  • Language Transmission in France in the Course of the 20Th Century
    https://archined.ined.fr Language transmission in France in the course of the 20th century François Héran, Alexandra Filhon et Christine Deprez Version Libre accès Licence / License CC Attribution - Pas d'Œuvre dérivée 4.0 International (CC BY- ND) POUR CITER CETTE VERSION / TO CITE THIS VERSION François Héran, Alexandra Filhon et Christine Deprez, 2002, "Language transmission in France in the course of the 20th century", Population & Societies: 1-4. Disponible sur / Availabe at: http://hdl.handle.net/20.500.12204/AXCwcStFDxhNsJusqTx6 POPU TTIION &SOCIÉTTÉÉS No.FEBRU ARY37 20062 Language transmission in France in the course of the 20th century François Héran *, Alexandra Filhon ** and Christine Deprez *** “French shall be the only language of education”, pro- Institute of Linguistics’ languages of the world data- claimed the Ministerial Order of 7 June 1880 laying base [4]. The ten most frequently recalled childhood down the model primary school regulations. “The lan- languages are cited by two thirds of the respondents, guage of the Republic is French”, recently added arti- while a much larger number are recalled by a bare cle 2 of the Constitution (1992). But do families follow handful. the strictures of state education and institutions? What The Families Survey distinguishes languages were the real linguistic practices of the population of “usually” spoken by parents with their children (fig- France in the last century? Sandwiched between the ure 1A) from those which they “also” spoke to them, monopoly of the national language and
    [Show full text]
  • UC Berkeley UC Berkeley Electronic Theses and Dissertations
    UC Berkeley UC Berkeley Electronic Theses and Dissertations Title The Gascon Énonciatif System: Past, Present, and Future. A study of language contact, change, endangerment, and maintenance Permalink https://escholarship.org/uc/item/12v9d1gx Author Marcus, Nicole Publication Date 2010 Peer reviewed|Thesis/dissertation eScholarship.org Powered by the California Digital Library University of California The Gascon Énonciatif System: Past, Present, and Future A study of language contact, change, endangerment, and maintenance by Nicole Elise Marcus A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor of Philosophy in Linguistics in the Graduate Division of the University of California, Berkeley Committee in charge: Professor Gary Holland, Chair Professor Leanne Hinton Professor Johanna Nichols Fall 2010 The Gascon Énonciatif System: Past, Present, and Future A study of language contact, change, endangerment, and maintenance © 2010 by Nicole Elise Marcus Abstract The Gascon Énonciatif System: Past, Present, and Future A study of language contact, change, endangerment, and maintenance by Nicole Elise Marcus Doctor of Philosophy in Linguistics University of California, Berkeley Professor Gary Holland, Chair The énonciatif system is a defining linguistic feature of Gascon, an endangered Romance language spoken primarily in southwestern France, separating it not only from its neighboring Occitan languages, but from the entire Romance language family. This study examines this preverbal particle system from a diachronic and synchronic perspective to shed light on issues of language contact, change, endangerment, and maintenance. The diachronic source of this system has important implications regarding its current and future status. My research indicates that this system is an ancient feature of the language, deriving from contact between the original inhabitants of Gascony, who spoke Basque or an ancestral form of the language, and the Romans who conquered the region in 56 B.C.
    [Show full text]
  • Franco-American History by Fecteau
    COMMUNITY OF WATERVILLE, MAINE BY ALBERT C. FECTEAU B.A., St. Michael's College, 1951 A THESIS Submitted in Partial Fulfillment of the Requirements for the Degree of Master of Arts (in History) Division of Graduate Study University of Maine Orono June, 1952 June, 1952 M.A. History THE FRENCH CANADIAN COMMUNITY OF WATERVILLE, MAINE Albert C. Fecteau ABSTRACT In the early 1800's, there was a slow migratory movement from Lower Canada into the United States. The major causes for this flow of immigrants were first: a flourishing Republic adjacent to the Canadian border which offered economic opportunity; second: the existing colonial dissatisfaction or ethnic difficulties between the French Canadians and the British rulers; and third: the dismal economic outlook within Canada. Most of Waterville's early French Canadian immigrants were led to this locality by one or all of these motives. They came a few at a time and some occasionally returned to their Canadian farms or villages. Several developments quickened the migration. The first was the building of the `Kennebec Road' in 1830. Twenty-five years later, the construction of railroads made travel ever easier. Then came the Civil War in 1861 with its many offers of employment. However, it was the building of the cotton mills in 1874 and in 1882 that attracted the greatest number of French Canadian immigrants into Waterville. These people, generally poor and largely illiterate, suffered many hardships. Many of these were common to industrial communities of the Northeastern section of the United States in a period when laisson- faire was the prevailing philosophy.
    [Show full text]
  • Trimestriel Ete 2010 Sommaire
    TRIMESTRIEL ETE 2010 SOMMAIRE ÉDITO P04 Vie communale P33 Vie scolaire, Petite enfance, Jeunesse Mairie annexe Chers Amis Biotois, GUPII Les Bachettes Vous avez entre vos mains la nouvelle formule de Biot Infos. La maquette, plus aérée, La crèche familiale rend votre magazine municipal encore plus agréable à lire. Parmi les principales Complexe sportif évolutions, nous avons voulu valoriser l’actualité de votre ville avec une nouvelle mise en page offrant une meilleure place à l’image, et un rédactionnel enrichi grâce des Combes une typographie fine. Les fondements de votre Biot Infos ne changent pas : vous retrouverez vos rubriques habituelles et les espaces réservés aux associations. La parution désormais trimestrielle, calquée sur les saisons, nous permet de déployer en parallèle des supports de communication plus réactifs, tels que notre site internet ou la prochaine lettre d’information électronique, pour laquelle vous pouvez dès à présent vous inscrire sur www.biot.fr. Parmi les sept grands projets de la municipalité présentés lors des réunions publiques, j’en soulignerai deux ici. Le premier est ouvert au public depuis le 16 juin : il s’agit de votre nouvelle Mairie Annexe, implantée au cœur du quartier sophipolitain de P47 Etat-civil Saint-Philippe, dont l’accueil est en continu de 9h à 17h. Dès septembre, la Police Autorisation de Municipale prendra possession de l’étage inférieur dans des locaux neufs et modernes, tandis qu’une annexe sera maintenue au centre historique. Le second projet concerne sortie de territoire le complexe sportif des Combes, pour lequel le jury de concours a désigné l’équipe de maîtrise d’œuvre.
    [Show full text]