Moldavian Vs. Romanian Cross-Dialect Topic Identification

Total Page:16

File Type:pdf, Size:1020Kb

Moldavian Vs. Romanian Cross-Dialect Topic Identification SC-UPB at the VarDial 2019 Evaluation Campaign: Moldavian vs. Romanian Cross-Dialect Topic Identification Cristian Onose, Dumitru-Clementin Cercel and Stefan Trausan-Matu Faculty of Automatic Control and Computers University Politehnica of Bucharest, Romania fonose.cristian, [email protected], [email protected] Abstract et al., 2017, 2018). This year (Zampieri et al., 2019), the problem of discriminating between Ro- This paper describes our models for the Mol- manian and Moldavian dialects was introduced as davian vs. Romanian Cross-Topic Identifica- a series of three subtasks. It involves the pro- tion (MRC) evaluation campaign, part of the cessing of the MOROCO dataset (Butnaru and VarDial 2019 workshop. We focus on the three subtasks for MRC: binary classification Ionescu, 2019) to construct several language clas- between the Moldavian (MD) and the Roma- sification models. The dataset contains text sam- nian (RO) dialects and two cross-dialect multi- ples from online news outlets in the Romanian class classification between six news topics, (RO) language or the Moldavian (MD) dialect. All MD to RO and RO to MD. We propose several the subtasks are closed, meaning that the use of ex- deep learning models based on long short-term ternal datasets is not allowed. Additionally, inter- memory cells, Bidirectional Gated Recurrent nal data, available for the MRC subtasks, must not Unit (BiGRU) and Hierarchical Attention Net- works (HAN). We also employ three word em- be used between tasks. Thus, the first subtask is a bedding models to represent the text as a low binary classification between the two dialects. The dimensional vector. Our official submission second subtask involves a cross-dialect multi-class includes two runs of the BiGRU and HAN classification between six topics. More precisely, models for each of the three subtasks. The the classifier is trained using Moldavian dialect in best submitted model obtained the following order to classify samples from the Romanian di- macro-averaged F1 scores: 0.708 for subtask alect. The third subtask is similar to the second 1, 0.481 for subtask 2 and 0.480 for the last one, here the use of the dialects is reversed. one. Due to a read error caused by the quoting behaviour over the test file, our final submis- Generally, such tasks are approached using tra- sions contained a smaller number of items than ditional machine learning algorithms, which un- expected. More than 50% of the submission fortunately require handcrafted features. Recently, files were corrupted. Thus, we also present the deep learning methods, where features are learned results obtained with the corrected labels for from the data, have been proposed (Ali, 2018). To which the HAN model achieves the following address the MRC shared task, we propose the use results: 0.930 for subtask 1, 0.590 for subtask of three state of the art deep learning architectures 2 and 0.687 for the third one. for text classification: Long Short-Term Mem- 1 Introduction ory cells (LSTM) (Hochreiter and Schmidhuber, 1997), Bidirectional Gated Recurrent Unit (Bi- The task of discriminating between two dialects GRU) (Graves and Schmidhuber, 2005) and Hier- or different languages is a popular research topic archical Attention Networks (HAN) (Yang et al., which has attracted a lot of interest from the 2016). The submission results are based only on research community. Specifically, the VarDial BiGRU and HAN models for each of the three competition proposed in recent years a num- subtasks. After the competition deadline an er- ber of shared tasks on different languages such ror, caused by the quoting behaviour over the test as dialect identification for Arabic or German, file, was discovered. As a result our final submis- Indo-Aryan language identification, distinguish sions contained a smaller number of labels than between Mainland and Taiwan Mandarin or dis- expected, with approximately 50% of the files be- criminating between Dutch and Flemish (Zampieri ing corrupted. Thus, we present both the official 172 Proceedings of VarDial, pages 172–177 Minneapolis, MN, June 7, 2019 c 2019 Association for Computational Linguistics Subtask Training Validation Test ures from both countries, Romania or the Republic 1 21719 11845 5923 of Moldova, are anonymized. 2 9968 5435 5923 Besides the default processing, we also took ex- 3 11751 6410 5923 tra steps to clean up the dataset. Text usually con- tains expressions which carry little to no meaning, Table 1: Dataset sample distribution between training, thus, we choose to remove the following: stop validation and test for each of the three subtasks. words, special characters and punctuation marks all except end of sentence. Additionally, we re- submissions as well as later work, that includes move the named entity identifier as they interfere the correction of this problem. with the text representations. Another important The study of Romanian dialects was first ap- aspect is given by how we deal with the diacritics. proached by Ciobanu and Dinu(2016). They During our experiments we analyze their impact construct binary classifiers to distinguish between on the performance. Romanian and three dialects (Macedo-Romanian, Megleno-Romanian and Istro-Romanian) by ex- 3 Deep learning models ploring information provided by a set of 108 word In recent years, with the increasing availability pairs. Consequently, Butnaru and Ionescu(2019) of computational resources, deep neural networks proposed a first Moldavian and Romanian Dialec- became successful for classification and regres- tal Corpus (MOROCO) assembled from multi- sion problems (LeCun et al., 2015). At first, sim- ple news websites. On top of this dataset they ple feedforward networks were used. These net- construct several deep learning models for di- works lack loops or cycles and the information alect identification: Character level Convolutional moves only forward, from the input to the out- Neural Network (CharCNN) and an improvement put nodes. The switch to other types of rep- CNN model using squeeze and excitation blocks resentations, namely Recurrent Neural Networks (Hu et al., 2018). Additionally, they also investi- (RNNs), was made because of the need to map in- gate shallow string kernel methods (Ionescu et al., put and output nodes of varying types and sizes. 2016). They conclude that string kernels achieve best performance among the studied methods. Recurrent neural networks. RNNs are neural The remainder of this paper is organized as networks that form connections between nodes follows. In Section2 we briefly discuss the along a sequence. This allows the network to dataset for the three tasks. Section3 describes the exhibit internal memory with respect to the in- methodology behind our solution, while the exper- puts which in turn enables the prediction of fu- imental setup and the results are presented in Sec- ture steps. Due to this memory, RNNs are the pre- tion4. Finally, Section5 contains details regard- ferred method for processing sequential data such ing our conclusions. as time series, text or video. Unfortunately, RNNs can suffer from training instability, exploding and 2 Dataset vanishing gradients. The MOROCO dataset contains Moldavian and Long short-term memory. Long Short-Term Romanian samples collected from one of the fol- Memory (LSTM) units, introduced by Hochreiter lowing news categories: culture, finance, politics, and Schmidhuber(1997), are used in recurrent science, sports and technology. It is divided be- neural networks as a way to prevent vanishing or tween training, validation and test for each of the exploding gradients. The units allow the errors three tasks as described in Table1. The test set is to flow backwards through endless virtual layers combined for all the subtasks such that the labels which are unfolded in space. Besides the usual in- for the first task can not be inferred. This is neces- put and output gates, LSTM units are augmented sary because the second and the third subtasks are by recurrent gates called forget gates which regu- based entirely on just one of the dialects. late the movement of information through the cell The data samples are provided preprocessed by (Gers et al., 2000). replacing named entities, which could act as biases Gated recurrent unit. Similar to LSTM, the for the classifiers, with a special identifier: $NE$. Gated Recurrent Unit (GRU) was introduced by For instance, city names or important public fig- Cho et al.(2014) as a method to solve the van- 173 Name Vector size Min. word count Unique tokens Diacritics Training algorithm CoRoLa 300 20 250942 Yes FastText NLPL 100 10 2153518 No Word2Vec Skipgram CC 300 - 2000000 Yes FastText Table 2: Word embeddings: statistics regarding training methods and dataset/parameters details. ishing gradient problem that occurs when using fiers. In order to achieve this, we rely on a number standard RNNs. These types of units are closely of pretrained word vector models: Romanian Lan- related to LSTM having similar performance and guage Corpus (CoRoLa) introduced by Mititelu design. The GRU layers are popular due their et al.(2018), Nordic Language Processing Labo- simpler structure, which results in faster training ratory (NLPL) word embedding repository (Kutu- time. A bidirectional extension for such recurrent zov et al., 2017) and Common Crawl (CC) word layers was proposed by Graves and Schmidhuber vectors (Grave et al., 2018). The relevant details (2005). It connects two hidden layers of opposite for each word vector representation model can be directions in a backward and forward manner to viewed in Table2. the same output. This is useful for text process- LSTM and BiGRU Models. The input for the ing since it can encode the context present in such RNN flavour models is computed by taking the structures: characters and words. mean of all word embeddings present in the text. Hierarchical Attention Networks. Hierarchi- Missing words are considered zero valued vectors. cal Attention Networks (HAN) were introduced The result is a representation of the whole news for document classification by Yang et al.(2016).
Recommended publications
  • Aleksey A. ROMANCHUK ROMANIAN a CINSTI in the LIGHT of SOME
    2021, Volumul XXIX REVISTA DE ETNOLOGIE ȘI CULTUROLOGIE E-ISSN: 2537-6152 101 Aleksey A. ROMANCHUK ROMANIAN A CINSTI IN THE LIGHT OF SOME ROMANIAN-SLAVIC CONTACTS1 https://doi.org/10.52603/rec.2021.29.14 Rezumat определенное отражение следы обозначенного поздне- Cuvântul românesc a cinsti în lumina unor contacte праславянского диалекта (диалектов). В частности, к româno-slave таким следам, возможно, стоит отнести как украинские Pornind de la comparația dintre cuvântul ucrainean диалектные чандрий, шандрий, чендрий, так и диалект- частувати ‚a trata’ și românescul a cinsti‚ a trata (cu vin), ное (зафиксировано в украинском говоре с. Булэешть) / a bea vin, se consideră un grup de împrumuturi slave cu мон|золетеи/ ‘мусолить; впустую теребить’. /n/ epentetic în limba română, căreia îi aparține și a cin- Ключевые слова: славяне, румыны, лексические sti. Interpretând corpul de fapte disponibil, putem presu- заимствования, этническая история, украинские диа- pune că convergență semantică dintre cuvintele честь и лекты, Молдова, Буковина. угощение a apărut în perioada slavică timpurie. Cuvântul românesc a cinsti este un argument important pentru da- Summary tarea timpurie a apariției acestei convergențe semantice. Romanian a cinsti in the light of some Așadar, cuvântul ucrainean частувати, ca și cuvântul po- Romanian-Slavic contacts lonez częstowac, apar independent unul de celălalt, la fel A group of Slavic loanwords with epenthetic /n/ in the ca și de cuvântul românesc a cinsti. Cuvântul românesc a Romanian language, to which a cinsti belongs, is consid- cinsti, ca și, în general, grupul menționat de împrumuturi ered. Interpreting the existing set of facts, the author sup- slave cu /n/ epentetic în limba română, reprezintă un rezul- poses that the semantic convergence between the Slav- tat al contactelor timpurii ale limbii române cu un dialect ic честь ‘honour’ and угощение ‘treat’ appeared as far back (dialecte) slav vechi, pentru care era caracteristică tendința as the Late Slavic period.
    [Show full text]
  • The Moldavian and Romanian Dialectal Corpus
    MOROCO: The Moldavian and Romanian Dialectal Corpus Andrei M. Butnaru and Radu Tudor Ionescu Department of Computer Science, University of Bucharest 14 Academiei, Bucharest, Romania [email protected] [email protected] Abstract In this work, we introduce the Moldavian and Romanian Dialectal Corpus (MOROCO), which is freely available for download at https://github.com/butnaruandrei/MOROCO. The corpus contains 33564 samples of text (with over 10 million tokens) collected from the news domain. The samples belong to one of the following six topics: culture, finance, politics, science, sports and tech. The data set is divided into 21719 samples for training, 5921 samples for validation and another 5924 Figure 1: Map of Romania and the Republic of samples for testing. For each sample, we Moldova. provide corresponding dialectal and category labels. This allows us to perform empirical studies on several classification tasks such as Romanian is part of the Balkan-Romance group (i) binary discrimination of Moldavian versus that evolved from several dialects of Vulgar Romanian text samples, (ii) intra-dialect Latin, which separated from the Western Ro- multi-class categorization by topic and (iii) mance branch of languages from the fifth cen- cross-dialect multi-class categorization by tury (Coteanu et al., 1969). In order to dis- topic. We perform experiments using a shallow approach based on string kernels, tinguish Romanian within the Balkan-Romance as well as a novel deep approach based on group in comparative linguistics, it is referred to character-level convolutional neural networks as Daco-Romanian. Along with Daco-Romanian, containing Squeeze-and-Excitation blocks.
    [Show full text]
  • Politiche E Pianificazioni Linguistiche in Bessarabia: Romenità
    UNIVERSITÀ DEGLI STUDI DI UDINE Corso di Dottorato in Scienze Linguistiche e Letterarie Ciclo XXV TESI DI DOTTORATO DI RICERCA Politiche e pianificazioni linguistiche in Bessarabia: romenità, russificazione, moldovenismo Dottorando Alessandro Zuliani Relatori Prof. Fabiana Fusco Prof. Celestina Zenobia Fanella Anno Accademico 2012-2013 INTRODUZIONE Tesi di dottorato di Alessandro Zuliani, discussa presso l'Università degli Studi di Udine 2 La presente ricerca verte sulle politiche e le pianificazioni linguistiche che hanno interessato la Bessarabia, regione europea che rappresenta l'estremità orientale della Romània continua, già parte del Principato di Moldavia e che oggi coincide pressappoco con i confini della Repubblica di Moldavia, stato nato dalla dissoluzione dell'Unione Sovietica nel 1991. Nel corso del nostro studio siamo partiti dal 1812, data di annessione della Bessarabia all’Impero russo, e abbiamo cercato di ripercorrere alcune tappe importanti che hanno portato all’attuale realtà sociolinguistica della Repubblica di Moldavia. Soffermandoci sulle politiche linguistiche della Russia zarista e sulle pianificazioni linguistiche sovietiche, abbiamo rilevato l'iniziale processo di profonda russificazione subito dalla popolazione autoctona dalla Bessarabia, cui è seguito il tentativo, in parte riuscito, di creare un nuovo popolo e un nuovo idioma. La cosiddetta lingua moldava altro non è che l'espressione della volontà di separare, anche linguisticamente, i romeni della Bessarabia dal resto della Romania, di fatto sancendo l'esistenza di due nazionalità e di due idiomi ben distinti. L’unità etnolinguistica romena dell’area compresa tra i fiumi Tibisco, Danubio, Dniestr e il Mar Nero, già affermata dagli storici e dai cronicari a partire dal XVI secolo, viene dunque messa in discussione e avversata in modo esplicito dalle tesi del moldovenismo, fenomeno linguistico e culturale incentrato sulla differenziazione etnica e linguistica tra moldavi e romeni.
    [Show full text]
  • MOROCO: the Moldavian and Romanian Dialectal Corpus
    MOROCO: The Moldavian and Romanian Dialectal Corpus Andrei M. Butnaru and Radu Tudor Ionescu Department of Computer Science, University of Bucharest 14 Academiei, Bucharest, Romania [email protected] [email protected] Abstract In this work, we introduce the Moldavian and Romanian Dialectal Corpus (MOROCO), which is freely available for download at https://github.com/butnaruandrei/MOROCO. The corpus contains 33564 samples of text (with over 10 million tokens) collected from the news domain. The samples belong to one of the following six topics: culture, finance, politics, science, sports and tech. The data set is divided into 21719 samples for training, 5921 samples for validation and another 5924 Figure 1: Map of Romania and the Republic of samples for testing. For each sample, we Moldova. provide corresponding dialectal and category labels. This allows us to perform empirical studies on several classification tasks such as Romanian is part of the Balkan-Romance group (i) binary discrimination of Moldavian versus that evolved from several dialects of Vulgar Romanian text samples, (ii) intra-dialect Latin, which separated from the Western Ro- multi-class categorization by topic and (iii) mance branch of languages from the fifth cen- cross-dialect multi-class categorization by tury (Coteanu et al., 1969). In order to dis- topic. We perform experiments using a shallow approach based on string kernels, tinguish Romanian within the Balkan-Romance as well as a novel deep approach based on group in comparative linguistics, it is referred to character-level convolutional neural networks as Daco-Romanian. Along with Daco-Romanian, containing Squeeze-and-Excitation blocks.
    [Show full text]
  • The Csang6 in Ghime~-Faget: Boundary and Ethnic Definition in Transylvania
    The Csang6 in Ghime~-Faget: Boundary and Ethnic Definition in Transylvania An Honors Thesis (HONRS 499) by Lydia Spotts Thesis Advisor Dr. James M. Nyce Ball State University Muncie, Indiana December 2008 December 20, 2008 Spotts ii Abstract: L A short ethnographic field study in Transylvania led to interesting questions about self­ identification in a traditionally multi-ethnic region-one that has been contested for most of the region's history. The complicated and dichotomous accounts of Transylvania's history and the development of national consciousness in the two nations (Hungary and Romania) who claim the region are examined. This framework contextualizes the issues surrounding an enigmatic and equally contested ethnic minority found in the Ghime~-Faget area of Transylvania. The Csang6 have been rejected and claimed by both nations throughout history, but lack internal definition and presently do not often self-identify in society. The information gajned through field study observation and interviews is analyzed following a discussion of the historical contestation of Transylvania and the Csang6 found in the English literature. Acknowledgements: I would like to thank Dr. Nyce for his patience and assistance during the months of research and writing process. I am also grateful to Dr. Bader and the members of the 2008 Ghime~-Faget field study who submitted field notes and interview transcriptions, making them available in the common corpus. Spotts iii Outline 1. Introduction: a field study in Transylvania II. A brief history of the contested region of Transylvania A. Geography of Transylvania and Ghime~-Faget B. A transitory border I. Ancient occupation to the medieval era 2.
    [Show full text]
  • The Modernization of the Romanian Standard Language According to G
    article doi:10.17684/i11A162en DIACRONIA ISSN: 2393-1140 Impavidi progrediamur! www.diacronia.ro The modernization of the Romanian standard language according to G. Ivănescu Cătălin Nicolau? Faculty of Letters, “Alexandru Ioan Cuza” University, Bd. Carol I 11, 700506 Iași, Romania Article info Abstract History: Our survey synthetically exposes G. Ivănescu’smain ideas with respect to the no- Received April 15, 2020 tion of standard language in general and its modernization process in particular. Accepted May 6, 2020 This undertaking is a component of a larger research of Ivănescu’s diachronic Published June 10, 2020 conception, which I have worked on for the last few years. As a result, the concept of standard language is approached here from a holistic perspective, that Key words: of the entire history of Romanian language, which, after all, it actually belongs written dialect to. This is the reason why some elements of the written language are related to literary language the history of the spoken language and the principles of its research. modernization Our aim is pointing out the originality of this conception, which is diver- rhotacized text gent from that of the linguists in Bucharest, against a background characterized by a decreasing interest in in diachronic linguistics as compared to that in syn- chronic studies. Ivănescu succeeds in developing his original linguistic theory, relying on historical and socio‐linguistic supporting arguments, which deepens, adds up to, and sometimes even contradicts the ideas of both his forerunners and his contemporaries. 1. Introduction – the notion of standard language In this survey, we aim at exposing the way in which G.
    [Show full text]
  • Credo Reference, Url=
    Romanian Americans from Gale Encyclopedia of Multicultural America View article on Credo OVERVIEW Romanian Americans are immigrants or descendants of people from Romania, an Eastern European country on the Black Sea. Romania is bordered by Ukraine to the north; Moldova to the northeast; Bulgaria to the south; and Hungary, Serbia, and Montenegro to the west. The Carpathian Mountains dominate the Romanian landscape, and the Danube River flows through southern Romania before entering the Black Sea. Slightly smaller than the state of Oregon, Romania measures 91,699 square miles (237,500 square kilometers). In 2011, according to the National Statistics Institute of Romania, Romania had a population of slightly over 19 million. This is a decrease of more than 2 million since 2002. The decline is a result of hundreds of thousands of Romanians leaving the country to find jobs and settling in various European countries, including Spain, Italy, France, Germany, and Austria, as well as in Israel and the United States. The overwhelming majority of Romanian citizens—88.8 percent—are members of the Romanian Orthodox Church. Approximately 10 percent are Catholics of the Byzantine rite, and there are also small groups of Protestants, Muslims, and Jews. Like many other former members of the Soviet bloc, the Romanian economy is in the midst of transition from communism to capitalism, which has led to uneven growth and has left the country with a lower-middle-income economy. However, the country is rich in natural resources and has a large agricultural sector. Romanians began to arrive in the United States in significant numbers in the late nineteenth and early twentieth centuries, settling mainly in urban areas and working in factories.
    [Show full text]
  • Revista Hiperboreea
    RREEVVIISSTTAA HHIIPPEERRBBOORREEEEAA Journal of History Vol. I, No. 2 December 2014 București 2014 RREEVVIISSTTAA HHIIPPEERRBBOORREEEEAA Journal of History Vol. I, No. 2 December 2014 București 2014 SCIENTIFIC BOARD Professor Ecaterina Lung (University of Bucharest) Professor Anca Irina Ionescu (University of Bucharest) Associate Professor Marian Petcu (University of Bucharest) Assistant Professor Luminiţa Diaconu (University of Bucharest) Assistant Professor Ioana Munteanu (University of Bucharest) Assistant Dean & Director of the Law Library Radu D. Popa (New York University School of Law) EDITORIAL BOARD Editor-in-Chief: Mihai Dragnea Editors: Mirela Nazîru, Raluca Anca Bratu, Irina Trușcă Hiperboreea is an online scholarly journal that publishes studies and articles from the field of History as well as book reviews and evaluations of scholarly conferences. Without limiting the scope of the research to a particular geographical area or historical period, the journal covers a wide range of interdisciplinary topics related to Cultural History, Political History, Military History, Social History, Economic History and Archeology. Since its first publication in 2012, Hiperboreea has a monthly periodicity (January 2012 – March 2013) and quarterly publication (until the end of 2013). Since 2014, the journal appears biannually. Hiperboreea journal is indexed in international databases such as Index Copernicus, EBSCO, ProQuest, Ulrich, Directory of Research Journals Indexing, Journal Impact Factor, Genamics Journal Seek and AWOL. All articles published in Hiperboreea Open Access journal are peer reviewed and upon acceptance are permanently free for everyone to read, print, distribute and download. The method of reviewing is Open peer review in which reviewers are aware of the identity of the authors, and authors are also aware of the identity of reviewers.
    [Show full text]
  • The Csángó Minority Culture in Moldavia
    PARLIAMENTARY ASSEMBLY ASSEMBLEE PARLEMENTAIRE Council of Europe / Conseil de l’ Europe F-67075 Strasbourg Cedex Tel : +33 (0)3 88 41 20 00 Fax : +33 (0)3 88 41 27 76 E-mail : [email protected] http://stars.coe.fr AS/Cult (2001) 06 2 March 2001 Or. English The Csángó Minority Culture in Moldavia Preliminary draft report Committee on Culture, Science and Education Rapporteur: Mrs Tytti Isohookana-Asunmaa, Finland Contents Paragraphs I. Introduction ........................................................................................................1-2 II. Who are the Csángós........................................................................................3-11 III. Historical background ....................................................................................12-13 IV. The language of the Csángós .........................................................................14-18 V. Folklore and popular ornamental art ..............................................................19-22 VI. The religious aspect........................................................................................23-26 VII. Education issues .............................................................................................27-31 VIII. Practical proposals for the preservation of the Csángó culture...........................32 2 I. INTRODUCTION 1. The term “Csángó” (Ceangai in Romanian) is used to identify a non-homogeneous group of Roman Catholic people of Hungarian origin living in Moldavia. This ethnic group is a relic from the Middle Ages
    [Show full text]
  • Swedish Journal of Romanian Studies
    “1 Decembrie 1918” University of Alba Iulia Lund University Romanian Language Institute, Bucharest SWEDISH JOURNAL OF ROMANIAN STUDIES Vol. 4 No 1 (2021) ISSN 2003-0924 Cover: Florian Rares Tileaga, travel journalist & photographer, film and theatre critic, ISIA ski instructor, Web: www.pasiliberi.ro The cover photo was taken on April 20th, 2011 and represents the window to the reconstructed house of Fefeleaga in Bucium, Alba County, Romania SWEDISH JOURNAL OF ROMANIAN STUDIES Vol. 4 No 1 (2021) ISSN 2003-0924 SWEDISH JOURNAL OF ROMANIAN STUDIES Table of Contents Editorial ………………………………………………………………. 7 Introduction for contributors to Swedish Journal of Romanian Studies ………………………………………………………………… 10 Literature Alina Buzatu Genul (literar/ discursiv). Pagini de istorie conceptuală (I) / (Literary /discursive) genre. Pages of conceptual history (I) ……………………. 13 Lucreţia Pascariu „Astra. Roman epistolar” de Dito şi Idem – un caz de istorie literară/ “Astra. Epistolary novel” by Dito and Idem – a case of literary history … 26 Dan Horaţiu Popescu Pentru o istorie a P.E.N. Club-ului Român: 3/ For a History of the Romanian P.E.N. Club: 3 ........................................................................ 38 Brânduşa Juică, Virginia Popović, Marinel Negru Banatul, temă lirică în literatura română din Voivodina/ Banat as a lyric theme in Romanian literature in Vojvodina ……………………... 50 Cristina Hermeziu Le désir de langue. Les dimensions poétique et politique de la diglossie dans le roman « Le jardin de verre » de Tatiana Țîbuleac/ The desire for language. The poetic and political dimensions of diglossia in the novel “The Glass Garden” by Tatiana Țîbuleac ……… 64 Bianca-Maria Bucur Tincu The concept of bovarism illustrated by a postmodern prototype: Gheorghe Crăciun’s Pupa Russa ……………………………………… 75 Lucian Vasile Bâgiu Amândoi by Liviu Rebreanu: Multicultural settings and sources of suspense ……………………………………………………………….
    [Show full text]
  • Me XXVI-1988 N° 3 (Juillet-Septembre) REPUBLICII SOCIALISTE ROMANIA
    ACADEMIE DFS SCIENCES SOCIALES ET POLMQUES INSTITUT D'ÉTUDES SUD-EST EUROPEENNES me XXVI-1988 N° 3 (Juillet-Septembre) Voyageurs et réalités sud-est européennes Mentalitéscollectives Relations linguistiques ED1TURA ACADEMIEI REPUBLICII SOCIALISTE ROMANIA www.dacoromanica.ro COMITÉ DE RÉDACTION ALEXANDRIT DUTITrddakteur responsable ; Membres du comitd :AL. ELIAN, VALENTIN AL. GEORGESCII, GHEORGHE I. IONITI, COSTIN MURGESCIT,D. M. PrPPIDI, MDIAI POP, AL. ROSETTI, ELENA SCAR- LITOIII, EUGEN STINESCU Se,crétaire du comité: LIDIA SIMEON La REVUE DES ATUDES SUD-EST EUROPÉENNES parait 4 tots par an. Toute commande de rétranger(fasciculesou abonnements) sera adressée A e Rompresfilatelia ., Departamentul Export-Import PresA, P.O. Box 12-201, télex 10376, Bucure§ti, prsfi r Calea Grivitei nr. 64-66 ou A ses représentants A rétranger. Le prix d'un abonnement est de 62 $ par an. La correspondance, les manuscrits et les publications (livres, revues, etc.) envoyés pour comptes rendus seront adressés A la REVUE DES ÈTUDES SUD-EST EUROPÉENNES CAsula po§talA 22.159, 71100 Bucure.gti Les article,s seront rem's dactylographiés en deux exemplaires. Les collaborateurs sont priés de ne pas dépasser les limites de 15-20 pages dactylographiées pour les articles et 5-6 pages pour les comptes rendus. EDITURA ACADEMIEI REPUI3LICII SOCIALISTE ROMANIA Calea Victoriel n° 125, téléphone 50 76 80 Bucure.gtiRomAnia www.dacoromanica.ro :UROPBE:1111ES TOME XXVI 1988 N° 3 JuilletSeptembre SOMMAIRE V oyageurs et réalites sud-est europeenn es MARIA HOLBAN, Autour du Journal inedit du Sieur de la Croix, II 187 GIUSEPPE PICCILLO (Università di Catania), La langue roumaine dans les écrits des missionnaires italiens XVII 1e siécles) 205 XeniaWes collectives EUGEN CIZEK, L'univers mental des Romains 215 Relations linguistiques ZAMF1RA MIHAIL, Quelques termes pour e métier à tisser *en perspective ethnolinguis- tique sud-est européenne 229 CATALINA VATMESCU, Les denominations des parties de l'instrument aratoire en roumain et en albanais 237 ELENA SCARLATOIU, Le mégléno-roumain parlé en Dobroudia.
    [Show full text]
  • Csángos in Moldavia
    Vilmos Tánczos: Csángós in Moldavia This study was prepared originally for the booklet titled "Magyar nemzeti kisebbségek Kelet- Közép-Európában" (Hungarian National Minorities in East-Central Europe) [1] (Published under the title "Hányan vannak a moldvai csángók?" in Magyar Kisebbség, No. 1-2, 1997. (III), Pages 370-390). Significantly expanded and modified version. 1. The Term "Csángó" The eastern province of Roumania, called Moldavia - namely in counties Bacau, Botosani, Iasi, Neamt, Vaslui and Vrancea - has nearly a quarter of a million (exactly 243133) Catholic inhabitants, on the basis of the data of the Roumanian population census in 1992. The Hungarian and the international literature in this subject unanimously agree that the Catholic population living in Moldavia - apart from the small proportion of assimilated people - elements of Roumanian, German, Polish, Italian and Gypsy origin - are of Hungarian origin. This fact is acknowledged also by significant Roumanian researchers. [2] Furthermore, it is also probable that a certain part of the Greek Orthodox Roumanian population of Moldavia also used to belong to the Hungarian ethnic group some time, however, we cannot speak of any scientific research that was carried out specially in this subject. [3] In the lack of adequate results of such investigations today we may only guess that the assimilation of the Hungarian Moldavian Catholics in the 16-18th century was not only of linguistic but of religious nature, as well: in certain villages the people lost not only their language but - upon the pressure of the Boyars and of the princes, and as a result of the lack of priests - their religion, as well.
    [Show full text]