Development of Text-To-Speech System for Latvian
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
VALTS ERNÇSTREITS (Riga) LIVONIAN ORTHOGRAPHY Introduction the Livonian Language Has Been Extensively Written for About
Linguistica Uralica XLIII 2007 1 VALTS ERNÇSTREITS (Riga) LIVONIAN ORTHOGRAPHY Abstract. This article deals with the development of Livonian written language and the related matters starting from the publication of first Livonian books until present day. In total four different spelling systems have been used in Livonian publications. The first books in Livonian appeared in 1863 using phonetic transcription. In 1880, the Gospel of Matthew was published in Eastern and Western Livonian dialects and used Gothic script and a spelling system similar to old Latvian orthography. In 1920, an East Livonian written standard was established by the simplification of the Finno-Ugric phonetic transcription. Later, elements of Latvian orthography, and after 1931 also West Livonian characteristics, were added. Starting from the 1970s and due to a considerable decrease in the number of Livonian mother tongue speakers in the second half of the 20th century the orthography was modified to be even more phonetic in the interest of those who did not speak the language. Additionally, in the 1930s, a spelling system which was better suited for conveying certain phonetic phenomena than the usual standard was used in two books but did not find any wider usage. Keywords: Livonian, standard language, orthography. Introduction The Livonian language has been extensively written for about 150 years by linguists who have been noting down examples of the language as well as the Livonians themselves when publishing different materials. The prime consideration for both of these groups has been how to best represent the Livonian language in the written form. The area of interest for the linguists has been the written representation of the language with maximal phonetic precision. -
The Incidence and Evolution of Palatalized Consonants in Latvian
City University of New York (CUNY) CUNY Academic Works All Dissertations, Theses, and Capstone Projects Dissertations, Theses, and Capstone Projects 5-2015 The Incidence and Evolution of Palatalized Consonants in Latvian Linda Zalite Graduate Center, City University of New York How does access to this work benefit ou?y Let us know! More information about this work at: https://academicworks.cuny.edu/gc_etds/1198 Discover additional works at: https://academicworks.cuny.edu This work is made publicly available by the City University of New York (CUNY). Contact: [email protected] THE INCIDENCE AND EVOLUTION OF PALATALIZED CONSONANTS IN LATVIAN by LINDA ZALITE A master’s thesis submitted to the Graduate Faculty in Linguistics in partial fulfillment of the requirements for the degree of Master of Arts, The City University of New York 2015 © 2015 Linda Zalite All Rights Reserved ii This manuscript has been read and accepted for the Graduate Faculty in Linguistics in satisfaction of the dissertation requirement for the degree of Master of Arts. Juliette Blevins Date Thesis Advisor Gita Martohardjono Date Executiv e Officer THE CITY UNIVERSITY OF NEW YORK iii Abstract The Incidence and Evolution of Palatalized Consonants in Latvian by Linda Zalite Advisor: Professor Juliette Blevins This thesis traces the evolution of the palatalized rhotic /rj/ in Baltic languages with focus on the continuation of this segment in Latvian and its recent neutralization with /r/. Historical, phonological, phonetic, and synchronic data is gathered as evidence to further our understanding of the Latvian palatalized rhotic and its near-disappearance in the 20th century. Previous typological works of Endzelīns (1922, 1951), Dini (1997), Rūķe-Draviņa (1994) and Ābele (1929) were considered intending to answer three central questions. -
Nominal Compounds in Old Latvian Texts in the 16Th and 17Th Centuries
!"#$ #%&"" ' ( ( )#"& !"# * + , #- #$ &* + , . &* + &. + + . + &* + . &* , + , & ( + . + & * . (+ + / ( ( ( ( & ( + & ,0&120&1& ( ,&3 + , . ( ,+ 4 & ( . + &3 4 ( + + (+ , &* ( . + &* ( + . , ( 5 & ( , & . ( / + + ( &6(/ + ( . + , & $ % !"#$ 722 && 2 8 9 77 7 7 ##: $" 3;<$ <#$-:< $!< 3;<$ <#$-:< $=- 3;"! #%:$ & '( # #( )*) & #) + (#"-<# NOMINAL COMPOUNDS IN OLD LATVIAN TEXTS IN THE 16TH AND 17TH CENTURIES Kristina Bukelskytė-Čepelė Nominal Compounds in Old Latvian Texts in the 16th and 17th Centuries Kristina Bukelskytė-Čepelė ©Kristina Bukelskytė-Čepelė, Stockholm University 2017 ISBN print 978-91-7649-872-9 ISBN PDF 978-91-7649-873-6 ISSN 0281-5478 Cover: "Die Statt Riga" in S. Münster’s "Cosmographia" (ca. 16th c) Printed in Sweden by Universitetsservice US-AB, Stockholm 2017 Distributor: Department of Slavic and Baltic Studies, Finnish, Dutch -
Comments on Cedilla and Comma Below (Revision 2)
Comments on cedilla and comma below (revision 2) Denis Moyogo Jacquerye <[email protected]> 2013-07-30 1. Summary The document provides comments on the recent discussion about cedilla and comma below and the proposal to add characters with invariant cedilla. With the evidence found, our conclusion is that the cedilla can have several forms, including that of the comma below or similar shapes. Adding characters with invariant cedilla would only increase the confusion and could lead to instable data as is the case with Romanian. The right solution to this problem is the same as for other preferred glyph variation: custom fonts or multilingual smart fonts, better application support for language tagging and font technologies. 2. Discussion In the Unicode Standard the cedilla and comma below are two different combining diacritical marks both inherited from previous character encodings. This distinction between those marks has not always been made, like in ISO 8859-2, or before computer encodings where they were used interchangibly based on the font style. The combining cedilla character is de facto a cedilla that can take several shapes or forms, and the combining comma below is a non contrastive character only representing a glyph variant of that cedilla as historically evidence shows. Following discussion about the Marshallese forms used in LDS publications on the mpeg-OT-spec mailing list, Eric Muller’s document (L2/13-037) raises the question of the differentiation between the cedilla and the comma below as characters in precomposed Latvian characters with the example of Marshallese that uses these but with a different preferred form. -
Overview and Rationale
Integration Panel: Maximal Starting Repertoire — MSR-4 Overview and Rationale REVISION – November 09, 2018 Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-4) 3 2.1 Files 3 2.1.1 Overview 3 2.1.2 Normative Definition 3 2.1.3 Code Charts 4 2.2 Determining the Contents of the MSR 5 2.3 Process of Deciding the MSR 6 3 Scripts 7 3.1 Comprehensiveness and Staging 7 3.2 What Defines a Related Script? 8 3.3 Separable Scripts 8 3.4 Deferred Scripts 9 3.5 Historical and Obsolete Scripts 9 3.6 Selecting Scripts and Code Points for the MSR 9 3.7 Scripts Appropriate for Use in Identifiers 9 3.8 Modern Use Scripts 10 3.8.1 Common and Inherited 11 3.8.2 Scripts included in MSR-1 11 3.8.3 Scripts added in MSR-2 11 3.8.4 Scripts added in MSR-3 or MSR-4 12 3.8.5 Modern Scripts Ineligible for the Root Zone 12 3.9 Scripts for Possible Future MSRs 12 3.10 Scripts Identified in UAX#31 as Not Suitable for identifiers 13 4 Exclusions of Individual Code Points or Ranges 14 4.1 Historic and Phonetic Extensions to Modern Scripts 14 4.2 Code Points That Pose Special Risks 15 4.3 Code Points with Strong Justification to Exclude 15 4.4 Code Points That May or May Not be Excludable from the Root Zone LGR 15 4.5 Non-spacing Combining Marks 16 5 Discussion of Particular Code Points 18 Integration Panel: Maximal Starting Repertoire — MSR-3 Overview and Rationale 5.1 Digits and Hyphen 19 5.2 CONTEXT O Code Points 19 5.3 CONTEXT J Code Points 19 5.4 Code Points Restricted for Identifiers 19 5.5 Compatibility with IDNA2003 20 5.6 Code Points for Which the -
Revisiting Proto-Indo-European Schwebeablaut
UNIVERSITY OF CALIFORNIA Los Angeles Revisiting Proto-Indo-European Schwebeablaut A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Indo-European Studies by Kaspars Ozoliņš 2015 © Copyright by Kaspars Ozoliņš 2015 ABSTRACT OF THE DISSERTATION Revisiting Proto-Indo-European Schwebeablaut by Kaspars Ozoliņš Doctor of Philosophy in Indo-European Studies University of California, Los Angeles, 2015 Professor H. Craig Melchert, Chair This dissertation examines the phenomenon of Proto-Indo-European schwebeablaut (German Schwebeablaut “floating vowel gradation”), whereby a number of reconstructed forms are observed to alternate in their root shape between CeRC (termed State I) and CReC (termed State II). This mechanism of Proto-Indo-European (PIE) root ablaut has long been tacitly accepted (in one form or another) by scholars; however, the only comprehensive treatment has been Anttila (1969), and the matter therefore merits a thorough review. This dissertation reviews material from the daughter languages considered to be evidence for schwebeablaut by using some of the same techniques employed in the work of Anttila, only in an updated fashion. A large majority of the remaining cases are explained without requiring recourse to schwebeablaut, while several more difficult forms are discussed in individual chapters. The second part of the dissertation examines a unique subtype of roots extended by an s-formant that exhibit the following alternation: CeRC : CReC-s. It is argued that the descriptive schwebeablaut inherent in these formations may legitimately be traced to PIE. An Optimality Theoretic account for this phonologically motivated metathesis is offered, which likewise eliminates the need to invoke schwebeablaut (as classically defined). -
Overview and Rationale
Integration Panel: Maximal Starting Repertoire — MSR-5 Overview and Rationale REVISION – APRIL 06, 2021 – PUBLIC COMMENT Table of Contents 1 Overview 3 2 Maximal Starting Repertoire (MSR-5) 3 2.1 Files 3 2.1.1 Overview 3 2.1.2 Normative Definition 4 2.1.3 HTML Presentation 4 2.1.4 Code Charts 4 2.2 Determining the Contents of the MSR 5 2.3 Process of Deciding the MSR 6 3 Scripts 8 3.1 Comprehensiveness and Staging 8 3.2 What Defines a Related Script? 9 3.3 Separable Scripts 9 3.4 Deferred Scripts 9 3.5 Historical and Obsolete Scripts 9 3.6 Selecting Scripts and Code Points for the MSR 10 3.7 Scripts Appropriate for Use in Identifiers 10 3.8 Modern Use Scripts 11 3.8.1 Common and Inherited 12 3.8.2 Scripts included in MSR-1 12 3.8.3 Scripts added in MSR-2 12 3.8.4 Scripts added in MSR-3 through MSR-5 13 3.8.5 Modern Scripts Ineligible for the Root Zone 13 3.9 Scripts for Possible Future MSRs 13 3.10 Scripts Identified in UAX#31 as Not Suitable for identifiers 14 4 Exclusions of Individual Code Points or Ranges 16 4.1 Historic and Phonetic Extensions to Modern Scripts 16 4.2 Code Points That Pose Special Risks 17 4.3 Code Points with Strong Justification to Exclude 17 4.4 Code Points That May or May Not be Excludable from the Root Zone LGR 17 4.5 Non-spacing Combining Marks 18 Integration Panel: Maximal Starting Repertoire — MSR-3 Overview and Rationale 5 Discussion of Particular Code Points 20 5.1 Digits and Hyphen 20 5.2 CONTEXT O Code Points 21 5.3 CONTEXT J Code Points 21 5.4 Code Points Restricted for Identifiers 21 5.5 Compatibility -
Proposal to Encode 10 Latin Letters with Diagonal Stroke
Universal Multiple-Octet Coded Character Set International Organization for Standardization Organisation Internationale de Normalisation Международная организация по стандартизации Doc Type: Working Group Document Title: Proposal to encode 10 Latin letters with diagonal stroke Source: Karl Pentzlin – [email protected] Status: Individual Contribution Action: For consideration by JTC1/SC2/WG2 and UTC Date: 2008-10-30 Additions for pre-1921 Latvian orthography U+A796 LATIN CAPITAL LETTER G WITH DIAGONAL STROKE U+A797 LATIN SMALL LETTER G WITH DIAGONAL STROKE U+A798 LATIN CAPITAL LETTER K WITH DIAGONAL STROKE OVERLAY U+A799 LATIN SMALL LETTER K WITH DIAGONAL STROKE OVERLAY → A743 latin small letter k with diagonal stroke · "overlay" in the name indicates contrast to U+A743 U+A79A LATIN CAPITAL LETTER N WITH DIAGONAL STROKE U+A79B LATIN SMALL LETTER N WITH DIAGONAL STROKE U+A79C LATIN CAPITAL LETTER R WITH DIAGONAL STROKE U+A79D LATIN SMALL LETTER R WITH DIAGONAL STROKE U+A79E LATIN CAPITAL LETTER S WITH DIAGONAL STROKE · Also used in pre-1950 Lower Sorbian orthography U+A79F LATIN SMALL LETTER S WITH DIAGONAL STROKE → A79E latin small letter long s with diagonal stroke · Also used for Luiseño (California, USA) Informative notes and cross references to be added for existing characters: U+1E9C LATIN SMALL LETTER LONG S WITH DIAGONAL STROKE → A79F latin small letter s with diagonal stroke · Medievalist use for several abbreviations · Also used in Fraktur types of pre-1921 Latvian orthography · Also used in Fraktur types of pre-1950 Lower Sorbian orthography U+A743 LATIN SMALL LETTER K WITH DIAGONAL STROKE → A799 latin small letter k with diagonal stroke overlay · Medievalist use for abbreviations of "kalendas" and "karta" · The diagonal stroke crosses the right leg of the k All these letters are used in the orthography of Latvia used before 1921 [3] [8] [4], see fig. -
Latvian Academy of Sciences Yearbook 2017
LATVIAN ACADEMY OF SCIENCES Akadēmijas laukums 1 Tel.: (371) 67225361. Fakss: (371) 67821153 Rīga, LV 1050 E-mail: [email protected] Latvia http://www.lza.lv The 2017 Latvian Academy of Sciences (LAS) Yearbook is a follow-up edition — already the fi fteenth one under this title. The previous issues refl ected the situation in 1991, 1992, 1994, 1995, 1996, 1997, 1998/1999, 2000/2001, 2002/2003, 2004/2005, 2006/2007, 2008/2009, and 2010/2011, 2012/2013, 2016/2017 (until 15 March 2017). The Yearbook 2017 contains basic information on the Academy, including the membership directory. The contents of the Yearbook are also available on the Website of the LAS (http://www.lza.lv). See section About Academy/Annual Reports Compiled by: Kristaps Broks, Daina Daija, Alma Edžiņa, Līva Griņeviča, Sofja Negrejeva, Andrejs Siliņš (supervisor), Ilze Stengrevica, Viktorija Zaļūksne Photos from the collection of the LAS. Photographers Māris Bartulāns, Jānis Brencis, Alma Edžiņa, Edgars Liepiņš, Virginija Valuckiene Edited by Antra Legzdiņa Computer design by Elmārs Lange Published by “Zinātne”: Akadēmijas laukums 1, Rīga LV-1050, Latvia. Reg. no. 40003576967 ISSN 1407-0383 ISBN 978-9934-549-35-9 © Latvian Academy of Sciences, 2017 C O N T E N T S Preface Ojārs Spārītis ................................................................................................................... 5 Transformation of the Academy of Sciences (1990s) and accomplishments in Latvian science Jānis Stradiņš ..........................................................................................8 -
Livonian – the Most Endangered Language in Europe?
ESUKA – JEFUL 2014, 5–1: 61–75 LIVONIAN – THE MOST ENDANGERED LANGUAGE IN EUROPE? Christopher Moseley School of Slavonic and East European Studies, University College London Abstract. Drawing on my experience as General Editor of the UNESCO Atlas of the World’s Languages in Danger, as well as my own research interest in the Livonian language, the situation of Livonian is here compared with that of some other threatened languages of Europe, most notably Manx Gaelic, spoken on the Isle of Man in the British Isles, which presents some interesting parallels with the situation of Livonian. Keywords: Livonian, Manx Gaelic, endangered languages, language revival, ortho- graphic norms DOI: http://dx.doi.org/10.12697/jeful.2014.5.1.04 1. Introduction In my work as editor of the third edition of the UNESCO Atlas of the World’s Languages in Danger, I have encountered several controversies about the status of languages in danger, and even caused some. This third edition, in its on-line version, invites participation from its users and because language, as an emblem of ethnic identity, is such an emotive issue, some heated debates have arisen about whether certain languages are extinct – and by implication whether the nations that bear their names are extinct too. The UNESCO Atlas grew out of the concept of the ‘Red Book’, which originally was meant to provide a world-wide alert to the loss of biological diversity. By the early nineteen-nineties, linguists and anthropologists were beginning to notice a parallel between the losses sustained by nature and the losses sustained by human culture. -
Critical Analysis of English-Latvian Lexicographic Tradition
University of Latvia Faculty of Humanities Laura Karpinska CRITICAL ANALYSIS OF ENGLISH-LATVIAN LEXICOGRAPHIC TRADITION ANGĻU-LATVIEŠU LEKSIKOGRĀFISKĀS TRADĪCIJAS KRITISKA ANALĪZE Doctoral Thesis Scientific Adviser: Dr. philol. Jeļena Dorošenko Rīga 2012 Contents Introduction ........................................................................................................................... 3 CHAPTER 1 Types of Bilingual Dictionaries, Their Functions and Users’ Needs........... 10 1.1 Dictionary Typologies Focusing on Bilingual Dictionaries ...................................... 11 1.1.1 Ščerba’s Typology.............................................................................................. 11 1.1.2 Zgusta’s Typology.............................................................................................. 15 1.1.3. Al-Kasimi’s Typology....................................................................................... 17 1.1.4 Yong and Peng’s Typology................................................................................ 19 1.2 Functions of Bilingual Dictionaries Based on Users’ Needs .................................... 21 1.2.1 Users of Bilingual Dictionaries and Their Needs............................................... 21 1.2.2 Functions of Bilingual Dictionaries (Active and Passive Dictionaries) ............. 23 1.2.2.1 Metalanguage and Information Types in Active vs. Passive Dictionaries .. 25 1.2.2.2. Number of Bilingual Dictionaries per Language Pair – in Theory and Practice ................................................................................................................... -
Contemporary Approaches to Baltic Linguistics Trends in Linguistics Studies and Monographs
Peter Arkadiev, Axel Holvoet, Björn Wiemer (Eds.) Contemporary Approaches to Baltic Linguistics Trends in Linguistics Studies and Monographs Editor Volker Gast Editorial Board Walter Bisang Jan Terje Faarlund Hans Henrich Hock Natalia Levshina Heiko Narrog Matthias Schlesewsky Amir Zeldes Niina Ning Zhang Editors responsible for this volume Volker Gast Volume 276 Contemporary Approaches to Baltic Linguistics Edited by Peter Arkadiev Axel Holvoet Björn Wiemer ISBN 978-3-11-034376-2 e-ISBN (PDF) 978-3-11-034395-3 e-ISBN (EPUB) 978-3-11-039498-6 ISSN 1861-4302 Library of Congress Cataloging-in-Publication Data A CIP catalog record for this book has been applied for at the Library of Congress. Bibliographic information published by the Deutsche Nationalbibliothek The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de. © 2015 Walter de Gruyter GmbH, Berlin/Boston Typesetting: Compuscript Ltd., Shannon, Ireland Printing: CPI books GmbH, Leck ♾ Printed on acid-free paper Printed in Germany www.degruyter.com Contents Contributors vii Peter Arkadiev, Axel Holvoet and Björn Wiemer 1 Introduction: Baltic linguistics – State of the art 1 Hans Henrich Hock 2 Prosody and dialectology of tonal shifts in Lithuanian and their implications 111 Anna Daugavet 3 The lengthening of the first component of Lithuanian diphthongs in an areal perspective 139 Ineta Dabašinskienė and Maria Voeikova 4 Diminutives in spoken Lithuanian and Russian: Pragmatic functions and structural properties 203 Daiki Horiguchi 5 Latvian attenuative pa-verbs in comparison with diminutives 235 Cori Anderson 6 Non-canonical case patterns in Lithuanian 263 Axel Holvoet 7 Non-canonic al subjects in Latvian: An obliqueness-based approach 299 Ilja A.