The Role of Zero in the Prosodic Hierarchy
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Information to Users
INFORMATION TO USERS This manuscript has been reproduced from the microfilm master. UMI films the text directly from the original or copy submitted. Thus, some thesis and dissertation copies are in typewriter face, while others may be from any type of computer printer. The quality of this reproduction is dependent upon the quality of the copy submitted. Broken or indistinct print, colored or poor quality illustrations and photographs, print bleedthrough, substandard margins, and improper alignment can adversely afreet reproduction. In the unlikely event that the author did not send UMI a complete manuscript and there are missing pages, these will be noted. Also, if unauthorized copyright material had to be removed, a note will indicate the deletion. Oversize materials (e.g., maps, drawings, charts) are reproduced by sectioning the original, beginning at the upper left-hand corner and continuing from left to right in equal sections with small overlaps. Each original is also photographed in one exposure and is included in reduced form at the back of the book. Photographs included in the original manuscript have been reproduced xerographically in this copy. Higher quality 6" x 9" black and white photographic prints are available for any photographs or illustrations appearing in this copy for an additional charge. Contact UMI directly to order. UMI University Microfilms International A Bell & Howell Information Company 300 Nortti Zeeb Road. Ann Arbor. Ml 48106-1346 USA 313.'761-4700 800/ 521-0600 Order Number 9401286 The phonetics and phonology of Korean prosody Jun, Sun-Ah, Ph.D. The Ohio State University, 1993 300 N. Zeeb Rd. -
Building a Universal Phonetic Model for Zero-Resource Languages
Building a Universal Phonetic Model for Zero-Resource Languages Paul Moore MInf Project (Part 2) Interim Report Master of Informatics School of Informatics University of Edinburgh 2020 3 Abstract Being able to predict phones from speech is a challenge in and of itself, but what about unseen phones from different languages? In this project, work was done towards building precisely this kind of universal phonetic model. Using the GlobalPhone language corpus, phones’ articulatory features, a recurrent neu- ral network, open-source libraries, and an innovative prediction system, a model was created to predict phones based on their features alone. The results show promise, especially for using these models on languages within the same family. 4 Acknowledgements Once again, a huge thank you to Steve Renals, my supervisor, for all his assistance. I greatly appreciated his practical advice and reasoning when I got stuck, or things seemed overwhelming, and I’m very thankful that he endorsed this project. I’m immensely grateful for the support my family and friends have provided in the good times and bad throughout my studies at university. A big shout-out to my flatmates Hamish, Mark, Stephen and Iain for the fun and laugh- ter they contributed this year. I’m especially grateful to Hamish for being around dur- ing the isolation from Coronavirus and for helping me out in so many practical ways when I needed time to work on this project. Lastly, I wish to thank Jesus Christ, my Saviour and my Lord, who keeps all these things in their proper perspective, and gives me strength each day. -
Improving Machine Translation of Null Subjects in Italian and Spanish
Improving Machine Translation of Null Subjects in Italian and Spanish Lorenza Russo, Sharid Loaiciga,´ Asheesh Gulati Language Technology Laboratory (LATL) Department of Linguistics – University of Geneva 2, rue de Candolle – CH-1211 Geneva 4 – Switzerland {lorenza.russo, sharid.loaiciga, asheesh.gulati}@unige.ch Abstract (2000) has shown that 46% of verbs in their test corpus had their subjects omitted. Continuation Null subjects are non overtly expressed of this work by Rello and Ilisei (2009) has found subject pronouns found in pro-drop lan- that in a corpus of 2,606 sentences, there were guages such as Italian and Spanish. In this study we quantify and compare the oc- 1,042 sentences without overtly expressed pro- currence of this phenomenon in these two nouns, which represents an average of 0.54 null languages. Next, we evaluate null sub- subjects per sentence. As for Italian, many anal- jects’ translation into French, a “non pro- yses are available from a descriptive and theoret- drop” language. We use the Europarl cor- ical perspective (Rizzi, 1986; Cardinaletti, 1994, pus to evaluate two MT systems on their among others), but to the best of our knowledge, performance regarding null subject trans- there are no corpus studies about the extent this lation: Its-2, a rule-based system devel- 2 oped at LATL, and a statistical system phenomenon has. built using the Moses toolkit. Then we Moreover, althought null elements have been add a rule-based preprocessor and a sta- largely treated within the context of Anaphora tistical post-editor to the Its-2 translation Resolution (AR) (Mitkov, 2002; Le Nagard and pipeline. -
A Comparative Study of Subject Pro-Drop in Old Chinese and Modern Chinese
University of Pennsylvania Working Papers in Linguistics Volume 10 Issue 2 Selected Papers from NWAVE 32 Article 18 2005 A Comparative Study of Subject Pro-drop in Old Chinese and Modern Chinese Zhiyi Song Follow this and additional works at: https://repository.upenn.edu/pwpl Recommended Citation Song, Zhiyi (2005) "A Comparative Study of Subject Pro-drop in Old Chinese and Modern Chinese," University of Pennsylvania Working Papers in Linguistics: Vol. 10 : Iss. 2 , Article 18. Available at: https://repository.upenn.edu/pwpl/vol10/iss2/18 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/pwpl/vol10/iss2/18 For more information, please contact [email protected]. A Comparative Study of Subject Pro-drop in Old Chinese and Modern Chinese This working paper is available in University of Pennsylvania Working Papers in Linguistics: https://repository.upenn.edu/pwpl/vol10/iss2/18 A Comparative Study of Subject Pro-drop in Old Chinese and Modern Chinese Zhiyi Song 1 Introduction Chinese is a Subject Pro-drop language in that the subject of a clause need not be overt. Thus a Chinese speaker has the choice of using either a null subject or an overt pronoun in the subject position of a sentence, as in ta kanjian yige nuhaizi, 0/ta daizhe yiding xiaohongmao. he see one-classifier girl, 0/she wear one-classifier small red hat. 'He saw a girl; she is wearing a red hat. ' Chinese differs from other Pro-drop languages such as Italian or Turkish in that the language has no inflections to mark subject-verb agreement. -
The Syllable Structure in Nyangatom
Studies in Ethiopian Languages, 6 (2017), 1-12 Article The Syllable Structure in Nyangatom Moges Yigezu (Addis Ababa University) [email protected] Abstract The syllable structure in Nyangatom is a key phonological unit that plays a central role in the organization of the phonological structure. The core syllable has a fairly complex structure consisting of a branching onset, an obligatory nucleus and a non- branching coda which is optional. The question of syllabification, that is, the division of words into syllables, both in complex onsets and reduplicated words, appear to be an interesting aspect of the phonology. The analysis of the internal structure of the syllable structure of Nyangatom shows that the Mora theory, rather than the traditional onset-rhyme conception, provides an insightful explanation in determining the syllable boundaries of disyllabic and multisyllabic words involving sequences of consonants as well as in the analysis of reduplicated words. 1 Introduction Nyangatom is an Eastern Nilotic language classified as a member of the Turkana- Teso cluster along with Toposa and Jie (Vossen 1982). Literature on the grammatical description of the language is still scanty. Some preliminary studies include Dimmendaal (2007), Kadanya & Schroder (2011) and Moges (2016). The current contribution is an attempt to add to the scanty literature by describing an aspect of the phonology of the language focusing on the syllable structure. The syllable structure in Nynagatom, as in other languages, is a key phonological unit and plays a pivotal role in the overall organization of the phonology of the language. In describing the internal structure of the syllable in Nyangatom, the Mora theory has been employed. -
How Radical Is Pro-Drop in Mandarin?
How radical is pro-drop in Mandarin? A quantitative corpus study on referential choice in Mandarin Chinese Masterarbeit im Masterstudiengang General Linguistics in der Fakultät Geistes- und Kulturwissenschaften der Otto-Friedrich-Universität Bamberg Verfasserin: Maria Carina Vollmer Prüfer: Prof. Dr. Geoffrey Haig Zweitprüferin: PD Dr. Sonja Zeman Contents Contents 1 List of Abbreviations 3 List of Figures 4 1Introduction 5 2Theoreticalbackground 8 2.1 Pro-drop........................... 8 2.2 Radical pro-drop . 14 2.2.1 Free distribution of zero arguments . 14 2.2.2 Frequency of zero arguments . 17 2.3 Referential choice . 18 2.3.1 Factors influencing referential choice . 21 2.3.1.1 Syntactic Function . 21 2.3.1.2 Animacy . 22 2.3.1.3 Topicality . 23 2.3.1.4 Person . 25 2.3.1.5 Antecedent-related factors . 26 2.3.2 ReferentialchoiceinMandarin. 28 2.4 Interim conclusion . 29 CONTENTS 3Methods 32 3.1 Research questions and hypotheses . 32 3.2 Thecorpus ......................... 34 3.2.1 Multi-CAST (Haig & Schnell 2019) . 35 3.2.2 Languages . 37 3.2.3 Mandarin ...................... 39 3.2.3.1 Jigongzhuan (jgz)............ 40 3.2.3.2 Liangzhu (lz). 41 3.2.3.3 Mulan (ml)................ 41 3.2.3.4 Corpusannotation . 42 3.3 Quantitative Analysis . 49 4Results 56 4.1 Frequency of zero arguments . 56 4.1.1 Distribution of noun phrase, pronoun and zero . 57 4.1.2 Distribution in different syntactic functions . 59 4.1.3 Frequency of only pronoun and zero . 64 4.1.4 Interim discussion and conclusion . 65 4.2 Probabilistic constraints . -
Treatment of Zero Pronouns in the Framework of Lexical-Functional Grammar 1
Selected Papers of the 18th Conference of Pan-Pan Pacific Association of Applied Linguistics Treatment of Zero Pronouns in the Framework of Lexical-Functional Grammar 1 Masanori Oya Mejiro University [email protected] Abstract This study presents both a Lexical-Functional Grammar (LFG) treatment and a typed-dependency grammar treatment of zero pronouns in the Japanese language. Japanese uses elliptic sentences quite often, and their functional-structural representation is problematic because they seem to violate the “completeness condition,” which states that all the core arguments of a predicate must be present in its functional structure. This study shows through a number of examples that this problem can be solved by positing the presence of zero pronouns with various semantic functions. Keywords Lexical-Functional Grammar, Dependency Grammar, Japanese, Zero Pronouns 1 Introduction This study presents both a Lexical-Functional Grammar (LFG) treatment and a typed-dependency grammar treatment of zero pronouns in the Japanese language. Japanese often uses elliptic 2 sentences , that is, sentences in which nouns with core grammatical functions such as subject or object are absent (Kanatani 2002; Mikami 1972; Toyama 1973, among others). In terms of functional well-formedness (Kaplan & Bresnan 1982), elliptic sentences are problematic because their functional structure seems to violate the completeness constraint , which states that a predicate and all its arguments must be present in an f-structure (Kaplan & Bresnan 1982, p. 211–212). Oya (2010) argued that such elliptic sentences contain zero pronouns that are the values of the core grammatical-function attributes, and that the completeness constraint is in fact observed because of the presence of zero pronouns in the functional structures of these sentences. -
Topics in Turkish Phonology.Pdf
TOPICS IN TURKISH PHONOLOGY Harry van der Hulst and Jeroen van de Weijer 0. INTRODUCTION In this chapter we offer a discussion of some aspects of the phonology of Turkish. Turkish phonology has played a significant role in theoretical discussions on the nature of phonological representation and rule formalism. In particular, the formal description of vowel harmony has attracted a considerable amount of attention in the phonological literature since the 1940s, and we, too, will devote a separate section to this topic. In section 1 we provide a synopsis of the general facts of Turkish phonology. Besides giving an overview of the phonemes of Turkish, we illustrate its syllabic structure and stress pattern. We also present a number of the phonological rules of Turkish, all of which have received earlier treatment in the literature, in particular compensatory lengthening (section 1.4.3). A number of linguists have provided analyses of the process of vowel harmony which pervades the Turkish language. In section 2 we lay out the basic facts, discuss some of the earlier analyses, and then provide our own account, which departs from the earlier approaches mainly by availing itself of unary elements which may extend over suprasegmental domains like the word. We believe that significant generalizations can be captured under this approach. 1. ASPECTS OF TURKISH PHONOLOGY 1.1 THE PHONEMIC INVENTORY 1.1.1 Vowels Turkish has eight vowel phonemes which may be plotted on the familiar triangular vowel diagram as follows (cf. Lass 1984: 145; Maddieson 1984: 277): (1) high i,y u,uu mid o lower mid e,oe low a Following all earlier writers (e.g. -
GOO-80-02119 392P
DOCUMENT RESUME ED 228 863 FL 013 634 AUTHOR Hatfield, Deborah H.; And Others TITLE A Survey of Materials for the Study of theUncommonly Taught Languages: Supplement, 1976-1981. INSTITUTION Center for Applied Linguistics, Washington, D.C. SPONS AGENCY Department of Education, Washington, D.C.Div. of International Education. PUB DATE Jul 82 CONTRACT GOO-79-03415; GOO-80-02119 NOTE 392p.; For related documents, see ED 130 537-538, ED 132 833-835, ED 132 860, and ED 166 949-950. PUB TYPE Reference Materials Bibliographies (131) EDRS PRICE MF01/PC16 Plus Postage. DESCRIPTORS Annotated Bibliographies; Dictionaries; *InStructional Materials; Postsecondary Edtmation; *Second Language Instruction; Textbooks; *Uncommonly Taught Languages ABSTRACT This annotated bibliography is a supplement tothe previous survey published in 1976. It coverslanguages and language groups in the following divisions:(1) Western Europe/Pidgins and Creoles (European-based); (2) Eastern Europeand the Soviet Union; (3) the Middle East and North Africa; (4) SouthAsia;(5) Eastern Asia; (6) Sub-Saharan Africa; (7) SoutheastAsia and the Pacific; and (8) North, Central, and South Anerica. The primaryemphasis of the bibliography is on materials for the use of theadult learner whose native language is English. Under each languageheading, the items are arranged as follows:teaching materials, readers, grammars, and dictionaries. The annotations are descriptive.Whenever possible, each entry contains standardbibliographical information, including notations about reprints and accompanyingtapes/records -
A Network Science Approach to Bilingual Code-Switching
Proceedings of the Society for Computation in Linguistics Volume 4 Article 3 2021 A Network Science Approach to Bilingual Code-switching Qihui Xu Graduate Center, City University of New York, [email protected] Magdalena Markowska Stony Brook, State University of New York, [email protected] Martin Chodorow Hunter College, City University of New York, [email protected] Ping Li The Hong Kong Polytechnic University, [email protected] Follow this and additional works at: https://scholarworks.umass.edu/scil Part of the Computational Linguistics Commons Recommended Citation Xu, Qihui; Markowska, Magdalena; Chodorow, Martin; and Li, Ping (2021) "A Network Science Approach to Bilingual Code-switching," Proceedings of the Society for Computation in Linguistics: Vol. 4 , Article 3. DOI: https://doi.org/10.7275/raze-1b18 Available at: https://scholarworks.umass.edu/scil/vol4/iss1/3 This Paper is brought to you for free and open access by ScholarWorks@UMass Amherst. It has been accepted for inclusion in Proceedings of the Society for Computation in Linguistics by an authorized editor of ScholarWorks@UMass Amherst. For more information, please contact [email protected]. A Network Science Approach to Bilingual Code-switching Qihui Xu Magdalena Markowska Department of Psychology Department of Linguistics Graduate Center, CUNY Institute for Advanced Computational Science [email protected] Stony Brook University [email protected] Martin Chodorow Ping Li Department of Psychology Department of Chinese and Bilingual Studies Hunter College, CUNY The Hong Kong Polytechnic University [email protected] [email protected] Abstract introduces a new possible angle for code-switching research. -
Spoken Content Retrieval Beyond Pipeline Integration of Automatic Speech Recognition and Information Retrieval David N. Racca
Spoken Content Retrieval Beyond Pipeline Integration of Automatic Speech Recognition and Information Retrieval David N. Racca Bachelor's in Computer Science A dissertation submitted in fulfilment of the requirements for the award of Doctor of Philosophy (Ph.D.) to the Dublin City University School of Computing Supervisor: Prof. Gareth J.F. Jones July 2018 I hereby certify that this material, which I now submit for assessment on the programme of study leading to the award of Ph.D. is entirely my own work, and that I have exercised reasonable care to ensure that the work is original, and does not to the best of my know- ledge breach any law of copyright, and has not been taken from the work of others save and to the extent that such work has been cited and acknowledged within the text of my work. Signed: (Candidate) ID No.: Date: Contents List of Tables vi List of Figures viii Abstract xi Acknowledgements xii 1 Introduction 1 1.1 Overview of spoken content retrieval (SCR) . 2 1.1.1 Information access and retrieval from spoken content . 2 1.1.2 SCR system overview . 3 1.1.3 Open problems in SCR . 9 1.2 Research questions . 13 1.3 Thesis structure . 14 2 Review of Fundamental Technologies in SCR 17 2.1 Information retrieval (IR) . 18 2.1.1 Text pre-processing and indexing . 18 2.1.2 Frameworks for ranked retrieval . 19 2.1.3 Evaluation of ranked retrieval . 24 2.2 Automatic speech recognition (ASR) . 27 2.2.1 Overview . 27 2.2.2 Speech units, signal processing, and feature extraction . -
Introductory Phonology
9781405184120_1_pre.qxd 06/06/2008 09:47 AM Page iii Introductory Phonology Bruce Hayes A John Wiley & Sons, Ltd., Publication 9781405184120_4_C04.qxd 06/06/2008 09:50 AM Page 70 4 Features 4.1 Introduction to Features: Representations Feature theory is part of a general approach in cognitive science which hypo- thesizes formal representations of mental phenomena. A representation is an abstract formal object that characterizes the essential properties of a mental entity. To begin with an example, most readers of this book are familiar with the words and music of the song “Happy Birthday to You.” The question is: what is it that they know? Or, to put it very literally, what information is embodied in their neurons that distinguishes a knower of “Happy Birthday” from a hypothetical person who is identical in every other respect but does not know the song? Much of this knowledge must be abstract. People can recognize “Happy Birth- day” when it is sung in a novel key, or by an unfamiliar voice, or using a different tempo or form of musical expression. Somehow, they can ignore (or cope in some other way with) inessential traits and attend to the essential ones. The latter include the linguistic text, the (relative) pitch sequences of the notes, the relative note dura- tions, and the musical harmonies that (often tacitly) accompany the tune. Cognitive science posits that humans possess mental representations, that is, formal mental objects depicting the structure of things we know or do. A typical claim is that we are capable of singing “Happy Birthday” because we have (during childhood) internalized a mental representation, fairly abstract in character, that embodies the structure of this song.