Inductive Learning of Phonotactic Patterns

Total Page:16

File Type:pdf, Size:1020Kb

Inductive Learning of Phonotactic Patterns University of California Los Angeles Inductive Learning of Phonotactic Patterns A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Linguistics by Jeffrey Nicholas Heinz 2007 c Copyright by Jeffrey Nicholas Heinz 2007 The dissertation of Jeffrey Nicholas Heinz is approved. Bruce Hayes D. Stott Parker Colin Wilson Kie Zuraw, Committee Co-chair Edward P. Stabler, Committee Co-chair University of California, Los Angeles 2007 ii To Mika iii Table of Contents 1 Introduction ................................. 1 1 Thesis .................................. 1 1.1 LocalityandLearning ..................... 2 1.2 FactoringtheLearningProblem . 4 2 Other Approaches to Phonotactic Learning . 5 2.1 Learning with Principles and Parameters . 7 2.2 Learning with Optimality Theory . 8 2.3 Learning with Connectionist Models . 10 2.4 LearningwithStatisticalModels . 11 2.5 LocalSummary......................... 12 3 Overview................................. 12 Appendices ................................. 16 A–1 MathematicalPreliminaries . 16 A–1.1 Sets ............................... 16 A–1.2 RelationsandPartiallyOrderedSets . 17 A–1.3 Equivalence Relations and Partitions . 18 A–1.4 Functions and Sequences . 18 A–1.5 StringsandFormalLanguages . 20 2 Establishing the Problem and Line of Inquiry ............ 22 1 Phonotactic Patterns and Phonotactic Knowledge . .. 22 iv 1.1 Patterns over Contiguous Segments . 23 1.2 Patterns over Non-contiguous Segments . 28 1.3 StressPatterns ......................... 29 1.4 Nonarbitrary Character of Phonotactic Patterns . 31 2 PhonotacticGrammars......................... 32 2.1 TheChomskyHierarchy . .. .. 33 2.2 PhonotacticPatternsasRegularSets . 34 2.3 Examples ............................ 37 2.4 LocalSummary......................... 39 3 Addressing the Learning Problem . 40 3.1 TheGoldLearningFramework . 42 3.2 The Probably-Approximately Correct (PAC) Framework . 44 3.3 SummaryofNegativeResults . 45 3.4 PositiveResults......................... 46 4 AResearchStrategy .......................... 47 5 Summary ................................ 50 Appendices ................................. 51 B–1 AFormalTreatmentoftheGoldFramework . 51 B–1.1 Definitions............................ 51 B–1.2 Any Superfinite Class is not Gold-learnable . 52 B–2 AFormalTreatmentofthePACFramework . 56 B–2.1 Definitions............................ 56 B–2.2 TheVCDimension ....................... 57 v B–2.3 The Class of Finite Languages is not PAC-learnable . .. 58 B–3 FiniteStateAcceptors . 59 B–3.1 Definition ............................ 59 B–3.2 Extending the Transition Function . 59 B–3.3 TheLanguageofanAcceptor . 60 B–3.4 BinaryOperations . 60 B–3.5 ReverseAcceptors . 61 B–3.6 Forward and Backward Deterministic Acceptors . 61 B–3.7 Relations Between Acceptors . 62 B–3.8 StrippedAcceptors . 62 B–3.9 CyclicAcceptors . .. .. 62 B–3.10 Languages and the Machines which Accept Them . 63 B–3.11TailCanonicalAcceptors. 64 B–3.12 The Myhill-Nerode Theorem . 64 B–3.13HeadCanonicalAcceptors . 66 3 Patterns over Contiguous Segments .................. 73 1 Overview................................. 73 2 N-gramGrammarsandLanguages. 74 3 Learning N-gram Languages as String Extension Learning . ... 77 4 GeneralizingbyStateMerging. 80 4.1 TheBasicIdea ......................... 80 4.2 PrefixTrees ........................... 81 4.3 StateMerging.......................... 83 vi 5 LearningN-gramLanguageswithStateMerging . 84 6 AreN-gramsAppropriateforPhonotactics? . 87 6.1 N-gramModelsCountto(n–1) . 87 6.2 Resolvable Consonant Clusters . 88 6.3 Discussion............................ 90 7 Summary ................................ 91 Appendices ................................. 92 C–1 StringExtensionGrammars . 92 C–1.1 Definitions and Properties of Lf ................ 92 C–1.2 Natural Gold-learning of Lf for Finite A ........... 95 C–2 AFormalTreatmentofN-grams. 96 C–2.1 The N-Contiguous Set Function . 96 C–2.2 N-GramGrammarsand N-gramLanguages . 97 C–2.3 PropertiesofN-gramLanguages. 98 C–2.4 ASimpleLearner . 101 C–3 AFormalTreatmentofStateMerging . 101 C–3.1 PrefixTrees ........................... 101 C–3.2 StateMerging. .. .. 102 C–3.3 TheStateMergingTheorem . 103 C–4 Learning Ln−gram viaStateMerging. 106 C–4.1 FiniteStateRepresentations . 106 C–4.2 Towards a Language-theoretic Characterization . 108 C–4.3 Learning N-gram Languages by State Merging . 109 vii C–4.4 Obtaining the Canonical FSA for a N-gram Grammar . 111 4 Patterns Over Non-contiguous Segments ............... 114 1 Overview................................. 114 2 LongDistanceAgreement . .. .. 115 2.1 KindsofLDA.......................... 116 2.2 InadequacyofN-gramModels . 120 2.3 Long Distance Agreement or Spreading? . 120 3 PrecedenceGrammars ......................... 121 3.1 TheBasicIdea ......................... 121 3.2 LearningPrecedenceLanguages . 123 3.3 LocalSummary......................... 126 4 Learning Precedence Languages by State Merging . 126 5 PropertiesofLDA ........................... 129 5.1 Phonetically Unnatural LDA Patterns . 131 5.2 DistanceEffects......................... 132 5.3 BlockingEffects......................... 133 6 Learning LDA Patterns and Patterns over Contiguous Segments . 136 7 Summary ................................ 136 Appendices ................................. 138 D–1 AFormalTreatmentofPrecedence . 138 D–1.1 PrecedenceRelationsandSets . 138 D–1.2 Precedence Grammars and Precedence Languages . 140 viii D–1.3 PropertiesofPrecedence Languages . 141 D–1.4 Learning Precedence Languages by String Extension . 144 D–2 Learning Lprec viaStateMerging . 144 D–2.1 FiniteStateRepresentation . 144 D–2.2 Towards a Language-theoretic Characterization . 146 D–2.3 Learning Precedence Languages by State Merging . 147 D–2.4 Obtaining the Canonical FSA for a Precedence Grammar . 149 D–3 Extending Precedence Grammars to Handle Local Blocking ..... 150 D–3.1 DefinitionsandExamples . 150 D–3.2 Properties of Relativized Bigram Precedence Languages. 152 D–3.3 LearningwithStringExtension . 152 5 Stress Patterns ............................... 153 1 Overview................................. 153 2 StressPatternsintheWorld’sLanguages . 154 2.1 TheStressTypology ...................... 154 2.2 SummaryoftheTypology . 164 2.3 Inadequacy of N-gram and Precedence Learners . 164 2.4 UnattestedStressPatterns. 165 3 Neighborhood-Distinctness . 165 3.1 Definition ............................ 166 3.2 Universality ........................... 168 3.3 Discussion............................ 168 4 TheLearner............................... 170 ix 4.1 TheForwardNeighborhoodLearner. 171 4.2 SuffixTrees ........................... 173 4.3 The Forward Backward Neighborhood Learner . 174 5 Discussion................................ 177 5.1 Basic Reasons Why the Forward Backward Learner Works . 177 5.2 InputSamples.......................... 178 5.3 UnlearnableUnattestedPatterns . 179 5.4 UnlearnableAttestedPatterns. 181 5.5 Addressing the Unlearnable Attested Patterns . 182 5.6 OtherPredictions........................ 185 5.7 ComparisontootherLearningModels . 186 6 Summary ................................ 189 Appendices ................................. 191 E–1 AFormalTreatmentofSuffixTrees . 191 E–2 Results of the Neighborhood Learning Study . 192 6 Deconstructing Neighborhood-distinctness .............. 203 1 Overview................................. 203 2 GeneralizingtheNeighborhood . 204 2.1 Preliminaries .......................... 204 2.2 Neighborhood-distinctness . 205 3 Subsets of Neighborhood-distinct Languages . 207 3.1 PrecedenceLanguages . 207 x 3.2 N-gramLanguages . .. .. 208 3.3 LocalSummary......................... 210 4 Neighborhood-distinctness not Preserved Under Intersection . 212 5 Towards a Compositional Analysis of Neighborhood-distinctness . 214 5.1 Strategy............................. 214 5.2 Merging States in Prefix Trees with Same Incoming Paths of Length n ............................ 215 5.3 Merging States in Suffix Trees with Same Outgoing Paths of Length n ............................ 216 5.4 Merging States in Prefix Trees with Same Outgoing Paths of Length n ............................ 216 5.5 Merging States in Suffix Trees with Same Incoming Paths of Length n ............................ 216 5.6 MergingFinalStatesinPrefixTrees . 217 5.7 MergingStartStatesinSuffixTrees. 224 5.8 MergingStartStatesinPrefixTrees . 228 5.9 MergingFinalStatesinSuffixTrees. 228 5.10 Merging Nonfinal States in Prefix Trees . 228 5.11 Merging Nonstart States in Suffix Trees . 228 5.12 Merging Nonstart States in Prefix Trees . 229 5.13 Merging Nonfinal States in Suffix Trees . 229 5.14 LocalSummary ......................... 230 6 Summary ................................ 232 xi 7 Conclusion .................................. 233 1 Results.................................. 233 2 LookingAhead ............................. 236 ⋆ Appendix: The Stress Typology .................... 239 xii List of Figures 2.1 TheChomskyHierarchy . .. .. 35 2.2 ∗CCCinYawelmaniYokuts . 38 2.3 NavajoSibilantHarmony. 38 2.4 TheStressPatternofPintupi . 39 2.5 TheLearningProcess.......................... 40 2.6 Locating Human Languages in the Chomsky Hierarchy . .. 48 2.7 Locating Phonotactic Patterns in the Regular Languages (I) .... 49 2.8 Locating Phonotactic Patterns in the Regular Languages (II). 49 2.9 A Tail-canonical Acceptor for L = {1, 10, 010, 0100, 01000,...} ... 70 2.10 A Head-canonical Acceptor for L = {1, 10, 010, 0100, 01000,...} .. 70 2.11 A Tail-canonical Acceptor for L={λ, 00, 11, 0011, 1100,
Recommended publications
  • Toward a Typology of Ranking Elements of Narrative Discourse in Languages and Cultures: a Cross- Linguistic Survey
    Volume 6, No. 1, Art. 2, May 2017 Toward a Typology of Ranking Elements of Narrative Discourse in Languages and Cultures: A Cross- Linguistic Survey Marla Perkins, Independent Scholar Keywords: Abstract: It has been noted (Perkins, 2009; Zwaan, 1999; Zwaan & Radvansky, 1998) Narrative that causality, character, location, and time are the four main aspects of narrative discourse, discourse, even if not attended to by listeners or readers in equal ways. For example, Hobongan, character is highly ranked, and the locational/spatial components have often been typology, underestimated for English narratives (see Perkins, 2009, for a review). Relative to the discourse ranking, there is no inherent reason why character needs to be highly ranked, and analysis, locational/spatial information is in fact important in English narrative discourse (Perkins, information 2009). I instead suggest that there are linguistic and cultural factors in the ranking of management these aspects of discourse. Specifically, I suggest that causality is (probably) the highest ranked component, in languages that have a ranking, with the other three elements being linked to causality more or less strongly, depending on linguistic and cultural factors; it is possible that some languages do not rank narrative elements or that some elements are ranked as highly as others. In English, the strongest link is between causality and character. However, this is not universal. In a survey of fifty-eight languages from thirty language families, including an in-depth study of Hobongan, an Austronesian language spoken by approximately two thousand people on the island of Borneo that I am in the process of describing, it is found that there is a great deal of cross-linguistic variation, to the extent that it is possible that each logically possible combination of narrative elements is present in the world’s languages.
    [Show full text]
  • Contribution of Agroforestry to the Plant Communities and Community Welfare in Ternate
    Advances in Engineering Research, volume 194 5th International Conference on Food, Agriculture and Natural Resources (FANRes 2019) Contribution of Agroforestry to the Plant Communities and Community Welfare in Ternate 1,* 1 1 Abdul Kadir Kamaluddin , Fadila Tamnge , Mahdi Tamrin 1Department of forestryFaculty of Agriculture, University of Khairun Ternate, Indonesia *Corresponding author. Email: [email protected] ABSTRACT An agroforestry system is land use developed to provide economic, ecological and social benefits to improve the welfare of the community. The aim of this study are (1) to determine the contribution of agroforestry to plant diversity, and (2) to calculate the contribution of agroforestry to community welfare in Ternate. Plant diversity was Collected by using a combination method. Data of welfare community was collected by interview method. Plant diversity was analyzed by using index of Shannon Wienner and Jaccard. Data of welfare community was analyzed by using farmer income variable. There are 18, 14, and 13 types of vegetation were recorded, each of which was found in Tabona, Gambesi, and SasaVillages (Tabona; mean= 76.94, SD= 80.27; Gambesi, mean = 30.35, SD = 24.27; Sasa; mean = 28.07; SD= 51.43).The highest contribution of agroforestry to community income is in strata II with a percentage of 99.66%. Keywords: agroforestry, plant diversity, Ternate purposive sampling method, where the research location was known to have agroforestry land. To collect plant I. INTRODUCTION diversity (amount of individu and species) use vegetation An agroforestry system is land use developed to provide analysis. To collect data of community welfare use economic, ecological and social benefits to improve the interview method to 90 respondents.
    [Show full text]
  • Pengumuman Abstrak PRASASTI III Yth. Bapak/Ibu Penulis
    THE 3rd PRASASTI INTERNATIONAL SEMINAR CURRENT RESEARCH IN LINGUISTICS Secretariat: Doctoral Program in Linguistics, Postgraduate, Universitas Sebelas Maret Jl. Ir. Sutami 36A Kentingan, Surakarta 57126, Telp/Fax (0271) 632450 Psw 377 Email: [email protected] ; [email protected] Website: s3linguistik.pasca.uns.ac.id Nomor : 05/PRASASTI/III/S3LG/2016 s3s3linguistik.pasca.uns.ac.idLampiran : 1 (satu) eksemplar Hal : Pengumuman abstrak PRASASTI III Yth. Bapak/Ibu Penulis Makalah Seminar Internasional PRASASTI III Dengan hormat, Bersama ini kami sampaikan bahwa abstrak Bapak/Ibu dinyatakan diterima untuk dapat disajikan/ dipresentasikan pada Seminar Internasional PRASASTI III yang diselenggarakan oleh Program Studi S3 Linguistik Pascasarjana Universitas Sebelas Maret. Sehubungan dengan hal tersebut, dimohon Bapak/Ibu memperhatikan poin-poin sebagai berikut. 1. Mengirimkan abstrak beserta fullpaper sesuai dengan ketentuan melalui email [email protected] (cc: [email protected]) dengan subjek dan nama file Nama_FullPaperPrasasti3 dalam format doc. atau rtf (jangan mengirim pdf) paling lambat pada tanggal 8 Juli 2016. 2. Melakukan pembayaran sesuai dengan ketentuan. 3. Mengisi formulir pendaftaran melalui yang formulirnya akan kami kirimkan melalui email dan/atau melalui website s3linguistik.pasca.uns.ac.id 4. Mengirim bukti pembayaran melalui email [email protected] (cc: [email protected]) dengan subjek Pembayaran_Nama_NoHP 5. Bukti transfer mohon disimpan dan dibawa pada saat seminar (2-3 Agustus 2016) Berikut kami lampirkan daftar nama penulis abstrak yang dinyatakan lolos seleksi untuk menuliskan fullpaper. Demikian pemberitahuan dari kami. Kami sampaikan terima kasih atas kerjasama Bapak/Ibu. Kami mengharapkan kehadiran Bapak/Ibu pada acara seminar tersebut tanggal 2 – 3 Agustus 2016, di Syariah Hotel Solo.
    [Show full text]
  • Completed Research, Studies, and Instructional Materials for Language Development Under the National Defense Education Act of 1958, Title VI, Section 602. a Bibliography, List No. 6
    DOCUMENT RESUME ED 036 213 FL 001 429 AUTHOR PETROV, JULIA A., COLE. ITTLE COMPLETED BESEAiCH, STUDIES, AND INSTRUCTIONAL MATLEIALS BC R LANGUAGE DEVELOPMENT UNDER THE NATIONAL DEFENSE EDUCATION ACT OF 1958, TITLE VI, SECTION 602.A BIBLIOGRAPHY, LIST NO. 6. INSTITUTION OFFICE CF EDUCATION (DhEW), WASHINGTON, D.C. INST. OF INTERNATIONAL STULIES. REPORT NO 0E-12016-69 PUB LATE 69 NOTE 144P. AVAILABLE FRU. SUPERINTENDENT CF DOCUFENTS, U.S. GOVERNMENT PRINTING OIFICL, WASHINGTON, D.C. 20402 (GPO FS-5.212:12016-69, $1.25) EDRS PRICE EDES PRICE MF-4,0.75 EC NOT AVAILABLE FROM EDRS. DESCRIPTORS *ANNOTATED BIBLIOGRAPHIES, AREA STUDIES, BIBLIOGRAPHIC CITATIONS, CONFERENCE REPORTS, *DIRECTORIES, GOVERNMENT PUELICATIONS, INDEXES (LOCATERS) , *INSTRUCTIONAL MATERIALS, *LANGUAGE DEVELOPMENT, *LANGUAGE RESEARCH, LIBRARY SERVICES, RESEARCH, SURVEYS, TEACdING METHODS, UNCOMMONLY TAUGHT LANGUAGES AESIRACT THIS CUMULATIVE BIBLIOGRAPHY, CONTAINING 542 ENTRIES, CCVIES PRIMAR_LIY THE NINE-YEAR FISCAL PERIOD PRIOR TC JULY 1, 1968. AlEXTENSIVE INDEX ChCSS REFERENCES AUTHORS AND THEIR ORGINILATICNthi AFFILIATIONS, LANGUAGES, TYeES CF TEXT MATERIALS, RESEAECH SUBJECTS, AND GEOGRAPHICAL AREAS. COPSIDELABLE ATTENTION IS DEVOTED TO SPECIALIZED MATERIALS, INCLUDING COMMONLY AND UNCOMMONLY TAUGHT LANGUAGES AND FOREIGN AREA STULlES. CONFERENCES, STUDIES, SURVEYS, AND METHODS CF INSTRUCTION ARE GROUPED TOGETHER. EACH ENTRY LISTS THE MAJCR SOURCES FCR THE ITEM AND THE PUBLISHER'S OR AUTHGE'S ADDRESS.A DIRECTORY OF THOSE LIBRARIES TAKING PART IN THE INTEEIIBRARY LCAN SERVICE IS FURNISHED. SOME REPOEIS AND MATERIALS COMPLETED AMER JULY 1968 ARE INCLUDED. (AT) U.S. DEPARTMENT Of HEALTH,EDUCATION & WELFARE OFFICE OF EDUCATION rwmi 0E-12016-69 REPRODUCED EXACTLY AS RECEIVEDFROM THE THIS DOCUMENT HAS BEEN N POINTS OF VIEW OR OPINIONS PERSON OR ORGANIZATIONORIGINATING IT, O REPRESENT OFFICIAL OFFICE OfEDUCATION STATED DO NOT NECESSARILY r'r\ POSITION OR POLICY.
    [Show full text]
  • 002.Alex.Comitato.2A Bozza
    Xaverio Ballester /A/ Y EL VOCALISMO INDOEUROPEO Negli últimi quarant’anni la linguística indo- europea si è in gran parte perduta dietro al mito delle laringali, di cui non intendo tenere alcun conto, e dello strutturalismo, di cui ten- go un conto molto limitato. Giuliano Bonfante, I dialetti indoeuropei, p. 8 De la regla a la ley o de mal en peor Bien digna de mención entre las primerísimas descripciones del mode- lo vocálico indoeuropeo es la propuesta de un inventario fonemático con únicamente tres timbres vocálicos /a i u/, una propuesta empero que fue desgraciadamente y demasiado pronto abandonada, siendo quizá la más conspicua consecuencia de este abandono el hecho de que para la Lin- güística indoeuropea oficialista la ausencia de /a/ devino en la práctica un axioma, de modo que, casi en cualquier posterior propuesta sobre el voca- lismo indoeuropeo, se ha venido adoptando la idea de que no hubiese exis- tido nunca la vocal /a/, y explicándose los ineluctables casos de presencia de /a/ en el material indoeuropeo con variados y bizarros argumentos del tipo de vocalismo despectivo, infantil o popular. Sin embargo, si conside- rada hoy spregiudicatamente, la argumentación que motivó el desalojo de la primitiva /a/ indoeuropea no presenta, al menos desde una perspectiva fonotipológica hodierna, ninguna validez en absoluto. Invocaremos un testimonio objetivo del tema para exponer brevemen- te la cuestión. Escribía O. Szemerényi: «Sotto l’impressione dell’arcai- cità del sanscrito, i fondatori dell’indoeuropeistica e i loro immediati suc- cessori pensavano che il sistema triangolare del sanscrito i–a–u rappre- sentasse la situazione originaria.
    [Show full text]
  • Re-Analyzing the Function of Demonstrative Reference in Tajik
    University of Montana ScholarWorks at University of Montana Graduate Student Theses, Dissertations, & Professional Papers Graduate School 2016 Re-analyzing the function of demonstrative reference in Tajik Kelly E. Bowman Follow this and additional works at: https://scholarworks.umt.edu/etd Part of the Semantics and Pragmatics Commons Let us know how access to this document benefits ou.y Recommended Citation Bowman, Kelly E., "Re-analyzing the function of demonstrative reference in Tajik" (2016). Graduate Student Theses, Dissertations, & Professional Papers. 10704. https://scholarworks.umt.edu/etd/10704 This Thesis is brought to you for free and open access by the Graduate School at ScholarWorks at University of Montana. It has been accepted for inclusion in Graduate Student Theses, Dissertations, & Professional Papers by an authorized administrator of ScholarWorks at University of Montana. For more information, please contact [email protected]. RE-ANALYZING THE FUNCTION OF DEMONSTRATIVE REFERENCE IN TAJIK By KELLY ELIZABETH BOWMAN B.A. in Linguistics and Germanic Languages and Literature, University of Kansas Lawrence, Kansas, 2013 Thesis presented in partial fulfillment of the requirements for the degree of Master of Arts in Linguistics The University of Montana Missoula, MT May 2016 Approved by: Scott Whittenburg, Dean of The Graduate School Graduate School Dr. Irene Appelbaum, Chair Linguistics Dr. Mizuki Miyashita Linguistics Dr. Rebecca Wood Anthropology Abstract Bowman, Kelly, M.A., Spring 2016 Linguistics Re-analyzing the function of demonstrative reference in Tajik Chairperson: Dr. Irene Appelbaum This thesis presents a re-analysis of Tajik demonstratives based on an alternative to the widely accepted framework for understanding demonstrative reference. In this framework, demonstrative reference is categorized according to two criteria: the anchor relative to which reference is made, and the number of spatial distinctions the system has for encoding distance from the anchor (Levinson 2004, O’Grady 2010).
    [Show full text]
  • Timucua Students Learn How Historical Resources Teach Us About the Timucua People and Their Technology
    CHHAPTERISTORY AND THETEN TIMUCUA STUDENTS LEARN HOW HISTORICAL RESOURCES TEACH US ABOUT THE TIMUCUA PEOPLE AND THEIR TECHNOLOGY. LET’S TALK ABOUT HISTORICAL DOCUMENTS In the unit titled “Theodore de Bry’s Timucua Engravings – Fact or Fiction?” we spent plenty of time analyzing the authenticity of his work. Other historical documents provide more reliable information about the Timucua. The earliest were the memoirs of French and Spanish explorers. French explorer, René Laudonnière (Low-don-YARE): He wrote an account of his experiences in Florida from 1562 – 1565. This account seems to have been written as an administrative report, to be read by King Charles IV or Admiral Coligny. Laudonnière never submitted it for general publication, though several other people had published their own memoirs. Because he was not publishing for profit, he had less reason to wildly embellish what he observed about the Timucua. When we read crazy things in his text – like the time he met a 250-year-old Timucua man whose own father was still alive – we can chalk that up to miscommunication or misunderstanding – not outright fabrication. It’s Laudonnière’s account that explains how the Timucua got their name. When Europeans asked the Timucua “what is the name of your people?” the Timucua responded with “we are us” or “this is our land.” Since they couldn’t call every native group, “we are us,” the French and Spanish usually referred to each village by the name of its leader. Headchief Saturiwa resided in the village Saturiwa. His enemy, Outina, lived near modern day Green Cove Springs, in a village named…you guessed it, Outina.
    [Show full text]
  • Partitioning the Timeline a Cross-Linguistic Survey of Tense
    Partitioning the timeline A cross-linguistic survey of tense Viveka Velupillai Justus Liebig University Giessen The database The following gives the languages and their values in my database. The lan- guage names are primarily based on the source(s) and WALS Online (Dryer & Haspelmath 2013); where these differ from theEthnologue (Lewis et al. 2013) lan- guage names the Ethnologue name has been given in parenthesis. The ISO-639–3 codes are given in square brackets after the language name. I have relied onWALS Online where possible for the family and genus affiliation; where this was not pos- sible I relied on the source(s) and Ethnologue. The values for fusion type were tak- en from Bickel and Nichols (2013a) where possible; those languages are marked with an asterisk (*). For those languages that were not in Bickel & Nichol’s (2013a) database, I coded the fusion type according to the principles set out in Bickel & Nichols (2013b) using the reference indicated in the ‘Source’ column. Studies in Language 40:1 (2016), 1–42. doi 10.1075/sl.40.1.04ve2 issn 0378–4177 / e-issn 1569–9978 © John Benjamins Publishing Company 2 Viveka Velupillai Viveka No tense Language Genus Family Fusion Source Abui [abz] Greater Alor Timor-Alor-Pantar Isolating/Concatenative (Kratochvíl 2007: 209ff, 350) Achumawi [acv] Palaihnihan Hokan Concatenative (Angulo & Freeland 1930: 89ff, 111) Ainu [ain] Ainu Ainu Concatenative (Shibatani 1990: 80) Apinajé (Apinayé) [apn] Ge-Kaingang Macro-Ge Isolating (Cunha de Oliveira 2005: 170f) Arandai [jbj] South Bird’s Head Marind
    [Show full text]
  • Reduplication: Form, Function and Distribution Carl Rubino
    Reduplication: Form, function and distribution Carl Rubino The systematic repetition of phonological material within a word for se- mantic or grammatical purposes is known as reduplication, a widely used morphological device in a substantial number of the languages spanning the globe. This paper will provide an overview of the types of reduplicative constructions found in the languages of the world and the functions they portray. Finally, a subset of the world's languages, will be categorized as to whether or not they employ reduplicative constructions productively and illustrated in a world map'. 1. Form For purposes of the accompanying typological map, two types of reduplica- tion are distinguished based on the size of the reduplicant: full vs. partial. Full reduplication is the repetition of an entire word, word stem (root with one or more affixes), or root, e.g. Tausug (Austronesian, Philippines) full word lexical reduplication dayang 'madam' vs. dayangdayang 'princess'; laway 'saliva' vs. laway-laway 'land snail', or full root reduplication, shown here with the verbalizing affixes mag- and -(h)un which do not par- ticipate in the reduplication: mag-bichara 'speak' vs. mag-bichara-bichara 'spread rumors, gossip'; mag-tabid 'twist' vs mag-tabid-tabid 'make cas- sava rope confection'; suga-hun 'be heated by sun' vs. suga-suga-hiin 'de- velop prickly heat rash' (Hassan et al 1994). Partial reduplication may come in a variety of forms, from simple con- sonant gemination or vowel lengthening to a nearly complete copy of a base. In Pangasinan (Austronesian, Philippines) various forms of redupli- cation are used to form plural nouns.
    [Show full text]
  • Cover Page the Following Handle Holds Various Files of This Leiden
    Cover Page The following handle holds various files of this Leiden University dissertation: http://hdl.handle.net/1887/67094 Author: Pache, M.J. Title: Contributions to Chibchan historical linguistics Issue Date: 2018-12-05 657 References ABARCA, ROCÍO. 1985. Análisis fonológico del guaymí movere. Estudios de Lingüística Chibcha 4: 7–46. ABBOTT, MIRIAM, AND PATRICK FOSTER. 2015. Macushi dictionary. In: The Intercontinental Dictionary Series, ed. Mary Ritchie Key and Bernard Comrie. Leipzig: Max Planck Institute for Evolutionary Anthropology. <http://ids.clld.org>. ADAM, LUCIEN. 1897. Matériaux pour servir a l’établissement d’une grammaire comparée des dialectes de la famille kariri. (Bibliothèque linguistique américaine, 20.) Paris: J. Maisonneuve. ADELAAR, WILLEM F.H. 1977. Tarma Quechua: Grammar, Texts, Dictionary. Lisse: Peter de Ridder Press. _____. 1984. Grammatical vowel length and the classification of Quechua dialects. International Journal of American Linguistics 50 (1): 25–47. _____. 1995. Les catégories verbales ‘conjugaison’ et ‘genre’ dans les grammaires de la langue chibcha. In: La ‘découverte’ des langues et des écritures d’Amérique: actes du colloque international, Paris, 7–11 septembre 1993. Amerindia 19/20: 173–182. _____. 2000. Propuesta de un nuevo vínculo genético entre dos grupos lingüísticos indígenas de la Amazonía occidental: harakmbut y katukina. In: Actas I Congreso de Lenguas Indígenas de Sudamérica, ed. Luis Miranda Esquerre, vol. 2, pp. 219–236. Lima: Universidad Ricardo Palma, Facultad de Lenguas Modernas. _____. 2004. The Languages of the Andes, with Pieter C. Muysken. Cambridge/New York: Cambridge University Press. _____. 2005. Verbos de baja especificación semántica y expresiones idiomáticas en la lengua muisca. In: Actas del II Congreso de la Región Noroeste de Europa de la Asociación de Lingüística y Filología de América Latina, ed.
    [Show full text]
  • Theodore De Bry's Timucua Engravings
    “Timucua Technology” - A Middle Grade Florida Curriculum Theodore de Bry’s Timucua Engravings – Fact or Fiction? Students discover how an inaccurate interpretation of historical documents can have far-reaching consequences. STUDENT LEARNING GOAL: Students will review and understand evidence that indicates that de Bry’s engravings of native peoples in the New World were intended as exciting travel tales and religious propaganda, not as reflections of reality. SUNSHINE STATE STANDARDS ASSESSED: Social Studies • SS.8.A.1.2 Analyze charts, graphs, maps, photographs and timelines; analyze political cartoons; determine cause and effect. • SS.8.A.1.7 View historic events through the eyes of those who were there as shown in their art, writings, music, and artifacts. • SS.8.A.2.1 Compare the relationships among the British, French, Spanish, and Dutch in their struggle for colonization of North America. Language Arts • LA.7.1.6.2 The student will listen to, read, and discuss familiar and conceptually challenging text. • LA.7.4.2.2 The student will record information (e.g., observations, notes, lists, charts, legends) related to a topic, including visual aids to organize and record information, as appropriate, and attribute sources of information. • LA.8.1.6.2 The student will listen to, read, and discuss familiar and conceptually challenging text. • LA.8.4.2.2 The student will record information (e.g., observations, notes, lists, charts, legends) related to a topic, including visual aids to organize and record information, as appropriate, and attribute sources of information. • LA.8.6.4.1 The student will use appropriate available technologies to enhance communication and achieve a purpose (e.g., video, digital technology).
    [Show full text]
  • Inductive Learning of Phonotactic Patterns
    University of California Los Angeles Inductive Learning of Phonotactic Patterns A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Linguistics by Jeffrey Nicholas Heinz 2007 c Copyright by Jeffrey Nicholas Heinz 2007 The dissertation of Jeffrey Nicholas Heinz is approved. Bruce Hayes D. Stott Parker Colin Wilson Kie Zuraw, Committee Co-chair Edward P. Stabler, Committee Co-chair University of California, Los Angeles 2007 ii To Mika iii Table of Contents 1 Introduction ................................. 1 1 Thesis .................................. 1 1.1 LocalityandLearning ..................... 2 1.2 FactoringtheLearningProblem . 4 2 Other Approaches to Phonotactic Learning . 5 2.1 Learning with Principles and Parameters . 7 2.2 Learning with Optimality Theory . 8 2.3 Learning with Connectionist Models . 10 2.4 LearningwithStatisticalModels . 11 2.5 LocalSummary......................... 12 3 Overview................................. 12 Appendices ................................. 16 A–1 MathematicalPreliminaries . 16 A–1.1 Sets ............................... 16 A–1.2 RelationsandPartiallyOrderedSets . 17 A–1.3 Equivalence Relations and Partitions . 18 A–1.4 Functions and Sequences . 18 A–1.5 StringsandFormalLanguages . 20 2 Establishing the Problem and Line of Inquiry ............ 22 1 Phonotactic Patterns and Phonotactic Knowledge . .. 22 iv 1.1 Patterns over Contiguous Segments . 23 1.2 Patterns over Non-contiguous Segments . 28 1.3 StressPatterns ......................... 29 1.4 Nonarbitrary Character of Phonotactic Patterns . 31 2 PhonotacticGrammars......................... 32 2.1 TheChomskyHierarchy . .. .. 33 2.2 PhonotacticPatternsasRegularSets . 34 2.3 Examples ............................ 37 2.4 LocalSummary......................... 39 3 Addressing the Learning Problem . 40 3.1 TheGoldLearningFramework . 42 3.2 The Probably-Approximately Correct (PAC) Framework . 44 3.3 SummaryofNegativeResults . 45 3.4 PositiveResults......................... 46 4 AResearchStrategy .........................
    [Show full text]