Perception and Acquisition of Linguistic Rhythm by Infants

Total Page:16

File Type:pdf, Size:1020Kb

Perception and Acquisition of Linguistic Rhythm by Infants Speech Communication 41 (2003) 233–243 www.elsevier.com/locate/specom Perception and acquisition of linguistic rhythm by infants Thierry Nazzi a,*, Franck Ramus b a Laboratoire Cognition et Developpement, CNRS, Universite Paris 5, 71, Avenue Edouard Vaillant, 92774 Boulogne Billancourt Cedex, France b Institute of Cognitive Neuroscience, University College London, UK Abstract In the present paper, we address the issue of the emergence in infancy of speech segmentation procedures that were found to be specific to rhythmic classes of languages in adulthood. These metrical procedures, which segment fluent speech into its constitutive word sequence, are crucial for the acquisition by infants of the words of their native lan- guage. We first present a prosodic bootstrapping proposal according to which the acquisition of these metrical seg- mentation procedures would be based on an early sensitivity to rhythm (and rhythmic classes). We then review several series of experiments that have studied infantsÕ ability to discriminate languages between birth and 5 months, in an attempt to specify their sensitivity to rhythm and the implication of rhythm perception in the acquisition of these segmentation procedures. The results presented here establish infantsÕ sensitivity to rhythmic classes (from birth on- wards). They further show an evolution of infantsÕ language discriminations between birth and 5 months which, though not inconsistent with our proposal, nevertheless call for more studies on the possible implication of rhythm in the acquisition of the metrical segmentation procedures. Ó 2002 Elsevier Science B.V. All rights reserved. 1. Introduction necting of these sound patterns to the lexical representations stored in the lexicon. Hence for The focus of the present paper is on the issue of adults, this task could be facilitated by the lexicon the segmentation of fluent speech into words, and itself, making it a problem of word recognition as more particularly on the development of speech well as (or rather than) a problem of word seg- segmentation procedures during infancy. For mentation. However, the segmentation task has to adults, speech segmentation involves language- be different for the infant who, starting with no specific phonological procedures (see below, and lexicon and no language-specific phonological Cutler, McQueen and Norris, this volume) that knowledge, has to discover, rather than recognize, allow for the retrieval of the acoustic sound pat- the words in the input and learn the speech seg- terns of words from fluent speech, and the con- mentation procedure appropriate to the language to be learnt. In the following, we present several studies that have recently investigated the issue of word segmentation in early infancy. These studies * Corresponding author. Tel.: +33-1-5520-5992; fax: +33-1- 5520-5985. have first established that speech segmentation E-mail address: [email protected] (T. emerges between 6 and 7.5 months of age in En- Nazzi). glish-learning American infants, hence months 0167-6393/02/$ - see front matter Ó 2002 Elsevier Science B.V. All rights reserved. doi:10.1016/S0167-6393(02)00106-1 234 T. Nazzi, F. Ramus / Speech Communication 41 (2003) 233–243 before the onset of lexical comprehension and Finally, it was shown that 9- to 10.5-month-old production (Jusczyk and Aslin, 1995). They have American infants rely on the typical stress pattern then set out to specify the kind of information that of English to group syllables into word-like units infants rely on to postulate word boundaries in (Morgan and Saffran, 1995) and to remember fa- fluent speech. miliar sequences of syllables (Echols et al., 1997). Some of these studies have established that The ability to retrieve familiar words from fluent young American infants are sensitive to different speech was also found to depend on their stress kinds of potential word-boundary markers that are pattern. Indeed, Jusczyk et al. (1999b) found that language-specific: allophonic information (the fact infants begin segmenting strong–weak nouns (e.g., that the distribution of allophones within words is ‘‘doctor’’ and ‘‘candle’’) from fluent speech at 7.5 position-dependent), phonotactic information (the months, but begin segmenting weak–strong nouns fact that some but not all phonemic sequences are (e.g., ‘‘guitar’’ and ‘‘beret’’) only at 10.5 months. legal at the lexical level), and prosodic/metrical These authors suggested that this processing ad- information (the fact that lexical stress is predom- vantage of strong–weak words might result from inantly word initial in English). A sensitivity to the specification of the predominant stress pattern allophonic differences was found in infants as of English, and the emergence by 7.5 months of a young as 2 months of age (Hohne and Jusczyk, segmentation procedure, appropriate for English, 1994), as attested by their ability to discriminate and based on its metrical properties. Following between pairs such as ‘‘nitrate’’ and ‘‘night rate’’. this procedure, infants would place a word Infants have also been found to become sensitive to boundary before the occurrence of every strong phonotactic properties of their native language syllable in the speech stream, allowing for the de- between 6 and 9 months of age (Friederici and tection of strong–weak but not weak–strong Wessels, 1993; Jusczyk et al., 1993b, 1994; Mattys words. Note that this procedure is language spe- et al., 2001). This is shown by the emergence of a cific, as it would not be appropriate for the ac- preference for legal or frequent sequences of pho- quisition of French in which the metrical structure nemes in their native language with respect to ille- is different. 1 Later, that is between 7.5 and 10.5 gal or infrequent ones (e.g., the frequent and months, infants would become sensitive to the infrequent non-word sequences ‘‘chun’’ and (language-general) distributional properties of ‘‘yush’’). Last but not least, a preference for words syllables in the speech stream, which would then with the predominant English strong–weak stress allow them to segment weak–strong words by pattern (e.g., ‘‘porter’’) over less frequent weak– grouping these two syllables together. Note that strong words (e.g., ‘‘report’’) also emerges between further support that infants can use statistical these two ages (Jusczyk et al., 1993a; Turk et al., regularities in the order of syllables forming a 1995), revealing the emergence of a sensitivity to continuous sequence to build cohesive word-like native word stress patterns. units comes from a study by Saffran et al. (1996) Other studies have investigated whether, once on 8-month-old infants. they are sensitive to these language-specific Hence, the studies above provide good evidence markers, infants actually use them to infer word- that the very onset of word learning/segmentation like units. First, some studies showed that al- is under the influence of language-specific seg- though 9-month-old infants only use allophonic mentation procedures, particularly a procedure cues to locate familiar words in fluent speech when based on the prosodic/metrical properties of the they are guided by distributional cues, 10.5- month-old can rely on these sole allophonic cues (Jusczyk et al., 1999a). Moreover, a recent study 1 Unfortunately, it is not possible to specify the kind of has shown that the fact of providing infants with segmentation procedures developed by infants growing up in a French-speaking environment, given that at this point all the word-boundary phonotactic information helps studies on the emergence of speech segmentation were done them extract familiarized words from fluent speech with infants learning (American) English, with the exception of (Mattys and Jusczyk, 2001). one study with Dutch-learning infants (Kuijpers et al., 1998). T. Nazzi, F. Ramus / Speech Communication 41 (2003) 233–243 235 native language (the strong–weak segmentation Finally, adultsÕ segmentation procedures appear to procedure for English). At this point, and in spite be deeply embedded in their language-specific of its importance, this finding is problematic unless competence, and acquired at a very young age. This we explain how infants come to specify these is attested by the fact that the metrical procedure phonological properties of their native language. they use is determined by their native language ra- Indeed, if we are trying to explain how infants ther than by the language they are listening to: once come to start segmenting fluent speech, we cannot they have mastered a particular language, adults say that they use some ÔknowledgeÕ of the fact that rely on procedures appropriate to that language words are stressed initially in their language unless even when listening to a foreign language (Cutler we can explain how they discovered that fact in- et al., 1986; Otake et al., 1993). Moreover, it has dependently of segmentation: a classical boot- also been shown that even very proficient bilinguals strapping problem. In this paper, we propose the are dominant in one of their languages, and have existence of such an independent mechanism for developed specialized metrical segmentation pro- the acquisition of the metrical segmentation pro- cedures in only one of their languages (Cutler et al., cedure. This proposal was initially inspired by data 1992). from the adult speech segmentation literature and It has been suggested that each of the different the phonetic literature, which we review in the types of metrical segmentation procedures is op- following. Then, we propose a prosodic boot- timally adapted to the processing of a particular strapping account of the acquisition of the metri- rhythmic class of languages (Cutler and Mehler, cal segmentation procedures, and then turn to 1993; Otake et al., 1993; see also Sebastiaan-Gall ees recent data that we have started to gather in sup- et al., 1992; Vroomen et al., 1996), even if minor port of this proposal. processing differences can be found within a Several studies have looked at the way adults class.
Recommended publications
  • Unsupervised Syntactic Chunking with Acoustic Cues: Computational Models for Prosodic Bootstrapping
    Unsupervised syntactic chunking with acoustic cues: computational models for prosodic bootstrapping John K Pate ([email protected]) Sharon Goldwater ([email protected]) School of Informatics, University of Edinburgh 10 Crichton St., Edinburgh EH8 9AB, UK Abstract tags or punctuation. However, infants do have ac- cess to certain cues that have not been well explored Learning to group words into phrases with- by NLP researchers focused on grammar induction out supervision is a hard task for NLP sys- from text. In particular, we consider the cues to syn- tems, but infants routinely accomplish it. We hypothesize that infants use acoustic cues to tactic structure that might be available from prosody prosody, which NLP systems typically ignore. (roughly, the structure of speech conveyed through To evaluate the utility of prosodic information rhythm and intonation) and its acoustic realization. for phrase discovery, we present an HMM- The idea that prosody provides important ini- based unsupervised chunker that learns from tial cues for grammar acquisition is known as the only transcribed words and raw acoustic cor- prosodic bootstrapping hypothesis, and is well- relates to prosody. Unlike previous work on established in the field of language acquisition unsupervised parsing and chunking, we use (Gleitman and Wanner, 1982). Experimental work neither gold standard part-of-speech tags nor punctuation in the input. Evaluated on the has provided strong support for this hypothesis, for Switchboard corpus, our model outperforms example by showing that infants begin learning ba- several baselines that exploit either lexical or sic rhythmic properties of their language prenatally prosodic information alone, and, despite pro- (Mehler et al., 1988) and that 9-month-olds use ducing a flat structure, performs competitively prosodic cues to distinguish verb phrases from non- with a state-of-the-art unsupervised lexical- constituents (Soderstrom et al., 2003).
    [Show full text]
  • Bootstrapping the Syntactic Bootstrapper
    Bootstrapping the Syntactic Bootstrapper Anne Christophe, Isabelle Dautriche, Alex de Carvalho, and Perrine Brusini In 1985, Laudau & Gleitman first outlined the syntactic bootstrapping hypothesis, in their book Language and Experience – Evidence from a blind child, followed in 1990 by Lila Gleitman’s article “The structural sources of verb meanings” (in Language Acquisition). They proposed that young children might learn the meaning of words (and in particular, verbs), by paying attention to the syntactic structures in which these words occur. This highly counter- intuitive hypothesis earned Lila a lot of flak from the community, and paved the way for the broader research framework that is now known as ‘synergies in language acquisition’: the general idea that even impoverished knowledge in one area of language might help children refine their representations in another (e.g., even a very crude proto-lexicon will help you learn your phonological system, see for instance Martin, Peperkamp, & Dupoux, 2013). Thirty years later, syntactic bootstrapping is widely accepted and has been supported by many experimental results – even though a lot remains to be discovered. In this paper, we will examine the ways in which very young children may start gathering the relevant syntactic facts on which to base their acquisition of word meanings – or, in other words, how to ‘bootstrap the syntactic bootstrapper’. In order to exploit syntactic structure to figure out the meaning of unknown words, very young children have to be able to recover at least some elements of the syntactic structure of the sentences they hear, even when they do not know all their content words.
    [Show full text]
  • Mehler Et Al. 31-05-00 a RATIONALIST APPROACH to THE
    Mehler et al. 31-05-00 A RATIONALIST APPROACH TO THE STUDY OF DEVELOPMENT. Humans are helpless at birth but in a few years they learn a great many things, gain increasing autonomy, and acquire motor and perceptual skills as well as language and numerical abilities. How does such a change come about? Observers often marvel at the speed and predictability of growth. For instance, the tri-dimensional organization of space emerges sometime between 16 and 18 weeks after birth (see Held, Birch, & Gwiazda, 1980). Motor learning unfolds predictably, bipedal gait being attained at the end of the first year of life (see Adolph, 1995). Likewise, language unfolds after only a few years of contact with the environment (see Brown, 1973). Major deviations from this schedule usually signal neurological impairment. Psychologists, borrowing terms from biology, tend to refer to these changes as development. It was biologists who first began to describe the predictable changes that take place as the fertilized egg progressively grows into the fully-fledged organism. In this sense, the term development is theoretically neutral : it is simply a word that describes the changes that take place during growth. A survey of the first half of the twentieth century shows that behavioral scientists slighted the study of development until Piaget’s influence began to be felt, roughly, at the time when Cognitive Psychology was beginning to gain a wider audience. Up until that time, most psychologists believed that living creatures learn by extracting regularities from their environment. Association was the mechanism offered to explain learning. It is not readily obvious, however, how association can explain the origin of species-specific behavioral dispositions (see Tinbergen, 1951; and also Gallistel, 1990).
    [Show full text]
  • Epenthesis and Prosodic Structure in Armenian
    UNIVERSITY OF CALIFORNIA Los Angeles Epenthesis and prosodic structure in Armenian: A diachronic account A dissertation submitted in partial satisfaction of the requirements for the degree Doctor of Philosophy in Indo-European Studies by Jessica L. DeLisi 2015 © Copyright by Jessica L. DeLisi 2015 ABSTRACT OF THE DISSERTATION Epenthesis and prosodic structure in Armenian: A diachronic account by Jessica L. DeLisi Doctor of Philosophy in Indo-European Studies University of California, Los Angeles, 2015 Professor H. Craig Melchert, Chair In this dissertation I will attempt to answer the following question: why does Classical Armenian have three dierent reexes for the Proto-Armenian epenthetic vowel word- initially before old Proto-Indo-European consonant clusters? Two of the vowels, e and a, occur in the same phonological environment, and even in doublets (e.g., Classical ełbayr beside dialectal ałbär ‘brother’). The main constraint driving this asymmetry is the promotion of the Sonority Sequenc- ing Principle in the grammar. Because sibilants are more sonorous than stops, the promo- tion of the Sonority Sequencing Principle above the Strict Layer Hypothesis causes speak- ers to create a semisyllable to house the sibilant extraprosodically. This extraprosodic structure is not required for old consonant-resonant clusters since they already conform to the Sonority Sequencing Principle. Because Armenian has sonority-sensitive stress, the secondary stress placed on word-initial epenthetic vowels triggers a vowel change in all words without extraprosodic structure, i.e. with the old consonant-resonant clusters. Therefore Proto-Armenian */@łbayR/ becomes Classical Armenian [èł.báyR] ‘brother,’ but Proto-Armenian */<@s>tipem/ with extraprosodic <@s> becomes [<@s>.tì.pém] ‘I rush’ because the schwa is outside the domain of stress assignment.
    [Show full text]
  • 1 from Sound to Syntax: the Prosodic Bootstrapping Of
    From Sound to Syntax: The Prosodic Bootstrapping of Clauses Item Type text; Electronic Dissertation Authors Hawthorne, Kara Eileen Publisher The University of Arizona. Rights Copyright © is held by the author. Digital access to this material is made possible by the University Libraries, University of Arizona. Further transmission, reproduction or presentation (such as public display or performance) of protected items is prohibited except with permission of the author. Download date 25/09/2021 05:22:33 Link to Item http://hdl.handle.net/10150/283672 1 FROM SOUND TO SYNTAX: THE PROSODIC BOOTSTRAPPING OF CLAUSES by Kara Hawthorne _____________________ Copyright © Kara Hawthorne 2013 A Dissertation Submitted to the Faculty of the DEPARTMENT OF LINGUISTICS In Partial Fulfillment of the Requirements For the Degree of DOCTOR OF PHILOSOPHY In the Graduate College of THE UNIVERSITY OF ARIZONA 2013 2 THE UNIVERSITY OF ARIZONA GRADUATE COLLEGE As members of the Dissertation Committee, we certify that we have read the dissertation prepared by Kara Hawthorne entitled From Sound to Syntax: The Prosodic Bootstrapping of Clauses and recommend that it be accepted as fulfilling the dissertation requirement for the Degree of Doctor of Philosophy _______________________________________________________________________ Date: 11/13/2012 Dr. LouAnn Gerken _______________________________________________________________________ Date: 11/13/2012 Dr. Rebecca Gomez _______________________________________________________________________ Date: 11/13/2012 Dr. Heidi Harley _______________________________________________________________________ Date: 11/13/2012 Dr. Diane Ohala Final approval and acceptance of this dissertation is contingent upon the candidate’s submission of the final copies of the dissertation to the Graduate College. I hereby certify that I have read this dissertation prepared under my direction and recommend that it be accepted as fulfilling the dissertation requirement.
    [Show full text]
  • PERSPECTIVES Child Language Acquisition: Why Universal Grammar Doesn’T Help
    PERSPECTIVES Child language acquisition: Why universal grammar doesn’t help BEN AMBRIDGE JULIAN M. PINE ELENA V. M. LIEVEN University of Liverpool University of Liverpool University of Manchester In many different domains of language acquisition, there exists an apparent learnability prob- lem to which innate knowledge of some aspect of universal grammar (UG) has been proposed as a solution. The present article reviews these proposals in the core domains of (i) identifying syntactic categories, (ii) acquiring basic morphosyntax, (iii) structure dependence, (iv) subja- cency, and (v) the binding principles. We conclude that, in each of these domains, the innate UG- specified knowledge posited does not, in fact, simplify the task facing the learner. Keywords: binding principles, child language acquisition, frequent frames, parameter setting, prosodic bootstrapping, semantic bootstrapping, structure dependence, subjacency, syntax, mor- phosyntax, universal grammar 1. Introduction. Many leading theories of child language acquisition assume innate knowledge of universal grammar (e.g. of syntactic categories such as noun and verb, constraints/principles such as structure dependence and subjacency, and parameters such as the head-direction parameter). Many authors have argued either for or against uni- versal grammar (UG) on a priori grounds such as learnability (e.g. whether the child can acquire a system of infinite productive capacity from exposure to a finite set of ut- terances generated by that system) or evolutionary plausibility (e.g. linguistic principles are too abstract to confer a reproductive advantage). Our goal in this article is to take a step back from such arguments, and instead to con- sider the question of whether the individual components of innate UG knowledge pro- posed in the literature (e.g.
    [Show full text]
  • English-Speaking Preschoolers Can Use Phrasal Prosody for Syntactic Parsing
    de Carvalho et al.: JASA Express Letters [http://dx.doi.org/10.1121/1.4954385] Published Online 23 June 2016 English-speaking preschoolers can use phrasal prosody for syntactic parsing Alex de Carvalho,1,a) Jeffrey Lidz,2 Lyn Tieu,1 Tonia Bleam,2 and Anne Christophe1 1Laboratoire de Sciences Cognitives et Psycholinguistique (ENS, EHESS, CNRS), Departement d’Etudes Cognitives, Ecole normale superieure - PSL Research University, 29 rue d’Ulm, 75005 Paris, France [email protected], [email protected], [email protected] 2Department of Linguistics, University of Maryland, College Park, Maryland 20742, USA [email protected], [email protected] Abstract: This study tested American preschoolers’ ability to use phrasal prosody to constrain their syntactic analysis of locally ambigu- ous sentences containing noun/verb homophones (e.g., [The baby flies] [hide in the shadows] vs [The baby] [flies his kite], brackets indicate pro- sodic boundaries). The words following the homophone were masked, such that prosodic cues were the only disambiguating information. In an oral completion task, 4- to 5-year-olds successfully exploited the sen- tence’s prosodic structure to assign the appropriate syntactic category to the target word, mirroring previous results in French (but challenging previous English-language results) and providing cross-linguistic evi- dence for the role of phrasal prosody in children’s syntactic analysis. VC 2016 Acoustical Society of America [DDO] Date Received: January 29, 2016 Date Accepted: May 3, 2016 1. Introduction According to the prosodic bootstrapping hypothesis,1 phrasal prosody (the rhythm and melody of speech) may provide a useful source of information for parsing the speech stream into words and phrases.
    [Show full text]
  • Final Thesis
    The Effect of Meter-Syntax Alignment on Sentence Comprehension, Sensorimotor Synchronisation, and Neural Entrainment Courtney Bryce Hilton The University of Sydney Faculty of Arts and Social Sciences A thesis submitted to fulfil requirements for the degree of Doctor of Philosophy Abstract Why rhythm in language? And why is linguistic rhythm grasped by means of meter (a temporal grid structure) in the human mind? And specifically, why does the human mind prefer to align meter to language in certain ways and not others? This thesis examines the alignment of meter to syntactic phrase structure and its effect on language comprehension. This is investigated empirically in four experiments, whose results are situated within the relevant linguistic, musicological, cognitive, and neuroscientific literatures. The first two experiments show that meter-syntax alignment indeed affects sentence comprehension, and the second also shows an effect on sensorimotor synchronisation. The third experiment behaviourally replicates the comprehension result while also recording electroencephalography (EEG). This neural measurement shows how delta oscillations track the perceived meter rather than syntactic phrase structure: contradicting some recent theories. The final experiment applies meter-syntax alignment to an algebraic grouping task. By using simpler (better controlled) non-linguistic stimuli, the results in this experiment better constrain the mechanistic interpretation of the results so far. Specifically, I suggest that the effect of meter (and its alignment to syntax) on comprehension is mediated by an effect on short-term/working memory. The broader theoretical and practical implications of these experiments is finally discussed, especially with regard to theories of language processing, music-language parallels, and education.
    [Show full text]
  • Intonational Pitch Accent Distribution in Egyptian Arabic Samantha Jane
    Intonational pitch accent distribution in Egyptian Arabic Samantha Jane Hellmuth Submitted to the Department of Linguistics in partial fulfilment of the requirements for the degree of Doctor of Philosophy in Linguistics and Phonetics School of Oriental and African Studies University of London March 2006 ProQuest Number: 10672847 All rights reserved INFORMATION TO ALL USERS The quality of this reproduction is dependent upon the quality of the copy submitted. In the unlikely event that the author did not send a com plete manuscript and there are missing pages, these will be noted. Also, if material had to be removed, a note will indicate the deletion. uest ProQuest 10672847 Published by ProQuest LLC(2017). Copyright of the Dissertation is held by the Author. All rights reserved. This work is protected against unauthorized copying under Title 17, United States C ode Microform Edition © ProQuest LLC. ProQuest LLC. 789 East Eisenhower Parkway P.O. Box 1346 Ann Arbor, Ml 48106- 1346 Abstract Egyptian Arabic (EA) is a stress-accent language with postlexical intonational pitch accents. This thesis investigates EA pitch accents within the autosegmental-metrical (AM) framework (Ladd 1996). The goal of the study is to identify the place of EA in the spectrum of cross-linguistic prosodic variation, and to resolve the challenge it presents to existing phonological accounts of pitch accent distribution. In a corpus of read and (semi-)spontaneous EA speech a pitch accent was found on (almost) every content word, and in the overwhelming majority of cases the same pitch accent type is observed on every word. The typological implications of EA pitch accent distribution are explored in the context of the typology of word-prosodic variation (Hyman 2001) and variation in the domain of pitch accent distribution is proposed as a new parameter of prosodic variation.
    [Show full text]
  • Prosodic Cues to Word Order: What Level of Representation?
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE ORIGINAL RESEARCH ARTICLE published: 30provided October by 2012 Frontiers - Publisher Connector doi: 10.3389/fpsyg.2012.00451 Prosodic cues to word order: what level of representation? Carline Bernard 1 and Judit Gervain1,2* 1 Laboratoire Psychologie de la Perception (UMR 8158), Université Paris Descartes, Sorbonne Paris Cité, Paris, France 2 Laboratoire Psychologie de la Perception (UMR 8158), CNRS, Paris, France Edited by: Within language, systematic correlations exist between syntactic structure and prosody. Claudia Männel, Max Planck Institute Prosodic prominence, for instance, falls on the complement and not the head of syntactic for Human Cognitive and Brain Sciences, Germany phrases, and its realization depends on the phrasal position of the prominent element. Reviewed by: Thus, in Japanese, a functor-final language, prominence is phrase-initial, and realized as ^ Mohinish Shukla, University of increased pitch ( Toky¯o¯ ni “Tokyo to”), whereas in French, English, or Italian, functor-initial Massachusetts Boston, USA languages, it manifests itself as phrase-final lengthening (to Rome). Prosody is readily avail- Jean-Remy Hochmann, Harvard able in the linguistic signal even to the youngest infants. It has, therefore, been proposed University, USA that young learners might be able to exploit its correlations with syntax to bootstrap lan- *Correspondence: Judit Gervain, Laboratoire guage structure. In this study, we tested this hypothesis, investigating how 8-month-old Psychologie de la Perception (UMR monolingual French infants processed an artificial grammar manipulating the relative posi- 8158), CNRS & Université Paris tion of prosodic prominence and word frequency. In Condition 1, we created a speech Descartes, 45 rue des Saints Pères, stream in which the two cues, prosody and frequency, were aligned, frequent words being 75006 Paris, France.
    [Show full text]
  • Clusters and Classes in the Rhythm Metrics Russell Horton First Reader: Amalia Arvaniti Readers: Roger Levy, Andrew Kehler
    Clusters and Classes in the Rhythm Metrics Russell Horton First reader: Amalia Arvaniti Readers: Roger Levy, Andrew Kehler This co-authored paper incorporates work from my final paper for LIGN201, co-authored with Cody Brimhall and Emily Morgan, which was presented as a poster at the Acoustic Society of America in 2010, and from an associated paper draft co-authored with Brimhall, Morgan and Amalia Arvaniti. Abstract. The classic formulation of the rhythm class hypothesis holds that human languages are either stress-timed, with roughly equal duration of periods in between stresses, or syllable-timed, with all sylla- bles tending toward equal duration. This strict isochrony failed to be confirmed empirically, and various metrics have been proposed to quantify the typology in terms of durational variation of segments instead. We show that a Naive Bayes classifier and a graph-based clusterer can use certain metrics to discriminate between stress-timed English and German and syllable-timed Italian and Spanish with greater than 80% accuracy. They cannot, however, consistently and strongly predict rhythm types for sentences from the unclassified languages Greek and Korean, indicating that the rhythm typology may not generalize beyond the prototypical exemplars of each class. Although the purported success of the metrics in modeling the rhythm typology has been used as a proof of the existence of the typology itself, this is unwarranted, as the main argument that the metrics are valid measures of rhythm is the fact that they partially accord with the predictions of the rhythm typology. Further, the success of the metrics is in proportion to the degree that they correlate with tempo, suggesting that they may merely reflect the reduced durational variability of segments at the high speaking rates that characterize syllable-timed languages, rather than reflecting inherent differences in rhythm class timing.
    [Show full text]
  • Why Is Language Unique to Humans?
    c16.qxd 10/31/2005 6:15 PM Page 251 Why is language unique to humans? Jacques Mehler*†1, Marina Nespor‡, Mohinish Shukla* and Marcela Peña* * International School for Advanced Studies, Trieste, Italy, † Ecole des Hautes Etudes en Sciences Sociales, Paris, France and ‡ Universita di Ferrara, Ferrara, Italy Abstract. Cognitive neuroscience has focused on language acquisition as one of the main domains to test the respective roles of statistical vs. rule-like computation. Recent studies have uncovered that the brain of human neonates displays a typical signature in response to speech sounds even a few hours after birth. This suggests that neuroscience and lin- guistics converge on the view that, to a large extent, language acquisition arises due to our genetic endowment. Our research has also shown how statistical dependencies and the ability to draw structural generalizations are basic processes that interact intimately. First, we explore how the rhythmic properties of language bias word segmentation. Second, we demonstrate that natural speech categories play specific roles during language acquisition: some categories are optimally suited to compute statistical dependencies while other categories are optimally suited for the extraction of structural generalizations. 2005 Percept, decision, action: bridging the gaps. Wiley, Chichester (Novartis Foundation Symposium 270) p 251–284 Linguists and psychologists have studied language acquisition; the former have elaborated the most sophisticated formal theories to account for how this unique competence arises specifically in humans. For instance, Chomsky (1980) formulated the Principles and Parameters theory (hereafter, P&P) to account for the acquisi- tion of language given the poverty of the linguistic data the learner receives.
    [Show full text]