Linguistic

Some ‘suprasegmental’ phenomena involve categorical form-meaning mappings, and are tightly integrated in the grammars/lexicons of languages: - Lexical - Lexical & metrical structure - Some intonational phenomena - and more… Lexical tone

• Use of pitch patterns to distinguish words or grammatical forms of words (e.g. singular vs. plural nouns) Tone systems

• Two-way division 1. tone systems: no contour tones 2. Contour tone systems Register tone systems

IPA • Fixed points in speaker’s pitch range (level tones) • 2 tones (H, L) (Ex: Luganda) • 3 tones (H, M, L) (Ex: Yoruba) • Rarer: 4-5 tones

• Present in many African languages (Bantu, Niger-Congo, Nilo-Saharan) Yoruba (Atlantic-Congo) [apá] MH arm [àpá] LH scar [bàtà] LL shoe [bàtá] LH type of drum [aya] MM wife [àyà] LL chest

• Three register tones Contour tone systems

• Systems with contour tones: • Movement between 2 or more targets IPA : • Register/contour: c.f. mono/

• Notation: • Pitch targets • Sequence of 1-5, e.g. “214” • Tone letters: graphical schematic • Tone number: Arbitrary • East/SE Asian languages (Mandarin, Vietnamese, Burmese…) IPA : • Also • Oto-Manguean langs (Central America) • Khosian languages (S. Africa) … Mandarin Chinese (Sino-Tibetan)

Tone 1

Tone 2

Tone 3

Tone 4

• Four contour tones • 妈妈骂马的麻吗? (IPA: [má ma mâ ma᷉ tə mǎ ma]) • “Is mother scolding the horse’s hemp?” Which waveform goes with which Mandarin tone? • 55, 35, 214, 51 Which waveform goes with which Mandarin tone? • 55, 35, 214, 51

35 214

55 51 • Tonal contrasts often involve changes in type. Ex. Burmese:

Name Pitch Duration Phonation Low L Med Modal [kʰà] “shake” High H Long Modal/breathy [kʰá́ ] “be bier” Creaky H Med Creaky [kʰa] “fee” Checked H Short Glottalized [kʰaʔ] “draw off”

• Also: • Zapotecan languages (SW Mexico) • Latvian (Balto-Slavic) …. White Miao Hmong (Hmong-Mien)

• 7 level + contour tones / pitch registers: pitch, phonation, duration Pitch accent systems

• Sub-type of tone systems • Used contrastively, but 0-1 tones per word (usually) • Intuition: • Uses tone (pitch) roughly like “stress accent” languages (English) use stress • Between stress-accent and tone languages

• Controversial (but widely used) classification • Tokyo Japanese: L accent R accent Unaccented [háʃi] “chopscks” [haʃí] “bridge” [haʃi] “edge”

Word A

Word B Postlexical phenomena

• Applying at the utterance level rather than the word level: • Downtrends • • Coarticulation • … Pitch donwtrends

• Declination:TAPS PPhoneticroceedings (gradient) Connell decrease of F0 along utterance successively lower pitch levels, though one might • But underFigure speaker1: Declinati ocontrol!n illustrated in a sequence of consider this to be incidental. Figures 2 and 3 Hausa High tones in the sentence ‘Muudii yaa zoo illustrate automatic and non-automatic gidaa’ (Muudii came home), adapted from Lindau downstep, respectively, in Ibibio. (1986). • Downstep2.2:. DDropownstep register observed in H tonesTh efollowing phenomenon HL of dsequencesownstep was first recognized in print well over a century ago by 340 320 • ObservedChris tinalle rregister (1875:15) languageswho, in his di scussion of 300 280 260 Fante grammar, talks of three tones, high, middle 240 220 and low, but explains the middle tone as “high 200 180 tones abating by one step or successive steps”, 160 Hz thereby explicitly recognizing the underlying ms 300 600 900 1200 1500 sameness of the middle and high tones. In the modern era, it was work by scholars such as Figure 2: Illustration of automatic downstep in the Ibibio phrase Welmers (1959), Winston (1960), and especially (HHHHLHHHH) ‘thought and self-rule’. (Speaker Stewart (1965) that brought the nature of EU.) downstep into focus as the lowering of a high tone in certain specifiable circumstances (it was soon discovered that there are languages in which tones other than high tones are also subject to downstep). One of Stewart’s important 340 320 contributions was the recognition that the 300 280 lowering of the second H in a HH sequence 260 240 paralleled the lowering of the second H in a HLH 220 200 180 sequence. In the latter case, the lowering was 160 Hz attributed to the influence of the intervening L; in ms 300 600 900 1200 1500 the former it was argued that the lowering was due either to an underlying (floating) L, or one Figure 3: Illustration of non-automatic downstep in that had been lost historically. These two types of the Ibibio phrase downstep Stewart referred to as ‘automatic’ and (HHH!HH!HH!H) ‘hand of cassava of king’s kingship’. (Speaker EU.) ‘non-automatic’, respectively. A second

important feature of downstep is that, within 2.3. specifiable bounds, it sets a new ‘ceiling’ for all Downdrift is somewhat more difficult to Hs subsequent to the one downstepped. In other characterize precisely because, as mentioned words, subsequent Hs do not rise above the earlier, the term has been used in different senses. height of the one downstepped, and hence the Most commonly, it has been used synonymously descriptive label ‘terracing’. The limit of with Stewart’s automatic downstep. Hombert downstep is frequently the entire utterance, but (1974), for example, characterizes downdrift as may be defined by a syntactic boundary within “the progressive lowering of a high tone after a the utterance, at which point a ‘reset’ occurs; F0 low tone” (p. 171), and in a footnote explicitly regains its original starting height. The term ‘key- equates it to automatic downstep. Similar views lowering’ was adopted by Stewart to describe are expressed by a range of authors including, this effect, and it has often been described as a more recently, Snider & van der Hulst (1993) downward shift in register (e.g. Snider 1990, and Hyman (2001). However, Hombert also Snider & van der Hulst 1993). A third attributes an intonational element to downdrift, characteristic of downstep is its cumulative observing that Ls also descend, and then suggests nature: successive result in that the term downdrift refer to “the lowering of

Tone sandhi Linguistic prosody

• A range of ‘suprasegmental’ phenomena appear to involve categorical form-meaning mappings, and to be tightly integrated in the grammars/lexicons of languages: - Lexical tone - Lexical stress and metrical structure - Intonational - and more… Lexical stress

• Lexical stress is an abstract property of specific : Stressed syllables are more metrically-prominent than unstressed syllables

• Phonetically, stressed syllables tend to be: • Louder than unstressed syllables • Longer than stressed syllables • More resistant to coarticulation / reduction Phonetic cues to stress

• What about F0/pitch? - Stressed syllables can be marked with a pitch accent - The type of pitch accent (high, low, etc) will depend on the contour chosen by the speaker in each particular utterance Quasi-neutralization of stress contrasts in Spanish Francisco Torreira Miquel Simonet José Ignacio Hualde Max Planck Institute for University of Arizona $ University of Illinois Psycholinguistics $ ' at Urbana-Champaign

Introduction Statistical comparison$ st nd Spanish is a lexical stress language, but the correlates of Differentials between 1 and 2 . Statistical differences only in: stress in Spanish are subtler than in other languages: Syl. duration ß = 22.3, t = 5.66, p < .0001 Quasi-neutralization of• No stress systematic vowel contrasts reduction in unstressed in Spanishsyllables. Intensity ß = 1.77, t = 7.42, p < .0001 Francisco Torreira Miquel• Short Simonet durational differences between José stressed Ignacio and HualdeF1 ß = 21.51, t = 4.1, p < .0001 unstressed syllables. Max Planck Institute for University of Arizona $ University of Illinois Voicing throughout closure: Psycholinguistics$ • Stress placement often difficult toat learn Urbana-Champaign for L2 speakers. 1st syl.: + stress (33%); - stress (46%) z = 3.12, p < .005 ' nd Pitch accents appear to be the best cue to lexical stress. 2 syl.: + stress (33%); - stress (54%) z = 3.24, p < .005 However, pitch accents are not always present in all words. Introduction In many types of utterancesStatistical (questions, continuation comparison$ Automatic classification st nd Spanish is a lexical stress language, but the correlates statements),of Differentials words do not between usually carry1 and a pitch 2 syllable.accent in StatisticalLogistic regression in leave-one-out cross-validation procedure to phrase-medialdifferences position: 'only in: assess contribution of different cues: stress in Spanish are subtler than in other languages: Syl. duration ß = 22.3, t = 5.66, p < .0001 ''''''''''''''''a)'[Siempre$que$miro$la$hora] , …' 90 • No systematicQuasi-neutralization in unstressed ofsyllables. stress contrastsIntensity in Spanish IP ' ß = 1.77, t = 7.42, p < .0001 ''''''''''''''''''''‘Every'4me'I'look'up'the'4me,'…’' • Short durationalFrancisco differences Torreira between stressed Miquel and Simonet F1 José Ignacio Hualde ß = 21.51, t = 4.1, p < .000180 Max Planck Institute for University of''''''''''''''''b)'[ Arizona Siem $ pre$que University$miró$la$ho ofra] IPIllinois,'…' unstressed syllables. Psycholinguistics Voicing throughout closure: 70 $ '''''''''''''''''''''‘Every'4me'she'looked'up'the'4me,'…’ at Urbana-Champaign • Stress placement often difficult to learn for L2 speakers. st Our observations1 syl.: from + stress a corpus (33%); of spontaneous - stress (46%) z = 3.12, p < .00560 conversations2nd suggestStatisticalsyl.: + stress that stresscomparison (33%); contrasts - stress might (54%) be lost in z = 3.24, p < .005 Pitch accents appearIntroduction to be the best cue to lexical stress. $ 50 suchDifferentials contexts: between! 1st and 2nd syllable. Statistical Spanish is a lexical stress language, but the correlates of correctclassification%of However, pitch accents are not always present in all words.differences only in: In manystress intypes Spanish of utterancesare subtler than (questions, in other languages: continuation To!what!extent!does!the!lack!of!pitch!accents!in!phrase4medial!Automatic classification Syl.posi6on!in!Spanish!lead!to!a!neutraliza6on!of!stress!contrasts?! duration ß = 22.3, t = 5.66, p < .0001 statements),• No systematic words vowel do reduction not usually in unstressed carry a syllables.pitch accent in Logistic regression in leave-one-out cross-validation procedure to Intensity assess contribution ß of = different1.77, t = 7.42,cues: p < .0001 phrase-medial position:' F1 'ß = 21.51, t = 4.1, p < .0001 Perception experiment • Short durational differences between stressed and unstressed syllables. Production experiment Thirteen listeners do a forced-choice task (oxytone vs. paroxytone) Voicing throughout closure: on 100 utterances from previous experiment. ''''''''''''''''a)'[Siempre$que$miro$la$hora]IP,'…' 90 • Stress placement often difficult to learn for L2 speakers. We'elicit'stress'contrasts'in'disyllabic'verbal'forms'with'/st ptk/' ''''''''''''''''''''‘Every'4me'I'look'up'the'4me,'…’' 1 syl.: + stress (33%); - stress (46%) z = 3.12, p < .005 Listeners’ accuracy above chance level: consonants:'nd Pitch accents appear to be the best cue to lexical stress. 2 syl.: + stress (33%);80 - stress (54%) z = 3.24, p < .005 62.9% correct classification (range: 55% to 70%). ''''''''''''''''b)'[However, Siempitch accentspre$que are$mi rónot$la$ alwayshora ]presentIP,'…' in all words. $$$$$$$$$$$$$$$$Paroxytone :$ $tapo $ $toco $ $corto$ 70 Analysis by items: 16 items below 50%; 6 below 30%. ''''''''''''''''''''‘Every'4me'she'looked'up'the'4me,'…’In many types of utterances (questions, continuation $$$$$$$$$$$$$$$$Oxytone Automatic:$ $ $tapó -classification-tocó - -cortó - statements), words do not usually carry a pitch accent in Logistic regression in leave-one-out cross-validation procedure to Listeners rely on intensity and duration cues in a similar degree. Our observations from a corpus of spontaneous 60 Automatic classification of listeners responses: phrase-medial position:' Nine'speakers'produce'interroga4ve'sentences'with'informa4on'assess contribution of different cues: conversations suggest that stress contrasts might be loston'screen'(ques4ons'are'normally'produced'as'a'single' in 50 ''''''''''''''''a)'[Siempre$que$miro$la$hora]IP,'…' 90 90 such contexts:! intona4onal 'phrase):' correctclassification%of ''''''''''''''''''''‘Every'4me'I'look'up'the'4me,'…’'Neutralization of stress contrasts in Spanish? 80 To!what!extent!does!the!lack!of!pitch!accents!in!phrase4medial!$$$$$$$$$$$$$$PASADO/ él:'¿Cómo/tocar/tu$estrella?' 80 ''''''''''''''''b)'[Siempre$que$miró$la$hora]IP,'…'

posi6on!in!Spanish!lead!to!a!neutraliza6on!of!stress!contrasts?!'''''''''''''''‘PAST/he:''How/touch/your'star?''70 ''''''''''''''''''''‘Every'4me'she'looked'up'the'4me,'…’ 70 ' ' Our observations from a corpus of spontaneous 60 Perception experiment conversations suggest that stress contrasts might be lost in Thirteenvoicing?' listeners do a forced-choice task (oxytone vs. paroxytone60 ) • SpanishProduction is a lexical stress languageexperiment 50 such contexts:! on 100 utterances from previous experiment. • In citation mode, pitch appears to provide the correctclassification%of 50 We'elicit'stress'contrasts'in'disyllabic'verbal'forms'with'/To!what!extent!does!the!lack!of!pitch!accents!in!phrase4medial!ptk/' syllable'

strongest cue to stress Listeners’ accuracy above chance level: correctclassification%of dura4ons' consonants:'posi6on!in!Spanish!lead!to!a!neutraliza6on!of!stress!contrasts?! 90 62.9% correct classification (range: 55% to 500070%).

' ) intensity' F $$$$$$$$$$$$$$$$Paroxytone:$ $tapo $ $toco $ $corto$ B Perception experiment r d e (

Analysis by items: 16 items below 50%; 6 belowq 30%. y

t (

Thirteeni listeners do a forced-choice task (oxytone vs. paroxytone) H

Production experiment s Conclusions z n )

$$$$$$$$$$$$$$$$Oxytone:$ $ $tapó - -tocó - -cortó- e

on 100t utterances from previous experiment. n We'elicit'stress'contrasts'in'disyllabic'verbal'forms'with'/ptk/' I Listeners rely on intensity and duration cues in a similarPhonetic degree. differences (mostly in duration and intensity) Listeners’30 accuracy above chance level: 0 Nine'speakers'produce'interroga4ve'sentences'with'informa4on'consonants:' 225 AutomaticF1'and'F2' classification of listeners responses: between stress patterns in phrase-medial position. No 62.9% correct classification (range: 55% to 70%). differences in f0. on'screen'(ques4ons'are'normally'produced'as'a'single'$$$$$$$$$$$$$$$$Paroxytone:$ $tapo $ $toco $ $corto$ Analysis by items: 16 itemsf0' below 50%; 6 below 30%. • Stress contrasts are difficult to perceive in 90 Substantial amount of phonetic overlap (~25% of intona4onal$$$$$$$$$$$$$$$$Oxytone'phrase):':$ $ $tapó - -tocó - -cortó- ) z

ListenersH rely on intensity and duration cues in a similar degree. misclassifications). ( phrase-medial position 0

AutomaticF classification of listeners responses: $$$$$$$$$$$$$$PASADO/Nine'speakers'produce'interroga4ve'sentences'with'informa4on'él:'¿Cómo/tocar/tu$estrella?' 80 Correct classification above chance level in perception

'''''''''''''''‘PAST/he:''How/touch/your'star?''on'screen'(ques4ons'are'normally'produced'as'a'single' experiment with native listeners, but numerous errors (almost intona4onal'phrase):' !!kwa90ndo '''''''''kor70to ''''''''''tu'''''''''''''θirkulo'''''' 40% of the data!). ' 50 $$$$$$$$$$$$$$PASADO/él:'¿Cómo/tocar/tu$estrella?' 80 Discussion: voicing?' 60 Time (s) '''''''''''''''‘PAST/he:''How/touch/your'star?'' 70 Is the observed degree of overlap/ambiguity unusual for a ' phonological contrast (cf. e.g. voicing contrast in Rome 50 voicing?' 60 Italian, mid vowels in French and Italian)? syllable' % of correctclassification%of dura4ons' How well are Spanish stress contrasts produced and 50 90 syllable' 5000 perceived online in everyday conversation?

)

intensity' correctclassification%of F

B r

d dura4ons' e (

q y 5000 t 90 ( i H s ) intensity' Conclusions z F n B ) r e d e t

(

q n y

I t ( i H

s PhoneticConclusions differences (mostly in duration and intensity) z n

30 ) e 0 t

225 n I F1'and'F2' Phonetic differencesbetween (mostlystress in patterns duration andin phrase-medial intensity) position. No 30 0 225 F1'and'F2' between stressdifferences patterns in in phrase-medial f0. position. No differences in f0. f0' ' f0' Substantial amount of phonetic overlap (~25% of ) Substantial amount of phonetic overlap (~25% of z ) H z misclassifications). (

H misclassifications). ( 0

0 F F Correct classificationCorrect classificationabove chance level above in perception chance level in perception experimentexperiment with native listeners, with native but numerous listeners, errors but (almost numerous errors (almost !!kwando'''''''''korto''''''''''tu'''''''''''''θirkulo'''''' 40% of the 40%data!). of the data!). !!50kwando'''''''''korto''''''''''tu'''''''''''''θirkulo'''''' 50 Discussion: Time (s) Discussion: Time (s) Is the observed degree of overlap/ambiguity unusual for a phonologicalIs contrastthe observed (cf. e.g. voicingdegree contrast of overlap/ambiguity in Rome unusual for a Italian, mid phonologicalvowels in French contrast and Italian)? (cf. e.g. voicing contrast in Rome How well areItalian, Spanish mid stress vowels contrasts in French produced and and Italian)? perceived onlineHow inwell everyday are Spanish conversation? stress contrasts produced and perceived online in everyday conversation?

'

' Quasi-neutralization of stress contrasts in Spanish Francisco Torreira Miquel Simonet José Ignacio Hualde Max Planck Institute for University of Arizona $ University of Illinois Psycholinguistics $ ' at Urbana-Champaign

Introduction Statistical comparison$ st nd Spanish is a lexical stress language, but the correlates of Differentials between 1 and 2 syllable. Statistical differences only in: stress in Spanish are subtler than in other languages: Syl. duration ß = 22.3, t = 5.66, p < .0001 • No systematic vowel reduction in unstressed syllables. Intensity ß = 1.77, t = 7.42, p < .0001 • Short durational differences between stressed and F1 ß = 21.51, t = 4.1, p < .0001 unstressed syllables. Voicing throughout closure: • Stress placement often difficult to learn for L2 speakers. 1st syl.: + stress (33%); - stress (46%) z = 3.12, p < .005 nd Pitch accents appear to be the best cue to lexical stress. 2 syl.: + stress (33%); - stress (54%) z = 3.24, p < .005 However, pitch accents are not always present in all words. In many types of utterances (questions, continuation Automatic classification Quasi-neutralization ofstatements), stress words contrasts do not usually in carry Spanish a pitch accent in Logistic regression in leave-one-out cross-validation procedure to phrase-medial position:' assess contribution of different cues: Francisco Torreira Miquel Simonet José Ignacio Hualde Max Planck Institute for University of Arizona $ University of Illinois ''''''''''''''''a)'[ Siempre$que$miro$la$hora]IP,'…' 90 Psycholinguistics $ ''''''''''''''''''''‘Every'4me'I'look'up'the'4me,'…’'' at Urbana-Champaign 80 ''''''''''''''''b)'[Siempre$que$miró$la$hora]IP,'…' Statistical comparison$ 70 Introduction ''''''''''''''''''''‘Every'4me'she'looked'up'the'4me,'…’ st nd Spanish is a lexical stress language, but the correlates of Differentials between 1 and 2 syllable. Statistical Our observationsdifferences onlyfrom in: a corpus of spontaneous 60 stress in Spanish are subtler than in other languages: conversations suggest that stress contrasts might be lost in Syl. duration ß = 22.3, t = 5.66, p < .0001 50 • No systematic vowel reduction in unstressed syllables.such contexts:Intensity! ß = 1.77, t = 7.42, p < .0001 correctclassification%of • Short durational differences between stressed and To!what!extent!does!the!lack!of!pitch!accents!in!phrase4medial!F1 ß = 21.51, t = 4.1, p < .0001 unstressed syllables. posi6on!in!Spanish!lead!to!a!neutraliza6on!of!stress!contrasts?!Voicing throughout closure: • Stress placement often difficult to learn for L2 speakers. 1st syl.: + stress (33%); - stress' (46%) z = 3.12, p < .005 Perception experiment nd Pitch accents appear to be the best cue to lexical stress. 2 syl.:Production + Productionstress (33%); - stress experimentexperiment (54%) z = 3.24, p < .005 Thirteen listenersPerception do a forced-choice experiment task (oxytone vs. paroxytone) on 100 utterances from previous experiment. However, pitch accents are not always present in all words.We'elicit'stress'contrasts'in'disyllabic'verbal'forms'with'/ ptk/' In many types of utterances (questions, continuation Automatic classification Listeners’ accuracy above chance level: consonants:' statements), words do not usually carry a pitch accent in Logistic regression in leave-one-out cross-validation procedure to 62.9% correct• classification62% of correct (range: classification 55% to 70%). phrase-medial position:' $$$$$$$$$$$$$$$$assessParoxytone contribution:$ $tapo of different$ $toco cues:$ $corto$ Analysis by items: 16 items below 50%; 6 below 30%. $$$$$$$$$$$$$$$$ Oxytone:$ $ $tapó - -tocó - -cortó- ''''''''''''''''a)'[Siempre$que$miro$la$hora]IP,'…' 90 Listeners rely on intensity and duration cues in a similar degree. ''''''''''''''''''''‘Every'4me'I'look'up'the'4me,'…’' AutomaticLogistic classification regression of listenersof participants’ responses: responses Nine'speakers'produce'interroga4ve'sentences'with'informa4on' 80

''''''''''''''''b)'[Siempre$que$miró$la$hora]IP,'…' on'screen'(ques4ons'are'normally'produced'as'a'single' 70 90 ''''''''''''''''''''‘Every'4me'she'looked'up'the'4me,'…’ intona4onal 'phrase):'

Our observations from a corpus of spontaneous 60 $$$$$$$$$$$$$$PASADO/ él:'¿Cómo/tocar/tu$estrella?' 80 conversations suggest that stress contrasts might be lost in '''''''''''''''‘PAST/he:''How/touch/your'star?'' 50 such contexts:! 70 ' correctclassification%of To!what!extent!does!the!lack!of!pitch!accents!in!phrase4medial! voicing?' 60 posi6on!in!Spanish!lead!to!a!neutraliza6on!of!stress!contrasts?! ' Perception experiment 50 Thirteen listeners do a forced-choicesyllable' task (oxytone vs. paroxytone)

Production experiment correctclassification%of on 100 utterances from previousdura4ons' experiment. We'elicit'stress'contrasts'in'disyllabic'verbal'forms'with'/ptk/' 90 5000

) Listeners’ accuracyintensity' above chance level: F B r d

consonants:' e (

q y

62.9% correct classification (range: 55% to 70%). t ( i H

s Conclusions z $$$$$$$$$$$$$$$$Paroxytone:$ $tapo $ $toco $ $corto$ n ) e t

n Analysis by items: 16 items below 50%; 6 below 30%. I Phonetic differences (mostly in duration and intensity) $$$$$$$$$$$$$$$$Oxytone:$ $ $tapó - -tocó - -cortó- 30 0 225 Listeners rely onF1'and'F2' intensity and duration cues in a similar degree. between stress patterns in phrase-medial position. No Nine'speakers'produce'interroga4ve'sentences'with'informa4on' Automatic classification of listeners responses: differences in f0. on'screen'(ques4ons'are'normally'produced'as'a'single' f0' Substantial amount of phonetic overlap (~25% of

) 90 intona4onal'phrase):' z H misclassifications). (

0 $$$$$$$$$$$$$$PASADO/él:'¿Cómo/tocar/tu$estrella?' F 80 Correct classification above chance level in perception

'''''''''''''''‘PAST/he:''How/touch/your'star?'' experiment with native listeners, but numerous errors (almost 70 ' !!kwando'''''''''korto''''''''''tu'''''''''''''θirkulo'''''' 40% of the data!). 50 voicing?' 60 Discussion: Time (s) 50 Is the observed degree of overlap/ambiguity unusual for a syllable' phonological contrast (cf. e.g. voicing contrast in Rome

% of correctclassification%of dura4ons' Italian, mid vowels in French and Italian)? 90 5000

) intensity' F B How well are Spanish stress contrasts produced and r d e ( q y t ( i H

s Conclusions perceived online in everyday conversation? z n ) e t n I Phonetic differences (mostly in duration and intensity) 30 0 225 F1'and'F2' between stress patterns in phrase-medial position. No differences in f0. f0' Substantial amount of phonetic overlap (~25% of ) z

H misclassifications). (

0 F Correct classification above chance level in perception experiment with native listeners, but numerous errors (almost ' !!kwando'''''''''korto''''''''''tu'''''''''''''θirkulo'''''' 40% of the data!). 50 Time (s) Discussion: Is the observed degree of overlap/ambiguity unusual for a phonological contrast (cf. e.g. voicing contrast in Rome Italian, mid vowels in French and Italian)? How well are Spanish stress contrasts produced and perceived online in everyday conversation?

' Which word receives an accent?

• In English, it depends on context:

• Q1: What types of foods are a good source of vitamins? • A1: LEGUMES are a good source of vitamins.

• Q2: Are legumes a source of vitamins? • A2: Legumes are a GOOD source of vitamins.

• Q3: I’ve heard that legumes are healthy, but what are they a good source of ? • A3: Legumes are a good source of VITAMINS. Phonetic cues to stress in English

• What about F0/pitch? This depends on the context: - Stressed syllables can be marked with a pitch accent - The type of pitch accent (high, low, etc.) will depend on the intonation contour chosen by the speaker in each particular utterance 2.2 Pitch accents and prominence 49 the term ‘pitch accent’ was first proposed by Bolinger (1958) and resurrected by Pierrehumbert (1980), and is now in general use. It should not be confused with a competing sense of the term, denoting the lexically specified pitch features of languages like Japanese, some Bantu languages, and some European languages, including Swedish, Norwegian, and Slovenian. Where necessary, the two senses can be distinguished as intonational pitch accent and lexical pitch accent. Bolinger originally proposed the notion of (intonational) pitch accent as a way of making sense of experimental data (in particular the findings of Fry 1955, 1958) showing that pitch change is a major cue to the perception of stress. In Bolinger’s view, ‘stress’ is nothing but an abstract lexical property of individual syllables, while ‘pitch accent’ is actual prominence in an utterance. If a word is prominent in a sentence, this prominence is realised as a pitch accent on the ‘stressed’ syllable of the word. Much the same view was put forth early in the development of the IPO model, where pitch accents were known as ‘prominence-lending pitch movements’, and little consideration was given to supposed differences of prominence not involving pitch movement. This view is attractive for its simplicity, but there is now overwhelming evidence that it is empirically inadequate. Specifically, as we shall see in the next section, it seems clear that linguistically significant prominence distinctions can be signalled in avarietyofwaysthatdonotinvolvepitchaccent.Toclarifythisissue,weneed to take a more detailed look at the whole problem of stress.

2.2.1 The phonetic nature of stress Probably no topic in the general area of intonation and suprasegmentals has posed such a puzzle as stress. A great deal of experimental work going back many decades has sought to establish both physiological and acoustic correlates of stress as a phonetic phenomenon. Lehiste, writing in 1970, says that ‘of the three suprasegmental features [quantity, tonal features, and stress], stress has for a long time been the most elusive one’ (p. 106). Asimpleexamplemakescleartheextentoftheproblem.Considerastress from English, like permit (noun) versus permit (verb). In citation form, the stress contrast is signalled most conspicuously by differences in :

(3)

(a) permit(noun) (b) permit (verb) 50 2Fundamentalautosegmental–metricalconcepts

The pitch rises to a peak on the stressed syllable, followed by a rapid fall. There are also differences of vowel quality, intensity, and especially syllable duration between the two words, but, as we just saw, Fry (1958) showed that the pitch differences are the cues that listeners use most reliably to make judgements about stress in isolated words when the suprasegmental features are manipulated experimentally. Yet if we change the citation forms into questions, the pitch contours are much less distinct:

(4)

(a) p e r m i t (noun) (b) p e r m i t (verb)

Moreover, the question contours are completely different from the statement contours; we can no longer say that the stressed syllable is cued by a pitch peak. And if we put the words in context after the main intonational peak of an utterance, there may be no pitch distinction at all, yet we still clearly perceive the different stress patterns of the noun and the verb:

(5)

(a) I TOLD you the permit had expired!

(b) I TOLD you they’d permit him to retire!

A long-standing interpretation of Fry’s results is that ‘stress’ – in the sense of perceived prominence on a syllable – is a complex amalgam of F0,duration, and intensity, with F0 generally the most important, but with uncomfortable exceptions made for certain intonational contexts such as (4) and (5). This view has remained remarkably widespread in certain quarters since the time of Fry’s experiments, and a good deal of instrumental work has concentrated on trying to find better characterisations of the correlation between acoustic cues and perceived prominence (e.g. Gay 1978). From the point of view of the AM approach, though, much of this work is misdirected, and ignores basic theoretical problems. The central such problem is the assumption that ‘stress’ is a scalar phonetic property of individual syllables,andthatpitchisoneof the components of that scalar phonetic property. Instead of adopting this idea, the AM view assumes that ‘stress’ is – or is partly based on – an abstract Yes-No question tune

550 500 450 400 H% 350 300 250 200 150 100 L* L* 50 are legumes a good source of VITAMINS

Rise from nuclear L* accent to H% boundary tone Yes-no question tune

550 500 450 400 350 300 H% 250 200 150 L* L* 100 are LEGUMES a good source of vitamins 50

Nuclear L* accents too can shift locations!! Yes-no question tune

550 500 450 400 350 H% 300 250 200 150 L* L* 100 are legumes a GOOD source of vitamins 50

Nuclear L* accents too can shift locations!! 2.2 Pitch accents and prominence 55

Beaver et al. (2007) reaches similar conclusions, though on the basis of rather different methodology and a rather different theoretical focus. Although there is still no consensus on the details, then, it seems likely that we may appropriately speak of ‘stress’ as a phonetic reality independent of pitch accent. This still leaves the question of the relation between the two phenomena, and whether it is appropriate to speak of multiple levels or degrees of stress. These are the topics of the next two subsections.

2.2.2 Tune–text association If, as just suggested, utterance-level prominence patterns (‘stress’) can be char- acterised independently of the distribution of pitch accents, then a complete account of intonational phonology within the AM approach will have to incor- porate a theory of stress or linguistic prominence. The search for such a theory is the focus of much work within what has come to be known as . Metrical phonology begins with Liberman’s notion that linguistic promi- nence crucially involves a relation between nodes in a binary-branching tree structure (Liberman 1975; Liberman and Prince 1977). According to Liberman and Prince, in any such relation one node is strong and the other weak:

(6)

ws sw permit (verb) permit (noun)

It is important to emphasise (because this is a point on which there has fre- quently been misunderstanding) that no absolute degree of prominence is implied by the labels ‘strong’ and ‘weak’. There is no direct phonetic inter- pretation whatsoever of either label, but only of whole structures. What the notation means is that one node is structurally stronger than the other; this rel- ative strength may be manifested phonetically in a great variety of ways. This abstract structural understanding of ‘strength’ – explored in depth by Beckman (1986: chs. 2 and 3) – is crucial for the metrical interpretation of experimental studies of the perception of stress. For example, consider the two words in (7) in the form they have when 56 2Fundamentalautosegmental–metricalconcepts pronounced as separate words.

In(7) citation form, the ‘strong’ syllables of these two words are phonetically very prominent, with substantial pitch excursions, and the ‘weak’ syllables are conspicuouslys lessw so, with energy droppings offw very rapidly and the pitch trailing awaybaby to the bottom of the speaker’s sitter range. But now let us combine them into a single compound:

(8)

sw

sw sw b a b y s i t t e r

(This representation says that ba-isstrongerthan-by, sit-isstronger than -ter,andba-isstrongerthansit-. It leaves the relative strength of -by and sit- unspecified; I will return to the significance of this point below.) The phonetic realisation of this more complex structure, unlike the simple case, allows no single phonetic interpretation of the notion ‘strong’ syllable. How- ever, it is still possible to state reasonably clearly how the prominence relations are realised. The structure in (8) defines a single strongest point or peak of prominence for the phrase as a whole, on ba-. In Liberman and Prince’s ter- minology (1977: 259), this peak of prominence is known as the Designated Terminal Element or DTE. Like the citation forms, the compound has a single DTE, and, as in the citation forms, the sole pitch accent occurs at the DTE. This means that in the compound only ba-ismarkedbyapitchaccent.Wecan still identify sit-asstrongerthan-ter in the compound, but the phonetic cues are a great deal more subtle, and certainly do not involve any significant pitch change. Pitch change, in other words, serves as a cue to prominence when the structure allows it, but prominence relations can also be cued in other ways. There is thus no problem in principle with describing both ba- and sit- as stressed syllables in this phrase, even though our understanding of the details of how stress is signalled is still incomplete. The idea that pitch accents are elements of an intonational tune, and occur at syllables whose prominence in the utterance is somehow independently definable, is an aspect of what Liberman called ‘tune–text association’. This notion can be motivated by analogy to the relation between musical tunes and song texts. In setting texts to music, linguistically stressed syllables are not necessarily associated with long or high-pitched or saliently loud notes – as would be expected if stress is primarily a matter of specific acoustic cues. Rather, what is crucial is for linguistically stressed syllables to be associated 122 Taehong Cho

the remainder of this section, I will review a structural view of prosody (Section 1.1) and intro- duce the phonetics–prosody interface along with the notion of prosodic strengthening (Section 1.2). I will then briefly review preboundary lengthening as kind of boundary-related prosodic strengthening (Section 2) and engage in an in-depth discussion on post-boundary (domain-initial) strengthening effects within and across languages (Section 3). At the end, I will summarize the paper (Section 4).

1.1. A STRUCTURAL VIEW OF PROSODY It is now a widely received view that “prosody” is a grammatical structure that has to be parsed in its own right (e.g., Beckman 1996). Under this structural view of prosody, the term prosody no longer refers merely to lower-order suprasegmental features such as pitch, duration, and amplitude, but it embraces an abstract notion of a higher-order grammatical structure definable as “a hierarchically organized structure of phonologically defined constituents and heads” (Beckman 1996:19). It therefore provides a “frame” for articulation with two functions: a delimitative function regarding how smaller phonological units or prosodic constituents (, syllables) are grouped together to form a larger prosodic constituent (a prosodic word or a phrase) and a culminative function regarding which of the prosodic constituents in the utterance should be the “head” of the phrase [e.g., the intermediate phrase (ip)] to be pro- nounced more prominently than others. A plausible prosodic structure that might stipulate the prosody of the utterance in (1b) (a possible phonetic output in Figure 1b) is given in Figure 2. The delimitative function of prosodic structure is ref lected in Figure 2 in terms of how phonologically units are hierarchically organized: The segments at the bottom of the hierarchy are grouped into an immediately next-higher unit, the syllable (σ) and further into a progres- sively higher unit in the order of the prosodic word (PWd), the ip, and the Intonational Phrase (IP). It is worth noting at this point that prosodic constituents are also referred to as prosodic domains as they often serve as domains of tonal distribution and of applications of phonological Metrical structurerules (cf. Selkirk 1984, 1995; Jun 1998). For the sake of completeness, IP recursion is introduced in the figure, so that an IP can dominate one or more IPs (Krivokapić and Byrd 2012), although

In phonological theory, lexical stress is usually seen as part of a more complex metrical constituent structure.

Cho (2016) reviews different kinds of phonetic effects attributed to such structure

Fig. 2. A prosodic structure of When danger threatens your children, call the police. It depicts a hierarchically-nested organization of phonological units of the utterance in terms of phrasing (or prosodic grouping) and prominence distribution. [Here, an Intonational Phrase (IP) is assumed to be recursive (following Krivokapić and Byrd 2012), so that an IP may govern one or more IPs]. Note that ‘-’ in the association line indicates stressed syllables; H* refers to an H-tone pitch accent as a phrase-level stress; L-, a phrase tone at the end of an intermediate phrase (ip); and L% or H%, a boundary tone at the end of an IP. (The as a possible above the syllable is omitted.) Cho (2015) © 2016 The Author Language and Linguistics Compass 10/3 (2016): 120–141, 10.1111/lnc3.12178 Language and Linguistics Compass © 2016 John Wiley & Sons Ltd Linguistic prosody

• A range of ‘suprasegmental’ phenomena appear to involve categorical form-meaning mappings, and to be tightly integrated in the grammars/lexicons of languages: - Lexical tone - Lexical stress - Intonational phonology - and more… Intonational phonology Ladd 2008

[…] the use of suprasegmental phonetic features to convey ‘postlexical’ or sentence-level pragmatic meanings in a linguistically structured way. Ladd, 2008:4 Intonational phonology Ladd 2008

- Suprasegmental: cued mainly by F0/pitch, but also by duration and intensity

- Postlexical: property of utterances, not words

- Linguistically structured: based on categorical contrasts, and integrated with other layers of linguistic structure (e.g. segmental phonology, syntax) British school

• Palmer (1922), O’Connor & Arnold (1973), Crystal, Halliday

• Didactic goal: Improve the pronunciation of non-native speakers

• At the utterance level, the tune as the basic unit of intonation

• Tonal movements (falls, rises) are the phonetic primitives

• Tunes have a structure, and that only certain tunes can be observed in English

• The different components of tunes have specific pragmatic meanings The structure of English tunes O’Connor & Arnold (1973)

• Nucleus: Stressed syllable of last accented word • Tail: all syllables after nucleus • Head: from stressed syllable of first accented word up to nucleus • Prehead: before head

The seven nuclear tunes of English O’Connor & Arnold (1973)

Heads O’Connor & Arnold (1973)

1. The low head 2. The high head

3. The falling head 4. The rising head

Preheads O’Connor & Arnold (1973)

1. The high pre-head 2. The low pre-head

Tone groups O’Connor & Arnold (1973)

• Not all combinations of preheads, heads, and nuclei (n=103) are attested • Ten frequent tone groups (i.e. combinations of heads and nuclei) are described:

Tone groups

Intonation and meaning O’Connor & Arnold (1973)