<<

Prosodic influences on formant frequencies of Polish

Patrycja Strycharczuk & Peter Jurgec CASTL, University of Tromsø, N-9037 Tromsø, Norway [email protected], [email protected]

The 3rd Newcastle Postgraduate Conference in Theoretical and Applied July 9, 2008

1 Overview

paper analyzes formant frequencies of (Standard) Polish vowels.

• Prosodic effects were studied; in particular, how formant frequencies are affected by and the position within the prosodic word.

• The paper follows the standard organization; Introduction is followed by the Method, Results, Discussion and Conclusion sections.

2 Introduction

The Polish system is traditionally described as one consisting of 6 oral and 2 nasal vowels (Jassem 2003). This is shown in (1). Only oral vowels were the subject of the study.

(1) Polish vowels (Jassem 2003) i 1 u

e ˜e o ˜o

a

Stress typically falls on the penultimate vowel of the word. In (2), we see that the stress shifts right, if a suffix is added.

1 Prosodic influences on formant frequencies of Polish vowels 2

(2) Penultimate stress "zrobiw ‘he did’ zro"biw-1 ‘they (female) did’

Polish has no phonological . In other words, the inventory of unstressed vowels is identical to the vowels that can appear in stressed positions (see Crosswhite 2001 for a cross-linguistic typology of phonological reduction). Similarly, phonetic reduction is generally considered limited, if not completely absent (Jassem 1962, 2003, Sawicka 1995). This is contrary to the generalizations found in most other languages.

• In an early study, Lindblom (1963) shows that vowels’ formant frequencies (in Swedish) are dependent on the duration of the vowel: the shorter the vowel, the more-schwa like it is (i.e., its formant frequencies are closer to that of schwa). This happens because of the influence of the neighboring segments, and is since known as (phonetic) undershoot.

• Gay (1978) shows that undershoot is caused primarily by stress. This is also because stressed vowels are normally longer than unstressed. He shows that speech tempo has significantly less influence on formant frequencies in comparison to word stress. Later studies confirmed that (Tuller et al. 1982, Engestrand 1988, Miller 1989, Fourakis 1991).

• The phonetic reduction depends on duration and stress, while particular environment (i.e. segments preceding and following a particular vowel) has a less significant effect (Lindblom 1963, Moon & Lindblom 1994, Erickson 2002, Sluijter & van Heuven 1996). For individual languages see also Fourakis et al. 1999, Adank et al. 2004, Hirata & Tsukada 2004, inter alia.

In this study we examined the correlation between vowels’ formant frequencies and their position with respect to word stress.

3 Method

3.1 Corpus

• 57 words selected from the online dictionary of Polish (S lownik je˛zyka polskiego PWN at http://sjp.pwn.pl/). All of the words were nouns.

• 18 were monosyllables of the type CVC, while 39 were trisyllables, ideally of the type CVCVCV.

• The words were chosen with a preference for stops and fricatives in the onsets. Segmentation is considerably easier this way. 3 Strycharczuk, Jurgec

• In trisyllables, 54 tokens were measured. All the tokens were in open .

• Examined positions were prestressed, stressed and poststressed. There were three words per position per vowel.

• Two tokens were disregarded, due to the differences in expected and actual pronun- ciations.

3.2 Speakers

• 6 speakers were chosen, 3 male and 3 female.

• They came from different parts of Poland, and all were speakers of standard Polish.

• All the speakers were students aged 20–25.

3.3 Procedure

• Corpus material was randomized using and online data randomizer.

• Two Power Point presentations were prepared: one for monosyllabic and one for trisyllabic words.

• Words were presented in separate slides. All the words were written in black font on a white background; the position of a word in the screen varied. The slides were changing automatically at 3 s and 4 s intervals for monosyllabic words and trisyllabic words respectively. For each presentation there were three words added initially that were not subsequently measured. Each word appeared twice non- consecutively.

• The speakers were given oral instructions by the experimenter and encouraged to correct themselves, if they had any concerns with the pronunciation of any particular word.

• Recordings were made at the University of Tromsø language laboratory (i.e., a sound treated room).

• Sampling frequency was 44.1 kHz, at a 16-bit rate. The recordings were stored on a portable recording device and transferred to computer for analysis.

• The acoustic analysis was performed using the Praat LPC analysis software (version 4.5).

• F1, F2 and F3 values were measured. Whenever possible, the formant steady state was measured. Otherwise, the central point was measured, or the mean value was calculated. Whenever the reading of a formant was not possible (e.g. too low intensity), the results were dismissed. Prosodic influences on formant frequencies of Polish vowels 4

• A total of 2,526 readings out of 2,592 tokens (or 97.5 %) were finally acknowledged. • The data were then statistically analyzed. Mean formant values for all the vowels in all positions were calculated (together for male and female speakers). Stan- dard deviation and confidence intervals were calculated. Finally, an ANOVA was performed.

4 Results

The average measurements for individual vowels depending on the condition (monosyl- lables, prestressed, stressed, poststressed) are graphically represented with a standard F1/F2 vowel plot (figure 1).

Figure 1: Average F1 and F2 of Polish vowels depending on the prosodic condition

For each vowel all the possible positions with respect to stress were analysed and compared against each their. The values of particular formants in the four different positions for each vowel were tested for statistical significance with respect to one another. The results of ANOVA analysis are summarized in 1. Only p-values are shown. The results were mostly statistically significant (p < 0.05). 5 Strycharczuk, Jurgec

Table 1: p-values of ANOVA depending on the condition. Vowel F1 F2 F3 i 0.56 0.13 <0.001 1 <0.001 0.025 0.31 e <0.001 0.007 0.99 a 0.002 0.41 0.27 o <0.001 0.007 0.06 u 0.64 <0.001 0.06

Note that standard deviations, and consequently also confidence intervals are consider- ably high. Mean formant values were calculated for male and female speakers together. This was a source of great variation in values within a sample and the cause for the high standard deviation. These differences, however, are quite consistent across the popula- tion, so they are considered to have a limited effect for the relative vowels positions.

5 Discussion

• As expected, monosyllables are the most peripheral, both in F1 and F2, with the exception of /u/, for which the poststressed position has the lowest F2. This is surprising, but consistent with the findings for other languages (see references in section 2).

• Vowels in stressed position are used in the following analysis as reference points for the vowels in unstressed positions, and also as points determining the shape of the Polish vowel space.

• According to the data, /1/ is quite front. The featural description of /1/ is prob- lematic and sources disagree in that respect. Rubach (1984) argues that the vowel is [back]. However, the data here point towards the vowel being front centralised, which is in agreement with the labelling by Sawicka (1995).

• The vowel /e/ is quite low. This agrees with the data from Jassem (2003), where /e/ is mid-low, i.e. [E]. Note that no token was /e/ preceded by a palatalized consonant. Had this been the case, a coarticulatory effect would be expected, with a considerable lowering of the F1 value. In constrast, /o/ is slightly higher. This is consistent with the vowel dispersion theories (see, for example, Boersma & Hamann 2007 for an Optimality Theoretic account).

• The vowels in unstressed positions tend to have a lower F2 value than the corre- sponding vowels in stressed positions. The tendency is observable for all non-low vowels. The only exception is pre-stressed /e/, which has an F2 higher than the corresponding stressed vowel. The vowels’ relative behaviour with respect to F1 is Prosodic influences on formant frequencies of Polish vowels 6

rather inconsistent. For the vowel /a/ the unstressed vowels have a lower F1 value than the corresponding stressed vowel, while there is little difference in F2 value.

• The respective positions of pre-stressed and post-stressed vowels are difficult to generalize. Poststressed vowels tend to have a lower F2 (exceptions: /i/ and /1/). The differences with respect to F1 are even more inconsistent, and, in most cases, they are not significant.

• The difference in F1 of [1] and [e] is considerably reduced, but still highly significant (p < 0.001) even in F1. Thus, all vowels are contrastive within each prosodic position.

6 Conclusion

As regards prosody, vowels in monosyllables are distinctly peripheral, while the differences between pre- and post-stressed vowels are mostly not statistically significant. In short, there is no phonological neutralization within each prosodic condition; Polish data show some tendency for phonetic neutralization, but vowels are contrastive in all examined conditions.

References

Adank, Patti, Roeland van Hout & Roel Smits (2004). An acoustic description of the vowels of Northern and Southern Standard Dutch. The Journal of the Acoustical society of America 116. 1729–1738.

Boersma, Paul & Silke Hamann (2007). The evolution of auditory contrast. Ms. Available on Rutgers Optimality Archive, ROA 909, http://roa.rutgers.edu.

Crosswhite, Katherine (2001). Vowel Reduction in Optimality Theory. New York, London: Routledge.

Engestrand, Olle (1988). Articulatory correlates of stress and speaking rate in Swedish VCV utterances. The Journal of the Acoustical society of America 83. 1863–1975.

Erickson, Donna (2002). Articulation of extreme formant patterns for emphasized vowels. Phonetica 49. 134–149.

Fourakis, Marios (1991). Tempo, stress, and vowel reduction in American English. The Journal of the Acoustical society of America 90. 1816–1827.

Fourakis, Marios, Antonis Bonitis & Maria Katsaiti (1999). Acoustic characteristics of Greek vowels. Phonetica 56. 28–43. 7 Strycharczuk, Jurgec

Gay, Thomas (1978). Effect of speaking rate on vowel formant movements. The Journal of the Acoustical society of America 63. 223–230.

Hirata, Yukari & Kimiko Tsukada (2004). The effects of speaking rate and on formant movements in Japanese. In Proceedings of the 2003 Texas linguistics society conference, Somerville, MA: Cascadilla Press. 73–85.

Jassem, Wiktor (1962). Akcent je˛zyka polskiego. Prace je˛zykoznawcze 31. 1–116.

Jassem, Wiktor (2003). Polish. Journal of the International Phonetic Association 33. 103–107.

Lindblom, Bj¨orn (1963). Spectrographic study of vowel reduction. The Journal of the Acoustical society of America 35. 1773–1781.

Miller, James D. (1989). Auditory-perceptual interpretation of the vowel. The Journal of the Acoustical society of America 85. 2114–2134.

Moon, Seung-Jae & Bj¨orn Lindblom (1994). Interaction between duration, context, and speaking style in English stressed vowels. The Journal of the Acoustical society of America 96. 40–55.

Rubach, Jerzy (1984). Cyclic and lexical : the structure of Polish. Dordrecht: Foris.

Sawicka, Irena (1995). Fonologia. In Henryk Wr´obel (ed.) Gramatyka wsp´o lczesnego je˛zyka poskiego. Fonetyka i fonolgia, Warszawa: Wydawnictwo Instytutu Je˛zyka Pol- skiego PAN. 105–194.

Sluijter, Agaath M. C. & Vincent J. van Heuven (1996). Spectral balance as a acoustic correlate of linguistic stress. The Journal of the Acoustical society of America 100. 2471–2485.

Tuller, Betty, Katherine S. Harris & J. A. Scott Kelso (1982). Stress and rate: Differential transformations of articulation. The Journal of the Acoustical society of America 71. 1534–1543.