Voicing”-Tone Interaction in Xiangxiang Chinese
Total Page:16
File Type:pdf, Size:1020Kb
“Voicing”-tone interaction in Xiangxiang Chinese Ting Zeng City University of Hong Kong This paper investigates the phonetic characteristics of the voiced stops and discusses questions about the general assumptions of “voicing”-tone interaction. The resulting data shows that for the voiced stops the VOT values are distributed along a continuum from long-lead to short-lag values, with short-lag values overlapping with the voiceless unaspirated stops. A closer examination of the acoustic characteristics of the voiced stops reveals four patterns of variation for each speaker: simple voiceless stop; voiced stop; voiced implosive and voiced stop where voicing during the oral closure dies out before the release of the stop, instead of continuing into the following vowel. There are seven tones in XXC and voiced stops co-occur only with low tones. These results raise questions about the usual assumption about the mapping between phonological representations of features and their physical correlates, as well as the relationship between variation and historical sound change. 1. Introduction Xiangxiang Chinese (hereafter XXC) is a sub-dialect of the Xiang Dialects of Chinese. According to the impressionistic description in Zeng (2001) and Zeng (2005), there is a three-way distinction of stops in Xiangxiang Chinese: voiceless unaspirated, voiceless aspirated and voiced (shown in Table 1); and voiced stops co-occur only with low tones. Table 1: The three series of stops in XXC Voiceless unaspirated Voiceless aspirated Voiced p ph b t th d k kh This paper aims at (i) investigating the phonetic characteristics of the voiced stops; (ii) conducting an acoustic experiment on tones; (iii) raising questions about the general assumptions of “voicing”-tone interaction based on the data in XXC. T W Toronto Working Papers in Linguistics 28: 395–405 P L Copyright © 2008 Ting Zeng TING ZENG 2. Methodology The wordlist for an investigation of the stops includes all monosyllabic words in XXC which begin with any of the nine stops, and that for the study of tones was given in Table 2. Four native speakers, two male and two female, provided the speech data of the stops, and two of them, one male and one female, provided the speech data of the tones. They all age around 50, and were born and grew up in Xiangxiang. Recording was done in a quiet room by using a Sony PCM-R700 digital audio recorder and a Shure SM-58 microphone. The speakers were asked to read each test word four times with an interval of 2~3 seconds and in a natural manner. Table 2: Test words for the study of tones Test tones Test words Group 1 Group 2 Group 3 Yinping i55 衣 u55 乌 y55 淤 Yangping di23 题 du23 图 dy23 局 thi23 踢 thu23 凸 thy23 出 Ciyangping i34 一 u34 吴 y34 余 Shangsheng i21 已 u21 五 y21 与 Yinqu i45 意 u45 物 y45 玉 Ciyinqu thi25 替 thu25 兔 thy25 处 Yangqu i22 异 u22 务 y22 寓 Note: Ping, Shang, Qu and Ru were four tones that were established since Middle Chinese (M.C. 200 AD - 900 AD). Tonal split then took place resulting in different registers, known in traditional terminology as Yin, Yang, Ciyin and Ciyang. To get the F0 values of each tone, 11 points were sampled at every 10% of the overall duration of the F0 contour for each monosyllabic word. The F0 values were then converted to pitch values by using the formula shown in (1): (1) Ti = 5 × lg xi –lg xmin lgxmax – lgxmin 3. Results 3.1. Stops Phonetic data emerging from the acoustic study of the stops in XXC show several patterns. Table 3 gives the mean VOT values and Standard Deviations of each stop for each speaker. The voiceless aspirated stops can be well distinguished from the other two categories by their long-lag VOT values. The voiced category, as indicated by shading, shows considerable intra-speaker variation along the VOT dimension for each speaker: some tokens have long-lead values, while others have short-lag values overlapping with the voiceless unaspirated category. 396 “VOICING”-TONE INTERACTION IN XIANGXIANG CHINESE Table 3: Mean VOT values and Standard Deviations of each stop for each speaker speaker 1 stop p t k ph th kh mean 9 11 20 78 85 97 SD 1 2 5 17 24 19 stop b b d d g g mean -83 10 -82 7 -86 22 SD 36 5 35 4 38 5 speaker 2 stop p t k ph th kh mean 8 6 12 56 60 80 SD 2 1 2 12 11 24 stop b b d d g g mean -50 6 -41 6 -34 26 SD 36 5 32 2 25 16 speaker 3 stop p t k ph th kh mean 6 6 17 71 63 78 SD 1 2 4 23 20 24 stop b b d d g g mean -97 7 -77 8 -65 16 SD 46 4 34 3 38 5 speaker 4 stop p t k ph th kh mean 9 6 18 88 73 99 SD 2 1 6 15 14 17 stop b b d d g g mean -60 9 -57 7 -71 14 SD 32 1 42 3 37 8 The Figures below further show that for each speaker the VOT values are distributed along a continuum from long-lead to short-lag values, with short-lag values overlapping with the voiceless unaspirated stops. (For each figure, the horizontal axis is the VOT dimension, and the vertical dimension is the percentage occurrence.) chen chen 1.4 1 b d p 0.9 t 1.2 Speaker 1 Speaker 1 0.8 1 0.7 0.6 0.8 0.5 0.6 0.4 0.4 0.3 0.2 0.2 0.1 0 0 -160 -140 -120 -100 -80 -60 -40 -20 0 20 -180 -160 -140 -120 -100 -80 -60 -40 -20 0 20 time (ms) time (ms) 397 TING ZENG chen shen 0.9 1.4 g b 0.8 k p Speaker 1 1.2 Speaker 2 0.7 1 0.6 0.5 0.8 0.4 0.6 0.3 0.4 0.2 0.2 0.1 0 0 -200 -150 -100 -50 0 50 -160 -140 -120 -100 -80 -60 -40 -20 0 20 time (ms) time (ms) shen shen 1.4 0.7 d g t k 1.2 Speaker 2 0.6 Speaker 2 1 0.5 0.8 0.4 0.6 0.3 0.4 0.2 0.2 0.1 0 0 -160 -140 -120 -100 -80 -60 -40 -20 0 20 -100 -80 -60 -40 -20 0 20 40 60 time (ms) time (ms) ding ding 1.4 0.8 b d p t 1.2 Speaker 3 0.7 Speaker 3 0.6 1 0.5 0.8 0.4 0.6 0.3 0.4 0.2 0.2 0.1 0 0 -250 -200 -150 -100 -50 0 50 -160 -140 -120 -100 -80 -60 -40 -20 0 20 time (ms) time (ms) ding huang 0.9 0.9 g b 0.8 k Speaker 3 0.8 p Speaker 4 0.7 0.7 0.6 0.6 0.5 0.5 0.4 0.4 0.3 0.3 0.2 0.2 0.1 0.1 0 0 -140 -120 -100 -80 -60 -40 -20 0 20 40 -120 -100 -80 -60 -40 -20 0 20 time (ms) time (ms) 398 “VOICING”-TONE INTERACTION IN XIANGXIANG CHINESE huang huang 1.5 0.9 d g t Speaker 4 0.8 k Speaker 4 0.7 1 0.6 0.5 0.4 0.5 0.3 0.2 0.1 0 0 -180 -160 -140 -120 -100 -80 -60 -40 -20 0 20 -200 -150 -100 -50 0 50 time (ms) time (ms) Figure 1: VOT measurements of the voiced and voiceless unaspirated stops A closer examination of the acoustic characteristics of the voiced stops revealed four patterns of variation for each speaker: • simple voiceless stop • voiced stop • voiced implosive • stop where voicing during the oral closure dies out before the voiceless release of the stop, instead of continuing into the following vowel. Figures 2-6 show the waveforms and spectrograms of the four types of variation of the prevocalic voiced stops. Figure 2 shows the first variant—a simple voiceless stop—with a short lag VOT value of 5 ms. The stops in Figures 3 and 4 have VOT values of –108 ms and –137 ms respectively. The difference lies in that in Figure 3 the amplitude of vibrations during the time when the oral closure is maintained remains relatively the same, while in Figure 4 the amplitude of vibrations decreases into the following vowel. Both Figures show variation of the second type—voiced stops. The decrease of amplitude in Figure 4 can be explained by the fact that the trans-glottal pressure is not sufficient for high amplitude of vocal cord vibration to be maintained for a long time. The account for the variant in Figure 6 in which the voicing during the oral closure dies out before the release instead of continuing into the following vowel is also aerodynamic: the closure is too long for positive trans-glottal pressure to be maintained, and as a result the vocal cord vibration will stop if trans-glottal pressure becomes zero or even negative before the release. 399 TING ZENG Figure 2: VOT = 5 ms Figure 3: VOT = -108 ms Figure 4: VOT = -137ms Figure 5: VOT = -101 ms Figure 6 In Figure 5 the stop has a VOT value of –101 ms, and the amplitude of vibrations increases during the time when the oral closure is maintained.