Tongue Shape and Airstream Contrasts in N|Uu Clicks: Predictable Information Is Phonologically Active
Total Page:16
File Type:pdf, Size:1020Kb
Tongue Shape and Airstream Contrasts in N|uu Clicks: Predictable Information is Phonologically Active Amanda L. Miller Cornell University [email protected] Outline of Presentation • Introduction - Phonetic questions raised by click phonology • Methods and Benefits of High-speed ultrasound w/ software mixing • Results – Tongue shape & C-V coarticulation • Conclusions Statement of Problem Clicks are consonants produced with two constrictions – one anterior and one posterior. Phonetic and Phonological studies have all asserted that only the anterior constriction varies by click type. However, anterior constriction differences do not straightforwardly account for phonological patterns seen. N|uu Lingual and Mixed Airstream Segments LINGUAL Labio-uvular Denti- Alveo-uvular Palato- pharyngeal Central Lateral pharyngeal Stop ʘ ǀ ǃ ǁ ǂ Nasal LINGUO-PULMONIC Stop ʘ͡ q ǀ͡q ǃ͡ q ǁ͡ q ǂ͡ q Affricate Lexical Frequency 100 120 140 160 180 200 20 40 60 80 •Uvular-i sequences donot occur – •Posterior place does predict the phonological pattern – •Anterior placedoesnot predictthephonological patterns. N|uu 0 occur. Labial p,b Alveolar P s,z Click Phonology: TheBack u S l m t Palatal ops oni c,j Vowel Constraint c Velar A k,g i r Uvular s C t q V r e V / a Labio- m uvular C / ʘ V I upper pharyngeal –i sequences do Li Alveolo- VC n S it ngua t uvular ! o ia V R ps l C Denti- l phar. | oot P l Palato- s a c phar.= e Labio- L i n uvular Ba Fr guo- Alveolo- o ck V S n t V t pul uvular ops o o w Denti- w m e e l l oni phar. |q c Palato- phar. =q Hypotheses • POSTERIOR PLACE OF CLICKS ACCOUNTS FOR BVC PATTERNS. • CLICKS PREVIOUSLY TRANSCRIBED AS ‘VELAR’ vs. ‘UVULAR’ CLICKS DIFFER ONLY IN AIRSTREAM. N|UU AIRSTREAM CONTRASTS Lingual burst Lingual burst Pulmonic burst (b) Dental (a) Dental 0 Time (s) 0.075 0 Time (s) 0.1 (c) Central alveolar (b) Central alveolar 0 Time (s) 0.075 0 Time (s) 0.1 (d) Lateral alveolar (c) Lateral alveolar 0 Time (s) 0.075 0 Time (s) 0.1 (e) Palatal (d) Palatal 0 Time (s) 0.075 0 Time (s) 0.1 LINGUAL STOPS LINGUO-PULMONIC STOPS – contour segments in airstream Phases for Stops and Clicks (3 Speakers) ! Anterior Release Posterior Closure q ! Posterior Release VOT = q = ent | egm S |q || O k 0.00 0.05 0.10 0.15 Duration (in seconds) Question Do N|uu lingual (so-called VELAR) and linguo-pulmonic (so-called UVULAR) clicks also differ in place of the pulmonic portion of the segment, as claimed by Traill (1985) and Ladefoged and Traill (1984) for similar segments in !Xoo? Methods New Architecture • High-speed ultrasound is needed in order to investigate the dynamics of click consonants, and their co-articulation with following vowels. • Since we can’t capture audio in the ultrasound machine, we need to do software mixing, in order to have both high speed, and accurate articulatory – acoustic alignment. DICOM DATA Path / Post Data Collection Software Mixing • High Frame rate DICOM New SysteSystem Diagraagram path with perfect fidelity to US machine. – Today’s data 51 fps Ultrasound (Loqicbook) – Next machine 100-160 fps Microphone Ethernet • Software mixing cable DICOM (Adobe Premier Pro) 1/8 in stereo cable SVGA cable – Powerful, precise, flexible • Redundant h / w mixed audio / Video Cam Canopus Twin 100 video path (s / w mixing aid) Audio AND Video • Video CAM path for IEEE 1374(firwire ) Firewire palataglossatron spatial correction, with audio. Spatial Correction Helmet with Palataglossatron • Articulate Instrument’s Ultrasound Helmet with Palataglossatron spatial correction (Mielke et al. 2005) – Greatly reduces tongue / palate position uncertainty • Typically correcting 2-4 mm within 5- 10 frames . – Particularly required for high frame rate, extended-times, data integrity • Increased temporal alignment and visibility precision would be less valuable without it. Ultrasound helmet results • The ultrasound helmet limited the amount of movement to about 2-3 mm. for 2 out of 3 speakers, and a bit more for a 3rd subject. • The larger amount of movement is due to the ill-fitting nature of the helmet for the 3rd speaker. Place of articulation Offset in mm by word and speaker 7.00 AK GS KE ) 6.00 m 5.00 et (m s f f O n 4.00 o i t a l 3.00 cu ti r a f 2.00 ce o a l P 1.00 0.00 Palate !uu !qui 'ashes' =uuke 'fly' =quu 'neck' 'camelthorn acacia' Word Tongue Height Offset in mm by word and speaker 7.00 AK GS KE 6.00 ) m m 5.00 ( t e s f 4.00 Of ht g i 3.00 He ue 2.00 ng To 1.00 0.00 Palate !uu !qui 'ashes' =uuke 'fly' =quu 'neck' 'camelthorn acacia' Word Rotation angle in degrees by word and speaker 0.90 0.80 AK GS KE 0.70 ) . g e 0.60 d ( e 0.50 gl an 0.40 on i t a t 0.30 o R 0.20 0.10 0.00 Palate !uu !qui 'ashes' =uuke 'fly' =quu 'neck' 'camelthorn acacia' Word High speed, greater visibility • 50 FPS implies 20 msecs resolution instead of 33 msecs – 33 milliseconds is long for some events – 100 FPS yields 10 msecs resolution • To the frame alignment, along 9 seconds of speech – Removes audio / US mapping concerns – Opens up study of individual token differences, and other precision issues. • Adobe tools make observing 20 msecs differences easy – higher spatial image quality 51 FPS Frame clarity – see demo Resolution of spatio-temporal inaccuracies of video based ultrasound ref Alan Wrench and Jim Scobbie* • Many artifacts and 51 FPS distortions documented DICOM (Wrench and Scobbie 2007) are removed by this method • NO external video used in final data • DICOM preserves the internal cine loop quality 30 FPS at ANY frame rate. Video – B-mode sweep preserved * – Notice artifact due to multiple scan times in same external video image Dynamic Alveolar click [!] release To the single frame, temporal alignment with high frame rate DICOM • DICOM perfect cine loops at any speed • H/w mixed audio / 30 fps cine loops • Align audio (adobe slide tool) with DICOM cine loop – Mixed US as guide – Linguistic knowledge along entire 9 sec loop Frame Mapping Demo • Adobe "to the Frame" Alignment Visual Estimate* : frame alignment accuracy for all independent events along a timeline is better than the most accurate event Assumed: the time alignment error is a dependent event • During 9 seconds of speech there are multiple independent events , but the alignment precision of all is dependent and equal. • Each token has a known probability of being accurate “to a frame” by observation • Probability theory for dependent events (each alignment error) predicts ¾ Given that event B has happened ( and A is dependent on B) , then the probability A will happen is greater . ¾ P{A|B}=P{A ∩ B} / P{B} ¾ Given token B is aligned “within a frame”, the probability that token A will be aligned as well is greater than without B. ¾ Implies potential for “probable” sub-frame alignment, by looking at all events on the timeline. *Resmi Gupta, Cornell Statistical Consulting Unit [email protected] Error band A Error band B Error band C Palataglossatron Data Prep: All Frames Aligned (except palate) • Palette (LB) N/A •51 Hz (LB) •30 Hz (LT) • Video cam (HV) Results Head Correction results • In data uncorrected for head movement, [k] and [q] do not look much different. • In data corrected for head movement, [q] has a clearly further back constriction than [k]. • The [!] back constriction is more like [q] than [k]. Alveolar Click [!] Correction BEFORE AFTER 0 3 - 200 5 5 9 3 1 0- 4 190 - 1] 1] 2, , 5 2, e[ 4 185 , [ e ac r c t . a r t - 0 new - 0- 18 5 - 5 175 5 k k 0- C Clo 170 6 C Clo - u u q q 20 40 60 80 100 110 120 130 140 150 160 170 trace[, 1, 1] new.trace[, 1, 1] Before correction, [!] is more similar to [k]. After correction, [!] is more similar to [q]. [q] and [!] similarity arises from tongue root shape. Alveolar Click [!] correction BEFORE AFTER 0 3 - 0 1 2 40 - ] 1 1] , 200 2, [, 2 , [ 0 e e 5 c - c a a r .tr t - w e n - 0 9 1 60 - 0 0 k 7 18 k - C Clo u C Clo q u q 40 60 80 100 40 60 80 100 120 trace[, 1, 1] new.trace[, 1, 1] In this case, the difference is not as large, but [!] still becomes more similar to [q] after correction, then before correction. We see that the hump of the [u] 40 ms after [! ] is the width only of the posterior constriction in [!], not both constrictions. Correction effects- Palatal click • Correction for head movement allows us to see tongue root raising implicit in the palatal click more clearly. • Tongue root raising captures similarity of [ǂ], with the production of [u] described by Esling (2005). Palatal Click [ǂ] Correction BEFORE AFTER 220 0 3 - 0 21 0 ] -4 1 , 1] [, 2 2, e , [ c e a 0 c .tr 0 a r 2 w t - e n 0 - 5 - 0 19 0 -6 k k C Clo C Clo u u q q 30 40 50 60 70 80 90 100 40 50 60 70 80 90 100 110 trace[, 1, 1] new.trace[, 1, 1] The palatal click posterior constriction is farther back in the corrected plot than in the uncorrected plot.