Valence, Arousal, and Task Effects in Emotional Prosody Processing
Total Page:16
File Type:pdf, Size:1020Kb
ORIGINAL RESEARCH ARTICLE published: 21 June 2013 doi: 10.3389/fpsyg.2013.00345 Valence, arousal, and task effects in emotional prosody processing Silke Paulmann 1*, Martin Bleichner 2 and Sonja A. Kotz 3,4* 1 Department of Psychology and Centre for Brain Science, University of Essex, Colchester, UK 2 Department of Neurology and Neurosurgery, Rudolf Magnus Institute of Neuroscience, University Medical Center Utrecht, Utrecht, Netherlands 3 Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany 4 School of Psychological Sciences, University of Manchester, Manchester, UK Edited by: Previous research suggests that emotional prosody processing is a highly rapid and Anjali Bhatara, Université Paris complex process. In particular, it has been shown that different basic emotions can Descartes, France be differentiated in an early event-related brain potential (ERP) component, the P200. Reviewed by: Often, the P200 is followed by later long lasting ERPs such as the late positive complex. Lorena Gianotti, University of Basel, Switzerland The current experiment set out to explore in how far emotionality and arousal can Lucia Alba-Ferrara, University of modulate these previously reported ERP components. In addition, we also investigated the South Florida, USA influence of task demands (implicit vs. explicit evaluation of stimuli). Participants listened *Correspondence: to pseudo-sentences (sentences with no lexical content) spoken in six different emotions Silke Paulmann, Department of or in a neutral tone of voice while they either rated the arousal level of the speaker or Psychology, Centre for Brain Science, University of Essex, their own arousal level. Results confirm that different emotional intonations can first be Wivenhoe Park, Colchester CO4 differentiated in the P200 component, reflecting a first emotional encoding of the stimulus 3SQ, UK possibly including a valence tagging process. A marginal significant arousal effect was e-mail: [email protected]; also found in this time-window with high arousing stimuli eliciting a stronger P200 than Sonja A. Kotz, Max Planck Institute low arousing stimuli. The P200 component was followed by a long lasting positive ERP for Human Cognitive and Brain Sciences, Stephanstraße 1a, 04103 between 400 and 750 ms. In this late time-window, both emotion and arousal effects were Leipzig, Germany found. No effects of task were observed in either time-window. Taken together, results e-mail: [email protected] suggest that emotion relevant details are robustly decoded during early processing and late processing stages while arousal information is only reliably taken into consideration at a later stage of processing. Keywords: P200, LPC, ERPs, arousal, task demands, emotion, prosody INTRODUCTION thereby ignoring the possible contribution of arousal during There is a recent increase in studies informing about the com- emotional prosody processing. Thus, the present investigation plexity and diversity of how the brain processes emotional aims to start filling this gap in the literature by studying how information from the voice. Much progress has been made in and when these two dimensions impact on emotional prosody depicting which brain structures are implied during emotional processing. prosody processing, that is the variation of acoustic cues such Event-related brain potentials (ERPs) have been widely used as fundamental frequency (F0), amplitude (or intensity), tim- to define the temporal processes involved in emotional prosody ing, and voice quality (energy distribution) during speech (see processing. For instance, early studies on vocal emotion process- e.g., Kotz and Paulmann, 2011 for recent review). In addition, ing have focused on assessing when stimuli of different valences electrophysiological studies have investigated the time-course can be distinguished from one another (e.g., Wambacq and Jerger, or speed with which emotional prosodic information is pro- 2004; Schirmer et al., 2005a). Later studies have explored when cessedtoensureappropriatesocialbehavior(e.g.,Pihan et al., language stimuli expressing so-called basic emotions (anger, fear, 1997; Schirmer et al., 2002, 2005b; Schirmer and Kotz, 2003; disgust, sadness, surprise, happiness) can be differentiated from Bostanov and Kotchoubey, 2004; Wambacq et al., 2004; Kotz neutral stimuli and/or each other (e.g., Paulmann and Kotz, 2008; and Paulmann, 2007; Paulmann and Kotz, 2008; Paulmann and Paulmann et al., 2011). Generally speaking, ERP findings sup- Pell, 2010). However, although most researchers would agree port the notion that valence information is detected and analyzed that emotional information as conveyed by the voice (or other rapidly (within the first 200 ms after stimulus encounter) from non-verbal channels such as face or body posture) can be prosody (e.g., Schirmer et al., 2005a, 2013; Paulmann and Kotz, described in a two-dimensional space, that is, with regard to 2008; Garrido-Vásquez et al., in press), irrespective of speaker valence (pleasant – unpleasant) and arousal (sometimes referred voice (Paulmann and Kotz, 2008; Paulmann et al., 2008b), and to as activation: high – low; see e.g., Feldman-Barrettetal., even when information is not task-relevant (Wambacq et al., 2007 for review of emotion theories), most electrophysiological 2004; Kotz and Paulmann, 2007), or when it is processed pre- research on vocal emotion processing has concentrated on attentively (Schirmer et al., 2005a). Under attentive processing exploring when emotional or valence attributes are processed, conditions, the process of rapid emotional salience detection has www.frontiersin.org June 2013 | Volume 4 | Article 345 | 1 Paulmann et al. Emotional prosody and arousal effects repeatedly been linked to the P200 component, a fronto-centrally relevant stimuli are preferentially processed over irrelevant (c.f., distributed positivity reaching its peak approx. 200 ms after stim- Hinojosa et al., 2009). This view has also received support from ulus onset. studies using non-language emotional stimuli such as pictures While the early P200 component is assumed to reflect (see Olofsson et al., 2008 for review). It should not go unmen- enhanced attention to emotional stimuli so that they can be tioned that there is also some sparse evidence that arousal of preferentially processed if need be, (concurrent) later ERP com- language stimuli can modulate early ERP components: Hofmann ponents are often linked to more in depth processing mechanisms et al. (2009) reported that high-arousal negative words elicit an (e.g., meaning evaluation, access to memory representation). increased early negative ERP between 80 and 120 ms after word Specifically, late emotional prosody effects have been observed in onset in contrast to neutral and low-arousal negative words when several late ERP components including the P300 (e.g., Wambacq participants performed a lexical decision task. This greater ERP and Jerger, 2004), N300 (e.g., Bostanov and Kotchoubey, 2004), negativity was linked to an early effect of arousal on lexical access N400 (e.g., Schirmer et al., 2002, 2005a; Schirmer and Kotz, 2003; processing. Specifically, the authors interpreted the ERP ampli- Paulmann and Pell, 2010), and a late positive complex (LPC; tude differences between high-arousal negative and neutral words Kanske and Kotz, 2007; Schirmer et al., 2013), depending on stim- to reflect early facilitative lexical access for arousing negative stim- uli, tasks, and experimental designs used. Thus, a growing body of uli suggesting an early influence of arousal on affective word literature suggests that emotion signaling features such as valence processing. The same effect, however, was not found for positive or even emotional category knowledge are rapidly extracted and word stimuli, that is, the general influence of arousal on early analyzed during emotional prosody processing. However, next emotional word processing mechanisms remains to be further to nothing is known about additional emotion relevant parame- investigated. Given that different studies applied different tasks, ters that could potentially influence this early evaluation process. it can also not be excluded that varying task demands (explicit Specifically, so-called circumplex models of emotion propose that emotional/arousal focus, implicit emotional/arousal focus) could both valence and arousal dimensions are crucial when describ- partly account for the equivocal time-course findings in the ing how someone feels (e.g., Feldman-Barrett and Russell, 1998; literature. Feldman-Barrett, 2006), that is both dimensions should modulate We are unaware of electrophysiological studies exploring the how emotions are perceived from speech. influence of arousal and valence on emotional prosody processing While previous electrophysiological research on vocal emo- in a combined experimental design. Thus, the current study tested tional language processing has either ignored the dimension of the influence of arousal and valence on both early (P200) and arousal altogether or has tried to control for arousal by keeping late (LPC) ERP components by using pseudo-sentence stimuli activation attributes of stimuli similar, ERP research on visual intoned in six distinct emotional tones (anger, disgust, fear, sad- emotional language processing has already started to explore ness, surprise, happiness). Given that emotions can be expressed the (combined) influence of valence and arousal on processing with either high or low arousal (e.g.,