Worlds of Musics: Cognitive Ethnomusicological Inquiries
on Experience of Time and Space in Human Music-making
Dissertation
Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy
in the Graduate School of The Ohio State University
By
Yong Jeon Cheong
Graduate Program in Music
The Ohio State University
2019
Dissertation Committee
Udo Will, Advisor
Georgia Bishop
Graeme Boone
Copyrighted by
Yong Jeon Cheong
2019
2
Abstract
This dissertation is a cognitive ethnomusicological investigation regarding how each individual creates his or her own world via different musical behaviors. The goal of this thesis is to contribute to a model of our sense of time and space from an interdisciplinary perspective. There is a long tradition that we use two cognitive constructs,
‘time’ and ‘space’, when talking about the world. In order to understand how we humans construct our own worlds cognitively via music-making, I first distinguished two behaviors in music performance (singing vs. instrument playing). I looked at how the different modes of music-making shape our body in a distinctive way and modifies our perception of time and space.
For the cognitive sections (chapters 2 & 3), I discussed not only building blocks of temporal experience but also features of space pertaining to the body. In order to build a comparative perspective (chapter 4), I examined various ancient understandings of time and space in different cultures. In terms of music evolution (chapter 5), I looked at the transformative power of music-making and speculated about potentially different modulatory processes between singing and instrument playing. The discussion in the cognitive sections provided the basic ideas for my ‘Hear Your Touch’ project consisting of two behavioral experiments (chapter 6). I focused not only on two elements of temporal experience: 1) event detection, and 2) perception of temporal order, but also on several ii elements of spatial experience: 1) body space, 2) audio-tactile integration, and 3) space pertaining to hands. Both simple reaction time and temporal order judgment experiments provide supporting evidence for differences in spatiotemporal processing between musicians and non-musicians as well as between vocalists and instrumentalists. The simple reaction time experiment suggests that instrumental musical training contributes to enhanced multisensory integration through co-activation. The temporal order judgment experiment indicates not only that musical training changes response to audio-tactile stimuli but also that instrumental training modifies the perception of temporal order.
Compared to non-musicians and vocalists, instrumentalists showed significantly lower absolute and difference thresholds. These demonstrate different effects of specific musical training on our perceptions of time and space. My experimental findings support that, although they are often considered as distinctive cognitive constructs (chapter 4), time and space are established together through our bodily experiences. In connection with music evolution (chapter 5), it is highly likely that the use of both vocal and non-vocal sounds in a communication system might have had significant influence on the development of human cognition by transforming our bodies, our perception of, and our action toward the world. This work suggests that there are many musics that allow us to have different worlds.
iii
Dedication
To my family
iv
Acknowledgments
I would like to express my sincere gratitude to my Cognitive Ethnomusicology guru, Dr.
Dr. Udo Will who has been patiently helping me to find my own path. Without his guidance and supports, I would not have completed my studies here at The Ohio State University.
I thank my committee member, Dr. Georgia Bishop, for giving me have my foundation in neuroscience.
I thank my committee member, Dr. Graeme Boone, for broadening my understanding of music and emotion.
I thank sincerely Dr. Hyun Kyung Chae for being always supportive of my academic journey.
I thank profoundly Seymour Fink and his wife Beth Owen for having me and my Miss
Daisy as part of their family.
v
I thank my friend, Darrell Joseph who has enriched my Columbus life with his help, humor, and kindness.
I thank my djembe master, Mr. Balla Sy, who initiated me into a new world of music- making.
I thank all of my programmer friends, Stephan Wolf, Tim Vets, Qianli Feng, Gopi
Tummala, Jessie Zhao, Leon Durrenberger, and Jack McHugh. Without them, I could not run my experiments.
I thank people who were willing to be my guinea pigs for tedious experiments.
I thank Dr. McCoy, the director of OSU Voice Teaching and Research Lab, who helped me to have my singer participants.
I thank Nancy McDonald-Kenworthy for her insightful editorial advice.
I thank Steven Brown, Daniel Everett, Lara Pearson, and Sundeep Teki for the quick reprint permission.
vi
I thank the Musicology department, the School of Music, and the College of Arts and
Science for the financial supports and all the opportunities for me.
I thank my Miss Daisy to be my perfect lab mate.
Above all, I thank my family who love and support me unconditionally throughout my life.
vii
Vita
2014 – 2018 Graduate Research & Teaching Associate, The Ohio State University
2013 M.A in Musicology & M.M in Composition, The Ohio State University
2006 M.M in Composition, Ewha Womans University
2003 B.M in Composition, Ewha Womans University
Publications
Cheong, Y. J., & Will, U. (2018). Music, space and body: the evolutionary history of vocal and instrumental music. Proceedings of 15th International Conference on Music Perception and Cognition 10th triennial conference of the European Society for the Cognitive Sciences of Music. Montréal, Canada: University
Cheong, Y. J., Will, U., & Lin, Y-Y. (2017). Do vocal and instrumental primes affect word processing differently: An fMRI study on the influence of melodic primes on word processing in Chinese musicians and non-musicians. Proceedings of 25th Anniversary Conference of the European Society for the Cognitive Sciences of Music, 35-39. Ghent, Belgium: University of Ghent
Klyn, N. A., Will, U., Cheong, Y. J., & Allen, E. T. (2015). Differential short-term memorisation for vocal and instrumental rhythms. Memory, 24(6). 766-791.doi: 10.1080/09658211.2015.1050400
Fields of Study
Major Field: Music
Area of Emphasis: Cognitive Ethnomusicology
viii
Table of Contents
Abstract ...... ii Dedication ...... iv Acknowledgments ...... v Vita ...... viii List of Tables ...... xii List of Figures ...... xiii Chapter 1. Introduction ...... 1 Chapter 2. Concerning time ...... 7 Time in cognitive sciences ...... 9 Psychological building blocks of time ...... 11 Event detection ...... 11 Perception of temporal order: simultaneity vs. succession ...... 13 Duration perception ...... 17 Duration estimation ...... 21 Psychological present ...... 22 Rhythm perception ...... 24 Conclusion ...... 26 Chapter 3. Space and music-making bodies ...... 28 Body space ...... 30 Postural schema ...... 32 Superficial schema ...... 33 Body schema vs. Body image ...... 35 Peripersonal space ...... 37 Multisensory integration ...... 40 Body-part centered specificity ...... 41 ix
Sensorimotor coupling ...... 44 Plasticity ...... 45 Two modes of spatial processing: sensorimotor vs. representational ...... 48 Embodied spaces in music-making bodies ...... 50 Chapter 4. The origins of time and space concepts ...... 55 Babylonia ...... 57 India ...... 61 China ...... 71 Greece ...... 88 Mythological views on time and space ...... 90 Paradigmatic views on time and space ...... 92 Conclusion ...... 104 Chapter 5. Transformative power of music-making and the origins of music-making .. 107 Transformative power of music-making ...... 107 The origins of vocal and instrumental music in the human history ...... 117 The vocal and non-vocal communications in the human prehistory ...... 121 Prehistory vocal communication ...... 121 Prehistory non-vocal communication ...... 126 The vocal and non-vocal communications in the animals ...... 131 Animal vocal communication ...... 131 Animal non-vocal sound communication ...... 142 Functions of animal sound communication and their implications on the origins of music ...... 143 Competition-sexual selection hypothesis ...... 144 Cooperation-social cohesion hypothesis ...... 145 Emotion ...... 152 Music vs. Language ...... 157 Non-vocal language: Speech surrogate ...... 160 The design feature analysis of vocal vs. non-vocal music and speech vs. speech surrogate ...... 163 Do two modes of music-making transform our experience of the world differently? 199 Chapter 6. Hear Your Touch: Experimental investigation of embodied time and space in music-making ...... 201
x
Simple Reaction Time (SRT) Experiment ...... 205 Methods...... 206 Results ...... 211 Discussion ...... 226 Temporal Order Judgment (TOJ) experiment ...... 231 Methods...... 232 Results ...... 236 Discussion ...... 257 General Discussion ...... 262 Chapter 7. Conclusion and suggestions for future research ...... 268 References ...... 273 Appendix A. Jajangga text with transliteration and translation ...... 304 Appendix B. Simple Reaction Time (SRT) experiment mean reaction time ANOVA table ...... 305
xi
List of Tables
Table 1. Comparison of spaces pertaining to the singing vs. playing-an-instrument bodies ...... 53 Table 2. Design features of language and music ...... 198 Table 3. Simple Reaction Time (SRT) experiment ANOVA summary ...... 212 Table 4. Significant coefficient estimates of GLMM of SRT experiment ...... 216 Table 5. Temporal Order Judgment (TOJ) experiment reaction time (RT) ANOVA summary...... 238 Table 6. Significant coefficient estimates of GLMM of TOJ experiment RT ...... 241 Table 7. TOJ experiment accuracy ANOVA summary ...... 244 Table 8. Significant coefficient estimates of for GLMM of TOJ experiment accuracy . 250 Table 9. PSE and JND estimates for instrumentalists, non-musicians and vocalists ..... 254 Table 10. PSE and JND estimates for ALIGN, LOCATION, and ARM in instrumentalists ...... 255 Table 11. PSE and JND estimates for ALIGN, LOCATION, and ARM in vocalists .... 256 Table 12. PSE and JND estimates for ALIGN, LOCATION, and ARM in non-musicians ...... 256
xii
List of Figures
Figure 1. Unified model of time perception ...... 20 Figure 2. Two parieto-premotor peripersonal space (PPS) networks on a monkey brain template ...... 39 Figure 3. Ancient Mesopotamia geography ...... 58 Figure 4. Taittirīya upaniṣad’s model of continuum of cosmos ...... 66 Figure 5. Sāṅkhya’s model of continuum of cosmos ...... 68 Figure 6. Qin notation - excerpt from Flowing water (Liushui 流水) ...... 85 Figure 7. Human vocalization system ...... 123 Figure 8. Laryngeal duplication and migration model ...... 126 Figure 9. Phase relationship between singing and rocking in The Song Family’s lullaby performance ...... 150 Figure 10. Semiotic progression model ...... 174 Figure 11. Varying pitch in Karnatic vocal music rendition ...... 179 Figure 12. Three examples of the Kele Drum language and enphrasing technique ...... 184 Figure 13. Extended duality of patterning: Linearity and hierarchy...... 187 Figure 14. Sixteen stimulus configurations of the SRT experiment ...... 207 Figure 15. Presentation and registration microcontrollers for SRT experiment ...... 208 Figure 16. BIOPAC tactile stimulator TSD 190 ...... 209 Figure 17. Simple Reaction Time experiment setup ...... 211 Figure 18. Reaction time (RT) for the factor MODALITY of SRT experiment ...... 213 Figure 19. RT for LOCATION:MODALITY interaction of SRT experiment ...... 214 Figure 20. RT for STATUS:MODALITY interaction of SRT experiment ...... 217 Figure 21. RT for STATUS:ARM:MODALITY interaction of SRT experiment ...... 218 xiii
Figure 22. RT for STATUS:LOCATION:MODALITY interaction of SRT experiment ...... 219 Figure 23. Redundancy gain plot: RT for multisensory and unisensory conditions in three participant groups ...... 221 Figure 24. Predicted multisensory facilitation violation of the RMI for instrumentalists ...... 222 Figure 25. Difference in the joint and multisensory cumulative probability for vocalists ...... 224 Figure 26. Difference in the joint and multisensory cumulative probability for non- musicians ...... 225 Figure 27. Group difference in the joint and multisensory cumulative probability between instrumentalists and non-musicians ...... 225 Figure 28. Eighty stimulus configurations of the Temporal Order Judgment (TOJ) experiment ...... 233 Figure 29. Presentation and registration microcontrollers for TOJ experiment ...... 234 Figure 30. TOJ experiment setup ...... 236 Figure 31. RT for the factor SOA of TOJ experiment ...... 239 Figure 32. RT for LOCATION:MODALITY interaction of TOJ experiment ...... 239 Figure 33. RT for ALIGN: MODALITY interaction of TOJ experiment ...... 240 Figure 34. RT for STATUS of TOJ experiment ...... 242 Figure 35. RT for STATUS:SOA interaction of TOJ experiment ...... 243 Figure 36. % correct for the factor MODALITY of TOJ experiment ...... 245 Figure 37. % correct for LOCATION of TOJ experiment ...... 245 Figure 38. % correct for ALIGN :LOCATION interaction of TOJ experiment ...... 246 Figure 39. % correct for LOCATION:MODALITY interaction of TOJ experiment .... 247 Figure 40. % correct for the factor SOA of TOJ experiment ...... 248 Figure 41. % correct for LOCATION:ALIGN:MODALITY interaction of TOJ experiment ...... 248 Figure 42. % correct for the factor STATUS of TOJ experiment ...... 251
xiv
Figure 43. % correct for STATUS:SOA interaction of TOJ experiment ...... 252 Figure 44. % correct for STATUS: MODALITY:SOA interaction of TOJ experiment 252 Figure 45. % correct of ‘sound first’ responses of instrumentalists, non-musicians, and vocalists...... 255 Figure 46. % correct of ‘sound first’ responses for ARM in non-musicians ...... 256
xv
Chapter 1. Introduction
In this dissertation, I intend to propose a cross-cultural model that can explain how music-making affects the cognitive construction of the world we live in, and how music- making contributes to shaping of time and space in both perceptual and conceptual aspects.
Many people have expressed that their experience of the world has been changed by music.
For instance, one of Gabrielsson’s (2011) interviewees described her experience of listening to Tchaikovsky’s Pathétique as “It moved me from the very first note. I felt as if
I was lifted up into another world. Time and space disappeared; perhaps that is what eternity is like” (p. 46). This personal report indicates that music-making transforms our experience of time and space and thereby leads to individually different mental representations of the world. As a cognitive ethnomusicologist, I want to have a holistic picture of the effects of music-making on the cognitive reconstruction of the world. The way I am going to approach the research questions leads me to apply different methodologies. Using a comparative approach, I will discuss how various cultures conceptualize time and space differently. I will also examine the roles of music-making in the establishment of time and space concepts in these cultures. For the cognitive component,
1
I will introduce two behavioral experiments with which I investigated the effects of specific musical training on spatiotemporal processing at the perceptual level.
In order to understand the relation between different musical behaviors and emergence of different world reconstructions, I will first introduce two primary cognitive constructs of the world, time and space, from a cognitive science perspective. In chapter 2,
I will discuss six psychological building blocks of time that shape our temporal experience.
They include 1) event detection, 2) perception of temporal order: simultaneity vs. succession, 3) duration perception, 4) duration estimation, 5) the psychological present, and 6) rhythm perception. As a precondition of other building blocks, event detection is the recognition of change happening in the environment. Perception of temporal order allows us to have temporal experience. Whether two events are happening at the same time or successively requires involvement of different processes. Duration perception relates to the detection of state changes, and I will discuss both duration-based and beat-based timing mechanisms for duration perception. Compared to duration perception, duration estimation is associated with memory. As the duration of experiential process (Fraisse, 1984), the psychological present allows us to keep the multiple percepts as one unit. These psychological building blocks enable rhythm perception.
Chapter 3 deals with spatial cognition with a focus on space as it pertains to the body, and introduces two concepts relevant for this thesis, that of body space and peripersonal space. Body space is the space of the body and peripersonal space is the space immediately surrounding the body. Body space consists of postural and superficial schemata. Superficial schema is related to body surface. For both schemata, a movement
2 component, action, plays an important role. We can see a distinctive character between postural and superficial schemata in terms of perception. The former is associated with proprioception and the latter is connected with touch. Peripersonal space serves two functions of 1) body protection and 2) goal directed action. It is characterized by four features: 1) multisensory integration, 2) body-part centered specificity, 3) sensorimotor coupling, and 4) plasticity. Sensory inputs retrieved from different modalities are integrated.
Various body parts (i.e., head, hand, trunk, etc.) function as different reference frames.
Perceived objects or events via multisensory integration lead to possible actions. The scope of peripersonal space is plastic, for example, tool use can enlarge it. I will discuss potential spatial differences between singing and instrument-playing. The features of audio-tactile perception and hand-centered space (i.e., perihand space) constitute two conspicuous differences between them.
In order to have a better understanding of diverse views of our worlds, I will discuss time and space as cognitive constructs. I will examine how various cultures conceptualize time and space in chapter 4. For this, I will analyze early documents on cosmology from selected cultures. They inform us of how each culture see time and space because ancient cosmology talks about how temporal order and/or spatial order had been established from chaos. Despite of culturally different views on the world, the fragments all together suggest an importance of rituals where music plays a role, which further implies a significance of human action in the construction of the world that is meaningful for each culture. The ancient texts allow us to understand each culture’s own view of time and space because time and space concepts have changed over human history. My discussion about the origins
3 of time and space concepts is heavily in debt on numerous scholarly works of the ancient
Babylonia, Indian, China, and Greek cosmology and philosophy. My analysis will show not only cultural diversity but also some shared characteristics (i.e., eternity, ephemerality,
& cyclicity) in ancient understanding of time and space.
In chapter 5, I will return to two behaviors in music performance (i.e., singing and instrument playing) and trace back their different evolutionary trajectories. I will try to demonstrate consistency in the distinction between singing and instrument playing from historical, prehistorical, and comparative perspectives. We use vocal and non-vocal modes of acoustic communication in music and language, which are human-specific. Using design features of Hockett (1960a, 1960b), I will analyze vocal music, instrumental music, speech and speech surrogate (i.e., non-vocal form of language) in order to have a better understanding regarding how human language, music, and cognition have evolved together.
Chapter 6 introduces two behavioral experiments that are directly connected to the discussions in chapters 2 & 3. For my ‘Hear Your Touch’ project, I conducted a simple reaction time (SRT) and a temporal order judgment (TOJ) experiment. The most important question of experiments concerns the effects of specific musical training on spatiotemporal processing. Therefore, I recruited three different participant groups, vocalists, instrumentalists, and non-musicians In terms of psychological building blocks of time, the
SRT experiment is based on event detection. For this, I delivered unisensory auditory, unisensory tactile, and multisensory audio-tactile stimuli. The TOJ experiment is based on a perception of temporal order. For this, audio- tactile stimuli are coupled and presented in a pair but with different stimulus onset asynchronies. In connection with chapter 3, both
4 experiments consider several features of space pertaining to the body. In terms of body space, the experimental task of crossed vs. uncrossed arms was used to test effect of postural schemata. In terms of peripersonal space, I focused on audio-tactile integration and hand-centered specificity. For this, I presented audio stimuli near the hand and tactile stimuli on the hand. The experimental results show supporting evidence for the different effects of specific musical training on spatiotemporal processing.
Pickering (2017) introduced the term ‘different worlds’ to mean “the fact that other social groups understand and act in the world different from ‘us’” (p.2). He further pointed out that ‘different worlds’ are associated with plural ontologies and argued that ontologies are not schemes of classification and representation but enacted or performed in practice.
My discussion on time and space as cognitive constructs and my experimental finding of differential spatiotemporal processing between vocalists and instrumentalists seem to be in line with Pickering’s (2017) notion of different worlds. This dissertation shows that time and space arise from our minds and bodies that can be shaped in a distinctive way through specific modes of musical practice. Although there are multiple ways of music-making
(e.g., performance, listening, writing, dancing, etc.), this dissertation shows that, at least in music performance, two different modes (i.e., singing vs. instrument playing) allow us to have multiple mental representations of the world. It recalls my conversation with Dr.
McCoy, the director of Voice Teaching and Research Lab. While I conducted my experiments, he helped me with recruiting singer participants. The other day I ran into him in the elevator. I told him “I found singers are different from instrumentalists in my
5 experiments”. Then he said “It isn’t surprising at all, is it?”. There are indeed many worlds and many musics.
6
Chapter 2. Concerning time
I wondered to what extent the cosmic phenomena of night controlled our periods of sleep and activity. In short, I wanted to investigate time - that most inapprehensible and irreversible thing. I wanted to investigate the notion of time which has haunted humanity since its beginning.
From Michel Siffre’s Beyond Time (1964, p. 25)
What is time? Before being asked this question, most of us think that we know time well. A report by a French geologist, Michel Siffre, challenges our naivety on time. In 1962,
Siffre spent two months of a hot summer in a cave of the Alps to conduct underground experiments while being absolutely isolated from the external world. He might have thought that he had comprehended time until he stepped out of the cave and discovered that he lost track of time. He was twenty-five days behind compared to the outside.
Personally, a lecture by my college professor, Dr. Lee, prompted me to think seriously about time. As a composition student, I sat in his ‘Advanced Music Analysis’ class. One day arguing that music is the art of time, he asked us a series of questions about the relationship between time and music. One of his questions I have never forgotten is whether my composition wastes the time of other people. Dr. Lee, a follower of Arnold Schönberg
(1874 - 1951), Theodor W. Adorno (1903 - 1969), and a student of Milton B. Babbitt (1916
- 2011), emphasized that we, composers, should be responsible for what we write. While regarding ‘craftmanship’ as the virtue of a composer, he said that a composer should
7 respect others’ time. According to him, bad music can steal five hours of fifty people’s precious time with a six-minute long piece. Although I accepted his view on a composer’s responsibility to write good music, I could not agree with his perspective on time. To me, it seems that he ignored the subjectivity of human experience of time and the influence of music on it. In terms of subjective experience of time and music, it is common that people say different things about the same music. As shown in the introduction section, listening to Tchaikovsky’s Pathétique moves some people. Other people may not listen to it in the same way. They may say “Suddenly a terrible feeling of confusion came upon me. In what
I believe was only a few seconds, a whole series of conflicting thoughts began rushing around in my mind” (Gabrielsson, 2011, p. 143). This leads me to keep questioning what time is and how music changes our experience of time. Now as a cognitive ethnomusicologist and researcher of time, I investigate how we experience time, how time relates to music, and how music transforms human temporal experience from different perspectives rather than a composer’s view.
To begin with, I would like to examine definitions of time from two widely- accepted dictionaries in order to get general ideas about how time has been seen in the modern world. According to Encyclopædia Britannica, time is “a measured or measurable period, a continuum that lacks spatial dimensions” (Markowitz, Smart, & Toynbee, n.d.).
This definition reflects three features of time. It is 1) quantifiable, 2) a period or duration that exists in the world, and 3) independent from space. Citing various sources including
Oxford, Merriam and Webster, Collins, American Heritage, internet Encyclopedia of
8
Philosophy, and The Stanford Encyclopedia of Philosophy, Wikipedia defines time more comprehensibly:
Time is the indefinite continued progress of existence and events that occur in apparently irreversible succession from the past through the present to the future. Time is a component quantity of various measurements used to sequence events, to compare the duration of events or the intervals between them, and to quantify rates of change of quantities in material reality or in the conscious experience. Time is often referred to as the fourth dimension, along with the three spatial dimensions.
Similar to Encyclopædia Britannica, time as presented by Wikipedia is quantifiable.
The duration of events is the measurable component of time. Wikipedia also distinguishes time from space and implicates that time and space are two components that constitute the world. Furthermore, Wikipedia considers time has an absolute existence with a linear sequence of past, present, and future. It is interesting to me that Wikipedia alludes to the existence of time in both external (i.e., material reality) and internal (i.e., conscious experience) worlds. This examination of definitions of time show how confusing our concept of time is. In the following discussion, my focus is the last feature of time in
Wikipedia, that is, how time is experienced in human mind. For this, I will discuss the notion of time in cognitive sciences and examine psychological building blocks of time.
Time in cognitive sciences
In the late 19th century, the term ‘time sense’ (Zeitsinn in German; sens du temps in
French) referred to the apprehension of any attributes of temporal experience (e.g., duration, change, order of events, etc.). It was studied with an instrument called a ‘time-sense apparatus’ in order to determine the accuracy of time estimation. The early time researchers sometimes had equated time sense with time perception. According to Roeckelein’s (2000, 9
2008) review of the concepts of time in psychology, however, time sense was applied to the capacity of apprehending the attributes of time while time perception denoted specific occurrences of apprehending. Later, time researchers rejected the term ‘time sense’ due to the misrepresentation of the term ‘sense’. Ornstein (1969) pointed out, for instance, that there is no specific sensory organ that processes time information. He also noted that the perception of duration differs in different senses. In line with Ornstein, Friedman (2000) remarked that there is no sensory organ that receives temporal stimulation in the way that eyes or ears transduce light or sound respectively. He argued that time perception is just a metaphor. Current cognitive science studies about time are heavily obliged to Paul Fraisse
(1911 - 1996). According to him, we do not have direct experience of time but only of sequences and rhythms, therefore there is no time sense (Roeckelein, 2000, 2008). Fraisse
(1984) clarified the notion of time:
The notion of time applies to two different concepts which may be clearly recognized from our personal experience of change: (a) the concept of succession, which corresponds to the fact that two or more events can be perceived as different and organized sequentially; it is based on our experience of the continuous changing through which the present becomes the past; (b) the concept of duration, which applies to the interval between two successive events. Duration has no existence in and of itself but is the intrinsic characteristics of that which endures. (p.2)
Here a prerequisite of both succession and duration is event detection. This is the most fundamental one among psychological building blocks that allows us to have temporal experience (Will, 2017). In the following section, I will discuss six psychological building blocks that include 1) event detection, 2) perception of temporal order, 3) duration perception, 4) duration estimation, 5) psychological present, and 6) rhythm perception.
10
Psychological building blocks of time
Event detection
We all live in the world which continuously changes. Change of the world is the precondition of our experience of time. Studies about human temporal experience have shown an importance of perceptible changes, that is, the interaction between senses and the external world. In the caveman story above, for instance, Siffre completely isolated himself for sixty-three days. No access to external cues from the environment (e.g., sun light) disturbed his biological clock. His biological cycle of sleep-wake was prolonged due to an absence of outside cues. Therefore, his circadian clock was de-coupled from the external cycle of day and night. At the end of the experiment, researchers of the caveman study also observed that Siffre’s estimation of two minutes in the cave took actually five minutes in addition to the twenty-five day difference between the actual calendar and Siffre’s estimation of the date. It demonstrates that both sensory deprivation and biological changes affect significantly our experience of time.
We do not detect every single change as an event. Sensory organs detect events depending on features that are specific to each modality. A change of features should reach a threshold to be perceived as an event. Visual features are luminance, size, and position, etc. Auditory features include amplitude, frequency, and source location, etc. Tactile features contain temperature, texture, and size, etc. Additionally, an interaction between sensory information retrieved from different modalities affects our perception of events.
Using a simple reaction time, researchers reported a faster reaction to multisensory inputs
11 than unisensory inputs. This is known as the Redundant Signal Effect. I will investigate this facilitatory effect of multisensory stimuli in chapter 6.
Not all detectable events exist only in the external world; some are generated internally. For example, memory can drive some sensations. Anderson-Barnes, McAuliffe,
Swangerg, & Tsao (2009) proposed proprioceptive memory in order to explain the symptoms of phantom limb pain with which patients report lingering sensations after amputation or severe pain of their missing limbs. According to the authors, proprioceptive memory refers to long-term memory where proprioceptive information has been stored sub-consciously. In other words, this long-term memory contributes a generation of phantom limb pain. Music imagery and earworms are other examples of internally generated detectable events. Composers have musical imagery. When writing my first orchestral piece Color of Sky, I heard in my mind a piercing trumpet melody that followed by strings playing a C major chord. As Sacks (2007) reported in his book Musicophilia, earworms, sticky music or catchy tunes in the mind, can be a compulsive, even pathological phenomenon.
In sum, event detection is one of the fundamental psychological building blocks of human temporal experiences. We detect events, changes in the world, via our senses. Event detection is based on a feature analysis of stimuli. Stimuli can be either external or internal.
Memory, imagery, etc., may generate internal ones.
12
Perception of temporal order: simultaneity vs. succession
If multiple events are detected, how does our mind organize them? Let me take a musical example. Imagine two different notes, namely, C and E. When these notes are sounding simultaneously, we call it a chord. If they are sounding successively, we call it a melody. As this example shows, the perception of event order is important for us to construct time because the temporal order gives us hints of how to interpret detected events.
Simultaneous events seem to require a simpler process compared to successive events because they may be causally related, which indicates an involvement of different processes.
Simultaneity
Citing Poincaré who had claimed that the perception of simultaneity is infinite and omnipresent intelligence, Fraisse (1963) pointed out that the order of events in the physical world does not correspond to the order that our sensory organs detect. For instance, we hear the sound of thunder after seeing lightening, although both thunder and lightening happen at the same time in the same place. One might argue that our different perceptions are due to the difference in speed of transmission between sound and light. However, it is important to note that sensory organs have their own modality-specific receptors. Each receptor has a different time interval to convert external stimuli to electric and/or chemical signals so neurons can transmit transduced signals to the central nervous system. Moreover, some senses have more than one receptor type. This suggests that, even in one sensory modality, different conduction times may exist. For instance, the somatosensory system
13 has four different types of mechanoreceptors. They are anatomically distinctive and their response behaviors are different. Specifically, the mechanoreceptors show different firing rates depending on the stimuli to which their neural response adapt (Gescheider, Wright,
& Verrillo, 2009). In particular, there are several types: the Meissner corpuscle (i.e.,
Rapidly Adapting (RA)), the Merkel neurite complex (i.e., Slowly Adapting (SA) type I), the Ruffini corpuscle end organ (i.e., SA type II), and the Pacinian corpuscle. The somatosensory system has four groups of the peripheral axons. Each group is classified by a type of somatosensory information and a thickness of axon myelin. The thicker the myelin is, the faster the conduction velocity is. Carrying proprioceptive information, Group
I has the thickest myelin. Group II is associated with touch information. Group II has smaller diameters than Group I does. Group III involves pain and temperature. Group III is less myelinated than Group II. Unmyelinated Group IV carries information regarding temperature, pain, and itchiness (Bautista & Lumpkin, 2011).
Minimum spacing between events is required to identify them as separate events.
This spacing, called a simultaneity threshold, varies depending on types of sensory modalities and stimulus features as discussed above. Since the late 19th century (e.g., Exner,
1875 cited in Hirsh, 1959; Hirsh & Scherrick, 1961), the auditory system has been considered to have the most superior acuity in the processing of temporal order.
Researchers have noted that, in the auditory system, events separated by less than 2 to 3 ms are perceived as simultaneous if duration of stimuli is brief (e.g., two clicks) (Hirsh,
1959). In their classical study on the cross-modal order perception, Hirsh & Scherrick
(1961) concluded that the auditory system is the one with the best temporal resolution,
14 tactile is less than auditory, and visual has the lowest resolution in order to decide whether two stimuli are simultaneous or not. According to Occelli, Spencer, & Zampini’s 2011 review on audio-tactile temporal order judgment studies, Gescheider performed a series of experiments to measure auditory and tactile temporal resolution for the first time in the late
1960’s. He concluded that simultaneity thresholds for auditory and tactile are about 2 ms and 10 ms respectively.
If two events are perceived as simultaneous although being physically apart, then it means that the events are fused. Multiple features of stimuli can affect fusion. Fusion occurs due to neural responses. For the auditory system, binaural fusion is associated with the subjective perception of one or two acoustic events. In other words, each ear is independently stimulated at the peripheral level, but the brain does not distinguish two events within a certain time window. In binaural fusion studies, stimuli are binaurally presented with interaural time or interaural intensity differences. In experiments investigating binaural fusion, duration and amplitudes of stimuli are controlled because these features allow us to detect an acoustic event. Arguing that a temporal perception of a sound’s onset is influenced by its duration, its spatial configuration, and its intensity,
Schimmel & Kohlrausch (2008) investigated the onset perception of binaural sounds and the role of interaural differences. They noted a systematic relationship between perceived and physical onsets in connection with interaural differences: a longer duration of a sound delays the perceived onset whereas a shorter one does not have a significant delay. This finding implies that the perceptual synchrony can be achieved when a long stimulus starts earlier than short one.
15
Succession
Regardless of types of sensory modalities, we can determine the order of successive events. Early studies reported that at least 20 ms is required to decide the order of two events (Hirsh & Scherrick, 1961; Fraisse, 1984). Interestingly, there is a time window called the temporal fusion within which we can detect multiple events but cannot determine the order of successive events. The temporal fusion occurs between simultaneity (i.e., 2 to
3 ms) and temporal order thresholds (i.e., 30 to 50 ms). Gescheider found 30 ms as threshold for temporal order decision in both auditory and tactile systems (Occelli, et al.,
2011). Using the temporal order judgment task, I investigated the audio-tactile temporal order threshold (see chapter 6). With more than three events, the threshold increases to 100 to 300 ms depending on the relatedness of the events (Will, 2017).
According to Fraisse’s 1984 review, a decision of the temporal order of multiple events requires an integration of various mechanisms including attention and perception, etc. For example, decision time of the order of successive events may correspond to the length of time that is required to attend the sequence of stimuli. This explanation is based on the attention-switching hypothesis. According to the discrete moment hypothesis, however, the decision time involves two separate processes: 1) the features of stimuli affect event detection, and 2) the detected events should be treated separately in order to determine the orders of events. This perspective postulates that the perception of time is not continuous but discrete. The discrete moment hypothesis also posits that events below the temporal order threshold cannot establish discrete functional units, so temporal integration occurs. It is important to note that each event should be marked as a functional
16 unit not to fuse the order of the events. This implies that the threshold of temporal order varies depending on what processes that are involved to build a functional unit for each event. Building a functional unit for temporal order can be associated not only with perception (e.g., stimulus features, the relatedness across events, etc) but also with action
(e.g., sensorimotor synchronization) (Will, 2017).
To conclude, the mind classifies multiple events in three ways, 1) simultaneous events, 2) distinct but not temporally ordered events (i.e., temporal fusion), and 3) temporally ordered successive events. Events are fused below the simultaneity threshold even when they are physically separated. Beyond the simultaneity threshold, events can be perceived with or without temporal order depending on whether the events reach temporal order threshold. Temporal fusion refers to the fact that people can detect multiple events but cannot decide on the order in which the events happened. Determination of temporal order involves several processes including event detection, attention, discretion of functional units, etc. An establishment of functional units relates to processes of perception and/or action.
Duration perception
Duration, the interval between two successive events, is an intrinsic characteristic of an organization of succession (Fraisse, 1963, 1984). Given that we perceive changes through our senses (i.e., event detection), the term ‘duration’ can be generalized as the length from one state-change to the next. There are two types of durations: the duration between state changes and the duration (extent) of a state change itself. These durations are
17 frequently labeled as ‘empty’ and ‘filled’. An empty interval is bounded by two successive brief sensory signals. A filled interval is characterized by a continuous signal that lasts from its onset to offset. Since the late 19th century (e.g., Meumann, 1896 cited in Fraisse,
1963), researchers have reported a phenomenon called filled duration illusion (FDI). Filled intervals are perceived longer than empty ones although the intervals are of equal length.
FDI can be explained by three components of the scalar timing theory: clock, memory, and comparator. Clock consists of a pacemaker that generates regular pulses. Memory stores the number of pulses generated by a pacemaker. Comparator determines whether the current number of pulses matches with that stored in memory. Accepting the scalar timing theory and focusing on a pacemaker rate, Wearden, Norton, Martin, & Monford-Bebb
(2007) argued for difference in pacemaker rates between filled and empty intervals. The authors discussed that a rate for filled stimuli is faster than empty ones. In other words,
FDI occurs because filled intervals accumulate more pulses than empty ones do.
Additionally, the perceived duration can be influenced by various factors. In terms of features of signals, stronger stimuli seem to lengthen the perceived duration compared to weaker ones (e.g., Goldstone, Lhamon, & Sechzer, 1978, cited in Frassie, 1984). By modifying Israeli’s 1930 study, Nakajima, ten Hoopen, & van der Wilk (1991) first demonstrated an auditory illusion called the time-shrinking illusion, that is, a substantial underestimation in duration judgment. When two auditory empty intervals are presented serially, participants’ judgment of the second interval is substantially influenced by the duration of the first. When the first interval is shorter than the second one, the second duration is remarkably underestimated. Investigating the influence of actions on the
18 perceived duration, Press, Berlot, Bird, Ivry, & Cook (2014) argued that a sensory prediction mechanism, essential for action, can distort the perceived duration of sensory events produced by actions. They also demonstrated that we overestimate the duration of our own actions.
There are two distinct timing mechanisms that have been proposed for duration perception: one is absolute, duration-based timing, and the other is relative, beat-based.
The former is associated with discrete encoding of the absolute duration of each time interval (ΔTi). The latter has been alluded to by a founding father of American psychology,
William James. In his discussion of empty time, James (1890) wrote, “subdividing the time by beats of sensation aids our accurate knowledge of the amount of it (time) that elapses”
(vol.1, p.619). This beat-based timing presumes not only that duration perception is facilitated by a regular beat but also that individual time intervals are encoded relative to the beat (ΔTi /Tbeat). Investigating the relationship between timing and control of movements in the cerebellum, Ivry, Spencer, Zelaznik, & Diedrichsen (2002) proposed event timing that is based on explicit temporal representation. In contrast, temporal regularitiy without explicit temporal representation is achieved via emergent timing which requires emergent or secondary properties to control motor movements. In other words, event timing is associated with a discrete representation of a time interval. It is needed for the temporal control of a series of discrete movements. Emergent timing relates to the processing of a temporal regularity or a beat. It is transformed into another control parameter that allows continuous movement or perform beat-based timing tasks in experimental set-ups (e.g., rhythm discrimination task in Grahn & Brett, 2007; tempo
19 judgment task in Grahn & McAuley, 2009). Teki, Grube, Kumar, & Griffiths (2011) argued distinctive brain areas subserving 1) perception of absolute duration of discrete time interval, and 2) perception of time intervals relative to a regular beat. They found that duration-based timing is correlated with neural activation in the olivo-cerebellar network whereas beat-based timing involves the striato-thalamo-cortical circuits (see fig.1). The olivo-cerebellar network includes the inferior olive, the cerebellar lobules IX and X, the vermis, and the deep cerebellar nuclei such as the dentate nucleus, the superior temporal gyri, and the cochlear nucleus. The striato-thalamo-cortical network consists of the putamen, the caudate nucleus, the thalamus, the supplementary motor area, the dorsal premotor cortex, and the dorsolateral prefrontal cortex. Teki, Grube, & Griffiths (2012) proposed a unified model of time perception in which the neural networks for both duration-based and beat-based timing mechanisms are interconnected and mediate for precision to process the timing signal.
Figure 1. Unified model of time perception (Teki et al., 2012): Blue = the striato-thalamo- cortical network; Green = the olivocerebellar network; Orange = dopaminergic pathways; Red = inhibitory projections; Solid black = excitatory projection; Dashed black = anatomical connections; IO = inferior olive; VTA= ventral tegmental area; GPe = external globus pallidus; GPi = internal globus pallidus; STN= subthalamic nucleus; SNpc = substantia nigra pars compacta; SNpr = substantia nigra pars reticulata: Reprint permission granted by the first author. 20
To conclude, studies of perceived duration have demonstrated various types of distortions in time interval judgment (e.g., filled duration illusion, time-shrinking illusion, etc.). Timing mechanisms proposed for duration perception include duration-based and beat-based in the olivocerebellar and the striato-thalamo-cortical circuits respectively.
Duration estimation
In his The Principles of Psychology, James (1890) established a foundation of the concept of time by making a distinction between experienced time and remembered time, although some of his terminology (e.g., time sense) is questionable. As James (1890) wrote
“I shall deal with what is sometimes called ‘internal perception’ or ‘the perception of time’, and of events as occupying a date therein, especially when the date is a past one, in which case the perception in question goes by the name of ‘memory’” (vol.1, p. 605). He dealt with these two aspects of temporal experience in the chapters entitled “The Perception of
Time” and “Memory”. Experienced time is based on sensory inputs and associated with the psychological building blocks of time. In contrast, remembered time involves internally generated events in the mind. According to James (1890), these events created by the mind are ideas as he noted “To remember a thing as past, it is necessary that the notion of ‘past’ should be one of our ‘ideas’” (vol.1, p. 605). Our chronological construction of time is based on the ideas that are connected like a string of beads.
Similarly, Fraisse (1984) distinguished duration perception from duration estimation on the basis of the involvement of memory. In his discussion about duration estimation, Fraisse (1984) did not specify a type of memory, but noted “estimation of
21 duration takes place when memory is used either to associate a moment in the past with a moment in the present or to link two past events, whereas perception of duration involves the psychological present” (p. 9) and “it will nonetheless be seen that in the case of durations which go beyond perception, new problems arise” (p. 19). This quote refers to the involvement of sensory, working and/or long-term memory. Researchers have considered estimation as one form of time judgment in addition to reproduction and production in experimental contexts (Fraisse, 1984; Friedman, 2000; Grondin, 2010).
These forms have been further explored in prospective and retrospective timing paradigms.
In a prospective timing task, participants are informed in advance that they will be asked to make a time related judgment. In a retrospective timing task that is associated primarily with memory, participants do not know in advance they will be asked to judge an interval of time. They are asked to judge the remembered duration. Methods of both prospective and retrospective paradigms include verbal estimation, reproduction, and production of an interval of time.
Psychological present
The discourse on the psychological present has shown that it has many names including the ‘specious present’, the ‘sensible present’, the ‘psychic present’, the ‘mental present’, the ‘actually present’, the ‘perceived present’ (Fraisse, 1963), and the ‘subjective present’ (Pöppel, 2009). We can find the origin of these terms in The Principles of
Psychology. James (1890) introduced Mr. E. R. Clay’s term ‘specious present’. Clay noted
“Time, then, is considered relatively to human apprehension, consists of four parts, viz.,
22 the obvious past, the specious present, the real present, and the future” (James, 1890, vol.1., p. 609). It is unclear how Clay made a distinction between the specious present and the real present. After his speculation about the specious present and its duration, James proposed the sensible present as “the original paragon and prototype of all conceived time is the short duration of which we are immediately and incessantly sensible” (1890, vol.1, p. 631). The mental present by Piéron (1923) is “a durable present…in which we apprehend a succession of diverse facts in a single mental process which embraces, in the present, a certain interval of time” (Fraisse, 1963, p. 86). Currently, the most widely used term is the psychological present. Fraisse described it in various ways including “the duration of experiential process” (1984, p. 10) and “the duration of the organization of stimuli which we perceive as one unit” (1963, p. 84). In other words, the multiple percepts are kept as a whole within the psychological present. For this, several mechanisms work together.
Sensory memory, allowing us access to information obtained from the senses, plays an important role for the psychological present. Other mechanisms including affect (Fraisse,
1963), attention (Fraisse, 1963; Allman & Mareschal, 2016) and working memory (Fraisse,
1984; Allman & Mareschal, 2016) contribute to the integration of various percepts.
Given the discussion above, a question comes up regarding how long the psychological present lasts. As the involvement of multiple mechanisms implies, duration of the psychological present is plastic. Fraisse (1984) asserted that there is no fixed duration for the psychological present but it ranges from 100 ms to 5 s. After converging the results from different experiments, the psychological present is currently considered to last approximately 2 to 3 s and its upper limit is about 5 s (Pöppel, 2009). The shorter duration
23 of the psychological present relates mainly to sensory memory whereas the longer duration suggests the involvement of additional temporal mechanisms such as affect, attention, and working memory (Pöppel, 2009).
Rhythm perception
Rhythm refers to the perceived and conceived temporal relationships of successive events. Rhythm perception is complex because it encompasses the psychological building blocks we discussed above. Within the width of the psychological present we can directly perceive rhythm but, beyond the psychological present, we do not conceive a sequence of successive events as temporal gestalts. Following the early time studies in which rhythm was discussed with regard to human movements (e.g., Mach, 1865 & Vierordt, 1868, cited in Fraisse, 1982), Fraisse (1982) focused on tempo, a perceptual aspect of rhythmic organization. He further explained that rhythm generated by periodic activities (e.g., walking, swimming, etc.) relates to spontaneous tempo, that is, the tempo that people select when they are asked to tap with minimal instructions. MacDougall & Moore (2005) found the temporal spectrum of human locomotor movements shows a dominant periodicity of 2 beats/sec, corresponding to 2 Hz, which is known as resonant frequency or internal periodicity. This can be used as a reference in perceptual and motor tasks for timing.
Interestingly, van Noorden & Moelants (1999) found that the tempo of many musical pieces is centered around periodicity of 2 beats/sec (i.e., 2 Hz). In his analysis of qin performance, Will (2014, 2018) argued that the performer’s action on this instrument also reflects a resonance frequency of body movement, that is 2Hz.
24
Noting that synchronization is fundamental in dancing and music-making (e.g.
McNeill’s (1995) muscular bonding), Fraisse (1982) argued that sensory inputs affect the periodic movements via synchronization in which temporal anticipation, not regularity of events, plays an important role. In terms of temporal anticipation, tapping experiments have reported that participants’ taps tend to precede isochronous tones by a few tenths of milliseconds. This is known as negative mean asynchrony (for a review, see Repp, 2005;
Repp & Su, 2013). This phenomenon can be explained by how external and internal periodicities interact. The relationship between external and internal periodicities has been investigated in entrainment studies. Entrainment refers to the process by which two or more independent temporal systems synchronize with each other through mutual interaction
(Kung, 2017). Jones (1976) and her colleagues (Jones & Boltz, 1989; Large & Jones 1999) argued that attentional cycles mediate these two periodicities. Clayton, Sager, & Will (2005) introduced the concept of entrainment to ethnomusicology. Their chronometric analyses on music performance from different cultures showed the influence of cultural and personal factors on entrainment.
In terms of rhythm processing, a study by Hung (2011) explored whether acoustic rhythms are processed differently depending on type of the sound source. For the past few decades, cognitive studies have consistently reported that vocal sounds are processed differently from non-vocal ones. Neuroimaging studies have observed different activations in response to human vocal sounds in the superior temporal sulcus and the superior temporal gyrus than to non-vocal sounds (Belin, Zatorre, Lafaille, Ahad, & Pike, 2000;
Belin, Zatorre, & Ahad, 2002; Belin & Zatorre, 2003). A magnetoencephalographic study
25 by Gunji et al. (2001) showed that greater source strength of a sustained field is elicited by the vocal stimuli around the Heschl’s gyri. In line with those studies, Lee, Peelle, Kraemer,
Lloyd, & Granger (2015) reported that the brain activity pattern for a human voice is distinguishable from that for non-vocal sounds in the superior temporal gyrus. Levy and his colleagues suggested neurophysiological evidence for differential processing between vocal and non-vocal sounds. They identified a voice specific response, a positive component with a latency around 320 ms only for voice (Levy, Granot, & Bentin, 2001,
2003). Expanding voice sensitivity and specificity into a temporal domain, Hung (2011) demonstrated differential processing between vocal versus instrumental rhythms functionally and behaviorally. Extending her finding to a memory domain, Klyn, Will,
Cheong, & Allen (2015) showed behavioral evidence for differential memorization between vocal and instrumental rhythms.
In sum, rhythm perception involves not only mechanisms for all psychological buildings of time but also biological (i.e., resonance frequency of bodily movements), and cultural factors (e.g., Clayton et al., 2005) In terms of acoustic rhythm perception, it interacts with the source of sounds.
Conclusion
Our experience of time emerges from the way we experience the world that is ever- changing. In other words, time arises from our interaction with the world that constantly changes. The caveman experiment shows how disconnection from the external world confuses not only the body but also the mind. In order to understand how our mind
26 constructs time, I examined various psychological building blocks of time including 1) event detection, 2) perception of temporal order: simultaneity vs. succession, 3) duration perception, 4) duration estimation, 5) the psychological present, and 6) rhythm perception.
As mentioned briefly, I investigated event detection and perception of temporal order with my “Hear Your Touch” project and studied empirically the effect of different types of musical training on these two building blocks with behavioral experiments (see chapter 6).
This discussion suggests that both perceptible changes of environments in the range of the psychological present and our action and rhythmic bodily movements play a role in our experience of time. My examination of these psychological building blocks of time also demonstrates that temporal experience relies on multiple cognitive processes of perception, attention, memory, etc., that are intertwined in a complex way.
27
Chapter 3. Space and music-making bodies
Space plays a role in all our behavior. We live in it, move through it, explore it, defend it. We find it easy enough to point to bits of it: the room, the mantle of the heavens, the gap between two fingers, the place left behind when the piano finally gets moved.
From O’Keefe & Nadel’s Hippocampus as a Cognitive Map (1978, p. 5)
I sit in the arm chair in the living room and place my MacBook on my lap. I see my dog, Miss Daisy, enjoying a sunbath at her favorite spot of the front patio. I hear the siren of a firetruck passing by. I smell a fresh brownie on a plate. I touch and grab a coffee mug on the side table. Vision lets me know where Daisy is and how close my laptop is to the mug. Olfaction informs me of the existence of coffee and the brownie around me. Sweet smells from the coffee and brownie make me localize where they are and then lead me to reach for them. So, I enjoy them without spilling coffee over the laptop. As shown in this ordinary scene, integrated sensory information establishes space. Inputs retrieved via different sensory modalities guides us what to do next. Knowing not only where things are with regard to the body but also how to interact with an environment gives rise to a meaningful perception of space.
Although we conceive of space as a homogenous phenomenon, it emerges from multiple sensory data. In other words, a unitary mental representation of space is constructed by inputs from all sense organs. However, it is important to note that there are
28 significant differences in roles among the sensory modalities in terms of space. Compared to other senses, the somatosensory system plays a distinctive role when it comes to space pertaining to the body. Other sensory modalities give spatial information outside body whereas the somatosensory system directly relates to the space of the body. It is not only because proprioception, a part of the somatosensory system, gives us the sense of the position of the body and its parts but also because touch, another part of the system, requires direct physical contacts with objects in the environment. Skin, as the only sensory organ entirely exposed to the environment, collects information from the surface of the body and proprioception. Writing “active touch refers to what is ordinarily called touching. This ought to be distinguished from passive touch, or being touched.... Active touch is an exploratory rather than a merely receptive sense” (p. 447), Gibson (1962) made interesting points about active touch. That is, direct contact to the surrounding via the somatosensory system has a close tie with movement.
In the following section, I will first introduce two types of space pertaining to the body, that is, body space and peripersonal space. Each space will be examined with its components. Firstly, body space consists of two schemata: postural and surface-related (i.e., superficial) schemata. In terms of perception, the schemata are associated with proprioception and touch respectively. In the history of research about space pertaining to the body, body schema has been used interchangeably with body image. I make a distinction between the two terms, ‘body schema’ and ‘body image’. Secondly, peripersonal space, that is, space immediately surrounds body, is characterized by 1) multisensory integration, 2) body-part centered specificity, 3) sensorimotor coupling, and
29
4) plasticity. Thirdly, I briefly examine two different processes of spatial information: sensorimotor and representational modes. Finally, I will apply my discussion about the space pertaining to the body of singing vs. instrument playing in order to introduce two types of embodied spaces in music-making.
Body space
In terms of space, the body is a special object. Similar to our temporal experience, we perceive the body on the basis of sensory inputs collected internally and externally, which contributes to an establishment of space. One move’s one’s body, which changes the space outside body (i.e., external space). Although the perception of the body remains often un- or subconscious; sometimes, some practices like playing music, dancing or doing yoga let us perceive our body consciously. For example, I have been practicing yoga for years. I have learned the grudasana (i.e., the eagle pose) for which I put my left leg on top of the right while placing my right arm on top of the left, or vice versa. When I learned the eagle pose the first time, my standing leg was wobbling but now I can take this pose quite firmly. It was interesting for me to see how my everyday practice consolidates the memory of the pose. In other words, training can alter the awareness of the bodily experience of space to be un- or subconscious. This shows the complexity of our experience of space because it is associated with many psychological processes (e.g., perception, consciousness, memory, etc.) and other contributing factors (e.g., training, culture, etc.).
The perception of the space of the internal body also relies on the somatosensory system. When feeling sick, for example, we can tell whether a stabbing-like pain comes
30 from the head or stomach, although internal localization of pain is not as precise as that of touch. In spite of the spatial components of the sub-modalities of our somatosensory system
(e.g., temperature & pain), as Berlucchi & Aglioti (2010) noted, touch and proprioception have been the main concerns in the research field of the body and space. I will follow this tradition and concentrate on touch and proprioception. The reason for this is that one of the goals of this dissertation is to have a better understanding of space pertaining to the music- making body. Spaces emerging from singing and instrument playing bodies are different and music-making seems to be primarily associated with touch and proprioception, although pain may accompany the mastery of certain musical instruments.1 Other sub- modalities of the somatosensory system may be less relevant than proprioception and touch in terms of music-related behaviors. In terms of body space, the first constituent of space pertaining to the body, I would like to introduce the concept of body schema, consisting of a postural schema and a surface-related (i.e., superficial) schema which are primarily associated with proprioception and touch respectively. I will discuss differences between body schema and body image in detail, which would minimize the confusion between these two terms.
1 While writing this dissertation, I have been taking djembe lessons from Mr. Balla Sy, a director of GOREE Drum and Dance company. Playing a djembe is associated with pain. It causes my hands to bleed due to cuts. When thick calluses are established, there is no longer pain (only fun!) and hands become insensitive to touch. I found that my arm movement was automatized. Once the sequence of arm movement is internalized, I controlled my performance only with acoustic information (e.g., bass, slap, & tone). In other words, my playing is not based on touch. 31
Postural schema
Philosophers and scientists have been intrigued by the question of how the body is represented in the mind. Early studies suggested the term ‘coenesthesia’ for a sense of body arisen from deep sensory impressions of the viscera, muscles, joints, and skin (e.g.,
Brissaud 1895; Deny & Camus, 1905, cited in Maravita, 2006). In other words, coenesthesia designates bodily sensation. The French neurologist Bonnier (1893, cited in
Maravita, 2006) proposed the term ‘schema’ to indicate some spatial qualities in addition to the bodily sensation. Bonnier’s (1893) schema involves the spatial orientation and localization of the body with regard to external objects. However, the term ‘body schema’ became famous with the work by Head & Holmes (1911-2):
For this combined standard, against which all subsequent changes of posture are measured before they enter consciousness, we proposed the word “schema.” By means of perpetual alternations in position we are always building up the postural model of ourselves which constantly changes. Every new posture or movement is recorded on this plastic schema, and the activity of the cortex brings every fresh group of sensations evoked by altered posture into relation with it. Immediate postural recognition follows as soon as the relation is complete. (Head & Holmes, 1911-2, p. 187)
Head & Holmes (1911-2) described several important aspects of the postural schema, one component of body schema. First, the postural schema does not involve consciousness. Rather, it is associated with sub- or un-conscious awareness of body postures and movements. For some researchers (e.g., Gallager), an involvement of consciousness in bodily sensation is a criterion of body schema vs. body image distinction.
This will be discussed in detail below. Second, the postural schema is not static but plastic.
It is because each body posture changes through body movements over time and movements successively lead to a new posture in a continuous manner. Seemingly static
32 yoga postures also involve the continuous changes due to minuscule body movements.
When practicing the grudasana, a yogi takes this posture for several minutes. Since a practitioner stands with one foot, keeping balance is important. A yogi should constantly adjust his or her body not to fall down. Especially, the standing foot makes incessant micro- movements to control balance.
Additionally, Head & Holmes (1911-2) implied that the postural schema is associated with both perception and movement, which reminds me of Gibson’s (1962) active touch. The perception aspect of the postural schema is similar to the proprioceptive system that receives inputs from mechanoreceptors in joints, muscles and tendons.
Although proprioceptive inputs provide an estimate about the relative relationship between body parts, it is insufficient to localize body parts precisely with respect to external space
(Longo, Azañón, & Haggard, 2010; Dijkerman, 2017). With regards to the motor component of the postural schema, different configurations of body parts change not only body space but also the space outside the body. Only when both motor and proprioceptive components work together, the postural schema fully functions.
Superficial schema
Head & Holmes (1911-2) observed that their patient Hn (case 14) could not report his hand position but was successfully able to localize the stimulated spots on the surface of his body. On the basis of this observation, they argued that there is another schema besides the postural schema and proposed that this additional schema involves the processing of the localization of the stimulated spots on surface of the body. Later, some
33 researchers called this surface-related schema the superficial schema. According to Paillard
(1999), the superficial schema is a central mapping of somatotopic information from the tactile inputs, which suggests that the superficial schema is associated primarily with touch.
Probably, the superficial schema relates to somatotopy because the somatotopic map refers to spatial patterns in the functional organization of neuronal responses in the somatosensory cortex (Wilson & Moore, 2015). It is important to note that the organization of somatotopy is not proportional to the physical size of body parts (e.g., cortical homunculus). It rather reflects the size of the tactile receptive field (RF). In the primary somatosensory cortex, for instance, a hand is represented larger than a torso because the size of RF of a hand is bigger than that of a torso. Longo et al. (2010) proposed the term
‘superficial map’, where they emphasized a bi-directional localization, in bottom-up and top-down fashions, between a neural representation of body surface and actual surface of the body.
As mentioned previously, Gibson (1962) distinguished passive touch from active touch. The former is known as tactile perception whereas the latter is called haptic, exploratory, and dynamic perception. Promoting an ecological psychology, Gibson (1966) wrote, “active exploratory touch permits both the grasping of an object and a grasp of its meaning” (p.123). The haptic touch is associated not only with perception of physical properties of an object in environment but also with awareness of its ecological meaning.
Tangible properties of an object cannot be grasped by other sensory systems. For example, both vision and touch allow us to perceive geometrical features of an object (e.g., shape, dimensions and proportions, etc.) but touch gives us more precise information about its
34 surface texture (e.g., rough vs. soft), and material features (e.g., heavy vs. light). Through these tangible properties, we can obtain a holistic understanding of an object. With this, we can interact with the environment. Therefore, the haptic touch encompasses tactile perception, motor capacity, and cognition. Developing Gibson’s idea, Aho (2016) discussed the haptic exploration in the context of music-making. In his book The Tangible in Music, Aho (2016) pointed out the importance of tactile perception arisen from bodily movements in the performance of traditional Finnish instruments, Kantele.
In sum, I have discussed body schema in terms of body space. Two components of body schema, the postural and superficial schemata, are primarily associated with proprioception and touch respectively. Although sensory disturbance studies (e.g., patient
Hn case) showed an independence of the postural and superficial schemata, the body schema relies on proprioception, touch, the interaction between two systems, and other systems (e.g., motor system) as well.
Body schema vs. Body image
In the literature of body research, the term ‘body schema’ has often been used interchangeably with another term, namely, ‘body image’. This has led to confusions in various disciplines including neurology, psychology, phenomenology, etc., and production of a series of new terms. According to Gallagher (1986), Fisher, the inventor of the Fisher
Body Distortion Questionnaire, for instance, used the terms of body concept, body scheme, body perception, and body image interchangeably. Body image or body percept, for Gibson, refers to physical poses that are equivalent to the postural schema (Gallagher, 1986). The
35 disagreement about the definitions of the terms and multiple associative terms has become an obstacle of development of study on body.
Following Head & Holmes (1911-2), several researchers have endeavored to make a distinction between body schema and body image depending on involvements of consciousness (Gallagher, 1986, 2005; Paillard, 1999; Longo et al., 2010). Paillard (1999) succinctly defined that body image is an internal representation in the conscious experience of visual, tactile, and motor information of corporeal origin. For Gallagher (1986), body schema is never fully represented in consciousness or conceptualized because the dynamics of the body organize its own spatiality within its surroundings. According to Gallagher
(1986), body image is more complex than body schema because it requires three different components: perception, cognition, and emotion. For Gallagher’s (1986) idea of body image, the body needs to be consciously perceived, conceptually constructed, and related to emotional attitudes and feelings. According to Berlucchi & Aglioti (2010) accepting
Gallagher’s view, when any of the three components malfunctions, it will produce psychological problems such as anorexia nervosa, an eating disorder caused by disturbances of one’s body image.
Interestingly, Gallagher (1986, 2005) noted an interaction between body schema and body image. As discussed earlier, training plays a role in an alternation between body schema and body image. Let us imagine that you are a yogi and practice the ardha chandrasana. It is known as the half-moon pose for which body limbs are radiated in all different directions. If you are a novice yogi, you should check your limb positions while projecting your arms, legs, and head not to fall down. You may look at a mirror or compare
36 your pose with your instructor’s. In this checking process, you may notice that your leg in the air is not in parallel to the ground. This requires an involvement of your consciousness, which means body image. When you master the ardha chandrasana, your body knows where to position your limbs automatically. At this stage, only your postural schema is active because you can accomplish the pose without conscious reflection.
In sum, there are many terms associated with body schema. Body schema and body image can be distinguished by the involvement of consciousness. Given its un- or sub conscious status, body schema is strongly associated with postural schema that also closely works with the motor system. Space pertaining to the body emerges from continuous changes of postural schema. Haptic touch has a strong a tie with perception and action.
Training can alter the body image to body schema.
Peripersonal space
Space outside of the body is not homogenous. External space can be at least divided into two: peripersonal space vs. extrapersonal space. Some researchers (e.g., Lourenco,
Longo, & Pathman, 2011) use a near vs. far space distinction for this heterogeneous external space. One criterion for this distinction in the field of space research is whether hands can either reach or grasp objects. Peripersonal space is associated with an egocentric frame of reference (e.g., in relation to the body) whereas extrapersonal space is characterized by the use of an allocentric frame of reference (e.g., relations between other objects or events) (de Vignemont & Iannetti, 2015). In this dissertation, I will focus
37 peripersonal space because human music-making involves primarily control of and action in body space and peripersonal space.
The earliest notice of the special zone around the body was made in 1955 by the biologist and the director of Zurich zoo, Heini Hediger. He observed not only that an animal escapes when its enemy or predator approaches within a certain distance, but also that it is not just a detection of a potential danger but a certain distance from an animal’s body leads the animal to escape. Hediger formulated this as an escape distance or a flight distance that corresponds to a margin of safety around the body. Cléry, Guipponi, Wardak, & Hamed
(2015) and de Vignemont & Iannetti (2015) summarized that peripersonal space subserves two functions: 1) body protection and 2) goal directed action. Hediger’s observation on escape distance shows defensive function of peripersonal space. Therefore, it implies body protection function according to Cléry et al. (2015) and de Vignemont & Iannetti (2015).
When it comes to hand-centered peripersonal space, examples of goal directed actions include grasping an object, playing a musical instrument, etc.
The two distinctive functions of peripersonal space are associated with different neural representations. In their review, Cléry et al. (2015) discussed the two distinct parieto-premotor networks of peripersonal space. A network subserves body protection function by encoding a safety boundary around the body and contains area 7b (i.e., a subregion of the inferior parietal lobule), the anterior part of the intraparietal sulcus (AIP), and area F5 (i.e., the rostral part of ventral premotor cortex (PMVc)) of monkey brain.
Thus, this space is known as protective or defensive space. In contrast, a network for goal- directed actions consists of the ventral section of the intraparietal sulcus (VIP) and the F4
38 area (i.e., the caudal part of ventral premotor cortex (PMVc)). This network is important for functions of reaching space, grasping space, working space, action space, etc. Fig. 2 visually presents how the networks are involved in different functions in a monkey brain.
Figure 2. Two parieto-premotor peripersonal space (PPS) networks on a monkey brain template: AIP= anterior part of the intraparietal sulcus; VIP= ventral section of the intraparietal sulcus
The term ‘peripersonal’ originates from a series of electrophysiological studies by
Rizzolatti, Scandolara, Matelli, & Gentilucci (1981) who discovered the existence of bimodal neurons responding to both tactile and visual stimuli in the arcuate sulcus of a macaque monkey. They not only called these multimodal neurons peripersonal but also reported that the neurons are activated by stimuli in the space within the animal’s reaching distance.
As implied in Hediger (1955) and Rizzolatti et al. (1981)’s observation, peripersonal space is the space that pertains to animal’s body. In other words, peripersonal space, like body space, uses an egocentric frame of reference. Incorporating existing 39 studies, Coello, Bourgeois, & Iachini (2012) defined peripersonal space the most comprehensively:
Peripersonal space contains the objects with which one can interact in the here and now, specifies our private area during social interactions and encompasses the obstacles or dangers to which the organism must pay attention in order to preserve its integrity.
Researchers come to the conclusion that the features of peripersonal space
(henceforth PPS) include 1) multisensory integration, 2) body-part centered specificity, 3) sensorimotor coupling and 4) plasticity, (Brozzoli, Makin, Cardinali, Holmes, & Farnè,
2012; Clèry, et al., 2015; de Vignemont & Iannetti, 2015). In the following section, I will examine these features of PPS.
Multisensory integration
Since the seminal work of Rizzolatti et al. (1981), different disciplines including electrophysiology, psychology, and neuropsychology have presented converging evidence for an important role of multisensory integration in the representation of PPS (for review, see Maravita, Spence, & Driver, 2003; Brozzoli, et al., 2012). A perception of events and objects within PPS triggers multisensory integration (de Vignemont & Iannetti, 2015). In other words, PPS is characterized by a high degree of integration of sensory inputs retrieved from different modalities. Multimodal processing involves both subcortical and cortical components. Subcortically, the superior colliculus (SC) plays a role as the mediating station of visual, auditory, and somatosensory inputs in the early stage processing
(Maravita et al., 2003; Stein & Standford, 2008; Stein, Standford, & Rowland, 2009; Stein, 40
Standford, & Rowland, 2014). In terms of the laminar structure of the SC, Stein and his colleagues discussed that the superficial layers (I to III) receive unisensory inputs while the deep layers (IV-VI) are associated with multimodal information. Brozzoli et al., (2012) argued that the putamen, rather than the SC, is relevant for processing multisensory events in PPS. As alluded to by the two PPS networks, cortical components responsible for multimodal integration include sub-regions of the parietal and premotor cortices. This will be discussed in detail in the sensorimotor coupling section.
Body-part centered specificity
About a decade prior to the study of Rizzolatti et al. (1981), a neurophysiological single cell study by Hyvärinen & Pranen (1974) reported that some neurons in the parietal area 7 of an awake monkey responds to both tactile stimuli delivered from a specific body part and visual stimuli presented close to the same body part (for review, see Cardinali,
Brozzoli, & Farnáe, 2009; Dijkerman, 2017). Since then, researchers not only found that multisensory neurons have tactile receptive fields (RFs) centered on specific body parts but also reported that the tactile RFs overlap with visual and/or auditory RFs. Rizzolatti,
Fadiga, Fogassi, & Gallese (1997) noted that the visual RFs work with the tactile RFs for coding body-parts coordinates. They pointed out that body movements not only affect both the visual and tactile RFs but also establish our experiential PPS.
A reference frame refers to the center of a coordinate system to represent locations of events or objects (Cohen & Anderson, 2002). Although there are disagreements about how many body-part-centered reference frames exist and which body part functions as a
41 common reference frame, it has been accepted that a peripersonal representation consists of multiple body-part specific reference frames (Holmes & Spence, 2004). Cohen &
Anderson (2002) noted four different reference frames: 1) body-centered, 2) eye-centered,
3) head-centered, and 4) limb-centered. Considering the eye-centered reference frame as the common system, Cohen & Anderson (2002) proposed that the posterior parietal cortex is the brain area where the body-part reference frames transform into the common reference frame. Pointing out that the brain constructs multiple and modifiable representations of space centered on different body parts, di Pellegrino & Làdavas (2015), however, proposed only three reference systems including 1) head-centered, 2) hand-centered, and 3) trunk- centered reference frames. They are known as perihead, perihand, and peritrunk respectively. di Pellegrino & Làdava’s (2015) proposal is interesting in regard with two different modes of music-making. Although both singing and playing an instrument need to control the whole body, which involves perihead and peritrunk spaces, compared to singing, playing an instrument strongly relies on perihand space because we use hands to play musical instruments. Therefore, it would be possible that two types of music training lead to different PPS representation depending on a specific body part.
At any rate, a behavior study by Serino et al., (2015) provides a better understanding of body-part centered reference frames and a common system of PPS. In order to measure the scope of PPS, Serino and his colleagues developed an audio-tactile interaction paradigm. The main assumption of this paradigm is that PPS is multisensorially constructed.
In this paradigm, a proxy of PPS is where sounds enhance tactile processing. Participants are asked to respond as fast as possible to a tactile stimulus while task-irrelevant
42 approaching or receding auditory stimuli are presented to their bodies. After a sound with different temporal delays, the target touch stimulus is delivered. The authors found that the size of PPS varies depending on which body part is stimulated. Specifically, trunk stimulation produces the largest volume of PPS while hand stimulation has the smallest
PPS. The task-irrelevant approaching auditory stimuli to the participants modulate tactile processing for head and trunk (e.g., the multisensory facilitation effect) whereas the receding auditory stimuli alter only hand tactile processing, which implies that different body-centered spaces respond differently to moving objects within PPS. Thirdly, perihead, perihand, and peritrunk spaces are not completely independent from each other. Rather, these spaces interact in a specific way. Serino et al. (2015) noted that perihead space is based on visual RFs that are anchored to the head thus the position of eyes and head direction are important in perihead space. The authors argued that an interaction of perihead and perihand spaces (i.e., an arm-anchored visual RFs) is associated with the computation of the arm position relative to both eyes and head. Lastly, the findings imply that the trunk, not the eyes, plays a role as the common reference frame because the size of peritrunk space is constant compared to that of perihand and perihead spaces, which vary in accordance to relative positioning and stimuli congruency. Further, Serino et al. (2015) argued that perihand space collapses peritrunk space when the hands are placed close to the trunk. Interestingly, Brozzoli et al. (2012) mentioned that near vs. far upper limb movements are represented separately in F4 and F5, the two sub areas of the premotor area respectively. This implies that F4 is probably associated with peritrunk space while F5 may involve perihand space independently. Given the stability and constancy of trunk-centered
43
PPS, Serino et al. (2015) concluded that peritrunk space comprises a whole-body reference frame relative to which a global egocentric representation of space is formed.
Sensorimotor coupling
Sensorimotor coupling links to the previously discussed two features of PPS. First, in regard to multisensory integration, Rizzolatti et al. (1981) pointed out that multimodal neurons are involved in organizing sequences of movements. Second, the body-part- centered coordinates play an important role in the sensory guidance of motor behavior in terms of the interaction with objects near the body.
Sensorimotor coupling refers to the fact that perceived objects or events can be represented in terms of possible actions. Objects or events in PPS guide body movements.
Paillard (1987, 1991) proposed two modes of spatial processing. He argued that, in a sensorimotor mode, spatial information is neurally encoded and derived from body movements in space. While making a connection between sensorimotor coupling and body-part-centered specificity, Cohen & Anderson (2002) argued that sensorimotor movements require neural computations for transformation between body-part-centered reference frames and a common reference frame in order to guide movements.
Neurophysiologically, di Pellegrino & Làdavas (2015) explained that the multimodal neurons in the putamen, and cortical components including the ventral section of intraparietal sulcus (VIP) and the macaque inferior area 6 have both multisensory and motor functions. Additionally, mirror neurons, a special class of motor neurons with visual properties, exist in area F5 (see fig. 2). According to a summary of peripersonal space by
44
Brozzoli, et al. (2012), a subset of the mirror neurons is particularly associated with the execution of a specific motor movement (e.g., precise grasping) and the same action of other bodies after observation. This implies that mirror neurons play an important role in the parieto-premotor network subserving for goal-directed action (see the blue network in fig. 2).
Plasticity
The boundary of PPS is plastic, flexible and dynamic. Hediger’s (1955) flight distance, for example, is to let an animal run away from a potential threatening danger.
However, this is not true for my dog, Daisy. She has a flight distance of zero towards cars that, for wild animals, are considered as a potential danger. Squirrels on campus climb up on a tree even when a car approaches them within about five meters. It seems that Daisy’s previous experience with cars modifies her flight distance to a car. This raises the question of what factors can modulate PPS. de Vignemont & Iannetti (2015) argued that 1) emotions and 2) tool use have modulatory effects on the PPS representation.
Concerning the effect of emotion on space, claustrophobia is a persistent and irrational fear of enclosed places or of being confined (APA Dictionary of Psychology,
2015). Patients with claustrophobia report panic symptoms (e.g., feelings of suffocation, sweating, fears of losing control, etc). Correlating claustrophobic fear to the size of near space in terms of the defensive function, Lourenco, et al. (2011) argued that claustrophobia distorts the representation of PPS. Behaviorally, they found that a relatively large near space correlates with an elevated anxiety in normal healthy people. The author further
45 argued that an enlarged PPS activates easily the defense mechanism, which causes anxiety of enclosed spaces, that is, claustrophobic emotion. Coello et al., (2012) investigated how dangerous objects affect the size of PPS. In their behavioral experiment, the participants rated whether objects (e.g., a knife, scissors, etc.) were dangerous depending on whether the objects are pointing away from or towards the participants. They found that the size of
PPS reduces when participants perceive an object as a threatening one. With regard to the study of Lourenco et al. (2011), the results can be interpreted to indicate that the perception of danger may cause anxiety, which subsequently leads to a shrinkage of the PPS.
In terms of two functions of PPS, both Lourenco et al. (2011) and Coello et al.
(2012) explored the relation between anxiety and a defensive space. Graydon, Linkenauger,
Teachman, & Proffitt (2012) investigated the effect of anxiety on a goal-directed PPS.
Investigating whether anxiety alters the perception of action capability (e.g., reaching, grasping, passing hands through holes, etc.) in a near space, Graydon et al. (2012) showed that the participants experiencing anxiety made a more conservative judgment on their action capability compared to the control, which reduces the size of the working PPS.
In addition to the modulative effect of emotions (e.g., anxiety) on the size of PPS, tool use is also a well-studied factor contributing to PPS plasticity. While anxiety is primarily associated with the reduction of the space, tool uses allow an animal to incorporate extrapersonal space in to its PPS. In other words, tool use stretches the size of near space. Neurophysiological studies suggest that the brain considers tools as extended body parts by activating neural networks associated with the putative body schema and consequently changing the PPS (Maravita & Iriki, 2004). In the first study reporting the
46 plasticity of the PPS due to tool use, Iriki, Tanaka, & Iwamura (1996) found that a monkey’s manipulation of a long rack with the hand in order to acquire a distant food rewards modulates the PPS representation. In other words, the rack extends the animal’s reaching distance by assimilating the rack with the body. Activation of bimodal neurons has been observed in the medial anterior intraparietal sulcus and in the post-central gyrus
(Iriki et al., 1996; Maravita & Iriki, 2004). Using a positiron emission tomography (PET),
Obayashi et al. (2001) investigated which areas of monkey’s brain would be activated after rake use training for several weeks. They found that tool use is associated with neural activities in the intraparietal region, the basal ganglia, the presupplementary motor area
(especially F4), the premotor cortex (especially F5), and the cerebellum.
In their review on the relationship between training and tool use in terms of PPS,
Brown & Goodale (2013) proposed a motor knowledge hypothesis. They argued that motor knowledge acquired through training plays a significant role in the appearance of near-tool effect that is associated with the adapted representation of PPS around the tool. The acquisition of motor knowledge requires to connect the planned motor movements and their corresponding sensory consequences after the execution. In other words, motor knowledge is established through a reliable predictive relationship between motor input and sensory output. Similarly, motor learning for tool manipulation involves the reinforcement of the predictive ability based on sensorimotor coupling when animals use tools. Motor knowledge of tool use allows its user to predict the spatial location of the tools as it is moved by linking limbs, hands and the tool position. According to Brown & Goodale
(2013), an animal with unfamiliar tools will not show altered spatiotemporal processing
47 because its sensorimotor system cannot predict the relationship between the tool’s action controlled by the animal and the sensory signals produced by the moving tool.
In sum, PPS plays an important role in body protection and goal-directed action that involve different neural networks. PPS is characterized by four features that are inseparable from each other as well as the two schemata of body space. First, multisensory integration has been identified with multimodal neurons. Second, body-part-centered specificity shows multiple and modifiable representations of PPS centered on different body parts. This feature is associated with postural schema and implies how body space interacts with PPS. Third, multisensorially perceived objects or events in PPS guide action or movements. Last, anxiety and tool use can modulate PPS. The last two features relate to the discussion about haptic touch.
Two modes of spatial processing: sensorimotor vs. representational
As mentioned previously, Paillard (1987, 1991) proposed that there exist both a sensorimotor and a representational mode of spatial processing. Although these two modes coexist, they generate and store their own mapping of space. The sensorimotor mode concerns the direct dialogue between an animal and the physical world which is attuned by its sensorimotor apparatus. Thus, the sensorimotor mode contributes to the continuous updating of a body-centered mapping of external space where things are located and to which actions are guided. Similar to Gibson’s active touch, Paillard (1987, 1991) noted that perceived sensory information directs body movement in space and coined the term
‘action space’.
48
In contrast, the representational mode derives from neural activities which explore and consult an internal representation of the physical environment that is embodied in memory stores. This mode is associated with a mental representation of local maps (e.g., the 18th Avenue library is on the OSU central campus), spatial relationships of routes relative to landmarks (e.g., the Arps garage from the library), relative position between objects (e.g., the Arps garage entrance and the exits) and the position of the body itself in relation to its stationary environmental frame (e.g., which floor I am in the garage). The representational mode is similar to what is known as a cognitive map.
In their phenomenal book, The Hippocampus as a Cognitive Map, O’Keefe &
Nadel (1978) argued that the psychological space is produced intrinsically and operated by the mind. The psychological space includes 1) a particular set of sensations transduced by a specialized spatial sense organ, 2) organized sensory arrays which derive their structure from the nature of peripheral receptors, 3) organizing principles that impose unified perceptions upon otherwise diverse sensory inputs, 4) abstractions from sensations, and 5) the concepts that the mind builds on the basis of reflections on experience. The first three components of the psychological space of O’Keefe & Nadel seem to correspond to the sensorimotor mode by Paillard (1987, 1991). The significance difference between the two studies is that the psychological space by O’Keefe & Nadel (1978) focused only on the perception of sensory inputs, thus missing the motor component, whereas Paillard (1987,
1991) emphasized action space that is guided by perception. Paillard’s (1987, 1991) representational mode seems to be associated with the remaining last two elements of the psychological space by O’Keefe & Nadel (1978).
49
Additionally, the different modes of spatial processing seem to be parallel to two modes of temporal processing. As discussed in chapter 2, many researchers of time (e.g.,
James, etc.) have distinguished perceived time from remembered time. Time perception is different from representation of time (Friedman, 2000). For the sensorimotor mode of spatial processing and time perception, both space and time emerge from sensory inputs and body movements. For the representation mode of spatial processing and representation of time, space and time heavily rely on patterns that are acquired from previous experience, namely, memory.
Embodied spaces in music-making bodies
With regard to music performance, one can distinguish two types of music-making:
1) singing and 2) playing an instrument. In the previous chapter, I briefly discussed the involvement of differential processing not only for vocal vs. non-vocal sounds but also for vocal vs. non-vocal rhythms. The distinction between singing and instrument playing offers us a better understanding of the origins of music. This and the different evolutionary paths of singing and playing an instrument will be discussed in chapter 5.
This chapter proposes a new perspective on the differences between singing and instrument playing in terms of space pertaining to the body. From ethnomusicological perspectives, Baily & Driver (1992) developed their argument for spatio-motor thinking on the basis of a connection between musical creativity and the way we play an instrument.
The idea behind their spatio-motor thinking is that the manner of music-making contributes to the representation of the body in a distinctive way and transform our spatial experience
50 accordingly to the instruments, which further influence musical styles. Aho (2016) proposed that tangibility may play an important role in instrumental music performance, which this implies different transformative powers of different types of music-making. He wrote:
the instrument can even seem to evolve into an organic extension of the body. In fact, a basic feature distinguishing playing an instrument from singing, the oldest and the most basic means of producing music, is the way a musical instrument provides the player with a means of transcending the limits of the physiological body, producing a sound that may be ultimately may be enough to fill a sport stadium. (Aho, 2016, p.4)
Would singing and instrument playing lead to different spatial experiences? If then, how? I limit the application of the postural and superficial schema to perceptual consideration, in spite of an importance of motor component in body space and peripersonal space. This is for a purely practical reason, in order to see clearly the differences in spaces in the two performing bodies. In terms of postural schema of the singing body, we un- or subconsciously control our vocal organs to produce vocal sounds.
Signing does not make use of the superficial schema of hands, which seem to play an important role in active touch. Singing may lead to a vibrating surface of the body but the vibration is limited to the torso and probably the head. Therefore, the singing body may have a different peritrunk space compared to the playing an instrument body. In contrast, the instrument playing body involves all of the components of both body space and peripersonal space. Additionally, it requires both postural and superficial schemata of body space due to the important role of direct tactile contact with, and haptic exploration of a musical instrument. For the instrument-playing body, multiple sensory information, predominantly from auditory and tactile inputs, is constantly integrated. Although visual 51 inputs play a role in music-making in general, I would not discuss visual system and its interaction with other sensory systems because my main research interest here is involved in the potential differences in spatial processing between the singing and the instrument playing body. And another most remarkable of differences between them, in addition to audio-tactile integration, is perihand space where I cannot think of any role of the visual system. In instrument playing, hands play an important role and the perihand space can be expected to occupy a prominent position in the body-part centered specificity of the peripersonal space. The coupled audio and tactile inputs at perihand space imply an interaction between the perihead and perihand spaces. In terms of sensorimotor coupling, a playing-an-instrument body interacts with a musical instrument in an action-perception feedback loop. Musical instruments, the most special tools that humans have ever invented, seem to alter the peripersonal space around our limbs and music instruments. As Brown &
Goodale (2013) pointed out, playing an instrument involves motor knowledge, specifically for limb movement, that is combined with specific spatial information near and on an instrument. Furthermore, some instruments are equipped with tools (e.g., drum stick, string bow) that extend perihand space (see Table 1).
52
Music-making bodies Singing body Playing-an-instrument body
Spaces pertaining to the body and components
Postural schema ✓ ✓ Body Superfical schema ? ✓
Multisensory ✓ integration (audio-tactile) Body part ? ✓ specificity (peritrunk, perihead) (perihand) Peripersonal Sensorimotor coupling ✓
Plasticity & near- ✓ tool effect (e.g., drum stick, string bow, etc.)
Table 1. Comparison of spaces pertaining to the singing vs. playing-an-instrument bodies
As some music researchers (e.g., Schäfer, Fachner, & Smukalla, 2013) noted, although space plays an important role in music-making, space has been largely ignored in music research. This is probably because music has been considered as an art of time so space has not been a primary interest in music research. Therefore, space has not been defined well in music research and the use of space is different depending on researchers’ arguments (e.g. pitch class space, etc). Early studies on space in music research do not give us a clear idea of how to study space in music-making bodies. At various conferences, I
53 proposed that we should look at space pertaining to the music-making bodies. However, some researchers I encountered at these conferences preferred to see this in a different way, depending on their interests. For instance, I met a composer who runs an electronic laptop orchestra project where people collaboratively improvise through remote access. For him, virtual space is the main concern. His understanding of space (i.e., virtual space) is different from space I propose in this chapter. Space pertaining to the body is neither abstracted nor virtual, but it emerges from our perception of and action on the world surrounding the body.
I hope my proposal here contributes future studies on space in music-making.
54
Chapter 4. The origins of time and space concepts
One of the main arguments of the previous chapters is that time and space emerge from integrated sensory information that is retrieved from an interaction with environments.
In other words, time and space are shaped by human experience. Our temporal and spatial experiences primarily rely on our perception of and action to the world. On the basis of our perception and action, we cognitively construct and/or reconstruct time and space, that is, the world, in a meaningful way. If we agree that time and space are cognitive constructs, then they should be discussed in terms of biology, experience and environment, that is, culture (Will, 2017). In chapters 2 and 3 I approached time and space from a cognitive science perspective, which may cover a portion of biological and experiential aspects of time and space. In order to understand the human experience of time and space holistically, it would be interesting to see how time and space have been seen in different cultures.
However, it is not easy to find cultural differences in an understanding of time and space because these two concepts have changed throughout human history (e.g., the shift from
Newton’s absolute time to Einstein’s relative time in physics in the West). Among many approaches, one possible way to find cultural differences regarding time and space is to look at the ancient writing fragments about cosmology. The ancient texts on the creation of the world tell us about the origins of the time and space concepts with each culture’s 55 own view. Rosen (2004) asserted a necessity of comparative studies on the study of time in early human history in the opening chapter of Time and Temporality in the Ancient
World because it allows us to speculate how ancient individual cultures reconstructed, conceptualized, and formulated time differently. Although there are some comparative studies on time (e.g., Rosen, 2004), there seems to be no equivalent research of the concept of space in antiquity. Therefore, I will discuss how various cultures have understood time and space differently through an examination of a few fragments from ancient Babylonia,
India, China, and Greece as well as relevant commentaries on those texts. My analysis here is piggybacked on scholarly works by the expertise of these cultures’ cosmology and philosophy. To begin with, it is worth noting that the majority of the ancient scripts describe the creation of the world as setting or recovering temporal and/or spatial orders from chaos.
It means that the creation of the world has been considered as that of the ordered world, that is, the cosmos or universe. Ephemerality, eternity, and cyclicity play an important role in an achievement of temporal and/or spatial order, however, each culture put different weights on one of these properties. As temporal and spatial orders are critical in the cosmos, time and space in ancient societies served didactic roles to control human action and behaviors, for example, how to make music in ritual ceremonies. Therefore, I will also look at the role of music described in the ancient cosmological scripts if relating ancient documents are accessible.
56
Babylonia
When skies above were not yet named, Nor earth below pronounced by name, Apsu, the first one, their begetter, And maker Tiamat, who bore them all, Had mixed their waters together, But had not formed pastures, nor discovered reed-beds; When yet no gods were manifest, Nor names pronounced, nor destinies decreed, Then gods were born within them. Lahmu (and) Lahamu emerged, their names pronounced. As soon as they matured, were fully formed, Ansher (and) Kishar were born, surpassing them. They passed the days at length, they added to the years. Anu their first-born son rivalled his forefathers: Anshar made his son Anu like himself, And Anu begot Nudimmud in his likeness.
(Tablet I:1-16, Dalley, 2008, p. 233)
This is the opening of the Babylonian genesis, Enūma Eliš, literally meaning ‘when above’, (Heidel, 1951; Jacobsen, 1957). 2 The Enūma Eliš indicates how the ancient
Babylonians experienced and constructed the world on the basis of their everyday experience. The Enūma Eliš reflects a unique geographical environment of the
Mesopotamian civilization. Mesopotamia had been built on a slit between the Tigris and
Euphrates rivers over thousand years ago (see fig.3). Although the rivers contributed to a formation of fertile marshlands, which played a crucial role in the rise of civilization, sweet water from the two rivers might often blended with sea water from the Persian gulf
(Jacobsen, 1957). It is very likely that the ancient Mesopotamians suffered from the violence and severity of water-related natural disasters, including torrential rains and devastating floods, which might have been almost impossible to control (Whitrow, 2004).
2 It was recorded on seven clay tablets around the second millennium BCE in Akkadian, whose dialects include Assyrian and Babylonian, a Semitic language. 57
Figure 3. Ancient Mesopotamia geography
The epic beautifully recounts the early Babylonians’ experience of the cruelty of the nature. According to the quote above, there was neither heaven nor earth in the beginning of the world. But there existed only chaos made by the sweet water and the salty water. The two waters were deified as Apsu and Tiamat, respectively. The mixed waters brought chaos. They also generated a slit that was represented as the twin gods Lahmu and
Lahamu. Ansher and Kishar derived from the twin gods brought a horizon. The horizon was circumscribed by heaven and earth. Therefore, the god of heaven, Anu, was born from
Ansher and Kishar. The god of earth, Nudimmud, was born from Anu. This genealogy of the gods is suggestive of how significant it was to secure a living space from water disasters for the ancient Babylonians. Accordingly, the ordered space seemed to precede time in the process of the story of the creation of the world. In Babylonia, the ordered world, cosmos,
58 was heavily indebted to Marduk, the son of the earth and was born within the line of the sweet water.3 Marduk has been known as the savior of the world from destructive forces of the salty water (i.e., Tiamat) which brought about chaos and evils to the world. 4
Championing the battle against Tiamat, Marduk acquired a power to control violent nature including flooding, winds, storm, etc. (Jacobsen, 1968). Throughout the history of
Babylonia, Marduk’s creation of cosmos has been extolled on the fourth day of the akītu festival5 where the Enūma Eliš is recited (Robson, 2004):
He [Marduk] fashioned stands for the great gods. As for the stars, he set up constellations corresponding to them. He designated the year and marked out its divisions. Apportioned three starts each to twelve months. When he had made plans of the days of the year, He founded the stands of Neberu to mark out their courses, So that none of them could go wrong or stay. . . . He made the crescent moon appear, entrusted night (to it). And designated it the jewel of the night to mark out the days.
‘Go forth every month without fail in a corona, At the beginning of the month, to glow over the land You shine with horns to mark out six days; On the seventh day the crown is half. The fifteenth day shall always be in the mid-point, the half of each month.
3 And inside Apsu, Marduk was created; Inside pure Apsu, Marduk was born. Ea, his father created him, Damki[na] his mother bore him. (Tablet 1: 81-4, Dalley, 2008, p. 235). Nidmmud’s another name is Ea, the father of Marduk. 4 How did Tiamat become the origin of chaos and evils in the epic? When the children of Apsu and Tiamat, the younger gods, played in her belly, they were so boisterous that Apsu could not quell them. He requested Tiamat to kill the children, which enraged Tiamat. Apsu convened the counsel of gods to kill the children. After the younger gods knew Apsu’s plan, Ea, the god of magic and Marduk’s father, casted a spell that made Apsu sleep. While Apsu was sleeping, Ea took his power and killed him. Marduk played with winds, which disturbed Tiamat who was enraged and promised to avenge her husband’s death by destroying everything. For this, she created demonic monsters. 5 According to an extensive survey on the akītu festival by Bidmead, (2014), the akītu is one of the oldest ceremonies in the ancient Near East. Records show that the early akītu began as an agricultural harvest festival and took place twice a year for the grain harvest and for wheat harvest. The festival developed from two semiannual agricultural celebrations to the one new year celebration. In the first millennium BCE, the festival ended up with twelve days of celebrations involving rituals, prayers, sacrifice, royal processions, recitation of the Enūma Eliš, and prophecies for the upcoming year. 59
When Shamash looks at you from the horizon, Gradually shed your visibility and begin to wane. Always bring the day of disappearance close to the path of Shamash, And on the thirtieth day, the [year] is always equalized, for Shamash is (responsible for) the year.’
(Tablet V:1-7 & 12-22, Dalley, 2008, pp.255-7)
The fifth tablet demonstrates how Marduk set cosmic orders. As delineated above, what Marduk conducted at first was to put the heavenly bodies in a proper position. Given that each star stands for his ancestor gods, Marduk’s first work could be interpreted as placing his ancestral tablets in his family shrine. After setting the spatial order, Marduk operated the stars in a regular motion that gives rise to cyclic time (Robson, 2004). The stars designated twelve months. Next, Marduk created the moon while providing how it ought to change the shape of its body throughout a month within a year. According to the temporal order established by Marduk, the ideal length of the months consisted of thirty days so the year had 360 days in total, which implies a cyclicity of time. However, reality does not correspond to the ideal model of cyclical time set by Marduk. Speculating how the early Mesopotamians conceived time and space with regard to celestial divination,
Brown (2000) argued that there was a significant development in the measuring of celestial movements by way of temporal and spatial units as indicated by the Mesopotamia cuneiform records. The observations of the movements of heavenly bodies by these units revealed that there were systematic variations from Marduk’s sacred system. The difference between reality and the ideal system was interpreted as being ominous.
Therefore, the discordance between the two systems thought to be reconciled through ritual ceremonies. Babylonian rulers used this discrepancy between the systems for administrative purposes. In line with Brown (2000), Robson (2004) argued that the 60
Bablylonians had religious ceremonies at the temples in order to reconcile the real and sacred times and to adjust the calendars by moving clothing ceremonies and animal sacrifice forward or backward a day according to the length of the month to avoid bad omens.
In sum, the Enūma Eliš reflects the ancient Babylonians’s experiences with a severe environment and their desire for ordered space. This epic describes how Marduk arranged the spatial order of his ancestral gods into heaven and determined the operation of star movements, which are associated with a cyclical temporal order of the cosmos. For the
Babylonians, the temporal order created by Marduk had been considered ideal and sacred.
The discrepancy between ideal time, established by the god, and worldly time is being considered ominous, therefore the ancient Babylonians have tried to recover and restore cosmic order through ceremonies including the akītu festival.
India
The ancient Indian understanding of the world is complex. In this section, I will look at that complexity through the Ṛgveda, a hymn collection dedicated to several deities and the oldest text of the four vedas (ca. 1,200 - 1,000 BCE). Yanchevskaya & Witzel
(2017) examined the Ṛgveda in order to reconstruct the ancient Indian views of the world.
The authors argue that the Ṛgveda is the root of the Indian time and space concepts. For example, Prasad (1992) argued that, on the basis of the vedas, the Upaniṣad and the
Ved!"nta, the post-vedic writings, show not only a radical development of thinking about time but also a diversity of understanding of the world in ancient India. This divergent
61 development can be seen in the emergence of the Brahmanical and Buddhist schools. Each school formulated its own perspectives on time and space after contemplating the origin of the world (Baslev, 2009). In regard to the concept of time, Yanchevskaya & Witzel (2017) pointed out two important properties of time in the ancient Indian perspective of the world:
1) eternity and 2) ephemerality. In the following section, I will touch upon those two temporal properties and add another one, cyclicity, which is associated with eternity. Next,
I will discuss the ancient Indian understanding of space within a continuum of time and space in the Ṛgveda. In the context of the ritual ceremonies, the continuum connects the inner world with the external world.
According to Yanchevskaya & Witzel (2017), there was no abstract term for time in the Ṛgveda where k!"la, which denotes ‘in a [proper] moment of time’, appeared once and samaya, another word for ‘time’, was not used at all. Examining the Ṛgveda,
Yanchevskaya &Witzel (2017) noted two different understandings of time in ancient India and proposed a term ‘two times’ to discuss two properties of time, that is, eternity and ephemerality. Eternity refers to something everlasting. Eternity is an imperceptible feature of time. In chapter 2, I discussed the fact that human temporal experience is based on perception of changes. Through perception, we detect events. An ensemble of different sensory modalities allows us to notice changes around us. As the fundamental psychological building blocks of time, event detection involves discerning a change of state of something. However, eternity or everlastingness is associated with an unchanging, stable, and constant state of an object. It implies that there is no event to be detected. As a purely
62 abstract concept, eternity is characterized by its atemporality. The penultimate hymn of the tenth maṇḍala titled ‘creation’ of the Ṛgveda represents this eternity property of time:
1. From fervor (tapas) kindled to its height Eternal Law(ṛta) and Truth (satya) were born: Thence was the Night (rātri) produced, and thence the billowy flood of sea arose. 2. From that same billowy flood of sea the Year (saṃvatsara) was afterwards produced, Ordainer of the days nights, Lord over all who close the eye. 3. Dh!"tṛ, the great Creator, then formed in due order Sun and Moon. He formed in order Heaven and Earth, the regions of the air, and light. (RV10.190, Griffith, 1896)
In the Ṛgveda, eternity is associated with ṛta. As the principle ṛta is the cosmic order in collaboration with satya (Truth/Reality). Another important term in this hymn is tapas (fervor/heat) that stands for certain spiritual or religious practices. It produced ṛta and satya. Both ṛta and satya were the preconditions for rātri (the night) and the billowy flood of sea. From the primordial chaos by the night and the billowy flood of sea, saṃvatsara (year) was born. According to Yanchevskaya & Witzel (2017), saṃvatsara denotes “all-powerful time-eternity in the Ṛgveda” (p. 24). After the birth of saṃvatsara, the world found its temporal organization that has governed movements of heavenly bodies.
In sum, ṛta is the organizing principle of the universe and its medium is saṃvatsara.
Saṃvatsara implies that cyclicity is a immanent property of in ṛta. Although cyclicity is not considered in Yanchevskaya & Witzel (2017)’s discussion, Brown (1968) argued that an Indian idea of cyclical time within the ever-revolving wheel of time goes back to the Ṛgveda. For example, a famous riddle of the universe hymn, asya vāmasya
(RV1.164) alludes to cyclicity of ṛta by using a wheel metaphor. This riddle hymn gives a description of the sun’s chariot whose wheel runs around heaven. Like the eternal world, heaven never changes. So the wheel running around heaven transcends both time and space. 63
The wheel is composed of several parts that are specified with certain numbers symbolizing different things in a cycle of a year. For example, the three naves of the wheel symbolize three seasons or three worlds, which are associated with the solemn vedic ritual.6 The five spokes or five feet of the wheel relates to five seasons that are distinguished in the vedic tradition.7 The twelve spokes or fellies of the wheel indicate twelve months.8 The 360 pegs that are used to put the wheel spokes together relate to the days of a year (see footnote 6).
In a cycle of a year that is represented as the wheel and involves solar rotation, all creatures come to life and die.9 In terms of time, the atemporal ṛta reveals its temporal manifestations
(e.g., solar movement) through cyclicity. The strong connection of eternity and cyclicity is also reflected in Indian music-making. In his discussion of time in Indian music, Clayton
(2000) said, “the ultimate nature of the rāg is thought of as unchanging, while it is constantly renewed in performance as cycle inevitably follows cycle” (p.16).
Let me return to the Ṛgveda’s creation hymn in order to understand the ancient
Indian perspective on space. There is an interesting difference in the process of world creation between the ancient Babylonia and India. I argued that the Enūma Eliš reflects the geographic environment where the early Babylonians probably suffered from frequent
6 Twelve are the fellies, and the wheel is single; three are the naves. What man hath understood it? Therein are set together spokes three hundred and sixty, which in nowise can be loosened (RV 1.164.48, Griffith, 1896). Three seasons include summer, monsoon, and winter while three worlds are composed of heaven, earth and underworld. 7 Upon this five-spoked wheel revolving ever all living creatures rest and are dependent. Its axle, heavy-laden, is not heated: the nave from ancient time remains unbroken (RV 1.164.13, Griffith, 1896). 8 Formed with twelve spokes, by length of time, unweakend, roll round the heaven this wheel of during Order (RV 1.164.11, Griffith, 1896). 9 The wheel revolves, unwasting, with its felly: ten draw it, yoked to the far-stretching car-pole. The Sun's eye moves encompassed by the region: on him dependent rest all living creatures (RV 1.164.14, Griffith, 1896); also see footnote 6. 64 water floods so they might have valued space more than time. I speculated that this may be the reason why an establishment of the spatial order by Marduk preceded that of temporal order in the fifth tablets of the Enūma Eliš. In contrast, the Ṛgveda prioritizes time. Time organized space. Associated with eternity and cyclicity, time regulated everything in the world and surpassed space. Although both the Ṛgveda and the Enūma Eliš described chaos as the primordial state that was associated with water, the Ṛgveda’s prioritization of time over space was opposite to the Babylonian origins story and its Jewish parallel, the Book of Genesis. The Enūma Eliš set spatial order from the watery chaos before temporal order.
In this sense, commonalities can be found in the Book of Genesis where God made the heavens and the earth by clearing the waters. Then God filled space with his creations in a timely manner:
1. In the beginning, when God created the heavens and the earth— 2. [b]and the earth was without form or shape, with darkness over the abyss and a mighty wind sweeping over the waters— 3. Then God said: Let there be light, and there was light. 4. God saw that the light was good. God then separated the light from the darkness. 5. God called the light “day,” and the darkness he called “night.” Evening came, and morning followed—the first day. (Genesis 1:1-5, New American Bible)
With regard to space represented in the Ṛgveda, Yanchevskaya & Witzel (2017) pointed out an importance of the Puruṣasūkta hymn of the tenth maṇḍala. Puruṣa is the cosmic giant who sacrificed himself10 and his body gave rise to all elements that filled space.11 For example, heaven, earth, and interspaces, the sun and the moon, and living
10 When Gods prepared the sacrifice with Puruṣa as their offering, Its oil was spring, the holy gift was autumn; summer was the wood. (RV 1.10.5, Griffith, 1896). 11 A thousand heads hath Puruṣa, a thousand eyes, a thousand feet. On every side pervading earth he fills a space ten fingers wide. 65 beings originated from his sacrificed body. However, it is important to note that Puruṣa’s sacrifice gave him eternity12 and cyclicity13 so that the cosmic giant Puruṣa created spaces and the materials filling in the time-space continuum.
Over time, this time-space continuum model had been developed in the ancient
Indian philosophy. One of the earliest models is found in a section of the Yuriveda, one of four Vedas describing actions related to rituals. For example, the Taittirīya upaniṣad, part of the Yuriveda, that suggests five layers inserted in the continuum of cosmos (see fig.4).
The layers contain the elements coming from the sacrificed Puruṣa’s body. Above all, the
Taittirīya paniṣad’s model introduced an existence of another continuum between inner cosmos and outer cosmos. I will discuss a modified and more elaborated model below.
Figure 4. Taittirīya upaniṣad’s model of a continuum of the cosmos (Rowell, 1992, p.16): Reprint permission granted by the publisher
Another property of the ancient Indiam time is ephemerality. This property reflects a transitory and concrete property of time. Time is often marked as an event or event units of the year, such as seasons, month, day, nights, etc. As implied in the second meaning of
12 This Puruṣa is all that yet hath been and all that is to be; The Lord of Immortality which waxes greater still by food. (RV 1.10.2, Griffith, 1896). 13 From him Viraj was born; again Puruṣa from Viraj was born. As soon as he was born he spread eastward and westward o'er the earth. (RV 1.10.5, Griffith, 1896). 66 tapas, that is, spiritual or religious practices, we can demarcate various time units especially in religious settings. In doing so, a proper moment (ṛtu) in time becomes conceivable to people participating in a certain ritual ceremony. Pointing out that Sāyaṇa, the commentator of the Ṛgveda, described a proper moment with the term ‘kāla’ referring to a moment in time, Yanchevskaya & Witzel (2017) argued that the term ‘ṛtu’ explains ephemerality of time and emphasized its connection with rituals in the Ṛgveda. For example, the second hymn dedicated to Agni, the god of fire, writes, “To the Gods’ pathway have we travelled, ready to execute what work we may accomplish. Let Agni, for he knows, complete the worship. He is the Priest: let him fix rites and seasons” (RV 10.002.3, Griffith,
1896). Similar to that the akītu festival plays an important role in recovery of the cosmic order, the Priest, Agni, organizes time via rites. In his monumental work The Ritual Process,
Turner (1977) analyzed three different units of rituals: 1) separation, 2) liminality, and 3) aggregation. In the second unit of a ritual, that is in liminality, a person experiences a moment of being neither here nor there. To me, this unit seems closely related to the proper moment.
Eliade’s (1992) proposed sacred time and its counterpart, profane time. The former takes eternity into consideration and the latter is “the continuous and irreversible time of our everyday, desacralized existence” (p.97). On the basis of Elaide, Prasad (1992) argued sacred and profane times relate to each other because the re-actualization of myth, namely rituals, would bring sacred time back. Again, this is reflected in the Babylonian akītu festival. In both cultures, rituals bring back the ideal time, that is, sacred time. Then how can rituals revive it? A possible explanation for this can be found in Kak’s (2009)
67 investigation of a role of the vedic temples. To begin with, Kak (2009)’s interpretation of the vedic temples is based on Sāṅkhya’s model of a continuum of the cosmos (see fig.5).
Therefore, let me discuss Sāṅkhya first and then return to Kak (2009).
Figure 5. Sāṅkhya’s model of a continuum of the cosmos modified from Rowell (1992, p. 30): Reprint permission granted by the publisher.
Over many centuries, models of a time-space continuum developed in various ways depending on different schools in ancient India. One of the earliest models is shown in
Taittirīya upaniṣad (fig. 4). According to Rowell (1992), Sāṅkhya, one of the Brahmanical schools14, suggested the most fully developed view on the processes of how the outer cosmos transforms the inner one by introducing the tripartite doorkeeper model. Compared to the Taittirīya upaniṣad where its five layers have their own inner world and corresponding outer world in a cosmos continuum, the Sāṅkhya model consists of the inner
14 Other schools include Nyāya, Vaiśeṣika, Yoga, Mīmāṁsā, and Vedānta. 68 world as associated with the mind and the outer world as related to the external world.
Furthermore, the inner world is refined with the three door keepers, which refers to the mind. The five layers in the Taittirīya upaniṣad model were reorganized, too. Sāṅkhya, suggested the five layers of the sense and motor organs, which correspond to perception and action respectively. The five layers of the five subtle and physical elements represent the external world. The most important aspects of Sāṅkhya’s expanded model are not only a connection of sense (i.e., perception) with motor organs (i.e., action) but also a distinction of mind from ego in terms of the involvement of consciousness. This relates to the concepts of time and space that are widely accepted in the field of the cognitive science. In my previous chapters, I discussed a strong tie between perception and action in human experience of time and space. We perceive changes in our surrounding. We react to the changes. Our bodily movements create other changes in a surrounding environment. We perceive new changes. This ongoing perception-action feedback loop allows temporal and spatial experience. Gibson’s active touch and Paillard’s action space are the best example of this. In addition to this, I also discussed the role of consciousness in the perception of space (e.g., body schema vs. body image). In the Sāṅkhya’s model, manas and ahaṅkāra seem to correspond to body schema and body image respectively. Manas interprets raw data retrieved from sensory modality. This does not require an involvement of conscious awareness. Therefore, manas is close to body schema. Ahaṅkāra involves consciousness so it may relate to body image. Sāṅkhya could be considered as a pioneer of time and space in cognitive science.
69
Let me return to a discussion about the relationship between sacred time and rituals in ancient India. With regards to the vedic temple, Kak (2009) not only described the temple that is the place binding time and space but also argued ritual ceremonies that unite an inner cosmos (i.e., the human mind) and an outer cosmos (i.e., external physical world). Kak
(2009) focused on architectural aspects of the vedic temples and noted three different shapes of altars: circular, half-moon, and square. The circular altar is a symbol of the earth for the outer cosmos and of the body for the inner cosmos. The half-moon altar is a symbol of the atmosphere for outer cosmos and of heart for the inner one. The square alter is a symbol of heaven for the outer cosmos and head for the inner one (Kak, 1995). Additionally, the agnicayana altar (i.e., the fire altar), the main altar in the vedic rituals, consists of a thousand bricks built in five layers. The five layers signifies the five great elements (e.g., earth, water, fire, air, and ether) that are associated with the five senses (Kak, 2009), which coincides with the Sāṅkhya’s cosmos continuum model.
To sum up, the ancient Indian understanding of time is more complex than that of the Babylonians. The Ṛgveda indicates three different properties of time: 1) eternity, 2) cyclicity, and 3) ephemerality while the Enūma Eliš suggests that time is marked by the movements of heavenly bodies. Time in ancient India seemed to be more powerful than the Babylonian time. The ancient Indian time as eternal time was the law that organized orders of the world. However, this required a scarification. By sacrificing himself, the cosmic giant Puruṣa acquired eternity and cyclicity so he was able to create space and all elements in the space. In terms of the ephemerality of time, a concrete event or event unit was often marked by the ritual ceremonies. Through the rituals, sacred time came back
70 and the outer and inner cosmos were in accord. Models of the continuum of cosmos had been developed from Taittirīya upaniṣad of the Yuriveda to Sāṅkhya. In terms of time and space, Sāṅkhya’s expanded model showed similarities to cognitive science. First, both
Sāṅkhya and cognitive science consider that time and space emerge from a sensorimotor connection. Second, both views differentiate mental representations of the world depending on an involvement of consciousness.
China
Many sinologists (e.g., Marcel Granet, Joseph Needham, A. C. Graham, etc.) speculated about the ancient Chinese view of a continuum of time and space. They pointed out that the ancient Chinese view of the world might have been shaped by the correlative, associative, or metaphoric mode of thinking about the world.15 Bodde (1991) argued that the correlative thinking creates a series of connected ideas and is based on metaphor. In line, Wu (1995) discussed the fact that the ancient Chinese constructed space and time within the web of experience, that is, in a contextualized world. According to Pankenier
(2004) who applied the metaphorical term ‘fabric of space-time’, the early Chinese synthesized time and space via a metaphoric way of thinking, as an art of weaving. These accounts show that the early Chinese built the concepts of space and time in reflection of their understanding of the world in a holistic way by connecting interrelated items and
15 Fung (2010) reviewed the ideas of correlative thinking developed by Western sinologists including Granet, Needham, Graham, Hall, and Ames, and pointed out a problem of their dualistic view of correlative vs. analytic way of thinking. As shown in the main text, recent scholars tend to avoid the term ‘correlative’ and to exchange it with ‘metaphoric’. Although Fung (2010)’s arguments on the premise of correlative thinking are valuable, in this chapter, I do not intend to delve into different scholars’ assumptions. 71 grouping them. One of the earliest notes about this was made by Granet, one of the most influential figures in the study of the Chinese mind. In his La Pensée chinoise, Granet
(1934), asserted that the ancient Chinese would not only see that time and space consist of blocks but also would think that a unit with the correlated blocks create an assemblage.
Although Granet’s remark on blocks of time and space seems to reflect on the conception of space and time as container of the physical material world, which is related to modern science (Mondragon & Lopez, 2012), the most important aspect in Granet’s discussion is that time and space in ancient China were associated with events caused by concrete actions :
All [Chinese thinkers] prefer to see in time an assemblage of eras, seasons and epochs, and in space a complex regions, climates and directions. In each such directions, extension [i.e., space] particularizes itself by assuming the attributes peculiar to a single climate or region. In the same way, duration [i.e., time] differentiate itself into varied time periods, each bearing the characterization appropriate to a single season or era. (Granet, 1934, p. 86, trans. by Bodde, 1991, p.104)
Time and space are never conceived apart from concrete actions…. The words shih [“occasion” or “timeliness”] and fang [“direction” or “regions”] apply respectively to all portions and parts of duration and extension- each and every one of which, however, is in each instance viewed under its own distinctive aspect. The two terms are evocative neither of space nor of time per se. Shih calls to mind the idea of circumstance or occasion (which may be either propitious or unpropitious for a give action); fang, that of direction or location (which may be either favorable or unfavorable for a particular instance. Thus time and space form a complex of symbolic conditions, both determining and determined; they are always imagined as an assemblage of concrete and diverse groupings of locations and occasions. (Granet, 1934, p. 88-9, trans. by Bodde, 1991, p.104)
Granet’s argument regarding the role of correlative relationship in building an assembled block is also reflected in the Chinese word for space consisting of two Han characters yu ( ) and chu ( ). They share a radical, ‘ ’, that designates a ‘roof’ and establishes a link between yu and chu. When the two characters are independent from each other, yu means ‘eaves’, ‘room’, or ‘world’ while chu refers to ‘roof timbers’, ‘house’, or 72
‘eternity’. The compound word yuchu ( ) denotes the eternal world, which can be understood as an infinite universe, i.e., the cosmos. Yuchu also signifies all things and happenings in nature. This suggests that the early Chinese understanding of the cosmos was different from that of ancient India. In my earlier discussion I pointed out that, in ancient India, time had three properties: eternity, cyclicity, and ephemerality. As the eternal law, ṛta organizes the world. Space and all filling elements of space cannot be created without the cosmic giant’s sacrifice. Puruṣa attained eternity through his sacrifice. The idea of an acquisition of eternality via God’s self-sacrifice is absent in the Chinese yuchu.
According to Granet, the early Chinese might not have considered eternity because there is no concrete action that can be correlated to eternity. As alluded to above, abstract time was difficult to think of for the ancient Chinese (Bodde, 1991). Then, how did the early
Chinese understand time? This can be seen etymologically with a word shijian ( ). The first character shi ( ) consists of three components, the radical ri ( ) standing for the sun, tu ( ) meaning the earth, and cun ( ) denoting something small or a radial artery pulse at the wrist. Shi refers to a moment or a happening of an event established by a concrete action of budding or blossoming. Composed of gate (men ) and light (ri ), jian ( ) means ‘between’. Putting these components together, shijian indicates a temporal interval between specific moments. In other words, duration as a temporal block is defined by a relationship between particular events. This way of understanding of time demonstrates that the ancient Chinese people conceived of time holistically while associating time with concrete actions or events.
73
In contrast to this holistic and correlative understanding of time and space, there existed a philosophical school looking at space and time as abstract concepts (Bodde, 1991).
This school is called Mohist, that was founded by Mozi ( ca. 479 -381 BCE). The
Mohists’ extensive discussion showed a different speculation on time and space compared to the holistic views (Harbsmeier, 1995). They considered time and space in relationship with movements of an object. This abstract way of thinking about time and space
(Needham, 1966) was not widely accepted by the early Chinese (Bodde, 1991):16
Time Canon I 40 : Duration (chiu :) includes all particular (different) times (shih ). Exposition : Former times, the present times, the morning and the evening, are combined together to form duration.17 (Needham,1966, p.93)
Space Canon I 41 : Space (yu ) includes all the different places (so ) Exposition : 'East, west, south and north, all are enclosed in space.18 (Needham,1966, p.93)
Movement in space (frames of reference) Canon II 63 : When an object is moving in space, we cannot say (in an absolute sense) whether it is coming nearer or going further away. The reason is given under “spreading (fu )” (i.e., setting up coordinates by pacing). Exposition : Talking about space, one cannot have in mind only some special district (chhü 傴). It is merely that the first step (of a pacer) is nearer and his later steps further away. (The idea of space is like that of ) duration (chiu :). (One can select a certain point in time or space as the beginning, and reckon from it within a certain period or
16 Yuan (2006) argued that the Later Mohist Canon does not deal with time as an abstract concept by analyzing the letters etymologically. She also discussed that the comparative philosophers’ approach to the text not only misrepresents the ancient Chinese concept of time but also excludes the subjective character of time. However, Yuan’s interpretation of subjective time in the Mohism seems problematic. First, the Mohists’ distinction of duration from time that is marked by events is not properly reflected in Yuan’s translation. Second, the etymology of chiu (:) does not take the main argument on time by the Mohists into account. 17 40 : : 18 41 : : 74
region, so that in this sense) it has boundaries, (but time and space are alike) without boundaries.19 (Needham,1966, p.93)
Movement and duration Canon II 64 : Movement in space requires duration. The reason is given under “earlier and later (hsien hou ).” Exposition : In movement, the motion (of an observer) must first be from what is nearer, and afterwards to what is further. The near and far constitute space. The earlier and later constitute duration. A person who moves space requires duration. 20 (Needham, 1966, p.94)
In terms of time, the Mohists differentiated duration (chiu :) from event
(shih ).21 This seems to correspond to two psychological building blocks, duration perception and event detection, as discussed in chapter 2. Although the Mohists’ perspective on time and space was different from other schools, they probably did not assume a complete independence between time and space. Needham (1966) wrote,
“Perhaps the Mohists envisaged something like what we should now speak of as a universal space-time continuum within which an infinite number of local reference frames coexist, and guessed that the universe would look very different to different observers according to their positions in the whole” (p. 95).:
19 63 : : 傴 20 64 : : : : : 21 Here I did not take Needham’s translation of shih ( ) of time, that is, change. I would rather interpret it as event. The reason for this can be found in the above etymological discussion of shih ( ). 75
Space and Time Canon II 13 : The boundaries of space (the spatial universe) are constantly shifting. The reasons is given under ‘extension (chhang ).’ Exposition : There is the South and the North in the morning, and again in the evening. Space, however, has long changed its place.22 (Needham, 1966, p.94)
Canon II 33 : Spatial positions are names for that which is already past. The reason is given under ‘reality (shih ) Exposition : Knowing that ‘this’ is no longer ‘this’, and that ‘this’ is no longer ‘here’, we still call it South and ‘North. That is, what is already past is regarded as if it were still present. We called it South then and therefore we continue to call it South now.23 (Needham, 1966, p.94-5)
In the following section, I will return my discussion regarding the representative ancient Chinese view on time and space with historical evidence. For this, I will first examine the early documents, including the Book of Change (yijing ) and its commentaries. This will allow us to understand the ancient Chinese holistic views of the world. I will also discuss how time and space are established in music-making in qin performance.
The transition from the Feudal age (Zhou dynasty 1030 - 221 BCE)24 to the early imperial periods (Qin dynasty 221 - 207 BCE and Han dynasty 202 BCE - 220 AC) contributed significantly to the formation of the two branches of Chinese thoughts, that is,
Confucianism and Taoism. Although the Book of Change, an ancient Chinese divination text from the early Zhou, is too enigmatic to derive the definitive Chinese concepts of time
22 13 : : : : 23 33 : : 24 According to Needham (1962, Vol.4, part 1., p.431), the Zhou dynasty consists of three different periods including the Early Zhou (1030 - 772 BCE), the Chhun Chhiu (722 - 480 BCE), and the Warring State (480 - 221 BCE). 76 and space, its commentary called the Ten Wings (shiyi ) provides us with some hints regarding how the early Chinese had shaped their own views of the world. After his review of several excerpts of the Book of Change and the Ten Wings, Lin (1995) argued that the former used time and space to deliver oracles and that the latter demonstrated how scholarly interpretations transformed the divination text into a philosophical discussion of life and the world.
To begin with, the Book of Change describes the world as neither stable nor fixed.
Rather it continuously changes. However, the principle that mandates the changes of the world is constant. In other words, two opposite but complementing life forces (qi ), yin
( ) and yang ( ) are engendered by the great ultimate (taiji ). Dynamics between yin and yang lead to the change of all creations in the world (Chan, 1963). Then, how could the principle, which is unchanging and stable, create changes? Both yin and yang wax and wane but they do not completely overwhelm the other. The constant changes arising from two opposite forces create the repetitive cycles of nature. The Book of Change indicates that the constancy of the principle is achieved by the cycles of day and night, four seasons, etc. The relationship between the unchanging principle and its manifestation, cyclicity, seems to be parallel to that between eternality of ṛta and cyclicity associated with ṛta in ancient India. The early Chinese also considered wood, fire, soil, metal, and water as the five elements (wu xing ). These elements change gradually in the course of the cycle.25
25 Bodde (1991) discussed how the yin-yang theory was combined in the five elements theory in ancient China. He viewed that the two theories are entirely independent of each other in the beginning. The yin-yang theory first appeared during the early Chou period and had no cosmic significance. Similarly, Redmond (2017) speculated the yin-yang theory arose from multiple sources and argued that the Book of Change may be one of them. The five elements theory was discussed a little later in the Zuozhuan ( ). It describes only 77
This seems to correspond to the five layers of a continuum of cosmos described in models of Taittirīya upaniṣad and Sāṅkhya.
Let me examine how the Book of Change and its commentaries explain the principle in the context of the Chinese philosophy. In the Book of Change, yin and yang are transformed into the eight trigrams 26 and expanded to the 64 hexagrams. The first hexagram is qian ( ), a symbol of heaven and dragon. Its judgment text is yuan heng li zhen ( )27 that has been known to all educated Chinese people. Translating yuan heng li zhen as “begin with an offering; beneficial to divine” (p.63), Redmond (2017) interpreted this phrase as an introductory invocation for the divination act and a proclamation to the spirits. However, an earlier translation of this invocation suggested that qian is the fundamental organizing principle of the world. For example, Legge (1882) regarded qian as “what is great and originating, penetrating, advantageous, correct and firm” and Wilhelm & Baynes (1967) translated yuan heng li zhen as “the creative works sublime success, furthering through perseverance” (p. 6). The first of the Ten Wings, Tuan zhuan( ) commented on this judgment as follows:
material substances that are not regarded as constituents of the world. Zou Yan ( ; 305 – 240 BC) united two theories for the first time. However, the five elements were not treated in philosophical writings before the late third century BC. After the Qin dynasty was established, the ideas of the two theories have been adopted in all schools of thoughts and integrated with Confucian moral and social values into an all- embracing system. 26 The eight trigrams include qian ( , heaven), zhen( , thunder), kan ( , water), gen ( , mountain), kun ( , earth), xun( , wind), li( , fire), and dui ( , a collection of water). Each trigram is combined with another thus making 64 hexagrams in total. (Redmond, 2017) 27 Althougth the hexagram qian ( ) are assigned to the fourth month from May to June (Wilhelm & Baynes, 1967, p.3), its judgment yuan heng li zhen ( ) are discussed in terms of seasons. The first character, yuan ( ), as spring, denotes the beginning of things in the universe. Second, heng ( ) is associated with summer and signifies growth of all things. As fall, li ( ) refers to achievement. For winter, zhen ( ) means completion. Yuan heng li zhen signifies a cycle of life. 78
Vast is the ‘great and originating (power)’ indicated by Qian! All things owe to it their beginning: - it contains all the meaning belonging to (the name) heaven. The clouds move and the rain is distributed; the various things appear in their developed forms. (The sages) grandly understand (the connexion between) the end and the beginning, and how (the indications of) the six lines (in the hexagram) are accomplished, (each) in its season. (Accordingly) they mount (the carriage) drawn by those six dragons at the proper times, and drive through the sky. The method of Qian is to change and transform, so that everything obtains its correct nature as appointed (by the mind of Heaven); and (thereafter the conditions of) great harmony are preserved in union. The result is ‘what is advantageous, and correct and firm.’ (The sage) appears aloft, high above all things, and the myriad states all enjoy repose.28 (Legge, 1882)
It is important to note that qian as the principle of the world not only governs cyclical movements in nature (e.g., clouds, rain, etc.) but also acts on the world of men.
The parallels between the universe and the world of men (Bodde, 1991) implies the Chinese correlative way of thinking. Specifically, the Wenyan zhuan ( )29, the seventh of the
Ten Wings, focuses on the world of men. Chan (1963) discussed the fact that the early
Confucianist Chinese might have attempted to operate the forces that they could control rather than the forces that are governed by nature. Qian in the human world is the assemblage of goodness (shan ), excellences (jia ), righteousness (yi ), and action
(shi ). According to the Wenyan zhuan’s interpretation, yuan heng li zhen is:
What is called (under qian) 'the great and originating' is (in man) the first and chief quality of goodness; what is called 'the penetrating' is the assemblage of excellences; what is called 'the advantageous' is the harmony of all that is right; and what is called 'the correct and firm' is the faculty of action. The superior man, embodying benevolence, is fit to preside over men; presenting the assemblage of excellences, he is fit to show in himself the union of all propriety; benefiting (all) creatures, he is fit to exhibit the harmony of all that is right; correct and firm, he is fit to manage (all) affairs. The fact that the superior man practises these four virtues justifies the application to him of
28 29 The wenyan zhuan provides commentary only for the two hexagrams, qian ( , heaven) and kun ( , earth). 79
the words – ‘Qian represents what is great and originating, penetrating, advantageous, correct and firm’.30 (Legge, 1882)
Lewis (2006) focused on how early China constructed a spatial order and emphasized the role of human actions. The spatial order can be set up hierarchically by cultivating bodies, organizing families, building cities, forming regional networks, and establishing an empire. To begin with, he introduced luan ( ) referring to chaos. In ancient
Chinese history, luan and unity (i.e., the ordered state) are associated with the Warring states and the Imperial period respectively. It is important to note not only that spatial order is not naturally given but also there is a repetitive cycle between luan and unity. This means that the early Chinese continuously strived to achieve or recover the spatial order through their actions, which might have been developed as combining the two theories of Taoism and Confucianism. The early Chinese people viewed that the ordered spaces are morally organized and achieved via self-cultivation (e.g., the way dao ). Morality shows a
Confucianist influence while self-cultivation relates to Taoism. The ancient Chinese perspectives on the ordered spaces appeared the beginning of the second chapter of the
Great Learning (da xue ), one volume of the Book of Rites (li ji ):
The ancients who wished to illustrate illustrious virtue throughout the world, first ordered well their own states. Wishing to order well their states, they first regulated their household. Wishing to regulate their household, they first cultivated their body. Wishing to cultivate their body, they first rectified their mind. Wishing to rectify their mind, they first sought to be sincere in their thoughts. Wishing to be sincere in their thoughts, they first extended to the utmost their knowledge. Such extension of knowledge lay in the investigation of things. Things being investigated, knowledge became complete. Their knowledge being complete, their thoughts were sincere. Their thoughts being sincere, their hearts were then rectified. Their hearts being rectified, their persons were
30 80
cultivated. Their persons being cultivated, their families were regulated. Their families being regulated, their states were rightly governed. Their states being rightly governed, the whole kingdom was made tranquil and happy. From the Son of Heaven down to the mass of the people, all must consider the cultivation of the person the root of everything besides. It cannot be, when the root is neglected, that what should spring from it will be well ordered. It never has been the case that what was of great importance has been slightly cared for, and, at the same time, that what was of slight importance has been greatly cared for.31 (modified from Legge, 1882)
This quote makes several points in connection to the previous discussions about the ancient Babylonian and Indian views on time and space. First, both ancient China and
Babylonia seem to consider spatial order to be important. Compared to the Babylonian spatial order set by the god, Marduk, Chinese spatial order was created by human actions.
Second, the Chinese idea of space seems unstable because of its alternation between luan and unity due to the dynamics between yin and yang. However, the Enūma Eliš described that, once the spatial order set up from the primordial watery chaos, there was no reverse.
Third, each unit of space in ancient China is placed on a continuum of the cosmos (Bodde,
1991). The units include the world (tianxia : literally means ‘under sky’), state (guo ), household (jia ), body (shen ), mind (xin ), thought (yi ) and knowledge (zhi ).
These units of the continuum were similar to the ancient Indian views of the world. In specific, the last three units seem to correspond to the three door keepers in the Sāṅkhya models. Xin ( mind), yi ( thought) and zhi ( , knowledge) match to manas (mind), ahaṅk!"ra (ego) and buddhi (knowledge, intellect)
31 81
Then what was the role of music in the ancient China in conceiving of space, time, and the cosmos? In terms of the Chinese idea of space, music plays an important role in building the spatial order by bringing peace to the world. More precisely, music was composed by an ancient king as a part of ritual service that let the world restore an equilibrium through enthusiasm. The sixteenth hexagram yu ( ) in the Book of Change alludes to this calming function of music. The top trigram zhen ( ) means an activity of thunder at the beginning of summer that causes the arousal of the earth. The bottom trigram kun ( ) symbolizes the arousal of the earth. Therefore, the hexagram yu ( ) is a symbol of enthusiasm, the devotion to the movement of the nature. The third of the
Ten wings, the xiang zhuan ( ) provides the commentary on the image of yu as follows:
(The trigrams for) the earth and thunder issuing from it with its crashing noise form yu. The ancient kings, in accordance with this, composed their music and did honor to virtue, presenting it especially and most grandly to God, when they associated with Him (at the service) their highest ancestor and their father.32 (Legge, 1882)
The above comment clearly demonstrates that the ruler makes music to commemorate virtue so he can please spirit of god and ancestors. Music in rites bridges the gap between the universe and the world of men. A post-Confucian work, the Records of
Music (yueji ) which is a part of the Book of Rite, reflects Confucian views on the role of music in establishing spatial order that is discussed above. The Records of Music not only suggests a connection between music, governance, and the natural order but also implies an importance of harmony between them (Cook, 1995). Specifically, Cook (1995)
32 82 argued that the Records of Music differentiates sheng ( ), yin ( ), and yue ( ).
According to him, there is no equivalent term corresponding to these different levels. He wrote, “at times as different stages the development of music in terms of its moral qualities, and its relation to governance, society, and the harmony of the natural world” (p. 20).33 For example, sheng is just an audible sound like a human scream or animal call. When sounds are ordered and have a meaning, they are no longer sheng, but yin. Yue is the final stage of the music development because it carries virtue. The transformations from sheng to yin and from yin to yue are associated with the level of virtue, the moral quality. Yue correlates to the human action that contributes to virtue.
DeWoskin(1982) suggested correlative thinking in the Chinese views on arts and aesthetics, and states that “both music and ritual address the twin concerns of self- cultivation and social order, appropriately contextualized in the prevailing cosmic order”
(p.175). As implied in the hexagram yu, the principal concern of music and rituals is human emotions in order to reach the ideal state of mind and an ideal life. For this, music and rituals should be properly coordinated. In a discussion about the role of music, self- discipline and self-cultivation of emotion are important to achieve the ideal state of mind as rites aim to control social concerns. Therefore, musical harmony is as important as cosmic harmony. There are shared values between music and ritual. Ya ( ) which means
33 旄 In all cases, the arising of music (yin) is born in the hearts of men. The movement of men’s hearts is made so by [external] things. They are touched off by things and move, thus they take shape in [human] sound (sheng). Sounds respond to each other, and thus give birth to change. Change forms a pattern, and this is called music (yin). The music is brought close and found enjoyable, and reaches the point of shields and axes, feathers and pennants, and this is called music (yue) (Cook ,1995, p. 24-5). 83 elegance is one of them. Pointing out that there is no duality between culture and nature in
Chinese way of thinking, DeWoskin (1982) explained that elegance means being a unity with nature. As Lewis’s (2006) argument above shows, being a whole with nature requires human actions, namely, practice. Strange as it may sound, being a unity with nature is not naturally given. This seems to resonate Granet’s remark on the importance of action in the
Chinese understanding of time and space by stating “Time and space are never conceived apart from concrete action” (trans. by Bodde 1991, p. 104).
This view that space and time emerge from concrete actions, more precisely, the ordered space and time emerge from human action for being in harmony with nature, is reflected in the qin ( ) practice. Since the qin, the seven-string zither, has been the most favored musical instrument of the elite class, it is well documented in literary sources and manuscript notation.34 The most characteristic feature of the qin notation is that it notates notate neither temporal/durational values nor pitches. Rather it informs a player how to act on or interact with the instrument. Specifically, it tells how each finger of both hands moves on the instrument. The finger movements are described with symbols that are known as zhifa ( ), namely, a finger technique. Yung (1997) exemplified zhifa with the symbol