Spatial Cross-Correlation

Total Page:16

File Type:pdf, Size:1020Kb

Spatial Cross-Correlation Biol. Cybern. 47, 149-163 (1983) Biological Cybernetics @ Springer-Verlag 1983 Spatial Cross-Correlation A Proposed Mechanism for Acoustic Pitch Perception Gerald E. Loeb*, Mark W. White, and Michael M. Merzenich Coleman Laboratory, Department of Otolaryngology, University of California at San Francisco, San Francisco, California, USA * Laboratory of Neural Control, IRP, National Institute of Neurological and Communicative Disorders and Stroke, Bethesda, Maryland, USA Abstract. We propose in this paper a new class of Profoundly deaf subjects have been chronically im- model processes for the extraction of spectral infor- planted with intracochlear electrode arrays in an at- mation from the neural representation of acoustic tempt to bypass their defective hair cells, which nor- signals in mammals. We are concerned particularly mally transduce mechanical vibrations traveling along with mechanisms for detecting the phase-locked ac- the basilar membrane into a temporospatial pattern of tivity of auditory neurons in response to frequencies discharges in the distributed auditory nerve fiber pop- and intensities of sound associated with speech per- ulation (Simmons, 1966; Michelson, 1971 ; Clark et ception. Recent psychophysical tests on deaf human al., 1977; Tonndorf, 1977; Eddington et al., 1978; subjects implanted with intracochlear stimulating elec- Merzenich et al., 1979; Tong et al., 1979; Michelson trodes as an auditory prosthesis have produced results and Schindler, 1981). As we shall discuss here, per- which are in conflict with the predictions of the ceptions of sound evoked by electrical stimulation in classical place-pitch and periodicity-pitch theories. In these patients cannot be reconciled with currently our model, the detection of synchronicity between two dominant theories of how the nervous system normally phase-locked signals derived from sources spaced a extracts pitch information from patterns of neural finite distance apart on the basilar membrane can be activity generated by acoustic input. Although this is a used to extract spectral information from the new and unfamiliar source of psychophysical data, it spatiotemporal pattern of basilar membrane motion. represents a powerful tool whose underlying physical Computer simulations of this process suggest an op- processes are beginning to be better understood (see timal spacing of about 0.3-0.4 of the wavelength of Merzenich, 1973 ; Ranck, 1975 ; Marks, 1977 ; Pollen et the frequency to be detected. This interval is consistent al., 1977; Kiang et al., 1979; Black et al., 1981). with a number of psychophysical, neurophysiological, We have been exploring the possibility that pa- and anatomical observations, including the results of tients' pitch sensations arise from changes in the high resolution frequency-mapping of the anteroven- synchronicity of firing among the neurons reaching tral cochlear nucleus which are presented here. One threshold over fairly extended regions of the auditory particular version of this model, invoking the bin- nerve array. A neuronal model of normal acoustic aurally sensitive cells of the medial superior olive as pitch perception can be built on such a concept which the critical detecting elements, has properties which are is simple and which can account for psychophysical useful in accounting for certain complex binaural effects that are difficult to explain by invoking either psychophysical observations. "place-pitch" or "periodicity-pitch" models (see below). This spatiotemporal model is consistent with the de- scribed neuroanatomy and physiology of brainstem auditory nuclei. This latter point is particularly impor- Introduction tant, as current place- and periodicity-pitch theories are abstract constructs of signal theory which address It has recently become possible to directly test theories complex psychophysical problems (e.g. see Wightman, of the neural mechanisms underlying pitch perception 1973) but ignore or even contradict anatomical and by eliminating the complex transduction processes of physiological descriptions of the auditory nervous the middle and inner ear and directly activating the system. A previous attempt to invoke spatial crosscor- sensory neurons in a localized and controlled manner. relation to account for sharpness of tuning in cochlear 150 MECHANISMS FOR PITCH PERCEPTION Fig. 1. Human auditory perceptual range 120 (top) divided into three regions likely to be subserved by different pitch perception LOUDNESS mechanisms. Pure place pitch (Helmholtzian) is adequate only for high frequencies (sharply ::L\ ::L\ tuned) and low amplitudes (below neural saturation). Pure rate pitch is available only for frequencies below the limits of sustained 10 neural firing rates (less than 500 pps). The central reg&, subserving most of the speech TONE NEURAL PHASE HUMAN perception requirements, utilizes the phase- FUSION FIRING LOCKING HEARING locking of neural discharge which is LlMlT LlMlT - LlMlT increasingly stable at higher sound intensities. SINGLE NEURON The typical tuning curve of single auditory nerve fibers and AVCN cells (middle) demonstrates very little frequency selectivity at intensities typical of normal speech. This, combined with the tendency of neurons to 10 saturate at their maximal firing rates, C.F. accounts for the flat profile of neural activity in the auditory nerve for high amplitude, ACOUSTIC STIMULUS FREQUENCY - single frequency stimuli (bottom). Note that the basilar membrane tuning of decreasing frequency with increasing distance from the MEAN AUDITORY cochlear base has been indicated by reversing NERVE ACTIVITY the frequency axis in the bottom figure, PPS 250 pivoting it around the characteristic frequency t of the middle figure. In this and all other figures, basilar membrane position is given in millimeters from the basal entry point of the BASE 5 10 15 20 25 30 SEX (20,000 Hz) (20~~)traveling wave, rather than as the more BASllAR MEMBRANE POSITION, mm conventional but reversed cochlear place TRAVELING WAVE - (distance from the apex) afferents called for a complex synaptic interaction human sound perception, covering the range between inner and outer hair cell afferents which now 20-20,000Hz (Fig. 1). Each appears to involve dif- seems unlikely (Nieder, 197 la, b). ferent information extraction strategies in the auditory In this paper, the kinds of spectral information system. We here use the term "pitch" to denote any normally present in the auditory nerve array and the perceptual experience for which a subject could de- previously proposed neural models for extracting this termine a matching sinusoidal tone. We are primarily information are reviewed. Using new data on the concerned with mechanisms by which the spectral activity patterns induced in the auditory system by frequencies present are converted to an internal neu- intracochlear electrical stimulation, the perceptions ronal representation which is approximately isorepre- reported by auditory prosthesis patients are contrasted sentational. We also assume that the transduction and with the predictions of these models. A new class of pitch detection mechanisms of most mammals includ- spectral detectors which extract "synchronicity-pitch" ing man are comparable in their general form. by a process of spatial crosscorrklation is then de- Place Pitch. The highest frequencies, about 5000 to scribed. Some psychophysical and neuroanatomical 20,000 Hz, are probably detected and discriminated as predictions derived from modeling this process and a result of mechanically tuned resonance along the data consistent with these predictions are reviewed. basilar membrane of the cochlea, somewhat as pro- Finally, the utility of this theory in accounting for posed by Helmholtz (1863). Sound energy from a tone complex psychophysical phenomena is briefly dis- is coupled into the basilar membrane at the base of the cussed. Preliminary discussions of some aspects of this spiral and travels apically through progressively lower theory and some data have been presented elsewhere frequencies of resonance until it reaches the region (White, 1980; White et al., 1980, 1981; Loeb et al., tuned to its frequency. There the amplitude of vi- 1980, 1981 ; Merzenich et al., 1980). brations reaches a maximum, and there local hair cells and auditory nerve fibers are excited, giving rise to the Previous Models "place pitch" (Bekesy and Rosenblith, 1951). The The psychophysics and neurophysiology of pitch per- traveling wave is then very rapidly damped by a ception suggest that there are three distinct bands of process which is still pdorly understood (Wilson and Johnstone, 1972). The psychophysics of pitch per- back onto the dorsal cochlear nucleus (Adams and ception in this band are consistent with the tuning Warr, 1976; see Brugge and Geisler, 1978).] curves of auditory nerve fibers (Moore, 1973). We here It is probable that the nervous system makes use of use the term "place pitch" to indicate the class of pitch phase-locking of discharges generated by these mid- extraction theories based solely on a spatial gradient of range frequencies (Anderson et al., 197 1). For acoustic neural activity, ignoring temporal cues. Auditory nerve stimuli from about 500 to 5000 Hz, no single neuron fibers have sharp enough tuning curves and broad can fire fast enough to follow each stimulus cycle, but enough dynamic range to account for the relatively rather each tends to fire on randomly changing sub- modest frequency discrimination limens achievable at harmonics of the incoming frequency.
Recommended publications
  • L Atdment OFFICF QO9ENT ROOM 36
    *;JiQYL~dW~llbk~ieira - - ~-- -, - ., · LAtDMENT OFFICF QO9ENT ROOM 36 ?ESEARC L0ORATORY OF RL C.f:'__ . /j16baV"BLI:1!S INSTITUTE 0i'7Cn' / PERCEPTION OF MUSICAL INTERVALS: EVIDENCE FOR THE CENTRAL ORIGIN OF THE PITCH OF COMPLEX TONES ADRIANUS J. M. HOUTSMA JULIUS L. GOLDSTEIN LO N OPY TECHNICAL REPORT 484 OCTOBER I, 1971 MASSACHUSETTS INSTITUTE OF TECHNOLOGY RESEARCH LABORATORY OF ELECTRONICS CAMBRIDGE, MASSACHUSETTS 02139 The Research Laboratory of Electronics is an interdepartmental laboratory in which faculty members and graduate students from numerous academic departments conduct research. The research reported in this document was made possible in part by support extended the Massachusetts Institute of Tech- nology, Research Laboratory of Electronics, by the JOINT SER- VICES ELECTRONICS PROGRAMS (U. S. Army, U. S. Navy, and U.S. Air Force) under Contract No. DAAB07-71-C-0300, and by the National Institutes of Health (Grant 5 PO1 GM14940-05). Requestors having DOD contracts or grants should apply for copies of technical reports to the Defense Documentation Center, Cameron Station, Alexandria, Virginia 22314; all others should apply to the Clearinghouse for Federal Scientific and Technical Information, Sills Building, 5285 Port Royal Road, Springfield, Virginia 22151. THIS DOCUMENT HAS BEEN APPROVED FOR PUBLIC RELEASE AND SALE; ITS DISTRIBUTION IS UNLIMITED. MASSACHUSETTS INSTITUTE OF TECHNOLOGY RESEARCH LABORATORY OF ELECTRONICS Technical Report 484 October 1, 1971 PERCEPTION OF MUSICAL INTERVALS: EVIDENCE FOR THE CENTRAL ORIGIN OF COMPLEX TONES Adrianus J. M. Houtsma and Julius L. Goldstein This report is based on a thesis by A. J. M. Houtsma submitted to the Department of Electrical Engineering at the Massachusetts Institute of Technology, June 1971, in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
    [Show full text]
  • Contribution of the Conditioning Stage to the Total Harmonic Distortion in the Parametric Array Loudspeaker
    Universidad EAFIT Contribution of the conditioning stage to the Total Harmonic Distortion in the Parametric Array Loudspeaker Andrés Yarce Botero Thesis to apply for the title of Master of Science in Applied Physics Advisor Olga Lucia Quintero. Ph.D. Master of Science in Applied Physics Science school Universidad EAFIT Medellín - Colombia 2017 1 Contents 1 Problem Statement 7 1.1 On sound artistic installations . 8 1.2 Objectives . 12 1.2.1 General Objective . 12 1.2.2 Specific Objectives . 12 1.3 Theoretical background . 13 1.3.1 Physics behind the Parametric Array Loudspeaker . 13 1.3.2 Maths behind of Parametric Array Loudspeakers . 19 1.3.3 About piezoelectric ultrasound transducers . 21 1.3.4 About the health and safety uses of the Parametric Array Loudspeaker Technology . 24 2 Acquisition of Sound from self-demodulation of Ultrasound 26 2.1 Acoustics . 26 2.1.1 Directionality of Sound . 28 2.2 On the non linearity of sound . 30 2.3 On the linearity of sound from ultrasound . 33 3 Signal distortion and modulation schemes 38 3.1 Introduction . 38 3.2 On Total Harmonic Distortion . 40 3.3 Effects on total harmonic distortion: Modulation techniques . 42 3.4 On Pulse Wave Modulation . 46 4 Loudspeaker Modelling by statistical design of experiments. 49 4.1 Characterization Parametric Array Loudspeaker . 51 4.2 Experimental setup . 52 4.2.1 Results of PAL radiation pattern . 53 4.3 Design of experiments . 56 4.3.1 Placket Burmann method . 59 4.3.2 Box Behnken methodology . 62 5 Digital filtering techniques and signal distortion analysis.
    [Show full text]
  • Large Scale Sound Installation Design: Psychoacoustic Stimulation
    LARGE SCALE SOUND INSTALLATION DESIGN: PSYCHOACOUSTIC STIMULATION An Interactive Qualifying Project Report submitted to the Faculty of the WORCESTER POLYTECHNIC INSTITUTE in partial fulfillment of the requirements for the Degree of Bachelor of Science by Taylor H. Andrews, CS 2012 Mark E. Hayden, ECE 2012 Date: 16 December 2010 Professor Frederick W. Bianchi, Advisor Abstract The brain performs a vast amount of processing to translate the raw frequency content of incoming acoustic stimuli into the perceptual equivalent. Psychoacoustic processing can result in pitches and beats being “heard” that do not physically exist in the medium. These psychoac- oustic effects were researched and then applied in a large scale sound design. The constructed installations and acoustic stimuli were designed specifically to combat sensory atrophy by exer- cising and reinforcing the listeners’ perceptual skills. i Table of Contents Abstract ............................................................................................................................................ i Table of Contents ............................................................................................................................ ii Table of Figures ............................................................................................................................. iii Table of Tables .............................................................................................................................. iv Chapter 1: Introduction .................................................................................................................
    [Show full text]
  • An Infrasonic Missing Fundamental Rises at 18.5Hz Christopher D
    Papers & Publications: Interdisciplinary Journal of Undergraduate Research Volume 2 Article 11 2013 An Infrasonic Missing Fundamental Rises at 18.5Hz Christopher D. Lacomba University of North Georgia Steven A. Lloyd University of North Georgia Ryan A. Shanks University of North Georgia Follow this and additional works at: http://digitalcommons.northgeorgia.edu/papersandpubs Part of the Behavior and Behavior Mechanisms Commons, Biological and Chemical Physics Commons, Biological Psychology Commons, Cognition and Perception Commons, Cognitive Neuroscience Commons, Cognitive Psychology Commons, Computational Neuroscience Commons, Dynamic Systems Commons, Investigative Techniques Commons, Mental Disorders Commons, Non-linear Dynamics Commons, Numerical Analysis and Computation Commons, Psychological Phenomena and Processes Commons, Speech and Hearing Science Commons, and the Systems Neuroscience Commons Recommended Citation Lacomba, Christopher D.; Lloyd, Steven A.; and Shanks, Ryan A. (2013) "An Infrasonic Missing Fundamental Rises at 18.5Hz," Papers & Publications: Interdisciplinary Journal of Undergraduate Research: Vol. 2 , Article 11. Available at: http://digitalcommons.northgeorgia.edu/papersandpubs/vol2/iss1/11 This Article is brought to you for free and open access by the Center for Undergraduate Research and Creative Activities (CURCA) at Nighthawks Open Institutional Repository. It has been accepted for inclusion in Papers & Publications: Interdisciplinary Journal of Undergraduate Research by an authorized editor of Nighthawks Open Institutional Repository. The Missing Fundamental (MF) phenomenon creates perceptible tones, which do not actually exist in the audible range, from harmonic series. Sounds that follow a harmonic series are perceptually significant since they stand out from random background noise and frequently represent sounds of biological origin and importance. For example, both animal vocalizations and human speech produce harmonic series of sounds.
    [Show full text]
  • 11 – Music Temperament and Pitch
    11 – Music temperament and pitch Music sounds Every music instrument, including the voice, uses a more or less well defined sequence of music sounds, the notes, to produce music. The notes have particular frequencies that are called „pitch“ by musicians. The frequencies or pitches of the notes stand in a particular relation to each other. There are and were different ways to build a system of music sounds in different geographical regions during different historical periods. We will consider only the currently dominating system (used in classical and pop music) that was originated in ancient Greece and further developed in Europe. Pitch vs interval Only a small part of the people with normal hearing are able to percieve the absolute frequency of a sound. These people, who are said to have the „absolute pitch“, can tell what key has been hit on a piano without looking at it. The human ear and brain is much more sensitive to the relations between the frequencies of two or more sounds, the music intervals. The intervals carry emotional content that makes music an art. For instance, the major third sounds joyful and affirmative whereas the minor third sounds sad. The system of frequency relations between the sounds used in music production is called music temperament . 1 Music intervals and overtone series Perfect music intervals (that cannot be fully achieved practically, see below) are based on the overtone series studied before in this course. A typical music sound (except of a pure sinusiodal one) consists of a fundamental frequency f1 and its overtones: = = fn nf 1, n 3,2,1 ,..
    [Show full text]
  • Impairment of the Missing Fundamental Phenomenon in Individuals with Alzheimer’S Disease: a Neuropsychological and Voxel-Based Morphometric Study
    EXTRA Dement Geriatr Cogn Disord Extra 2018;8:23–32 DOI: 10.1159/000486331 © 2018 The Author(s) Received: November 17, 2017 Published by S. Karger AG, Basel Accepted: December 7, 2017 www.karger.com/dee Published online: February 1, 2018 This article is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 Interna- tional License (CC BY-NC-ND) (http://www.karger.com/Services/OpenAccessLicense). Usage and distribu- tion for commercial purposes as well as any distribution of modified material requires written permission. Original Research Article Impairment of the Missing Fundamental Phenomenon in Individuals with Alzheimer’s Disease: A Neuropsychological and Voxel-Based Morphometric Study a a, b a a Makiko Abe Ken-ichi Tabei Masayuki Satoh Mari Fukuda c d a Hironobu Daikuhara Mariko Shiga Hirotaka Kida a, b Hidekazu Tomimoto a Department of Dementia Prevention and Therapeutics, Graduate School of Medicine, b Mie University, Mie, Japan; Department of Neurology, Graduate School of Medicine, c Mie University, Mie, Japan; Graduate School of Medicine, Mie University, Mie, Japan; d Mie Prefectural Dementia-Related Disease Medical Center, Mie, Japan Keywords Alzheimer’s disease · Dementia · Pitch perception · Missing fundamental phenomenon · Magnetic resonance imaging · Voxel-based morphometry Abstract Background/Aims: The missing fundamental phenomenon (MFP) is a universal pitch percep- tion illusion that occurs in animals and humans. In this study, we aimed to determine wheth- er the MFP is impaired in patients with Alzheimer’s disease (AD) using an auditory pitch per- ception experiment. We further examined anatomical correlates of the MFP in patients with AD by measuring gray matter volume (GMV) on magnetic resonance images via voxel-based morphometric analysis.
    [Show full text]
  • Locomotion-Induced Sounds and Sonations: Mechanisms, Communication Function, and Relationship with Behavior
    Chapter 4 Locomotion-Induced Sounds and Sonations: Mechanisms, Communication Function, and Relationship with Behavior Christopher James Clark Abstract Motion-induced sound is an intrinsic byproduct of essentially all animal behavior. Locomotion-induced sounds that have evolved specialization for commu- nication are termed sonations. The null hypothesis is that locomotion-induced sounds are noncommunicative (adventitious), produced by nonspecialized mor- phology, and are involuntary. A sound is a sonation if it is produced by specialized morphology, or is produced voluntarily. The production of locomotion-induced sound can be examined at two levels: the animal motions (kinematics) that lead to sound production and the physical acoustic mechanism(s) that generate(s) the sound itself. The physical acoustics of locomotion induced sound are diverse, with both aerodynamic and structural mechanisms, including aeroelastic flutter, percussion, stridulation, and presumably many other undescribed mechanisms. There is a direct sound–motion correspondence between aspects of an animal’s motions and ensuing locomotion-induced sounds, especially in the time domain. This correspondence has two implications. One is experimental: sound recordings are a useful and perhaps underutilized source of data about animal locomotion. The second is behavioral: locomotion-induced sounds intrinsically contain information about an animal’s motions (such as wingbeat frequencies) that may be of interest to other animals. Therefore, locomotion-induced sounds are intrinsically suited to be mechanisms by which animals directly evaluate the locomotor performance of other animals, such as during courtship. The sound–motion correspondence is also a constraint. Sonations seem less acoustically diverse than vocalizations. Because they require discrete behaviors to be produced, animals also have somewhat fewer opportunities to produce sonations strategically, and few sonations are frequency-­ modulated.
    [Show full text]
  • Ghost Stochastic Resonance with Distributed Inputs in Pulse-Coupled Electronic Neurons
    PHYSICAL REVIEW E 73, 021101 ͑2006͒ Ghost stochastic resonance with distributed inputs in pulse-coupled electronic neurons Abel Lopera,1 Javier M. Buldú,1,* M. C. Torrent,1 Dante R. Chialvo,2 and Jordi García-Ojalvo1,† 1Departamento de Física i Enginyeria Nuclear, Universitat Politècnica de Catalunya, Colom 11, E-08222 Terrassa, Spain 2Department of Physiology, Northwestern University, Chicago, Illinois 60611, USA ͑Received 29 September 2005; published 8 February 2006͒ We study experimentally the phenomenon of ghost stochastic resonance in pulse-coupled excitable systems, for input signals distributed among different elements. Specifically, two excitable electronic circuits are driven by different sinusoidal signals that produce periodic spikes at distinct frequencies. Their outputs are sent to a third circuit that processes these spiking signals and is additionally perturbed by noise. When the input signals are harmonics of a certain fundamental ͑that is not present in the inputs͒ the processing circuit exhibits, for an optimal amount of noise, a resonant response at the frequency of the missing fundamental ͑ghost frequency͒. In contrast with the standard case in which the signals being directly integrated are sinusoidal, this behavior relies here on a coincidence-detection mechanism. When the input signals are homogeneously shifted in frequency, the processing circuit responds with pulse packages composed of spikes at a frequency that depends linearly on the frequency shift. Expressions for the dependence of the package period and duration on the frequency shift and spike width, respectively, are obtained. These results provide an experimental verification of a recently proposed mechanism of binaural pitch perception. DOI: 10.1103/PhysRevE.73.021101 PACS number͑s͒: 05.40.Ϫa, 05.45.Ϫa I.
    [Show full text]
  • The Sounds of Music: Science of Musical Scales∗ 1
    SERIES ARTICLE The Sounds of Music: Science of Musical Scales∗ 1. Human Perception of Sound Sushan Konar Both, human appreciation of music and musical genres tran- scend time and space. The universality of musical genres and associated musical scales is intimately linked to the physics of sound, and the special characteristics of human acoustic sensitivity. In this series of articles, we examine the science underlying the development of the heptatonic scale, one of the most prevalent scales of the modern musical genres, both western and Indian. Sushan Konar works on stellar compact objects. She Introduction also writes popular science articles and maintains a Fossil records indicate that the appreciation of music goes back weekly astrophysics-related blog called Monday Musings. to the dawn of human sentience, and some of the musical scales in use today could also be as ancient. This universality of musi- cal scales likely owes its existence to an amazing circularity (or periodicity) inherent in human sensitivity to sound frequencies. Most musical scales are specific to a particular genre of music, and there exists quite a number of them. However, the ‘hepta- 1 1 tonic’ scale happens to have a dominating presence in the world Having seven base notes. music scene today. It is interesting to see how this has more to do with the physics of sound and the physiology of human auditory perception than history. We shall devote this first article in the se- ries to understand the specialities of human response to acoustic frequencies. Human ear is a remarkable organ in many ways. The range of hearing spans three orders of magnitude in frequency, extending Keywords from ∼20 Hz to ∼20,000 Hz (Figure 1) even though the sensitivity String vibration, beat frequencies, consonance-dissonance, pitch, tone.
    [Show full text]
  • Part D: Characterizing Sound
    Part D : Characterizing Sound Part D: Characterizing Sound Chapter 36. Parameters and Perception Simple Harmonic Motion of an object is itself not sound, but a vibrating object can be a source of sound. Each of the physical parameters of SHM vibration is closely related to a perceptual characteristic of the sound produced. • The duration of a vibration, how long it lasts, is mainly determines the duration of the resulting sound. • Vibrations with larger amplitudes create sounds that are louder, all other things being equal. • Vibrations with higher frequencies cause sounds that are higher in pitch. • Since period is the reciprocal of frequency, it also relates directly to pitch. A longer period corresponds to a lower pitch. • The initial phase of a vibration has no bearing on the perception of the sound. Since the value of initial phase depends on a choice made by an observer (specifically, the choice of the time origin), it can’t have real-world consequences. Recall from Chapter 21 that phase differences can be physically meaningful, but those will only arise when multiple sounds combine. These relationships are not exclusive, as each of these perceptions (duration, loudness, and pitch) can be influenced by all of the physical parameters except initial phase. For instance, perceived loudness depends slightly on frequency. Nevertheless, the connections above are the principal ones. The characteristics of a sound can vary over time. Over the course of seconds, like notes in a song, you might consider that separate sounds, or one changing sound. On a more rapid scale, the exact variations over the first few milliseconds of a sound, called the attack in some contexts, can be very important for recognizing the source.
    [Show full text]
  • Tu.P3.1 II - 1739 Fundamental Frequencies Were Mistuned by ∆F Hz the Normalized ACF Calculated by In-Phase and Maintaining a Harmonic Relationship
    Multiple Missing Fundamental Phenomenon as a Monaural-Beat Perception Ryota Shimokura a and Yoichi Ando b Graduate School of Science and Technology, Kobe University Now at: a Department of Energy, Nuclear and Environmental Control Engineering, Bologna University [email protected] Now at: bDepartment of Architecture and Civil Engineering, Kumamoto University [email protected] Abstract phenomenon of periodicity pitch or residue pitch has played roles to explain the mechanism of pitch It is found that a monaural beat induced by two missing perception [1][2]. The fundamental frequency is fundamentals of complex tones (A and B) may be absent in the spectrum, however the pitch of it can be perceived, even in the condition without the envelope perceived. Schouten mixed a pure tone 206 Hz on a component of beat. This was realized by mixing the complex tone with fundamental frequency 200 Hz and complex tone A with the missing fundamental frequency indicated that the beat corresponding to 6 Hz did not f0a and the complex tone B with the missing occur [2]. In this experiment, we shall examine the fundamental frequency f0b. When we listen together, missing fundamental phenomenon perceived as a beat, then an additional missing fundamental frequency ∆f (= which was caused by two fundamental frequencies. Two f0b - f0a) may be perceived as the beat. The ∆f was different fundamental frequencies (f0a and f0b) by two changed to 2, 4, 8 and 16 Hz. The total tone mixed with different complex tones may induce an additional two complex tones has the envelope fluctuation fundamental frequency: ∆f (= f0a - f0b).
    [Show full text]
  • A. Acoustic Theory and Modeling of the Vocal Tract
    A. Acoustic Theory and Modeling of the Vocal Tract by H.W. Strube, Drittes Physikalisches Institut, Universität Göttingen A.l Introduction This appendix is intended for those readers who want to inform themselves about the mathematical treatment of the vocal-tract acoustics and about its modeling in the time and frequency domains. Apart from providing a funda­ mental understanding, this is required for all applications and investigations concerned with the relationship between geometric and acoustic properties of the vocal tract, such as articulatory synthesis, determination of the tract shape from acoustic quantities, inverse filtering, etc. Historically, the formants of speech were conjectured to be resonances of cavities in the vocal tract. In the case of a narrow constriction at or near the lips, such as for the vowel [uJ, the volume of the tract can be considered a Helmholtz resonator (the glottis is assumed almost closed). However, this can only explain the first formant. Also, the constriction - if any - is usu­ ally situated farther back. Then the tract may be roughly approximated as a cascade of two resonators, accounting for two formants. But all these approx­ imations by discrete cavities proved unrealistic. Thus researchers have have now adopted a more reasonable description of the vocal tract as a nonuni­ form acoustical transmission line. This can explain an infinite number of res­ onances, of which, however, only the first 2 to 4 are of phonetic importance. Depending on the kind of sound, the tube system has different topology: • for vowel-like sounds, pharynx and mouth form one tube; • for nasalized vowels, the tube is branched, with transmission from pharynx through mouth and nose; • for nasal consonants, transmission is through pharynx and nose, with the closed mouth tract as a "shunt" line.
    [Show full text]