<<

Classifying soundscapes using a multifaceted

Christopher Trudeau McGill University, School of Information Studies, Montréal, Canada. Catherine Guastavino McGill University, School of Information Studies, Montréal, Canada.

Summary Multiple organizational principles have been proposed to classify information in the field of information studies. Hierarchical classification systems are the most familiar, but they require relational consistency between the different levels and mutual exclusivity across classes. Attempts to organize soundscapes within hierarchical classification are challenging for these two reasons. Listening test results suggest that people cross-categorize everyday sounds according to the sound source, the action generating the sound, and/or the context in which these sounds are perceived. Moreover, soundscapes are complex sound scenes with multiple, concurrent sound sources. This paper proposes a faceted taxonomy that reflects the way people cross-categorize sounds. Five facets are used in the taxonomy, each reflecting a fundamental characteristic of the sound: agent, source, action, context, and acoustic features. Within some facets, a of terms can be established, and thus the facets must allow for a hierarchical relationship between the terms. This taxonomy will be implemented using a relational database (MySQL) and a web interface for users to navigate the content and structure of the taxonomy. The proposed database will be populated using existing sound datasets, such as freesound.org. In this taxonomy, a soundscape can be described as a set of sounds, eliminating the challenge of isolating a soundscape to one placement in a taxonomy. We will analyze the free format verbal descriptions accompanying each sound in terms of the facets of the proposed taxonomy, and explore new facets from emerging categories. Thus, the proposed taxonomy supports the ongoing efforts to standardize and report soundscape research by offering a way to systematically describe a given soundscape.

PACS no. 43.50.Qp, 43.66.Lj

1. Introduction1 or the action (e.g. someone is mad and slammed the door) or based on the agent (e.g. Joe left). is a critical process to make sense of This paper proposes a faceted taxonomy of sounds the world around us by dividing it into meaningful to account for cross-categorization. Here, the facets categories. Everyday sound categorization allows represent the categorization principles that people us to make inferences about the presence of sound use, while allowing for a supplementary sources, agents and actions and subsequently guide hierarchical structure. The goal of the proposed action (e.g. avoid an approaching car or attend to a taxonomy is to support ongoing research on crying baby). everyday sounds and soundscapes, as well as to Classification schemes have been proposed to provide a toolkit that can assist urban planners in model categorization principles that emerge from their consideration of the sonic dimension. empirical and laboratory research. However, current taxonomies have some difficulty representing cross-categorization: our ability to categorize sounds differently according to the situation to infer particular information. For example, the sound of a door slamming, can be categorized based on the source (e.g. door sound),

Copyright © 2018 | EAA – HELINA | ISSN: 2226-5147 - 2487 - All rights reserved Euronoise 2018 - Conference Proceedings

high-level semantic classes (e.g. voice) and then 1.1. Literature review specific sounds (e.g. laughter, an action, or Everyday sound categorization has garnered children, a source). increased research attention within the ecological Taxonomies for more complex sound scenes have approach to auditory perception [1] and in the field also been proposed. Specifically, Brown, Kang and of soundscape research. In contrast to artificial, Gjestland proposed a taxonomy that includes synthetic sounds, everyday sounds are defined as different categorization principles at different sounds occurring in real life environments [2]; they levels [13]. They first distinguish by location – are also referred to as environmental sounds in the indoor or outdoor – and then by type – urban, rural, literature (e.g. [3], [4], [5]) or domestic sounds for wilderness or underwater. The third level in the everyday sounds typically heard inside the home taxonomy is a function of whether or not the sound [6]. A variety of everyday sound categorization is created by human activity. Finally, the lowest principles have been reported in the literature (e.g. branches and leaves offer some general and specific [5], [7], [6], [8], [9]). There is converging evidence sound sources. that people categorize everyday sounds based on sound source, the agent or the action producing 1.2. Review of taxonomies sounds (see [10] for a review). A taxonomy is a semantic classification scheme As well as cross-categorizing, we also build that provides a knowledge map of its domain. In within these categories. Results from order to meet these criteria, a taxonomy should be free sorting tasks of everyday sound recordings comprehensive, predictable and navigable [14]. indicated different levels of abstraction in everyday There are many types of taxonomies available, each sound categorization, consistent with Rosch’s with their own strengths and challenges. The theory of natural categories grouped into classification systems analyzed in the above superordinate level (e.g. furniture), basic level (e.g. literature review are all hierarchical taxonomies. chair), and subordinate level (e.g. office chair). This type of taxonomy is generally pre-coordinated Research on the categorization of urban and enumerative, that is, the domain is mapped soundscapes indicates that complex sound scenes before the items are classified and the mapping is are also subject to cross-classification. A study on exhaustive of the entire domain. Furthermore, soundscape conceptualization found that city users hierarchical taxonomies have the following four categorized urban sounds according to four main characteristics: inclusiveness, relational categories: human sounds, traffic sounds, natural consistency, inheritance and mutual exclusivity sounds and music [11]. Studies using free-sorting [14]. Inclusiveness implies that parent categories tasks of soundscape recordings indicate that include all of the elements of the subcategories. categories are based on the presence or absence of Relational consistency holds that the relationship human activity [10]. between each level is of exactly the same kind. In While there is a general convergence in other words, there is a consistency in the divisions categorization research toward a set of fundamental throughout the entire taxonomy. Inheritance holds factors, there is still a need for a standard that the subordinate categories inherit the attributes classification scheme of sounds. Gaver proposed of the superordinate categories. Finally, mutual categories of everyday sounds according to the exclusivity holds that a single item must be placed state of the sound source – liquid, gas, solid or unambiguously in one branch of the hierarchy. mechanical [7] at a superordinate level (e.g. liquid), In particular, the principle of mutual exclusivity is then by actions at a basic level (e.g. leak), then by difficult to enforce for sounds and soundscapes. source at a subordinate level (e.g. faucet). Guyot et There is abundant evidence in the literature al., 1997, focusing on domestic sounds, found reviewed that people cross-classify sounds similar evidence that people cross-classify based on according to several factors. Outside of a laboratory the excitation (e.g. mechanical or electrical), the setting, sound scenes are complex and typically action (e.g. rubbing, scratching) and the source involve multiple sound sources concurrently. To (e.g. dishes, Velcro). Salamon, Jacoby and Pablo some extent, current taxonomies try to account for Bello, recently proposed a taxonomy of urban cross-classification by using sound sources at one sounds including a combination of sound sources level of the taxonomy and sound events at another and sound-generating actions [13]. Sounds within level, or even a mixture of these at the same level. an urban acoustic environment are first divided The result is a taxonomy that presents the user with according to four categories: human, nature, difficulties during the classification process. mechanical and music. Subordinate to these are Furthermore, the size and potential for unending

- 2488 - Euronoise 2018 - Conference Proceedings

growth mean that the taxonomy is ill suited to act single taxonomy complicates the process of as a knowledge map. The field of information populating the classifier. Second, any given science offers a taxonomy that is capable of acoustic environment can be described by its set of meeting the specific challenges posed by the sound events. Thus, we can fully describe any given classification of sounds. sound scene by listing its component sounds. The research on sound categorization can be used 2. Faceted taxonomies to map the knowledge domain and inform our choice of facets. Thus, the proposed facets for Originally proposed by S.R. Ranganathan, faceted isolated sound events are: agent, source, action and classification breaks with the pre-coordinated and location. The agent is the human or animal causing enumerative model of hierarchical classification the activity that generates the sound (e.g. child). [15]. Facets are the fundamental dimensions (or The source is the physical material whose attributes) of the item that is being classified [14]. vibrations are at the origins of the sound (e.g. toy). In order to avoid confusion, each facet needs to be The action (e.g. falling) is the cause of the orthogonal to every other facet in the taxonomy – excitation of the physical material. Finally, the that is, the attributes of one facet cannot overlap location is a description of the immediate context in with those of another. For example, a physical item which the sound event occurs (e.g. playroom). might have colour and texture as two separate Each facet will be represented as a single table of a facets because there is no conceptual overlap. As MySQL database, with the arrangement of the the knowledge domain gets smaller, the facets can items by primary key. The names of the tables be more specific and more detailed. mostly correspond to the facet names: location, Prior to the development of the digital environment, agent, src (source) and act (action). The the opportunities for implementing faceted presentation of the sounds and the taxonomic taxonomies were limited. In particular, libraries structure can be done through a web interface, need to have a unique location for each resource, allowing some flexibility to the user to engage and even when there are multiple copies of these. interact with the data. However, with the advent of digital technology, Implementation into a database requires that we faceted taxonomies are able to leverage the use of determine the entities that the database needs and to aid in implementation [14]. As well, what relationship these entities have with each replication costs are nil, making it possible to list other. Agent, source, action, and location entities the same item in several locations. Therefore, can each be attached to multiple sounds. However, digital technology makes the use of a faceted each sound can only be attached to one of these. taxonomy easier. Therefore, we have a one-to-many relationship Through the use of multiple facets, a between the sound event entity and the facet comprehensive description of a single sound can be entities. As such, each facet is simply created as a created. In this way, the challenge of classifying separate entity in the database. Finally, one entity sounds that are being cross-categorized is representing the sound event is used to connect minimized because these factors of categorization each of the facets of the taxonomy together. can be turned into facets. The database contains two entities that help to track The drawback of a faceted taxonomy is that it is information about the sounds, but that do not inherently difficult to visualize. When the number represent facets: setting and origin. The setting of facets is limited to two (sometimes called a entity is used to distinguish between the acoustic matrix) or even three, it is possible to visualize the environments that are defined by the Brown, Kang content of a facet in 2-space or 3-space. Taxonomies with more than three facets present significant challenges in this area. That said, web technology offers a platform that can be used to visualize the content of a faceted taxonomy. 2.1. Implementation Given the preceding discussion, it is now possible to propose a faceted taxonomy for urban sounds. The subject of this taxonomy will be sound events and not soundscapes for two important reasons. First, combining both sounds and soundscapes in a

- 2489 - Euronoise 2018 - Conference Proceedings

Figure 1: Database Entity-Relationship Diagram. The crow’s feet indicate the direction of the one-to-many relationship. The primary keys of each table are represented by the yellow key icons. The foreign keys are identified through a combination of foreign table name and foreign field, in the format: table_name$field_name. and Gjestland taxonomy [13]. Therefore, it is the content of the database, as well as defining restricted largely to the following entries: urban, relationships within these broad categories. suburban, rural, wilderness and underwater. Three tables implement recursive relationships to The setting table is created separately from location provide the described hierarchical organization for for reasons that have more to do with database normalization than with the faceted taxonomy. Database design principles hold that redundant entries should not exist. For example, we could have sounds produced on the road, and thus our database will require a location entry ‘road’. However, if location contains the setting as a top- level item, then we need to consider the urban road as distinct from the suburban road, causing there to be two or more entries for the same item, road. The table labelled origin is added to the database to track the provenance of the sound events that are described. This includes previous empirical studies and online sound repositories.

2.2. Hierarchies within facets A hierarchy of items can be defined for a specific Figure 2: Example of hierarchies for location, source facet. These hierarchies provide further structure to and action facets.

- 2490 - Euronoise 2018 - Conference Proceedings

Figure 3: Modelling sound scenes in the database by grouping multiple sounds. the facets of each sound. These tables are: source, working in this domain. For example, regulations action and location. For example, the sound of an and planning documents use a combination of engine can either be listed under engine or under sound sources and actions to carry out noise vehicle. Through the use of a recursive relationship, reduction strategies. These documents can be we can define a hierarchical, broader-narrower term mined to extract details and then compared against relationship between the engine and the vehicle. the database to provide a categorical assessment of Thus, we can define a hierarchical relationship coverage and thoroughness. within each facet that provides a better description In order to fully benefit from the implementation of of the sounds. this taxonomy, more work is required to add In the early stages, the database will be validated acoustical and hedonic dimensions. However, once using the taxonomies proposed by previous these are fully implemented, the database could research into the classification of sounds. potentially be used to predict judgements of Eventually, the database will be further populated soundscapes and sound scenes. with sounds taken from the online repository, freesound.org. This site provides a large number of 4. Conclusions sounds with textual descriptions that can be mined to fill the database tables. There is evidence that people cross-categorize sounds according to a number of factors. Based on 3. Discussion these results, researchers have proposed a number of classification systems for sounds, some of which A faceted taxonomy approach to sound use a combination of sound source and sound event. classification has applications to research The hierarchical nature of these taxonomies makes generally, but also practical uses for soundscape the actual classification process challenging. planning. Given the limitations of hierarchical taxonomies, Soundscape research uses different conceptual this paper proposes a faceted taxonomy that reflects approaches, making it difficult to directly compare the way people cross-categorize sounds. Four the results that are obtained. A faceted taxonomy facets are used in the current taxonomy: agent, allows for common framework for the comparison source, action, and location. Other facets that play of this research. For example, more research is a role in categorization include hedonic factors, necessary to understand why certain categorization familiarity, and acoustic descriptors. The current principles are considered to be more important than taxonomy does not consider these properties, and others. The proposed taxonomy and accompanying their addition is an important element in future database can be used to examine patterns in sound phases. The taxonomy has been implemented using categorization. an open-source relational database management The benefits to soundscape planners depend on the system (MySQL). A web interface will be goals of the project. In general, the taxonomy can implemented shortly to allow users to navigate the be used to compare the areas that are included in content and structure of the taxonomy. sound and noise documents. This provides the opportunity for an assessment toolkit for anyone

- 2491 - Euronoise 2018 - Conference Proceedings

References [1] J.G. Neuhoff: Introduction and history. In: J.G. Neuhoff, (ed.) Ecological Psychoacoustics, Brill, Leiden, 2004. [2] J.A. Ballas: Common factors in the identification of an assortment of brief everyday sounds. J. Exp. Psychol. Hum. Percept. Perform. 19 (1993) 250–267. [3] C. Guastavino: Categorization of environmental sounds. Can. J. Exp. Psychol. 61(1), 54–63 (2007). [4] B. Gygi, G. R. Kidd, & C. S. : Similarity and categorization of environmental sounds. Perception & Psychophysics, 69, 2007 839–855. [5] N.J. VanDerveer: Ecological acoustics: human perception of environmental sounds. Ph.D. thesis, Cornell University, 1979. Unpublished Dissertation [6] F. Guyot, M. Castellengo, B. Fabre: Étude de la catégorisation d’un corpus de bruits domestiques. In: Dubois, D. (ed.) Catégorisation et cognition: de la perception au discours, Kimé, Paris, 1997. [7] W.W. Gaver: What in the world do we hear?: an ecological approach to auditory event perception. Ecol. Psychol. 5, 1993, 1–29. [8] D. Dubois: Categories as acts of meaning: The case of categories in olfaction and audition. Cognitive Science Quarterly 1 (2000) 35–68. [9] O. Houix, G. Lemaitre, N. Misdariis, P. Susini, & I. Urdapilleta: A lexical analysis of environmental sound categories. Journal of Experimental Psychology: Applied, 18 (2012) 52-80. [10] C. Guastavino: Everyday Sound Categorization. In Computational Analysis of Sound Scenes and Events. Cham: Springer, 2018. [11] C. Guastavino: The ideal urban soundscape: Investigating the sound quality of French cities. Acta Acustica United with Acustica, 92, 2006, 945-951. [12] J. Salamon, C. Jacoby, and J.P. Bello: A dataset and taxonomy for urban sound research, Proc. 22nd ACM International Conference on Multimedia, 2014. [13] A. L. Brown, J. Kang, & T. Gjestland: Towards standardization in soundscape preference assessment. Applied Acoustics, 72, 2011, 387-392. [14] P. Lambe: Organising knowledge: Taxonomies, knowledge and organisational effectiveness. Chandos Publishing: Oxford, 2007. [15] S. R. Ranganathan: Colon classification. Madras: Madras Library Association, 1957. [16] L. Spiteri: A simplified model for facet analysis: Ranganathan 101. Canadian Journal of Information and Library Science, 23, 1998, 1-30.

- 2492 -