Web Audio Evaluation Tool: a Browser-Based Listening Test Environment

WEB AUDIO EVALUATION TOOL: A BROWSER-BASED LISTENING TEST ENVIRONMENT Nicholas Jillings Brecht De Man David Moffat Joshua D. Reiss [email protected], fb.deman, d.j.moffat, [email protected] Centre for Digital Music, Queen Mary University of London ABSTRACT Various listening test design tools are already available, see Table1. A few other listening test tools, such as OPA- Perceptual evaluation tests where subjects assess certain QUE [16] and GuineaPig [17], are described but not avail- qualities of different audio fragments are an integral part able to the public at the time of writing. of audio and music research. These require specialised Many are MATLAB-based, useful for easily processing software, usually custom-made, to collect large amounts and visualising the data produced by the listening tests, of data using meticulously designed interfaces with care- but requiring MATLAB to be installed to run or - in the fully formulated questions, and play back audio with rapid case of an executable created with MATLAB - at least cre- switching between different samples. New functionality ate the test. Furthermore, compatibility is usually limited in HTML5 included in the Web Audio API allows for in- across different versions of MATLAB. Similarly, Max re- creasingly powerful media applications in a platform in- quires little or no programming background but it is pro- dependent environment. The advantage of a web applica- prietary software as well, which is especially undesirable tion is easy deployment on any platform, without requiring when tests need to be deployed at different sites. More any other application, enabling multiple tests to be eas- recently, BeaqleJS [12] makes use of the HTML5 audio ily conducted across locations. In this paper we propose capabilities and comes with a number of predefined, es- a tool supporting a wide variety of easily configurable, tablished test interfaces such as ABX and MUSHRA [18]. multi-stimulus perceptual audio evaluation tests over the BeaqleJS provides a number of similar features including web with multiple test interfaces, pre- and post-test sur- saving of test data to a web server. The main difference veys, custom configuration, collection of test metrics and is that with BeaqleJS, the configuration is done through other features. Test design and setup doesn’t require pro- writting a JavaScript file holding a JavaScript Object of the gramming background, and results are gathered automati- notation. Instead our presented system uses the XML doc- cally using web friendly formats for easy storing of results ument standard, which allows configuration outside of a on a server. web-centric editor. The results are also presented in XML again allowing 3rd party editors and programs to easily ac- 1. INTRODUCTION cess. Finally, the presented system does not require web access to run, instead being deployed with a Python server Perceptual evaluation of audio plays an important role in script. This is particularly useful in studios where ma- a wide range of research on audio quality [1,2], sound chines may not, by design, be web connected, or use in synthesis [3,4], audio effect design [5], source separation locations where web access is limited. [6,7], music and emotion analysis [8,9], and many oth- A browser-based perceptual evaluation tool for audio has ers [10]. a number of advantages. First of all, it doesn’t need any other software than a browser, meaning deployment is very Table 1. Available audio perceptual evaluation tools easy and cheap. As such, it can also run on a variety of de- Name Language Ref. vices and platforms. The test can be hosted on a central APE MATLAB [11] server with subjects all over the world, who can simply go BeaqleJS HTML5/JS [12] to a webpage. This means that multiple participants can HULTI-GEN Max [13] take the test simultaneously, potentially in their usual lis- MUSHRAM MATLAB [6] tening environment if this is beneficial for the test. Natu- Scale MATLAB [14] rally, the constraints on the listening environment and other WhisPER MATLAB [15] variables still need to be controlled if they are important to the experiment. Depending on the requirements a survey or a variety of tests preceding the experiment could establish whether remote participants and their environments are ad- Copyright: c 2015 Nicholas Jillings et al. This is an equate for the experiment at hand. open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported License, which permits unre- The Web Audio API is a high-level JavaScript Applica- stricted use, distribution, and reproduction in any medium, provided the original tion Programming Interface (API) designed for real-time author and source are credited. processing of audio inside the browser through various processing nodes 1 . Various web sites have used the Web in terms of any subjective property, as well as a comment Audio API for creative purposes, such as drum machines box for every marker, and any extra text boxes for extra and score creation tools 2 , others from the list show real- comments. The reason for such an interface, where all time captured audio processing such as room reverberation stimuli are presented on a single rating axis (or multiple tools and a phase vocoder from the system microphone. axes if multiple subjective qualities need to be evaluated), The BBC Radiophonic Workshop shows effects used on is that it urges the subject to consider the rating and/or famous TV shows such as Doctor Who, being simulated in- ranking of the stimuli relative to one another, as opposed side the browser 3 . Another example is the BBC R&D per- to comparing each individual stimulus to a given reference, sonalised compressor which applies a dynamic range com- as is the case with e.g. a MUSHRA test [18]. As such, it pressor on a radio station that dynamically adjusts the com- is ideal for any type of test where the goal is to carefully pressor settings to match the listener’s environment [19]. compare samples against each other, like perceptual evalu- In contrast with the tools listed above, we aim to pro- ation of different mixes of music recordings [21] or sound vide an environment in which a variety of multi-stimulus synthesis models [4], as opposed to comparing results of tests can be designed, with a wide range of configurabil- source separation algorithms [6] or audio with lower data ity, while keeping setup and collecting results as straight- rate [18] to a high quality reference signal. forward as possible. For instance, the option to provide The markers on the slider at the top of the page are po- free-text comment fields allows for tests with individual sitioned randomly, to minimise the bias that may be in- vocabulary methods, as opposed to only allowing quantita- troduced when the initial positions are near the beginning, tive scales associated to a fixed set of descriptors. To make end or middle of the slider. Another approach is to place the tool accessible to a wide range of researchers, we aim the markers outside of the slider bar at first and have the to offer maximum functionality even to those with little or subject drag them in, but the authors believe this doesn’t no programming background. The tool we present can set encourage careful consideration and comparison of the dif- up a listening test without reading or adjusting any code, ferent fragments as the implicit goal of the test becomes to provided no new types of interfaces need to be created. audition and drag each fragment in just once, rather than to Specifically, we present a browser-based perceptual eval- compare all fragments rigorously. uation tool from which any kind of multiple stimulus audio See Figure1 for an example of the interface. evaluation tool where subjects need to rank, rate, select, or comment on different audio samples can be built. We also 3. ARCHITECTURE include an example of the multiple stimulus user interface included with the APE tool [11], which presents the sub- The tool uses entirely client side processing utilising the ject with a number of axes on which a number of mark- new HTML5 Web Audio API, supported by most major ers, corresponding to audio samples, can be moved to re- web browsers. The API allows for constructing audio pro- flect any subjective quality, as well as corresponding com- cessing elements and connecting them together to produce ment boxes. However, other graphical user interfaces can a high quality, real time signal process to manipulate audio be put on top of the engine that we provide with minimal streams. The API supports multichannel processing and or no modifications. Examples of this are the MUSHRA has an accurate playback timer for precise, scheduled play- test [18], single or multiple stimulus evaluation with a two- back control. The API is controlled through the browser dimensional interface (such as valence and arousal dimen- JavaScript engine and is therefore highly configurable. Pro- sions), or simple annotation (using free-form text, check cessing is all performed in a low latency thread separate boxes, radio buttons or drop-down menus) of one or more from the main JavaScript thread, so there is no blocking audio samples at a time. In some cases, such as method due to real time processing. of adjustment, where the audio is processed by the user, or The web tool itself is split into several files to operate: AB test, where the interface does not show all audio sam- • index.html: The main index file to load the scripts, ples to be evaluated at once [20], the back end of the tool this is the file the browser must request to load.

Load more