BeaqleJS: HTML5 and JavaScript based Framework for the Subjective Evaluation of Audio Quality

Sebastian Kraft and Udo Z¨olzer Department of Signal Processing and Communications Helmut-Schmidt-University Holstenhofweg 85, 22043 Hamburg, Germany [email protected]

Abstract published under the GPLv3 open source license 1 Subjective listening tests are an essential tool for and its source code is available on GitHub . the evaluation and comparison of audio processing The following section 2 will first introduce algorithms. In this paper we introduce BeaqleJS, some background information about listening a framework based on HTML5 and JavaScript to tests and common standards in general. After- run listening tests in any modern web browser. This wards, the BeaqleJS framework is described in allows an easy distribution of the test environment to section 3 and section 4 outlines advanced usage a significant amount of participants in combination scenarios like modifying or implementing new with simple configuration and good expandability. test schemes as well as server side evaluation Keywords and data collection. Finally section 5 will give a conclusion and outlook. listening test, subjective audio evaluation, HTML5, JavaScript 2 Listening test standards and basics 1 Introduction The difficulty in setting up a listening test comes from the fact that humans are rarely ob- Frequently used physical measures to judge the jective in their judgements. Therefore, the chal- quality of audio signals, like the signal to noise lenge is to design a test environment that mini- ratio or signal distortion, do not correlate well mizes external influences and yields non-biased with the perception of quality by the human results. To avoid mistakes it is advised to stick hearing system. Therefore, listening tests, also close to standardised instructions and test pro- named subjective audio evaluations, play an im- cedures as they are for example defined by the portant role in the comparison of signal process- ITU in [1][2][3]. ing algorithms like audio effects and codecs. Test items should be presented in random or- The setup of a test environment and the se- der together with neutral names avoiding any lection of items under test is crucial to yield sig- association to the underlying algorithms. If sev- nificant and non-biased results. Some guidance eral different algorithms are compared in one and standards can be found for example in the test, the corresponding items should always ap- International Telecommunication Union (ITU) pear at a random position to prevent that the recommendations and in particular in [1]. Still, listeners recognize or learn the connection be- one of the biggest problems is to address an ade- tween a rating and its item position. quate number of qualified participants. Closely It is also necessary to find a way to judge the connected is the problem of distributing the test ability of the participants to understand the test environment to the various platforms of the par- procedure or to even recognize if they are able ticipants and how the results could be merged to perceive any differences between the items at and evaluated afterwards. all. For this purpose, a hidden reference and In this paper BeaqleJS (browser based an anchor signal can be mixed among the test evaluation of audio quality and comparative items. In valid test results, the participants listening environment) is presented, which is a should always rate the hidden reference with the framework to easily setup and run listening tests same quality as the visible reference. In con- in any modern web browser. To achieve this, trary, an anchor signal is an obviously bad test BeaqleJS purely relies on open like HTML5 and JavaScript, without the need 1GitHub is a source code hosting platform using the of further browser plugins or extensions. It is git version control system http://www.github.com item, for example heavily lowpass filtered, that Local Browser, index. is expected to always catch the worst rating and HTML framework and DOM will set the bottom end of the scale. The experience of the participants can have