Tarsos Manual

Tarsos Manual Joren Six [email protected] Royal Academy of Fine Arts & Royal Conservatory, University College Ghent, Hoogpoort 64, 9000 Ghent - Belgium March 1, 2012 Contents 1 Introduction 2 1.1 What is Tarsos? . .2 1.2 Organization of this Manual . .3 2 Getting Started 4 2.1 Requirements . .4 2.2 Installation . .4 2.3 Detecting a Tone Scale With Tarsos . .5 2.4 Scala Files and Tarsos . .6 2.5 Export Results . .9 2.5.1 Annotations . .9 2.5.2 Pitch Histogram . .9 2.5.3 Pitch Class Histogram . .9 2.5.4 Pitch Class Data . .9 3 Advanced Use 11 3.1 Connecting a MIDI Keyboard . 11 3.2 External Pitch Detectors . 11 3.3 Alternative Peak Detection Schemes . 11 4 Processing Datasets with Tarsos 12 4.1 Directory and file naming structure . 12 4.2 Command line applications . 13 4.3 Scripting Tarsos . 13 4.3.1 The Tarsos API . 14 4.3.2 Example scripts . 14 A Pitch, Pitch Interval & Pitch Ratio Representation 21 A.1 Pitch & Pitch Interval Representation . 21 A.2 Pitch Ratio Representation . 22 A.3 Conclusion . 23 B Maqams 24 1 Chapter 1 Introduction This is the manual for Tarsos a modular software platform to extract and analyze pitch organization in musical audio. Tarsos can be used to automatically detect the tone scale of most music. The manual is geared towards: • Musicologists: to identify tone scales in musical pieces. Either detailed, manually assisted analysis of one piece or automatic analysis of a large number of pieces. • Musicians: to improve their intonation. • Composers: to experiment with (micro) tonality. 1.1 What is Tarsos? Tarsos is a modular software platform to extract and analyze pitch and scale organization in music, especially aiming at the analysis of non-Western music. Tarsos aims to be a user-friendly, graphical tool to explore tone scales and pitch organization in music of the world. With Tarsos pitch annotations are extracted from an audio signal that are then processed to form musicologically meaningful representations. These representations cover more than the typical Western 12 pitch classes, since a fine-grained resolution of 1200 cents is used. The Tarsos API creates opportunities to analyse large sets of - ethnic - music automatically. With the Application Programmers Interface tasks can be automated by programming scripts. For examples see 4.3.1. The graphical user interface can be used for detailed, manually adjusted analysis of specific songs. Section 2.3 contains details on how to work with the interface. Several output modalities make Tarsos an interesting tool for musicological analysis, educational purposes and even for artistic productions. The different output options are listed in section 2.5. Tarsos is open source and and runs on any recent Java Runtime. JRE (Java Rutime Environment) 5 is supported, 6 or higher advised. Tarsos is available 2 on the Tarsos website1 and is developed at the Royal Academy of Fine Arts & Royal Conservatory, University College Ghent, Belgium2 To cite our work please refer to [8]: @inproceedings{six2011tarsos, author = {Joren Six and Olmo Cornelis}, title = {{T}arsos - a {P}latform to {E}xplore {P}itch {S}cales in {N}on-{W}estern and {W}estern {M}usic}, booktitle = {Proceedings of the 12th ISMIR Conference}, year = {2011} } 1.2 Organization of this Manual This text is structured as follows: after this introduction the next chapter contains a down to earth explanation on how to get started. It features some basic use cases. More advanced use is explained in the following chapter. In the final chapter the graphical interface is left behind: it covers how to process large sets of audio from the command line or by writing scripts yourself. Text that is present somewhere on the GUI (Graphical User Interface) is represented by a mono spaced font. E.g. the File-Open... menu. If you reading this manual digitally the URL's used throughout this text are clickable E.g. the Tarsos Website3, the full URL is embedded as a footnote, for reading on paper. Bold is used to give a quick summary of a paragraph: the TL;DR. E.g. typographical conventions. Source code is embedded as in Listing 1.1. Listing 1.1: Source code example 1 if(true){ Console.println("Source") } 1http://tarsos.0110.be 2http://cons.hogent.be 3http://tarsos.0110.be 3 Chapter 2 Getting Started 2.1 Requirements Tarsos is programmed in Java. To run Tarsos you need a recent Java Runtime Environment (JRE) on your computer. JRE version 5.0 or newer is required, JRE version 6 is advised. To check if Java is installed on your machine use the tools provided on the Java website1. There you can also find which version of Java is installed on your machine. If Java is not installed you can find installation instructions, for different operating systems, on the same website. Apart from a Java Runtime, there are little other requirements. Tarsos should work on any operating system and is regularly tested on Ubuntu 11.04, Windows XP, Windows 7 and Mac OS X 10.7. As to be expected analysis on large files will go faster on a more recent, beefier computer. 2.2 Installation To run Tarsos you need a recent JavaRuntime Environment (JRE) on your computer. See 2.1 to check if you have Java installed or for installation instructions. The installation procedure for Tarsos is straightforward: you need to down- load one executable file (a JAR-file) and double click it. The executable can be found on the Tarsos website2. If double clicking does not work, please check if Java is properly installed on your system. Linux or Unix systems using the PulseAudio sound system do not cooperate very well with the Sun (Oracle) Java 6 Runtime Environment. To alleviate this annoyance it is recommended to install the PulseAudio to Java bridge. Documentation on how to get PulseAudio Support3 for Sun Java 6 an Ubuntu 1http://java.com 2http://tarsos.0110.be/attachment/tarsos.jar 3http://tarsos.0110.be/artikels/lees/PulseAudio_Support_for_Sun_Java_ 6_on_Ubuntu 4 system with an AMD64 processor architecture is available. The procedure for other processor architectures or other Unix-like operating systems is similar. 2.3 Detecting a Tone Scale With Tarsos Once Taros is correctly installed and up and running you can start detecting tone scales. The following procedure documents how you can find the tone scale of a piece of music. A more formal description of this task is how to detect the most commonly used pitch classes within a piece of music. This set of pitch classes can be a subset of the complete scale. The procedure is based on peak detection on a pitch class histogram, which is only useful for music with pitch organized in octaves. Music without octave equivalence is much less common but can be analysed using the pitch histogram which contains the same information, not reduced to one octave. In [3] the following is stated about pitch relationships and octave equivalence: \... the use of discrete pitch relationships, as well as the concept of octave equivalence seem, while not universal in early and pre- historic music (Nettl 1956; Sachs 1962), rather common to current musical systems (Burns 1999)." To be able to interpret the results one needs to fully understand both the limitations of the pitch detection algorithm used and how a pitch (class) histogram is constructed. Knowledge about the music under analysis is also a boon. To execute this task you need an audio file with an interesting tone scale. An example audio file with a tone scale based on an octave division in 10 equal parts of 120 cents4 can be found here5. Examples using this file will be returning throughout this text. 1. Using the File-Open... menu to choose a file is the first step to detect a tone scale. Tarsos is also capable to open audio files by drag and drop. The audio can be in almost any format. In the background FFmpeg6 is used to convert audio to WAV. This is indicated by 1 in Figure 2.1. 2. The second step is simple: wait until the file is analysed by one or more pitch detection algorithms. By default a platform independent implemen- tation of YIN[1] is used, it should 'just work'. See 3.2 if you want to try out other pitch detectors. After the file is analysed it starts to play and a pitch class histogram emerges. This histogram represents how many times each pitch class is detected. This is indicated by 2 in Figure 2.1. 4See appendix A for more information about the cents unit. There you can also find conversion formulas and pitch representation conventions used within Tarsos. 5http://tarsos.0110.be/attachment/cons/200/bach_BWV_1007_120.mp3 6FFmpeg is used by Tarsos as a cross-platform solution to convert audio. It has support for MP3, FLAC, Monkey's Audio, WMAv2, Theora, MusePack, etc. The supported audio formats depend on which version of FFmpeg is used by your system. 5 3. The third step is to detect the pitch classes that are most frequently used. In practice this means peak detection on the pitch class histogram. This can be done automatically and is visualised in Figure 2.1 as step 3. The sliders modify parameters of the peak detection strategy. Immediate visual feedback shows the detected locations in the pitch class histogram, the detected pitch classes. Manual adjustment is sometimes required en is therefore possible: • To add a pitch class hold down alt while hovering the mouse pointer over the pitch class histogram. Pressing the left mouse button while hovering fixes it to the current location.

Load more