Automatic music transcription using sequence to sequence learning Master’s thesis of B.Sc. Maximilian Awiszus At the faculty of Computer Science Institute for Anthropomatics and Robotics Reviewer: Prof. Dr. Alexander Waibel Second reviewer: Prof. Dr. Advisor: M.Sc. Thai-Son Nguyen Duration: 25. Mai 2019 – 25. November 2019 KIT – University of the State of Baden-Wuerttemberg and National Laboratory of the Helmholtz Association www.kit.edu Interactive Systems Labs Institute for Anthropomatics and Robotics Karlsruhe Institute of Technology Title: Automatic music transcription using sequence to sequence learning Author: B.Sc. Maximilian Awiszus Maximilian Awiszus Kronenstraße 12 76133 Karlsruhe
[email protected] ii Statement of Authorship I hereby declare that this thesis is my own original work which I created without illegitimate help by others, that I have not used any other sources or resources than the ones indicated and that due acknowledgement is given where reference is made to the work of others. Karlsruhe, 15. M¨arz 2017 ............................................ (B.Sc. Maximilian Awiszus) Contents 1 Introduction 3 1.1 Acoustic music . .4 1.2 Musical note and sheet music . .5 1.3 Musical Instrument Digital Interface . .6 1.4 Instruments and inference . .7 1.5 Fourier analysis . .8 1.6 Sequence to sequence learning . 10 1.6.1 LSTM based S2S learning . 11 2 Related work 13 2.1 Music transcription . 13 2.1.1 Non-negative matrix factorization . 13 2.1.2 Neural networks . 14 2.2 Datasets . 18 2.2.1 MusicNet . 18 2.2.2 MAPS . 18 2.3 Natural Language Processing . 19 2.4 Music modelling .