A Laser Triangulation Approach for Optical Audio Reconstruction of Phonograph Records Kristofer Janukiewicz
Total Page:16
File Type:pdf, Size:1020Kb
LiU-ITN-TEK-A-16/044--SE A Laser Triangulation Approach for Optical Audio Reconstruction of Phonograph Records Kristofer Janukiewicz 2016-09-22 Department of Science and Technology Institutionen för teknik och naturvetenskap Linköping University Linköpings universitet nedewS ,gnipökrroN 47 106-ES 47 ,gnipökrroN nedewS 106 47 gnipökrroN LiU-ITN-TEK-A-16/044--SE A Laser Triangulation Approach for Optical Audio Reconstruction of Phonograph Records Examensarbete utfört i Medieteknik vid Tekniska högskolan vid Linköpings universitet Kristofer Janukiewicz Handledare Joel Kronander Examinator Jonas Unger Norrköping 2016-09-22 Upphovsrätt Detta dokument hålls tillgängligt på Internet – eller dess framtida ersättare – under en längre tid från publiceringsdatum under förutsättning att inga extra- ordinära omständigheter uppstår. Tillgång till dokumentet innebär tillstånd för var och en att läsa, ladda ner, skriva ut enstaka kopior för enskilt bruk och att använda det oförändrat för ickekommersiell forskning och för undervisning. Överföring av upphovsrätten vid en senare tidpunkt kan inte upphäva detta tillstånd. All annan användning av dokumentet kräver upphovsmannens medgivande. För att garantera äktheten, säkerheten och tillgängligheten finns det lösningar av teknisk och administrativ art. Upphovsmannens ideella rätt innefattar rätt att bli nämnd som upphovsman i den omfattning som god sed kräver vid användning av dokumentet på ovan beskrivna sätt samt skydd mot att dokumentet ändras eller presenteras i sådan form eller i sådant sammanhang som är kränkande för upphovsmannens litterära eller konstnärliga anseende eller egenart. För ytterligare information om Linköping University Electronic Press se förlagets hemsida http://www.ep.liu.se/ Copyright The publishers will keep this document online on the Internet - or its possible replacement - for a considerable time from the date of publication barring exceptional circumstances. The online availability of the document implies a permanent permission for anyone to read, to download, to print out single copies for your own use and to use it unchanged for any non-commercial research and educational purpose. Subsequent transfers of copyright cannot revoke this permission. All other uses of the document are conditional on the consent of the copyright owner. The publisher has taken technical and administrative measures to assure authenticity, security and accessibility. According to intellectual property law the author has the right to be mentioned when his/her work is accessed as described above and to be protected against infringement. For additional information about the Linköping University Electronic Press and its procedures for publication and for assurance of document integrity, please refer to its WWW home page: http://www.ep.liu.se/ © Kristofer Janukiewicz Master of Science Thesis in Media Technology Department of Science and Technology, Linköping University, 2016 A Laser Triangulation Approach for Optical Audio Reconstruction of Phonograph Records Kristofer Janukiewicz Master of Science Thesis in Media Technology A Laser Triangulation Approach for Optical Audio Reconstruction of Phonograph Records Kristofer Janukiewicz LiTH-ITN-EX--YY/NNNN--SE Supervisor: Joel Kronander itn, Linköpings Universitet Mattias Johannesson SICK IVP Examiner: Jonas Unger itn, Linköpings universitet Media and Information Technology Department of Science and Technology Linköping University SE-601 74 Norrköping, Sweden Copyright © 2016 Kristofer Janukiewicz Abstract This thesis introduces a method for contact-free optical audio reconstruction of phonograph records using laser triangulation. The reconstruction is done using a 3D-profile camera and by scanning the surface of record in a single circumfer- ence. The depth-map created by the camera is then used to decode the audio information stored in the record. To evaluate the quality of the decoded audio in- formation, it is tested against digital copies of the same record in order to analyze the correlation between it and the extracted sound. The result of this thesis presents a decoded and high correlated audio which is recognizable to an original digital copy. The method is well suited for fast real-time or faster than real-time decoding implementations. iii Acknowledgments My thanks goes to all the people involved in this project. Not the least to my su- pervisor at SICK IVP, Mattias Johanesson, who have been a great help to construct the setups, exchange ideas and introduce me to the company. But also forming this thesis idea and choosing me to do it. A lot of thanks goes to my supervisor, Joel Kronander, at Linköping University who has guided and gave me a lot of introductory ideas on were to start and handle this thesis but also in finalizing it. A big thanks to my examiner, Jonas Unger, who approved of this thesis. A loving thanks to Tiffani Höglind, who have put up with me during this thesis project. A thanks to my thesis-colleagues at SICK IVP who have brought great joy during this thesis work: Jens Edhammer, Mikael Zackrisson and Richard Bonde- mark. And of course, a huge thanks to everyone at SICK IVP for letting me use their equipment, experience and technology for this thesis. v Contents 1 Introduction 1 1.1 Optical Audio Reconstruction of Phonograph Records . 1 1.2 Related Work . 2 1.3 Project’s Contribution . 4 1.4 Proposed Method Pipeline . 4 2 Background 9 2.1 A Brief History of Audio Recording . 9 2.2 Disc Formats . 10 2.3 Groove . 11 2.4 Error in analog decoding . 14 3 Hardware Setup 17 3.1 Camera . 17 3.1.1 Sheet of Light Triangulation . 18 3.1.2 Camera Lens . 18 3.1.3 Ranger Studio . 19 3.1.4 Laser . 20 3.2 Arduino Controlled Turntable . 21 3.3 Photo of the Setup . 22 3.4 Illustrations of the setup . 23 3.5 Simulating a Phonograph Record with Archimedes’ Spiral . 24 3.6 Measurements . 27 4 Method 29 4.1 Scanning . 29 4.1.1 Laser width and scanning frequency . 34 4.2 Artifacts in Scans . 35 4.3 Artifact Removal . 39 4.4 Segmentation . 41 4.4.1 Path Segmentation . 42 4.5 Decoding . 44 4.5.1 Groove Extremas . 44 vii viii Contents 4.5.1.1 Special Shaped Samples . 45 4.5.1.2 Finer Groove Extremas . 46 4.5.2 Intersection Line . 49 4.5.3 Stylus Position in Sample . 52 4.5.4 Modulation to Sound . 53 4.5.5 Filtering . 53 4.5.5.1 Moving Average - Smoothing . 54 4.5.5.2 Maximum Suppression filter - Click Removal . 55 4.6 Sound Stitching . 56 4.6.1 Mono or Stereo . 57 4.6.2 Calculating the Sample Rate . 58 5 Result 59 5.1 Ground truth comparison . 60 5.2 Decoded Sound from Record - Reversed Ordinary Setup . 61 5.3 Decoded Sound from Record - Ordinary Setup . 62 5.4 Comparison to Digitally Mastered Audio . 63 5.5 Comparison to Cell Phone Recorded Audio from Gramophone Player 65 6 Discussion 69 6.1 Disc Modulation . 69 6.2 Noise in the Beginning . 71 6.3 Noise Analysis . 71 6.4 Sample Rate Artifacts . 73 6.5 RIIA Playback Curves - Equalization . 73 6.6 Real-Time Approach . 73 6.7 Future Work . 74 7 Conclussion 77 Bibliography 79 1 Introduction Libraries across the world have large collections of old music records that need to be digitized for preservation. For example the National Library of Sweden has, approximately, 146 000 such records [Vinnova, 2014]. Reading and digitizing these with conventional methods takes prohibitively long time and furthermore risks destroying the old artifacts, and therefore; more efficient methods are re- quested. SICK1 develops high-performance 3D profiling products, that uses a laser tri- angulation technique, which has the potential to read and decode audio informa- tion on records. This thesis deals with contact-free reading of music on 78 rpm phonograph records using a 3D profiling camera provided by SICK IVP2; furthermore inves- tigates the challenges laser triangulation will have on these kind of records and possible ways to overcome them. 1.1 Optical Audio Reconstruction of Phonograph Records Contact-free reading of phonograph records is a technology that has existed in the past few decades. The topic is well researched and analyzed, however there is still room for new approaches and optic technologies to be improved/invented. Achieving a contact-free reading is advantageous; as it does not degrade or ruin the information stored in a record and it also has the potential to do it more rapidly than a traditional stylus. Meaning; digitizing a single record quicker than the total audio time recorded into a disc. For a huge library, this is crucial as each 1https://www.sick.com/us/en 2https://www.sick.com/se/sv/w/sick/ 1 2 1 Introduction 78 rpm disc can store up to a few minutes of audio information on each side 1 and 33 3 almost an half hour of music on each side. Some older records, in the libraries, are fragile or degraded to a point where the traditional stylus is not an option, as it could further damage the stored information. In this thesis, the above topics are analyzed using a 3D-profiling approach, in contrast to the more common 2D approach. Creating a depth map of each scanned record directly with the camera opens up new possibilities of decoding the information stored in a phonograph record. 1.2 Related Work Creating an optical audio reconstruction of records is a well researched topic where different methods and approaches have been studied and proposed for a few decades. In a historical perspective, the first contact-free approach for reading records was presented by William K. Heine [Heine, 1976]. The proposed procedure is to use a laser beam to create an interference/diffraction pattern of the records groove walls and the information of magnitude and displacement to decode the track. His research was very beneficial to achieve an interest in optical decod- ing of phonograph records and William K.