Optical Music Recognition for Dummies

Total Page:16

File Type:pdf, Size:1020Kb

Optical Music Recognition for Dummies Optical Music Recognition for Dummies A Gentle Introduction to Optical Music Recognition Watch on YouTube Image by Yngve Bakken Nilsen, Flickr Speakers Jorge Jan Alexander Ichiro Calvo-Zaragoza Hajič jr. Pacha Fujinaga Introduction The History of OMR Introduction to Optical Music Recognition Ichiro Fujinaga Music Technology Area, Schulich School of Music McGill University Introduction ❖ What is Optical Music Recognition (OMR)? ❖ Why is OMR important? ❖ History of OMR research ❖ Obstacles and challenges ISMIR 2018 Fujinaga 5 /45 Optical Music Recognition (OMR) A process of converting images of music scores into a symbolic computer representation, such as MIDI, MusicXML, or MEI (Music Encoding Initiative). OMR ISMIR 2018 Fujinaga 6 /45 Steps Involved in OMR Image Music Symbol Music Notation Preprocessing Recognition Reconstruction Final Binarization Staves Symbol Digitized Processing Combination Output Noise Removal Score Semantic Structural Symbol Assignment Analysis Segmentation (pitch, value) Musical Image Symbol Structure Segmentation Classification Reconstruction ISMIR 2018 Fujinaga 7 /45 Why is OMR important? ❖ Automatic playback ❖ Rearrangements ❖ Transpositions ❖ Change of mode (Major /Minor) ❖ Symbolic music analysis ❖ Searching ❖ Data mining ❖ Distant reading ❖ Reprints (for publishers) ❖ Braille output ❖ Score following ISMIR 2018 Fujinaga 8 /45 History of OMR Research: The Pioneers ❖ Dennis Pruslin ❖ Automatic recognition of sheet music (1966) Sc.D. Dissertation, MIT ❖ David Prerau ❖ Computer pattern recognition of standard engraved music notation (1970). Ph.D. Dissertation, MIT Denis PruslinDavid Prerau AKA The Tool Denis Pruslin with grandson Kevin, 2010/11 Baker House, c. 1959 ISMIR 2018 Fujinaga 9 /45 The first published digital scan of music (Prerau 1970) ISMIR 2018 Fujinaga 1 /45 0 1984: Wabot-2 https://www.scaruffi.com/mind/ai/wabot.jpg ISMIR 2018 Fujinaga 1 /45 1 OMR Theses ❖ 1966 Denis Pruslin (MIT) ❖ 1970 David Prerau (MIT) ❖ 1988 Ichiro Fujinaga (McGill: MA) ❖ 1989 Nicholas Carter (University of Surrey) ❖ 1996 Ichiro Fujinaga (McGill) ❖ 1996 Kia Ng (University of Leeds) ❖ 1996 Bertrand Coüasnon (Université de Rennes) ❖ 1997 David Bainbridge (University of Edinburgh) ❖ 2006 Laurent Pugin (Université de Genève) ❖ 2009 Alicia Fornés (Universitat Autònoma de Barcelona) ❖ 2012 Ana Rebelo (Universidade do Porto) ❖ 2014 Andrew Hankinson (McGill) ❖ 2016 Jorge Calvo Zaragoza (Universitat d’Alacant) ISMIR 2018 Fujinaga 1 /45 2 Scanner at U.S. Census Data (c.1960) ISMIR 2018 Fujinaga 13/45 Flatbed Scanner from Fujitsu (1983) ISMIR 2018 Fujinaga 14/45 How I did OMR without a scanner in 1983 1. Made a photocopy of a score to a transparency 2. Bought lots of graph papers 3. Found a large classroom (at night) 4. Taped the graph papers to the blackboard 5. Put the transparency of the score on a overhead projector 6. Projected the score onto the blackboard (with graph papers) 7. Manually blacked out the graph paper wherever there were black pixels 8. Punched cards using a run-length encoding method 9. Wrote an OMR program in Pascal and ran it on a mainframe computer ISMIR 2018 Fujinaga 15/45 Desktop scanners ❖ Datacopy 90 (1984): $9,945 (2018: $24,000) ❖ Datacopy 700 (1985): $3,950 (2018: $9,200) ❖ Datacopy 730 (1987): $1,800 (2018: $4,000) ❖ 200 DPI, 8-bit greyscale ❖ Canon LiDE 120 (2018): $60 ❖ 2400 DPI, 24-bit color ISMIR 2018 Fujinaga 16/45 ISMIR 2018 Fujinaga 17/45 ISMIR 2018 Fujinaga 18/45 Notes on Image File Sizes ❖ Scanner: 200 DPI (dots per inch) @ 8-bit/dot or 1 byte / pixel ❖ 8 1/2 x 11 paper (~A4): 8.5 x 200 x 11 x 200 x 1 byte= ~3.7MB ❖ Projector: 800x600 @ 24bit/pixel ❖ 480,000 x 3 bytes = ~1.4MB ❖ iPhone 6: 750 x 1334 @ 24bit/pixel ❖ 1,000,500 x 3 bytes = ~3 MB ❖ Color scanner: 600 DPI (dots per inch) @ 24-bit/pixel ❖ 8 1/2 x 11 paper (~A4): 8.5 x 600 x 11 x 600 x 3 byte= ~100MB ISMIR 2018 Fujinaga 19/45 Size of RAM on PCs in the 1980s ❖ IBM PC XT: 256KB (1985), 640KB (1986), 10MB hard disk ❖ Apple Macintosh: 512KB (1984), 1MB (1986), 20MB external hard disk ❖ Scanner: 200 DPI (dots per inch) @ 1 byte / pixel ❖ 8 1/2 x 11 paper (~A4): 8.5 x 200 x 11 x 200 x 1 byte= ~3.7MB ❖ 640,000/ (200DPI * 6 inches) = 533 pixels (~2.5 inch) ISMIR 2018 Fujinaga 20/45 2000: Gamera ❖ Framework for creation of structured document recognition system ❖ Designed for domain experts ❖ Image processing tools (filters, binarizations, etc.) ❖ Document segmentation and analysis ❖ Symbol segmentation and classification ❖ Portable, extensible, simple, open-source, GUI, batch Generalized Algorithms and Methods for Enhancement and Restoration of Archives ISMIR 2018 Fujinaga 21/45 2000: Gamera ISMIR 2018 Fujinaga 22/45 Birth of Gamera ❖ Christened Gamera on 1 April 2001 (17th Anniversary!) ❖ Based on Fujinaga’s Adaptive Optical Music Recognition algorithms ❖ First public mention of Gamera at the 1st Joint Conference on Digital Libraries (JCDL: June 2001) in Roanoke, VA ❖ First paper presented at the 2nd International Symposium on Music Information Retrieval (ISMIR: October 2001) in Bloomington, IN ISMIR 2018 Fujinaga 23/45 Early Gamera Screenshot (Linux) ca. June 2002 ISMIR 2018 Fujinaga 24/45 Some Features of Gamera c. 2008 ISMIR 2018 Fujinaga 25/45 Preprocessing Brightness Enhancement ISMIR 2018 Fujinaga 26/45 Preprocessing Binarization ISMIR 2018 Fujinaga 27/45 Staffline Removal ISMIR 2018 Fujinaga 28/45 Staffline Removal Four-line hand-drawn staff example ISMIR 2018 Fujinaga 29/45 Staffline Removal Difficult ISMIR 2018 Fujinaga 30/45 Staffline Removal Lute tablature ISMIR 2018 Fujinaga 31/45 Symbol classifier / Gamera ISMIR 2018 Fujinaga 32/45 Lute tablature symbol recognition ISMIR 2018 Fujinaga 33/45 2008: Other Applications Optical Recognition of Psaltic Byzantine Chant Notation Christoph Dalitz · Georgios K. Michalakis · Christine Pranzas ISMIR 2018 Fujinaga 34/45 2009: Other Applications Optical Recognition of Lute Tablature Christoph Dalitz · Thomas Karsten ISMIR 2018 Fujinaga 35/45 2002: Aruspix ❖ Developed by Laurent Pugin ❖ Specialized for typographic music ❖ Uses HMM (Hidden Markov Model) ❖ Does not remove staff lines ISMIR 2018 Fujinaga 36/45 ISMIR 2018 Fujinaga 37/45 2011: Liber Usualis Project Full-text search of 2,000 pages of Latin text and square notation ISMIR 2018 Fujinaga 38/45 Preprocessing: Aruspix ISMIR 2018 Fujinaga 40/45 Music recognition: Gamera ISMIR 2018 Fujinaga 41/45 Text recognition: Ocropus ISMIR 2018 Fujinaga 42/45 Pitch correction: Aruspix ISMIR 2018 Fujinaga 43/45 Web interface: Diva.js ISMIR 2018 Fujinaga 44/45 2012: Rodan Andrew Hankinson ❖ Remote Online Document Analysis Network (RODAN) ❖ Workflow management system for large-scale optical music recognition processes ❖ File management ❖ Cropping ❖ Noise removal ❖ Binarization ❖ Staff recognition and segmentation ❖ Symbol recognition ❖ Music reconstruction / Semantic assignment ❖ Corrections ❖ Build indices for searching ISMIR 2018 Fujinaga 45/45 Rodan: OMR Workflow Management System ISMIR 2018 Fujinaga 46/45 2016: Cantus Ultimus ISMIR 2018 Fujinaga 47/45 punctum Greyscale d d c dc cb c dedcd dfd edc Binarization C clef Border Removal Lyric Removal Staff Removal Shape Classification Music Reconstruction Shape/Image Alignment Conclusions ❖ What is Optical Music Recognition (OMR)? ❖ Why is OMR important? ❖ History of OMR research ISMIR 2018 Fujinaga 49/45 Challenges of OMR What Makes Music Notation Complicated? Lossy Knowledge Representation ● Any knowledge representation of the real world that the computer can handle has limits and inevitable bias ● MIDI representation very limited ● Common Western Music Notation (CWMN) is extremely complex ● The rules of CWMN are often violated ○ Minor violations and simplifications (e.g., omitted 3’s in tuplets) ○ Major violations (e.g., invalid meter) ○ See Donald Byrd’s Gallery for more examples “The Rite of Spring” by Igor Stravinsky Understanding the Complexity Graphical complexity (e.g, dense scores, artifacts from image capturing procedure, overlapping symbols, ambiguities) Structural complexity ● Inherently complicated content ● Syntactic and semantic rules ● Violations of these rules Nocturne (Op. 15, no. 2) by Frédéric Chopin ➢ Natural limit of OMR: where musicians disagree ➢ Building a rule-based system nearly impossible Complexity In Other Notation Systems ● Many historical and modern music notation systems ● Mensural notation: Interpretation of the same glyph depends on period of time From Wikipedia by Bitethesilverbullet Guqin Notation from Wikipedia From https://en.wikipedia.org/wiki/Mensural_notation Comparing OMR to OCR OMR is often compared to Optical Character Recognition (OCR): Wikipedia Image of Text OCR Text Die freie Enzyklopädie Image of Encoded OMR Music Scores Music Challenges of OMR The OMR domain The OMR Domain One of the key points for understanding OMR as a research field is its heterogeneity regarding the input domain and finding a suitable output representation. Icons by Freepik from Flaticon (Pack: Interaction) Input Signal Offline: Image of the sheet Online: Signal captured by an electronic pen Engraving Mechanism Typeset by a machine (“Printed”) Handwritten (typically over printed staff lines) Notational Type Common Western Preceding notations (mensural, Music Notation neumes, ...) Instrument-specific notation Others (Numbered, (tabulature, drums, ...) Yogyakarta, Surakarta, ...) Graphical Complexity Ideal conditions Camera-based scenario (blurring, skewing, 3D distortions, low resolution, ...) Degraded documents (heterogeneous background, inkblots, mold, ...) Structural Complexity Single-staff single-voice
Recommended publications
  • Improving Optical Music Recognition by Combining Outputs from Multiple Sources
    IMPROVING OPTICAL MUSIC RECOGNITION BY COMBINING OUTPUTS FROM MULTIPLE SOURCES Victor Padilla Alex McLean Alan Marsden Kia Ng Lancaster University University of Leeds Lancaster University University of Leeds victor.padilla. a.mclean@ a.marsden@ k.c.ng@ [email protected] leeds.ac.uk lancaster.ac.uk leeds.ac.uk ABSTRACT in the Lilypond format, claims to contain 1904 pieces though some of these also are not full pieces. The Current software for Optical Music Recognition (OMR) Musescore collection of scores in MusicXML gives no produces outputs with too many errors that render it an figures of its contents, but it is not clearly organised and unrealistic option for the production of a large corpus of cursory browsing shows that a significant proportion of symbolic music files. In this paper, we propose a system the material is not useful for musical scholarship. MIDI which applies image pre-processing techniques to scans data is available in larger quantities but usually of uncer- of scores and combines the outputs of different commer- tain provenance and reliability. cial OMR programs when applied to images of different The creation of accurate files in symbolic formats such scores of the same piece of music. As a result of this pro- as MusicXML [11] is time-consuming (though we have cedure, the combined output has around 50% fewer errors not been able to find any firm data on how time- when compared to the output of any one OMR program. consuming). One potential solution to this is to use Opti- Image pre-processing splits scores into separate move- cal Music Recognition (OMR) software to generate sym- ments and sections and removes ossia staves which con- bolic data such as MusicXML from score images.
    [Show full text]
  • OCR Pwds and Assistive Qatari Using OCR Issue No
    Arabic Optical State of the Smart Character Art in Arabic Apps for Recognition OCR PWDs and Assistive Qatari using OCR Issue no. 15 Technology Research Nafath Efforts Page 04 Page 07 Page 27 Machine Learning, Deep Learning and OCR Revitalizing Technology Arabic Optical Character Recognition (OCR) Technology at Qatar National Library Overview of Arabic OCR and Related Applications www.mada.org.qa Nafath About AboutIssue 15 Content Mada Nafath3 Page Nafath aims to be a key information 04 Arabic Optical Character resource for disseminating the facts about Recognition and Assistive Mada Center is a private institution for public benefit, which latest trends and innovation in the field of Technology was founded in 2010 as an initiative that aims at promoting ICT Accessibility. It is published in English digital inclusion and building a technology-based community and Arabic languages on a quarterly basis 07 State of the Art in Arabic OCR that meets the needs of persons with functional limitations and intends to be a window of information Qatari Research Efforts (PFLs) – persons with disabilities (PWDs) and the elderly in to the world, highlighting the pioneering Qatar. Mada today is the world’s Center of Excellence in digital work done in our field to meet the growing access in Arabic. Overview of Arabic demands of ICT Accessibility and Assistive 11 OCR and Related Through strategic partnerships, the center works to Technology products and services in Qatar Applications enable the education, culture and community sectors and the Arab region. through ICT to achieve an inclusive community and educational system. The Center achieves its goals 14 Examples of Optical by building partners’ capabilities and supporting the Character Recognition Tools development and accreditation of digital platforms in accordance with international standards of digital access.
    [Show full text]
  • Musical Notation Codes Index
    Music Notation - www.music-notation.info - Copyright 1997-2019, Gerd Castan Musical notation codes Index xml ascii binary 1. MidiXML 1. PDF used as music notation 1. General information format 2. Apple GarageBand Format 2. MIDI (.band) 2. DARMS 3. QuickScore Elite file format 3. SMDL 3. GUIDO Music Notation (.qsd) Language 4. MPEG4-SMR 4. WAV audio file format (.wav) 4. abc 5. MNML - The Musical Notation 5. MP3 audio file format (.mp3) Markup Language 5. MusiXTeX, MusicTeX, MuTeX... 6. WMA audio file format (.wma) 6. MusicML 6. **kern (.krn) 7. MusicWrite file format (.mwk) 7. MHTML 7. **Hildegard 8. Overture file format (.ove) 8. MML: Music Markup Language 8. **koto 9. ScoreWriter file format (.scw) 9. Theta: Tonal Harmony 9. **bol Exploration and Tutorial Assistent 10. Copyist file format (.CP6 and 10. Musedata format (.md) .CP4) 10. ScoreML 11. LilyPond 11. Rich MIDI Tablature format - 11. JScoreML RMTF 12. Philip's Music Writer (PMW) 12. eXtensible Score Language 12. Creative Music File Format (XScore) 13. TexTab 13. Sibelius Plugin Interface 13. MusiXML: My own format 14. Mup music publication program 14. Finale Plugin Interface 14. MusicXML (.mxl, .xml) 15. NoteEdit 15. Internal format of Finale (.mus) 15. MusiqueXML 16. Liszt: The SharpEye OMR 16. XMF - eXtensible Music 16. GUIDO XML engine output file format Format 17. WEDELMUSIC 17. Drum Tab 17. NIFF 18. ChordML 18. Enigma Transportable Format 18. Internal format of Capella (ETF) (.cap) 19. ChordQL 19. CMN: Common Music 19. SASL: Simple Audio Score 20. NeumesXML Notation Language 21. MEI 20. OMNL: Open Music Notation 20.
    [Show full text]
  • Using Smartscore 2.Pdf
    Using SmartScore Scanning Music Be sure you have the necessary scanner drivers installed before attempting to scan from inside SmartScore. Most scanners come with software that enable programs such as SmartScore to control them. TWAIN drivers and/or Mac plug-ins are normally included in the software packaged with most scanners. It may be necessary for certain Mac users to perform a “Custom > TWAIN” installation from the CD accompanying your scan- ner; depending on the manufacturer. NOTE: Scanner drivers are often updated by scanner manufacturers and posted on their web sites. If problems occur during scanning, it is always a good idea to check the Internet for updated scanner drivers before calling Musitek Technical Support. Mac Users: Skip the next section. Turn to “Scanning in Macintosh” on page 6. Scanning in Windows: Using the SmartScore Scanning Interface a. Push the Scan button in the Navigator or in the Main Toolbar. Figure 1: Scan Button b. If there is no response, go to File > Scan Music > Select Scanner and choose appropriate TWAIN driver. If you do not see anything listed in the Select Scanner window, your drivers are probably not installed. Install or replace TWAIN driver from scanner CD or from “Driver Download” area of scanner manufacturer’s website. c. If the scanner still does not operate properly, go to “Choosing an alternative scanning interface” on page 5. USING SmartScore 1 Help > Using SmartScore Your scanner should immediately begin to operate with Scan or Acquire. A low-resolution pre-scan should soon appear in the Preview window. FIGURE 2: SmartScore scanning interface d.
    [Show full text]
  • Gradu04243.Pdf
    Paperilomakkeesta tietomalliin Kalle Malin Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Pro gradu -tutkielma Ohjaaja: Erkki Mäkinen Toukokuu 2010 i Tampereen yliopisto Tietojenkäsittelytieteiden laitos Tietojenkäsittelyoppi Kalle Malin: Paperilomakkeesta tietomalliin Pro gradu -tutkielma, 61 sivua, 3 liitesivua Toukokuu 2010 Tässä tutkimuksessa käsitellään paperilomakkeiden digitalisointiin liittyvää kokonaisprosessia yleisellä tasolla. Prosessiin tutustutaan tarkastelemalla eri osa-alueiden toimintoja ja laitteita kokonaisjärjestelmän vaatimusten näkökul- masta. Tarkastelu aloitetaan paperilomakkeiden skannaamisesta ja lopetetaan kerättyjen tietojen tallentamiseen tietomalliin. Lisäksi luodaan silmäys markki- noilla oleviin valmisratkaisuihin, jotka sisältävät prosessin kannalta oleelliset toiminnot. Avainsanat ja -sanonnat: lomake, skannaus, lomakerakenne, lomakemalli, OCR, OFR, tietomalli. ii Lyhenteet ADRT = Adaptive Document Recoginition Technology API = Application Programming Interface BAG = Block Adjacency Graph DIR = Document Image Recognition dpi= Dots Per Inch ICR = Intelligent Character Recognition IFPS = Intelligent Forms Processing System IR = Information Retrieval IRM = Image and Records Management IWR = Intelligent Word Recognition NAS = Network Attached Storage OCR = Optical Character Recognition OFR = Optical Form Recognition OHR = Optical Handwriting Recognition OMR = Optical Mark Recognition PDF = Portable Document Format SAN = Storage Area Networks SDK = Software Development Kit SLM
    [Show full text]
  • JETIR Research Journal
    © 2019 JETIR June 2019, Volume 6, Issue 6 www.jetir.org (ISSN-2349-5162) IMAGE TEXT CONVERSION FROM REGIONAL LANGUAGE TO SPEECH/TEXT IN LOCAL LANGUAGE 1Sayee Tale,2Manali Umbarkar,3Vishwajit Singh Javriya,4Mamta Wanjre 1Student,2Student,3Student,4Assistant Professor 1Electronics and Telecommunication, 1AISSMS, Institute of Information Technology, Pune, India. Abstract: The drivers who drive in other states are unable to recognize the languages on the sign board. So this project helps them to understand the signs in different language and also they will be able to listen it through speaker. This paper describes the working of two module image processing module and voice processing module. This is done majorly using Raspberry Pi using the technology OCR (optical character recognition) technique. This system is constituted by raspberry Pi, camera, speaker, audio playback module. So the system will help in decreasing the accidents causes due to wrong sign recognition. Keywords: Raspberry pi model B+, Tesseract OCR, camera module, recording and playback module,switch. 1. Introduction: In today’s world life is too important and one cannot loose it simply in accidents. The accident rates in today’s world are increasing day by day. The last data says that 78% accidents were because of driver’s fault. There are many faults of drivers and one of them is that they are unable to read the signs and instructions written on boards when they drove into other states. Though the instruction are for them only but they are not able to make it. So keeping this thing in mind we have proposed a system which will help them to understand the sign boards written in regional language.
    [Show full text]
  • Improved Optical Music Recognition (OMR)
    Improved Optical Music Recognition (OMR) Justin Greet Stanford University [email protected] Abstract same index in S. Observe that the order of the notes is implicitly mea- This project focuses on identifying and ordering the sured. Any note missing from the output, out of place, or notes and rests in a given measure of sheet music using a erroneously added heavily penalizes the percentage. More novel approach involving a Convolutional Neural Network qualitatively, we can render the output of the algorithm as (CNN). Past efforts in the field of Optical Music Recogni- sheet music and visually check how it compares to the input tion (OMR) have not taken advantage of neural networks. measure. The best architecture we developed involves feeding pro- Multiple commercial OMR products exist and their ef- posals of regions containing notes or rests to a CNN then fectiveness has been studied to be imperfect enough to pre- removing conflicts in regions identifying the same symbol. clude many practical applications [4, 20]. We run the test We compare our results with a commercial OMR product images through one of them (SmartScore [12]) and use the and achieve similar results. evaluation criteria described above to serve as a benchmark for the proposed algorithm. 1. Introduction 2. Related Work Optical Music Recognition (OMR) is the problem of The topic of OMR is heavily researched. The research converting a scanned image of sheet music into a symbolic can be broadly broken up into three categories: evaluation representation like MusicXML [9] or MIDI. There are many of existing methods, discovery of novel solutions for spe- obvious practical applications for such a solution, such as cific parts of OMR, and classical approaches to OMR.
    [Show full text]
  • Efficient Optical Music Recognition Validation Using MIDI Sequence Data by Janelle C
    Efficient Optical Music Recognition Validation using MIDI Sequence Data by Janelle C. Sands Submitted to the Department of Electrical Engineering and Computer Science in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science at the MASSACHUSETTS INSTITUTE OF TECHNOLOGY May 2020 ○c Massachusetts Institute of Technology 2020. All rights reserved. Author................................................................ Department of Electrical Engineering and Computer Science May 12, 2020 Certified by. Michael Scott Cuthbert Associate Professor Thesis Supervisor Accepted by . Katrina LaCurts Chair, Master of Engineering Thesis Committee ii Efficient Optical Music Recognition Validation using MIDI Sequence Data by Janelle C. Sands Submitted to the Department of Electrical Engineering and Computer Science on May 12, 2020, in partial fulfillment of the requirements for the degree of Master of Engineering in Electrical Engineering and Computer Science Abstract Despite advances in optical music recognition (OMR), resultant scores are rarely error-free. The power of these OMR systems to automatically generate searchable and editable digital representations of physical sheet music is lost in the tedious manual effort required to pinpoint and correct these errors post-OMR, or evento just confirm no errors exist. To streamline post-OMR error correction, I developeda corrector to automatically identify discrepancies between resultant OMR scores and corresponding Musical Instrument Digital Interface (MIDI) scores and then either automatically fix errors, or in ambiguous cases, notify the user to manually fix errors. This tool will be open source, so anyone can contribute to further improving the accuracy of OMR tools and expanding the amount of trusted digitized music. Thesis Supervisor: Michael Scott Cuthbert Title: Associate Professor iii iv Acknowledgments Thank you to my advisor, Professor Cuthbert, for sharing your enthusiasm, expertise, and encouragement throughout this project.
    [Show full text]
  • Understanding Optical Music Recognition
    1 Understanding Optical Music Recognition Jorge Calvo-Zaragoza*, University of Alicante Jan Hajicˇ Jr.*, Charles University Alexander Pacha*, TU Wien Abstract For over 50 years, researchers have been trying to teach computers to read music notation, referred to as Optical Music Recognition (OMR). However, this field is still difficult to access for new researchers, especially those without a significant musical background: few introductory materials are available, and furthermore the field has struggled with defining itself and building a shared terminology. In this tutorial, we address these shortcomings by (1) providing a robust definition of OMR and its relationship to related fields, (2) analyzing how OMR inverts the music encoding process to recover the musical notation and the musical semantics from documents, (3) proposing a taxonomy of OMR, with most notably a novel taxonomy of applications. Additionally, we discuss how deep learning affects modern OMR research, as opposed to the traditional pipeline. Based on this work, the reader should be able to attain a basic understanding of OMR: its objectives, its inherent structure, its relationship to other fields, the state of the art, and the research opportunities it affords. Index Terms Optical Music Recognition, Music Notation, Music Scores I. INTRODUCTION Music notation refers to a group of writing systems with which a wide range of music can be visually encoded so that musicians can later perform it. In this way, it is an essential tool for preserving a musical composition, facilitating permanence of the otherwise ephemeral phenomenon of music. In a broad, intuitive sense, it works in the same way that written text may serve as a precursor for speech.
    [Show full text]
  • Finale Songwriter Free Download Mac
    Finale songwriter free download mac Download Finale Notepad for free and get started composing, arranging and printing your own sheet music today. Download a free trial of Finale 25, the latest release, and experience MakeMusic's professional music notation software for 30 days. With SongWriter music notation software, you can listen to the results before the rehearsal. When you can hear what you've written you can. With SongWriter music notation software, you can listen to the results before the rehearsal. When you can hear what you've written you can quickly refine your. Finale for Mac, free and safe download. Finale latest version: Professional music composition software. If you're looking for the most professional music. Finale Songwriter , Commercial Software, App, Download Finale Songwriter Download Update. Mac Universal Binary, Finale Songwriter products supported on Mac OS X Finale , , Finale , Free Finale SongWriter Download,Finale SongWriter is Its. Shop for the Makemusic Finale SongWriter Software Download in and receive free shipping and guaranteed lowest OS X (Mac-Intel or Power PC). Search the Finale download library for updates, documentation, free trial versions and more. Download free Finale SongWriter for Mac, free Finale SongWriter download for Mac OS X. Finale SongWriter (Electronic Software Delivery) Please Note: Once the order print, and make minor edits with the free, downloadable Finale NotePad®. Finale Free Download. Finale Songwriter download for Mac. Finale songwriter keygen downloadanyfiles com. Download Finale Songwriter by MakeMusic. Finale for Mac, free and safe download. Finale latest version: In questo video vi spiegherò come installare Finale SongWriter sul vostro pc Finale SongWriter.
    [Show full text]
  • An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow For
    Article OCR4all - An Open-Source Tool Providing a (Semi-)Automatic OCR Workflow for Historical Printings Christian Reul 1, Dennis Christ 1, Alexander Hartelt 1, Nico Balbach 1, Maximilian Wehner 1, Uwe Springmann 2, Christoph Wick 1, Christine Grundig 3, Andreas Büttner 4, and Frank Puppe 1 1 Chair for Artificial Intelligence and Applied Computer Science, University of Würzburg, 97074 Würzburg, Germany 2 Center for Information and Language Processing, LMU Munich, 80538 Munich, Germany 3 Institute for Modern Art History, University of Zurich, 8006 Zurich, Switzerland 4 Institute for Philosophy, University of Würzburg, 97074 Würzburg, Germany * Correspondence: [email protected] Version September 11, 2019 submitted to Appl. Sci. Abstract: Optical Character Recognition (OCR) on historical printings is a challenging task mainly due to the complexity of the layout and the highly variant typography. Nevertheless, in the last few years great progress has been made in the area of historical OCR, resulting in several powerful open-source tools for preprocessing, layout recognition and segmentation, character recognition and post-processing. The drawback of these tools often is their limited applicability by non-technical users like humanist scholars and in particular the combined use of several tools in a workflow. In this paper we present an open-source OCR software called OCR4all, which combines state-of-the-art OCR components and continuous model training into a comprehensive workflow. A comfortable GUI allows error corrections not only in the final output, but already in early stages to minimize error propagations. Further on, extensive configuration capabilities are provided to set the degree of automation of the workflow and to make adaptations to the carefully selected default parameters for specific printings, if necessary.
    [Show full text]
  • Evaluation of Music Notation Packages for an Academic Context
    EVALUATION OF MUSIC NOTATION PACKAGES FOR AN ACADEMIC CONTEXT Pauline Donachy Dept of Music University of Glasgow [email protected] Carola Boehm Dept of Music University of Glasgow [email protected] Stephen Brandon Dept of Music University of Glasgow [email protected] December 2000 EVALUATION OF MUSIC NOTATION PACKAGES FOR AN ACADEMIC CONTEXT PREFACE This notation evaluation project, based in the Music Department of the University of Glasgow, has been funded by JISC (Joint Information Systems Committee), JTAP (JISC Technology Applications Programme) and UCISA (Universities and Colleges Information Systems Association). The result of this project is a software evaluation of music notation packages that will be of benefit to the Higher Education community and to all users of music software packages, and will aid in the decision making process when matching the correct package with the correct user. Six main packages are evaluated in-depth, and various others are identified and presented for consideration. The project began in November 1999 and has been running on a part-time basis for 10 months, led by Pauline Donachy under the joint co-ordination of Carola Boehm and Stephen Brandon. We hope this evaluation will be of help for other institutions and hope to be able to update this from time to time. In this aspect we would be thankful if any corrections or information regarding notation packages, which readers might have though relevant to be added or changed in this report, could be sent to the authors. Pauline Donachy Carola Boehm Dec 2000 EVALUATION OF MUSIC NOTATION PACKAGES FOR AN ACADEMIC CONTEXT ACKNOWLEDGEMENTS In fulfillment of this project, the project team (Pauline Donachy, Carola Boehm, Stephen Brandon) would like to thank the following people, without whom completion of this project would have been impossible: William Clocksin (Calliope), Don Byrd and David Gottlieb (Nightingale), Dr Keith A.
    [Show full text]