Musicdigger: a Tool for Music Discovery

Musicdigger: a Tool for Music Discovery

MusicDigger: A tool for Music Discovery M. Adil Yalc.ın, Preeti Bhargava, Sravanthi Bondugula, Varun K. Nagaraja and Marcelo Velloso Dept. of Computer Science University of Maryland, College Park [email protected], [email protected], [email protected], [email protected], [email protected] Abstract—There are currently many information visualization two major visualizations to accomplish this: a timeline visual- applications present for various fields that deal with large ization to view the releases of an artist/label over time and a amounts of data but one such area that has often gone over- network visualization to view the connections of a particular looked is the area of music. Musicians are a very collaborative group and they will very quickly generate complex networks of artist/label to other artists/labels. collaborations with other musicians and a large set of releases, Section 2 discusses previous work that has been done in also known as discographies. In conjunction with our faculty visualizing music data and how our application differs from collaborator, Robert Gibson (Director, University of Maryland these other solutions. Section 3 discusses our application in School of Music), we have worked to create our own information detail, the design issues we faced, and how we implemented visualization tool to browse music data called MusicDigger. MusicDigger is an visualization application that allows users the application. Finally, section 4 describes the user evaluation to discover discographies and browse the collaborative networks study that was conducted and section 5 presents the conclu- of artists using the Discogs database. Discogs is a web site that sions we gained from this study and the future work for this contains information on artists and their releases contributed by project. the many users who support the web site. Our tool focuses on music discovery rather than music recommendation. II. RELATED WORK I. INTRODUCTION There has been extensive work done in the area of visual- izing music data but much of it can be tailored to two main The current musical world we live in is a vast collection types of tools: music recommendation systems and tools for of data that is not readily available to everyone. Many pro- exploring personal music collections. The tools are discussed fessional musicians do not know exactly what is out there below. because what is made more readily available are those artists who work with major labels and their major releases. There A. Music Recommendation Systems are many talented individuals who often perform under the Music recommendation systems arose because there was a radar and there are also many interesting collaborations major market for users who were interested in listening to new music artists make that are not as well advertised. These are the sorts similar to artists they already liked. There are two popular web of questions professional musicians would like to answer but sites that do music recommendation: Last.fm2 and Pandora3. only with the dawn of the Internet has it been possible to Last.fm is a web site that uses a social networking approach access so much music data. to music recommendation. When a user creates their account 1 We make extensive use of a web site called Discogs to on Last.fm they are asked to import the data of the music access user-submitted data regarding the releases of musicians. (scrobbling) they listen to from their computer or on internet To contribute to this site users must first find a release that radio and Last.fm will then find other users who listen to does not already exist in the database and create a new entry. the same music. Once Last.fm encounters users who listen The users of the community then rate how correct the data is to a lot of the same songs Last.fm will recommend songs the through a voting system and as long as the data is determined user has not listened to yet from the play-lists of these other to be incomplete it is color coded red in the database. Through users. Last.fm plays these songs using the same format as radio this system of community vigilance the data is determined to stations, it does not allow the user to pause a song but users be correct according to the masses, much like how web sites can skip songs they do not like. such as wikipedia work. MusicDigger, our solution, uses the Pandora is a web site that uses the data in the Music Genome Discogs database of user-submitted data in the background Project to make its recommendations. The Music Genome and adds functionality to visualize the information. We have Project has isolated over 400 different musical attributes and it tags songs and artists with these attributes. The user is then 1http://www.discogs.com/ given the option to create radio stations based on a particular song/artist. Pandora finds other artists/songs that are similar to a particular song/artist and plays its recommendations to the user like a music radio. This method of recommendation differs from Last.fm because the Music Genome Project has 2http://www.last.fm/ Fig. 1. MusicDigger Logo 3http://www.pandora.com/ come up with musical attributes and classified songs/artists remotely request the data of certain releases if the release id using experts in the field of music as opposed to a social of the Discogs database is known. We have made use of data networking approach. from the latest database dump made on April 2011. B. Tools for Exploring Personal Music Collections IV. MUSICDIGGER -DESIGN AND IMPLEMENTATION Tools such as MuVis [4] and MusicBox [5] allow the user to explore their personal music collections in different ways. MusicDigger is an application written in C++ that allows MuVis focuses on visualizing a large collection of music using users to discover discographies and browse the networks treemaps. The ordering of the treemap is based on a pivot artist of artists using the Discogs database. Our code has many selected by the user that gets displayed in the top-left corner. moving parts but the main two visualizations are the timeline Other artists in the music collection are mapped according to visualization and the network visualization. their similarity to the pivot artist. The user is allowed to control 1) Timeline visualization: The releases of an artist or a label other variables such as the size and color of each node in the are displayed on a timeline making it easier to visualize treemap and the user can filter results by the duration of the along with various other filtering and view controls. track, the release year, the genre, the beat, and the mood. 2) Network visualization: Connections are identified be- MusicBox, on the other hand, maps a music collection onto tween a artist of choice to other artists based on col- 2D space by applying principal components analysis (PCA) laborations in releases, links through other artists/labels to contextual and content-based features. After projecting the etc. songs in a collection onto 2D space, the MusicBox application In order to feed the data to these two visualizations we then plots a scatter diagram of the songs. The guiding intuition required code to parse XML files, populate our own database in navigating this visualization is that similar songs are closer using Discogs data, write interfacing functions to query the together. database and parse the retrieved information, and code a C. How MusicDigger is Different module to make release and image requests in the background Our tool is different from the others out there in that so we would not have to store all that data on a users machine. it is not a recommendation system or a tool for exploring A. XML Parser personal music collections. Our tool focuses on allowing users to discover music through the connections of artists. In that The XML data available at discogs is required to be parsed sense our tool allows users to explore the music collection of to extract necessary fields used in the application. These fields the Discogs database. are mentioned in Appendix A. We use a stream based parser, Expat to parse the XML sequentially as the huge size of III. DATA DESCRIPTION the data disallows the use of a traditional tree based parser. As mentioned earlier, we have made extensive use of data Chunks of data is sent to the parser that stores the state of found on the Discogs web site. Discogs identifies the data the parse elements. This state preserved by the parser enables under three main categories - Artists, Labels and Releases. the parsing of sequential buffered data. The data stream is Artists can be individuals (Ex: Eric Clapton) or groups of encoded in UTF-8 format and any illegal character is skipped artists (Ex: Beatles, Pink Floyd) who have a set of releases by the parser. Different schema’s present with the label, artist which is referred to as discographies. Each recording can and release data has been incorporated into the parser. The be released on many formats: Vinyl, CD, DVD, etc. Other parser is analogous to a plug-in service. It can be used by information included with the release is the date of production, any application which provides an XML and the type of the the country of production, the genre and styles, the credits, and data. As each structure is parsed, the database populates the the track list. The genre and style determine what category the corresponding tables. Also, the release data loaded in the music falls under: rock, pop, jazz, etc. The track list shows memory by the DataMiner uses the Parser to extract release the number of songs recorded for a particular release and the track information.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    6 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us