Arxiv:1704.05815V1 [Physics.Soc-Ph] 19 Apr 2017 Prox

Headphones on the wire Statistical patterns of music listening practices Thomas Louail1, 2, ∗ and Marc Barthelemy3, 4 1CNRS, UMR 8504 Géographie-cités,13 rue du four, FR-75006 Paris 2Instituto de F´ısica Interdisciplinar y Sistemas Complejos IFISC (CSIC-UIB), Campus UIB, ES-07122 Palma de Mallorca 3Institut de Physique Théorique,CEA, CNRS-URA 2306, FR-91191 Gif-sur-Yvette 4CAMS (CNRS/EHESS) 190-198, avenue de France, FR-75244 Paris Cedex 13 We analyze a dataset providing the complete information on the effective plays of thousands of music listeners during several months. Our analysis confirms a number of properties previously highlighted by research based on interviews and questionnaires, but also uncover new statistical patterns, both at the individual and collective levels. In particular, we show that individuals fol- low common listening rhythms characterized by the same fluctuations, alternating heavy and light listening periods, and can be classified in four groups of similar sizes according to their temporal habits - \early birds", \working hours listeners", \evening listeners" and \night owls". We provide a detailed radioscopy of the listeners' interplay between repeated listening and discovery of new content. We show that different genres encourage different listening habits, from Classical or Jazz music with a more balanced listening among different songs, to Hip Hop and Dance with a more heterogeneous distribution of plays. Finally, we provide measures of how distant people are from each other in terms of common songs. In particular, we show that the number of songs S a DJ should play to a random audience of size N such that everyone hears at least one song he/she currently listens to, is of the form S ∼ N α where the exponent depends on the music genre and is in the range [0:5; 0:8]. More generally, our results show that the recent access to virtually infinite catalogs of songs does not promote exploration for novelty, but that most users favor repetition of the same songs. The reasons why human beings like listening to mu- ing the uses of music and their self-declared impor- sic, the variety of emotions music can arouse, its uses tance as relevant classifiers [26, 27]. Statistical physi- and functions in human societies: those are some long cists also contributed by highlighting structural prop- lasting questions which have been discussed by music erties of artists and genres communities that emerge critics and by scientists belonging to a wide range of from the analysis of personal libraries of audio files disciplines. From the early musicology [1] to popular [28]. music studies [2] through sociology of cultural prac- However, relatively little is known about how pre- tices [3], geography [4, 5], music history [6, 7], cul- cisely we listen to recorded music on a daily basis. By tural economics [8, 9], educational and cognitive psy- how we refer here to some kind of detailed, quantified chology [10{13], physiology and neurosciences [14, 15], radioscopy of our contemporary listening practices of an eclectic scientific litterature has illuminated many recorded music, an important aspect of the relation different facets of music listening. At a collective we entertain with music. level, it has been demonstrated several times that Until recently, any empirical research willing to an- statistical relations between inherited social charac- swer to questions pertaining to daily listening prac- teristics of individuals and their musical preferences tices had to rely on surveys and interviews. The exist [3, 16{18]. At the individual level, studies rely- technological and societal evolutions have sustained ing on questionnaires, interviews or experiments con- the development of new mobile devices, online tools ducted in controled environments have documented and listening possibilities, as well as new actors in both the functions attributed to listening and the the music industry. Music-on-demand services have emotions aroused, in various situations of daily life quickly gained in popularity over the last few years, and in different contexts [10, 13, 15, 19]. The influ- and for example, according to a recent report of the ence of the device on the listening practice [20], the French national syndicate of phonographic publishing effects of listening on a number of daily activities { [29], more than three million of French residents (ap- e.g. performance at work [21], driving [22], coping arXiv:1704.05815v1 [physics.soc-ph] 19 Apr 2017 prox. 4% of the total population) were subscribing to and regulating emotions [12, 23] { or the rewarding as- an on-demand streaming music platform in 2016, and pects of music-evoked sadness [24] are other examples roughly 1=3 of the total french population regularly of listening-related research. Classifications of listen- stream audio content. The data recorded by stream- ers have been proposed, with some authors concluding ing platforms offer great possibilities to analyze and about the existence of a direct relation between musi- hopefully better understand individual and collective cal preferences and cognitive styles [25], other stress- listening practices. Whenever an individual plays a song through such a service, with a web browser or dedicated applica- ∗ Correspondence and requests for materials should be ad- tion, online or offline, all known information associ- dressed to TL (email: [email protected]) ated with the stream are logged in the company's 2 database. For example, when Amélieplays The Roots' pute for each individual its average t¯i over all days Kool On on her mobile phone, a new entry is added to and we show on Fig. 1b the normalized (ti=t¯i) distri- the database, informing that at 8:23 am on 2/10/2015 butions for the three groups. These normalized dis- she played the song (Kool On) by The Roots, from tributions collapse onto a single curve, indicating that the album Undun. Incidentally, we also know the whatever how much music they listen to, individu- genres tags associated to the song, the stream's du- als display a common behavior characterized by the ration (and consequently if she listened to the song same fluctuations in listening times, alternating days entirely or not), the information declared during reg- with relatively few music listened and days of heavy istration, including her age and city of residence, and listening. These distribution are peaked and can be possibly some additional contextual information (e.g. fitted by an exponential function, suggesting a Pois- if the song was part of one of Amélie'splaylists, if she son nature of the listening behavior, in contrast with was online or offline when listening to the song, possi- previous results on daily human behavior [30]. bly the city where Améliewas located when listening, We then wonder at what time of the day people etc.). Such a record of information is added to the listen to music. We introduce u(h) which counts database for each single play of the millions of regis- the number of individuals listening to music at time tered users of the service. Once anonymized, users' h. In order to highlight the collective rhythm we listening history data can be processed and analyzed, R h+1 R 23 plot the normalized values h u(h)= 0 u(h), where and those digital traces passively produced while lis- 23 R u(h) is the total number of unique individuals that tening consequently constitute an unprecedented em- 0 listened to music during the day (see Supplementary pirical source to study quantitatively contemporary Figure 1). Like other daily activities which have been listening practices. heavily studied from individual traces [31, 32] the ag- In the following we analyze the listening history gregated curves of activity display two characteristic data of one of the major streaming platforms. The patterns, one for weekdays and another for weekends. data correspond to the entire listening history of about However not everyone listens to music at the same five thousands users during one hundred days (see Ma- hours, and for each listener i we calculate his/her pro- terial for details). We note that for some of these portion ph of plays that occurred between h and h+1, people the music streamed may represent only a lim- i averaged over the 100 days period. We then construct ited subset of all the recorded music they listened to the 24-values vector (p0; : : : ; p23) and using these time during that period, and their data might not be rep- i i profiles we cluster the listeners, and find four typical resentative of their entire listening practice [20]. In groups whose average profiles are shown in Fig. 1c order to control this bias we selected a set of anony- (see the Appendix for details on the clustering). These mous users among those who displayed a frequent use time profiles are very specific: in one group individuals of the service (see Appendix). For these listeners we listen to music mostly in the morning (\Early birds"); can reasonably assume that the streaming platform, in another we observe two listening peaks, one in the if not exclusive, constitutes a daily source of music. morning and the other in the afternoon (\Working hours listeners"); there are also individuals listening RESULTS to music mostly at the end of the day (\Evening listeners"); and finally those whose listening peak is late in the evening and during the night (\Night owls"). Rhythms We start by quantifying the relation that individuals have with music listening as a daily activity, and Difference and repetition the rhythms and typical hours at which they perform this activity. For all individuals, we compute the total We now investigate how individuals are distributed time they spent listening during the 100 days period along various dimensions of music listening.

Load more