Characteristics of Streaming Media Stored on the Web Mingzhe Li Worcester Polytechnic Institute, [email protected]
Total Page:16
File Type:pdf, Size:1020Kb
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by DigitalCommons@WPI Worcester Polytechnic Institute DigitalCommons@WPI Computer Science Faculty Publications Department of Computer Science 5-1-2003 Characteristics of Streaming Media Stored on the Web Mingzhe Li Worcester Polytechnic Institute, [email protected] Mark Claypool Worcester Polytechnic Institute, [email protected] Robert Kinicki Worcester Polytechnic Institute, [email protected] James Nichols Worcester Polytechnic Institute, [email protected] Follow this and additional works at: http://digitalcommons.wpi.edu/computerscience-pubs Part of the Computer Sciences Commons Suggested Citation Li, Mingzhe , Claypool, Mark , Kinicki, Robert , Nichols, James (2003). Characteristics of Streaming Media Stored on the Web. Retrieved from: http://digitalcommons.wpi.edu/computerscience-pubs/133 This Other is brought to you for free and open access by the Department of Computer Science at DigitalCommons@WPI. It has been accepted for inclusion in Computer Science Faculty Publications by an authorized administrator of DigitalCommons@WPI. WPI-CS-TR-03-18 May 2003 Characteristics of Streaming Media Stored on the Web by Mingzhe Li Mark Claypool Robert Kinicki Jim Nichols Computer Science Technical Report Series WORCESTER POLYTECHNIC INSTITUTE Computer Science Department 100 Institute Road, Worcester, Massachusetts 01609-2280 1 Characteristics of Streaming Media Stored on the Web Mingzhe Li, Mark Claypool, Robert Kinicki and James Nichols lmz,claypool,rek,jnick ¡ @cs.wpi.edu Computer Science Department Worcester Polytechnic Institute Worcester, MA, 01609, USA Abstract— The increasing power and connectivity of to- mated that 350,000 hours of online entertainment was be- day’s computers have spurred the growth in streaming audio ing broadcast each week over the Internet, and this statis- and video available on the Internet through the Web. While tic does not include the volume of additional hours down- there is substantial research characterizing the performance loaded on-demand by Web users around the world. of streaming media and characterizing documents stored on the Web, there have been few studies characterizing stream- CAIDA [2] emphasized in 2002 the significant frac- ing audio and video stored on the Web. We crawled over tion of Internet link capacities that were already being 17 million Web pages from key geographic locations and ex- allocated to support streaming media applications. An- tracted nearly 30,000 streaming audio and video clips for nouncements such as RealNetworks’ [3] press release to analysis. Using custom built tools, we analyzed the charac- support the advancement of streaming multimedia appli- teristics of these multimedia objects, determining such in- cations over wireless cellular networks have added to the formation as media type, encoding format, playout dura- tion, bitrate, resolution, and codec. We find proprietary au- concern among Internet performance experts about be- dio and video formats dominate all multimedia content, pri- ing able to support even more readily available access to marily content by RealNetworks followed next by Microsoft streaming media clips through the Web. This anxiety over and with Apple following just behind MP3. The playout du- future streaming media applications significantly restrict- rations of streaming audio and video clips are long-tailed, ing performance of other Web users has translated into a suggesting streaming media may contribute self-similar traf- variety of research papers that propose new network pro- fic on the Internet. More than half of all streaming media tocols [4], [5] or more sophisticated network router al- clips on the Web are video, with 90% of videos targeted gorithms that seek to lessen the impact of streaming me- for broadband connections. Video resolutions are consider- ably smaller than typical monitor resolutions, implying that dia [6], [7], [8], [9]. Several recent research efforts [10], video bitrates, which are directly related to resolutions, have [11], [12], [13], [14], [15], [16] have focused on captur- enormous potential to increase. The detailed results from ing the characteristics of current streaming application be- this study should be useful for future studies characterizing havior to better understand its impact. Only by knowing the performance of streaming media on the Web and also the nature of commercial streaming products and how they valuable for those interested in generating more accurate In- typically stream multimedia traffic can researchers begin ternet traffic simulations. to prepare for the next generation of Web users. Keywords— Multimedia, Streaming, RealNetworks Re- Unfortunately, there is little recent published work on alPlayer, Microsoft Windows Media Player, Apple Quick- Time, Self-similarity, Long-tailed the exact characteristics of streaming media stored on the Web. While there have been studies characterizing Web content [17], [18] measured at the client side, there I. INTRODUCTION have been no recent studies on the general characteristics Improvements in the power and connectivity of today’s of streaming media stored at the Web server. In 1997, computers have enabled the growth in Web users who Acharya and Smith [19] characterized video content stored cross cultural and national boundaries to stream multime- on the Web by analyzing every video available in the (then dia applications from far away Web servers to browsers popular) Alta Vista search engine. However, the nature on their desktops. Whether it is news, sports or entertain- of streaming media has changed considerably since that ment clips, the newest generation of Web users have come time. For example, Acharya and Smith [19] found that to expect audio and video streams at their fingertips by in 1997 the Internet could not support real-time streaming simply clicking on a browser link to automatically start given the encoded bitrates and available last-mile data ca- playing streaming media. In 2001, Real Networks [1] esti- pacities. Today, RealNetworks RealPlayer and Microsoft 2 Media Player, two popular streaming media products [20] tent providers use to deal with limited capacities at last- that did not even exist in 1997, have significantly improved mile connections. Web users’ ability to stream multimedia to home comput- ¢ What fraction of the many streaming media codecs ers. available are being used? Innovative compression tech- Two papers, one by Ousterhout et al [21] and the other nologies in new codecs have the potential to deliver higher by [22], provide good examples where fundamental re- quality video with lower bitrates. Moreover, new codecs search on understanding the nature of data stored in file incorporate technologies that yield more sophisticated be- systems and studying how these files were likely to be ac- haviors that adapt to network conditions to improve quality cessed, proved to be influential in the design of new file and performance. Understanding the percentage of older systems and distributed file systems. The accessibility to codecs that persist on the Internet provides information media clips on the Web through RealNetworks and Win- as to the speed at which new codec technologies are de- dows Media Players has reached such a state that simi- ployed. lar fundamental research on the nature of streaming media This paper provides detailed information to answer stored on the Web is critical to understanding the impact these questions on streaming media characteristics on the of streaming traffic on future Internet performance. Web today. Since commercial products have had a signifi- This investigation built customized tools to answer the cant influence on streaming traffic, our analysis focuses on following questions about the characteristics of streaming commercial streaming products such as Microsoft’s Media media stored on the Web today: Player, Real Networks’ RealPlayer and Apple QuickTime. ¢ What are the most popular streaming technologies? Pre- Unlike other measurement studies that have viewed real vious research [12] has shown that proprietary encoded streaming traffic by monitoring behavior near clients or media products significantly differ in their impact on servers [10], [13], [14], [27], [28], this investigation seeks streaming network traffic, even when the products utilize the wider perspective of reviewing streaming content at the same network bitrates. Similar to the situation in 1997 media servers world-wide. While there is substantial audio when the large user base for MPEG, AVI and QuickTime and video content stored on peer-to-peer (p2p) file sharing was an obstacle for incoming streaming technologies, by systems [29], [30], this content is not typically streamed quantifying today’s dominant technologies one can un- at a target bitrate, but is typically first downloaded as fast cover current obstacles for future media applications. as capacity will allow and subsequently played. Thus, the ¢ What are the relative amounts of streaming audio clips network traversal behavior for p2p file sharing systems is versus streaming video clips? The type of media, whether more similar to bulk file transfer than to streaming. Since audio or video, has a large impact on performance require- this study is focused on the characteristics of streaming ments. Audio often requires only modest bitrates but typ- media that is played out in real-time, analysis of the con- ically has very discrete encoded bitrates. Video, on the tent characteristics of audio and video stored