Exploiting Facebook, Flickr, and Picasa : Utilizing Photo Sharing
Total Page:16
File Type:pdf, Size:1020Kb
University of Konstanz Department of Computer and Information Science Distributed Systems Group Master Thesis Exploiting Facebook, Flickr, and Picasa: Utilizing Photo Sharing Websites as Cloud Storage Backends Scientific Thesis in Fulfillment of the Requirements for the Degree of Master of Science (M.Sc.) Author: Wolfgang Miller (01/612437) First Assessor: Professor Dr. Marcel Waldvogel Second Assessor: Professor Dr. Dietmar Saupe Advisor: Sebastian Graf, M.Sc. July 2013 Wolfgang Miller Exploiting Facebook, Flickr, and Picasa: Utilizing Photo Sharing Websites as Cloud Storage Backends, c July 2013 Abstract A major application of cloud computing is to supply storage that is often advertised by providers as being reliable, available, functional, and cost efficient because it relies on a massive infrastructure available in the cloud. Photo sharing websites rely on similar infrastructures to handle the amount of uploaded image data; however, in contrast to conventional cloud storage providers, they offer free storage capacities for images. Due to the fact that images can be used as data containers to store arbitrary information, it is possible to exploit these services as free cloud storage. In this thesis we describe our approach for the development of a fully functional photosharing website driven cloud storage system based on twelve encoding approaches. These encoding approaches allow for adjusting our system to the requirements of a specific photo sharing website and to gain the best trade-off between the actual data ratio of an image and its robustness against image compression. The system’s performance has been tested with the three major photo sharing websites Facebook, Flickr, and Picasa Web Albums, and has been compared with Amazon’s professional cloud storage service AWS S3. The results show that the system can be an alternative in scenarios in which the overall performance is not that important, such as backup systems. Our developed photosharing website driven cloud storage system is accessible as an open source and implemented as part of the widely used jClouds framework. III IV IV Acknowledgements My special thanks goes to my family, my parents Willi and Anneliese Miller, my brother Sebastian Miller, and my grandmother Adelheid Rothmund, with whom I lived during the first years of studying in Konstanz. They made it possible for me to study by giving me always great support. I would like to thank Professor Dr. Marcel Waldvogel and the members of the DiSy work- ing group for the excellent support. I would also like to thank Professor Dr. Dietmar Saupe for taking the time to assess this thesis. Special thanks goes to Sebastian Graf. Throughout my studying I became to know and appreciate Sebastian in several roles: As a fellow student, a lecturer, a colleague, the su- pervisor of this thesis, and most importantly as a valuable friend. Thanks Sebastian for everything and good luck for your little family! During my time in Konstanz, I met many nice people whom I studied with, and with whom I spend very good times. A special thanks to all my friends, who made me feel at home in Konstanz. Particularly I want to mention my master clique: Anja Fauth, Laura Lorenz, Volker Rehberg, Markus Hankh, Alexander Walter, Benno Geißelmann, Jochen Budzinski, Lin Shao, and Roland Jungnickel. I also want to give special thanks to my very good friends Johanna Harde and Andreas Kraft, who gave me a great support during the writing of this thesis. Last but not least, I would like to especially thank Anna Dowden-Williams for her more than valuable input in writing this thesis in English. Thank you Anna for all the time you invested in helping me and for your friendship. V VI VI Contents 1. Introduction1 1.1. Motivation . .1 1.2. Contribution of the Thesis . .2 1.3. Key Points of the Thesis . .3 1.4. Related Work . .4 1.4.1. Image Steganography . .5 1.4.2. 2D-Codes . .6 1.4.3. Projects with similar Orientation . .8 1.4.4. Conclusion and Comparison . .9 2. Encoding Approaches 11 2.1. Intention . 12 2.2. First Encoding . 13 2.2.1. Encoding Workflow . 14 2.2.2. Decoding Workflow . 15 2.2.3. Results . 16 2.3. Advanced Encoding . 17 2.3.1. Basics on Numeral Systems . 18 2.3.2. Relationship between Radix γ and Data Ratio . 18 2.3.3. Results . 20 2.4. Single-Layer-Approaches . 21 2.4.1. Encoding . 22 2.4.2. Decoding . 26 2.4.3. Results . 31 2.5. Multi-Layer-Approaches . 32 2.5.1. Encoding . 34 2.5.2. Decoding . 39 VII VIII Contents 2.5.3. Results . 39 2.6. Images and Colors . 40 2.6.1. Types of Images . 40 2.6.2. Predefined Colors Single-Layer-Approaches . 42 2.6.3. Generated Colors Multi-Layer-Approaches . 46 2.7. Summary . 47 2.7.1. Benchmarks . 48 2.7.2. Results . 51 3. Implementation of our Photosharing Website driven Cloud Storage System 53 3.1. Photosharing Website driven Cloud Storage System . 53 3.1.1. Error Correcting Code (Reed Solomon) . 54 3.1.2. Areas and Sectors of generated Images . 56 3.1.3. Integration in jClouds and System Workflows . 59 3.2. Facebook . 62 3.2.1. Features . 63 3.2.2. Implementation . 63 3.2.3. Benchmarks . 64 3.2.4. Results . 67 3.3. Flickr . 67 3.3.1. Features . 68 3.3.2. Implementation . 69 3.3.3. Benchmarks . 69 3.3.4. Results . 74 3.4. Picasa Web Albums . 74 3.4.1. Features . 74 3.4.2. Implementation . 75 3.4.3. Benchmarks . 75 3.4.4. Results . 78 3.5. Comparison Facebook, Flickr, and Picasa Web Albums . 79 3.5.1. Feature Comparison . 79 3.5.2. Overhead Comparison . 81 3.5.3. Performance Comparison . 82 3.5.4. Results . 84 4. Conclusion 87 4.1. Results . 87 VIII Contents IX 4.2. Future Work . 88 4.2.1. Benchmarks with larger Data and Flickr’s new Service . 88 4.2.2. Enhance Images with Visual Information . 89 4.2.3. Automatic Determination of Best Encoding Parameters . 90 4.2.4. Step from Images to Videos . 90 Appendices 91 A. Random Input Bytes 93 B. Encoding Approaches Naming Convention 95 Bibliography 97 IX X Contents X 1 Introduction 1.1. Motivation In the last years, cloud computing has become one of the most influencing and widely discussed trends in the IT sector. Big players like Apple, Amazon, Google, and Microsoft developed new cloud services and pushed the idea to outsource to third party providers launching marketing campaigns to gain the attention of the end consumer market. Con- sequently, today the term “cloud computing“ is well-known not only by computer en- thusiasts. The pull-effect of the cloud trend also brought up new business-concepts and start-up companies with fresh ideas. The credo seems to be “everything is better with the cloud“. Through this background a huge variation of services around cloud-computing have mushroomed. From services for companies to exclude a lot of their internal IT- infrastructure to trustful third parties, e.g. the local provider Adlon Datenverarbeitung Sys- tems GmbH [Adl13], to collaborative file storage for end-users, e.g. like Dropbox [Dro13]. Even gaming, whereas the complex graphics of video games are rendered in the cloud and users stream the precalculated scenes via client. A fitting example would therefore be Sony’s cloud-based gaming service Gaikai [com13]. Besides additional computing power, the major application of cloud services is to supply storage. This storage is often advertised as being reliable, available, functional, and cost efficient. Provider offers for end-users extend from services like the mentioned Dropbox or Apple’s iCloud [App13] to external storage, data backup, synchronizing and sharing general data, to professional big data NoSQL storage like Amazon’s AWS S3 [Ama13] or Microsoft’s Azure [Mic13]. With the cloud storage another type of service exploited the availability of large storage areas, namely: photo sharing websites. Photo sharing websites are storage providers specialized to store and share photos. How- 1 2 1.2. Contribution of the Thesis ever, these sites rely on massive server infrastructures that utilize the same reliable stor- age techniques like other cloud storage services, they do not require fees. Examples for such services are Facebook [Fac13a], Yahoo’s Flickr [Fli13a], and Picasa Web Albums [Goo13a] from Google. Taking these facts about photo sharing websites into account, we questioned whether it is possible to exploit these services and use them as free cloud storage for any kind of data. Whereby the major issue was finding a way of how to negotiate the constraint of only being able to upload photos. In this thesis the development of a fully functional photosharing website driven cloud storage system will be shown. The last part of the thesis will provide a closer look at the available providers implemented in our system, namely Facebook, Flickr, and Picasa Web Albums, and we will analyze how well they perform in our cloud storage scenario. Flickr has shown that it is worth thinking about utilizing photo sharing websites as cloud storage. On May 20th, 2013 Flickr completely renewed its service offer [Fli13c]. Since that day Flickr allows to upload and store image data up to a size of 1TB for free instead of their previous monthly limit of 300MB. Another important change is that they now allow to access the original uploaded images and no longer apply JPEG-compression, which, as we will see, makes it a lot easier to store data. Since Flickr’s restart took place during the time this thesis was written, we only refer to features before the restart.