Measurement-Based Modeling of Distributed Systems

Total Page:16

File Type:pdf, Size:1020Kb

Measurement-Based Modeling of Distributed Systems Measurement-Based Modeling of Distributed Systems Meßbasierte Modellierung verteilter Systeme Der Technischen Fakultät der Universität Erlangen-Nürnberg zur Erlangung des Grades DOKTOR-INGENIEUR vorgelegt von Kai-Steffen Jens Hielscher Erlangen - 2008 Als Dissertation genehmigt von der Technischen Fakultät der Universität Erlangen-Nürnberg Tag der Einreichung: 12. März 2008 Tag der Promotion: 21. April 2008 Dekan: Prof. Dr.-Ing. habil. Johannes Huber Berichterstatter: Prof. Dr.-Ing. Reinhard German Prof. Dr.-Ing. Wolfgang Schröder-Preikschat Contents List of Figures7 List of Tables9 Abstract 11 Zusammenfassung 15 1 Introduction 19 2 Related Work 23 ó.Õ Measurements.............................. óì ó.ó Time Synchronization.......................... ó¦ ó.ì Input Modeling.............................. ó¢ ó.¦ Performance Evaluation of Web Servers............... ó¢ 3 The Web Cluster Laboratory 27 ì.Õ e Linux Virtual Server System.................... ó ì.ó Hardware Setup............................. ìþ 4 Measurement Concepts 33 ¦.Õ Computer Clocks............................ ì ¦.ó Clock Errors............................... ìÉ ¦.ó.Õ Classication in the Frequency Domain........... ¦ä ¦.ó.ó Classication in the Time Domain.............. ¦ ¦.ì Reference Clocks............................. ¢Õ ¦.ì.Õ NTP............................... ¢ó ¦.ì.ó Time Sources.......................... ¢¦ ¦.ì.ì e PPS API.......................... ¢¢ ì Contents 5 Dedicated Measurement Infrastructure 59 ¢.Õ PPS Pulse Latency............................ äì ¢.Õ.Õ Echo Feedback......................... ä¢ ¢.ó Oine Synchronization......................... äÉ ¢.ì Instrumentation............................. ßì ¢.ì.Õ IP Stack Instrumentation................... ßì ¢.ì.ó Web Server Instrumentation................. ßß ¢.ì.ì Load Generator Instrumentation............... ßß ¢.ì.¦ Application Server Instrumentation............. ß ¢.ì.¢ Summary Performance Data................. Õ ¢.¦ Analysis of the Traces.......................... ó ¢.¢ Example Measurement Results..................... ì 6 Advanced Input Modeling 91 ä.Õ Traces and Empirical Distributions.................. Éó ä.ó Outlier Values.............................. ɦ ä.ì Autocorrelation............................. É¢ ä.¦ Standard eoretical Distributions.................. Éß ä.¢ Multimodal Distributions........................ ÕþÕ ä.ä Multimodal Distributions with Phases................ Õþó ä.ß Bézier Distributions........................... Õþ¢ ä. A New Model for Autocorrelated Data................ Õþß 7 Simulation Model 117 ß.Õ Model Structure............................. ÕÕ ß.ó TCP.................................... ÕÕÉ ß.ó.Õ RFC ßÉì............................. Õóþ ß.ó.ó RFC ÕÕóó............................. Õóì ß.ó.ì RFC Õìóì............................. Õó¦ ß.ó.¦ RFC ó¢Õ............................. Õó¢ ß.ó.¢ RFC óÉ............................ Õóä ß.ì Client................................... Õó ß.ì.Õ Application........................... Õó ß.ì.ó TCP............................... ÕóÉ ß.ì.ì Processor............................ ÕóÉ ß.¦ Network Channels............................ Õìþ ß.¢ Load Balancer.............................. Õìó ¦ Contents ß.ä Servers.................................. Õìó ß.ä.Õ Processes............................ Õìì ß.ä.ó System Processes........................ Õì¦ ß.ä.ì Processor............................ Õì¢ ß.ß Utility Classes and Execution Control................. Õìä ß. Experiments............................... Õìß 8 Conclusions and Future Work 143 Bibliography 147 ¢ Contents ä List of Figures ì.Õ Distributed Web Server Architecture................. óß ì.ó Load Balancing via NAT........................ óÉ ¦.Õ Hardware Monitoring.......................... ì¢ ¦.ó Soware Monitoring.......................... ìä ¦.ì Hybrid Monitoring........................... ìß ¦.¦ Latencies for Reading the Time.................... ìÉ ¦.¢ Frequency Changes with Temperature................ ¦Õ ¦.ä Frequency Variation........................... ¦ó ¦.ß Frequency Distribution......................... ¦ì ¦. Phase Errors............................... ¦¦ ¦.É UDP Delays................................ ¦¢ ¦.Õþ Power-Law Spectral Densities..................... ¦ß ¦.ÕÕ Allan Deviation............................. ¢þ ¦.Õó NTP Time Transfer........................... ¢ó ¦.Õì NTP Architecture............................ ¢¦ ¦.Õ¦ NTP and the PPS API.......................... ¢ä ¢.Õ Detail of UDP Delays.......................... äÕ ¢.ó Synchronization System......................... äó ¢.ì Interrupt Latencies............................ ä¢ ¢.¦ External Clock.............................. ää ¢.¢ Time Deviation............................. ä ¢.ä Oine Synchronization......................... ßþ ¢.ß IP Stack Instrumentation........................ ߢ ¢. Application Server Instrumentation Architecture.......... þ ¢.É Illustration of Delays in the Object System.............. ä ¢.Õþ Trace Plot of Measured Delays..................... ß ¢.ÕÕ Trace Plots of Individual Delays.................... ß List of Figures ¢.Õó Delay Components for Requests.................... É ¢.Õì Summary Statistics the Delays..................... Éþ ä.Õ Histograms of Observed Delays.................... Éì ä.ó Correlation Plots (lag ≤ ¢þþ)...................... Éä ä.ì Correlation Plots (lag ≤ ¦þ)....................... É ä.¦ Trace Plots Sorted by Real Server...................ÉÉ ä.¢ Distribution Comparison for Delay óó................ ÕþÕ ä.ä Distribution Comparison for Delay ì................. Õþì ä.ß State Chart for Phase Transitions................... Õþ¦ ä. Distribution Comparison for Delay ÕÉ................ Õþ¢ ä.É Screenshot of PRIME.......................... Õþä ä.Õþ Distribution Comparison for Delay Õ................ Õþ ä.ÕÕ Histogram Ho of the Deltas for Delay ¢................ ÕþÉ ä.Õó Trace Plot of Delay ¢........................... ÕÕþ ä.Õì Delta over the Values of Delay ¢.................... ÕÕÕ ä.Õ¦ ìD Histogram of Delta ¢........................ ÕÕó ä.Õ¢ Weighting Areas............................. ÕÕì ä.Õä Weighting Factors............................ ÕÕ¦ ä.Õß Original and Weighted Histogram for Delta ¢............ ÕÕ¦ ä.Õ Distribution Comparison for Delay ¢................. ÕÕ¢ ß.Õ Conceptual Model............................ ÕÕÉ ß.ó TCP.................................... Õóþ ß.ì Model of a TCP Segment........................ ÕóÕ ß.¦ Central TCP State Chart receive_packet ........... Õóó ß.¢ Structure of the Client Object..................... Õó ß.ä Conceptual Model of the Network Channels............. ÕìÕ ß.ß Server Model and Embedded Objects................. Õìì ß. Graphical Comparison of the Results................. Õ¦þ List of Tables ¦.Õ Slope Characteristics.......................... ¢Õ ¢.Õ Quantile Summary for Delays in Microseconds........... ¢ ä.Õ Fitted Standard eoretical Distributions.............. Õþþ ä.ó Fitted Multimodal Distributions.................... Õþó ä.ì Fitted Multimodal Distributions with Phases............ Õþ¦ ß.Õ Core Simulation Parameters...................... ÕìÉ ß.ó Quantile Comparisons in Milliseconds................ Õ¦Õ ß.ì CPU Load Comparison......................... Õ¦ó É List of Tables Õþ Abstract Nowadays, distributed systems are ubiquitous. Since the delays during processing in such systems are oen essential, many research projects deal with performance analyses of these systems. Most of them treat the systems from an abstract point of view and oarse-grained models are built. ese do not include results of detailed measurement studies of real systems. e goal of this work is to demonstrate a methodology that allows to create precise models of distributed systems that are parametrized, calibrated and validated from ne-grained measurements of a laboratory setup. e approach is exemplied on a cluster-based web server system. e resulting model contains many details that inuence the behavior and performance of the system like one-way delays or system activity caused by the hardware. Since network aspects play a central role in distributed systems, it is important to be able to capture the timing characteristics of packet delays in the network exactly. erefore, a modular system has been developed for the Linux operating system that allows to record sent and received TCP segments and to generate timestamps for the sending and receiving actions. For that purpose, the netlter framework has been extended to insert packet headers and corresponding timestamps into a ring buer in kernel space. e timestamps in the resulting event trace are generated using the clock of the object system that is observed. To calculate one- way delays for packets, it is necessary to synchronize the clocks of the nodes, because the timestamps for the sending and the receiving event are taken from dierent clocks. is can be achieved with standard solutions like the use of NTP during the measurement. An alternative that is more suitable in many situations is the use of a dedicated oine synchronization. is method is an own development based on an algorithm classically used for online synchronization of computer clocks. e method allows to use the cycle counter of the processor (TSC) of the object system for timestamping. Due to this feature, a context switch for obtaining kernel clock timestamps can be avoided. erefore, the latencies for ÕÕ Abstract reading the clocks are minimized. To implement this solution, the PPS output of a GPS receiver is connected to the nodes of the object system that need to be synchronized. PPS
Recommended publications
  • Model Driven Scheduling for Virtualized Workloads
    Model Driven Scheduling for Virtualized Workloads Proefschrift voorgelegd op 28 juni 2013 tot het behalen van de graad van Doctor in de Wetenschappen – Informatica, bij de faculteit Wetenschappen, aan de Universiteit Antwerpen. PROMOTOREN: prof. dr. Jan Broeckhove dr. Kurt Vanmechelen Sam Verboven RESEARCH GROUP COMPUTATIONAL MODELINGAND PROGRAMMING Dankwoord Het behalen van een doctoraat is een opdracht die zonder hulp onmogelijk tot een goed einde kan gebracht worden. Gelukkig heb ik de voorbije jaren de kans gekregen om samen te werken met talrijke stimulerende collega’s. Stuk voor stuk hebben zij bijgedragen tot mijn professionele en persoonlijke ontwikkeling. Hun geduldige hulp en steun was essentieel bij het overwinnen van de vele uitdagingen die met een doctoraat gepaard gaan. Graag zou ik hier dan ook enkele woorden van dank neerschrijven. Allereerst zou ik graag prof. dr. Jan Broeckhove en em. prof. dr. Frans Arickx bedanken om mij de kans te geven een gevarieerde en boeiend academisch traject te starten. Bij het beginnen van mijn doctoraat heeft Frans mij niet alleen geholpen om een capabel onderzoeker te worden, ook bij het lesgeven heeft hij mij steeds met raad en daad bijgestaan. Na het emiraat van Frans heeft Jan deze begeleiding overgenomen en er voor gezorgd dat ik het begonnen traject ook succesvol kon beeïndigen. Beiden hebben ze mij steeds grote vrijheid gegeven in mijn zoektocht om interessante onderzoeksvragen te identificeren en beantwoorden. Vervolgens zou ik graag dr. Peter Hellinckx en dr. Kurt Vanmechelen bedanken voor hun persoonlijke en vaak intensieve begeleiding. Zelfs voor de start van mijn academische carrière, bij het schrijven van mijn Master thesis, heeft Peter mij klaar- gestoomd voor een vlotte start als onderzoeker.
    [Show full text]
  • Towards Better Performance Per Watt in Virtual Environments on Asymmetric Single-ISA Multi-Core Systems
    Towards Better Performance Per Watt in Virtual Environments on Asymmetric Single-ISA Multi-core Systems Viren Kumar Alexandra Fedorova Simon Fraser University Simon Fraser University 8888 University Dr 8888 University Dr Vancouver, Canada Vancouver, Canada [email protected] [email protected] ABSTRACT performance per watt than homogeneous multicore proces- Single-ISA heterogeneous multicore architectures promise to sors. As power consumption in data centers becomes a grow- deliver plenty of cores with varying complexity, speed and ing concern [3], deploying ASISA multicore systems is an performance in the near future. Virtualization enables mul- increasingly attractive opportunity. These systems perform tiple operating systems to run concurrently as distinct, in- at their best if application workloads are assigned to het- dependent guest domains, thereby reducing core idle time erogeneous cores in consideration of their runtime proper- and maximizing throughput. This paper seeks to identify a ties [4][13][12][18][24][21]. Therefore, understanding how to heuristic that can aid in intelligently scheduling these vir- schedule data-center workloads on ASISA systems is an im- tualized workloads to maximize performance while reducing portant problem. This paper takes the first step towards power consumption. understanding the properties of data center workloads that determine how they should be scheduled on ASISA multi- We propose that the controlling domain in a Virtual Ma- core processors. Since virtual machine technology is a de chine Monitor or hypervisor is relatively insensitive to changes facto standard for data centers, we study virtual machine in core frequency, and thus scheduling it on a slower core (VM) workloads. saves power while only slightly affecting guest domain per- formance.
    [Show full text]
  • Big Data Benchmarking Workshop Publications
    Benchmarking Datacenter and Big Data Systems Wanling Gao, Zhen Jia, Lei Wang, Yuqing Zhu, Chunjie Luo, Yingjie Shi, Yongqiang He, Shiming Gong, Xiaona Li, Shujie Zhang, Bizhu Qiu, Lixin Zhang, Jianfeng Zhan INSTITUTE OFTECHNOLOGY COMPUTING http://prof.ict.ac.cn/ICTBench 1 Acknowledgements This work is supported by the Chinese 973 project (Grant No.2011CB302502), the Hi- Tech Research and Development (863) Program of China (Grant No.2011AA01A203, No.2013AA01A213), the NSFC project (Grant No.60933003, No.61202075) , the BNSFproject (Grant No.4133081), and Huawei funding. 2/ Big Data Benchmarking Workshop Publications BigDataBench: a Big Data Benchmark Suite from Web Search Engines. Wanling Gao, et al. The Third Workshop on Architectures and Systems for Big Data (ASBD 2013) in conjunction with ISCA 2013. Characterizing Data Analysis Workloads in Data Centers. Zhen Jia, et al. 2013 IEEE International Symposium on Workload Characterization (IISWC-2013) Characterizing OS behavior of Scale-out Data Center Workloads. Chen Zheng et al. Seventh Annual Workshop on the Interaction amongst Virtualization, Operating Systems and Computer Architecture (WIVOSCA 2013). In Conjunction with ISCA 2013.[ Characterization of Real Workloads of Web Search Engines. Huafeng Xi et al. 2011 IEEE International Symposium on Workload Characterization (IISWC-2011). The Implications of Diverse Applications and Scalable Data Sets in Benchmarking Big Data Systems. Zhen Jia et al. Second workshop of big data benchmarking (WBDB 2012 India) & Lecture Note in Computer
    [Show full text]
  • An Experimental Evaluation of Datacenter Workloads on Low-Power Embedded Micro Servers
    An Experimental Evaluation of Datacenter Workloads On Low-Power Embedded Micro Servers Yiran Zhao, Shen Li, Shaohan Hu, Hongwei Wang Shuochao Yao, Huajie Shao, Tarek Abdelzaher Department of Computer Science University of Illinois at Urbana-Champaign fzhao97, shenli3, shu17, hwang172, syao9, hshao5, [email protected] ABSTRACT To reduce datacenter energy cost, power proportional- This paper presents a comprehensive evaluation of an ultra- ity [47] is one major solution studied and pursued by both low power cluster, built upon the Intel Edison based micro academia and industry. Ideally, it allows datacenter power servers. The improved performance and high energy effi- draw to proportionally follow the fluctuating amount of work- ciency of micro servers have driven both academia and in- load, thus saving energy during non-peak hours. However, dustry to explore the possibility of replacing conventional current high-end servers are not energy-proportional and brawny servers with a larger swarm of embedded micro ser- have narrow power spectrum between idling and full uti- vers. Existing attempts mostly focus on mobile-class mi- lization [43], which is far from ideal. Therefore, researchers cro servers, whose capacities are similar to mobile phones. try to improve energy-proportionality using solutions such We, on the other hand, target on sensor-class micro servers, as dynamic provisioning and CPU power scaling. The for- which are originally intended for uses in wearable technolo- mer relies on techniques such as Wake-On-LAN [36] and gies, sensor networks, and Internet-of-Things. Although VM migration [24] to power on/off servers remotely and dy- sensor-class micro servers have much less capacity, they are namically.
    [Show full text]
  • Adaptive Control of Apache Web Server
    Adaptive Control of Apache Web Server Erik Reed Abe Ishihara Ole J. Mengshoel Carnegie Mellon University Carnegie Mellon University Carnegie Mellon University Moffett Field, CA 94035 Moffett Field, CA 94035 Moffett Field, CA 94035 [email protected] [email protected] [email protected] Abstract least load. Typical implementations of load balancing deal with round robin balancing for domain name service Traffic to a Web site can vary dramatically. At the same (DNS) [7]. Much of existing frameworks and previous time it is highly desirable that a Web site is reactive. To research deal with inter-server connections and balanc- provide crisp interaction on thin clients, 150 milliseconds ing, with little or no focus on dynamic parameter adjust- has been suggested as an upper bound on response time. ment of individual servers. Unfortunately, the popular Apache Web server is lim- Focusing on an individual Web server rather than a ited in its capabilities to be reactive under varying traf- cluster of machines, we investigate in this paper an alter- fic. To address this problem, we design in this paper an native approach to ensuring quick and predictable user adaptive controller for the Apache Web server. A modi- response, namely feedback control and specifically Min- fied recursive least squares algorithm is used to identify imum Degree Pole Placement control [5]. This can be in system dynamics and a minimum degree pole placement an environment where inter-server load balancing is al- controller is implemented to adjust the maximum num- ready in place, or when a single server is handling Web ber of concurrent connections.
    [Show full text]
  • Energy Efficiency of Server Virtualization
    Energy Efficiency of Server Virtualization Jukka Kommeri Tapio Niemi Olli Helin Helsinki Institute of Physics, Helsinki Institute of Physics, Helsinki Institute of Physics, Technology program, CERN, Technology program, CERN, Technology program, CERN, CH-1211 Geneva 23, Switzerland CH-1211Geneva 23, Switzerland CH-1211 Geneva 23, Switzerland [email protected] [email protected] [email protected] Abstract—The need for computing power keeps on growing. combine several services into one physical server. In this The rising energy expenses of data centers have made server way, these technologies make it possible to take better consolidation and virtualization important research areas. advantage of hardware resources. Virtualization and its performance have received a lot of attention and several studies can be found on performance. In this study, we focus on energy efficiency of different So far researches have not studied the overall performance virtualization technologies. Our aim is to help the system and energy efficiency of server consolidation. In this paper we administrator to decide how services should be consolidated study the effect of server consolidation on energy efficiency with to minimize energy consumption without violating quality an emphasis on quality of service. We have studied this with of service agreements. several synthetic benchmarks and with realistic server load by performing a large set of measurements. We found out that We studied energy consumption of virtualized servers with energy-efficiency depends on the load of the virtualized service two open source virtualization solutions; KVM and Xen. and the number of virtualized servers. Idle virtual servers do They were tested both under load and idle.
    [Show full text]
  • Remote Profiling of Resource Constraints of Web Servers Using
    Remote Profiling of Resource Constraints of Web Servers Using Mini-Flash Crowds Pratap Ramamurthy Vyas Sekar Aditya Akella Balachander Krishnamurthy UW-Madison CMU UW-Madison AT&T–Labs Research Anees Shaikh IBM Research Abstract often limited, this can be difficult. Hence, large providers Unexpected surges in Web request traffic can exercise who can afford it typically resort to over-provisioning, or server-side resources (e.g., access bandwidth, process- employ techniques such as distributed content delivery or ing, storage etc.) in undesirable ways. Administra- dynamic server provisioning, to minimize the impact of tors today do not have requisite tools to understand the unexpected surges in request traffic. Smaller application impact of such “flash crowds” on their servers. Most providers may trade-off robustness to large variations in Web servers either rely on over-provisioning and admis- workload for a less expensive infrastructure that is provi- sion control, or use potentially expensive solutions like sioned for the expected common case. CDNs, to ensure high availability in the face of flash This approach still leaves operators without a sense crowds. A more fine-grained understanding of the per- of how their application infrastructure will handle large formance of individual server resources under emulated increases in traffic, due to planned events such as annual but realistic and controlled flash crowd-like conditions sales or Web casts, or unexpected flash crowds. While can aid administrators to make more efficient resource these events may not occur frequently, the inability of the management decisions. In this paper, we present mini- infrastructure to maintain reasonably good service, or at flash crowds (MFC) – a light-weight profiling service least degrade gracefully, can lead to significant loss of that reveals resource bottlenecks in a Web server infras- revenue and dissatisfied users.
    [Show full text]
  • Optimal Power Allocation in Server Farms
    Optimal Power Allocation in Server Farms Anshul Gandhi∗ Mor Harchol-Balter∗ Rajarshi Das† Charles Lefurgy†† March 2009 CMU-CS-09-113 School of Computer Science Carnegie Mellon University Pittsburgh, PA 15213 ∗School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA †IBM T.J. Watson Research Center, Hawthorne, NY, USA ††IBM Research, Austin, TX, USA Research supported by NSF SMA/PDOS Grant CCR-0615262 and a 2009 IBM Faculty Award. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of NSF or IBM. Keywords: Power Management, Server Farm, Response Time, Power-to-Frequency, Data Center Abstract Server farms today consume more than 1.5% of the total electricity in the U.S. at a cost of nearly $4.5 billion. Given the rising cost of energy, many industries are now looking for solutions on how to best make use of their available power. An important question which arises in this context is how to distribute available power among servers in a server farm so as to get maximum perfor- mance. By giving more power to a server, one can get higher server frequency (speed). Hence it is commonly believed that for a given power budget, performance can be maximized by oper- ating servers at their highest power levels. However, it is also conceivable that one might prefer to run servers at their lowest power levels, which allows more servers for a given power budget. To fully understand the effect of power allocation on performance in a server farm with a fixed power budget, we introduce a queueing theoretic model, which also allows us to predict the opti- mal power allocation in a variety of scenarios.
    [Show full text]
  • Benchmarking Models and Tools for Distributed Web-Server Systems
    Benchmarking Models and Tools for Distributed Web-Server Systems Mauro Andreolini1, Valeria Cardellini1, and Michele Colajanni2 1 Dept. of Computer, Systems and Production University of Roma “Tor Vergata” Roma I-00133, Italy andreolini,cardellini @ing.uniroma2.it, { } 2 Dept. of Information Engineering University of Modena Modena I-41100, Italy [email protected] Abstract. This tutorial reviews benchmarking tools and techniques that can be used to evaluate the performance and scalability of highly accessed Web-server systems. The focus is on design and testing of lo- cally and geographically distributed architectures where the performance evaluation is obtained through workload generators and analyzers in a laboratory environment. The tutorial identifies the qualities and issues of existing tools with respect to the main features that characterize a benchmarking tool (workload representation, load generation, data col- lection, output analysis and report) and their applicability to the analysis of distributed Web-server systems. 1 Introduction The explosive growth in size and usage of the Web is causing enormous strain on users, network service, and content providers.Sophisticated software components have been implemented for the provision of critical services through the Web. Consequently, many research efforts have been directed toward improving the performance of Web-based services through caching and replication solutions.A large variety of novel content delivery architectures, such as distributed Web- server systems, cooperative proxy
    [Show full text]
  • NCA’04) 0-7695-2242-4/04 $ 20.00 IEEE Interesting Proposals
    WALTy: A User Behavior Tailored Tool for Evaluating Web Application Performance G. Ruffo, R. Schifanella, and M. Sereno R. Politi Dipartimento di Informatica CSP Sc.a.r.l. c/o Villa Gualino Universita` degli Studi di Torino Viale Settimio Severo, 65 - 10133 Torino (ITALY) Corso Svizzera, 185 - 10149 Torino (ITALY) Abstract ticular, stressing tools make what-if analysis practical in real systems, because workload intensity can be scaled to In this paper we present WALTy (Web Application Load- analyst hypothesis, i.e., a workload emulating the activity based Testing tool), a set of tools that allows the perfor- of N users with pre-defined behaviors can be replicated, mance analysis of web applications by means of a scalable in order to monitor a set of performance parameters dur- what-if analysis on the test bed. The proposed approach is ing test. Such workload is based on sequences of object- based on a workload characterization generated from infor- requests and/or analytical characterization, but sometimes mation extracted from log files. The workload is generated they are poorly scalable by the analyst; in fact, in many by using of Customer Behavior Model Graphs (CBMG), stressing framework, we can just replicate a (set of) man- that are derived by extracting information from the web ap- ually or randomly generated sessions, losing in objectivity plication log files. In this manner the synthetic workload and representativeness. used to evaluate the web application under test is repre- The main scope of this paper is to present a methodol- sentative of the real traffic that the web application has to ogy based on the generation of a representative traffic.
    [Show full text]
  • Virtual Machine Reset Vulnerabilities and Hedging Deployed Cryptography
    When Good Randomness Goes Bad: Virtual Machine Reset Vulnerabilities and Hedging Deployed Cryptography Thomas Ristenpart Scott Yilek Dept. of Computer Science and Engineering University of California, San Diego, USA {tristenp,syilek}@cs.ucsd.edu Abstract their relevance to RNG failures, the vulnerabilities are also interesting because they introduce a potentially-widespread Random number generators (RNGs) are consistently a class of practical problems due to virtualization technolo- weak link in the secure use of cryptography. Routine cryp- gies. Our second contribution is a general and backwards- tographic operations such as encryption and signing can compatible framework to hedge against the threat of ran- fail spectacularly given predictable or repeated random- domness failures in deployed cryptographic algorithms. We ness, even when using good long-lived key material. This discuss each contribution more in turn. has proved problematic in prior settings when RNG imple- VM reset vulnerabilities. Virtualization technologies en- mentation bugs, poor design, or low-entropy sources have able significant flexibility in handling the state of guest sys- resulted in predictable randomness. We investigate a new tems (an operating system, user applications, and data). In way in which RNGs fail due to reuse of virtual machine particular, virtual machine (VM) snapshots, i.e. copies of (VM) snapshots. We exhibit such VM reset vulnerabilities the state of the guest, can be used to replicate, backup, trans- in widely-used TLS clients and servers: the attacker takes fer (to another physical system), or reset (to a prior state) advantage of (or forces) snapshot replay to compromise the guest. Snapshots are one reason virtualization is trans- sessions or even expose a server’s DSA signing key.
    [Show full text]
  • Phelps AJ T 2020.Pdf (1.937Mb)
    ink - An HTTP Benchmarking Tool Andrew J. Phelps Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the degree of Master of Science in Computer Science & Application Godmar V. Back, Chair Ali R. Butt Denis Gracanin May 11, 2020 Blacksburg, Virginia Keywords: Networking, Benchmarking, HTTP, Distributed Systems Copyright 2020, Andrew J. Phelps ink - An HTTP Benchmarking Tool Andrew J. Phelps (ABSTRACT) The Hypertext Transfer Protocol (HTTP) is one the foundations of the modern Internet. Because HTTP servers may be subject to unexpected periods of high load, developers use HTTP benchmarking utilities to simulate the load generated by users. However, many of these tools do not report performance details at a per-client level, which deprives developers of crucial insights into a server’s performance capabilities. In this work, we present ink, an HTTP benchmarking tool that enables developers to better understand server performance. ink provides developers with a way of visualizing the level of service that each individual client receives. It does this by recording a trace of events for each individual simulated client. We also present a GUI that enables users to explore and visualizing the data that is generated by an HTTP benchmark. Lastly, we present a method for running HTTP benchmarks that uses a set of distributed machines to scale up the achievable load on the benchmarked server. We evaluate ink by performing a series of case studies to show that ink is both performant and useful. We validate ink’s load generation abilities within the context of a single machine and when using a set of distributed machines.
    [Show full text]