Adaptive Timeout Discovery Using the Network Weather Service∗

Total Page:16

File Type:pdf, Size:1020Kb

Adaptive Timeout Discovery Using the Network Weather Service∗ Adaptive Timeout Discovery using the Network Weather Service∗ Matthew S. Allen Rich Wolski Computer Science Department Computer Science Department University of California, Santa Barbara University of California, Santa Barbara James S. Plank Computer Science Department University of Tennessee, Knoxville In this paper, we present a novel methodol- uled. That is, they are customized software com- ogy for improving the performance and dependabil- ponents that exploit application-specific characteris- ity of application-level messaging in Grid systems. tics to mitigate the negative performance effects of Based on the Network Weather Service, our system Grid dynamism. Tools such as the AppLeS Param- uses non-parametric statistical forecasts of request- eter Sweep Template [6, 5] generalize this approach response times to automatically determine message to incorporate information that is common to a class timeouts. By choosing a timeout based on predicted of applications. Little work, however, has focused on network performance, the methodology improves ap- the use of dynamic techniques to optimize the per- plication and Grid service performance as extraneous formance of Grid services that application schedulers and overly-long timeouts are avoided. We describe and Grid users require. the technique, the additional execution and program- One such Grid Service that is critical to applica- ming overhead it introduces, and demonstrate the ef- tion performance is communication. Grid applica- fectiveness using a wide-area test application. tions must be designed to run on a wide area network, which poses a difficult performance problem. Com- munication times in such an environment fluctuate 1 Introduction dramatically as routers become overloaded and drop packets, networks partition, and congestion control The challenge for the Computational Grid is to ex- algorithms restrict communication. Many conditions tract reliable, consistent performance from a fluctuat- can cause a transmission in progress to take much ing resource pool, and to deliver that performance to longer than expected. One way to deal with this user applications. Dynamic schedulers [12, 14, 2] have problem is to time out communications that exceed been able to \steer" resource usage automatically so some expected elapsed time. Choosing this value, that the application can take the best advantage of however, is often little more than an educated guess. the performance that is available. To do so, many rely on short-term, statistical forecasts of resource per- When network performance is fluctuating, time- formance generated by the Network Weather Service out determination can dramatically affect both ap- (NWS) [17, 16] { a Grid performance data manage- plication and service performance. Timeouts that ment and forecasting service. are premature cause either needless failures or extra messaging overhead as the message is retried. Overly While application-level schedulers work well in long timeouts result in degraded messaging perfor- Grid contexts, to date, they have had to rely on in- mance as communication is delayed unnecessarily. trinsic \knowledge" of the application being sched- Our methodology attempts to \learn" the best time- ∗This work was supported, in part, NSF grants EIA- out value to use based on previously observed com- 9975015, ACI-9876895, and CAREER award 0093166. munication between host pairs. By combining pre- 1 dicted message send-acknowledge times with NWS The rest of this paper is organized as follows. In the forecasting error measures, our system sets timeouts next section, we will discuss the network conditions adaptively, in response to fluctuating network perfor- we hope to improve on with this technique. In Sec- mance. tion 3 we will describe the adaptive timeout discovery In this paper we present a general technique for im- methodology. Section 4 describes the specific set of proving communication times on the wide-area. Our experiments we conducted. In Section 5 we discuss system \predicts" communication times using non- the results of our investigation, and Section 6 draws parametric statistical forecasting techniques. These our final conclusions from this investigation. predictions are then used to determine an appropriate application level timeout for each send. By canceling and reissuing sends that exceed this time limit, we 2 Network Behavior are able to achieve improvements in communication. This methodology's benefits are realized in a general The motivation for this research is the observation and application independent way. that there is a high degree of variance in communi- We used a sample application to present our re- cation times, especially in wide-area networks. Send sults. For this application, we chose a global data times have been observed to loosely follow either a reduction across multiple hosts in the Grid Appli- heavy-tailed or Poisson distribution [10]. Although cation Software Development (GrADS) [1] testbed. there is some debate over which accurately describes GrADS is a Grid software research project studying actual network traffic, either case indicates that there the problem of application development and deploy- can be extremely long send times. Furthermore, the ment. From some of its early results [12], it is clear tail{end sends are not necessarily clustered together, that the performance of the Grid services themselves but often occur among more reasonably behaved com- is a critical component of Grid application perfor- munications [13]. Thus, while the send time across a mance. If the services respond slowly, the applica- link will usually take only a couple of seconds, it may tion using them will also suffer a performance degra- occasionally take significantly longer. dation. Our choice of test application in this work is Most of these delays stem from the behavior of motivated by two observations. First, collective oper- routers at one of the hops in the transmission. For ations [7, 9, 8] are a performance critical component example, a temporarily overloaded router may drop of many distributed scientific and engineering appli- packets or accidentally direct them to the wrong des- cations. Secondly, we observe that frequently, Grid tination. This is particularly a problem when a TCP users and schedulers must collect up-to-date informa- transmission collides with a poorly behaved UDP tion from widely distributed sites, often to pick out stream that fills up a routing queue [3]. Although the \best" choice from a number of possibilities. For overloaded routers are likely to provide consistently example, application schedulers often wish to deter- poor performance, other routers before them in the mine the most lightly loaded resource(s) from a spec- communication path may detect this congestion and ified resource pool when making scheduling decisions. redirect messages to another path. While it may be possible to avoid scheduling collec- Another significant problem is that routers occa- tive operations in a scientific application so that they sionally change their routing tables because of poli- span wide-area network connections, it is not gener- cies or configurations and not detection of a congested ally possible (or desirable) to centralize Grid services network. Sometimes this happens mid{transmission, in the same way. causing packets to arrive out of order and causing As such, this paper will make two significant con- other delays in the communication [11]. This is par- tributions. It will: ticularly evident in a phenomenon known as “flutter- describe an application-independent, high- ing" where a router rapidly switches between differ- • performance, and simple methodology for ent paths to direct packets. Where a few packets may adaptive timeout discovery using NWS forecast- follow a 12 hop route, the next few may follow one ing tools with 25 before the router returns to its original route. This change can occur with every other packet in the demonstrate the effectiveness of this approach in worst cases [11]. • a \live" Grid setting. It is also possible that the TCP/IP congestion con- 2 trol algorithms can delay a single transmission that cs.ucsb.edu arrives at an inappropriate time. In the face of fluc- tuating network traffic, these algorithms can behave cs.utk.edu cs.utk.edu chaotically and unnecessarily throttle single trans- missions [15]. This happens not as a result of an cs.utk.edu cs.uiuc.edu cs.indiana.edu wellesley.edu overall increase in traffic through the network, but in- stead as an anomaly in the algorithm. These anoma- lies quickly are resolved, and subsequent sends will ncni.net cs.uiuc.edu cs.ucsb.edu cs.vu.nl ucsd.edu cs.utk.edu eecs.harvard.educs.utk.edu be unaffected until the algorithm becomes unstable again. cs.uiuc.eduucsd.educs.utk.eduecs.csun.educs.utk.eduucsd.educs.uiuc.educs.uiuc.educs.utk.eduucsd.educs.ucsb.educs.ucsb.educs.uiuc.educs.utk.educs.uiuc.eduucsd.edu 3 Adaptive Timeout Discovery Figure 1: Topology of the reduction Our strategy for computing dynamic timeouts uses the non-parametric techniques of the NWS forecast- method is minimal. ers. For each application-level link the forecaster maintains a unique timeout prediction determined using measurements gathered throughout the exper- 4 Experiment iment's execution. Because the forecaster is con- stantly updated with measurements of the link's per- We chose to implement a global reduction to investi- formance, it is able to produce accurate and current gate the potential impact of various timeout strate- predictions of this link's expected speed. gies on distributed applications. Our reduction is The performance of each connection is monitored structured as a binary tree consisting of 31 nodes as constantly, and the results are used to generate time- depicted in figure 1. Each node in the tree generates out predictions for that link. When a connection is a set of n random values at the beginning of the re- established, a NWS forecaster is created to monitor duction. At the end of the reduction, the hosts have it. It is initialized with a generous prediction of the computed the n maximum values of those generated send time; in our case, 45 seconds.
Recommended publications
  • Commands to Control Device
    Commands to Control Device New OTA firmware upgrade command. To try it via over-the-air update, use these commands: > fwe > fwd B6FMA186 The board will reboot after the first command, this is normal. After the second command wait 1-3 minutes (seems like a long time). After it resets again, go back to BlueFruit and press "info" the software should be 3.0.186 The default wakeup interval is still 4 hours (14400 seconds). To configure a unit for a different wakeup interval: >wakt 600 (this would be 60*10 = 10 minutes) No beeps (Command to turn off the beeps) nvap 6 120000 14400000 15000 5000 0 223 1 3 Command to Restore Beeps nvap 6 120000 14400000 15000 5000 0 223 1 65535 Main Menu commands ---- SubMenus -------------------------- >i - I2C menu >l - Log menu >M - Modem menu ---- Device Control Cmds --------------- >D [n] - set debug level to [n] >info - Show device info >cota - CDMA OTA reprovision >svrm [name] - Set server main >nvmr - Non-volatile memory revert >t - show system uptime >rst - Hard reset ---- General Cmds ---------------------- >c [str] - Execute remote command manually >batq - Query battery state >slp - Go to sleep immediately >slpt [n] - Set sleep timeout to [n] seconds >wakt [n] - Set wake timeout to [n] hours >sus - Suspend immediately >sts - Show statistics and send to server >bt [n] - Bluetooth disable/enable: n=0/1 >dsms [n] - DebugSMS configure 0/1=disabled/enabled >wss [p] [s] - WiMM Server Send [p]ort, [s]tring >wsr - WiMM Server Recv ---- Pos/Log Cmds ---------------------- >pn - Position now >plc {i} {t} - Position log ctrl, [i]nterval, [t]ripId Stop Logging >pla {i} {t} - Position log auto, [i]nterval, [t]ripId >plb {n} - Position log batch [n] records >ple - Position log erase --- Alarm Cmds ------------------------ >alm [m] - Alarm mode: m=0/1/2: Off/On/FireNow ---- Firmware Cmds --------------------- >fwd [v] - Firmware Download (WIMM{v}.hex) >fwl {i} - Firmware Launch (i=0/1: ImgA/B) >fwe - Firmware Erase (ImgB) To change the alarm delay You must be running firmware 145 or later.
    [Show full text]
  • Administering Unidata on UNIX Platforms
    C:\Program Files\Adobe\FrameMaker8\UniData 7.2\7.2rebranded\ADMINUNIX\ADMINUNIXTITLE.fm March 5, 2010 1:34 pm Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta UniData Administering UniData on UNIX Platforms UDT-720-ADMU-1 C:\Program Files\Adobe\FrameMaker8\UniData 7.2\7.2rebranded\ADMINUNIX\ADMINUNIXTITLE.fm March 5, 2010 1:34 pm Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Beta Notices Edition Publication date: July, 2008 Book number: UDT-720-ADMU-1 Product version: UniData 7.2 Copyright © Rocket Software, Inc. 1988-2010. All Rights Reserved. Trademarks The following trademarks appear in this publication: Trademark Trademark Owner Rocket Software™ Rocket Software, Inc. Dynamic Connect® Rocket Software, Inc. RedBack® Rocket Software, Inc. SystemBuilder™ Rocket Software, Inc. UniData® Rocket Software, Inc. UniVerse™ Rocket Software, Inc. U2™ Rocket Software, Inc. U2.NET™ Rocket Software, Inc. U2 Web Development Environment™ Rocket Software, Inc. wIntegrate® Rocket Software, Inc. Microsoft® .NET Microsoft Corporation Microsoft® Office Excel®, Outlook®, Word Microsoft Corporation Windows® Microsoft Corporation Windows® 7 Microsoft Corporation Windows Vista® Microsoft Corporation Java™ and all Java-based trademarks and logos Sun Microsystems, Inc. UNIX® X/Open Company Limited ii SB/XA Getting Started The above trademarks are property of the specified companies in the United States, other countries, or both. All other products or services mentioned in this document may be covered by the trademarks, service marks, or product names as designated by the companies who own or market them. License agreement This software and the associated documentation are proprietary and confidential to Rocket Software, Inc., are furnished under license, and may be used and copied only in accordance with the terms of such license and with the inclusion of the copyright notice.
    [Show full text]
  • (2) G: 9 Timeout Error Layer —T
    US007831732B1 (12) United States Patent (10) Patent No.: US 7,831,732 B1 Zilist et al. (45) Date of Patent: Nov. 9, 2010 (54) NETWORK CONNECTION UTILITY 6,018,724 A 1/2000 Arent 6,182,139 B1 1/2001 Brendel ...................... TO9,226 (75) Inventors: Ira Zilist, Chicago, IL (US); Daniel 6,338,094 B1* 1/2002 Scott et al. .................. 709/245 Reimann, Mt. Prospect, IL (US); Devin 6,393,581 B1 5, 2002 Friedman et al. 6,731,625 B1 5/2004 Eastep et al. Henkel, Chicago, IL (US); Gurpreet 6,742,015 B1 5/2004 Bowman-Amuah Singh, Clarendon Hills, IL (US) 7,082,454 B1* 7/2006 Gheith ....................... TO9,203 2002/0120800 A1* 8/2002 Sugahara et al. .. ... 710,260 (73) Assignee: Federal Reserve Bank of Chicago, 2004/0010546 A1* 1/2004 Kluget al. ............ ... 709,203 Chicago, IL (US) 2004/0192383 A1* 9, 2004 Zacks et al. ................. 455/557 (*) Notice: Subject to any disclaimer, the term of this patent is extended or adjusted under 35 * cited by examiner U.S.C. 154(b) by 1196 days. Primary Examiner Ario Etienne Assistant Examiner Avi Gold (21) Appl. No.: 11/192,991 (74) Attorney, Agent, or Firm—Brinks Hofer Gilson & Lione (22) Filed: Jul. 29, 2005 (57) ABSTRACT Int. C. (51) A system is disclosed for masking errors that may occur G06F 15/16 (2006.01) during a delay of a client connecting with a server on a (52) U.S. Cl. ........................ 709/237; 709/217; 709/219 network. A connection utility requests a connection with the (58) Field of Classification Search ................
    [Show full text]
  • Webserver Cannot Be Started / No Web Access EAGLE20 / Eagleone
    Knowledgebase > Products > Classic Firewalls > Webserver cannot be started / No web access EAGLE20 / EAGLEOne Webserver cannot be started / No web access EAGLE20 / EAGLEOne - 2018-02-09 - Classic Firewalls At the first startup of a brand new EAGLE or at the first startup after “clear certificates” the web certificates are generated. Affected products are EAGLE20 and EAGLEOne in rel. 05.3.00. If the power is removed during the certificate generation is in progress the webserver cannot be started and therefore no web access is possible. The cli command 'show login' displays the following output: !(Hirschmann EAGLE One) #show login Login parameters ---------------- Access per SSH..........................enabled SSH Access port number..................22 DSA Fingerprint for SSH.................""xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:ab:ab:ab:ab:0f "" RSA Fingerprint for SSH.................""xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:xx:5e "" Access per Web (HTTPS)..................disabled Web Access port number (HTTPS)..........443 SNMP version 1..........................disabled SNMP version 2..........................disabled SNMP port number........................161 SNMP over HTTPS tunneling...............disabled RADIUS auth. of SNMP v3 local users.....disabled Inactivity timeout Web (minutes)........5 Inactivity timeout serial (minutes).....5 Inactivity timeout SSH (minutes)........5 Login prompt............................""Hirschmann EAGLE One"" Login banner............................"""" 23: 2013-01-01 01:00:01 [tCfgMgrTask, CRITICAL, WEB-S, 0x02080014] Web Server - start of web server failed 24: 2013-01-01 01:00:01 [tCfgMgrTask, ERROR, WEB-S, 0x02080028] Web Server - directory for https server certificate could not be created Possible solutions: 1. Reformat the flash file system in sysMon1 and to put the operating firmware on the device again.
    [Show full text]
  • SLC Console Manager User Guide Available At
    SLC™ Console Manager User Guide SLC8 SLC16 SLC32 SLC48 Part Number 900-449 Revision J July 2014 Copyright and Trademark © 2014 Lantronix, Inc. All rights reserved. No part of the contents of this book may be transmitted or reproduced in any form or by any means without the written permission of Lantronix. Lantronix is a registered trademark of Lantronix, Inc. in the United States and other countries. SLC, SLB, SLP, SLM, Detector and Spider are trademarks of Lantronix, Inc. Windows and Internet Explorer are registered trademarks of Microsoft Corporation. Firefox is a registered trademark of the Mozilla Foundation. Chrome is a trademark of Google, Inc. All other trademarks and trade names are the property of their respective holders. Warranty For details on the Lantronix warranty replacement policy, please go to our web site at http://www.lantronix.com/support/warranty. Open Source Software Some applications are Open Source software licensed under the Berkeley Software Distribution (BSD) license or the GNU General Public License (GPL) as published by the Free Software Foundation (FSF). Redistribution or incorporation of BSD or GPL licensed software into hosts other than this product must be done under their terms. A machine readable copy of the corresponding portions of GPL licensed source code may be available at the cost of distribution. Such Open Source Software is distributed WITHOUT ANY WARRANTY, INCLUDING ANY IMPLIED WARRANTY OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. See the GPL and BSD for details. A copy of the licenses is available from Lantronix. The GNU General Public License is available at http://www.gnu.org/licenses/.
    [Show full text]
  • TEE Internal Core API Specification V1.1.2.50
    GlobalPlatform Technology TEE Internal Core API Specification Version 1.1.2.50 (Target v1.2) Public Review June 2018 Document Reference: GPD_SPE_010 Copyright 2011-2018 GlobalPlatform, Inc. All Rights Reserved. Recipients of this document are invited to submit, with their comments, notification of any relevant patents or other intellectual property rights (collectively, “IPR”) of which they may be aware which might be necessarily infringed by the implementation of the specification or other work product set forth in this document, and to provide supporting documentation. The technology provided or described herein is subject to updates, revisions, and extensions by GlobalPlatform. This documentation is currently in draft form and is being reviewed and enhanced by the Committees and Working Groups of GlobalPlatform. Use of this information is governed by the GlobalPlatform license agreement and any use inconsistent with that agreement is strictly prohibited. TEE Internal Core API Specification – Public Review v1.1.2.50 (Target v1.2) THIS SPECIFICATION OR OTHER WORK PRODUCT IS BEING OFFERED WITHOUT ANY WARRANTY WHATSOEVER, AND IN PARTICULAR, ANY WARRANTY OF NON-INFRINGEMENT IS EXPRESSLY DISCLAIMED. ANY IMPLEMENTATION OF THIS SPECIFICATION OR OTHER WORK PRODUCT SHALL BE MADE ENTIRELY AT THE IMPLEMENTER’S OWN RISK, AND NEITHER THE COMPANY, NOR ANY OF ITS MEMBERS OR SUBMITTERS, SHALL HAVE ANY LIABILITY WHATSOEVER TO ANY IMPLEMENTER OR THIRD PARTY FOR ANY DAMAGES OF ANY NATURE WHATSOEVER DIRECTLY OR INDIRECTLY ARISING FROM THE IMPLEMENTATION OF THIS SPECIFICATION OR OTHER WORK PRODUCT. Copyright 2011-2018 GlobalPlatform, Inc. All Rights Reserved. The technology provided or described herein is subject to updates, revisions, and extensions by GlobalPlatform.
    [Show full text]
  • Powerview Command Reference
    PowerView Command Reference TRACE32 Online Help TRACE32 Directory TRACE32 Index TRACE32 Documents ...................................................................................................................... PowerView User Interface ............................................................................................................ PowerView Command Reference .............................................................................................1 History ...................................................................................................................................... 12 ABORT ...................................................................................................................................... 13 ABORT Abort driver program 13 AREA ........................................................................................................................................ 14 AREA Message windows 14 AREA.CLEAR Clear area 15 AREA.CLOSE Close output file 15 AREA.Create Create or modify message area 16 AREA.Delete Delete message area 17 AREA.List Display a detailed list off all message areas 18 AREA.OPEN Open output file 20 AREA.PIPE Redirect area to stdout 21 AREA.RESet Reset areas 21 AREA.SAVE Save AREA window contents to file 21 AREA.Select Select area 22 AREA.STDERR Redirect area to stderr 23 AREA.STDOUT Redirect area to stdout 23 AREA.view Display message area in AREA window 24 AutoSTOre ..............................................................................................................................
    [Show full text]
  • U-Connectxpress at Commands Manual
    u-connectXpress AT commands manual Abstract u-blox AT commands reference manual for the short range stand-alone modules. This document lists both the standard and proprietary AT- commands for u-connectXpress based modules with Bluetooth low energy, Bluetooth BR/EDR and Wi-Fi. www.u-blox.com UBX-14044127 - R48 C1-Public u-connectXpress - AT commands manual Document information Title u-connectXpress Subtitle Document type AT commands manual Document number UBX-14044127 Revision and date R48 06-Sep-2021 Disclosure restriction C1-Public u-blox reserves all rights to this document and the information contained herein. Products, names, logos and designs described herein may in whole or in part be subject to intellectual property rights. Reproduction, use, modification or disclosure to third parties of this document or any part thereof without the express permission of u-blox is strictly prohibited. The information contained herein is provided “as is” and u-blox assumes no liability for the use of the information. No warranty, either express or implied, is given, including but not limited, with respect to the accuracy, correctness, reliability and fitness for a particular purpose of the information. This document may be revised by u-blox at any time. For most recent documents, visit www.u-blox.com. Copyright © u-blox AG u-blox is a registered trademark of u-blox Holding AG in the EU and other countries. UBX-14044127 - R48 Page 2 of 176 u-connectXpress - AT commands manual Preface Applicable products This document applies to the following products:
    [Show full text]
  • Command-Line IP Utilities This Document Lists Windows Command-Line Utilities That You Can Use to Obtain TCP/IP Configuration Information and Test IP Connectivity
    Guide to TCP/IP: IPv6 and IPv4, 5th Edition, ISBN 978-13059-4695-8 Command-Line IP Utilities This document lists Windows command-line utilities that you can use to obtain TCP/IP configuration information and test IP connectivity. Command parameters and uses are listed for the following utilities in Tables 1 through 9: ■ Arp ■ Ipconfig ■ Netsh ■ Netstat ■ Pathping ■ Ping ■ Route ■ Tracert ARP The Arp utility reads and manipulates local ARP tables (data link address-to-IP address tables). Syntax arp -s inet_addr eth_addr [if_addr] arp -d inet_addr [if_addr] arp -a [inet_address] [-N if_addr] [-v] Table 1 ARP command parameters and uses Parameter Description -a or -g Displays current entries in the ARP cache. If inet_addr is specified, the IP and data link address of the specified computer appear. If more than one network interface uses ARP, entries for each ARP table appear. inet_addr Specifies an Internet address. -N if_addr Displays the ARP entries for the network interface specified by if_addr. -v Displays the ARP entries in verbose mode. -d Deletes the host specified by inet_addr. -s Adds the host and associates the Internet address inet_addr with the data link address eth_addr. The physical address is given as six hexadecimal bytes separated by hyphens. The entry is permanent. eth_addr Specifies physical address. if_addr If present, this specifies the Internet address of the interface whose address translation table should be modified. If not present, the first applicable interface will be used. Pyles, Carrell, and Tittel 1 Guide to TCP/IP: IPv6 and IPv4, 5th Edition, ISBN 978-13059-4695-8 IPCONFIG The Ipconfig utility displays and modifies IP address configuration information.
    [Show full text]
  • Timeout Control in Distributed Systems Using Perturbation Analysis
    2011 50th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC) Orlando, FL, USA, December 12-15, 2011 Timeout Control in Distributed Systems Using Perturbation Analysis Ali Kebarighotbi and Christos G. Cassandras Division of Systems Engineering and Center for Information and Systems Engineering Boston University Brookline, MA 02446 [email protected],[email protected] Abstract— Timeout control is a simple mechanism used when direct feedback is either impossible, unreliable, or too costly, as is often the case in distributed systems. Its effectiveness is determined by a timeout threshold parameter and our goal is to quantify the effect of this parameter on the behavior of such systems. In this paper, we consider a basic communi- cation system with timeout control, model it as a stochastic flow system, and use Infinitesimal Perturbation Analysis to Fig. 1. Timeout controlled distributed system determine the sensitivity of a “goodput” performance metric with respect to the timeout threshold parameter. In conjunction with a gradient-based scheme, we show that we can determine an optimal value of this parameter. Some numerical examples with an action Ai, a proper response to a timeout event is are included. normally related to the state and action at time t − θi.A simple example is repeating Ai at t because no desirable NTRODUCTION I. I response has been observed in [t − θi; t). Thus, a controller Timeout control is a simple mechanism used in many must have some information about both current and past systems where direct feedback is either impossible, unreli- states and actions of the system.
    [Show full text]
  • System Analysis and Tuning Guide System Analysis and Tuning Guide SUSE Linux Enterprise Server 15 SP1
    SUSE Linux Enterprise Server 15 SP1 System Analysis and Tuning Guide System Analysis and Tuning Guide SUSE Linux Enterprise Server 15 SP1 An administrator's guide for problem detection, resolution and optimization. Find how to inspect and optimize your system by means of monitoring tools and how to eciently manage resources. Also contains an overview of common problems and solutions and of additional help and documentation resources. Publication Date: September 24, 2021 SUSE LLC 1800 South Novell Place Provo, UT 84606 USA https://documentation.suse.com Copyright © 2006– 2021 SUSE LLC and contributors. All rights reserved. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or (at your option) version 1.3; with the Invariant Section being this copyright notice and license. A copy of the license version 1.2 is included in the section entitled “GNU Free Documentation License”. For SUSE trademarks, see https://www.suse.com/company/legal/ . All other third-party trademarks are the property of their respective owners. Trademark symbols (®, ™ etc.) denote trademarks of SUSE and its aliates. Asterisks (*) denote third-party trademarks. All information found in this book has been compiled with utmost attention to detail. However, this does not guarantee complete accuracy. Neither SUSE LLC, its aliates, the authors nor the translators shall be held liable for possible errors or the consequences thereof. Contents About This Guide xii 1 Available Documentation xiii
    [Show full text]
  • June 2021 Uptime Policy
    June 2021 Uptime Policy The xx network is targeting 80% Uptime, 94% Round Success Rate and .5% Realtime Timeout requirements for June 2021. This means that generally, a node needs to be online for at least 80% of the month of May 2021, successfully complete at least 94% of the rounds the node participates in and not fail more than .5% of the Realtime phase of a round to receive compensation for the month. A node will be considered online as long as it is pinging the permissioning server regularly. If at the dashboard, your node is listed as either “Online” or “Error”, it is still online for the purpose of compensation for the month of June 2021. On the individual node pages on the dashboard there are monthly and weekly uptime metrics to inform you of your progress. Node uptime is not tracked while xx network’s servers are down and such periods will not be counted against the minimum. In the event the network is down due to a software update or a bug caused by the xx network team, those periods will also not count towards the 80% uptime minimum. Grace Period Nodes who are joining the network for the first time this month (Orange Team X) will have the entire month of June 2021 as a grace period. This only includes nodes who registered initially to start in June 2021. If this month is a grace period for your node, you will receive an email notifying you. The grace period in June 2021 is designed to allow new nodes adequate time to onboard.
    [Show full text]