The Science DMZ: a Network Design Pattern for Data-Intensive Science

Total Page:16

File Type:pdf, Size:1020Kb

The Science DMZ: a Network Design Pattern for Data-Intensive Science The Science DMZ: A Network Design Pattern for Data-Intensive Science Eli Dart Lauren Rotman Brian Tierney Energy Sciences Network Energy Sciences Network Energy Sciences Network Lawrence Berkeley National Lawrence Berkeley National Lawrence Berkeley National Laboratory Laboratory Laboratory Berkeley, CA 94720 Berkeley, CA 94720 Berkeley, CA 94720 [email protected] [email protected] [email protected] Mary Hester Jason Zurawski Energy Sciences Network Internet2 Lawrence Berkeley National Office of the CTO Laboratory Washington DC, 20036 Berkeley, CA 94720 [email protected] [email protected] Abstract Categories and Subject Descriptors The ever-increasing scale of scientific data has become a sig- C.2.1 [Computer{Communication Networks]: Network nificant challenge for researchers that rely on networks to Architecture and Design; C.2.3 [Computer{Communication interact with remote computing systems and transfer re- Networks]: Network Operations|network management, net- sults to collaborators worldwide. Despite the availability work monitoring; C.2.5 [Computer{Communication Net- of high-capacity connections, scientists struggle with inade- works]: Local and Wide-Area Networks|internet quate cyberinfrastructure that cripples data transfer perfor- mance, and impedes scientific progress. The Science DMZ paradigm comprises a proven set of network design patterns General Terms that collectively address these problems for scientists. We Performance, Reliability, Design, Measurement explain the Science DMZ model, including network architec- ture, system configuration, cybersecurity, and performance 1. INTRODUCTION tools, that creates an optimized network environment for science. We describe use cases from universities, supercom- A design pattern is a solution that can be applied to a puting centers and research laboratories, highlighting the general class of problems. This definition, originating in the effectiveness of the Science DMZ model in diverse opera- field of architecture [1,2], has been adopted in computer sci- tional settings. In all, the Science DMZ model is a solid ence, where the idea has been used in software designs [6] platform that supports any science workflow, and flexibly and in our case network designs. The network design pat- accommodates emerging network technologies. As a result, terns we discuss are focused on high end-to-end network the Science DMZ vastly improves collaboration, accelerating performance for data-intensive science applications. These scientific discovery. patterns focus on optimizing the network interactions be- tween wide area networks, campus networks, and computing systems. The Science DMZ model, as a design pattern, can be adapted to solve performance problems on any existing network. Of these performance problems, packet loss has proven to be the most detrimental as it causes an observable This manuscript has been authored by an author at Lawrence Berkeley Na- and dramatic decrease in data throughput for most applica- tional Laboratory under Contract No. DE-AC02-05CH11231 with the U.S. tions. Packet loss can be caused by many factors including: Department of Energy. The U.S. Government retains, and the publisher, by firewalls that cannot effectively process science traffic flows; accepting the article for publication, acknowledges, that the U.S. Govern- routers and switches with inadequate burst capacity; dirty ment retains a non-exclusive, paid-up, irrevocable, world-wide license to optics; and failing network and system components. In ad- publish or reproduce the published form of this manuscript, or allow others to do so, for U.S. Government purposes. dition, another performance problem can be the misconfig- uration of data transfer hosts, which is often a contributing ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As factor in poor network performance. such, the Government retains a nonexclusive, royalty-free right to publish Many of these problems are found on the local area net- or reproduce this article, or to allow others to do so, for Government pur- works, often categorized as \general-purpose" networks, that poses only. Copyright is held by the owner/author(s). Publication rights are not designed to support large science data flows. Today licensed to ACM. many scientists are relying on these network infrastructures SC13 November 17-21, 2013, Denver, CO, USA to share, store, and analyze their data which is often geo- Copyright 2013 ACM 978-1-4503-2378-9/13/11 ...$15.00. http://dx.doi.org/10.1145/2503210.2503245 graphically dispersed. The Science DMZ provides a design pattern developed to procurement systems, web browsing, and so forth. Second, specifically address these local area network issues and offers these general networks must also be built with security that research institutions a framework to support data-intensive protects financial and personnel data. Meanwhile, these net- science. The Science DMZ model has been broadly deployed works are also used for research as scientists depend on this and has already become indispensable to the present and infrastructure to share, store, and analyze data from many future of science workflows. different sources. As scientists attempt to run their appli- The Science DMZ provides: cations over these general-purpose networks, the result is often poor performance, and with the increase of data set • A scalable, extensible network infrastructure free from complexity and size, scientists often wait hours, days, or packet loss that causes poor TCP performance; weeks for their data to arrive. • Appropriate usage policies so that high-performance Since many aspects of general-purpose networks are dif- applications are not hampered by unnecessary con- ficult or impossible to change in the ways necessary to im- straints; prove their performance, the network architecture must be adapted to accommodate the needs of science applications • An effective \on-ramp" for local resources to access without affecting mission critical business and security op- wide area network services; and erations. Some of these aspects that are difficult to change might include the size of the memory buffers for individual • Mechanisms for testing and measuring, thereby ensur- interfaces; mixed traffic patterns between mail and web traf- ing consistent performance. fic that would include science data; and emphasis on avail- This paper will discuss the Science DMZ from its devel- ability vs. performance and what can be counted on over opment to its role in future technologies. First, Section 2 time for network availability. will discuss the Science DMZ's original development in ad- The Science DMZ model has already been implemented at dressing the performance of TCP-based applications. Sec- various institutions to upgrade these general-purpose, insti- ond, Section 3 enumerates the components of the Science tutional networks. The National Science Foundation (NSF) DMZ model and how each component adds to the overall recognized the Science DMZ as a proven operational best paradigm. Next, Sections 4 and 5 offer some sample illustra- practice for university campuses supporting data-intensive tions of networks that vary in size and purpose. Following, science and specifically identified this model as eligible for Section 6 will discuss some examples of Science DMZ imple- funding through the Campus Cyberinfrastructure{Network mentations from the R&E community. And lastly, Section 7 Infrastructure and Engineering Program (CC-NIE).1 This highlights some future technological advancements that will program was created in 2012 and has since been respon- enhance the applicability of the Science DMZ design. sible for implementing approximately 20 Science DMZs at different locations|thereby serving the needs of the science 2. MOTIVATION community. Another NSF solicitation was released in 2013 and awards to fund a similar number of new Science DMZ's When developing the Science DMZ, several key principles are expected. provided the foundation to its design. First, these design patterns are optimized for science. This means the compo- 2.1 TCP Performance nents of the system|including all the equipment, software The Transmission Control Protocol (TCP) [15] of the and associated services|are configured specifically to sup- TCP/IP protocol suite is the primary transport protocol port data-intensive science. Second, the model is designed used for the reliable transfer of data between applications. to be scalable in its ability to serve institutions ranging TCP is used for email, web browsing, and similar applica- from large experimental facilities to supercomputing sites tions. Most science applications are also built on TCP, so it to multi-disciplinary research universities to individual re- is important that the networks are able to work with these search groups or scientists. The model also scales to serve a applications (and TCP) to optimize the network for science. growing number of users at those facilities with an increas- TCP is robust in many respects|in particular it has so- ing and varying amount of data over time. Lastly, the Sci- phisticated capabilities for providing reliable data delivery ence DMZ model was created with future innovation in mind in the face of packet loss, network outages, and network by providing the flexibility to incorporate emerging network congestion. However, the very mechanisms that make TCP services. For instance, advances in virtual circuit services, so reliable also make it perform poorly
Recommended publications
  • Ipv6 DMZ Web Service Technology Design Guide
    IPv6 DMZ Web Service Technology Design Guide August 2014 Series Table of Contents Preface ........................................................................................................................................1 CVD Navigator .............................................................................................................................2 Use Cases .................................................................................................................................. 2 Scope ......................................................................................................................................... 2 Proficiency .................................................................................................................................. 2 Introduction .................................................................................................................................3 Technology Use Cases ............................................................................................................... 3 Use Case: Enable Native IPv6 Access for Network Traffic Between the Internet and a Web Server DMZ Network.............................................................................................................. 3 Use Case: Enable IPv6 Access for Network Traffic Between the Internet and an IPv4-only Web Server DMZ Network ..................................................................................................... 3 Design Overview ........................................................................................................................
    [Show full text]
  • Iptables with Shorewall!
    Iptables with shorewall! Table of Contents 1. Install swarmlab-sec (Home PC) . 1 2. shorewall . 1 2.1. Installation . 2 3. Basic Two-Interface Firewall. 2 4. Shorewall Concepts . 3 4.1. zones — Shorewall zone declaration file . 3 4.2. interfaces — Shorewall interfaces file. 4 4.3. policy — Shorewall policy file . 4 4.4. rules — Shorewall rules file . 4 4.5. Compile then Execute . 4 5. Three-Interface Firewall. 5 5.1. zones . 6 5.2. interfaces . 6 5.3. policy . 7 5.4. rules . 7 5.5. masq - Shorewall Masquerade/SNAT definition file . 7 5.6. snat — Shorewall SNAT/Masquerade definition file . 8 5.7. Compile and Execute . 8 1. Install swarmlab-sec (Home PC) HowTo: See http://docs.swarmlab.io/lab/sec/sec.adoc.html NOTE Assuming you’re already logged in 2. shorewall Shorewall is an open source firewall tool for Linux that builds upon the Netfilter (iptables/ipchains) system built into the Linux kernel, making it easier to manage more complex configuration schemes by providing a higher level of abstraction for describing rules using text files. More: wikipedia 1 NOTE Our docker instances have only one nic to add more nic’s: create netowrk frist docker network create --driver=bridge --subnet=192.168.0.0/16 net1 docker network create --driver=bridge --subnet=192.168.0.0/16 net2 docker network create --driver=bridge --subnet=192.168.0.0/16 net3 then connect network to container connect network created to container docker network connect net1 master docker network connect net1 worker1 docker network connect net2 master docker network connect net2 worker2 now let’s look at the following image 2.1.
    [Show full text]
  • How to Configure Some Basic Firewall and VPN Scenarios
    AlliedWareTM OS How To | Configure Some Basic Firewall and VPN Scenarios Introduction This document provides examples that illustrate common configurations for security routers. You may want to make changes or enhancements to these configurations to customize them to your particular requirements. However, with the configurations provided here, you can be quickly operational with a reliable and secure Internet connection. What information will you find in this document? The first section provides the basic configuration for two likely methods that will be used for an Internet connection from the security router: z "Script A: basic Ethernet connection" on page 3 z "Script B: basic PPPoE configuration" on page 7 The second section provides three extra configurations to enable the router to support three popular forms of Virtual Private Network (VPN) connection, followed by a configuration for a Mail server on a DMZ. One or more of these additional scripts can be added to either of the basic configuration scripts: z "Script C: internal L2TP Network Server (LNS)" on page 11 z "Script D: IPsec tunnel" on page 13 z "Script E: PPTP server on LAN behind router" on page 16 Then the second section ends with an example in which private IP addresses are used on the DMZ LAN: z "Script F: DMZ using private addresses" on page 17 C613-16069-00 REV B www.alliedtelesis.com Introduction > Related How To Notes These six configuration examples are as general as possible, and no actual IP addresses have been specified. IP addresses are represented by placeholder names in angled brackets, for example, <dmz-ip-address>.
    [Show full text]
  • Guidelines on Firewalls and Firewall Policy
    Special Publication 800-41 Revision 1 Guidelines on Firewalls and Firewall Policy Recommendations of the National Institute of Standards and Technology Karen Scarfone Paul Hoffman NIST Special Publication 800-41 Guidelines on Firewalls and Firewall Revision 1 Policy Recommendations of the National Institute of Standards and Technology Karen Scarfone Paul Hoffman C O M P U T E R S E C U R I T Y Computer Security Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899-8930 September 2009 U.S. Department of Commerce Gary Locke, Secretary National Institute of Standards and Technology Patrick D. Gallagher, Deputy Director GUIDELINES ON FIREWALLS AND FIREWALL POLICY Reports on Computer Systems Technology The Information Technology Laboratory (ITL) at the National Institute of Standards and Technology (NIST) promotes the U.S. economy and public welfare by providing technical leadership for the nation’s measurement and standards infrastructure. ITL develops tests, test methods, reference data, proof of concept implementations, and technical analysis to advance the development and productive use of information technology. ITL’s responsibilities include the development of technical, physical, administrative, and management standards and guidelines for the cost-effective security and privacy of sensitive unclassified information in Federal computer systems. This Special Publication 800-series reports on ITL’s research, guidance, and outreach efforts in computer security and its collaborative activities with industry, government, and academic organizations. National Institute of Standards and Technology Special Publication 800-41 Revision 1 Natl. Inst. Stand. Technol. Spec. Publ. 800-41 rev1, 48 pages (Sep. 2009) Certain commercial entities, equipment, or materials may be identified in this document in order to describe an experimental procedure or concept adequately.
    [Show full text]
  • Internet DMZ Equipment Policy
    Internet DMZ Equipment Policy CS Department Internet DMZ Equipment Policy 1.0 Purpose The purpose of this policy is to define standards to be met by all equipment owned and/or operated by SUNY Stony Brook CS department located outside SUNY Stony Brook CS department's Internet firewalls. These standards are designed to minimize the potential exposure to SUNY Stony Brook CS department from the loss of sensitive or department confidential data, intellectual property, damage to public image etc., which may follow from unauthorized use of SUNY Stony Brook CS department resources. Devices that are Internet facing and outside the SUNY Stony Brook CS department firewall are considered part of the "de-militarized zone" (DMZ) and are subject to this policy. These devices (network and host) are particularly vulnerable to attack from the Internet since they reside outside the department firewalls. The policy defines the following standards: Ownership responsibility Secure configuration requirements Operational requirements Change control requirement 2.0 Scope All equipment or devices deployed in a DMZ owned and/or operated by SUNY Stony Brook CS department including hosts, routers, switches, etc.) and/or registered in any Domain Name System (DNS) domain owned by SUNY Stony Brook CS department, must follow this policy. This policy also covers any host device outsourced or hosted at external/third-party service providers, if that equipment resides in the "cs.sunysb.edu" domain or appears to be owned by SUNY Stony Brook CS department. All new equipment which falls under the scope of this policy must be configured according to the referenced configuration documents, unless a waiver is obtained from the Director of Labs.
    [Show full text]
  • DMZ Implementations
    DMZ Implementations Table of Contents DMZ – De-Militarized Zone ............................................................................................................. 2 DMZ Reiteration -1.......................................................................................................................... 3 DMZ Reiteration -2.......................................................................................................................... 4 DMZ Implementations .................................................................................................................... 6 Bare DMZ ........................................................................................................................................ 7 Web Servers .................................................................................................................................... 8 DNS Servers ................................................................................................................................... 10 Unified Threat Management (UTM) ............................................................................................. 11 UTM – URL Filter ........................................................................................................................... 13 UTM – Content Inspection ............................................................................................................ 14 UTM – Malware Inspection ..........................................................................................................
    [Show full text]
  • Secure Data Transfer Guidance for Industrial Control and SCADA Systems
    PNNL-20776 Prepared for the U.S. Department of Energy under Contract DE-AC05-76RL01830 Secure Data Transfer Guidance for Industrial Control and SCADA Systems RE Mahan JR Burnette JD Fluckiger CA Goranson SL Clements H Kirkham C Tews September 2011 PNNL-20776 Secure Data Transfer Guidance for Industrial Control and SCADA Systems RE Mahan JR Burnette JD Fluckiger CA Goranson SL Clements H Kirkham C Tews September 2011 Prepared for the U.S. Department of Energy under Contract DE-AC05-76RL01830 PNNL-20776 Pacific Northwest National Laboratory Richland, Washington 99352 Table of Contents Secure Data Transfer Guidance for Industrial Control and SCADA Systems .................................. 1 Introduction .................................................................................................................................... 1 Secure Data Transfer....................................................................................................................... 1 Current Architecture ................................................................................................................... 1 Proposed Architecture ................................................................................................................ 2 SECURITY ZONES ............................................................................................................................. 4 Definition .................................................................................................................................... 4 Recommendations
    [Show full text]
  • NIST SP 800-44 Version 2
    Special Publication 800-44 Version 2 Guidelines on Securing Public Web Servers Recommendations of the National Institute of Standards and Technology Miles Tracy Wayne Jansen Karen Scarfone Theodore Winograd NIST Special Publication 800-44 Guidelines on Securing Public Web Version 2 Servers Recommendations of the National Institute of Standards and Technology Miles Tracy, Wayne Jansen, Karen Scarfone, and Theodore Winograd C O M P U T E R S E C U R I T Y Computer Security Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899-8930 September 2007 U.S. Department of Commerce Carlos M. Gutierrez, Secretary National Institute of Standards and Technology James Turner, Acting Director GUIDELINES ON SECURING PUBLIC WEB SERVERS Reports on Computer Systems Technology The Information Technology Laboratory (ITL) at the National Institute of Standards and Technology (NIST) promotes the U.S. economy and public welfare by providing technical leadership for the Nation’s measurement and standards infrastructure. ITL develops tests, test methods, reference data, proof of concept implementations, and technical analysis to advance the development and productive use of information technology. ITL’s responsibilities include the development of technical, physical, administrative, and management standards and guidelines for the cost-effective security and privacy of sensitive unclassified information in Federal computer systems. This Special Publication 800-series reports on ITL’s research, guidance, and outreach efforts in computer security, and its collaborative activities with industry, government, and academic organizations. National Institute of Standards and Technology Special Publication 800-44 Version 2 Natl. Inst. Stand. Technol. Spec. Publ. 800-44 Ver.
    [Show full text]
  • NIST Firewall Guide and Policy Recommendations
    Special Publication 800-41 Guidelines on Firewalls and Firewall Policy Recommendations of the National Institute of Standards and Technology John Wack, Ken Cutler, Jamie Pole NIST Special Publication 800-41 Guidelines on Firewalls and Firewall Policy Recommendations of the National Institute of Standards and Technology John Wack, Ken Cutler*, Jamie Pole* C O M P U T E R S E C U R I T Y Computer Security Division Information Technology Laboratory National Institute of Standards and Technology Gaithersburg, MD 20899-8930 *MIS Training Institute January 2002 U.S. Department of Commerce Donald L. Evans, Secretary Technology Administration Phillip J. Bond, Under Secretary for Technology National Institute of Standards and Technology Arden L. Bement, Jr., Director ii Reports on Computer Systems Technology The Information Technology Laboratory (ITL) at the National Institute of Standards and Technology (NIST) promotes the U.S. economy and public welfare by providing tech- nical leadership for the Nation’s measurement and standards infrastructure. ITL de- velops tests, test methods, reference data, proof of concept implementations, and technical analysis to advance the development and productive use of information technology. ITL’s responsibilities include the development of technical, physical, ad- ministrative, and management standards and guidelines for the cost-effective security and privacy of sensitive unclassified information in Federal computer systems. This Special Publication 800-series reports on ITL’s research, guidance, and outreach ef- forts in computer security, and its collaborative activities with industry, government, and academic organizations. National Institute of Standards and Technolog y Special Publication 800-41 Natl. Inst. Stand. Technol. Spec. Publ. 800-41, 75 pages (Jan.
    [Show full text]
  • Firewall Deployment for Scada and Process Control Networks Good Practice Guide
    FIREWALL DEPLOYMENT FOR SCADA AND PROCESS CONTROL NETWORKS GOOD PRACTICE GUIDE 15 FEBRUARY 2005 This paper was previously published by the National Infrastructure Security Co-ordination Centre (NISCC) – a predecessor organisation to the Centre for the Protection of National Infrastructure (CPNI). Hyperlinks in this document may refer to resources that no longer exist. Please see CPNI’s website (www.cpni.gov.uk) for up-to-date information. Disclaimer Reference to any specific commercial product, process or service by trade name, trademark, manufacturer, or otherwise, does not constitute or imply its endorsement, recommendation, or favouring by CPNI. The views and opinions of authors expressed within this document shall not be used for advertising or product endorsement purposes. To the fullest extent permitted by law, CPNI accepts no liability for any loss or damage (whether direct, indirect or consequential and including, but not limited to, loss of profits or anticipated profits, loss of data, business or goodwill) incurred by any person and howsoever caused arising from or connected with any error or omission in this document or from any person acting, omitting to act or refraining from acting upon, or otherwise using, the information contained in this document or its references. You should make your own judgement as regards use of this document and seek independent professional advice on your particular circumstances. Firewall Deployment for SCADA and Process Control Networks Revision History Revision Date Author(s) Description 0.1
    [Show full text]
  • The Strengths and Limitations of Dmzs in Network Security by Cameron Meyer Submitted to the Department of Technology Systems In
    DMZs in Network Security The Strengths and Limitations of DMZs in Network Security By Cameron Meyer Submitted to the Department of Technology Systems In partial fulfillment of the requirements for the degree of Master of Science in Networking Technology At EAST CAROLINA UNIVERSITY November 2017 1 DMZs in Network Security This page intentionally left blank. 2 DMZs in Network Security Abstract A demilitarized zone (DMZ) in terms of a network is a segmented area in the network that is available to the public but is segmented in order to stay separated from a network’s internal private network. In other words, it separates the untrusted public Internet from the trusted network of an organization. This is done through the act of subnetting and is a useful network security design feature within the network architecture. A DMZ can have a completely different IP addressing scheme from the internal network, or it can reside on the same network broken up logically through the use of VLANs. Typically, an organization will place assets that house information, resources or servers that the public needs access to; such as its public web servers in the DMZ. This allows the public can have open access to them and their data or services they provide but integral organizational data is stored on the private network not directly accessible from the public Internet or the assets that are placed within the DMZ. Firewalls play an important role is the security architecture of a network especially in setting up a DMZ. Due to the idea of opening up this segment of the network to the public, the assets here can become compromised.
    [Show full text]
  • Demilitarized Network to Secure the Data Stored in Industrial Networks
    International Journal of Electrical and Computer Engineering (IJECE) Vol. 11, No. 1, February 2021, pp. 611~619 ISSN: 2088-8708, DOI: 10.11591/ijece.v11i1.pp611-619 611 Demilitarized network to secure the data stored in industrial networks José R. Nuñez Alvarez1, Yelena Pérez Zamora2, Israel Benítez Pina3, Eliana Noriega Angarita4 1,4Department of Energy, Universidad de la Costa, Colombia 2,3Department of Automatic, Universidad de Oriente, Cuba Article Info ABSTRACT Article history: Currently, the data and variables of a control system are the most important elements to be safeguarded in an industrial network, so it is vitally important Received Jan 29, 2020 to ensure their safety. This paper presents the design and simulation of Revised Jun 18, 2020 a demilitarized network (DMZ) using firewalls to control access to all Accepted Jul 29, 2020 the information that is stored in the servers of the industrial network of the Hermanos Díaz Refinery in Santiago de Cuba, Cuba. In addition, the characteristics, configurations, methods, and rules of DMZs and firewalls Keywords: are shown, select the configuration with three multi-legged firewalls as the most appropriate for our application, since it allows efficient exchange of Control system data guaranteeing security and avoiding the violation of the control system. Demilitarized network Finally, the simulation of the proposed network is carried out. Electrical network Firewalls Industrial network This is an open access article under the CC BY-SA license. Corresponding Author: José Ricardo Nuñez Alvarez, Department of Energy, Universidad de la Costa, Calle 58 No 55-66. Código Postal: 080002, Barranquilla, Atlántico, Barranquilla, Colombia. Email: [email protected] 1.
    [Show full text]