Nordugrid ARC 6 Information Release ARC6

NorduGrid ARC 6 Information Release ARC6 NorduGrid Collaboration Sep 16, 2021 ARC OVERVIEW 1 ARC Overview 3 1.1 ARC CE components and the infrastructure ecosystem around...................3 2 Obtaining the software 5 3 Support and Community7 4 Documentation for Infrastructure Admins9 4.1 ARC Configuration Reference Document..............................9 4.2 ARC CE Deployment and Operation................................ 114 4.3 ARCHERY.............................................. 177 4.4 ARC Admin Tools Reference.................................... 197 5 Technical Documents Describing ARC Components 227 5.1 ARC Data Services Technical Description.............................. 227 5.2 ARC Next Generation Accounting Technical Details........................ 266 5.3 ARC CE REST interface specification................................ 271 5.4 ARCHERY data model and DNS records rendering......................... 282 5.5 Old Relevant Technical Documents................................. 286 5.6 Legacy JURA Accounting Technical Details............................ 288 5.7 ARC Accounting Database Schema................................. 292 6 Documentation for Developers 293 6.1 Implementation Details for Developers............................... 293 6.2 Contributing to Documentation................................... 304 7 Documentation for Infrastructure Users 311 7.1 Installing ARC Client Tools..................................... 311 7.2 Overview of ARC client tools.................................... 314 7.3 How to submit the job?........................................ 320 7.4 How to work with data?....................................... 325 7.5 Job Description Language (xRSL).................................. 329 7.6 ARC Client Config Reference.................................... 351 7.7 ARC SDK Documentation...................................... 354 8 ARC Miscellaneous Pages 355 8.1 ARC support for OIDC....................................... 355 8.2 About the Nordugrid ARC Releases................................. 356 8.3 Security Operations......................................... 358 8.4 ARC 6 Testing Area......................................... 359 8.5 Release management......................................... 361 8.6 Changelogs/list of bugs....................................... 362 8.7 Using ARC packages from nightly builds.............................. 365 8.8 NorduGrid repository information for ARC 6............................ 367 8.9 Work-in-progress Docs........................................ 371 i Bibliography 373 ii NorduGrid ARC 6 Information, Release ARC6 The Advanced Resource Connector (ARC) middleware, developed by the NorduGrid Collaboration, is an open source software solution enabling e-Science computing infrastructures with emphasis on processing of large data volumes. ARC is being used to enable national and international e-infrastructures since its first release in 2002. This document is dedicated to the ARC Version 6 collecting all relevant information in one place. You should be able to find information regarding the code, documentation, testing activities, support channels, . and so on here. The information is refreshed daily, a snapshot of the development version can be found here. If you are new to ARC start reading the Try ARC6 quickstart guide to get an overview of main operations in the simple test case. For production Computing Element deployment follow the Installation and Configuration Guide that contains the structure and pointers to precise configuration of every ARC subsystem. In case you are migrating to ARC 6 from an ARC 5 installation read the Migration Guide. For an overview of the main changes compared to ARC 5, please visit Main changes in ARC 6 compared to ARC 5. The ultimate description of the new ARC 6 configuration can be found in the ARC Configuration Reference Document. ARC OVERVIEW 1 NorduGrid ARC 6 Information, Release ARC6 2 ARC OVERVIEW CHAPTER ONE ARC OVERVIEW Birds-eye overview of the ARC services, including the architecture figure of the ARC CE can be found in the following documents: 1.1 ARC CE components and the infrastructure ecosystem around Fig. 1.1: ARC CE: internals, interfaces and the infrastructure ecosystem services around 3 NorduGrid ARC 6 Information, Release ARC6 4 Chapter 1. ARC Overview CHAPTER TWO OBTAINING THE SOFTWARE ARC is available for variety of GNU/Linux flavors via stable Repositores or Nightly Builds if you want to test the latest development release. The source code is hosted in NeIC’s Coderefinery GitLab repository. 5 NorduGrid ARC 6 Information, Release ARC6 6 Chapter 2. Obtaining the software CHAPTER THREE SUPPORT AND COMMUNITY User support and site installation assistance is provided via the nordugrid-discuss mailing list, and the Nordugrid Bugzilla. 7 NorduGrid ARC 6 Information, Release ARC6 8 Chapter 3. Support and Community CHAPTER FOUR DOCUMENTATION FOR INFRASTRUCTURE ADMINS This section contains a documentation about all ARC middleware services deployment, configuration and operations. If you are looking for ARC Computing Element setup instruction or performance tuning parameters you are in the right place. 4.1 ARC Configuration Reference Document 4.1.1 General configuration structure This is the arc.conf REFERENCE DOCUMENT defining the configuration blocks and configuration options for the ARC services. The arc.conf configuration file consists of the following blocks: [common] [authgroup:groupname] [mapping] [authtokens] [lrms] [lrms/ssh] [arex] [arex/cache] [arex/cache/cleaner] [arex/data-staging] [arex/ws] [arex/ws/jobs] [arex/ws/publicinfo] [arex/ws/cache] [arex/ws/candypond] [arex/ws/argus] [arex/jura] [arex/jura/sgas:targetname] [arex/jura/apel:targetname] [arex/jura/archiving] (removed in 6.8.0) [arex/ganglia] [gridftpd] [gridftpd/jobs] [gridftpd/filedir] [infosys] [infosys/ldap] [infosys/nordugrid] [infosys/glue2] [infosys/glue2/ldap] [infosys/glue1] [infosys/glue1/site-bdii] [infosys/cluster] [queue:name] (continues on next page) 9 NorduGrid ARC 6 Information, Release ARC6 (continued from previous page) [datadelivery-service] [acix-scanner] [acix-index] [userlist:name] [nordugridmap] [custom:blockname] [block] A block configures an ARC service, a service interface, a utility or a subsystem. Enabling (turning on) a function- ality, a service or an interface requires the presence of the appropriate configuration block. To disable a service or an interface, simply delete or comment out the related arc.conf block (you may need to rerun the corresponding startup script). The [common] block is mandatory even if not a single option is specified within. The presence of the block turns on the default values for the configuration options within the block. As an example, in order to set up a minimalistic ARC CE offering no external interfaces you need to configure at least the [common], [mapping], [arex], [lrms], [infosys] and [queue:name] blocks. As another example, an ARC-based data offloader would require the [common] and the [datadelivery-service] blocks. A block is identified by its block header. A block header may consist of keywords and optionally block identifiers. Keywords may be separated by / and used to label subblocks (e.g. [arex/jura]), while block identifiers are separated by : from keywords. For example, in the [queue:short] block header queue is a keyword while short is an identifier, e.g. the name of the queue. Block headers must be UNIQUE. A block starts with a unique [keyword:identifier] blockheader and ends where the next block starts, that is at the next [blockheader] directive. A block may have sub-blocks e.g. the various interfaces of the AREX service are configured via sub-blocks (e.g. [arex/ws]). When a sub-block is enabled then the corresponding parent block must also appear in the arc.conf file. Configuration blocks contain (config option, config value) pairs following the syntax in single line: config_option=value element [optional value element] Note: quotes around the configuration value(s) must NOT be used any longer. Note: the arc.conf is CASE-SENSITIVE! Space handling syntax in arc.conf for configuration lines: (stripped space)option(stripped space)=(stripped space)value(saved ,!space)(value)(stripped space) and for block headers: [keyword:(stripped space)space is NOT allowed within identifier(stripped space)] Detailed textual definition: 1. All trailing and leading spaces on each confiuration line are stripped and ignored. This aplies both to block headers and block content. 10 Chapter 4. Documentation for Infrastructure Admins NorduGrid ARC 6 Information, Release ARC6 2. All spaces around the = sign in option=value kind of string (after ‘a’ is applied) are stripped and ignored. For example line hostname = myhost.info is treated as identical to hostname=myhost. info. 3. In block headers of [keyword] kind (after ‘a’ is applied) no additional spaces are allowed around keyword and inside keyword. 4. In block headers of [keyword:identifier] kind (after ‘a’ is applied) no additional spaces are allowed around keyword and inside both keyword and identifier. The spaces ARE allowed around identifier part and stripped and ignored. Mandatory configuration options are indicated by an asterix prefix to the option name e.g: *mandatory_configoption. Mandatory options with undefined values will result in service stop during the startup process. Each of the configuration options have well-defined default that is specified in this reference file. The default can take either a pre-set value, a special substitution or the keyword undefined. Configuration options

Nordugrid ARC 6 Information Release ARC6

Push-Based Job Submission Using Reverse SSH Connections

Introduction to Python University of Oxford Department of Particle Physics

The Translational Journey of the Htcondor-CE

A Comprehensive Perspective on Pilot-Job Systems

EGI Federated Platforms Supporting Accelerated Computing

Experimental Study of Remote Job Submission and Execution on LRM Through Grid Computing Mechanisms

Pilot-Job Provisioning Through CREAM Computing Elements on the Worldwide LHC Computing Grid Alexandre Boyer, David Hill, Christophe Haen, Federico Stagni

The NMI Build & Test Laboratory: Continuous Integration Framework

Workload Schedulers - Genesis, Algorithms and Comparisons

Freebsd in SCIENTIFIC COMPUTING

Virtual Cluster Management for Analysis Of

HPC Job Sceduling Co-Scheduling in HPC Clusters