Deployment Guide Pipeline Pilot Server 2016

DEPLOYMENT GUIDE PIPELINE PILOT SERVER 2016 Copyright Notice ©2015 Dassault Systèmes. All rights reserved. 3DEXPERIENCE, the Compass icon and the 3DS logo, CATIA, SOLIDWORKS, ENOVIA, DELMIA, SIMULIA, GEOVIA, EXALEAD, 3D VIA, BIOVIA and NETVIBES are commercial trademarks or registered trademarks of Dassault Systèmes or its subsidiaries in the U.S. and/or other countries. All other trademarks are owned by their respective owners. Use of any Dassault Systèmes or its subsidiaries trademarks is subject to their express written approval. Acknowledgments and References To print photographs or files of computational results (figures and/or data) obtained using BIOVIA software, acknowledge the source in an appropriate format. For example: "Computational results obtained using software programs from Dassault Systèmes BIOVIA. The ab initio calculations were performed with the DMol3 program, and graphical displays generated with Discovery Studio Collection." BIOVIA may grant permission to republish or reprint its copyrighted materials. Requests should be submitted to BIOVIA Support, either through electronic mail to [email protected] or in writing to: BIOVIA Support 5005 Wateridge Vista Drive, San Diego, CA 92121 USA Page 2 | Pipeline Pilot 2016 Contents About This Document Tuning Server Settings................................................ 31 Getting Help ................................................................. 4 Admin Portal job settings ........................................ 31 Job pool configuration ............................................. 33 Chapter 1: Overview Deployment Options .................................................... 5 Chapter 5: Security Standalone server ...................................................... 6 Authentication............................................................ 35 Web Farm .................................................................. 7 Authentication examples ......................................... 35 Clustering ................................................................... 8 Impersonation ............................................................ 36 Distributed grid computing...................................... 10 Authorization .............................................................. 36 Connectivity ................................................................ 11 Securing protocols and components ....................... 36 Ports ......................................................................... 11 System application permissions .............................. 37 Port forwarding ........................................................ 11 Secure Deployment .................................................... 38 Reverse proxy .......................................................... 11 Firewall ..................................................................... 38 Web Port .................................................................. 12 SSL termination ........................................................ 39 Endpoints ................................................................. 12 Secure connections .................................................. 39 Folders...................................................................... 15 Reducing attack surface and endpoints .................. 39 Software development tools ................................... 16 Secure server settings .............................................. 39 Chapter 2: Architecture Chapter 6: Planning for Operation Apache ........................................................................ 17 Backup procedures..................................................... 41 Considerations for deployments on Microsoft Restore procedures .................................................... 41 Windows .................................................................. 18 Keeping Your Server Healthy ..................................... 42 Considerations for deployments on Linux ............... 18 Managing versions ................................................... 42 Best practice for Apache configuration ................... 18 Job folders ................................................................ 42 Tomcat ........................................................................ 18 Upload folder ........................................................... 42 Pipeline Pilot Manager ............................................... 18 Managing Your Log Files ............................................ 43 BIOVIA Query Service ................................................. 18 Log of job execution events ..................................... 43 Query Service RESTful API ....................................... 19 Log of user authentication events ........................... 43 MongoDB ................................................................. 19 Error and diagnostic messages ................................ 43 Managing BIOVIA Catalog Indexes ............................. 45 Chapter 3: Hardware Guidance Catalog architecture ................................................ 45 Processor Guidance .................................................... 20 Indexer Configuration and Scheduling .................... 46 Memory Guidance ..................................................... 20 Scheduling the Indexer ............................................ 47 Operating System Guidance ...................................... 20 Disk usage ................................................................ 47 Server Virtualization ................................................... 21 Backup and recovery ............................................... 47 Optimizing Grid Hardware ......................................... 21 Suggested Processes and Policies .............................. 47 Chapter 4: Planning Your Server Deployment Publication of protocols ........................................... 47 Web Farm ................................................................... 22 XMLDB folder structure, access rights, and roles .... 48 Parallel Computing on Distributed Deployments ...... 23 Publication of results ............................................... 48 Clustering.................................................................... 24 Data longevity and size ............................................ 48 Clustering requirements .......................................... 25 Packaging and regressions ....................................... 49 Managing nodes and delegating jobs ...................... 25 Appendix A: Application Deployment Guidance for Planning for your cluster deployment ..................... 26 BIOVIA Insight 2016 Grid Engines ............................................................... 26 Architecture ................................................................ 49 Grid Engine requirements........................................ 26 Performance Considerations ..................................... 49 Server Performance Tuning ....................................... 26 Deployments .............................................................. 49 Usage profiles .......................................................... 26 Small deployment .................................................... 49 Job Modes .................................................................. 28 Medium deployment ............................................... 49 Job Pools ..................................................................... 29 Large deployment .................................................... 49 Built-in job pools ...................................................... 29 Job pools and impersonation .................................. 30 Pipeline Pilot: Deployment Guide | Page 3 About This Document The deployment guide provides technical details on what Pipeline Pilot 2016 is, how it can be deployed, and information used for planning purposes. This document was specifically authored with the IT Architect, IT Administrator, and Pipeline Pilot administrator personas in mind. IMPORTANT! See the Pipeline Pilot Server Install Guide and the Hub Deployment Guide for instructions on how to deploy them. Getting Help An updated version of the System Requirements might be available at the BIOVIA Community site. To locate the updated System Requirements for this product: 1. Go to the BIOVIA Community site: https://community.3dsbiovia.com 2. Search, locate, and download the documentation for this version of Pipeline Pilot. If you have any questions, contact BIOVIA Customer Support at: ■ Support for BIOVIA Products is available at [email protected] or [email protected] (for our customers in Japan). ■ Support on the Web: https://community.3dsbiovia.com Page 4 | Pipeline Pilot 2016 Chapter 1: Overview Enterprise-based deployment offers increased security, manageability, availability, and scalability. It allows you to include additional servers in your network configuration to improve performance and reduce the single points of failure throughout a system. Pipeline Pilot deployments also include additional features such as the ability to run in load-balanced and reverse-proxy environments, clustering support, and integration with batch queuing systems or grid engines. An enterprise deployment isolates specific functionality onto separate servers instead of running all services on the same server. This isolation enables different servers to use their resources for more specific purposes, such as running applications, providing database access capabilities,

Deployment Guide Pipeline Pilot Server 2016

3 Workflow Systems

BIOVIA Pipeline Pilot System Requirements

Integration Collection for Pipeline Pilot

Accelrys Enterprise Platform

JAVA COMPONENT DEVELOPMENT GUIDE PIPELINE PILOT INTEGRATION COLLECTION 2017 Copyright Notice

DEVELOPER GUIDE PIPELINE PILOT INTEGRATION COLLECTION 2016 Copyright Notice

Multiple Usage of KNIME in a Screening Laboratory Environment

List of Applications Updated in ARL #2581

List of New Applications Added in ARL #2543

Workflow and Pipelining in Cheminformatics

Admin Portal Guide Pipeline Pilot 2016

WINDOWS SCRIPTING INTEGRATION GUIDE PIPELINE PILOT INTEGRATION COLLECTION 2017 Copyright Notice