Working with EC2 Instances
Total Page:16
File Type:pdf, Size:1020Kb
PRODUCT DOCUMENTATION Pivotal™ Greenplum Database® Version 4.3 Installation Guide Rev: A31 © 2018 Pivotal Software, Inc. Copyright Installation Guide Notice Copyright Privacy Policy | Terms of Use Copyright © 2018 Pivotal Software, Inc. All rights reserved. Pivotal Software, Inc. believes the information in this publication is accurate as of its publication date. The information is subject to change without notice. THE INFORMATION IN THIS PUBLICATION IS PROVIDED "AS IS." PIVOTAL SOFTWARE, INC. ("Pivotal") MAKES NO REPRESENTATIONS OR WARRANTIES OF ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Use, copying, and distribution of any Pivotal software described in this publication requires an applicable software license. All trademarks used herein are the property of Pivotal or their respective owners. Revised November 2018 (4.3.30.3) 2 Contents Installation Guide Contents Chapter 1: Preface.......................................................................................6 About This Guide................................................................................................................................ 7 About the Greenplum Database Documentation Set..........................................................................8 Document Conventions....................................................................................................................... 9 Command Syntax Conventions................................................................................................ 9 Getting Support.........................................................................................................................9 Chapter 2: Introduction to Greenplum.................................................... 10 The Greenplum Master..................................................................................................................... 11 Master Redundancy................................................................................................................11 The Segments................................................................................................................................... 12 Segment Redundancy............................................................................................................ 12 Example Segment Host Hardware Stack...............................................................................13 Example Segment Disk Layout.............................................................................................. 14 The Interconnect................................................................................................................................16 Interconnect Redundancy.......................................................................................................16 Network Interface Configuration............................................................................................. 16 Switch Configuration...............................................................................................................17 ETL Hosts for Data Loading............................................................................................................. 19 Greenplum Performance Monitoring................................................................................................. 21 Chapter 3: Estimating Storage Capacity.................................................22 Calculating Usable Disk Capacity..................................................................................................... 23 Calculating User Data Size...............................................................................................................24 Calculating Space Requirements for Metadata and Logs................................................................ 25 Chapter 4: Configuring Your Systems and Installing Greenplum.........26 System Requirements....................................................................................................................... 27 Setting the Greenplum Recommended OS Parameters...................................................................29 Linux System Settings............................................................................................................29 Running the Greenplum Installer...................................................................................................... 34 Installing and Configuring Greenplum on all Hosts.......................................................................... 36 About gpadmin........................................................................................................................36 Confirming Your Installation................................................................................................... 37 Installing Oracle Compatibility Functions.......................................................................................... 38 Installing Greenplum Database Extensions...................................................................................... 39 Creating the Data Storage Areas......................................................................................................40 Creating a Data Storage Area on the Master Host................................................................40 Creating Data Storage Areas on Segment Hosts.................................................................. 40 Synchronizing System Clocks........................................................................................................... 42 Enabling iptables............................................................................................................................... 43 Example iptables Rules..........................................................................................................43 Amazon EC2 Configuration (Amazon Web Services).......................................................................46 About Amazon EC2................................................................................................................46 Working with EC2 Instances.................................................................................................. 47 About Amazon Machine Image (AMI).................................................................................... 49 3 Contents Installation Guide About Amazon Elastic IP Addresses..................................................................................... 49 Notes.......................................................................................................................................49 References..............................................................................................................................50 Next Steps......................................................................................................................................... 51 Chapter 5: Installing the Data Science Packages.................................. 52 Python Data Science Module Package............................................................................................ 53 Python Data Science Modules............................................................................................... 53 Installing the Python Data Science Module Package............................................................ 53 Uninstalling the Python Data Science Module Package........................................................ 54 R Data Science Library Package......................................................................................................55 R Data Science Libraries....................................................................................................... 55 Installing the R Data Science Library Package......................................................................56 Uninstalling the R Data Science Library Package................................................................. 56 Chapter 6: Validating Your Systems....................................................... 58 Validating OS Settings...................................................................................................................... 59 Validating Hardware Performance.................................................................................................... 60 Validating Network Performance............................................................................................ 60 Validating Disk I/O and Memory Bandwidth..................................................................................... 62 Chapter 7: Configuring Localization Settings........................................ 63 About Locale Support in Greenplum Database................................................................................ 64 Locale Behavior...................................................................................................................... 65 Troubleshooting Locales.........................................................................................................65 Character Set Support...................................................................................................................... 66 Setting the Character Set..................................................................................................................69 Character Set Conversion Between Server and Client.....................................................................70