Update on activities at LBNL

*Val Hendrix: [email protected], Doug Benjamin2: [email protected], Roberto Vitillo1: [email protected] 1Lawrence Berkeley National Lab 2Duke University

7/12/12 ADC RnD Overview Overview

● Goal ● Progress ● Plans

7/12/12 ADC Cloud Computing RnD 2 Goal Goal

● Use Case: A site should be able to easily deploy new analysis cluster (batch, PROOF) in commercial or private cloud resources. https://twiki.cern. ch/twiki/bin/view/Atlas/CloudcomputingRnD#Use_Cases ● Solution: Elastic Data Analysis Cluster (E-DAC): a fully configured data analysis cluster that is elastic and deployable on multiple clouds. ● We have chosen Scalr Open Source as the cluster management tool we will use to deliver E-DAC

7/12/12 ADC Cloud Computing RnD 3 Progress Progress

● Scalr 3.5 release ● Scalr + Openstack ● ScalrEDAC

7/12/12 ADC Cloud Computing RnD 4 Progress Scalr 3.5 Released

● Scalr 3.5 Released June 13th ○ RAID over EBS – increased performance for EBS on EC2 ○ The two scalarizr repositories- time and control your updates ○ Scripts & Roles attachment – create your custom images ○ Role versioning – improved control ● This fixed a series of issues that were impeding the development of Condor Cluster roles ○ Custom metric scaling ○ Scalarizr agent bug: This blocked successfully running the Condor Cluster configuration scripts ● Openstack supported was promised for this release but unfortunately it is still not available

7/12/12 ADC Cloud Computing RnD 5 Progress Scalr + Openstack

● Scalr does not officially support Openstack yet (although they are trying) “We haven't gotten it to work (Essex) on our own servers, TryStack lacks Swift and Volumes, and every cluster that the community has lent us has been disappointingly buggy or incomplete.” Sebastian Stadil on scalr-discuss group ● Scalr has a interface, will that work for Openstack? ○ There’s more to adding support for a cloud than a simple API. Differences include signing, bundling images, creating manifests… ○ For now the best for the open source community, is to work with Scalr to grant temporary access so they can test and make it work

7/12/12 ADC Cloud Computing RnD 6 Progress Scalr + Openstack

● Fresno Openstack ○ I initially tried to used Openstack through the Scalr Eucalyptus interface but I never got past importing Scalr roles (VMs) due to volume creation issues. ○ Fresno is still working on their Openstack deployment….Stay tuned. ○ In parallel, they are exploring Cloudstack (see forthcoming slides) ● BNL Openstack ○ Worked with BNL folks to figure out our requirements ○ Have gone so far as to get Openstack credentials, a BNL account id and requested a tenant account ○ BNL VMs are in a NATed network which presents a problem for a Scalr service outside of BNL. Possible solutions are: ■ Run Scalr inside BNL as a VM ■ Run Scalr outside of BNL have broker connection between the Scalr machine and the launched instances ■ Since the launched VMs can pull data from outside the network, see the launch instances pull the contextualization from Scalr rather that push. 7/12/12 ADC Cloud Computing RnD 7 Progress ScalrEDAC (Scalr Elastic Data Analysis Cluster )

Subversion project, scalredac, available to all CERN users for installing and configuring Scalr to run ATLAS data analysis clusters on EC2.

7/12/12 ADC Cloud Computing RnD 8 Progress ScalrEDAC (Scalr Elastic Data Analysis Cluster )

● Boxgrinder appliance definitions for building and uploading ATLAS VM images ○ Scalr VM (Can be used for a virtual instance of Scalr) ○ ATLAS T3 VM ( VM with ATLAS services installed and ready to be configured by Scalr) ● Installation script to configure Scalr Open Source for ATLAS ○ Currently supports Centos 6.2, SLC 5.8 ○ Installs Scalr 3.5 ○ Adds Scalr Roles (VMs) for creating Server Farms (Clusters): ■ Condor Cluster: CondorHead, CondorWorker, CondorInteractive ■ Local XrootD Cluster: XrootDRedirector, XrootDDataNode ● Detailed installation instructions on twiki.cern.ch at Main.ScalrEDAC ○ Currently supports deploying Condor and local XrootD clusters on EC2

7/12/12 ADC Cloud Computing RnD 9 Plans Plans

● Scalr + Cloudstack ○ Scalr supports Cloudstack ○ Investigate use of Scalr+Cloudstack+VPN ○ Cloudstack Providers ■ ANL Cloudstack Doug B will be setting up a Cloudstack deployment at ANL that LBNL will have access to ■ CSU Fresno Cloudstack Fresno has a Cloudstack test cloud We will work with Fresno to get Scalr working on this test cloud

7/12/12 ADC Cloud Computing RnD 10 Plans Plans

● Scalr + puppet integration ○ We have put in a request to Scalr, Inc to integrate puppet with Scalr ● ScalrEDAC ○ Custom scaling metrics for Condor workers ○ Streamline installation ○ More detailed uses cases needed ○ Feedback?

7/12/12 ADC Cloud Computing RnD 11 Plans Conclusion

● ScalrEDAC, a scalar installation for ATLAS data analysis clusters, has been made available to CERN users in a CERN svn repo. ● Openstack+Scalr has been stalled until we can work out the issues with the Scalr team ● Cloudstack+Scalr investigation will be underway this summer 7/12/12 ADC Cloud Computing RnD 12 Backup