Analysis Reproducibility with REANA on Kubernetes

Analysis Reproducibility with REANA on Kubernetes

Analysis Reproducibility with REANA on Kubernetes Diego Rodriguez on behalf of the REANA team 1 Reproducibility? https://www.nature.com/news/1-500-scientists-lift-the-lid-on-reproducibility-1.19970 2 Long-term value of knowledge? https://twitter.com/PKoppenburg/status/1301813341460066304 3 It is harder than you think 4 The four questions where is the data? EOS, local hard drive… where is the code? GitLab, local copy... where is the environment? my own computer, lxplus... where is the workflow? I remember it, bash script, readme... 5 ATLAS and CMS containerised environments https://hub.docker.com/r/atlas/analysisbase/tags https://gitlab.cern.ch/cms-cloud/cmssw-docker/container_registry 6 A recipe for reproducibility 陋 + + 菱 + ‍‍ = data + code + environment + workflow = ♻ 7 Overview Different storage backends Multiple compute backends: Kubernetes, HTCondor, Slurm Different workflow languages (from simple pipelines to complex map-reduce operations) 8 Integrations with other systems Private GitLab images Possibility to access private EOS resources with Kerberos Support for VOMS-proxy certificates 9 Demo Demonstrate REANA UI and CLI Walk-through simple physics model REANA CLI - analysis end-user view GitLab integration Kubernetes/OpenStack - admin view Live architecture overview Kubernetes cluster/admin view 10 A declarative approach version: 0.6.0 inputs: files: - code/gendata.C - code/fitdata.C parameters: events: 20000 data: results/data.root plot: results/plot.png workflow: type: serial specification: steps: - name: gendata environment: 'reanahub/reana-env-root6:6.18.04' compute_backend: htcondorcern commands: - mkdir -p results && root -b -q 'code/gendata.C(${events},"${data}")' - name: fitdata environment: 'reanahub/reana-env-root6:6.18.04' compute_backend: htcondorcern commands: - root -b -q 'code/fitdata.C("${data}","${plot}")' outputs: files: - results/plot.png 11 Scaling out: Beyond Standard Model searches 12 Pilot examples 13 @CERN Install on premises $ helm repo add reanahub \ https://reanahub.github.io/reana $ helm repo update $ helm install --devel reana reanahub/reana Helm repository at http://reanahub.github.io/reana, more docs https://reana.cern.ch/ at http://docs.reana.io/development/deploying-at-scale/ 14 Get in touch! @CERN → https://reana.cern.ch/ Documentation → http://docs.reana.io/ Forum https://forum.reana.io/ Mattermost@CERN https://mattermost.web.cern.ch/it-dep/channels/reana REANA@Gitter https://gitter.im/reanahub/reana REANA@Github https://github.com/reanahub/reana 15 Q&A.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    16 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us