Ons on Purdue's Diagrid

Ons on Purdue's Diagrid

BLAST and Bioinformacs Applicaons on Purdue’s DiaGrid May 3, 2012 Brian RauB Purdue University [email protected] Condor Week 2012 Where were we? • Over 37 kilocores across campus – Three community clusters (Steele, Coates, Rossmann) – Two “ownerless” clusters (Radon, Miner) – CMS Tier-2 cluster – Other small clusters – Instruc/onal labs and academic departments Condor Week 2012 … and what about now? • Nearly 50 kilocores across campus! – Two new community clusters • Hansen – Dell nodes w/ four 12-core AMD Opteron 6176 processors • Carter – HP nodes w/ 2 8-core Intel Xeon-E5 processors (Sandy Bridge) – Carter ranks 54th in the latest Top500.org list for fastest supercomputers – Carter is the naon’s fastest campus supercomputer Condor Week 2012 DiaGrid? • A large, high-throughput, distriButed compu/ng system • Using Condor to manage joBs and resources • Purdue leading a partnership of 10 campuses and ins/tu/ons – University of Wisconsin, Notre Dame and Indiana University to name a few • Including all Purdue (and other campus) clusters, lab computers, department computers, desktop, totaling 60,000+ cores Condor Week 2012 Ok, cool… Now what? Condor Week 2012 Basic Local Alignment Search Tool • Comparing nucleo/de or protein sequences – String and SuBstring paern matching • Naonal Center for Biotechnology Informaon (NCBI) Condor Week 2012 Why remake something? • Input file size limitaons (5MB, 10MB, etc.) • # of sequences for comparison • Timeliness • Ease of use Condor Week 2012 BLAST and DiaGrid • BLAST is highly parallelizable – No one sequence result depends on another (GREAT!!!) – Split input file with trusty friend AWK – Build a Condor DAG to maintain all joBs • Never more than 1500 individual joBs Condor Week 2012 BLAST and DiaGrid Input File Results Condor Week 2012 BLASTer Condor Week 2012 BLASTer Condor Week 2012 BLASTer Condor Week 2012 Big Benefits? We think so! • Rick Westerman – Bioinformacs Specialist at the Purdue University Genomics Facility Condor Week 2012 Development Hurdles • DiaGrid disk quota per user – Default 1GB -> NOT ENOUGH SPACE!!! • Condor joB failure – Set retry flag (We use 20 to Be safe) • Need more features! Condor Week 2012 To the Future! Condor Week 2012 BLASTer Plans • Custom Databases – Nearly all researchers want this feature – Concern: Database permissions • More output viewing op/ons – Integrated HTML viewer – Blast2Go • Befer file management Condor Week 2012 DiaGrid Plans • R (programming language) stas/cal compung – Landscape Ecology & Biodiversity Department • Cryo-Electron Microscopy Tools (Cryo-EM) – Single par/cle reconstruc/on (EMAN2 and similar tools) – Department of Biological Sciences Condor Week 2012 Quesons? Condor Week 2012 .

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    18 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us