Supercomputers: Queue and Job Management HORT 59000 Lecture 6 Instructor: Kranthi Varala Client/Server Architecture

Total Page:16

File Type:pdf, Size:1020Kb

Supercomputers: Queue and Job Management HORT 59000 Lecture 6 Instructor: Kranthi Varala Client/Server Architecture Supercomputers: Queue and Job management HORT 59000 Lecture 6 Instructor: Kranthi Varala Client/Server architecture User1 User2 Server (UNIX/ Web/ Database etc..) User3 User4 When to use supercomputers • Need to run hundreds to thousands of similar jobs. • Need to run a few large jobs quickly. • Tasks can be divided into smaller portions and run in parallel. Parallelization • Refers to the ability of dividing a large task into smaller parts that can all be run in parallel. • E.g., Correlation matrix of 10,000 genes. • Can be divided into 10,000 jobs where each job works on one gene. • Need to balance increase in computation power with increase in need to communicate. Parallelization: Ideal vs Real By Raul654 (Own work) [CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0)], via Wikimedia Commons Supercomputer architecture Storage Compute Node1 Home Compute Node2 Compute Node3 Storage Head Node Compute Node4 Scratch Compute Node5 Compute Node6 Storage Compute Node7 Archive Compute Node8 Supercomputer architecture Compute Node1 Compute Node2 Compute Node3 Storage Head Node Compute Node4 Archive Compute Node5 Compute Node6 Compute Node7 Storage Storage Compute Node8 Home Scratch Terminology • Cluster: Complex of multiple “nodes” + connecting hardware + storage • Node: Individual computer in the cluster with its own processors and memory. • Core: Individual processor on a node. Head Node • Specialized node handles user accounts, logins, communication with the internet and network storage. • Built to handle lots of user interactions and to run the job management system. • NOT optimized to run compute intensive jobs. • Typically interacts with archival storage. Compute Node • Multiple identical nodes heavy computational load and large memory jobs. • NOT optimized to support user interaction. • Typically has access to home and scratch space only. Homogeneous vs. Heteregeneous nodes • Homogeneous : All compute nodes share identical software and hardware. • Heterogeneous: Different kinds of compute nodes with same software but large differences in hardware. • Heterogenous clusters support multiple job types such as compute intensive vs. memory intensive vs. I/O intensive jobs Job Management • User submits “jobs” to the cluster with specific requests for cores, memory, walltime etc. • Jobs once submitted will be allocated to the nodes by the job management software. • Each job runs creates its own login/shell and runs within that. • Eg., PBS, SGE, SLURM etc. PBS Job Management • Portable Batch System (PBS) is a very common management system. Used on all Purdue RCAC clusters. • Each “job” is essentially a shell script that has a series oF commands. • Jobs are allocated based on resources available to the user. PBS resources • Nodes: No. of nodes requested for the job. • Cores: No. of cores requested for the job. Total number cannot exceed the sum of cores available on all nodes requested. • Memory: Total memory required for the job (estimate). • Wall time: Expected run time for the job. • NOTE: job will be terminated if it exceeds time requested and/or runs out of memory. PBS queues • Queues are how PBS manages the job submission. • Each queue has a set of properties: No. and/or types of nodes available to it, max. run time, user access permissions, priorities etc. • Each job is submitted by user to specific queue. • When a node that fits user requirement becomes available the job in the queue is run. PBS queues • Multiple queues may be available to you. • Get a list of queues by using PBS command qlist • qstat –a will list the status of the queue Scholar queues • Three queues available. • Our jobs go to scholar queue by default. PBS environment variables • Job environment variables are specific to the job and only exist while the job is running. • PBS_JOBID : ID assigned to the current job. • PBS_O_WORKDIR : Directory from which the job was submitted. Example PBS job #!/bin/sh –l #PBS -l nodes=1:ppn=24 #PBS -l walltime=24:00:00 #PBS -l naccesspolicy=singleuser #PBS -q kvarala cd /scratch/brown/kvarala/EvoNet mpirun -np 24 /home/kvarala/bin/examl-AVX -t RAxML_parsimonyTree.StartingTree -m PSR -s EvoNet.m10.binary -n T1 >ExaML.log Example SGE job #!/bin/bash #$ -N run_blastp #$ -cwd #$ -pe smp 8 #$ -l h_vmem=4G blastp –a 8 –I infile.fasta –d database.db –o outfile.blast Example SLURM job #!/bin/bash #SBATCH --nodes 1 #SBATCH --tasks-per-node 12 #SBATCH -t 4:00:00 #SBATCH --mem 8GB cd /scratch/kv15/subCluster makeblastdb -dbtype 'prot' -in Gymno.Family1 -title GYMFAM1 -out GYMFAM1 -parse_seqids blastp -db GYMFAM1 -query Gymno.Family1 -outfmt 6 -evalue 0.01 - num_threads 12 -out GYMFAM1.blastout Modules • Pre-installed software that is loaded when needed. • E.g., module load blastall • module list avail shows the list of all modules available. • Only load the modules you need to run the current job..
Recommended publications
  • Analysis of Server-Smartphone Application Communication Patterns
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Aaltodoc Publication Archive Aalto University School of Science Degree Programme in Computer Science and Engineering Péter Somogyi Analysis of server-smartphone application communication patterns Master’s Thesis Budapest, June 15, 2014 Supervisors: Professor Jukka Nurminen, Aalto University Professor Tamás Kozsik, Eötvös Loránd University Instructor: Máté Szalay-Bekő, M.Sc. Ph.D. Aalto University School of Science ABSTRACT OF THE Degree programme in Computer Science and MASTER’S THESIS Engineering Author: Péter Somogyi Title: Analysis of server-smartphone application communication patterns Number of pages: 83 Date: June 15, 2014 Language: English Professorship: Data Communication Code: T-110 Software Supervisor: Professor Jukka Nurminen, Aalto University Professor Tamás Kozsik, Eötvös Loránd University Instructor: Máté Szalay-Bekő, M.Sc. Ph.D. Abstract: The spread of smartphone devices, Internet of Things technologies and the popularity of web-services require real-time and always on applications. The aim of this thesis is to identify a suitable communication technology for server and smartphone communication which fulfills the main requirements for transferring real- time data to the handheld devices. For the analysis I selected 3 popular communication technologies that can be used on mobile devices as well as from commonly used browsers. These are client polling, long polling and HTML5 WebSocket. For the assessment I developed an Android application that receives real-time sensor data from a WildFly application server using the aforementioned technologies. Industry specific requirements were selected in order to verify the usability of this communication forms. The first one covers the message size which is relevant because most smartphone users have limited data plan.
    [Show full text]
  • Implementation of Embedded Web Server Based on ARM11 and Linux Using Raspberry PI
    International Journal of Recent Technology and Engineering (IJRTE) ISSN: 2277-3878, Volume-3 Issue-3, July 2014 Implementation of Embedded Web Server Based on ARM11 and Linux using Raspberry PI Girish Birajdar Abstract— As ARM processor based web servers not uses III. HARDWARE USED computer directly, it helps a lot in reduction of cost. In this We will use different hardware to implement this embedded project our aim is to implement an Embedded Web Server (EWS) based on ARM11 processor and Linux operating system using web server, which are described in this section. Raspberry Pi. it will provide a powerful networking solution with 1. Raspberry Pi : The Raspberry Pi is low cost ARM wide range of application areas over internet. We will run web based palm-size computer. The Raspberry Pi has server on an embedded system having limited resources to serve microprocessor ARM1176JZF-S which is a member of embedded web page to a web browser. ARM11 family and has ARMv6 architecture. It is build Index Terms— Embedded Web Server, Raspberry Pi, ARM, around a BCM2835 broadcom processor. ARM processor Ethernet etc. operates at 700 MHz & it has 512 MB RAM. It consumes 5V electricity at 1A current due to which power I. INTRODUCTION consumption of raspberry pi is less. It has many peripherals such as USB port, 10/100 ethernet, GPIO, HDMI & With evolution of World-Wide Web (WWW), its composite video outputs and SD card slot.SD card slot is application areas are increasing day by day. Web access used to connect the SD card which consist of raspberry linux functionality can be embedded in a low cost device which operating system.
    [Show full text]
  • Data Management for Portable Media Players
    Data Management for Portable Media Players Table of Contents Introduction..............................................................................................2 The New Role of Database........................................................................3 Design Considerations.................................................................................3 Hardware Limitations...............................................................................3 Value of a Lightweight Relational Database.................................................4 Why Choose a Database...........................................................................5 An ITTIA Solution—ITTIA DB........................................................................6 Tailoring ITTIA DB for a Specific Device......................................................6 Tables and Indexes..................................................................................6 Transactions and Recovery.......................................................................7 Simplifying the Software Development Process............................................7 Flexible Deployment................................................................................7 Working with ITTIA Toward a Successful Deployment....................................8 Conclusion................................................................................................8 Copyright © 2009 ITTIA, L.L.C. Introduction Portable media players have evolved significantly in the decade that has
    [Show full text]
  • Blockchain Database for a Cyber Security Learning System
    Session ETD 475 Blockchain Database for a Cyber Security Learning System Sophia Armstrong Department of Computer Science, College of Engineering and Technology East Carolina University Te-Shun Chou Department of Technology Systems, College of Engineering and Technology East Carolina University John Jones College of Engineering and Technology East Carolina University Abstract Our cyber security learning system involves an interactive environment for students to practice executing different attack and defense techniques relating to cyber security concepts. We intend to use a blockchain database to secure data from this learning system. The data being secured are students’ scores accumulated by successful attacks or defends from the other students’ implementations. As more professionals are departing from traditional relational databases, the enthusiasm around distributed ledger databases is growing, specifically blockchain. With many available platforms applying blockchain structures, it is important to understand how this emerging technology is being used, with the goal of utilizing this technology for our learning system. In order to successfully secure the data and ensure it is tamper resistant, an investigation of blockchain technology use cases must be conducted. In addition, this paper defined the primary characteristics of the emerging distributed ledgers or blockchain technology, to ensure we effectively harness this technology to secure our data. Moreover, we explored using a blockchain database for our data. 1. Introduction New buzz words are constantly surfacing in the ever evolving field of computer science, so it is critical to distinguish the difference between temporary fads and new evolutionary technology. Blockchain is one of the newest and most developmental technologies currently drawing interest.
    [Show full text]
  • An Introduction to Cloud Databases a Guide for Administrators
    Compliments of An Introduction to Cloud Databases A Guide for Administrators Wendy Neu, Vlad Vlasceanu, Andy Oram & Sam Alapati REPORT Break free from old guard databases AWS provides the broadest selection of purpose-built databases allowing you to save, grow, and innovate faster Enterprise scale at 3-5x the performance 14+ database engines 1/10th the cost of vs popular alternatives - more than any other commercial databases provider Learn more: aws.amazon.com/databases An Introduction to Cloud Databases A Guide for Administrators Wendy Neu, Vlad Vlasceanu, Andy Oram, and Sam Alapati Beijing Boston Farnham Sebastopol Tokyo An Introduction to Cloud Databases by Wendy A. Neu, Vlad Vlasceanu, Andy Oram, and Sam Alapati Copyright © 2019 O’Reilly Media Inc. All rights reserved. Printed in the United States of America. Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472. O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are also available for most titles (http://oreilly.com). For more infor‐ mation, contact our corporate/institutional sales department: 800-998-9938 or [email protected]. Development Editor: Jeff Bleiel Interior Designer: David Futato Acquisitions Editor: Jonathan Hassell Cover Designer: Karen Montgomery Production Editor: Katherine Tozer Illustrator: Rebecca Demarest Copyeditor: Octal Publishing, LLC September 2019: First Edition Revision History for the First Edition 2019-08-19: First Release The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. An Introduction to Cloud Databases, the cover image, and related trade dress are trademarks of O’Reilly Media, Inc. The views expressed in this work are those of the authors, and do not represent the publisher’s views.
    [Show full text]
  • Middleware-Based Database Replication: the Gaps Between Theory and Practice
    Appears in Proceedings of the ACM SIGMOD Conference, Vancouver, Canada (June 2008) Middleware-based Database Replication: The Gaps Between Theory and Practice Emmanuel Cecchet George Candea Anastasia Ailamaki EPFL EPFL & Aster Data Systems EPFL & Carnegie Mellon University Lausanne, Switzerland Lausanne, Switzerland Lausanne, Switzerland [email protected] [email protected] [email protected] ABSTRACT There exist replication “solutions” for every major DBMS, from Oracle RAC™, Streams™ and DataGuard™ to Slony-I for The need for high availability and performance in data Postgres, MySQL replication and cluster, and everything in- management systems has been fueling a long running interest in between. The naïve observer may conclude that such variety of database replication from both academia and industry. However, replication systems indicates a solved problem; the reality, academic groups often attack replication problems in isolation, however, is the exact opposite. Replication still falls short of overlooking the need for completeness in their solutions, while customer expectations, which explains the continued interest in developing new approaches, resulting in a dazzling variety of commercial teams take a holistic approach that often misses offerings. opportunities for fundamental innovation. This has created over time a gap between academic research and industrial practice. Even the “simple” cases are challenging at large scale. We deployed a replication system for a large travel ticket brokering This paper aims to characterize the gap along three axes: system at a Fortune-500 company faced with a workload where performance, availability, and administration. We build on our 95% of transactions were read-only. Still, the 5% write workload own experience developing and deploying replication systems in resulted in thousands of update requests per second, which commercial and academic settings, as well as on a large body of implied that a system using 2-phase-commit, or any other form of prior related work.
    [Show full text]
  • Database Technology for Bioinformatics from Information Retrieval to Knowledge Systems
    Database Technology for Bioinformatics From Information Retrieval to Knowledge Systems Luis M. Rocha Complex Systems Modeling CCS3 - Modeling, Algorithms, and Informatics Los Alamos National Laboratory, MS B256 Los Alamos, NM 87545 [email protected] or [email protected] 1 Molecular Biology Databases 3 Bibliographic databases On-line journals and bibliographic citations – MEDLINE (1971, www.nlm.nih.gov) 3 Factual databases Repositories of Experimental data associated with published articles and that can be used for computerized analysis – Nucleic acid sequences: GenBank (1982, www.ncbi.nlm.nih.gov), EMBL (1982, www.ebi.ac.uk), DDBJ (1984, www.ddbj.nig.ac.jp) – Amino acid sequences: PIR (1968, www-nbrf.georgetown.edu), PRF (1979, www.prf.op.jp), SWISS-PROT (1986, www.expasy.ch) – 3D molecular structure: PDB (1971, www.rcsb.org), CSD (1965, www.ccdc.cam.ac.uk) Lack standardization of data contents 3 Knowledge Bases Intended for automatic inference rather than simple retrieval – Motif libraries: PROSITE (1988, www.expasy.ch/sprot/prosite.html) – Molecular Classifications: SCOP (1994, www.mrc-lmb.cam.ac.uk) – Biochemical Pathways: KEGG (1995, www.genome.ad.jp/kegg) Difference between knowledge and data (semiosis and syntax)?? 2 Growth of sequence and 3D Structure databases Number of Entries 3 Database Technology and Bioinformatics 3 Databases Computerized collection of data for Information Retrieval Shared by many users Stored records are organized with a predefined set of data items (attributes) Managed by a computer program: the database
    [Show full text]
  • Microcomputers: NQS PUBLICATIONS Introduction to Features and Uses
    of Commerce Computer Science National Bureau and Technology of Standards NBS Special Publication 500-110 Microcomputers: NQS PUBLICATIONS Introduction to Features and Uses QO IGf) .U57 500-110 NATIONAL BUREAU OF STANDARDS The National Bureau of Standards' was established by an act ot Congress on March 3, 1901. The Bureau's overall goal is to strengthen and advance the Nation's science and technology and facilitate their effective application for public benefit. To this end, the Bureau conducts research and provides; (1) a basis for the Nation's physical measurement system, (2) scientific and technological services for industry and government, (3) a technical basis for equity in trade, and (4) technical services to promote public safety. The Bureau's technical work is per- formed by the National Measurement Laboratory, the National Engineering Laboratory, and the Institute for Computer Sciences and Technology. THE NATIONAL MEASUREMENT LABORATORY provides the national system of physical and chemical and materials measurement; coordinates the system with measurement systems of other nations and furnishes essential services leading to accurate and uniform physical and chemical measurement throughout the Nation's scientific community, industry, and commerce; conducts materials research leading to improved methods of measurement, standards, and data on the properties of materials needed by industry, commerce, educational institutions, and Government; provides advisory and research services to other Government agencies; develops, produces, and
    [Show full text]
  • Usb Laptop Console 2
    QUICK SETUP GUIDE KVT100A-FT USB LAPTOP CONSOLE 2 24/7 TECHNICAL SUPPORT AT 1.877.877.2269 OR VISIT BLACKBOX.COM PACKAGE CONTENTS: The USB Laptop Console 2 is an adapter which enables you to control any computer with USB ports and a VGA video output, using just your • Second-Gen USB Laptop Console laptop. (Computer systems with DVI, HDMI or DisplayPort video will • (1x) 6 ft.(1.8m) high speed USB-A to require a DVI to VGA, HDMI to VGA or DisplayPort to VGA adapter.) USB-miniB cable The USB Laptop Console 2 streams real-time video over high-speed • (1x) flash drive with software and USB, from the host computer (server) to the application software you Users Manual (PDF) will install on your laptop. At the same time, the software emulates • Quick setup guide (printed) the server’s USB keyboard and mouse. A window on your laptop acts as a console and gives you easy and complete control over the server. It works even if the server is in the BIOS, or is otherwise damaged. No software is ever installed on the server. The USB Laptop Console 2 can be hot-plugged into a working system at any time. 1.877.877.2269 BLACKBOX.COM NEED HELP? QUICK SETUP GUIDE LEAVE THE TECH TO US KVT100A-FT LIVE 24/7 TECHNICAL SUPPORT USB LAPTOP CONSOLE 2 1.877.877.2269 Software Installation: Before installing the software in a Windows or Apple computer, please ensure the USB Laptop Console 2 is DISCONNECTED. Cancel any “Add new hardware” dialog screens as these can interfere with the installation process.
    [Show full text]
  • TIBCO Spotfire® Analyst Portable Software Release 10.4 2
    TIBCO Spotfire® Analyst Portable Software Release 10.4 2 Important Information SOME TIBCO SOFTWARE EMBEDS OR BUNDLES OTHER TIBCO SOFTWARE. USE OF SUCH EMBEDDED OR BUNDLED TIBCO SOFTWARE IS SOLELY TO ENABLE THE FUNCTIONALITY (OR PROVIDE LIMITED ADD-ON FUNCTIONALITY) OF THE LICENSED TIBCO SOFTWARE. THE EMBEDDED OR BUNDLED SOFTWARE IS NOT LICENSED TO BE USED OR ACCESSED BY ANY OTHER TIBCO SOFTWARE OR FOR ANY OTHER PURPOSE. USE OF TIBCO SOFTWARE AND THIS DOCUMENT IS SUBJECT TO THE TERMS AND CONDITIONS OF A LICENSE AGREEMENT FOUND IN EITHER A SEPARATELY EXECUTED SOFTWARE LICENSE AGREEMENT, OR, IF THERE IS NO SUCH SEPARATE AGREEMENT, THE CLICKWRAP END USER LICENSE AGREEMENT WHICH IS DISPLAYED DURING DOWNLOAD OR INSTALLATION OF THE SOFTWARE (AND WHICH IS DUPLICATED IN THE LICENSE FILE) OR IF THERE IS NO SUCH SOFTWARE LICENSE AGREEMENT OR CLICKWRAP END USER LICENSE AGREEMENT, THE LICENSE(S) LOCATED IN THE “LICENSE” FILE(S) OF THE SOFTWARE. USE OF THIS DOCUMENT IS SUBJECT TO THOSE TERMS AND CONDITIONS, AND YOUR USE HEREOF SHALL CONSTITUTE ACCEPTANCE OF AND AN AGREEMENT TO BE BOUND BY THE SAME. ANY SOFTWARE ITEM IDENTIFIED AS THIRD PARTY LIBRARY IS AVAILABLE UNDER SEPARATE SOFTWARE LICENSE TERMS AND IS NOT PART OF A TIBCO PRODUCT. AS SUCH, THESE SOFTWARE ITEMS ARE NOT COVERED BY THE TERMS OF YOUR AGREEMENT WITH TIBCO, INCLUDING ANY TERMS CONCERNING SUPPORT, MAINTENANCE, WARRANTIES, AND INDEMNITIES. DOWNLOAD AND USE OF THESE ITEMS IS SOLELY AT YOUR OWN DISCRETION AND SUBJECT TO THE LICENSE TERMS APPLICABLE TO THEM. BY PROCEEDING TO DOWNLOAD, INSTALL OR USE ANY OF THESE ITEMS, YOU ACKNOWLEDGE THE FOREGOING DISTINCTIONS BETWEEN THESE ITEMS AND TIBCO PRODUCTS.
    [Show full text]
  • What Is a Database? Differences Between the Internet and Library
    What is a Database? Library databases are mostly full-text material (in their entirety) and summaries or descriptions of articles. They are collections of articles from newspapers, magazines and journals and electronic reference sources. Databases are selected for the quality and variety of resources they offer and are accessed using the Internet. Your library pays for you to have access to a number of relevant databases. You support this with your tuition, so get the most out of your money! You can access them from home or school via the Library Webpage or use the link below. http://www.mxcc.commnet.edu/Content/Find_Articles.asp Two short videos on the benefits of using library databases: http://www.youtube.com/watch?v=VUp1P-ubOIc http://youtu.be/Q2GMtIuaNzU Differences Between the Internet and Library Databases The Internet Library Databases Examples Google, Yahoo, Bing LexisNexis, Literary Reference Center or Health and Wellness Resource Center Review process None – anyone can add Checked for accuracy by content to the Web. publishers. Chosen by your college’s library. Includes “peer-reviewed” scholarly articles. Reliability Unknown Very No quality control mechanisms! Content Anything, from pictures of a Scholarly journal articles, Book person’s pets to personal reviews, Research papers, (usually not researched and Conference papers, and other unsubstantiated) opinions on scholarly information gun control, abortion, etc. How often updated Unknown/varies. Regularly – daily, quarterly monthly Cost “Free” but some of the info Library has paid for you to you may need for your access these databases. assignment requires a fee. Organization Very little or no organization Very organized Availability Websites come and go.
    [Show full text]
  • Embedded ATMEL HTTP Server
    Embedded ATMEL HTTP Server A Design Project Report Presented to the Engineering Division of the Graduate School of Cornell University in Partial Fulfillment of the Requirements for the Degree of Masters of Engineering (Electrical) by Tzeming Tan, Jeremy Project Advisor: Dr. Bruce R. Land Degree date: May 2004 Abstract Master of Electrical and Computer Engineering Program Cornell University Design Project Report Project Title: Embedded ATMEL HTTP Server Author: Tzeming Tan, Jeremy Abstract: The objective of this project was to design and build an embedded HTTP server using a microcontroller chip. The webserver required the implementation of the interface with Ethernet as well as several internet protocols such as TCP/IP and ARP. This embedded web server is able to serve small, static web pages as well as perform certain useful laboratory lab functions such as displaying the current temperature read by the microcontroller from a thermometer, on the webpage. While the capabilities of the embedded webserver are no where near that of a regular server computer, its small size and relatively low cost makes it more practical for some applications. The web server was built, tested to work, and a temperature reporting feature added to it. Report Approved by Project Advisor: ___________________________________ Date: __________ ii Executive Summary The internet is a versatile, convenient and efficient means of communication in the 21st century. Protocols such as TCP/IP, UDP, DHCP and ICMP form the backbone of internet communications a large bulk of which consists of Hyper Text Transfer Protocol (HTTP) traffic for the World Wide Web. A HTTP or web server is a server process running at a web site which sends out web pages in response to HTTP requests from remote browsers.
    [Show full text]