Quantian As an Environment for Distributed Statistical Computing

Total Page:16

File Type:pdf, Size:1020Kb

Quantian As an Environment for Distributed Statistical Computing Quantian for distributed statistical computing Dirk Eddelbuettel Background Quantian as an environment for Introduction Timeline distributed statistical computing Quantian Motivation Content Distributed Computing Dirk Eddelbuettel Overview Preparation Beowulf Debian Project openMosix [email protected] R Examples Snow SnowFT papply DSC 2005 – Directions in Statistical Computing 2005 Others University of Washington, Seattle, August 13-14, 2005 Summary Quantian for distributed statistical computing Outline Dirk Eddelbuettel Background Introduction Timeline 1 Background Quantian Motivation Content 2 Quantian Distributed Computing Overview Preparation 3 Distributed Computing Beowulf openMosix R Examples Snow 4 R Examples SnowFT papply Others Summary 5 Summary Quantian for distributed statistical computing What is Quantian? Dirk A live-dvd for numbers geeks Eddelbuettel Background Introduction Timeline • Quantian is a directly bootable and self-configuring Quantian Motivation Linux sytem that runs from a compressed dvd image. Content • Quantian offers zero-configuration cluster computing Distributed Computing using openMosix, including ’openMosix Terminalserver’ Overview Preparation via PXE. Beowulf openMosix • Quantian contains over 6gb of software, including an R Examples Snow additional 4gb of ’quantitative’ software: scientific, SnowFT papply numerical, statistical, engineering, ... Others • Summary Quantian also contains editors, programming languages, complete latex support, two ’office’ suites, networking tools and more. Quantian for distributed statistical computing Quantian lineage Dirk Knoppix, clusterKnoppix, Debian Eddelbuettel Background Introduction Timeline • Quantian is based on clusterKnoppix, which extends Quantian Motivation Knoppix with an openMosix-enabled kernel and Content applications, kernel modules and security patches. Distributed Computing • ClusterKnoppix extends Knoppix, an impressive ’linux Overview Preparation on a cdrom’ system which puts 2.1gb of software onto a Beowulf openMosix cdrom along with auto-detection and configuration (but R Examples Snow Knopppix followed Quantian and switched to 4gb dvds). SnowFT papply • Knoppix is based on Debian, a Linux distribution Others containing over 8000 source packages available for 12 Summary architectures (such as i386, alpha, ia64, amd64, sparc or s390) produced by hundreds of individuals. Quantian for distributed statistical computing Timeline Dirk As provided by the releases Eddelbuettel Background Introduction Timeline • 0.1 (March 2003): Initial version at DSC 2003. Quantian Motivation • 0.2 (May 2003): Now based on Knoppix 3.2. Content Distributed • 0.3 (June 2003): Switched to using clusterKnoppix Computing Overview which added openMosix clustering support. Preparation Beowulf • 0.3.9.* (Sep. 2003): Updated clusterKnoppix. openMosix R Examples • 0.4.9.* (Oct. 2003 to Mar. 2004): Based on Knoppix 3.3. Snow SnowFT • 0.5.9.* (June to Sep. 2004): Based on Knoppix 3.4, first papply Others ’kitchen sink’ versions > 1gb for bootable DVDs. Summary • 0.6.9.* (Oct. to Dec. 2004): Based on Knoppix 3.6, size increased to 2.0gb. Quantian for distributed statistical computing Motivation Dirk Major modes of use Eddelbuettel Background Introduction • Computing clusters to speed up embarrassingly Timeline parallel tasks. Quantian Motivation • Computer labs by enabling temporary use of a Content Distributed computing environment booted off a dvd, and/or Computing netbooting. Overview Preparation Beowulf • Students / co-workers as distributing DVDs enables openMosix work in identical environments with minimal R Examples Snow administration. SnowFT papply • Convenience of not having to chase down new Others Summary software releases, and to configure and installing it. • Easier installation of a ’normal’ workstation by booting off Quantian, and / or installing to hard disk getting a head start with 6gb of configured software. Quantian for distributed statistical computing What is included? Dirk Broken down by field Eddelbuettel Background Introduction Timeline • Statistics: GNU R (plus essentially all of CRAN and Quantian Motivation BioConductor; Ggobi, ESS), Xlispstat, Gretl, PSPP. Content • Bioinformatics: BioConductor, BioPython, BioPerl and Distributed Computing tools like emboss and blast2. Overview Preparation • Beowulf Mathematics: Six computer algebra systems, matrix openMosix languages Octave (with add-on packages), Yorick and R Examples Snow Scilab, TeXmacs front-end. SnowFT papply • Physics: CERN tools (Cernlib, Geant, PAW/PAW++), Others Scientific / Numeric Python, GNU GSL libraries. Summary • Visualization and graphics: OpenDX, Mayavi, Ggobi, Gnuplot, Grace, Gri, plotutils, xfig. Quantian for distributed statistical computing What is included? Dirk Broken down by application area Eddelbuettel Background Introduction • Programming languages: C, C++, Fortran, Java, Perl, Timeline Quantian Python, PHP, Ruby, Lua, Tcl, Awk, A+. Motivation Content • Editors: XEmacs, Vim, jed, joe, kate, nedit, zile. Distributed • Computing Scientific Publishing: Extended LaTeX support with Overview several frontends (xemacs, kile, lyx) and extensions. Preparation Beowulf openMosix • Office software: OpenOffice.org, KOffice, Gnumeric, R Examples and tools like the Gimp. Snow SnowFT • Finance: Software from the Rmetrics project and the papply Others QuantLib libraries. Summary • Networking: ethereal, portmap, netcat, ethercap, bittorent, nmap, squid plus wireless tools and drivers. • General tools: Apache, MySQL, PHP, and more. Quantian for distributed statistical computing How to use many computers Dirk Conceptual overview Eddelbuettel Background Introduction Timeline Quantian • ’sneaker net’: physically (or virtually via ssh) running Motivation Content from machine to machine, launching jobs and collecting Distributed results. Computing Overview • ’Beowulf’ clusters using ’MPI/PVM/...’ require explicitly Preparation Beowulf parallel code (though there are some R wrappers, more openMosix below). R Examples Snow SnowFT • openMosix forms a ’single system image’ computer and papply Others does require explicitly parallel code. Summary • Other approaches such as Condor or OSCAR which we won’t cover here. Quantian for distributed statistical computing Setup for PVM and MPI Dirk Should go into next Quantian revision Eddelbuettel Background Introduction Timeline • PVM and MPI ’do not know’ they are running inside. Quantian • They want to talk to other hosts by ssh. Motivation Content • PVM/MPI require distinct hostnames for all machines. Distributed Computing • Setup for ssh, LAM and PVM: Overview Preparation $ cp -ax /root/.ssh ~knoppix Beowulf openMosix $ chown -R knoppix.knoppix ~knoppix/.ssh R Examples $ ifconfig # note $IP Snow $ hostname Quantian$IP SnowFT papply $ vi /etc/hosts # define local hosts Others $ scp /etc/hosts to_all_local_hosts Summary $ vi /tmp/clusterhosts # add them $ lamboot /tmp/clusterhosts $ echo “conf” | pvm /tmp/clusterhosts Quantian for distributed statistical computing Distributed computing: Beowulf Dirk Eddelbuettel Background • Beowulf clusters use message-passing interfaces such Introduction Timeline as LAM/MPI or PVM to communicate across nodes. Quantian • This may require a sizable amount of new programming Motivation Content and explicitly parallel coding. ’Hard’ Distributed Computing • Quantian includes several Beowulf tools and libraries: Overview Preparation • LAM MPI libraries and run-time; Beowulf • Mpich MPI libraries and run-time; openMosix • R Examples Pvm (Parallel Virtual Machine) libraries and run-time; Snow • Sprng (Scalable Parallel Random Number Generator); SnowFT papply • as well as documentation as examples for these. Others Summary • Contrast: openMosix takes existing programs and moves them around nodes in the cluster to achieve optimal load across all nodes in the cluster – no alteration to algorithms, or new programming. Quantian for distributed statistical computing Cluster computing: openMosix Dirk Eddelbuettel Background • Easiest way to distribute computing load, esp. for Introduction Timeline ’embarrassingly parallel’ tasks, as the kernel schedules Quantian tasks across the cluster. Motivation Content • Since release 0.3, Quantian contains a kernel with the Distributed Computing openMosix patch as well as a set of openMosix utilities. Overview Preparation • As a result, “instant cluster computing” is possible Beowulf based on a single dvd or iso image: openMosix R Examples 1 boot one master instance from the dvd or hard disk, Snow 2 enable ’openmosixterminalserver’ from the menu, SnowFT papply 3 boot 1, 2, ... ’slave’ nodes via PXE protocol (available in Others most recent computers) from master, and Summary 4 enjoy openMosix on the cluster. • Big advantage: Identical software configuration, library versions, ... throughout the cluster. Quantian for distributed statistical computing Cluster computing: openMosix Dirk Eddelbuettel (cont.) Background Introduction Timeline • clusterKnoppix autoconfigures and autodiscovers the Quantian nodes. and enables (root) ssh access between them. Motivation Content • openMosix is ideal for stand-alone programs such as Distributed Computing ’old fashioned’ C++ or Fortran apps that ’just run’. Overview Preparation • In general, any program without shared memory, or Beowulf openMosix threads, will migrate though I/O may bring jobs back to R Examples Snow the front node. SnowFT papply • Ian Latter’s CHAOS projects addresses some of the Others security aspects by overlaying a VPN allowing for Summary private
Recommended publications
  • Instalación Y Configuración De Un Cluster De Alta Disponibilidad Con Reparto De Carga
    UNIVERSIDAD POLITÉCNICA DE VALENCIA Máster en Ingeniería de Computadores INSTALACIÓN Y CONFIGURACIÓN DE UN CLUSTER DE ALTA DISPONIBILIDAD CON REPARTO DE CARGA SERVIDOR WEB Y MAQUINAS VIRTUALES Alumno: Lenin Alcántara Roa. Director: Pedro López Rodríguez. Febrero 2014 Febrero de 2014 2 Universidad Politécnica de Valencia Febrero de 2014 ÍNDICE 1. INTRODUCCIÓN 5 1.1. Objetivos 6 1.2. Motivación 6 1.3. Resumen 6 2. ESTADO DEL ARTE 7 2.1. ¿Qué es un Cluster? 7 2.2. Clustering de Alta Disponibilidad con Linux 15 2.3. Sistemas Operativos 17 3. ENTORNO TECNOLOGICO 28 3.1. Programación Bash 28 3.2. Servidor DNS 29 3.3. Servidor NFS 29 3.4. Servidor DHCP 30 3.5. Servidor PXE 32 3.6. Servicio dnsmasq 34 3.7. Servicio NIS 35 3.8. Condor 36 3.9. MPI 37 3.10. Almacenamiento RAID 38 3.11. Servicio LVS 42 3.12. Alta Disponibilidad: Corosync, Pacemaker y ldirectord 43 3.13. Virtualización con Linux 44 4. DESCRIPCIÓN DE LA SOLUCIÓN 47 4.1. Configuración del Cluster 48 4.2. Instalación del Sistema Operativo en el Cluster 50 4.3. Administración del Sistema 59 4.4. Almacenamiento 65 4.5. Equilibrado de Carga 66 4.6. Alta Disponibilidad 68 4.7. Sistema de Máquinas Virtuales 70 5. PRUEBAS 73 5.1. Servidor Web 73 5.1.1. Reparto de Carga 73 5.1.2. Alta Disponibilidad 77 5.1.3. Evaluación del Servidor Web 80 5.2. Sistema de Máquinas Virtuales 84 6. CONCLUSIONES 89 6.1. Trabajo Futuro 90 7. BIBLIOGRAFÍA 91 Universidad Politécnica de Valencia 3 Febrero de 2014 4 Universidad Politécnica de Valencia Febrero de 2014 1.
    [Show full text]
  • 24 Bringing Order to Chaos: Barrier-Enabled I/O Stack for Flash Storage
    Bringing Order to Chaos: Barrier-Enabled I/O Stack for Flash Storage YOUJIP WON and JOONTAEK OH, Hanyang University, Korea JAEMIN JUNG, Texas A&M University, USA GYEONGYEOL CHOI and SEONGBAE SON, Hanyang University, Korea JOOYOUNG HWANG and SANGYEUN CHO, Samsung Electronics, Korea This work is dedicated to eliminating the overhead required for guaranteeing the storage order in the modern IO stack. The existing block device adopts a prohibitively expensive approach in ensuring the storage order among write requests: interleaving the write requests with Transfer-and-Flush. For exploiting the cache bar- rier command for flash storage, we overhaul the IO scheduler, the dispatch module, and the filesystem sothat these layers are orchestrated to preserve the ordering condition imposed by the application with which the associated data blocks are made durable. The key ingredients of Barrier-Enabled IO stack are Epoch-based IO scheduling, Order-Preserving Dispatch,andDual-Mode Journaling. Barrier-enabled IO stack can control the storage order without Transfer-and-Flush overhead. We implement the barrier-enabled IO stack in server as well as in mobile platforms. SQLite performance increases by 270% and 75%, in server and in smartphone, respectively. In a server storage, BarrierFS brings as much as by 43× andby73× performance gain in MySQL and SQLite, respectively, against EXT4 via relaxing the durability of a transaction. CCS Concepts: • Software and its engineering → File systems management; Additional Key Words and Phrases: Filesystem, storage, block device, linux ACM Reference format: Youjip Won, Joontaek Oh, Jaemin Jung, Gyeongyeol Choi, Seongbae Son, Jooyoung Hwang, and Sangyeun Cho. 2018. Bringing Order to Chaos: Barrier-Enabled I/O Stack for Flash Storage.
    [Show full text]
  • Emacs Speaks Statistics (ESS): a Multi-Platform, Multi-Package Intelligent Environment for Statistical Analysis
    Emacs Speaks Statistics (ESS): A multi-platform, multi-package intelligent environment for statistical analysis A.J. Rossini Richard M. Heiberger Rodney A. Sparapani Martin Machler¨ Kurt Hornik ∗ Date: 2003/10/22 17:34:04 Revision: 1.255 Abstract Computer programming is an important component of statistics research and data analysis. This skill is necessary for using sophisticated statistical packages as well as for writing custom software for data analysis. Emacs Speaks Statistics (ESS) provides an intelligent and consistent interface between the user and software. ESS interfaces with SAS, S-PLUS, R, and other statistics packages under the Unix, Microsoft Windows, and Apple Mac operating systems. ESS extends the Emacs text editor and uses its many features to streamline the creation and use of statistical software. ESS understands the syntax for each data analysis language it works with and provides consistent display and editing features across packages. ESS assists in the interactive or batch execution by the statistics packages of statements written in their languages. Some statistics packages can be run as a subprocess of Emacs, allowing the user to work directly from the editor and thereby retain a consistent and constant look- and-feel. We discuss how ESS works and how it increases statistical programming efficiency. Keywords: Data Analysis, Programming, S, SAS, S-PLUS, R, XLISPSTAT,STATA, BUGS, Open Source Software, Cross-platform User Interface. ∗A.J. Rossini is Research Assistant Professor in the Department of Biostatistics, University of Washington and Joint Assis- tant Member at the Fred Hutchinson Cancer Research Center, Seattle, WA, USA mailto:[email protected]; Richard M.
    [Show full text]
  • Ein Wilder Ritt Distributionen
    09/2016 Besichtigungstour zu den skurrilsten Linux-Distributionen Titelthema Ein wilder Ritt Distributionen 28 Seit den frühen 90ern schießen die Linux-Distributionen wie Pilze aus dem Boden. Das Linux-Magazin blickt zurück auf ein paar besonders erstaunliche oder schräge Exemplare. Kristian Kißling www.linux-magazin.de © Antonio Oquias, 123RF Oquias, © Antonio Auch wenn die Syntax anderes vermu- samer Linux-Distributionen aufzustellen, Basis für Evil Entity denkt (Grün!), liegt ten lässt, steht der Name des klassischen denn in den zweieinhalb Jahrzehnten falsch. Tatsächlich basierte Evil Entity auf Linux-Tools »awk« nicht für Awkward kreuzte eine Menge von ihnen unseren Slackware und setzte auf einen eher düs- (zu Deutsch etwa „tolpatschig“), sondern Weg. Während einige davon noch putz- ter anmutenden Enlightenment-Desktop für die Namen seiner Autoren, nämlich munter in die Zukunft blicken, ist bei an- (Abbildung 3). Alfred Aho, Peter Weinberger und Brian deren nicht recht klar, welche Zielgruppe Als näher am Leben erwies sich der Fo- Kernighan. Kryptische Namen zu geben sie anpeilen oder ob sie überhaupt noch kus der Distribution, der auf dem Ab- sei eine lange etablierte Unix-Tradition, am Leben sind. spielen von Multimedia-Dateien lag – sie heißt es auf einer Seite des Debian-Wiki wollten doch nur Filme schauen. [1], die sich mit den Namen traditioneller Linux für Zombies Linux-Tools beschäftigt. Je kaputter, desto besser Denn, steht dort weiter, häufig halten Apropos untot: Die passende Linux- Entwickler die Namen ihrer Tools für Distribution für Zombies ließ sich recht Auch Void Linux [4], der Name steht selbsterklärend oder sie glauben, dass einfach ermitteln. Sie heißt Undead Linux je nach Übersetzung für „gleichgültig“ sie die User ohnehin nicht interessieren.
    [Show full text]
  • Clustering with Openmosix
    Clustering with openMosix Maurizio Davini (Department of Physics and INFN Pisa) Presented by Enrico Mazzoni (INFN Pisa) Introduction • What is openMosix? – Single-System Image – Preemptive Process Migration – The openMosix File System (MFS) • Application Fields • openMosix vs Beowulf • The people behind openMosix • The openMosix GNU project • Fork of openMosix code 12/06/2003 HTASC 2 The openMosix Project MileStones • Born early 80s on PDP-11/70. One full PDP and disk-less PDP, therefore process migration idea. • First implementation on BSD/pdp as MS.c thesis. • VAX 11/780 implementation (different word size, different memory architecture) • Motorola / VME bus implementation as Ph.D. thesis in 1993 for under contract from IDF (Israeli Defence Forces) • 1994 BSDi version • GNU and Linux since 1997 • Contributed dozens of patches to the standard Linux kernel • Split Mosix / openMosix November 2001 • Mosix standard in Linux 2.5? 12/06/2003 HTASC 3 What is openMOSIX • Linux kernel extension (2.4.20) for clustering • Single System Image - like an SMP, for: – No need to modify applications – Adaptive resource management to dynamic load characteristics (CPU intensive, RAM intensive, I/O etc.) – Linear scalability (unlike SMP) 12/06/2003 HTASC 4 A two tier technology 1. Information gathering and dissemination – Support scalable configurations by probabilistic dissemination algorithms – Same overhead for 16 nodes or 2056 nodes 2. Pre-emptive process migration that can migrate any process, anywhere, anytime - transparently – Supervised by adaptive
    [Show full text]
  • XLISP-STAT a Statistical Environment Based on the XLISP Language (Version 2.0)
    I XLISP-STAT A Statistical Environment Based on the XLISP Language (Version 2.0) by Luke Tierney l.5i1 University of Minnesota School of Statistics Technical Report Number 528 July 1989 Contents Preface .. 3 1 Starting and Finishing 6 2 Introduction to Basics 8 2.1 Data ........ 8 2.2 The Listener and the Evaluator . 8 3 Elementary Statistical Operations 11 3.1 First Steps ......... 11 3.2 Summary Statistics and Plots 12 3.3 Two Dimensional Plots 16 3.4 Plotting Functions ..... 19 4 More on Generating and Modifying Data 20 4.1 Generating Random Data . 20 4.2 Generating Systematic Data . 20 4.3 Forming Subsets and .Deleting Cases 21 4.4 Combining Several Lists 22 4.5 Modifying Data . 22 5 Some Useful Shortcuts 24 5.1 Getting Help . 24 5.2 Listing and Undefining Variables .. 26 5.3 More on the XLISP-STAT Listener .. 26 5 .4 Loading Files . 28 5.5 Saving Your Work ..... 28 5.6 The XLISP-STAT Editor 29 5.7 Reading Data Files .. 29 5.8 User Initialization File 29 6 More Elaborate Plots 30 6.1 Spinning Plots . ..... 30 6.2 Scatterplot Matrices • It ••••• 32 6.3 Interacting with Individual Plots 35 6.4 Linked Plots ....... 35 6.5 Modifying a Scatter Plot . 36 6.6 Dynamic Simulations . 39 7 Regression 42 8 Defining Your Own Functions and Methods 47 8.1 Defining Functions .... 47 8.2 Anonymous Functions .. 48 8.3 Some Dynamic Simulations 48 8.4 Defining Methods . 51 8.5 Plot Methods . 52 9 Matrices and Arrays 53 10 Nonlinear Regression 54 1 11 One Way ANOVA 57 12 Maximization and Maximum Likeliho~d Estimation 58 13 Approximate Bayesian Computations 61 A XLISP-STAT on UNIX Systems 68 A.1 XLISP-STAT Under the X11 Window System.
    [Show full text]
  • "Graph" Program
    Physics 305 Introduction to Computational Physics Fall 2010 The graph program Unfortunately, graphics programs are not as universal as we would like. These notes discuss one simple graphics program, unimaginatively called “graph”. While it is not a part of the standard Linux setup, it is universally available (and free) from the GNU software project. In fact, it is written and maintained by Robert Maier in the UA Math department. More importantly for you, it is already installed on faraday. Before you try out “graph”, make a couple of data files to practice on. These should be just two columns of number, where each line is an (x, y) pair. For example, file “data1” might look like: 0.0 0.123 0.5 0.2245 0.7 0.3145 1.1 0.224 Note that the x values don’t have to be evenly spaced. graph can make several kinds of output. The two that are important to us are X-windows output, which makes plots appear on the screen in front of you, and PostScript output, which you can email as part of your homework assignment. (You can, and should, also view the PostScript on your screen by using the “gv” (short for “ghostview”) command. Now try it out. Suppose “data1” is the name of one our your practice data files. Type graph -T X data1 WATCH OUT. Upper and lower case matters. Spacing and punctuation matter. “-T” and “X” must be upper case, and there must be a space between them. This should produce a graph of your data file on your screen.
    [Show full text]
  • The Openmosix Resource Sharing Algorithms Are Designed to Respond On-Line to Variations in the Resource Usage Among the Nodes
    Institute of Fundamental Technological Research, Polish Academy of Sciences, Department of Mechanics and Physics of Fluids Documentation on Linux clustering with openMosix Version 1.0.1 Supervisor of the project: Tomasz Kowalewski Administration: Piotr Matejek Technical performers: Ihor Trots & Vasyl Kovalchuk Cluster’s web page: http://fluid.ippt.gov.pl/mosix/ Warsaw, 2004 Table of contents: 1 OpenMosix …………………………………………………….. 3 1.1 Installing openMosix ……………………………………….. 4 1.2 Configuration steps openMosix …………………………… 8 1.3 OpenMosix in action ……………………………………….. 10 1.4 OpenMosixview …………………………………………….. 11 2 Masquerading and IP-tables ………………………………... 14 2.1 Applying iptables and Patch-o-Matic kernel patches 15 2.2 Configuring IP Masquerade on Linux 2.4.x Kernels …. 22 3 Parallel Virtual Machine PVM ………………………………. 29 3.1 Building and Installing ……………………………………… 29 3.2 Running PVM Programs .................................................. 32 4 Message Passing Interface MPI ………………………….… 34 4.1 Configuring, Making and Installing .................................. 34 4.2 Programming Tools ......................................................... 38 4.3 Some details ……………………………………….……….. 41 5 References …………………………………………………….. 46 2 1 OpenMosix Introduction OpenMosix is a kernel extension for single-system image clustering. Clustering technologies allow two or more Linux systems to combine their computing resources so that they can work cooperatively rather than in isolation. OpenMosix is a tool for a Unix-like kernel, such as Linux, consisting of adaptive resource sharing algorithms. It allows multiple uniprocessors and symmetric multiprocessors (SMP nodes) running the same kernel to work in close cooperation. The openMosix resource sharing algorithms are designed to respond on-line to variations in the resource usage among the nodes. This is achieved by migrating processes from one node to another, preemptively and transparently, for load-balancing and to prevent thrashing due to memory swapping.
    [Show full text]
  • The Bioinformatics Playground
    Gearing for bioinformatics Gearing for bioinformatics Bela Tiwari and Dawn Field explore the tools and facilities that ioinformatics’ is a buzz word that is Projects with enough funding are able to hire users will depend on the system, how they will becoming increasingly audible in the dedicated system administrators to provide access it, etc. Live CD or DVD distributions may can be used by the budding open source bioinformatician ‘BLinux world. Fast, economical, sustainable bioinformatics computing systems, be good for an individual and for demonstration flexible, and extensible computing power is but many of us are not that lucky and have to purposes, but they are probably not the right making Linux increasingly attractive to scientists go it alone. choice for the provision of tools to a whole in many areas of research, including biology. To add to the challenge, much bioinformatics department. More generally, the open source movement has software is written by academics, and while greatly benefited biological research; the most there are some very good, well tested packages LIVE DISTRIBUTIONS publicised project being the publicly funded out there, there are also many that were Live Linux distributions are a relatively new effort to sequence and make freely available the intended to answer a particular question, on a phenomenon and offer some big advantages. human genome. Less well publicised is the huge particular machine, for a particular group. Such You don’t have to install anything to run them. amount of biological data that can be freely packages were often not built with portability, Just slot the CD or DVD into the drive and boot accessed.
    [Show full text]
  • Enabling Virtualization Technologies for Enhanced Cloud Computing
    New Jersey Institute of Technology Digital Commons @ NJIT Dissertations Electronic Theses and Dissertations Fall 1-31-2015 Enabling virtualization technologies for enhanced cloud computing Kashifuddin Qazi New Jersey Institute of Technology Follow this and additional works at: https://digitalcommons.njit.edu/dissertations Part of the Computer Sciences Commons Recommended Citation Qazi, Kashifuddin, "Enabling virtualization technologies for enhanced cloud computing" (2015). Dissertations. 106. https://digitalcommons.njit.edu/dissertations/106 This Dissertation is brought to you for free and open access by the Electronic Theses and Dissertations at Digital Commons @ NJIT. It has been accepted for inclusion in Dissertations by an authorized administrator of Digital Commons @ NJIT. For more information, please contact [email protected]. Copyright Warning & Restrictions The copyright law of the United States (Title 17, United States Code) governs the making of photocopies or other reproductions of copyrighted material. Under certain conditions specified in the law, libraries and archives are authorized to furnish a photocopy or other reproduction. One of these specified conditions is that the photocopy or reproduction is not to be “used for any purpose other than private study, scholarship, or research.” If a, user makes a request for, or later uses, a photocopy or reproduction for purposes in excess of “fair use” that user may be liable for copyright infringement, This institution reserves the right to refuse to accept a copying order
    [Show full text]
  • GNU Libredwg for Version 0.12.4, 30 December 2020
    GNU LibreDWG for version 0.12.4, 30 December 2020 GNU LibreDWG Developers and Thien-Thi Nguyen This manual is for GNU LibreDWG (version 0.12.4, 30 December 2020). Copyright c 2010-2020 Free Software Foundation, Inc. Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.3 or any later version published by the Free Software Foundation; with no Invariant Sections, with no Front-Cover Texts, and with no Back-Cover Texts. A copy of the license is included in the section entitled \GNU Free Documentation License". i Table of Contents 1 Overview ::::::::::::::::::::::::::::::::::::::::: 1 1.1 API/ABI version ::::::::::::::::::::::::::::::::::::::::::::::: 1 1.2 Coverage ::::::::::::::::::::::::::::::::::::::::::::::::::::::: 1 1.3 Related projects :::::::::::::::::::::::::::::::::::::::::::::::: 3 2 Usage ::::::::::::::::::::::::::::::::::::::::::::: 5 3 Types::::::::::::::::::::::::::::::::::::::::::::: 6 4 Objects ::::::::::::::::::::::::::::::::::::::::::: 8 4.1 HEADER :::::::::::::::::::::::::::::::::::::::::::::::::::::: 8 4.2 ENTITIES :::::::::::::::::::::::::::::::::::::::::::::::::::: 22 4.3 OBJECTS :::::::::::::::::::::::::::::::::::::::::::::::::::: 92 5 Sections:::::::::::::::::::::::::::::::::::::::: 259 5.1 HEADER Section :::::::::::::::::::::::::::::::::::::::::::: 259 5.2 OBJECTS Section ::::::::::::::::::::::::::::::::::::::::::: 259 5.3 CLASSES Section :::::::::::::::::::::::::::::::::::::::::::: 259 5.4 HANDLES Section :::::::::::::::::::::::::::::::::::::::::::
    [Show full text]
  • Linux Distributions for Bioinformatics
    EMBnet. news 15.3 REVIEWS 35 run as a virtual machine, in parallel with the host Linux distributions for operating system, a feature which has the ad- vantage of giving occasional users or students bioinformatics: the possibility to use their usual environments while becoming familiar with a new operating an update system. This is reflected in the Linux distributions for bioinformatics that we have reviewed in this article. A trend that has been noticed in respect with the review we made in 2007 is the tenden- cy to use Ubuntu as base distribution which is in fact replacing Knoppix and to provide the bio- informatics bench environment also as a virtual machine which can be run inside the popular VMWare environment in parallel with the host op- erating system. While reviewing the distributions in this article we have paid particular attention to 1 Antonia Rana and Fabrizio Foscarini their user friendliness and ease of use. Joint Research Centre, European Commission Bio-Linux Bio-Linux [4], developed and distributed by the NERC Environmental Bioinformatics Centre, has Introduction evolved since our review in 2007, its home page The article provides an updated view on the has also changed. Its developers describe it as world of Linux distributions tailored for bioinformat- “…a fully featured, powerful, configurable and ics analysis. The main driver for producing these easy to maintain bioinformatics workstation ” and distributions is to provide an easy-to-use, user in fact it is rich with applications and documen- friendly environment for non IT specialised users tation. In its current versions, 5.0, the most nota- without strong requirements on the knowledge ble new features are the possibility to boot it from of the technology.
    [Show full text]