Workflow and Pipelining in Cheminformatics
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Istls Information Services to Life Science Internet Bioinformatics Resources Josef Maier [E-Mail: [email protected]] Last Checked August, 17Th, 2011
IStLS Information Services to Life Science Internet Bioinformatics Resources Josef Maier [e-mail: [email protected]] Last checked August, 17th, 2011 IStLS Bioinformatics Resources http://www.istls.de/bioinfolinks.php Courses and lectures Bioinformatics - Online Courses and Tutorials http://www.bioinformatik.de/cgi-bin/browse/Catalog/Research_and_Education/Online_Courses_and_Tutorials/ EMBRACE Network of Excellence http://www.embracegrid.info/page.php EMBNet Quick Guides http://www.embnet.org/node/64 EMBNet Courses http://www.embnet.org/ Sequence Analysis with distributed Resources http://bibiserv.techfak.uni-bielefeld.de/sadr/ Tutorial Protein Structures (EXPASY) SwissModel http://swissmodel.expasy.org/course/course-index.htm CMBI Courses for protein structure http://swift.cmbi.ru.nl/teach/courses/index.html 2Can Support Portal - Bioinformatics educational resource http://www.ebi.ac.uk/2can Bioconductor Workshops http://www.bioconductor.org/workshops/ CBS Bioinformatics Courses http://www.cbs.dtu.dk/courses.php The European School In Bioinformatics (Biosapiens) http://www.biosapiens.info/page.php?page=esb Institutes Centers Networks Bioinformatics Institutes Germany WSI Wilhelm-Schickard-Institut für Informatik - Universitaet Tuebingen http://www.uni-tuebingen.de/en/faculties/faculty-of-science/departments/computer-science/department.html WSI Huson - Algorithms in Bioinformatics http://www-ab.informatik.uni-tuebingen.de/welcome.html WSI Prof. Zell - Computer Architecture http://www.ra.cs.uni-tuebingen.de/ WSI Kohlbacher - Div. for Simulation -
Open Data, Open Source, and Open Standards in Chemistry: the Blue Obelisk Five Years On" Journal of Cheminformatics Vol
Oral Roberts University Digital Showcase College of Science and Engineering Faculty College of Science and Engineering Research and Scholarship 10-14-2011 Open Data, Open Source, and Open Standards in Chemistry: The lueB Obelisk five years on Andrew Lang Noel M. O'Boyle Rajarshi Guha National Institutes of Health Egon Willighagen Maastricht University Samuel Adams See next page for additional authors Follow this and additional works at: http://digitalshowcase.oru.edu/cose_pub Part of the Chemistry Commons Recommended Citation Andrew Lang, Noel M O'Boyle, Rajarshi Guha, Egon Willighagen, et al.. "Open Data, Open Source, and Open Standards in Chemistry: The Blue Obelisk five years on" Journal of Cheminformatics Vol. 3 Iss. 37 (2011) Available at: http://works.bepress.com/andrew-sid-lang/ 19/ This Article is brought to you for free and open access by the College of Science and Engineering at Digital Showcase. It has been accepted for inclusion in College of Science and Engineering Faculty Research and Scholarship by an authorized administrator of Digital Showcase. For more information, please contact [email protected]. Authors Andrew Lang, Noel M. O'Boyle, Rajarshi Guha, Egon Willighagen, Samuel Adams, Jonathan Alvarsson, Jean- Claude Bradley, Igor Filippov, Robert M. Hanson, Marcus D. Hanwell, Geoffrey R. Hutchison, Craig A. James, Nina Jeliazkova, Karol M. Langner, David C. Lonie, Daniel M. Lowe, Jerome Pansanel, Dmitry Pavlov, Ola Spjuth, Christoph Steinbeck, Adam L. Tenderholt, Kevin J. Theisen, and Peter Murray-Rust This article is available at Digital Showcase: http://digitalshowcase.oru.edu/cose_pub/34 Oral Roberts University From the SelectedWorks of Andrew Lang October 14, 2011 Open Data, Open Source, and Open Standards in Chemistry: The Blue Obelisk five years on Andrew Lang Noel M O'Boyle Rajarshi Guha, National Institutes of Health Egon Willighagen, Maastricht University Samuel Adams, et al. -
Molecular Structure Input on the Web Peter Ertl
Ertl Journal of Cheminformatics 2010, 2:1 http://www.jcheminf.com/content/2/1/1 REVIEW Open Access Molecular structure input on the web Peter Ertl Abstract A molecule editor, that is program for input and editing of molecules, is an indispensable part of every cheminfor- matics or molecular processing system. This review focuses on a special type of molecule editors, namely those that are used for molecule structure input on the web. Scientific computing is now moving more and more in the direction of web services and cloud computing, with servers scattered all around the Internet. Thus a web browser has become the universal scientific user interface, and a tool to edit molecules directly within the web browser is essential. The review covers a history of web-based structure input, starting with simple text entry boxes and early molecule editors based on clickable maps, before moving to the current situation dominated by Java applets. One typical example - the popular JME Molecule Editor - will be described in more detail. Modern Ajax server-side molecule editors are also presented. And finally, the possible future direction of web-based molecule editing, based on tech- nologies like JavaScript and Flash, is discussed. Introduction this trend and input of molecular structures directly A program for the input and editing of molecules is an within a web browser is therefore of utmost importance. indispensable part of every cheminformatics or molecu- In this overview a history of entering molecules into lar processing system. Such a program is known as a web applications will be covered, starting from simple molecule editor, molecular editor or structure sketcher. -
Spoken Tutorial Project, IIT Bombay Brochure for Chemistry Department
Spoken Tutorial Project, IIT Bombay Brochure for Chemistry Department Name of FOSS Applications Employability GChemPaint GChemPaint is an editor for 2Dchem- GChemPaint is currently being developed ical structures with a multiple docu- as part of The Chemistry Development ment interface. Kit, and a Standard Widget Tool kit- based GChemPaint application is being developed, as part of Bioclipse. Jmol Jmol applet is used to explore the Jmol is a free, open source molecule viewer structure of molecules. Jmol applet is for students, educators, and researchers used to depict X-ray structures in chemistry and biochemistry. It is cross- platform, running on Windows, Mac OS X, and Linux/Unix systems. For PG Students LaTeX Document markup language and Value addition to academic Skills set. preparation system for Tex typesetting Essential for International paper presentation and scientific journals. For PG student for their project work Scilab Scientific Computation package for Value addition in technical problem numerical computations solving via use of computational methods for engineering problems, Applicable in Chemical, ECE, Electrical, Electronics, Civil, Mechanical, Mathematics etc. For PG student who are taking Physical Chemistry Avogadro Avogadro is a free and open source, Research and Development in Chemistry, advanced molecule editor and Pharmacist and University lecturers. visualizer designed for cross-platform use in computational chemistry, molecular modeling, material science, bioinformatics, etc. Spoken Tutorial Project, IIT Bombay Brochure for Commerce and Commerce IT Name of FOSS Applications / Employability LibreOffice – Writer, Calc, Writing letters, documents, creating spreadsheets, tables, Impress making presentations, desktop publishing LibreOffice – Base, Draw, Managing databases, Drawing, doing simple Mathematical Math operations For Commerce IT Students Drupal Drupal is a free and open source content management system (CMS). -
Microsoft Office Sharepoint Server Custom Application Development: Document Workflow Management Project
Microsoft Office SharePoint Server Custom Application Development: Document Workflow Management Project Author: Eric Charran, Microsoft Corporation Date published: February 2009 Applies to: Microsoft® Office SharePoint® Server 2007 VSTS Rangers: This content was created in a VSTS Rangers project. “Our mission is to accelerate the adoption of Microsoft® Visual Studio® Team System by delivering out-of-band solutions for missing features or guidance. We work closely with members of Microsoft Services and MVP communities to make sure that their solutions address real-world blockers.” Bijan Javidi, VSTS Ranger Lead Summary: This white paper reviews the successful Microsoft Consulting Services (MCS) design, construction, and deployment of a custom Microsoft® Office SharePoint® Server 2007 application to a mid-market enterprise customer. This documentation will serve to educate and familiarize customers who are planning to undertake similar custom Office SharePoint Server 2007 application projects within their own organizations. The guidance and experiences come directly from a real-world field implementation and contain patterns and practices in addition to descriptions of the problem, business challenge, the solution's technical structure, and the staffing model that was used to implement the application. The audience for this document includes customers who are interested in conducting a software development life cycle for a custom application that will be built on the Microsoft SharePoint Products and Technologies platform. This document not only discusses the project's goals and vision from inception, but also follows the project through staffing by identifying successful team models. The solution's physical design and best practice considerations are documented and provide background and context for the presented lessons learned. -
3 Workflow Systems
Large Scale Data Handling in Biology Workflow Systems 3 Workflow Systems Within the last few years a large number of tools and softwares dealing with different computational problems related to HCS have been developed. Incorporating third party or new tools into existing frameworks needs a flexible, modular and customizable workflow framework. Workflow (Pipeline) systems could become crucial for enabling HCS researchers doing large scale experiments to deal with this data explosion. The workflow is termed abstract in that it is not yet fully functional but the actual components are in place and in the requisite order. In general, workflow systems concentrate on the creation of abstract process workflows to which data can be applied when the design process is complete. In contrast, workflow systems in the life sciences domain are often based on a data-flow model, due to the data-centric and data-driven nature of many scientific analyses. A comprehensive understanding of biological phenomena can be achieved only through the integration of all available biological information and different data analysis tools and applications. In general, an ideal workflow system in HCS can integrate nearly all standard tools and software. For example, for an HCS using small molecules, the workflow system must be able to integrate different image processing software and data mining toolkits with flexibility. The possibility that any single software covers all possible domains and data models is nearly zero. No one vendor or source can provide all the tools needed by HCS informatics. So it is suggested that one uses specialized tools from specialized sources. Also not all softwares components can be integrated with all workflow systems. -
Workflow Definition, Benefits, and Preparation
4 Country View Road Malvern, PA 19355 800.223.7036 610.647.5930 S www.sct.com Workflow definition, benefits, and preparation An SCT White Paper EWHT-005 (02/03) SCT, Banner, PowerCAMPUS, and the circle P logo are registered trademarks; and the SCT logo, SCT Plus, SCT Matrix, SCT Campus Pipeline, and Luminis are trademarks, of Systems & Computer Technology Corporation or one or more of its wholly owned subsidiaries. All other products and company names referenced herein are used for identification purposes only and may be trademarks of their respective owners. Copyright © 2003 Systems & Computer Technology Corporation. All rights reserved. Workflow (cont.) What is Workflow? Most colleges and universities today practice workflow of some kind, either manually or with software applications. Admitting a student, for example, requires more than one person (and more than one department) to find the prospect, review the application, compile the financial aid package, and send the offer letter. The process of passing information along and responding to it is a type of workflow. However, more often than not, the process grinds to a halt when one of those papers languishes in a participant’s in-box. A software-based workflow management system would speed this process—and hundreds of others like it—and elevate its quality by automating, simplifying, measuring, directing, and managing the flow of information from department to department across the enterprise. There are many different definitions of workflow. For consistency, SCT follows the definition put forth by the respected Workflow Management Coalition (WfMC): The coalition also states that a true workflow management system must have the following criteria: • Define, create, and manage the execution of workflows with software • Run on one or more workflow engines • Interpret the process definition • Interact with workflow participants • Invoke IT tools and applications where required In addition, workflow solutions comprise data, people, tools, and activities. -
BIOVIA Pipeline Pilot System Requirements
SYSTEM REQUIREMENTS PIPELINE PILOT 2020 Copyright Notice ©2019 Dassault Systèmes. All rights reserved. 3DEXPERIENCE, the Compass icon and the 3DS logo, CATIA, SOLIDWORKS, ENOVIA, DELMIA, SIMULIA, GEOVIA, EXALEAD, 3DVIA, 3DSWYM, BIOVIA, NETVIBES, IFWE and 3DEXCITE, are commercial trademarks or registered trademarks of Dassault Systèmes, a French "société européenne" (Versailles Commercial Register # B 322 306 440), or its subsidiaries in the U.S. and/or other countries. All other trademarks are owned by their respective owners. Use of any Dassault Systèmes or its subsidiaries trademarks is subject to their express written approval. Acknowledgments and References To print photographs or files of computational results (figures and/or data) obtained by using Dassault Systèmes software, acknowledge the source in an appropriate format. For example: "Computational results were obtained by using Dassault Systèmes BIOVIA software programs. Pipeline Pilot Server was used to perform the calculations and to generate the graphical results." Dassault Systèmes may grant permission to republish or reprint its copyrighted materials. Requests should be submitted to Dassault Systèmes Customer Support, either by visiting https://www.3ds.com/support/ and clicking Call us or Submit a request, or by writing to: Dassault Systèmes Customer Support 10, Rue Marcel Dassault 78140 Vélizy-Villacoublay FRANCE Contents About This Document 1 Definitions 1 Additional Information 1 Dassault Systèmes Support Resources 1 Pipeline Pilot Server Requirements 2 Minimum Hardware -
Introduction to Workflows April 6, 2020
Tutorial An Introduction to Workflows April 6, 2020 Sample to Insight QIAGEN Aarhus Silkeborgvej 2 Prismet 8000 Aarhus C Denmark digitalinsights.qiagen.com [email protected] An Introduction to Workflows 2 Tutorial An Introduction to Workflows A workflow consists of a series of connected tools where the output of one tool is used as input for another tool, making it possible to analyze many samples using a standardized pipeline. This tutorial covers: • Setting up a workflow that maps reads to a reference sequence, detects variants in the data relative to the reference, and filters away variants that are also present in a database of common variants • Launching a workflow in batch mode to analyze multiple samples • Workflow management, including creating a workflow installer, for installing the workflow on your own or another CLC Workbench, or on a CLC Server, where all or a selected group of server users could make use of it Steps where action should be taken are numbered. Prerequisites: We recommend using CLC Genomics Workbench 20.0 or higher when working through this tutorial. Some functionality referred to is not available in earlier versions. Download and import data for the tutorial The example data for this tutorial includes two files of sequencing reads and several reference tracks: the human mitochondrial genome from the hg18 build, NC_0O1807 (Genome), corre- sponding CDS and Gene tracks, and a chrMdbSNPCommon track containing the dbSNP common variants for this mitochondrial sequence. 1. Download the example data from http://resources.qiagenbioinformatics.com/ testdata/chrM-tutorial-data.zip. 2. Start the CLC Genomics Workbench. -
Integration Collection for Pipeline Pilot
QUICK START GUIDE PROTOCOL DEVELOPMENT INTEGRATION COLLECTION 2017 Copyright Notice ©2016 Dassault Systèmes. All rights reserved. 3DEXPERIENCE, the Compass icon and the 3DS logo, CATIA, SOLIDWORKS, ENOVIA, DELMIA, SIMULIA, GEOVIA, EXALEAD, 3D VIA, BIOVIA and NETVIBES are commercial trademarks or registered trademarks of Dassault Systèmes or its subsidiaries in the U.S. and/or other countries. All other trademarks are owned by their respective owners. Use of any Dassault Systèmes or its subsidiaries trademarks is subject to their express written approval. Acknowledgments and References To print photographs or files of computational results (figures and/or data) obtained using BIOVIA software, acknowledge the source in an appropriate format. For example: "Computational results obtained using software programs from Dassault Systèmes BIOVIA. The ab initio calculations were performed with the DMol3 program, and graphical displays generated with Pipeline Pilot." BIOVIA may grant permission to republish or reprint its copyrighted materials. Requests should be submitted to BIOVIA Support, either through electronic mail to [email protected], or in writing to: BIOVIA Support 5005 Wateridge Vista Drive, San Diego, CA 92121 USA Contents Chapter 1: Introduction 1 Run Program 17 Architectural Overview 1 Run Program on Remote Host 18 Extending Functionality 2 SOAP 18 Application Integration Components 3 Chapter 4: Language-Based Components 19 Command-line Integration 3 Perl 19 Run Program Components 3 Java 20 FTP 3 .NET 20 SSH 3 Windows Script Host Components 20 SCP 4 Python 21 Telnet 4 VBScript 21 Language-based Integration 4 PilotScript 21 Java Component Development 4 About the PilotScript Language 21 .NET Component Development 4 PilotScript vs. -
A Adapting Scientific Workflow Structures Using Multi-Objective
A Adapting Scientific Workflow Structures Using Multi-Objective Optimisation Strategies IRFAN HABIB, Centre for Complex Cooperative Systems, FET, University of the West of England, Bristol, UK ASHIQ ANJUM, School of Computing and Mathematics, University of Derby, Derby, UK RICHARD MCCLATCHEY, Centre for Complex Cooperative Systems, FET, University of the West of England, Bristol, UK OMER RANA, School of Computer Science & Informatics, Cardiff University, UK Scientific workflows have become the primary mechanism for conducting analyses on distributed comput- ing infrastructures such as grids and clouds. In recent years, the focus of optimisation within scientific workflows has primarily been on computational tasks and workflow makespan. However, as workflow-based analysis becomes ever more data intensive, data optimisation is becoming a prime concern. Moreover, scien- tific workflows can scale along several dimensions: (i) number of computational tasks, (ii) heterogeneity of computational resources, and the (iii) size and type (static vs. streamed) of data involved. Adapting workflow structure in response to these scalability challenges remains an important research objective. Understand- ing how a workflow graph can be restructured in an automated manner (through task merge for instance), to address constraints of a particular execution environment is explored in this work, using a multi-objective evolutionary approach. Our approach attempts to adapt the workflow structure to achieve both compute and data optimisation. The question of when to terminate the evolutionary search in order to conserve compu- tations is tackled with a novel termination criterion. The results presented in this paper, demonstrate the feasibility of the termination criterion and demonstrate that significant optimisation can be achieved with a multi-objective approach. -
How to Describe Workflow Information Systems to Support Business Process
How to Describe Workflow Information Systems to Support Business Process Josefina Guerrero García, Jean Vanderdonckt, Christophe Lemaige, Juan M. González Calleros 1Université catholique de Louvain, Louvain School of Management Place des Doyens, 1 – B-1348 Louvain-la-Neuve (Belgium) Abstract ticipant to another for action, according to a set of pro- cedural rules” [26]. Workflow Information Systems (WIS) cover the application of information technology This paper addresses a methodology for developing to business problems. Its primary characteristic is the the various user interfaces (UI) of a workflow informa- automation of processes involving combinations of tion system (WIS), which are advocated to automate human activities with information technology applica- business processes, following a model-centric ap- tions. Owing to the fact that the users of a IS interact proach based on the requirements and processes of the with it through its user interfaces (UIs) in the pursuit organization. The methodology applies to: 1) integrate of organizational goals, flexibility in creating them is human and machines based activities, in particular therefore important. We will explore a systematic way those involving interaction with IT applications and to define UIs for a WIS. The workflow model defines tools, 2) to identify how tasks are structured, who per- what processes and tasks need to be fulfilled and their form them, what their relative order is, how they are possible ordering; hence the workflow model is a offered or assigned, and how tasks are being tracked. ‘framework’ for creating task model, hence the task For this purpose, workflow is recursively decomposed model is suitable for developing UIs.