Getting Stuff Done

Total Page:16

File Type:pdf, Size:1020Kb

Getting Stuff Done Getting Stuff Done Software Synergy with Fiji / ImageJ2 and KNIME Max-Planck Institute, Dresden Florian Jug - [email protected] Outline A. What do we want to get done? (And who is we?) B. Software synergy with Fiji, IJ2, and KNIME C. Examples D. Remarks and opinions A What to get done… A Who wants stuff done? • You • Your boss • Your institute / university • Your scientific community A Viewpoint: single person A Viewpoint: single person • do awesome stuff of interest • do it right! • get it done • avoid unnecessary overhead A Research group level A Research group level • do lots of awesome stuff of interest • synergy between group members • keep knowledge alive! • make starting easy • documentation • teaching • open exchange of ideas A Research group level The Myers Lab A Research group level Genome Sequencing Development of a new assembler (The Dresden Azzembler „DAZZLER“). First assembler that fully leverages on the properties of the new BacBio reads. We run two new PacBio machines at CRTD/TUD. Two ongoing projects: • Axolotl (with Tanaka Lab) • Planaria (with Rink Lab) A Research group level Microscope Development So far we developed two light sheet microscopes (SPIM) at MPI-CBG. • The “Betzig Scope” is since month in part-time production mode for our “costumers” and creates stunning datasets. • Our second scope, the „X-Wing“ is almost completed and will be turned on soon. It was developed to resolve entire organisms at single cell resolution. A Research group level Image and Data Analysis Idea: intensive collaboration of biologists, physicists, mathema- ticians, and computer scientists. Three example projects: • Development of specialised segmentation methods. • Building an adaptive C. Elegans atlas for fully automatic cell annotation. • Lineage tracing (tracking) and analysis of developing organisms. A Institute level Mission statement of the MPI-CBG We maintain our fundamental scientific question: how do cells form tissues? More generally we are interested in how the morphology of cells and tissues emerge from the interactions between individual molecules and cells. Key questions we are interested in are the molecular principles underlying morphogenesis, such as regulation of size and shape. Importantly, we want our research programs to span multiple scales of magnitude. We will continue working on size and shape at any of the scales or organizational levels from molecular assemblies, to organelles, cells, tissues, organs and organisms. New RGLs are selected on the basis of their fit to the overall multi-scale research of the Institute, to maintain some proportion between the different scales covered. Ultimately, our goal is to take a multi-scale approach to link tissue level organizational principles to molecular activity. Animal model systems: We are prepared to consider any animal system that amplifies a particular piece of biology and offers particular opportunities and/or is easy to study. Technology: The institute will develop new technologies necessary to realize these goals. A Institute level • foster collaborations among groups • share knowledge and tools • provide facilities - to ‘facilitate’ productivity! A Institute level (Dresden) Software Synergy — Vision • A new lab member comes in, easily understands the framework, has lots of useful software components available, and starts rockin' on their research project right away. Image and Data Processing What is Software Synergy? If you benefit from work other did in the past and future projects benefit from your current work. How can it be enhanced? Social Aspect: “One has to know what was done in the past, where to find it, and how to use it.” Technological Aspects: “Ideally, modules should be interoperable (one can work on output of other one).” Status Quo Main technologies used at MPI-CBG: • Amira (C++) • Fiji / ImageJ (Java) And all this on Windows, • KNIME (Java) Mac, and • PLUK Linux. • ITK / VTK / OpenCV / Vigra + Don’t forget • Python the cluster!!! • Matlab Typical Image Processing Workflow Image Data Storage Visualization Processing Analysis Some HDD FIJI R Excel Fileserver CellProfiler Weka MatLab OMERO ImageJ2 MatLab Matplotlib … MatLab Python GnuPlot OpenCV Excel JFreeChart SciKit-Image Prism … … … A Image Processing Workflow Automation of Analysis Own Weka & Fileserver Algorithm & R Excel CellProfiler run somewhere on many images A Image Processing Workflow Later in time… ? B Software Synergy What’s FIJI? What’s ImageJ2? http://imagej.github.io/presentations/2015-09-03-imagej2-and-fiji/ What’s KNIME? What’s KNIME? www.knime.org What’s KNIME Image Processing? Example Workflow in KNIME KNIME incluDes e.g. Python, Matlab, R ImageJ Integration 32 ImageJ2 Integration ImageJ2 in KNIME • Automatic noDe generation • ADD own ImageJ2 Plugins via installer • Every ImageJ2 Plugin IS a KNIME noDe! 33 ImageJ2 Integration Ultimate Goal: full bi-directionality ImageJ2 Integration Ultimate Goal: full bi-directionality C Example projects C Example projects “Provide Existing” C Epithelia Cell Segment. C Epithelia Cell Segment. C Epithelia Cell Segment. C Epithelia Cell Segment. C Epithelia Cell Segment. C Epithelia Cell Segment. C Segmentation Pipeline Christian Dietz Univ. Konstanz Andreas Graumann Univ. Konstanz Tobias Pietzsch MPI-CBG, Dresden C Example projects “Avoid Reinventing” C The Mother Machine The Mother Machine MotherMachine data by Eric van Nimwegen’s Lab, Basel The Mother Machine Finding The Solution Optimal Joint Segmentation and Tracking of Escherichia Coli in the Mother Machine BAMBI@MICCAI 2014 Tracking by Assignment Facilitates Data Curation (Best Paper Award) IMIC@MICCAI 2014 MINIMIZE C V FG · FG SUCH THAT (t) hi t T, P (H(t)): a 1, (1) 8 2 8 2 P i i P az Γ }|(h(t)) { X2 i2Xr i t T, h H(t) : a a =0, (2) 8 2 8 2 i − i a Γ (h) a Γ (h) i2Xr i2Xl t T, h H(t) : H(t) a(t) + a H(t) . (3) 8 2 8 2 | |· ?i | | a H (h) 2X" Fully Automatic Result Fully Automatic Tracking Results Data Curation …someone has to fix remaining mistakes… Example Example Example The Interactive MotherMachine C Example projects “Give Back” C Clear Volume Loic Martin Ulrik Nicola Royer Weigert Günther Maghelli Open-source live visualization and processing for light sheet microscopy ClearVolume – Open-source live 3D visualization for light sheet microscopy. Loic A. Royer, Martin Weigert, Ulrik Günther, Nicola Maghelli, Florian Jug, Ivo F. Sbalzarini, Eugene W. Myers Nature Methods 12, 480–481 (2015) The SPIM/DSLM playground The SPIM/DSLM playground Live visualization | GPUs to the rescue Volumetric Data GPU OpenCL, CUDA Drosophila Brain (Tzumin Lee) Volume Rendering Isosurfaces e.g. ~ 50ms for 400MB Dataset on a modest card Fibonacci multi pass for big volumes naive successive Fibonacci Fibonacci multi pass for big volumes naive successive Fibonacci Fibonacci multi pass for big volumes Drosophila Brain (Tzumin Lee), 3 Channel, 2GB Open / flexible framework Framework / language bindings Open / flexible framework D Remarks and Opinions D Remarks and Opinions • BUT • This whole SW landscape can be confusing • We need to make it easy be part of it • informative web resources • teaching how to contribute • It should be in the interest of research groups, institutes, and communities to foster contribution! D Remarks and Opinions • How can I contribute? • Use stuff that exists + give credit. • Make your own contributions available. • And if you can: hire developers, get funding. Thanks! Myers Lab @ MPI-CBG (Dresden) Tobias Pietzsch Stephan Saalfeld Christian Dietz Jean-Yves Tinevez Michael Berthold Carsten Rother MPI-CBG, Dresden Janelia Univ. Konstanz Institute Pasteur Univ. Konstanz CVLD, TU Dresden.
Recommended publications
  • Text Mining Course for KNIME Analytics Platform
    Text Mining Course for KNIME Analytics Platform KNIME AG Copyright © 2018 KNIME AG Table of Contents 1. The Open Analytics Platform 2. The Text Processing Extension 3. Importing Text 4. Enrichment 5. Preprocessing 6. Transformation 7. Classification 8. Visualization 9. Clustering 10. Supplementary Workflows Licensed under a Creative Commons Attribution- ® Copyright © 2018 KNIME AG 2 Noncommercial-Share Alike license 1 https://creativecommons.org/licenses/by-nc-sa/4.0/ Overview KNIME Analytics Platform Licensed under a Creative Commons Attribution- ® Copyright © 2018 KNIME AG 3 Noncommercial-Share Alike license 1 https://creativecommons.org/licenses/by-nc-sa/4.0/ What is KNIME Analytics Platform? • A tool for data analysis, manipulation, visualization, and reporting • Based on the graphical programming paradigm • Provides a diverse array of extensions: • Text Mining • Network Mining • Cheminformatics • Many integrations, such as Java, R, Python, Weka, H2O, etc. Licensed under a Creative Commons Attribution- ® Copyright © 2018 KNIME AG 4 Noncommercial-Share Alike license 2 https://creativecommons.org/licenses/by-nc-sa/4.0/ Visual KNIME Workflows NODES perform tasks on data Not Configured Configured Outputs Inputs Executed Status Error Nodes are combined to create WORKFLOWS Licensed under a Creative Commons Attribution- ® Copyright © 2018 KNIME AG 5 Noncommercial-Share Alike license 3 https://creativecommons.org/licenses/by-nc-sa/4.0/ Data Access • Databases • MySQL, MS SQL Server, PostgreSQL • any JDBC (Oracle, DB2, …) • Files • CSV, txt
    [Show full text]
  • Bioimage Analysis Tools
    Bioimage Analysis Tools Kota Miura, Sébastien Tosi, Christoph Möhl, Chong Zhang, Perrine Paul-Gilloteaux, Ulrike Schulze, Simon Norrelykke, Christian Tischer, Thomas Pengo To cite this version: Kota Miura, Sébastien Tosi, Christoph Möhl, Chong Zhang, Perrine Paul-Gilloteaux, et al.. Bioimage Analysis Tools. Kota Miura. Bioimage Data Analysis, Wiley-VCH, 2016, 978-3-527-80092-6. hal- 02910986 HAL Id: hal-02910986 https://hal.archives-ouvertes.fr/hal-02910986 Submitted on 3 Aug 2020 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. 2 Bioimage Analysis Tools 1 2 3 4 5 6 Kota Miura, Sébastien Tosi, Christoph Möhl, Chong Zhang, Perrine Pau/-Gilloteaux, - Ulrike Schulze,7 Simon F. Nerrelykke,8 Christian Tischer,9 and Thomas Penqo'" 1 European Molecular Biology Laboratory, Meyerhofstraße 1, 69117 Heidelberg, Germany National Institute of Basic Biology, Okazaki, 444-8585, Japan 2/nstitute for Research in Biomedicine ORB Barcelona), Advanced Digital Microscopy, Parc Científic de Barcelona, dBaldiri Reixac 1 O, 08028 Barcelona, Spain 3German Center of Neurodegenerative
    [Show full text]
  • Imagej2-Allow the Users to Use Directly Use/Update Imagej2 Plugins Inside KNIME As Well As Recording and Running KNIME Workflows in Imagej2
    The KNIME Image Processing Extension for Biomedical Image Analysis Andries Zijlstra (Vanderbilt University Medical Center The need for image processing in medicine Kevin Eliceiri (University of Wisconsin-Madison) KNIME Image Processing and ImageJ Ecosystem [email protected] [email protected] The need for precision oncology 36% of newly diagnosed cancers, and 10% of all cancer deaths in men Out of every 100 men... 16 will be diagnosed with prostate cancer in their lifetime In reality, up to 80 will have prostate cancer by age 70 And 3 will die from it. But which 3 ? In the meantime, we The goal: Diagnose patients that have over-treat many aggressive disease through Precision Medicine patients Objectives of Approach to Modern Medicine Precision Medicine • Measure many things (data density) • Improved outcome through • Make very accurate measurements (fidelity) personalized/precision medicine • Consider multiple perspectives (differential) • Reduced expense/resource allocation through • Achieve confidence in the diagnosis improved diagnosis, prognosis, treatment • Match patients with a treatment they are most • Maximize quality of life by “targeted” therapy likely to respond to. Objectives of Approach to Modern Medicine Precision Medicine • Measure many things (data density) • Improved outcome through • Make very accurate measurements (fidelity) personalized/precision medicine • Consider multiple perspectives (differential) • Reduced expense/resource allocation through • Achieve confidence in the diagnosis improved diagnosis,
    [Show full text]
  • KNIME Workbench Guide
    KNIME Workbench Guide KNIME AG, Zurich, Switzerland Version 4.4 (last updated on 2021-06-08) Table of Contents Workspaces . 1 KNIME Workbench . 2 Welcome page . 4 Workflow editor & nodes . 5 KNIME Explorer . 13 Workflow Coach . 35 Node repository . 37 KNIME Hub view . 38 Description. 40 Node Monitor. 40 Outline. 41 Console. 41 Customizing the KNIME Workbench . 42 Reset and logging . 42 Show heap status . 42 Configuring KNIME Analytics Platform . 43 Preferences . 43 Setting up knime.ini. 47 KNIME runtime options . 49 KNIME tables . 55 Data table . 55 Column types. 56 Sorting . 59 Column rendering . 59 Table storage. 61 KNIME Workbench Guide This guide describes the first steps to take after starting KNIME Analytics Platform and points you to the resources available in the KNIME Workbench for building workflows. It also explains how to customize the workbench and configure KNIME Analytics Platform to best suit specific needs. In the last part of this guide we introduce data tables. Workspaces When you start KNIME Analytics Platform, the KNIME Analytics Platform launcher window appears and you are asked to define the KNIME workspace, as shown in Figure 1. The KNIME workspace is a folder on the local computer to store KNIME workflows, node settings, and data produced by the workflow. Figure 1. KNIME Analytics Platform launcher The workflows and data stored in the workspace are available through the KNIME Explorer in the upper left corner of the KNIME Workbench. © 2021 KNIME AG. All rights reserved. 1 KNIME Workbench Guide KNIME Workbench After selecting a workspace for the current project, click Launch. The KNIME Analytics Platform user interface - the KNIME Workbench - opens.
    [Show full text]
  • Data Analytics with Knime
    DATA ANALYTICS WITH KNIME v.3.4.0 QUALIFICATIONS & EXPERIENCE ▶ 38 years of providing professional services to state and local taxing officials ▶ TMA works exclusively with government partners WHO ▶ TMA is composed of 150+ WE ARE employees in five main offices across the United States Tax Management Associates is a professional services firm that has ▶ Our main focus is on revenue served the interests of state and local enhancement services for state government since 1979. and local jurisdictions and property tax compliance efforts KNIME POWERED CUSTOM ANALYTICS ▶ TMA is a proud KNIME Trusted Consulting Partner. Visit: www.knime.org/knime-trusted-partners ▶ Successful analytics solutions: ○ Fraud Detection (Michigan Department of Treasury) ○ Entity Discovery (multiple counties) ○ Data Aggregation (Louisiana State Tax Commission) KNIME POWERED CUSTOM ANALYTICS ▶ KNIME is an open source data toolkit ▶ Active development community and core team ▶ GUI based with scripting integration ○ Easy adoption, integration, and training ▶ Data ingestion, transformation, analytics, and reporting FEATURES & TERMINOLOGY KNIME WORKBENCH TAX MANAGEMENT ASSOCIATES, INC. KNIME WORKFLOW TAX MANAGEMENT ASSOCIATES, INC. KNIME NODES TAX MANAGEMENT ASSOCIATES, INC. DATA TYPES & SOURCES DATA AGNOSTIC ▶ Flat Files ▶ Shapefiles ▶ Xls/x Reader ▶ HTTP Requests ▶ Fixed Width ▶ RSS Feeds ▶ Text Files ▶ Custom API’s/Curl ▶ Image Files ▶ Standard API’s ▶ XML ▶ JSON TAX MANAGEMENT ASSOCIATES, INC. KNIME DATA NODES TAX MANAGEMENT ASSOCIATES, INC. DATABASE AGNOSTIC ▶ Microsoft SQL ▶ Oracle ▶ MySQL ▶ IBM DB2 ▶ Postgres ▶ Hadoop ▶ SQLite ▶ Any JDBC driver TAX MANAGEMENT ASSOCIATES, INC. KNIME DATABASE NODES TAX MANAGEMENT ASSOCIATES, INC. CORE DATA ANALYTICS FEATURES KNIME DATA ANALYTICS LIFECYCLE Read Data Extract, Data Analytics Reporting or Predictive Read Transform, and/or Load (ETL) Analysis Injection Data Read Data TAX MANAGEMENT ASSOCIATES, INC.
    [Show full text]
  • Downloaded from the Cellprofiler Site [31] to Provide a Starting Point for New Analyses
    Open Access Software2006CarpenteretVolume al. 7, Issue 10, Article R100 CellProfiler: image analysis software for identifying and quantifying comment cell phenotypes Anne E Carpenter*, Thouis R Jones*†, Michael R Lamprecht*, Colin Clarke*†, In Han Kang†, Ola Friman‡, David A Guertin*, Joo Han Chang*, Robert A Lindquist*, Jason Moffat*, Polina Golland† and David M Sabatini*§ reviews Addresses: *Whitehead Institute for Biomedical Research, Cambridge, MA 02142, USA. †Computer Sciences and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02142, USA. ‡Department of Radiology, Brigham and Women's Hospital, Boston, MA 02115, USA. §Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02142, USA. Correspondence: David M Sabatini. Email: [email protected] Published: 31 October 2006 Received: 15 September 2006 Accepted: 31 October 2006 reports Genome Biology 2006, 7:R100 (doi:10.1186/gb-2006-7-10-r100) The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2006/7/10/R100 © 2006 Carpenter et al.; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. deposited research Cell<p>CellProfiler, image analysis the software first free, open-source system for flexible and high-throughput cell image analysis is described.</p> Abstract Biologists can now prepare and image thousands of samples per day using automation, enabling chemical screens and functional genomics (for example, using RNA interference). Here we describe the first free, open-source system designed for flexible, high-throughput cell image analysis, research refereed CellProfiler.
    [Show full text]
  • Cellprofiler: Image Analysis Software for Identifying and Quantifying Cell Phenotypes
    CellProfiler: image analysis software for identifying and quantifying cell phenotypes The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation Genome Biology. 2006 Oct 31;7(10):R100 As Published http://dx.doi.org/10.1186/gb-2006-7-10-r100 Publisher BioMed Central Ltd Version Final published version Citable link http://hdl.handle.net/1721.1/58762 Terms of Use Creative Commons Attribution Detailed Terms http://creativecommons.org/licenses/by/2.0 1 2 3 Table of Contents Getting Started: MaskImage . 96 Introduction . 6 Morph . 110 Installation . 7 OverlayOutlines . 111 Getting Started with CellProfiler . 9 PlaceAdjacent . 112 RescaleIntensity . 115 Resize . 116 Help: Rotate . 118 BatchProcessing . 11 Smooth . 122 Colormaps. .16 Subtract . 125 DefaultImageFolder . 17 SubtractBackground . 126 DefaultOutputFolder . 18 Tile.............................................127 DeveloperInfo . 19 FastMode . 28 MatlabCrash . 29 Object Processing modules: OutputFilename . 30 ClassifyObjects . 41 PixelSize . 31 ClassifyObjectsByTwoMeasurements . 42 Preferences. .32 ConvertToImage . 45 SkipErrors . 33 Exclude . 65 TechDiagnosis . 34 ExpandOrShrink . 66 FilterByObjectMeasurement . 71 IdentifyObjectsInGrid . 76 File Processing modules: IdentifyPrimAutomatic . 77 CreateBatchFiles . 51 IdentifyPrimManual . 83 ExportToDatabase . 68 IdentifySecondary . 84 ExportToExcel. .70 IdentifyTertiarySubregion. .88 LoadImages . 90 Relate . 113 LoadSingleImage . 94 LoadText. .95 RenameOrRenumberFiles .
    [Show full text]
  • Sheffield HPC Documentation
    Sheffield HPC Documentation Release November 14, 2016 Contents 1 Research Computing Team 3 2 Research Software Engineering Team5 i ii Sheffield HPC Documentation, Release The current High Performance Computing (HPC) system at Sheffield, is the Iceberg cluster. A new system, ShARC (Sheffield Advanced Research Computer), is currently under development. It is not yet ready for use. Contents 1 Sheffield HPC Documentation, Release 2 Contents CHAPTER 1 Research Computing Team The research computing team are the team responsible for the iceberg service, as well as all other aspects of research computing. If you require support with iceberg, training or software for your workstations, the research computing team would be happy to help. Take a look at the Research Computing website or email research-it@sheffield.ac.uk. 3 Sheffield HPC Documentation, Release 4 Chapter 1. Research Computing Team CHAPTER 2 Research Software Engineering Team The Sheffield Research Software Engineering Team is an academically led group that collaborates closely with CiCS. They can assist with code optimisation, training and all aspects of High Performance Computing including GPU computing along with local, national, regional and cloud computing services. Take a look at the Research Software Engineering website or email rse@sheffield.ac.uk 2.1 Using the HPC Systems 2.1.1 Getting Started If you have not used a High Performance Computing (HPC) cluster, Linux or even a command line before this is the place to start. This guide will get you set up using iceberg in the easiest way that fits your requirements. Getting an Account Before you can start using iceberg you need to register for an account.
    [Show full text]
  • Mathematica Document
    Mathematica Project: Exploratory Data Analysis on ‘Data Scientists’ A big picture view of the state of data scientists and machine learning engineers. ����� ���� ��������� ��� ������ ���� ������ ������ ���� ������/ ������ � ���������� ���� ��� ������ ��� ���������������� �������� ������/ ����� ��������� ��� ���� ���������������� ����� ��������������� ��������� � ������(�������� ���������� ���������) ������ ��������� ����� ������� �������� ����� ������� ��� ������ ����������(���� �������) ��������� ����� ���� ������ ����� (���������� �������) ����������(���������� ������� ���������� ��� ���� ���� �����/ ��� �������������� � ����� ���� �� �������� � ��� ����/���������� ��������������� ������� ������������� ��� ���������� ����� �����(���� �������) ����������� ����� / ����� ��� ������ ��������������� ���������� ����������/�++ ������/������������/����/������ ���� ������� ����� ������� ������� ����������������� ������� ������� ����/����/�������/��/��� ����������(�����/����-�������� ��������) ������������ In this Mathematica project, we will explore the capabilities of Mathematica to better understand the state of data science enthusiasts. The dataset consisting of more than 10,000 rows is obtained from Kaggle, which is a result of ‘Kaggle Survey 2017’. We will explore various capabilities of Mathematica in Data Analysis and Data Visualizations. Further, we will utilize Machine Learning techniques to train models and Classify features with several algorithms, such as Nearest Neighbors, Random Forest. Dataset : https : // www.kaggle.com/kaggle/kaggle
    [Show full text]
  • Direct Submission Or Co-Submission Direct Submission
    Z-Matrix template-based substitution approach Title for enumeration of 3D molecular structures Authors Wanutcha Lorpaiboon and Taweetham Limpanuparb* Science Division, Mahidol University International College, Affiliations Mahidol University, Salaya, Nakhon Pathom 73170, Thailand Corresponding Author’s email address [email protected] • Chemical structures • Education Keywords • Molecular generator • Structure generator • Z-matrix Direct Submission or Co-Submission Direct Submission ABSTRACT The exhaustive enumeration of 3D chemical structures based on Z-matrix templates has recently been used in the quantum chemical investigation of constitutional isomers, diastereomers and 5 rotamers. This simple yet powerful initial structure generation approach can apply beyond the investigation of compounds of identical formula by quantum chemical methods. This paper aims to provide a short description of the overall concept followed by a practical tutorial to the approach. • The four steps required for Z-matrix template-based substitution are template construction, generation of tuples for substitution sites, removal of duplicate tuples and 10 substitution on the template. • The generated tuples can be used to create chemical identifiers to query compound properties from chemical databases. • All of these steps are demonstrated in this paper by common model compounds and are very straightforward for an undergraduate audience to reproduce. A comparison of the 15 approach in this tutorial and other options is also discussed. SPECIFICATIONS TABLE Subject Area Chemistry More specific subject area Cheminformatics Method name Z-matrix template-based substitution Name and reference of original method N/A Source codes are available as supplementary information in this Resource availability paper. 2 of 10 Method details 20 1. Introduction Initial structures (Z-matrix or Cartesian coordinate) are important starting points for the in silico investigation of chemical species.
    [Show full text]
  • Role of Materials Data Science and Informatics in Accelerated Materials Innovation Surya R
    Role of materials data science and informatics in accelerated materials innovation Surya R. Kalidindi , David B. Brough , Shengyen Li , Ahmet Cecen , Aleksandr L. Blekh , Faical Yannick P. Congo , and Carelyn Campbell The goal of the Materials Genome Initiative is to substantially reduce the time and cost of materials design and deployment. Achieving this goal requires taking advantage of the recent advances in data and information sciences. This critical need has impelled the emergence of a new discipline, called materials data science and informatics. This emerging new discipline not only has to address the core scientifi c/technological challenges related to datafi cation of materials science and engineering, but also, a number of equally important challenges around data-driven transformation of the current culture, practices, and workfl ows employed for materials innovation. A comprehensive effort that addresses both of these aspects in a synergistic manner is likely to succeed in realizing the vision of scaled-up materials innovation. Key toolsets needed for the successful adoption of materials data science and informatics in materials innovation are identifi ed and discussed in this article. Prototypical examples of emerging novel toolsets and their functionality are described along with select case studies. Introduction goal of reducing the time and cost of materials development Materials innovation initiatives and deployment by 50%. 1 Essential to achieving this goal is A number of US-based, 1 – 3 as well as international, 4 , 5 efforts are the development and deployment of a supporting infrastruc- now focused on accelerated deployment of advanced materials ture that integrates a wide range of data, experimental, and in commercial products.
    [Show full text]
  • Titel Untertitel
    KNIME Image Processing Nycomed Chair for Bioinformatics and Information Mining Department of Computer and Information Science Konstanz University, Germany Why Image Processing with KNIME? KNIME UGM 2013 2 The “Zoo” of Image Processing Tools Development Processing UI Handling ImgLib2 ImageJ OMERO OpenCV ImageJ2 BioFormats MatLab Fiji … NumPy CellProfiler VTK Ilastik VIGRA CellCognition … Icy Photoshop … = Single, individual, case specific, incompatible solutions KNIME UGM 2013 3 The “Zoo” of Image Processing Tools Development Processing UI Handling ImgLib2 ImageJ OMERO OpenCV ImageJ2 BioFormats MatLab Fiji … NumPy CellProfiler VTK Ilastik VIGRA CellCognition … Icy Photoshop … → Integration! KNIME UGM 2013 4 KNIME as integration platform KNIME UGM 2013 5 Integration: What and How? KNIME UGM 2013 6 Integration ImgLib2 • Developed at MPI-CBG Dresden • Generic framework for data (image) processing algoritms and data-structures • Generic design of algorithms for n-dimensional images and labelings • http://fiji.sc/wiki/index.php/ImgLib2 → KNIME: used as image representation (within the data cells); basis for algorithms KNIME UGM 2013 7 Integration ImageJ/Fiji • Popular, highly interactive image processing tool • Huge base of available plugins • Fiji: Extension of ImageJ1 with plugin-update mechanism and plugins • http://rsb.info.nih.gov/ij/ & http://fiji.sc/ → KNIME: ImageJ Macro Node KNIME UGM 2013 8 Integration ImageJ2 • Next-generation version of ImageJ • Complete re-design of ImageJ while maintaining backwards compatibility • Based on ImgLib2
    [Show full text]