Human Experts Vs. Machines in Taxa Recognition

Total Page:16

File Type:pdf, Size:1020Kb

Human Experts Vs. Machines in Taxa Recognition 1 Human experts vs. machines in taxa recognition Johanna Arje¨ ∗z∗∗, Jenni Raitoharju∗, Alexandros Iosifidisk, Ville Tirronenx, Kristian Meissnery, Moncef Gabbouj∗, Serkan Kiranyaz{, Salme Karkk¨ ainen¨ z zDepartment of Mathematics and Statistics, University of Jyvaskyl¨ a,¨ Jyvaskyl¨ a,¨ Finland ∗Department of Computing Sciences, Tampere University, Tampere, Finland kDepartment of Engineering, Aarhus University, Aarhus, Denmark {Department of Electrical Engineering, Qatar University, Doha, Qatar xFaculty of Information Technology, University of Jyvaskyl¨ a,¨ Jyvaskyl¨ a,¨ Finland yProgramme for Environmental Information, Finnish Environment Institute, Jyvaskyl¨ a,¨ Finland ∗∗Corresponding author: johanna.arje@jyu.fi, Department of Mathematics and Statistics, P.O. Box 35 (MaD), FI-40014 University of Jyvaskyla Abstract—The step of expert taxa recognition currently slows down the response time of many bioassessments. Shifting Sample to quicker and cheaper state-of-the-art machine learning ap- proaches is still met with expert scepticism towards the ability and logic of machines. In our study, we investigate both the differences in accuracy and in the identification logic of taxo- Identification nomic experts and machines. We propose a systematic approach utilizing deep Convolutional Neural Nets with the transfer learning paradigm and extensively evaluate it over a multi-pose taxonomic dataset with hierarchical labels specifically created Indices for this comparison. We also study the prediction accuracy on different ranks of taxonomic hierarchy in detail. Our results revealed that human experts using actual specimens yield the lowest classification error (CE = 6:1%). However, a much Ecological assessment faster, automated approach using deep Convolutional Neural Nets comes close to human accuracy (CE = 11:4%). Contrary to previous findings in the literature, we find that for machines Figure 1: A schematic of the biomonitoring process. following a typical flat classification approach commonly used in machine learning performs better than forcing machines to adopt a hierarchical, local per parent node approach used by human While a growing body of work has used different genetic taxonomic experts. Finally, we publicly share our unique dataset tools (e.g. Elbrecht et al., 2017; Zimmermann et al., 2015) for to serve as a public benchmark dataset in this field. species identification, these methods are not yet standardized Keywords: hierarchical classification; taxonomy; convolutional neural networks; taxonomic expert; multi-image data; biomoni- or capable of producing reliable abundance data currently toring required in, e.g., Water framework directive. While we have also worked on genetic approaches and acknowledge the great promise that genetic taxa identification methods hold I. INTRODUCTION (e.g. Hering et al., 2018), we will not explore them here but alternatively examine the suitability of machine learning arXiv:1708.06899v4 [stat.ML] 17 May 2019 Due to its inherent slowness, traditional manual identifi- techniques on image data for routine taxa identification. cation has long been a bottleneck in bioassessments (Fig. Many studies on automatic classification of biological image 1). The growing demand for biological monitoring and the data have been published during the past decade. Yousef declining funding and number of taxonomic experts is forcing Kalafi et al. (2018) have done an extensive review on auto- ecologists to search for alternatives for the cost-intensive and matic species identification and automated imaging systems. time consuming manual identification of monitoring samples Classification methods for aquatic macroinvertebrates have (Borja and Elliott, 2013; Nygard˚ et al., 2016). Identification of been proposed in several studies (e.g. Culverhouse et al., taxonomic groups in biomonitoring of, e.g., aquatic environ- 2006; Lytle et al., 2010; Kiranyaz et al., 2011; Arje¨ et al., ments often involves a large number of samples, specimens in 2013; Joutsijoki et al., 2014; Raitoharju et al., 2018). The a sample, and the number of taxonomic groups to identify. For most popular classification methods used for identification example, even in relatively species-poor regions like Finland, of biological image data, such as insects, are deep neural the calculation of Water framework directive related indices networks and support vector machines (Kho et al., 2017). often involves hundreds of individual specimens from 118-349 Despite the potential of computational, as well as DNA lotic diatom taxa and 44-113 lotic benthic macroinvertebrate methods for taxa identification, some taxonomists continue to taxa (Arje¨ et al., 2016). object the shift from manual to novel identification methods 2 (Kelly et al., 2015; Leese et al., 2018). Often biologists of hierarchy and multi-label structure improved classification that take a cursory look at automated identification tend to results when compared to single-label cases. Babbar et al. mistrust computational methods because they observe that (2016) performed a theoretical study on the difference between a classifier is unable to separate two specimens which to flat and hierarchical classification and found that for well- them are clearly different to the human eye. Similarly, experts balanced data flat classifiers should be preferred, whereas are baffled when the same classifier is able to discriminate hierarchical classifiers are a better for unbalanced data. between two specimens from low-resolution images while they Automatic classification of benthic macroinvertebrates, as as taxonomic experts cannot. This mismatch in the ability well as plankton, has received increasing attention in recent of computers to identify taxa observed for single cases is years. However, most of the previous studies have focused on often mistakenly extrapolated into an overall unreliability of single-image data (see e.g. Arje¨ et al., 2010; Kiranyaz et al., algorithms. But how different truly is both the logic used and 2011; Arje¨ et al., 2013; Joutsijoki et al., 2014; Uusitalo et al., the overall accuracy of taxonomic experts and algorithms? 2016; Lee et al., 2016; Arje¨ et al., 2017) and have not taken Only few studies assess the accuracy of human experts the inherent hierarchical structure of the data into account. In and automatic classifiers, and their consequences on aquatic single-image data studies, the posture of the specimens can biomonitoring. In a study on human accuracy, Haase et al. have substantial impact on the classification. Besides Lytle (2010) reported on the audit of macroinvertebrate samples et al. (2010), an imaging system producing multiple-image from an EU Water Framework Directive monitoring program. data is presented in Raitoharju et al. (2018). In this paper, They found a great discrepancy between the experts deter- we present a comparison of taxonomic experts and automatic mining the true taxonomic classes and the audited laboratory classification methods on a benthic macroinvertebrate data that workers. Contrastingly, in a study on the effect of mistakes incorporates information on the taxonomic resolution. We test made in automated taxa identification on biological indices, flat classifiers, local per level classifiers, and hierarchical top- Arje¨ et al. (2017) found a relatively small impact. Literature down classification, i.e., local classification per parent node, on direct human versus machine comparisons in classification and perform the automatic classification using convolutional tasks in an aquatic biomonitoring context is equally scant neural networks (CNNs) and support vector machines (SVMs). and ambiguous. Culverhouse et al. (2003) compared human The results are compared with the results of a proficiency test and machine identification of six phytoplankton species using organized for human taxonomic experts and with a test where images and noted a similar average performance for both the taxonomic experts used the same images as the automatic experts and a computer algorithm. In Lytle et al. (2010), au- classifiers. The comparisons evaluate traditional single level tomatic classifiers outperformed 26 humans (a mix of experts accuracy and additionally use a novel variant of an accuracy and amateurs) when distinguishing between two stonefly taxa. measure that accounts for the hierarchical structure of the data. Given these contrasting results, we feel it is necessary to simultaneously examine the effect of taxonomic hierarchy and II. THEORY of using human logical pathways for human and computer- based identification. A. Hierarchy in classification Taxonomic experts identify specimens based on a predefined Silla and Freitas (2011) unified the concepts of methods taxonomic resolution while automatic classifiers operate on used in hierarchical classification problems, and in this section the information of taxonomic rank used in the training data. we follow their terminology. There are different ways for accounting for data hierarchy, Human experts base visual identification of, e.g., inverte- such as taxonomy, in classification. Hierarchical classification brate taxa on rules defined in the International commission on is widely investigated in the current literature. Silla and Freitas zoological nomenclature (1999). Therefore, human experts can (2011) sought to describe and unify the concepts of methods be thought of as hierarchical, local per parent node classifiers used in hierarchical classification problems from different (see Fig. 2c) that first identify the order of the specimen, domains. Using the existing literature,
Recommended publications
  • Mnemonic Memory Taxonomy
    MNEMONIC MEMORY TAXONOMY Overview: In this lesson, students determine proper classification of organisms according to taxonomic levels, explore characteristics that determine classification, and create methods to recall ordered taxonomic terminology. Objectives: The student will: • describe the use and function of a taxonomy, specifically to order and classify living organisms; and • identify and list taxonomic levels of biological classification. Targeted Alaska Grade Level Expectations: Science [7] SC2.2 The student demonstrates an understanding of the structure, function, behavior, development, life cycles, and diversity of living organisms by identifying the seven levels of classification of organisms. [7] SA1.1 The student demonstrates an understanding of the processes of science by asking questions, predicting, observing, describing, measuring, classifying, makign generalizations, inferring, and communicating. Vocabulary: animalia— one of six kingdoms, including most living things that are able to move and digest food internally plantae –one of six kingdoms, including living things that generally manufacture their own food through the use of photosynthesis fungi – one of six kingdoms, mostly living things that are nonmobile and assist in the decomposition process, including yeasts, molds, and mushrooms binomial nomenclature – a scientific naming system that gives a unique name (“scientific name”) to each species, using organisms’ genus and species as its two parts (e.g., “Homo sapiens” for humans) class – a taxonomic rank below
    [Show full text]
  • Botanical Nomenclature: Concept, History of Botanical Nomenclature
    Module – 15; Content writer: AvishekBhattacharjee Module 15: Botanical Nomenclature: Concept, history of botanical nomenclature (local and scientific) and its advantages, formation of code. Content writer: Dr.AvishekBhattacharjee, Central National Herbarium, Botanical Survey of India, P.O. – B. Garden, Howrah – 711 103. Module – 15; Content writer: AvishekBhattacharjee Botanical Nomenclature:Concept – A name is a handle by which a mental image is passed. Names are just labels we use to ensure we are understood when we communicate. Nomenclature is a mechanism for unambiguous communication about the elements of taxonomy. Botanical Nomenclature, i.e. naming of plants is that part of plant systematics dealing with application of scientific names to plants according to some set rules. It is related to, but distinct from taxonomy. A botanical name is a unique identifier to which information of a taxon can be attached, thus enabling the movement of data across languages, scientific disciplines, and electronic retrieval systems. A plant’s name permits ready summarization of information content of the taxon in a nested framework. A systemofnamingplantsforscientificcommunicationmustbe international inscope,andmustprovideconsistencyintheapplicationof names.Itmustalsobeacceptedbymost,ifnotall,membersofthe scientific community. These criteria led, almost inevitably, to International Botanical Congresses (IBCs) being the venue at which agreement on a system of scientific nomenclature for plants was sought. The IBCs led to publication of different ‘Codes’ which embodied the rules and regulations of botanical nomenclature and the decisions taken during these Congresses. Advantages ofBotanical Nomenclature: Though a common name may be much easier to remember, there are several good reasons to use botanical names for plant identification. Common names are not unique to a specific plant.
    [Show full text]
  • Coastal Marine Habitats Harbor Novel Early-Diverging Fungal Diversity
    Fungal Ecology 25 (2017) 1e13 Contents lists available at ScienceDirect Fungal Ecology journal homepage: www.elsevier.com/locate/funeco Coastal marine habitats harbor novel early-diverging fungal diversity * Kathryn T. Picard Department of Biology, Duke University, Durham, NC, 27708, USA article info abstract Article history: Despite nearly a century of study, the diversity of marine fungi remains poorly understood. Historical Received 12 September 2016 surveys utilizing microscopy or culture-dependent methods suggest that marine fungi are relatively Received in revised form species-poor, predominantly Dikarya, and localized to coastal habitats. However, the use of high- 20 October 2016 throughput sequencing technologies to characterize microbial communities has challenged traditional Accepted 27 October 2016 concepts of fungal diversity by revealing novel phylotypes from both terrestrial and aquatic habitats. Available online 23 November 2016 Here, I used ion semiconductor sequencing (Ion Torrent) of the ribosomal large subunit (LSU/28S) to Corresponding Editor: Felix Barlocher€ explore fungal diversity from water and sediment samples collected from four habitats in coastal North Carolina. The dominant taxa observed were Ascomycota and Chytridiomycota, though all fungal phyla Keywords: were represented. Diversity was highest in sand flats and wetland sediments, though benthic sediments Marine fungi harbored the highest proportion of novel sequences. Most sequences assigned to early-diverging fungal Ion torrent groups could not be assigned
    [Show full text]
  • The Empire of Life Needs a Proper Name by Guy Lane & Andrew Buckwell
    The Empire of Life Needs a Proper Name by Guy Lane & Andrew Buckwell The Empire of Life Needs a Proper Name 9 May, 2020. The biggest living organism on Planet Earth is the Living Planet itself, but she doesn’t have a proper name. Guy Lane & Andrew Buckwell ascribe a taxonomic classification to Gaia. THE PROCESS of human reproduction goes through a number of distinct phases. First, a young man meets a young woman and they learn each other’s names. They realise their lives would be better in a partnership. They come to an agreement about how to live together, and then they make a baby human. A similar process takes place when our Living Planet seeks to reproduce. Let us explain. The proper name for a human is Homo sapiens, and the science of properly naming living things is called taxonomy. In 1758, Carl Linneus instituted the practice of binomial nomenclature. This is a two-part name, written in Latin that describes how the organism looks or behaves. The binomial for Homo sapiens translates (perhaps inappropriately) to “wise people”. All species are so named as part of their ‘taxonomic rank’ that has a number of levels: Domain – Kingdom – Phylum – Class – Order – Family – Genus – Species. For a human, the taxonomic rank has the following entries: Prokaryote – Animalia – Chordata – Mammalia – Primate – Hominidae – Homo – Sapiens. The different types of living things can be represented in a Tree of Life. The picture below shows the Tree of Life extended to the level of Kingdom. 1 The Empire of Life Needs a Proper Name by Guy Lane & Andrew Buckwell The Phylogenetic tree (Image via Wikipedia) Of note, there is one important life form that has yet to be given a binomial or ascribed a taxonomic rank.
    [Show full text]
  • In Memory of Tomáš Soldán (9 November 1951 – 13 August 2018)
    Published December 27, 2018 Klapalekiana, 54: 303–323, 2018 ISSN 1210-6100 Vzpomínka na Tomáše Soldána (9. listopadu 1951 – 13. srpna 2018) In memory of Tomáš Soldán (9 November 1951 – 13 August 2018) V pondělí 13. srpna 2018 odešla výrazná osobnost české entomologie, profesor Tomáš Soldán. Zemřel po krátké nemoci ve věku nedožitých 67 let. Tomáše Soldána zajímala zoologie, respekti- ve entomologie, již od dětství. Po absolvování Gymnázia Botičská směřovaly jeho kroky na Přírodovědeckou fakultu Univerzity Kar- lovy. V té době zde působili vynikající učitelé a entomologové, jako Karel Hůrka, Milan Chvála nebo Pavel Štys. Možná právě jejich přístup k vedení studentů a skvělé přednášky Pavla Štyse o morfologii hmyzu ovlivnily i bu- doucí směřování Tomáše Soldána. Po ukončení studia a absolvování povinné vojenské služby nastoupil Tomáš v létě 1975 na Oddělení morfologie hmyzu Entomolo- gického ústavu ČSAV, který v té době sídlil ve Viničné ulici v Praze. O čtyři roky později se přestěhoval i s rodinou do Českých Budě- Obr. 1. Tomáš Soldán u řeky Křemelná nedaleko jovic, kam z Prahy postupně přešly některé zaniklé osady Starý Brunst v roce 2007 (fotografie biologické ústavy Akademie věd. Podílel Světlana Zahrádková). se na přípravě nových laboratoří, pracoven Fig. 1. Tomáš Soldán at the Křemelná River close to a chovů a v roce 1985 i na organizaci stěhování the former settlement Starý Brunst in 2007 (photograph Světlana Zahrádková). Entomologického ústavu do jeho současných prostor v Branišovské ulici. Tomáš byl silně spjatý s životem a děním na Entomologickém ústavu, na začátku devadesátých let jej i krátce řídil, později po mnoho let vedl Laboratoř ekologie vodního hmyzu a zaměstnán zde byl až do konce svého života.
    [Show full text]
  • Molecular Data and Phylogeny of Family Smilacaceae
    Pak. J. Bot., Special Issue (S.I. Ali Festschrift) 42: 111-116, 2010. MOLECULAR DATA AND PHYLOGENY OF FAMILY SMILACACEAE ZABTA K. SHINWARI1 AND SHEHLA SHINWARI2 1Dept. of Biotechnology, Quaid-i-Azam University-Islamabad 2Dept. of Botany, PMAS Arid Agriculture University, Rawalpindi Abstract Family Smilacaceae’s higher order taxonomy remained disputed for many years. It was treated as an order “Smilacales” and was also placed under Liliales by several taxonomists. Even some considered as part of family Liliacaeae. In present paper, we investigated the family’s higher order phylogeny and also compared its rbcL gene sequence data with related taxa to elucidate its phylogeny. The data suggests that its family stature is beyond dispute because of its advanced karyotype, woody climbing habit and DNA sequence data. The data suggest that Smilacaceae may be a sister group of order Liliales and it forms a clear clade with the order. Introduction The Family Smilacaceae was first recognized by Vent. However, its taxonomic status remained controversial. Smilacaceae is a small family with only 3 genera and about 320 species. (Mabberley 2008, Conran,1998) In Pakistan it is represented by one genus and 4 species including a cultivated species (Ghazanfar, 1977). This family is mainly distributed in tropics but extending to temperate regions of both the hemispheres. Bantham and Hooker, (1883); Krause (1930) considered the genera of Smilacaceae under tribe Polygonatae of the family Liliaceae But majority of the present day taxonomists treat them under an independent family Smilacaceae because of its petiolar sheath tendrils, 1-chambered anther cells, sessile stigmas and mostly climbing habit (Fig.
    [Show full text]
  • Bibliographia Trichopterorum
    Entry numbers checked/adjusted: 23/10/12 Bibliographia Trichopterorum Volume 4 1991-2000 (Preliminary) ©Andrew P.Nimmo 106-29 Ave NW, EDMONTON, Alberta, Canada T6J 4H6 e-mail: carita.nybom@welho.net [As at 25/3/14] 2 LITERATURE CITATIONS [*indicates that I have a copy of the paper in question] 0001 Anon. 1993. Studies on the structure and function of river ecosystems of the Far East, 2. Rep. on work supported by Japan Soc. Promot. Sci. 1992. 82 pp. TN. 0002 * . 1994. Gunter Brückerman. 19.12.1960 12.2.1994. Braueria 21:7. [Photo only]. 0003 . 1994. New kind of fly discovered in Man.[itoba]. Eco Briefs, Edmonton Journal. Sept. 4. 0004 . 1997. Caddis biodiversity. Weta 20:40-41. ZRan 134-03000625 & 00002404. 0005 . 1997. Rote Liste gefahrdeter Tiere und Pflanzen des Burgenlandes. BFB-Ber. 87: 1-33. ZRan 135-02001470. 0006 1998. Floods have their benefits. Current Sci., Weekly Reader Corp. 84(1):12. 0007 . 1999. Short reports. Taxa new to Finland, new provincial records and deletions from the fauna of Finland. Ent. Fenn. 10:1-5. ZRan 136-02000496. 0008 . 2000. Entomology report. Sandnats 22(3):10-12, 20. ZRan 137-09000211. 0009 . 2000. Short reports. Ent. Fenn. 11:1-4. ZRan 136-03000823. 0010 * . 2000. Nattsländor - Trichoptera. pp 285-296. In: Rödlistade arter i Sverige 2000. The 2000 Red List of Swedish species. ed. U.Gärdenfors. ArtDatabanken, SLU, Uppsala. ISBN 91 88506 23 1 0011 Aagaard, K., J.O.Solem, T.Nost, & O.Hanssen. 1997. The macrobenthos of the pristine stre- am, Skiftesaa, Haeylandet, Norway. Hydrobiologia 348:81-94.
    [Show full text]
  • Ensembl Fungi Evolutionary Analysis (Gene Trees)
    Demo: Ensembl Fungi gene trees and homologues Let’s look at the homologues of Saccharomyces cerevisiae YPR140W aka Taz1. This gene is involved in stress response and conserved across different taxonomic domains. Search for the gene and go to the Gene tab. Click on Fungal compara: Gene tree, which will display the current gene in the context of a phylogenetic tree used to determine orthologues and paralogues. Funnels indicate collapsed nodes. Click on a node (coloured square) to get a pop-up. We can then see what type of node this is, some statistics and options to expand or export the sub-tree. There are some quick filtering options below the image, where you can add paralogues, and quickly expand or collapse nodes. You can download the tree in a variety of formats. From the pop-up above you can click to export the sub-tree. Alternatively, click on the download icon in the bar at the top of the image to get a pop- up where you can choose your format. You can preview this file before you download. We can look at homologues in the Orthologues and Paralogues pages, which can be accessed from the left-hand menu. If there are no orthologues or paralogues, then the name will be greyed out. Click on Orthologues to see the orthologues available. Scroll to the bottom of the page to see a list of the species that do not have any orthologues with taz1 in Saccharomyces cerevisiae… there’s a lot! Saccharomyces cerevisiae is part of Pan-compara, which compares a subset of fungal species with species from other taxa, such as plants, bacteria and vertebrates.
    [Show full text]
  • Botanical Classification and Nomenclature an Introduction —
    Botanical classification and nomenclature an introduction — Marc S.M. Sosef Jérôme Degreef Henry Engledow Pierre Meerts Botanical classification and nomenclature an introduction — Marc S.M. Sosef Jérôme Degreef Henry Engledow Pierre Meerts by Marc S.M. Sosef1, Jérôme Degreef1,2, Henry Engledow1 & Pierre Meerts3 1 Meise Botanic Garden, Nieuwelaan 38, B-1860 Meise, Belgium 2 Service Général de l’Enseignement supérieur et de la Recherche scientifique, Fédération Wallonie-Bruxelles, Rue A. Lavallée 1, B-1080 Brussels, Belgium 3 Herbarium et bibliothèque de botanique africaine, Université Libre de Bruxelles, Av. F.D. Roosevelt 50, CP 265, B-1050 Brussels, Belgium Copyright © 2020 Meise Botanic Garden, Nieuwelaan 38, 1860 Meise, Belgium. Printed in Belgium by Gewadrupo, Arendonk. This publication is published and distributed in Open Access under the Creative Commons Attribution 4.0 International license (CC-BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original work is properly cited. A PDF file of this publication can be ordered, free of charge (send an email to webshop@plantentuinmeise.be), or downloaded from the webshop of Meise Botanic Garden at http://shopbotanicgarden.weezbe.com. DOI: 10.5281/zenodo.3706707 CIP Royal Library Albert I, Brussels Botanical classification and nomenclature, an introduction. Marc S.M. Sosef, Jérôme Degreef, Henry Engledow & Pierre Meerts - Meise, Meise Botanic Garden, 2020. - 72 p.; ill.; 22 x 15 cm. ISBN 9789492663207 Subject: Botany D/2020/0325/002 Content Introduction . 5 1. The history of classification . 9 1.1 Theophrastus to the Middle Ages . 11 1.2 Renaissance, Pre-Linnean period . 13 1.3 Linnaeus and the Linnaeans .
    [Show full text]
  • Order of Classification Biology
    Order Of Classification Biology weirdly.Spicy Albrecht Pestiferous bootstraps, Yance hissometimes wuss preface enthrall unhumanising his quirt essentially fulgently. and Effervescible retransferred and so vizierial undersea! Kam phlebotomize her palazzo carpetbagging or syllabize This allowed biologist to seek new methods of. Gorilla is a genus. And, they say, the prospect of being able to address these questions marks a new chapter in the life sciences. They have also detailed the characteristics by which organisms are clustered into those groups. Linnaean system to how does not photosynthesize, it is a file containing information to two distinct type of potential group of classification is the. It is classification order, an evolutionary relationships. Taxonomic hierarchy categories are also arise as Linnaean hierarchy and they were introduced by Linnaeus. Nature itself does coach fit into nice little groups. In this classification file pdf linnaean taxonomy depends upon and that is as observed above has a particular species. Make sure to order of classification biology aqa specification on interconnected biogeochemical processes in tamil and the basis for cat classifications might be necessary to improve chances of many ranks have? Glosario de mulder k, classification systems or drag and classifications were a simple. And that DNA, for the novel part, replicates in a very, tedious, very, original way. The many changes in our understanding of evolutionary relationships among animals have resulted in confusing and conflicting relationships among animal groups defined using ranks. Is classification order, and classifications were grouped according to biology of their right here for centuries when words. Kingdom classification order, you are prokaryotes and classifications are not always tidy and knowledge.
    [Show full text]
  • A Taxonomic Information Model for Botanical Databases: the IOPI Model
    TAXON 46 ϑ MAY 1997 283 A taxonomic information model for botanical databases: the IOPI Model Walter G. Berendsohn1 Summary Berendsohn, W. G.: A taxonomic information model for botanical databases: the IOPI Model. ϑ Taxon 46: 283-309. 1997. ϑ ISSN 0040-0262. A comprehensive information model for the recording of taxonomic data from literature and other sources is presented, which was devised for the Global Plant Checklist database project of the International Organisation of Plant Information (IOPI). The model is based on an approach using hierarchical decomposition of data areas into atomic data elements and ϑ in parallel ϑ abstraction into an entity relationship model. It encompasses taxa of all ranks, nothotaxa and hybrid formulae, "unnamed taxa", cultivars, full synonymy, misapplied names, basionyms, nomenclatural data, and differing taxonomic concepts (potential taxa) as well as alternative taxonomies to any extent desired. The model was developed together with related models using a CASE (Computer Aided Software Engineering) tool. It can help designers of biological information systems to avoid the widely made error of over-simplification of taxonomic data and the resulting loss in data accuracy and quality. Introduction Data models. ϑ A model is "a representation of something" (Homby, 1974). In the technical sense, a model is the medium to record the structure of an object in a more or less abstract way, following pre-defined and documented rules. The objecti- ve of applying modelling techniques is either to describe and document the structure of an existing object, or to prescribe the structure of one to be created. In both cases, the model can be used to test (physically, or, in most cases, intellectually) the functi- on of the object and to document it, for example for future maintenance.
    [Show full text]
  • Quantitatively Partitioning of Microbial Function Among Taxonomic Ranks Across the Tree of Life
    University of Tennessee, Knoxville TRACE: Tennessee Research and Creative Exchange Earth and Planetary Sciences Publications and Other Works Earth and Planetary Sciences 2019 Quantitatively Partitioning of microbial function among taxonomic ranks across the tree of life Taylor M. Royalty University of Tennessee, Knoxville Andrew D. Steen University of Tennessee, Knoxville, asteen1@utk.edu Follow this and additional works at: https://trace.tennessee.edu/utk_eartpubs Recommended Citation Royalty, Taylor M. and Steen, Andrew D., "Quantitatively Partitioning of microbial function among taxonomic ranks across the tree of life" (2019). Earth and Planetary Sciences Publications and Other Works. https://trace.tennessee.edu/utk_eartpubs/8 This Article is brought to you for free and open access by the Earth and Planetary Sciences at TRACE: Tennessee Research and Creative Exchange. It has been accepted for inclusion in Earth and Planetary Sciences Publications and Other Works by an authorized administrator of TRACE: Tennessee Research and Creative Exchange. For more information, please contact trace@utk.edu. RESEARCH ARTICLE Applied and Environmental Science Quantitatively Partitioning Microbial Genomic Traits among Taxonomic Ranks across the Microbial Tree of Life Downloaded from Taylor M. Royalty,a Andrew D. Steena,b aDepartment of Earth and Planetary Sciences, University of Tennessee, Knoxville, Tennessee, USA bDepartment of Microbiology, University of Tennessee, Knoxville, Tennessee, USA ABSTRACT Widely used microbial taxonomies, such as the
    [Show full text]