ARTICLE 143 ology, ology, Fungi 1 : promises and : promises internal transcribed spacer molecular barcoding molecular sequence data next generation sequencing nomenclature Key words: biodiversity ecologically cryptic environmental sequencing algorithm evolutionary placement high throughput sequencing IMC1 IMA FUNGUS · 9(1): 143–166 (2018) 143–166 · 9(1): FUNGUS IMA
Fungi Fungi Code. Fungi, given
in situ hybridization (FISH), or (4) the type sequence 2 Code "
and David L. Hawksworth and David 1 There is urgent need for a formal nomenclature of sequence-based, voucherless a formal nomenclature of sequence-based, There is urgent need for never aimed at placing restrictions on the nature of characters chosen for taxonomy, and of characters chosen for taxonomy, Code never aimed at placing restrictions on the nature ^ \ \ Department of Life Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, UK; and Comparative Plant and Fungal Bi London SW7 5BD, UK; and Comparative Plant Natural History Museum, Cromwell Road, The Department of Life Sciences, Botanischer Garten und Botanisches Museum, Freie Universität Berlin, Königin-Luise-Strasse 6–8, 14195 Berlin, Germany; corresponding Berlin, Königin-Luise-Strasse 6–8, 14195 Botanisches Museum, Freie Universität Botanischer Garten und Submitted: 2 July 2017; Accepted: 15 May 2018; Published: 22 May 2018. Article info: Submitted: 2 July 2017; author e-mail: [email protected] Abstract: Royal Botanic Gardens, Kew, Surrey TW9 3DS, UK; Jilin Agricultural University, Changchun, Jilin Province,130118 China Changchun, Jilin Province,130118 Agricultural University, TW9 3DS, UK; Jilin Surrey Kew, Royal Botanic Gardens, the requirement for physical types is now becoming a constraint on the advancement of science. We a constraint on the advancement of science. We the requirement for physical types is now becoming nomenclature and provide potential solutionselaborate on the promises and pitfalls of sequence-based of sequence-based taxa, which by default lack Types to major concerns of the mycological community. in four alternative ways: (1) the underlying samplea physical specimen or culture, could be designated Q R * [\] ¡ \ that environmental sequencing has accumulated more than one billion fungal ITS reads in the Sequence has accumulated more than one billion that environmental sequencing These unnamed in GenBank. 1,000 times as many as fungal ITS sequences Archive, about Read to 3.8 million to 140,000 currently accepted and 2.2 gap between 115,000 could help to bridge the X " The Code and the latter would be the most straightforwarditself. Only (4) would require changes to the " ] manageable if a stringent analytical approach is applied. Conceptual errors such as homoplasy, is applied. Conceptual errors such as homoplasy, manageable if a stringent analytical approach apply to all molecular and horizontal gene transfer, intragenomic variation, gene duplication, hybridization, X as phylogenetic species delimitation has workedpotential impact of these phenomena is manageable, is the likelihoodThe most serious shortcoming of sequence-based nomenclature satisfactorily in Fungi. " The probability of inadvertently same lineage. markers to delimit species within the or by using different names available is between 20.4 % and 1.5 % dependingestablishing sequence-based species that have compares favourably to a historical error rateThis on the number of globally predicted fungal species. this rate could be reduced to practically zero by addingof about 30 % based on physical types, and " . markers, sequence-based nomenclature should be limited to a single marker, preferably the fungal markers, sequence-based nomenclature should be limited to a single marker, at accurateITS barcoding marker; this is possible since sequence-based nomenclature does not aim of whetherspecies delimitation but at naming lineages to generate a reference database, independent that argue these lineages represent species, closely related species complexes, or infraspecies. We must insteadclustering methods are inappropriate for sequence-based nomenclature; this approach recognitionuse phylogenetic methods based on multiple alignments, combined with quantitative species outline strategies to obtain higher-level phylogenies for ITS-based, voucherless species, methods. We species delimitation methods, and temporal banding. We including phylogenetic binning, ‘hijacking’ conclude that voucherless, sequence-based nomenclature is not a threat to specimen and culture-based closing the gap between but a complementary approach capable of substantially fungal taxonomy, work and high skill levels. an approach that requires careful known and predicted fungal diversity, © 2018 International Mycological Association © 2018 International Mycological conditions: distribute and transmit the work, under the following are free to share - to copy, You Attribution: 2 1 Robert Lücking Formal description of sequence-based voucherless voucherless of sequence-based description Formal Non-commercial: No derivative works: if you get Any of the above conditions can be waived For any reuse or distribution, you must make clear to others the license terms of this work, which can be found at http://creativecommons.org/licenses/by-nc-nd/3.0/legalcode. Nothing in this license impairs or restricts the author’s moral rights. permission from the copyright holder. pitfalls, and how to resolve them and how to pitfalls, doi:10.5598/imafungus.2018.09.01.09 VOLUME 9 · NO. 1 Lücking and Hawksworth
BACKGROUND
Fungal taxonomy and systematics based on DNA sequencing commenced about three decades ago (Kurtzman 1985, Michelmore & Hulbert 1987, Gouy & Li 1989, Hendriks et al.
ARTICLE 1989, White et al. 1990, Bruns et al. 1991, Bowman et al. 1992). Large-scale analyses reshaped our understanding " | " et al. 2002, Lutzoni et al. 2004, Spatafora 2005, Blackwell et al. 2006, James et al. 2006, Hibbett et al. 2007, Schoch et al. * | = et al. 2014, Spatafora et al. 2016). Subsequently, focus shifted towards species delimitation (e.g. Taylor et al. 2000, Pringle et al. 2005, Geml et al. 2006, Bensch et al. 2010, Lombard et al. 2010, Lumbsch & Leavitt 2011, Leavitt et al. 2011, Nagy et al. 2012, Moncada Fig. 1. As of 2017, the fungal ITS universe in GenBank and in the et al. 2014, Quaedvlieg et al. 2014, O’Donnell et al. 2015, Sequence Read Archive roughly compare to the sizes of Earth Del-Prado et al. 2016, Lücking et al. 2016a, Hawksworth versus Jupiter. & Lücking 2017). An important step was the community- wide adoption of the nuclear ITS as universal barcoding marker for Fungi (Pryce et al. 2003, Rossman 2007, Seifert A substantial proportion of the approximately 1 million 2008, 2009, Eberhardt 2010, Begerow et al. 2010, Vrålstad ' F 2011, Schoch et al. 2012; Hibbett & Taylor 2013). Finally, labeled or represents unrecognized contaminants (Harris next-generation, high throughput sequencing (NGS, HTS) 2003, Vilgalys 2003, Nilsson et al. 2005, 2006, 2012, 2014, opened a new dimension to molecular assessment of Meier 2008, Bidartondo et al. 2008, Lücking et al. 2012, fungal diversity in environmental samples (e.g. Ronaghi & Kõljalg et al. 2013). Sixty percent of these correspond to Elahi 2002, O’Brien et al. 2005, Sogin et al. 2006, Geml et ‘uncultured’ Fungi " al. 2008, Taylor et al. 2008, Buée et al. 2009, Amend et al. 2010, Lumini et al. 2010, Hibbett et al. 2011, Unterseher only a portion of all currently accepted Fungi, about 35 000 out et al. 2011, McGuire et al. 2012, Hibbett & Taylor 2013, of 120 000 (C. Schoch, pers. comm., Hawksworth & Lücking Tedersoo et al. 2014, 2017). * $ The structured query