Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. itiuin aesuyo h e Lettuce. Sea Fort Antoine the of study case and diversity a taxa distribution: impacts repositories and misidentifications public ongoing from highlights sequences barcode of reanalysis Exhaustive Perrineau eea ye,oecnann ois setlk”seis(itoai ois ldscmol nw as known commonly blades foliose (distromatic species of “sheet-like” morphology frequently foliose been The have containing 25 even unpublished). about one only and 2020; types, and brackish rare Guiry general ocean, or & uncommon worlds are (Guiry the taxa reported these in of ubiquitous Many 400 species taxonomically. Some recognised macroalgal environments. green freshwater of in group diverse genetically a genus the of Species Introduction 1. representation improved an delivers and misannotations, highlights sequences barcode of of of library absence a the of aggregation and by species, clarification some for isolation graphical pertaining entries to the misannotated of often 21% and that of rare estimate all case We extreme with the database. results In NCBI our another the misannotated. compared are in and misidentification species foliose species species, Atlanticto of distinct East degree seven North large found the barcoding a We from demonstrate traditional strains on sequencing. foliose based generation distromatic identification next individual worldwide species or 295 making barcoding of banks, distribution traditional species gene by of the and Species investigated herbaria We coastlines. including many hazardous. blighting repositories, blooms public near-shore in of source tified well-known a is and distribution ( Lettuce Sea Abstract 2020 24, November 7 6 5 4 3 2 1 Visser de Azevedo Costa Isabel nvriyo ot nedsilnr eteo aieadEvrnetlResearch Environmental and Research Marine & of University Centre Wageningen Interdisciplinary Porto of University Aveiro de Universidade Aveiro of Science University Marine for Association Scottish Roscoff de Biologique Station Galway NUI Ulva .laetevirens U. pce iest n itiuin hsapoc ol eesl dpe oohrtaxa. other to adapted easily be could approach This distribution. and diversity species Ulva laspp Ulva 7 3 laspp. Ulva pce,typically species, oan Beniers Johanna , hlpKerrison Philip , naayi ftegoa itiuino eitrdsmlsfo ois pce loidctspsil geo- possible indicates also species foliose from samples registered of distribution global the of analysis An . 1 acsMcHale Marcus , eune rsn nteNB aaaefrtetrebarcodes three the for database NCBI the in present sequences lohca,Uvls laee sa motn clgcladeooia niy ihaworldwide a with entity, economical and ecological important an is ) , , ; Ulva .laetevirens U. 6 sblSousa-Pinto Isabel , h yeadnm-rniggnso h lohca,UvlsadUvca,are Ulvaceae, and Ulvales Ulvophyceae, the of genus name-bringing and type the .fenestrata U. 3 7 lsbt aCosta da Elisabete , erc Jansen Henrice , 1 Furthermore, . ei Cascella Kevin , nadto,3%of 30% addition, In . Ulva pce aebe ecie fwihaot9 r currently are 90 about which of described been have species .lactuca U. 6 .lactuca U. .armoricana U. lieGachon Claire , 1 7 ihe Guiry Michael , .rigida U. 2 4 hlpePotin Philippe , rmNrhr uoe loehr xasietaxonomic exhaustive Altogether, . Northern from iad Calado Ricardo , 5 fteetisaeernosylble pcmn of specimens labelled erroneously are entries the of 65% , and nre r misannotated, are entries .scandinavica U. Ulva 3 di a e Werf der van Adrie , 1 rbc n oa Sulpice Ronan and , pce a egopdit two into grouped be can species 2 L, Marie-Mathilde , 4 ai Domingues Maria , tuf n h T1 u results Our ITS1. the and A rsn sbigsynonymous being as present Ulva .rigida U. r rqetymisiden- frequently are en relatively being 7 Willem , 1 5 , Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. h ula 5 RArpas(T1,a ela h chloroplastic in the available as for for are 2010) well sequences study Kucera, partial as this obtain (ITS1), to in repeats combinations used primers rRNA different 45S three nuclear using the amplified powder was 2018). of DNA al., mg extracted sequencing ˜5 et The (Fort and Sanger in II), and TissueLyser described amplification (QIAGEN protocol DNA mill magnetic-beads ball 2.2 the a –80 using using at extraction, powder stored DNA fine and for a thoroughly nitrogen used to were liquid ground thalli in arrival, dried, flash-frozen On mm freeze immediately were boxes. ˜50 were insulated tubes a cold The and in (Micronic). seawater to artificial in sent available with and are seawater washed 2017 coordinates local June GPS with and between filled species Netherlands origin, bags of the country and strains, of Kingdom S1 list United The the 2019. September , and , foliose (), from Brittany Ireland, thalli individual collected We appropriate Foliose of 2.1 certain list between methods a adjustment and accessions, nomenclature Materials a NCBI and misannotated 2. sequences species, for all foliose between readjustment large misannotations propose for possible and vouchers and foliose phylogeneticreference database, relationships distromatic large-scale NCBI phylogenetic of sequences for the the the of in primer in view misannotations a our detailed of as extent a from the provide 2020), delimitation highlight to al., species is et study and (Fort this data strains sequencing all 110 next-generation of analysis another the containing used study and previous Atlantic, confidence names. ( with East species sequence barcoding North a several DNA the to with species employed entries a NCBI we assign matches to Here, closely difficult interest be of can it sequence characterised, the been Steinhagen, to when date 2008; Guiry, belong to & not actually Stengel, has entries Rindi, 2019), McIvor, Weinberger, Loughnane, & 2017; al., Karez, the et of (Biancarosa of number Atlantic significant East misidentification a North the findings, et those to (Hughey Given by leading work. attributed highlighted of erroneous, was species holotype the species the be sequenced foliose that fenestrata authors can distromatic is the in NCBI case where the example such 2019), the recent for al., in used most in risk and A present The species individual. sequence same individual. investigated the matching sequenced to the the belonging of as to with The species considered sequences the 2010). are NCBI Typically, of sequence Sherwood, repositories, (NCBI). classification query in & Guiry, Information available the Shipley, Fort, Biotechnology publicly with Kurihara, for sequences compared 1998; O’Kelly, Center with identity al., 2018; National compared al., et then the et are as (Coat such barcodes Miladi (ITS1)] those 2019; 1 from al., obtained Spacer et sequences Transcribed Fort Internal 2018; the Sulpice, common commonly & most [most as the repeats such rRNA for markers even chloroplast using specimens, often to of names barcoding species DNA Kamermans, attribute & Draisma, to Malta, necessary 2010; is Mathieson, species. the response barcoding & in in particularly Neefus, plasticity DNA phenotypic species, Nettleton, formerly such 1999), between (Hofmann, to differences tubes taxa Due morphological foliose (monostromatic 2015). subtle distromatic al., relatively thalli et and (Wichard factors, tubular factors environmental biotic partially to and or abiotic both tubular on based with another as and recognized Lettuce”), “Sea the oa f15sriswr olce o hssuy ncleto,smlswr lcdi clip-seal in placed were samples collection, On study. this for collected were strains 185 of total A . otl urct n icvrdasrosmsplcto fnmsi usqetpublished subsequent in names of misapplication serious a discovered and Ruprecht, & Postels Ulva Enteromorpha Ulva rbc eune o h he omnbroe rsn nteNB aaae h anga of goal main The database. NCBI the in present barcodes common three the for sequences apecleto n N extraction DNA and collection sample Ulva and L p.rle nteapicto n eunigo pcfi oii h eoe most genome, the in loci specific of sequencing and amplification the on relies spp. tuf eea.Peoyi lsiiybtentblradfloemrhtpscnbe can morphotypes foliose and tubular between plasticity Phenotypic genera). ,rsetvl.TeIS rmr eedsge rmtedtstotie in obtained dataset the from designed were primers ITS1 The respectively. A, al S2 Table rbc rbc L, Ulva and L tuf n rgnt rm(eshe l,20)ad(anes& (Saunders and 2009) al., et (Heesch from originate and , 2 lafenestrata Ulva ic fboascletdadpae nsrwcp tubes caps screw in placed and collected biomass of piece niiul ihatal area thalli a with individuals ,IS)o 8 tan fdsrmtcfoliose distromatic of strains 185 on ITS1) A, tuf lalactuca Ulva 2 A lalactuca Ulva , u loncermressc sprso h 45S the of parts as such markers nuclear also but niiul eotdi h ieauei the in literature the in reported individuals ic h xeto ianttdNCBI misannotated of extent the Since . ines swl stehltp of holotype the as well as Linnaeus, rbc and L tuf > acds h primers The barcodes. A 00mm 1000 ° .Te,samples Then, C. Ulva 2 Ulva n3 ie in sites 34 in pce.We species. Ulva species. > Table from Ulva 99% Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. nerahn ovrec ausblw00.Bosrpvle eecmue sn h –smti tbe” “–bs-metric the using computed were values 2010) Bootstrap Stamatakis, 0.03. & likelihood below Moret, automatically values Maximum Bininda-Emonds, stopped convergence Alipour, appropriate. was (Pattengale, reaching most Bootstrapping “–all” Test once trees). the the Bootstopping using bootstrap deemed MRE-based 2019) then a was Stamatakis, distribution inferences, & using I) Gamma likelihood Morel, + Flouri, + maximum on Darriba, Posada, (20 G Reversible (Kozlov, based option & Time + RAxML-NG determined using Doallo, General (GTR obtained was Taboada, alignments, sites were alignments invariants three (Darriba, trees three all of 2 For the Proportion jModeltest 2004). of + using Buckley, each score & for ITS1, Posada Criterion) the model 2012; Information for evolutionary (Akaike analyses best AIC phylogenetic the their MCMC First, Bayesian and datasets. likelihood A maximum differences. relevant both due genetically were used to We sequences not the modified and of were length), end analyses data sequencing and Phylogenetic missing (or beginning to 2.4 primers the due different at sequences nucleotides of the missing use of The the some bases. to 1,062 in unknown) bp), positions (i.e., (270 missing “n” sequences ITS1, of into 1,245 for presence containing bp) the alignments (801 by sequences final introduced 1,320 yielded and steps bp) ( filtering (1,231 barcodes The sequences sequence. organellar in the while only between species), but methods between filtering varies using different removed length two then of were use alignments studies. and The different trimmed informative 70). in retain the -seqoverlap primers to in different (trimal as (for bases of alignment unknown 50 use the -Seqoverlap 50% the of (Capella-Guti´errez, Silla- Trimal than ends to Trimal 3’ due more using nucleotide and containing positions removed sequences 5’ missing Sequences the then large the a trimmed avoiding we of effectively thereby amplify primers, step nucleotides, 60% might different This than study 0.09). of more each -gt use (Trimal Because i) the for present. in to 0.4 gaps absent -gt due Gabald´on,Mart´ınez, 2009) numerous & were barcodes the that the to of positions due for portion alignment, settings different ITS1 1,321 default slightly and the the 1,312 for using 1,726, yielded method 2019) filtering Yamada, This & sequences. “ Rozewicki, corresponding containing ITS1, their not extract for to sequences sequences 2009) all al., et 1,432 selecting (Li 2020), by Faidx al, “ dataset, (containing et information the Fort species contain from and not study did that this entries NCBI including 1,114 total and in total) (1,975 in (1,732 sequences “ sequences ITS follows: 1,679 as yielded were keywords strategy “ search sequences, The in 2020]. ITS rbc summarised in al, for available interest, et transcribed” [Fort are of internal ITS, study used taxa AND previous for software other our NCBI the any from therein. download the in sequences misannotation to in used links species sequences be available and of all could codes evidence that line any Command detect pipeline . and analysis species, an to tubular designed attributed We sequences partially all and consider tubular to including aimed analysis phylogenetic analyses Our barcode. 4,000 phylogenetic each at for for centrifuged primer compilation were forward Dataset Pellets the min. 2.3 using 30 sequencing for Sanger ice for on (Germany) 4 incubated GmbH at and min acetate 30 95 ammonium for at 7.5M denaturation of of 72 volume cycles 0.1 at 35 extension used and protocol s PCR The 25 (Bioline). in mix performed was 0.65 amplification DNA, PCR 2020). al., et (Fort eune,ad“ and sequences, L tuf )adIS a eas h T1ainetcnan asta r ilgclyrlvn teITS1 (the relevant biologically are that gaps contains alignment ITS1 the because was ITS1 and A) μ f2 mlfradadrvrepies 9.25 primers, reverse and forward pmol 20 of L ° ,adwse wc ih7%EO.Fnly C mlcn eesn oLCGenomics LGC to sent were amplicons PCR Finally, EtOH. 75% with twice washed and C, rbc Ulva ° o 0s C rdcswr rcpttduig25vlmso 0%EO and EtOH 100% of volumes 2.5 using precipitated were products PCR s. 30 for C and L ogns]ADtf gn]ADpatd[le] for [filter]” plastid AND [gene] tufa AND [organism] rbc and L tuf tuf ,rsetvl.Sqecswr hnainduigMFT(Katoh, MAFFT using aligned then were Sequences respectively. A, rbc tuf rbc eune 133sqecsi total). in sequences (1,393 sequences A and L ) n oeta 0 nnw ae o h T1alignment ITS1 the for bases unknown 70% than more and A), and L rbc Ulva and L tuf tuf 3 ogns]AND [organism] oigsqecsgnrlyd o ayi length, in vary not do generally sequences coding A ,adi)i oeta 1 ftesqecsfrITS1 for sequences the of 91% than more in ii) and A, tuf μ rbc ecinvlm otiig1 containing volume reaction L a f13 of (as A rbc μ fmlQwtrad12.5 and water miliQ of L lasp Ulva and L and L lasp Ulva tuf ° rbc o 0s neln t60 at annealing s, 30 for C ntertte n sn Samtools using and title, their in ” Ulva th tuf ,respectively A, sogns)wr hnremoved then were organism) as ” gn]ADpatd[le] for [filter]” plastid AND [gene] L fJl 00,i diint the to addition in 2020), July of ,adteieaieFFT-NS-i iterative the and A, pce nteNB database, NCBI the in species tuf ieS1 File eune.Ti search This sequences. A . h ’ad3 gaps 3’ and 5’ The μ edownloaded We . fMTqRed MyTaq of L μ Ulva fundiluted of L rbc [organism] and L ° o 30 for C i.1 Fig. rbc rbc tuf L L g Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. h 8 ape eune nti td rgntn rmteNrhEs tatcbln osvndistinct seven to belong Atlantic East North the as from identified originating samples study 19 this with in clades, entries sequenced and samples probabilities), 185 (including The tree analysis MCMC for Bayesian names the species support), bootstrap (including tree ohnoi, U. in laetevirens listed namely names species, laetevirens species foliose Ulva the distromatic used obligatory we to , synonymous, belong ( be ten presence 19-28) interval these, to the (confidence Of revealed found sequences analysis been two GMYC have than more names sequences. containing clades 1,245 24 containing of tree phylogenetic Likelihood Maximum from sequences L common the most used three We the for all study, of this Analysis in 3.1 sequenced Atlantic all East analysed North in and the used recovered from barcodes we strains created, additional we 185 as pipeline analysis the Using of windows Results creating 3. areas, publication, geographical name, similar from 2011). specimen, entries (South, vouchers, NCBI) merge Rworldmap of the to list degrees in km complete two in sequenced 12,100 available of The of barcode is multiple address attribution one in kept. affiliated species than grouped was the proposed more entry and used where with one coordinates we coordinates GPS specimens only publication, GPS and (i.e., or retrieve removed coordinates specimens to were contain searched Duplicated not manually authors. did were voucher the origin the of Where location available. and/or coordinates GPS ing and Chihara foliose [[?] distromatic G.Alongi & main G.Furnari eleven in the entries NCBI to virens the ( belonging of scripts as all python analysis of Areschoug, custom name our using publication in recovered and assigned were name specimen datasets coordinates, three GPS the origin, of country foliose The distromatic of a distribution with Species 2.5 2020), ( Figtree al., & (Fujisawa ( using R et Inkscape visualised in Fort packages in were score Splits annotated 2019; 50 trees and ESS and Rncl an All al., the BEAST, with 2013). using et in confirmed performed Barraclough, 2006) (Fort was was al., delimitation Convergence per et Species (MCMC). all Pons parameters. as Carlo relevant 2013; for Monte method Barraclough, 200 Chain & Markov same (Fujisawa than millions model the higher Coalescent Yule used were split Mixed we (ESSs) General of delimitation, sizes deviation sample species standard average For estimated the and until 0.05, with datasets, of 2012), three maximum parameters. al., the et a Bootstrap between (Ronquist Felsenstein reached generations MrBayes classical frequencies using of with compared performed number were 2018) for varying al., analyses support a et MCMC higher Bayesian (Lemoine produce sequences (FBP). to of Proportions expected hundreds values, with (TBE) trees Expectation large Bootstrap Transfer representing option, laexpansa Ulva Areschoug, n nte ld containing clade another and 2 and h ol a n i-hr itiuinof distribution pie-chart and map world The . .fenestrata U. .ohiohilulu U. rbc , .rigida U. h MCseisdlmtto,hwvr aldt iciiaebtenfiespecies. five between discriminate to failed however, delimitation, species GMYC The . lafenestrata Ulva Ulva aae eeae nti td,ta rmFr ta,22,a ela l available all as well as 2020, al, et Fort from that study, this in generated dataset L Ulva rbc U.ohnoi laspp Ulva a efudin found be can L hlgn,namely phylogeny, nre seMtrasadMtos.Fo the From Methods). and Materials (see entries https://inkscape.org/ r hw ob oseic swl sasnl ld otiigboth containing clade single a as well as conspecific, be to shown are otl Ruprecht, & Postels ..plig&ARSewo] ulctosascae ihNB nre miss- entries NCBI with associated Publications A.R.Sherwood]. & H.L.Spalding .rotundata U. .iak S.Shimada, & M.Hiraoka . , laaustralis Ulva rbc .pseudorotundata U. eune rmpbi repositories public from sequences L .pseudorotundata U. i.S1 Fig. Bliding], rbc ). L, .lactuca U. , tuf lagigantea Ulva and , .expansa U. n ITS1. and A Ulva 4 .rigida U. al S3 Table Linnaeus, 1smlsas samples 21 , species. Ulva and http://tree.bio.ed.ac.uk/software/figtree/ Sthl)Sthl N.L.Gardner, & Setchell (Setchell) , .adhaerens U. C.Agardh, pce a rae nRuigtepackage the using R in created was species laohnoi Ulva ie S2 Files al S3 Table ). .gigantea U. laarasakii Ulva Ulva .fenestrata U. rbc and .pseudorotundata U. aiue n ogtdswere longitudes and Latitudes . eune nteNB,a well as NCBI, the in sequences , i.2 Fig. h ulmxmmlikelihood maximum full The . lalactuca Ulva lgmn,w eeae a generated we alignment, L Ulva S3 (K al 1 Table tig Bliding, utzing) ¨ ,rsrce ovouchers to restricted ), pce [i.e species , .Snesvrlspecies several Since ). lapseudorotundata Ulva 7as 47 , , sorreference our as larigida Ulva .lactuca U. .australis U. > M.Cormaci, .australis U. .arasakii U. 0 o all for 200 .laete- U. ,and ), Ulva and and rbc ˜ . , Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. ih25idvdas 1i hssuy 1i Fr ta. 00,107 2020), al., et (Fort in 11 study, this in 21 individuals, 225 with the , expected, as species, foliose the For for than the clusters using species more analysis significantly same the performed We all of Analysis 3.2 of mixture a containing tanneri clades defined poorly one for misidentifcation species, for other as Concerning same 452119310, (# set population same ohnoi either U. as annotated are 2019), which al., of et 48 (Hughey synonymous dataset, be our to shown in represented not the species and foliose large the Of five of out Two scandinavica 2020)). al., et laetevirens (Fort and study this with identity 100% showing study, five this all the from between in entries confusion individuals 21 common 63 However, of contains 2014)]. four locality al., clade and type from et the This 2020), EU484408 20 Australia, entry al., as South NCBI misidentification. et Phillip, well (Fort as species well from as likely as individuals 2020), of 2015) 38 al., al., cases et et several the Finally, Rautenberger (Fort with 2009; 2008). in al., al., described et et (Loughnane (Heesch 29 in study, the described Next, this NCBI, 452119310). likely from (number several set strains population as 12 unpublished well same as the from specimen), originate 2018; appear type one al., entries the Sharma, et all Krupnik & including 2008)), one 2004; 2016)], al., including Masuda, entries, & et Lopez-Bautista, misannotated Uenosono, (Loughnane & Shimada, from (Hiraoka, Collado-Vides, the 3 in to Melton, [described and belonging database 2020) NCBI strains al., the either 69 et from of found (Fort individuals We in all annotated. 10 with well study, discrepancy, this no shows in clade individuals This 2009). al., or 2020). et (Heesch al. et and (Fort 2010) Spain from 2019). strain al., one for et except Brittany, and Ireland fenestrata from originating all 2008), al., sequences holotype present once five entries change could from names and species All 2020) the al., indeed et that caveat (Fort the available. from with become sequences database, from from NCBI (apart comparisons the yet in with sequenced attribution been species not our have species those as 13 .pertusa U. rbc .pseudorotundata U. .intestinalis U. .gigantea U. .scandinavica U. L . .expansa U. .pseudorotundata U. orerroneous four , eune eeanttdbsdo their on based annotated were sequences ns n h te he oss igeplmrhcst.Altogether, site. polymorphic single a possess three other the and ones, r oelkl ob yoyoswith synonymous be to likely more are wihaesnnmu;see synonymous; are (which laaustralis Ulva .reticulata U. as 2 , .intestinalis U. n two and ld otissxNB nre,four entries, NCBI six contains clade tuf nre locutrwti the within cluster also entries ape showed samples eune rmpbi repositories public from sequences A .ohnoi U. .reticulata U. .compressa U. .pseudocurvata U. .laetevirens U. .rigida U. a dnie ae on based identified was oskladone and Forsskal ocesi h CIdtbs rm(inaoae l,21;Luhaeet Loughnane 2017; al., et (Biancarosa from database NCBI the in vouchers n nytrelkl ianttdsqecsi the in sequences misannotated likely three only and , .lactuca U. 2as 12 , .fenestrata U. .rigida U. .laetevirens U. and > two , .laetevirens U. inesand Linnaeus 9 iiaiywt hs ecie y(ote l,22)adthe and 2020) al., et (Fort by described those with similarity 99% niiul eg,NB oce U393adms nre from entries most and EU933943 voucher NCBI (e.g., individuals .rigida U. al 1 Table .laetevirens U. tuf U.laetevirens or nre.Teohrseisaemr rbeai,wt several with problematic, more are species other The entries. .taeniata, U. three , rbc al 1 Table .prolifera U. .fasciata U. .taeniata U. acd ( barcode A .ohnoi U. ld hw h aenm iapiainwith misapplication name same the shows clade .laetevirens U. acd 4 pce lses ofiec nevl37-46). interval confidence clusters, species (40 barcode L > en rsn ihntecae For clade. the within present being ) 9 dniywt h ooyeof holotype the with identity 99% ld otiig18srisapasmr problematic, more appears strains 138 containing clade .laetevirens U. n 3as 63 and .laetevirens U. 5 .lactuca U. > .scandinavica U. .ohnoi U. 9 dniywt hs f(rf,Kat Waller, & Kraft, (Kraft, of those with identity 99% ld eeasge as assigned were clade ial,the Finally, . lafenestrata Ulva .intestinalis U. ianttdsequences). misannotated neetnl,alsix all Interestingly, . wihaesynonymous, are (which Giy&Giy 00,adtofo hn (Du from two and 2020), Guiry, & (Guiry Sthl)Sthl ..ade.Ms entries Most N.L.Gardner. & Setchell (Setchell) one .arasakii U. .expansa U. , i.3 Fig. .linza U. .beytensis U. ld,2i hssuy 57 study, this in 2 clade, .laetevirens U. three , nre tofo Kate l,21) Port 2010), al., et (Kraft from [two entries ld,wt l six all with clade, . , .fenestrata U. .lactuca U. i.S2 Fig. nre r nitnusal from indistinguishable are entries , srpeetdb igeindividual, single a by represented is .fasciata U. n two and inesaewl end ihno with defined, well are Linnaeus Hge ta. 09) ebased we 2019)), al., et (Hughey ; .flexuosa U. one .rigida U. eas othltpsof holotypes most Because . and .lobata, U. ld otis5 sequences, 58 contains clade .rigida U. .armoricana U. .laetevirens, U. nre n 86 and entries al 1 Table one , .rigida U. al S3 Table .armoricana U. , .armoricana U. .fenestrata U. .californica U. .compressa U. niaigalikely a indicating , .beytensis U. .ohnoi U. hc aebeen have which nre rmthe from entries .gigantea U. ,toerroneous two ), ld contains clade .W found We ). l rmthe from all .australis U. nre and entries .lactuca U. .lactuca U. vouchers (Hughey and and entries Thivy clade: Ulva (13 U. U. U. Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. 09 rpi ta. 08 e,Kn,&Km 09 etne l,21;Mld ta. 08,btas the also but 2018), al., et Miladi 2016; al., et Melton 2019; Kim, & Kang, three Lee, of 2018; presence al., et Krupnik 2019; 2020). al., et (Fort rbc CIetiso l he pce netie ihnalrecae usd fteLPcmlx other complex, Discussion LPP 4) the of misannotated Outside likely twelve intestinalis. contains clade. clade the 2011) large Similarly, Ichihara, a vouchers. & Masakiyo, within (Horimoto, intertwined S.Shimada & with species delimited, three poorly is all narrow-tubular complex of (LPP) Linza-Procera-Prolifera entries the NCBI species, narrow-tubular to regard With as as annotated other annotated clade. two most are the vouchers, with 21 within four identity sequences, contains 100% other clade show the the which Of of Hillary all 2014)). 2002; clade, as al., Waaland, annotated et entries & fenestrata Mao NCBI Hayden U. 2010; of S al., six extraneous Hillary et only An and 1998; Kraft 2020), al., 1999). al., et al., et (Coat (Fort et Tan from rigida rest 2004; Waaland, U. the erroneous & 2020), Three Hayden al., The 2018)]. S. et al., (Fort sources. et from unpublished Monotilla 29 from 2013; Paul, all 23 & found, as Nys, were well de as Mata, study, Lawton, this 2004; from sequences three contains clade erroneous and 19 and 2002), Waaland, misannotated & Hayden S (Hillary lactuca by annotated/submitted AY260562, entry NCBI with particularly with barcodes, The 40 previous with the (compared 59. with clusters to 34 species agreement of ( 42 general dataset predicts in barcode delimitation are ITS1 species results the on the repeated repositories again, was public analysis from the sequences Finally, ITS1 all of Analysis 3.3 the with in identity defined 100% displaying both ers, one from as apart misidentifications such species, foliose between tion common Less species, other Wilson, For 2010). Kim, Kucera, Mao, 2019; & al., Saunders et Kang 2018; clades. H. al., (J. database et NCBI Miladi the expansa from 2014; 25 2012), Yarish, and Moro, 2020), & al., & The et Andreoli, (Fort Sciuto, 2020). in Wolf, previously al., 2019; et al., (Fort et and the (Steinhagen to dataset NCBI belong the all in present are for sequences as same 452119404, number australis Ulva The entries. results. L .gigantea U. .fenestrata U. nadto,ti ld otisasingle a contains clade this addition, In . , .arasakii U. ld.Fnly the Finally, clade. .lactuca U. .linza U. sfor As . .pseudorotundata U. and tuf .pedrtnaatuf pseudorotundata U. Ulva T1sqecsaeol ecie ntepeetsuyad(ote l,22) The 2020). al., et (Fort and study present the in described only are sequences ITS1 .fasciata U. .laetevirens U. tuf dataset A .iatatuf U.gigantea laohnoi Ulva ld 2 nti td,1 n(ote l,22) nycnan single a contains only 2020)) al., et (Fort in 11 study, this in (21 clade pce perwl eiie,wt w xetos The exceptions. two with delimited, well appear species and rbc and per oeaporaethan appropriate more appears A ld otis1 eune,tno hc noae as annotated which of ten sequences, 16 contains clade .tepida U. .ohiohilulu U. eut,w found we results, L .prolifera U. . nre n one and entries .laetevirens U. .laetevirens U. sgnrlywl icmcie,wt 92 with circumscribed, well generally is .lactuca U. ld.Ide,the Indeed, clade. .mediterranea U. ldsapaswl end ihn aemsplcto,smlrt the to similar misapplication, name no with defined, well appears clades A .aaio&SSiaacaecnan eea nre noae as annotated entries several contains clade S.Shimada & Y.Masakiyo eune,wihbln othe to belong which sequences, swl sbetween as well as , eune nycnanidvdasdsrbdi hssuyadin and study this in described individuals contain only sequences A r ersne ihmr hntoetis ahwt hi separate their with each entries, two than more with represented are .flxoa lacompressa Ulva flexuosa. U. ianttdsqecs.Itrsigy hl 18 while Interestingly, sequences). misannotated ld otis14sqecswt 2fo hssuy 8from 38 study, this from 62 with sequences 134 contains clade .prolifera U. ld otis5 tan dnie nti td,3 identified 38 study, this in identified strains 59 contains clade .armoricana U. .rigida U. .californica U. lni omc .unr n one and G.Furnari & Cormaci Alongi, .rigida U. .expansa U. 6 nr,alfo nulse tde pplto set (population studies unpublished from all entry, rbc .laetevirens U. ld otis5 eune 1 rmti study, this from (14 sequences 50 contains clade .californica U. .ohnoi U. o pce eiiain ihacersepara- clear a with delimitation, species for L ld nycnan eune rmti study this from sequences contains only clade and .laetevirens U. n w as two and nr,lkl ianttd efudtwo found We misannotated. likely entry, i.4 Fig. .lactuca U. .scandinavica U. .australis U. oces[ecie n(iak tal., et (Hiraoka in [described vouchers and .ohnoi U. eune.Regarding sequences. and , tuf i.S3 Fig. .intestinalis U. .lobata U. .flexuosa U. dsrbdb D ta. 2014; al., et (Du by (described oce a on ihnthe within found was voucher ) ihacndneinterval confidence a with A), clade. .meridionalis U. .fasciata U. oces(.H age al., et Kang H. (J. vouchers ihnthe within and .rigida U. lapseudorotundata Ulva .fasciata U. ih10 identity 100% with , .prolifera U. al S3 Table ihu apparent without , r iial well similarly are [[?] .rgd tuf rigida U. tuf .laetevirens U. .fenestrata U. .expansa U. n n as one and , .lactuca U. R.Horimoto .prolifera U. .Indeed, A. sequences .ohnoi U. .Once ). vouch- U. U. U. A ] . , Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. ec,cuinsol etknwe sinn pce ae otoevouchers. those to names species assigning when taken be should caution Hence, . show we addition, In specimens with of clade. epitype synonymous each are corresponding to 1998) names and Tan, species neotype assign for conclusively lectotype, that specimens to necessary, here holotype sequenced Where Unfortunately, and designated 2020). available. be al., not both should three et since those (Fort are surprising within hybrids contained species not interspecific SNPs is of both discriminating perhaps of viability It handful the a and 2018). only al., with barcodes, close, et relatively (Miladi are by sequences hinted species recently was as which classified repositories, public be erroneously to will comparing belonging matches 2 when sequences best exercised the of of be of ˜40% should use some caution Indeed, since Hence, BLAST significant. 334). using / is (127 2019) misidentification misannotated al., this et are (Hughey that by quality the reported found underlying recently within the was we found is it sequences were instance, to For misidentifications name several repository. species the that assign within to annotation repositories species public the of all the of use of the or with subset available, issue a are main repositories. contains The they public only if studies likely in sequenced past database misidentifications be in NCBI Species should the of morphologically 4.2) specimens as number described These resampled, entities the from evidence. species of originate genetic numerous they that no site of than is discrepancy presence its lower there apparent the which This despite significantly by from 2020)). are analysis, Guiry, explained present & numbers this the (Guiry be to These remaining, agreed could according broadly NCBI. (90 names which thus taxonomically in of species species sequences, nine accepted two described unique currently with than 47 species 56, more is containing the of dataset clusters of number NCBI species He, Out the ˜40 & from found Cai, 2019; synonymous. entries limitations, Zhang, the al., as with Wang, the in et those classified Zhou, of names” Hughey and being data “species 2017; 2017; 2020), of barcode Jiao, available number Jiao, al., & the the & et Interestingly, He, all (Fort He, of Zhou, in Zhou, delimitation Cai, studied Wang, species Wang, genomes species Cai, precise 2018; organellar foliose 2016), 2018; full Gabrielson, six al., as & the such et Miller, information, of Hughey, avoided, (Cai genetic outside be genomes of should Hence, organellar amount literature sequenced larger the 2020).. a in al., on elsewhere et and performed (Fort here be used ideally separate barcodes should to single and able on are analysis we delimitation while species However, species. ITS1 two The compressa those clusters. of two separation the between the lactuca delimitation for species allowed a SNPs similarly for allowing of for barcode 2020). respectively), observed number two, al., speciation we versus limited et intervals (nine the (Fort species a confidence skew study large containing can relatively previous barcode dataset the our single Barraclough, entire explains likely a using the & contrast, This when Fontaneto, in In used. especially sequences Humphreys, is Tang, analysis, spurious species GMYC possibly 2014; between the of al., of presence et threshold the (Leliaert as Indeed, information such 2019), genetic 2014). al., species limited et Notably, a Hughey 2004; using al., barcodes. et common Hiraoka ohnoi 2020; U. three al., intervals. for et confidence (Fort clusters NCBI species species large the relatively using yields interest. from sequences instance, of of For amount sequences taxa large our available a for such available using publicly information delimitation genetic all the avoided. assess used exhaustively be We to endeavoured should we barcodes study, this common In three using delimitation Species 4.1) .Frhroe u td hw that shows study our Furthermore, ). .rigida U. .fenestrata U. and and .armoricana U. , .intestinalis U. .ohnoi U. .lactuca U. Luhaee l,20)btti nlsssget ti nfc yoyosto synonymous fact in is it suggests analysis this but 2008) al., et (Loughnane tuf rbc ooyedsrbdb Hge ta. 09 stebsln o hsseis( species this for baseline the as 2019) al., et (Hughey by described holotype ewr bet separate to able were we A using i o lo osprt eti aata eepeiul hw ob separate be to shown previously were that taxa certain separate to allow not did L dsrbdb Do,D eir,&Ca,19) and 1998)) Coat, & Reviers, De (Dion, by (described uhadsrpnyi neett ag-cl pce eiiainaaye when analyses delimitation species large-scale to inherent is discrepancy a Such . and tuf .laetevirens U. .prolifera U. A, .ohnoi U. tuf .rigida U. ldsaesprtdit eea u-rus loehr precise Altogether, sub-groups. several into separated are clades ipasmr Nsthan SNPs more displays A ssprtdit w ieetgop.Similarly, groups. different two into separated is . lascandinavica Ulva .laetevirens U. and .fenestrata U. 7 .laetevirens U. and a rvosytogtt esynonymous be to thought previously was ld.Hr,uigalsqecsavailable, sequences all using Here, clade. .rigida U. r locmol ianttdin misannotated commonly also are rbc Ulva .rigida U. hncmaigtoetwo those comparing when L .lactuca U. hc si gemn with agreement in is which , .scandinavica U. eeasol eavoided. be should genera .fenestrata U. and espotthe support We . .laetevirens U. Ulva .laetevirens U. .fenestrata U. .linza U. Btel & (Battelli sequences species. Table rbc , U. U. L. or Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. ial,teohrsxlrefoliose large with six while hemispheres, other South the and Finally, North ( the latitudes of narrow waters in temperate present are species Three australis tiigy no Strikingly, Zealand, New and coast and Atlantic East the to sam- of density the ( Indeed, particular foliose Netherlands. in of the diversity Ireland and species in Brittany Zealand. the dates as New such sampling and regions and Australia nearby Americas, pling in the individuals , of East number sea, ( Mediterranean sequenced the however, coast, specimens Interestingly Atlantic unique the of in unknown number areas. present currently highest those the from within is information, with present identity originating genetic tributed, are containing sequence species species locations its NCBI new/un-sequenced the that but no From likely decade with also is a 2009), is repositories It than Kandjengo, public eleven country. more & in this those for Shuuluka, released of foliose and presence Robertson-Andersson, of the areas South (Bolton, distribution preclude those in not global from commercially does the originating used this characterise data While precisely genetic the to taxa. more of many needed that distribution for shows the case it of the species, is map as world Oceania, those a East from generated data we genetic entry, of NCBI each foliose for large name eleven species re-annotations. reassigning proposed and After or sequences/species foliose accessions misannotated of reference highlight distribution appropriate similar to of Global and list interest 4.3) overlooked, a of be with taxa community not different scientific should are on the repositories misidentifications provide performed public as be in species, could misidentifications those analyses to for indicates of remains potential but sequences clade study the that this the Altogether, after of within analyzing entries scope delimitation when present. the NCBI species taken outside be the of is be to clusters matter likely of also those the should re-analysis of 2020), delimitation caution further al., species Kim, that require precise et Kim, The (Xie will between Kang, resolved. viable hybrids and J. be be Since E. 2009), to 2018; al., shown specimens. al., been holotype et et of Leliaert (Cui sequencing separation complex 2014; organelle the (LPP) Kim, within Linza-Procera-Prolifera & lies the here Choi, within found species hurdle the major of the species, tubular “organism” misannotated partially the of of or incorrect, recurrence and if and tubular amplification update, Concerning the to avoid NCBI thus as the such numbers, on species, accession sequences their deposited https://www.ncbi.nlm.nih.gov/genbank/update/), of have (see information that sequences authors NCBI foliose encourage in large to eleven propose we associated the we Finally, of information barcodes the attribution. three date species all proposed iden- for and our accessions vouchers NCBI as specific reference well searching as for viewers here, Alternatively, pdf highlighted of species belong. function they search clade the which use to in to tifying provide researchers We allow results. to BLAST in on codes based annotation species for check” the encourage we foliose and large of analysis the Overall, .lna .procera U. linza, U. .ohiohilulu U. .pseudorotundata U. nBianadIeadrmist eexplained be to remains Ireland and Britain in .lactuca U. .lactuca U. etitdt h e fJpnadHwii respectively. Hawai’i, and Japan of sea the to restricted Ulva Ulva and .australis U. pce rmwihteei eei vdne( evidence genetic is there which from species etitdt h atAlni coast, Atlantic East the to restricted .prolifera U. niiul r rsn nteNrhAlni n h atcSa usd fa of outside Sea, Baltic the and Atlantic North the in present are individuals cetfi omnt ouetetesdsrbdhr sptnil“ceso quality “accession potential as here described trees the use to community scientific . Ulva Ulva pce o h fia os,SuhAeiaadSuhEs Asia/North East South and America South coast, African the for species Ulva scnpcosyasn rmteIihadBiihcat,dsiealarge a despite coasts, British and Irish the from absent conspicuously is Ulva nIead n no and Ireland, in niiul.Ti per ob nogigisewt h delimitation the with issue ongoing an be to appears This individuals. laspp. Ulva pce r oegorpial oaie ( localised geographically more are species pce hwd˜1 fmsnoae nre nteNB database, NCBI the in entries misannotated of ˜21% showed species .expansa U. i.5 Fig. al S3 Table .laetevirens U. al S1 Table 8 idepanel: middle , .australis U. oteNrhEs aic n finally and Pacific, North-East the to otisalo h ceso ubr ftefoliose the of numbers accession the of all contains .lactuca U. ieyalwdfra xasiecpueof capture exhaustive an for allowed likely ) .gigantea U. and Ulva eercvrd hs h bec of absence the Thus, recovered. were i.5 Fig. .linza U. i.S1 Fig. .fenestrata U. .australis U. pce.Frisac,foliose instance, For species. and i.5 Fig. Ulva oteNrhAtlantic, North the to o panel, top , .ohnoi U. to and pce.A ti ipet up- to simple is it As species. S3 .Ntby hr salack a is there Notably, ). .prolifera U. i.5 Fig. r h otwdl dis- widely most the are h re faltrebar- three all of trees the speeti h odto cold the in present is aorwre waters. warmer favour i.6 Fig. otmpanel), bottom , al 2 Table pce have species .Te are They ). .arasakii U. .rigida U. itof list a Ulva Ulva Ulva U. is Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. MC ultesicuigbosrpvle n aeinpseirpoaiiisaeaalbein available using species are determined probabilities and/or clusters posterior species species bayesian represent semitubular and clades or values colored tubular bootstrap and including represent Shaded trees clades study. Full Shaded this GMYC. in study. representative this no foliose in with 1,245 on distromatic found rooted of main species alignment, the ITS1 tree foliose the to phylogenetic of belonging tree Likelihood vouchers likelihood the in Maximum available using of species are determined description probabilities and/or 4: clusters posterior species species bayesian represent semitubular and clades Fig. or values colored tubular bootstrap and including represent Shaded trees clades study. Full Shaded this GMYC. in study. representative this no foliose in with 1,320 distromatic found of main species the tree foliose the to phylogenetic of belonging tree Likelihood vouchers likelihood the Maximum in of available GMYC. description no are 3: using with probabilities determined species posterior clusters bayesian and/or Fig. species and species represent values semitubular clades bootstrap or colored including tubular and trees represent Shaded Full clades study. Shaded this in study. foliose representative this distromatic in main found the species to the belonging of entries 1,062 tree of the tree of phylogenetic Likelihood tion Maximum 2: Fig. of study. this taxa in other used framework any . Analysis to 1: geographical transferred misannotations. Fig. specific be suspected to could and belong sequences detail of legends species in amount Figure foliose large here contain validation most described that databases. that accession framework those the and particularly identification, analytical interest, within oceans, species The names world’s future species the locations. for the throughout used of foliose present representation be of are accurate distribution can worldwide ensure dataset the to the Interestingly, This purposes, of classification annotation species. and and foliose composition large species ongoing highlight on the to clarify interest, to of species. all contribution synonymous taxa uncharacterised investigated of previously we to information names genetic Here, reassign potentially the and reassess issues identification and species revisit to important becoming species. one than Conclusions more (e.g., contain environments areas of most range since species, wide as a such to species localised tolerant most the the improve that from “ both indicate and of results to distribution profile name the need nutritional the together, Taken the biodiscovery, under highlights present products This at natural consumed 2016). effectively for are Calado, description essential accurate & is Furthermore, Blunt, of literature 2019). identification Munro, the ( al., Hilario, et results in (Leal, (Barbier our used France traceability with of species outside contradiction the Europe in of in Atlantic, consumption North records. food only the species for since in reporting contains challenge for located (OBIS) significant sequencing which a DNA System on of Information rely most Biodiversity not , do Ocean that the databases instance, other For in of higher reports even the is above, problem shown as As to USA. misannotated referable Massachusetts, and in aquarium specimen an single from recovered specimen .fenestrata U. rbc u oteicesnl ag ubro eune en eoie npbi eoiois tis it repositories, public in deposited being sequences of number large increasingly the to Due lgmn,roe on rooted alignment, L Ulva tuf pce n hneteErpa odrglto yicuino the of inclusion by regulation food European the change and species .pseudorotundata U. lgmn,roe on rooted alignment, A Ulva motnl,wietenme fmsnoain nteNB ssgicn,the significant, is NCBI the in misannotations of number the while Importantly, . eune nteNB ulcrpstr o he omnbroe,a a as barcodes, common three for repository public NCBI the in sequences Ulva rdcslble scontaining as labelled products Umbraulva tas ihihstewrdiec-curneo foliose of co-occurrence worldwide the highlights also It . Umbraulva Umbraulva Ulva eune.Clrdcae ersn itoai foliose distromatic represent clades Colored sequences. Ulva 9 h ito cit n otaei vial in available is software and scripts of list The pce rsne eeso htol e species few a only that show here presented species lalactuca Ulva pce ih aevrigdgeso specialization, of degrees varying have might species eune.Clrdcae ersn distromatic represent clades Colored sequences. eune.Clrdcae ersn distromatic represent clades Colored sequences. .laetevirens U. laspp Ulva .laeteviren U. ”. laspp Ulva laspp Ulva lalactuca Ulva Ulva Ulva .lactuca U. . rbc > eu olwd,wt focus a with worldwide, genus species. Vakne l,21) n a and 2018), al., et (Vranken and s eune,addescrip- and sequences, L ,0 eod for records 4,700 . T1sqecs and sequences, ITS1 . Ulva Ulva tuf i.S4 Fig. r ffiilyauthorized officially are .australis U. nmn ein r all are regions many in eune,and sequences, A aiu likelihood Maximum species. species. i.S1 Fig. Ulva .Ti poses This ). pce which species ,t more to ), .lactuca U. Maximum Maximum i.S3 Fig. S2 Fig. . ieS1 File Ulva . . Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. soito o aieSine,DnSaeadCtWlig(aieBooia raiain,Tmvan Tim Organisation), (Scottish Biological Reed (Marine Sarah Wilding and Cat Brunner and Lars Smale Galway), (NUI Dan Bermejo Science), Ricardo Marine thank for to Association like would authors The interest numbers. Acknowledgments of accession conflict NCBI no of declare list authors numbers. The a accession from NCBI names of specimen interests list retrieve Competing a to from script coordinates Python GPS S3: study. retrieve File this to in script used Python software S2: and File scripts of of records List NCBI S1: and alignment. File OBIS ITS1 of of Comparison trees S4: Bayesian Fig. and ML of Complete trees S3: Bayesian Fig. and ML of Complete trees S2: Bayesian Fig. and ML foliose Complete main eleven S1: the Fig. to coordinates. GPS belonging vouchers and NCBI attribution study. of name this List in S3: used Table of primers vouchers of NCBI List and S2: coordinates Table GPS study. species, this samples, in of lected List S1: Table Data Supplementary as +Annotated ohiohilulu Ulva expansa Ulva arasakii Ulva fenestrata Ulva lactuca NCBI Ulva gigantea accession Ulva ITS NCBI pseudorotundata Ulva ohnoi Ulva rigida Ulva laetevirens Ulva australis Ulva Species foliose for sequences study. reference species. this Proposed foliose in each 2: used for Table synonyms available and sequences Names specimens km NCBI 1: ˜12,100 Table unique a of within Number entries specimen The 6: of species. Fig. number foliose localised the distromatic bottom: to specimens. latitudes, large unique proportional narrow 1336 is eleven in contains circles present the the species of of middle: size distribution species, distributed species widely Worldwide Top: 5: Fig. .fasciata U. AN T396Sadn ta,2016 al, et Spalding 2019 al, 2018 et al, Kang et KT932996 2003, Hughey al, 2019 et al, Shimada et Hughey 2018 2020, al, al, et et Miladi Fort 2019, al, et Hughey 2020 2010, al, al, et 2019 et Fort al, Kraft 2020 et al, Kang et 2004; Fort al, et Shimada MH731007 2020 al, et MK992126 Fort 2020 al, et Fort MK456404 2020 NA al, et Fort + MF172082 MT160676 MT160719 MK992234 MH746437 AB097621 MT160696 MK456393 MT160697 MT160674 MK456395 MT160566 NA MT160609 MH730161 AB116037 AB097650 MT160586 MT894736 MT160587 + EU933990 MT160564 MT894472 MT894650 AB116031 MT894503 MT894611 MT894708 10 tuf rbc alignment. L alignment. A Ulva rbc .lactuca U. ceso NCBI accession L species . Ulva tuf pce,proposed species, Ulva ceso Reference accession A 2 ais h map The radius. Ulva tan col- strains species. Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. in . eRves . ot .(98.Uv roiaas.nv(lae,Clrpya rmtecoasts the from ) nov.(Ulvales, sp. heuristics identification. armoricana new Ulva Morphological (1998). models, I. G. (France). more Coat, Brittany 2: & of jModelTest B., Reviers, (2012). De D. P., Dion, Posada, & R., computing. and Doallo, parallel L., locality and G. type Taboada, Taxonomic the (2018). D., from M. Darriba, Hiraoka, specimens . on . based . tides. K., Chlorophyta) green Ichihara, (Ulvophyceae, Sea S., Yellow prolifera Shimada, Y., Ulva Takano, of W., Zhu, reassessment P., A. Monotilla, J., Go¨er,Cui, Loiseaux-De & Y., Berger-Perrot, analysis. J.-M., sequence Fontaine, ITS B., Reviers, De (1998). M.-C., S. Noailles, P., Dion, G., Coat, alignment analyses. automated phylogenetic for algae large-scale tool tide a in green trimAl: trimming of (2009). genome Gabald´on, T. chloroplast of & Complete M., genomes Silla-Mart´ınez, (2017). Capella-Guti´errez, J. S., mitochondrial B. Jiao, complete & The P., (2018). He, L., B. flexuosa Zhou, Jiao, L., Wang, & C., P., Cai, He, analysis. L., SWOT Zhou, a algae T., tide Africa: Jiang, green South L., in Wang, feed C., abalone Cai, for Growing crop Amino (2009). commercial 21 L. (2017). Phycology, a Kandjengo, E. Applied from as & Lock, systems species D., . integrated Shuuluka, . in D., 21 . Robertson-Andersson, of R., J., Waagbø, factors Bolton, conversion N., Liland, nitrogen-to-protein S., and waters. Heesch, content, Norwegian C., protein Bruckner, composition, M., acid (2019). Espe, B. I., Charrier, Biancarosa, . (1998). H. . I. Sea. Tan, . & C., C., Rebours, Battelli, . at B., of Jacquemin, aquaculture S., sustainable France NCBI Roscoff, a Holdt, FA1406. for R., the guidelines Araujo, European technical B., PEGASUS-PHYCOMORPH and in Charrier, manuscript. administrative the M., provided reviewed available Barbier, exper- PP authors All the and openly manuscript. performed KC the AF results; wrote References material; the are RS biological and analysed MM provided RS study MDG, authors and AF, all MDG support; this experiments; MM, the AF, designed iments; of RS and findings AF the Contributions. Authors support MT895108. MT894471- that numbers reference https://www.ncbi.nlm.nih.gov/, data The Periphery Northern the Statement in Availability Producers Data the Pro- Seaweed by Arctic for and funded Innovations Periphery was - exploitation Northern project; SW-GROW work Union diversity Area 366, This European GENetic number the study. - (project and this 19/FFP/6841) gramme GenialG number in 727892, grant used Coasts, ID biorefinery, strains Pristine (project the Macro-ALGal programme Innovative of 2020 He- some for (SeaGrown), Horizon providing Crookes Union for Wave and +), European Company) (Alga Seaweed Abreu Cornish lena (The Warwick-Evans Caroline and Berkel ae rsne tteAnls nasfrIta n eierna Studies. Mediterranean and Istran for Annals Annales: the at presented Paper Uvpyee hooht)wt oprtv analysis. comparative with Chlorophyta) (Ulvophyceae, laarmoricana Ulva http://sw-grow.interreg-npa.eu/ laflexuosa Ulva ora fApidPyooy 29 Phycology, Applied of Journal 5,575-583. (5), uoenJunlo hclg,33 Phycology, of Journal European hclga 57 Phycologia, aueMtos 9 Methods, Nature . Uvls hooht)fo h osso rtay(rne.I.NcerrDNA Nuclear II. (France). Brittany of coasts the from Chlorophyta) (Ulvales, Uvpyee Chlorophyta). (Ulvophyceae, lasadnvc ldn,Clrpya:anwseisfrteAdriatic the for species new a Bliding,(Chlorophyta): scandinavica Ulva 6,692-704. (6), http://genialgproject.eu/ 8,772-772. (8), ) iifrais 25 Bioinformatics, 11 2,1001-1009. (2), uoenJunlo hclg,33 Phycology, of Journal European 1,81-86. (1), osrainGnt eor,10 Resour., Genet. Conservation lSoe 12 one, PloS ,SIFotesfrteFtr (Project Future the for Frontiers SFI ), 1) 1972-1973. (15), (9). Ulva 1,73-80. (1), (Chlorophyta) 3,415-418. (3), OTaction COST ora of Journal Ulva Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. ag .H,Jn,J . i,J . yo,S . i,S,Co,S . e,H .(09.Species (2019). J. H. Lee, . on based . Korea . Island, K., Jeju S. of Choi, coast S., the Kim, along Y., Ulva S. tide-forming genus Byeon, green the H., analysis. of of phylogenetic J. Re-evaluation molecular distribution Kim, (2014). and E., diversity, Y. J. composition, K. Jang, Kim, H., & J. H.-G., Kang, Choi, Sea. K., Yellow Kim, the in J.-H., species Kim, forming J., tide green E. a Kang, of as analysis identity Mitogenome 3 its (2018). B, W. confirms P. Part W. USA Gabrielson, P. & , Gabrielson, A., & from K. Miller, H., R., S. J. Hughey, Shabaka, A., synonymies. K. new and Miller, origins, 503-508. C., Linnaean unexpected (3), misapplications, Jarvis, the name F., of reveals Mineur, specimens analysis A., Genetic C. Horimoto (2019). Maggs, meridionalis Ulva R., to J. Chlorophyta) reference Hughey, (Ulvophyceae, special Ulva with nov. Enteromorpha-like of Japan, sp. Island, diversity indigenous (2011). Shimada, Ishigaki Cryptic K. River, et and Todoroki Ichihara, introduced the (2010). & in USA): C. Y., growing (Atlantic A. Masakiyo, System R., Mathieson, Horimoto, & Estuarine D., Bay C. Great Neefus, species. the distromatic C., in J. Chlorophyta) Nettleton, alga, (Ulvales, green-tide-forming C., new L. Japan. A from Hofmann, (2004). Ulvophyceae) M. nov.(Ulvales, Masuda, sp. & Shimada M., et (2009). Uenosono, Hiraoka A. S., W. Shimada, Nelson, M., & Hiraoka, L., J. Dalen, J., 44 T. Phycology, Farr, of F., K. Neill, S., and E. J. of Broom, study S., systematic Heesch, molecular A (2004). Ulvophyceae) R. (Ulvales, Pacific. J. Ulvaceae Waaland, northeast the & of S., H. systematics Hayden, Phylogenetic sequences. (2002). DNA nuclear R. and J. chloroplast Waaland, using & S., H. Ireland, of Hayden, University National publication, electronic World-wide AlgaeBase. 2020. (2020). Galway. Generalized G. Guiry, the & and M., Guiry, data sets. single-locus data simulated using on evaluation species and Delimiting method 62 revised (2013). a rare approach: Foliose G. Coalescent Yule the (2020). T. Mixed R. and Barraclough, Sulpice, variation, & & T., genetic wild. no- Fujisawa, D., the intra-specific M. in low Guiry, hybrids effective diversity, B., inter-specific genetic of Usadel, inter-specific P., particularly occurrence Potin, considerable (2019). K., show R. a Cascella, Sulpice, species M., . McHale, beads, . A., . Fort, among F., Magnetic Lopez, metabolism macroalgae. M., and McHale, (2018). patterns A., from growth 180 A. diurnal Esteves-Ferreira, R. in DNA M., variations Allaire, Extensive Sulpice, NGS-ready M., Lebrault, A., ma- & of Fort, green extraction of D., assessment for barcoding M. doi:https://doi.org/10.1016/j.algal.2018.04.015 DNA method Guiry, (2014). G. vel A., Bi, & Fort, H., China. Xue, , S., around Guo, zone doi:10.1007/s11802-014-2197-1 Y., coastal Mao, in F., croalgae Wu, G., Du, 5,707-724. (5), Gemina 1,109-123. (1), 2,1302-1303. (2), eei uvyo e eln aarvasdvriyaditoue species. introduced and diversity reveals taxa Zealand New of survey genetic : R:ht:/w.ageae org algaebase. http://www. URL: 2,1314 doi:10.1080/09670260802422477 143-154. (2), hclga 43 Phycologia, uoenJunlo hclg,45 Phycology, of Journal European ul al u.Nt c.Sr o,37 Bot, B Ser. Sci. Nat. Mus. Natl. Bull. la,29 Algae, 4,3432 doi:10.2216/i0031-8884-43-4-364.1 364-382. (4), lSoe 14 one, PloS 4,267-277. (4), lalactuca Ulva ora fPyooy n/a Phycology, of Journal 7,e0219958. (7), ora fPyooy 38 Phycology, of Journal laexpansa Ulva . ora fOenUiest fCia 13 China, of University Ocean of Journal 12 3,2029 doi:10.1080/09670261003746201 230-239. (3), Uvls hooht)hltp n eae type related and holotype Chlorophyta) (Ulvales, Uvca,Chlorophyta). (Ulvaceae, hclgclRsac,52 Research, Phycological 155-167. , 6,1200-1212. (6), laspp. Ulva Ulva na.doi:10.1111/jpy.13079 (n/a). la eerh 32 Research, Algal Uvca,Uvls rmthe from Ulvales) (Ulvaceae, ora fPyooy 55 Phycology, of Journal strains. iohnra DNA Mitochondrial ytmtcBiology, Systematic uoenJournal European Ulva ln physiology, Plant 1,17-29. (1), , 1,97-103. (1), laohnoi Ulva 308-313. , Umbraulva Ulva Ulva Ulva Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. iai . agii . iiat,S . eoee . bekfi . oaio .(08.ADNA A (2018). M. Morabito, & S., Abdelkafi, G., Genovese, A., ohnoi S. of Minicante, survey A., barcoding USA. Manghisi, , nutrient a Atlantic R., Chlorophyta), and and (Ulvophyceae, Miladi, identification 2004 of Molecular Shimada, Gulf S. the & in (2016). Hiraoka species M. M. 11 nonnative ohnoi likely J. Ulva and species Lopez-Bautista, report tide new & (Ulvophyceae, green L., the laetevirens of Ulva Collado-Vides, analysis of T., appearance J. The Melton, America. (2014). of States C. United Yarish, the & of 13 R., coast northeast Wilson, the K., in Chlorophyta) J. Kim, Y., Mao, comparison. ecological species and Netherlands: southwest Morphology, molecular the morphological, in 34 Ulva (2008). Free-floating A D. (1999). P. morphotypes? M. Kamermans, or Guiry, & S., & Draisma, E.-J., B., Malta, D. Stengel, sequence F., The 47 Rindi, (2009). Phycologia, distromatic M., R. of Durbin, L. distribution and . McIvor, phylogeny . J., . C. N., Loughnane, Homer, J., Ruan, SAMtools. T., and Fennell, & format A., T., alignment/map Wysoker, Oliveira, B., Handsaker, De data. H., M., big Li, Felipe, of era Davila Research the D., in (2009). Correia, bootstrap doi:10.1038/s41586-018-0043-0 O. phylogenetic E., Clerck, Felsenstein’s 452-456. De Wilkinson, Renewing . B., (2018). O. J. . Gascuel, Entfellner, . Domelevo F., Mineur, F., bloom. H., Lemoine, algal A. Qingdao Engelen, j., the E. of Malta, identity N., note: Ye, De X., & Zhang, C., F., Leliaert, G. Zuccarello, M., algae. J. in Lopez-Bautista, delimitation of doi:10.1080/09670262.2014.904524 species F., DNA-based Steen, (2014). P., O. Vanormelingen, Clerck, (2019). H., Verbruggen, S. F., M. Leliaert, Kim, & species, C., doi:10.4490/algae.2019.34.12.9 new from needs J. discovery waters with products Kang, Natural waste Korea W., (2016). of R. H. bioremediation Calado, & Algal Lee, W., J. information. (2013). Blunt, geographic H., A. and M. taxonomic N. Munro, improved A., Paul, Hilario, & C., M. Native, R., Leal, Nys, (2018). A. de C. using L., Maggs, aquaculture Mata, land-based . J., . R. . Lawton, N., Carmel, A., 19 Israel, Science, E., Marine Lewinsohn, cryptogenic J., and Douek, invasive G., Paz, N., Krupnik, species cryptic scalable 46 endemic Australian Phycology, and southern fast, into cosmopolitanism of Investigations a both Journal indicate (2010). RAxML-NG: phylogeny F. molecular R. Waller, (2019). and & taxonomy A. T., Chlorophyta) G. Stamatakis, Kraft, inference. & G., L. phylogenetic B., Kraft, likelihood Morel, maximum alignment, T., for sequence Flouri, tool multiple D., user-friendly service: and Darriba, online M., MAFFT A. Kozlov, (2019). D. visualization. K. and Yamada, choice & sequence interactive J., Rozewicki, K., Katoh, 3,225-237. (3), 865-870. (5), 443-454. (5), . rpoai,Algologie Cryptogamie, 4,4649 doi:10.2216/PH07-61.1 416-429. (4), 1,132-146. (1), Ulva Clrpya nTnsaadIayrvastepeec fteoeloe alien overlooked the of presence the reveals and in (Chlorophyta) Ulva 6,1257-1277. (6), .ped-ho p nov. sp. pseudo-ohnoi U. Ulva pce rmteIreiMdtraenSa ikadpotential. and risk Sea: Mediterranean Israeli the from species . eetn agtseisadstrains. and species target selecting : iifrais 25 Bioinformatics, Ulva hclgclRsac,57 Research, Phycological rensi iifrais 20 Bioinformatics, in Briefings Uvpyee hooht)i rln n otenBritain. southern and Ireland in Chlorophyta) (Ulvophyceae, aua rdc eot,33 Reports, Product Natural 13 Uvls Chlorophyta). (Ulvales, 1) 2078-2079. (16), uoenJunlo hclg,49 Phycology, of Journal European Ulva ora fOenUiest fChina, of University Ocean of Journal 2,147-151. (2), iifrais 35 Bioinformatics, LSOe 8 One, PLoS asn losfo euIsland, Jeju from blooms causing uoenJunlo Phycology, of Journal European 4,1160-1166. (4), 6,747-750. (6), la,34 Algae, 1) e77344. (10), aue 556 Nature, Ulva qai invasions, Aquatic 2) 4453-4455. (21), Mediterranean (Ulvophyceae, 4,253-266. (4), 2,179-196. (2), (7702), rbc U. L Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. ihr,T,Care,B,Mnu,F,Btwl,J . lrk .D,&Cae,J .(05.The introductions. (2015). new (2012). C. and I. species J. Moro, cryptic & Coates, 1510-1521. Italy): C., (6), & (Mediterranean, Andreoli, Sea D., K., Adriatic O. North Sciuto, A., Clerck, M. H., Wolf, J. Bothwell, of F., sequence doi:10.3389/fpls.2015.00072 genome Mineur, chloroplast seaweed B., complete of The green assessment Charrier, risk (2017). T., B. A Jiao, Wichard, (2018). & P., O. He, Clerck, L., De Zhou, linza & C., F., Cai, waters. phylogenetic L., Mineur, European Wang, of F., in seaweed Leliaert, Effects of V., introductions Pena, trade (2014). aquarium S., G. Bosch, T. S., data. Barraclough, Vranken, single-locus & using delimitation D., 5 species Fontaneto, Evolution, of robustness and M., the A. on method Humphreys, (1999). reconstruction J. Q., M. C. Stanhope, of Tang, . morphology . gross . the G., seaweeds, controlling D. green switch 1011-1018. Mann, of morphogenetic A., genera reversible C. common Maggs, a two E., for Leskinen, evidence G., phylogenetic of Hansen, diversity Molecular J., molecular Blomster, H., species: I. Tan, lost and alien doi:10.1080/09670262.2019.1597925 Cryptic, 466-483. (2019). (3), F. data. lato Weinberger, global sensu & mapping Ulva R., for Karez, package R S., new Steinhagen, a rworldmap: (2011). A. of South, macroalgae. evaluation green An P. marine (2010). J. the H. Huelsenbeck, for space. Kucera, markers . model & large . W., a G. . across Saunders, S., choice Hohna, model A., and Darling, inference 61 L., phylogenetic Biology, Bayesian D. Systematic efficient Ayres, P., 3.2: Mark, MrBayes Der (2012). Van photosynthesis M., drives Teslenko, acidification F., M. ocean Ronquist, Roleda, (Chlorophyta). under & dioxide rigida L., Ulva C. carbon Hurd, increased in E., not growth C. and and Cornwall, light S., Saturating advantages Heesch, M., (2015). phylogenetics: Strittmatter, Y. A., in P. Fernandez, averaging R., tests. A. model Rautenberger, ratio Vogler, and likelihood selection over . approaches Model Bayesian 793-808. . and (5), (2004). criterion insects. R. . information T. Akaike undescribed S., Buckley, of of Hazell, & taxonomy P., D., DNA D. Posada, the Duran, for A., (2010). delimitation Cardoso, A. 55 species J., Biology, Sequence-based Stamatakis, Gomez-Zurita, & G., (2006). E., T. P. M. Barraclough, B. J., Pons, Moret, Necessary? P., R. Are O. Replicates doi:10.1089/cmb.2009.0179 Bininda-Emonds, of Bootstrap assessment M., Many Molecular Alipour, How (2010). D., R. N. A. Sherwood, Pattengale, islands. & Hawaiian C., the T. in Examination Shipley, for Chlorophyta) A., evidence (Ulvophyceae, (2018). Kurihara, M. Chlorophyta): J., Hiraoka, (Ulvophyceae, C. & Ulva O’Kelly, B., tropical D. in Largo, barriers Y., isolating speciation. Tanii, ongoing postzygotic M., and Adachi, prezygotic T., of Nishimura, P., A. Monotilla, . osraingntc eore,9 resources, genetics Conservation 4,595-609. (4), Ulva ln h emncat fteNrhadBli Seas. Baltic and North the of coasts German the along 1) 0619.doi:10.1111/2041-210x.12246 1086-1094. (10), ora fPyooy 54 Phycology, of Journal oe ytmt td morphogenesis. study to system model a : 3,539-542. (3), Ulva clg n vlto,5 evolution, and Ecology rpoai,Aglge 31 Algologie, Cryptogamie, 3,463-466. (3), and 4,539-549. (4), Enteromorpha 14 ora fCmuainlBooy 17 Biology, Computational of Journal ora fPyooy 46 Phycology, of Journal rbc Ulva L, ilgclIvsos 20 Invasions, Biological tuf . Clrpya lae)boiest nthe in biodiversity Ulvales) (Chlorophyta, oeua ilg n vlto,16 Evolution, and Biology Molecular ,UA S n T sDAbarcode DNA as ITS and LSU UPA, A, 4,874-888. (4), rnir nPatSine 6 Science, Plant in Frontiers 4,487-528. (4), uoenJunlo hclg,54 Phycology, of Journal European ora,3 Journal, R 4,728-735. (4), ora fPyooy 48 Phycology, of Journal ytmtcBooy 53 Biology, Systematic (1). 5,1171-1187. (5), ehd nEcology in Methods 3,337-354. (3), Systematic laspp. Ulva (72). Ulva (8), Posted on Authorea 24 Nov 2020 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.160622296.66033732/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. i,E,X,R,Zag . u . un,B,Zu . u,J 22) Growth (2020). J. Cui, & of genome mitochondrial W., Complete (2016). Zhu, M. P. He, B., & C., Huang, Cai, H., 1 B-Resources, J. C., Part Zhang, K., Xu, related linza L. Ulva Wang, closely J., J., L. by Zhou, Zhang, produced R., hybrids doi:https://doi.org/10.1016/j.aquaculture.2019.734902 Xu, of characteristics E., Xie, n ftecua pce fgenmcola losi elwSa China. Sea, Yellow in blooms macroalgal green of species causal the of one , Trim xgaps/unknown with> positions pseudorotundata Ulva 30 strains 1) Preparation of alignments Remove