Performance of amplicon and shotgun sequencing for accurate biomass

ANGOR UNIVERSITY estimation in invertebrate community samples Bista, Iliana; Carvalho, Gary; Tang, Min; Walsh, Kerry; Zhou, Xin; Hajibabaei, Mehrdad; Shokralla, Shadi; Seymour, Mathew; Bradley, David ; Liu, Shanlin; Christmas, Martin; Creer, Simon Molecular Ecology Resources

DOI: 10.1111/1755-0998.12888

PRIFYSGOL BANGOR / B

Published: 01/09/2018

Peer reviewed version

Cyswllt i'r cyhoeddiad / Link to publication

Dyfyniad o'r fersiwn a gyhoeddwyd / Citation for published version (APA): Bista, I., Carvalho, G., Tang, M., Walsh, K., Zhou, X., Hajibabaei, M., Shokralla, S., Seymour, M., Bradley, D., Liu, S., Christmas, M., & Creer, S. (2018). Performance of amplicon and shotgun sequencing for accurate biomass estimation in invertebrate community samples. Molecular Ecology Resources, 18(5), 1020-1034. https://doi.org/10.1111/1755-0998.12888

Hawliau Cyffredinol / General rights Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights.

• Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain • You may freely distribute the URL identifying the publication in the public portal ?

Take down policy If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

04. Oct. 2021 This article isThis article by protected copyright. All rightsreserved. 10.1111/1755doi: differences to lead between the of thisversion citethisart and Version Record.Please copyediting, paginationbeen throughthe andproofreadingtypesetting, process,may which This article 8 7: 6 Guelph, Ontario, N1G Canada 2W1 5 4: ofChina Republic 3 Laboratory 2 1 Simon Creer Shadi Shokralla Iliana Bista Authors in invertebrate estimation P Article :Resource Article type : (Orcid ID 0000 BISTA ILIANA MISS : : : : : : erformance of amplicon of erformance

BGI De Bangor University, SchoolBangor University, of Wellcome Trust Sanger AcceptedNa Mersey, Stockport, APEM LTD,Heaton BiologyDepartment of Integrative & Biodiversity Instituteof Ontario,University of Agency, DeaneryEnvironment Horizon House, Road, BristolBS15AH, UK Article partment of Entomology, tural History Museum oftural HistoryMuseum Denmark, - Shenzhen, Shenzhen,Shenzhen, 518083, China

1 , LL57 2UW

, 2 , has been acceptedhas been forpublication andundergone fullpeer review buthasnot 2

* , Gary R. Carvalho , GaryR. 5 , Mathew Seymour

- 0998.12888

Institute, Cambridgeshire, Hinxton, CB10 1SA, UK

and China AgriculturalChina University,100193, Beijing People’s Biological Sciences, Molecular Sciences, Biological 2 community samples community - , 0002

Min Tang Min 2

, shotgun sequencing for accurate for shotgun sequencing David Bradley - 6155 Øster VoldgadeØster 5

Cheshire, -

3

3093) , Kerry Walsh , Kerry

6 SK4 3GN , Shanlin Liu

4 - , 7, 1350KøbenhavnK X

in Zhou Ecology and Genetics Fisheries 7 , 8 , , MartinChristmas 3 , Mehrdad Hajibabaei , Mehrdad

biomass biomass

4 icle as , 5 , This article isThis article by protected copyright. All rightsreserved. between correlations significant highly provided results sequencing Shotgun bias. primer was metabarcoding species, consistent abundance low very recover to failed occasionally (mitogenome sequencing Additionally, (3 metabarcoding COI macroinvertebrates of set ( genome different to metabarcoding of ability the bias, primer to Due efficiency. and discovery Abstract Simon Creer, * C Running title: biodiversity Keywords: Iliana Bista, o

Accepted Articlerrespon New applications of DNA and DNA of applications New taxa from taxa mito

documented , ding author

, to provide reference sequences reference provide to ,

y aln t rcvr oe pce wt highe with species some recover to failing by eaacdn, eaeois biomass, metagenomics, metabarcoding, [email protected] [email protected] - e sd individual used we genome skimming) Shotgun vs. ecological monitoring, yet questions remain regarding precision and and precision regarding remain questions yet monitoring, ecological

bulk of estimated ind estimated of ) *

sequencing sequencing

c ok omnte cmrsn 1 seis f freshwater of species 13 comprising communities mock ommunities amplicon invertebrate amplicon communities different respectively.

may provide a provide may RNA sequencing are are sequencing RNA

O broig and barcoding COI ividual biomass, ividual amplicons) remains We found that even though both methods methods both though even that found We for OTU assignment and meta and assignment OTU for

unclear, and more reliable alternative. Here we used a a used we Here alternative. reliable more

to compare the compare to

htu mtgnm sequencing. mitogenome shotgun while PCR while

expanding the field of field the expanding genome de abundance r

novo accurately accurately - - skimming, free whole free

mito detection efficiency efficiency detection s chondrial , probably due to to due probably , depict genome invertebrate mitochondrial mitochondrial

biodiversity biomass mapping genome genome less less

of of of s , This article isThis article by protected copyright. All rightsreserved. Acceptedrequire WFD the as such Initiatives directives ecosystem wider variation temporal and spatial both the biomonitoring, specimens immature Moreover, approaches and loss biodiversity rapid of face the in especially relationships, function ecosystem andbiodiversity understand to order ArticleIntroduction diversity metabarcoding studies, ecological communities macroinvertebrate bulk from Overall, read the improved approach) amplicon three all of results the Combining species. 658bp) FFFR, (amplicon relationships from obtained read the Conversely, species. one but all in biomass and number read

The accurate qualitative and quantitative assessment of biodiversity is essential in in essential is biodiversity of assessment quantitative and qualitative accurate The scale scale mito

assessment. sc a te EU the as such ,

accurate f csse dgaain is degradation ecosystem of genomic sequencing sequencing genomic ht yial rqie high require typically that for for

eaacdn varied metabarcoding Caso, 1990) (Cranston, 8

robust species identification species u o 1 (mlcn B1FR (amplicons 13 of out

Jcsn t l, 04 Seny Battle, Sweeney, 2014; al., et (Jackson (Loreau & de Mazancourt, 2013) Mazancourt, de & (Loreau quantification of community composition enables detection of of detection enables composition community of quantification

ae Faeok ietv (WFD) Directive Framework Water yielded more informative more yielded currently currently . ecological status classification classification status ecological s o xeie imntrn faeok, i frameworks, biomonitoring expedite To

n h booia community biological the in across

- is

biomass ees of levels

iie b te s o taiinl taxonomic traditional of use the by limited remains the remains frequentl than ampl

- metabarcoding. 5b, FF130R 450bp,

icons. correlations for some of the species. species. the of some for correlations

y expertise

not possible not

most commonly used commonly most

pcfcly w found we Specifically, . However, However, . predictions of biomass content biomass of predictions

ako, Dpe, 2011) Dapkey, & Jackson,

and

of water bodies water of -

However 3b) or 130bp) ae en established. been have ,

in case in

n b etnin the extension, by and - COI r labour are biomass relationships relationships biomass evaluating

mlcn (multi amplicons s fr ag scale large for ,

of damaged or or damaged of

4 approach for approach nternational nternational

out of 13 13 of out

- significant significant the which intensive

speed speed

. are are In - .

This article isThis article by protected copyright. All rightsreserved. Accepted2012) al., et Yu 2012; Konynenburg, taxonomic it steps, PCR and RbcL 2003) Waard, de & mitochondrial samples community as to genes marker more or one of sequencing amplicon 2014; al.,2018) et Seymour Altermatt, & Steinmann, Deiner, Mächler, 2008; Taberlet, & Pompanon, Miaud, Ficetola, (eDNA) DNA environmental aqueous from biodiversity of detection (Mala Articlesampling invertebrate litter leaf or soil from used, been have taxa al., et Pfrender 2011; Baird, & ecosystem studies recovered increasi by biomonitoring &Ohandja, Hoare, Voulvoulis, 2012) underpinned T Most

e det f ih hogpt eunig ehoois HS i revolutionising is (HTS) technologies sequencing throughput high of advent he eaacdn ad s omny ple fr h aayi of analysis the for applied commonly is and metabarcoding include various invertebrate taxa, various invertebrate include ITS2 studies using HTS for HTS using studies s (Baird & Hajibabaei, 2012) Hajibabaei, & (Baird represent

(e.g. by

for plants for Cytochrome s considered is

the taxonomic identification of organisms from routine monitoring routine from organisms of identification taxonomic the Gibson et al., 2014; al., et Gibson

and and

(Creer et al., 2016; Yu et al., 2012) al., et Yu 2016;al., et (Creer to of ation ribosomal RNA regions RNA ribosomal

(Fahner, Shokralla, Baird, & Hajibabaei, 2016) Hajibabaei, & Baird, Shokralla, (Fahner, g h truhu ad aooi ifrain ht a be can that information taxonomic and throughput the ng C .

ise traps) ise

Oxidase Subunit I (COI) (COI) I Subunit Oxidase

diversity diversity that 2010; Shokralla et al., 2015) al., et Shokralla 2010; bio diversity assessment of assessment diversity metabarcoding . . In fact, it has been has it fact, In . Gibson et al., 2015; Hajibabaei, Shokralla, Zhou, Singer, Singer, Zhou, Shokralla, Hajibabaei, 2015; al., et Gibson . The most commonly used taxonomic groups for for groups taxonomic used commonly most The .

(Ji et al., 2013) al., et (Ji in such as benthic macroinvertebrates for freshwater macroinvertebratesfor asbenthic such uk samples bulk such as such Yn e a. 2014) al., et (Yang

. rdcs bias produces barcod Amplicon sequencing Amplicon . More recent work recent More . .

16S C

Hjbbe, S (Hajibabaei,

ommonly used markers include the include markers used ommonly reported . Similarly, terrestrial invertebrate terrestrial Similarly, .

e bulk (Epp et al., 2012) al., et (Epp

region

samples to date utilise date to samples

that PCR bias might alter alter might bias PCR that o fo aoe ground above from or , n eain to relation in

(Hebert, Ratn (Hebert, pall, Shokralla, & van van & Shokralla, pall, .

Due t Due bulk/environmental (Bista et al., 2017; 2017; al., et (Bista encompasses

is

o intermediate o f often or or asingham,

accurate

referr (Collins, (Collins, such such PCR , or , the the ed

This article isThis article by protected copyright. All rightsreserved. Acceptedsamples. bulk in relationship reads to biomass bias 2013) approach simultaneously species multiple of reconstruction phylogenetic assessment ecological for samples bulk of characterisation (Crampton sequencing Illumina throughput to promoteaccessibility and eventualuptakeof technologies. such data independent of comparability improve to only not possible, Article2014) bioma and recovery richness estimations species improve to suggested been has region same the from amplicons multiple of combination and pairs, primer multiple of 2014) use pipelines, al., et Kelly 2014; al., et Hiiesalu 2017; Leese, reported been also have biomass and number 2015) Leese, & Elbrecht 2014; mis introduce species the

Seiial, t s ugse ta mtgnmc eunig results sequencing mitogenomic that suggested is it Specifically, . species The more recently introduced m introduced recently more The , .

advocat

Such Such (Piñol, Mir, Gomez Mir, (Piñol,

has been suggested been has - lt, u Zo, Vge, 2016) Vogler, & Zhou, Yu, Platt,

biomass ratio ratio biomass ( Gibson et al., 2014; Hajibabaei et al., 2012; Zhan, Bailey, Heath, & Macisaac, & Heath, Bailey, Zhan, 2012; al., et Hajibabaei 2014; al., et Gibson mrvmns to improvements - ing ersnain f atclr groups particular of representation The efficiency of this approach could be related to the assembly method method assembly the to related be could approach this of efficiency The

that the absence of absence the that - or Polo, & Agustí, 2015) Agustí, & Polo,

s produce inaccurate representation of of representation inaccurate produce

and as an alternative to PCR to alternative an as . methodological aspects aspects methodological On On

f hl mtcodil genome mitochondrial whole of possibly h ohr hand other the itochondrial metagenomics (mitogenomics) use (mitogenomics) metagenomics itochondrial initial more reliable representation reliable more

PCR PCR , where ,

amplification

. Ebeh, ao, esnr Aoit, & Aroviita, Meissner, Vamos, (Elbrecht,

( , Clarke, Soubrier, Weyrich, & Cooper, Cooper, & Weyrich, Soubrier, Clarke, oe c Some

correlations between between correlations -

primer based metabarcoding based . r ciia to critical are Optimisation of metabarcoding metabarcoding of Optimisation urrent applications involve involve applications urrent - step will step

template mismatches template

Tn e a. 2015) al., et (Tang sets, but importantly but sets, Glet t l, 2014) al., et (Gillett

s relative relative

rm uk samples bulk from reduce PCR related related PCR reduce tnads where standardise

of species of n oe accurate more in amplicon abundance of of abundance

(Zhou et al., al., et (Zhou

biomass s , might . This This . ,

read read high high also and ss

This article isThis article by protected copyright. All rightsreserved. future compr a provide to aim method performance. their evaluate to considered, e shotgun and MiSeq, Illumina were extracts DNA community (e.g. work previous densities natural of P depending from biomass, of gradients comprise species. target per individuals taxo various including freshwater their for measured individually 13 comprising samples. extracted bulk in species macroinvertebrate investigate mappingCrampton for available are sequences reference whether and used, prmna workflow xperimental Acceptedotamopyrgus Article Here, utility was - Platt, Timmermans, Baselga, &Vogler,2015) we provid we

on the species the on h aiiy of ability the

biomonitoring sample collected for environmental assessment purposes, purposes, assessment environmental for collected sample biomonitoring for ecological applications in biodiversity and f and biodiversity applicationsin ecological for assessed while providing suggestions for future improvements. Ultimately, we Ultimately, improvements. future for suggestions providing while assessed

antipodarum co mmon macroinvertebrate species, species, macroinvertebrate mmon . DNA . e Crampton

nomic orders, while accounting for availability of large numbers of of numbers large of availability for accounting while orders, nomic a direct comparison between metabarcoding and mitogenomics and metabarcoding between comparison direct a ). ehensive evaluation of the two methods and troubleshoot their their troubleshoot and methods two the of evaluation ehensive

was (e.g each method to depict the richness and biomass presence of of presence biomass and richness the depict to method each Possible limitations and advantages of each method were were method each of advantages and limitations Possible , and 1 to 17 specimens for specimens 17 to 1 and , . eune on sequenced extracted from extracted

-

amplicon sequenced for three fragments three for sequenced amplicon step increase of specimens per community from 1 to 33 for for 33 to 1 from community per specimens of increase step lt e al., et Platt diinly te elct communities replicate the Additionally, biomass. as low low

The samples were samples The

as a as ( Furthermore, the overall applicability of each each of applicability overall the Furthermore, 2015 Illumina pooled single specimen specimen single ) Gómez ;

Ten whole bodies whole Hi . R

Seq adix

for which all specimens were were specimens all which for artificial communities artificial reshwater ecosystemmonitoring reshwater -

selected Rod

( balthica see see íuz t al., et ríguez up iue 1 Figure of

) to to to represent a typical a represent to ,

invertebrates to

multiple specimens multiple (Gómez of the COI gene gene COI the of were represent a range a represent

for

( designed to to designed overview of of overview 2015)

- were used were Rodríguez, , unlike , . The ). ,

and on on , .

This article isThis article by protected copyright. All rightsreserved. preservatio during al.,1999)(Benke et mass dry of loss to due inaccurate be might specimens preserved speed and precision increased to due specimens, of weighing or biovolume of determination as such 2016)Dolman, produc could possible, wherever type ecosystem and region geographic account measurements the for accounting USA Rockville, species E marinus, m S ( Coleoptera and Ephemeroptera Amphipoda, Isopoda, Hemiptera, from: each of one extraction environment cool Ltd APEM ( Specimen and Materials September September 1, Figure Acceptedethodologies Article pcmn fr hs ok w work this for Specimens oy esrmns ee efre bsd n ulse work published on based performed were measurements Body

( e.g.

, U.K. , . S collection and phemera

In total, 13 species were used for analysis, including 8 species of and Gastropoda of species 8 including analysis, for used were species 13 total, In 1 – Bne Hrn Sok & alc, 1999) Wallace, & Smock, Huryn, (Benke, ) , e

October 2014 October which were selectedwhich were to biomonitoring represent atypical sample . ) : P )

were used were intraspecific Conversion of length to mass is considered superior to other methodologies to other superior considered is to mass length of Conversion . otamopyrgus c a used was Methods S allipers for larger larger for allipers pecimens were preserved in absolute ethanol, ethanol, absolute in preserved were pecimens . curvature of specimens of curvature

o u t 6 months 6 to up for danica morphological measurements morphological f ) or estimation of biomass for each species each for biomass of estimation or f

) r h amphipod the or and , and were identified to species level species to identified were and , variation

antipodarum

a microscope fitted with an ocular micrometre ocular an with fitted microscope a r cletd rm ra o Smre ad Suffolk and Somerset of areas from collected ere species s

n eeomn rates development in .

until morphological measurements and DNA DNA and measurements morphological until ) P ( . N ublished regressions ublished otonecta S G fwr Iae Pro Image oftware ammarus

. lua A glauca,

Additionally, direct weighing of of weighing direct Additionally, pulex

(Mährlein, Pätzig, Brauns, & & Brauns, Pätzig, (Mährlein, , to facilitate facilitate to ,

( and stored in a in stored and sellus based on length to mass mass to length on based by county surveyors county by

( as Table ( ei Cybernetics Media sn te following the using

qaiu, G aquaticus, these parameters parameters these

S 2 . ) accuracy , taking into into taking ,

for dark

smaller yrinus Table , UK UK ,

and and and

by n , ,

This article isThis article by protected copyright. All rightsreserved. extract community each extracted separately was which asses to community, communities controls positive ( individuals, only below) see species, communitiesDesign ofmock 2007) Corporation, (CodonCode v.3.7.1 Aligner Massachusetts) CodonCode using edited were sequences barcoding Ephemeroptera regarding successful not was sequencing barcode which for (Ephemeroptera), types protocol extraction with performed primers metazoan universal LibrarDNA barcode Reference AcceptedTable Article I Ten 6 ( 6 ndividual specimens were extracted and sequenced for the COI the for sequenced and extracted were specimens ndividual . .

High 1 N. glauca N. , omnte wr cetd otiig ihr 3 r 14 or 13 either containing created were communities Table

each ult broe wr otie fo al pce ( species all from obtained were barcodes quality . is, he seies of specimens three First,

n aind with aligned and 2 9 or )

DN were used were t gain o boas pres biomass of gradient a at ) Te t The . , except , e ( s s Bod Tsu kit Tissue & Blood asy gastropods

extraction efficiency. Second, DNA Second, efficiency. extraction ( P. fontinalis P. tl u of sum otal at 1% concentration concentration 1% at

for (Folmer, Black, Hoeh, Lutz, & Vrijenhoek, 1994) Vrijenhoek, & Lutz, Hoeh, Black, (Folmer, to

y two species, where low numbers allowed representation in in representation allowed numbers low where species, two

assess the quality of sequencing performance across mock across performance sequencing of quality the assess , u t best to due ), (previously assembled by assembled (previously lsaW n MEGA in ClustalW )

of the communities. the of

specimens see D

rosophila (QIAGEN) (QIAGEN) oiir e al et Morinière utblt o protocol of suitability ence (at (at cos 10 across BGI

rm snl t mlil specimens multiple to single a from v5 melanogaster (

-

arthropods Specimens ranged from ranged Specimens Shenz

of the of (Tamura, Dudley, Nei, & Kumar, Kumar, & Nei, Dudley, (Tamura, Tang communities communities . h

Table n China) en, bu et al. et 2017) species (including control control (including species t , and ), terfly

ee de i each in added were

S s barcod

Sne generated Sanger . 2014)

1), (for similar findings findings similar (for for different tissue tissue different for

Mycalesis mineus Mycalesis

CTAB chloroform chloroform CTAB , except except . Extraction was was Extraction . was to account for for account to ,

e was

region using region 1479 13

E. danica danica E. added in added 9

to 15 to .

T wo wo 7 ,

This article isThis article by protected copyright. All rightsreserved. samples. between species of proportions biomass the skewing avoid to order in method enrichment yield 5min. for 2500g at centrifuged 4ml) volume (total (Sigma speed medium at shaker a in 65°C (MO then 37 bulk in sequencing. 2 of amount minimum A electrophoresis. gel agarose (Roche) Qiagen qu and mitogenomes DNA extractionforreference toselected due existing reference mitogenomeinformation. in variability o Accepted Articlefrom extracted was DNA genomic ality fr hours. 2 for C -

For the construction of individual shotgun reference genomes for each species, high high species, each for genomes reference shotgun individual of construction the For For the mock communities, DNA was extracted from extracted was DNA communities, mock the For

BIO) and vortexed at high speed for 5min. for speed high at vortexed and BIO) transferred into 50ml Power Bead tubes from the Power Max Soil DNA Isolation Kit Kit Isolation DNA Soil Max Power the from tubes Bead Power 50ml into transferred ( iue S2 Figure . -

Aldrich) ] First, ethanol ethanol First, DN

and e

asy

DNA subsampling before before subsampling DNA DNA . , usqet tp followed steps Subsequent lo ad ise xrcin kit extraction Tissue and Blood Table h did pcmn wr gon uig trl mra ad ete sets, pestle and mortar sterile using ground were specimens dried The concentration concentration te oun wr incubate were columns the , preservative S 3 ).

For the purposes of this work, we did not use any mtDNA mtDNA any use not did we work, this of purposes the For The elution The and was , with the addition of addition the with , ult ws sesd ih dsQubit with assessed was quality removed htu sequencing. shotgun leg or muscle tissue muscle or leg step was repeated a second time to time second a repeated was step manufacturer’s Lysis was per was Lysis

and mock

d [ f

inal elution elution inal f then r 30min or .

5µg total 5µg communities

the entire the specimens 450µl of Proteinase K Proteinase of 450µl

formed by incubation for 3h at at 3h for incubation by formed The of instructions a single specimen using the the using specimen single a at room temperature temperature room at DNA was DNA

in two

bodies of i of bodies 50µl PCR Grade water water Grade PCR 50µl were allowed to dry at dry to allowed were

control Fr f For .

used for shotgun shotgun for used DNA

nvertebrate pce were species nl elution inal assays ( 20mg/ml maximise and and s ) ,

This article isThis article by protected copyright. All rightsreserved. Acceptedbleaching included Protocols DNA. community of l appropriate Appendix in available are protocols PCR Detailed protocol Illumina indexes Illumina with two round from amplicons target COI 2016) Port, & Lowell, community each of efficiency step, indexing extracts. community Articlebarcodes barcode aligned with comparison communities from (450bp) primer 1994) al., et (Folmer barcod Metabarcoding primer, Metabarcoding was performed performed was Metabarcoding

esir t al. et Meusnier

specific primers with added Illumina tails Illumina added with primers specific

e (130bp)

( region

ie uig ie V raet (0 cce) olwn the following cycles) (500 reagents V2 MiSeq using MiSeq I , (PE) -

n pies I primers and B1 B1 and performed PCRsetupswere inadedicated clean cabinet.

Round 2: Round ( ] aboratory at , primer primer

to minimise the effects of variant index sequences on the amplification amplification the on sequences index variant of effects the minimise to ; using the following the using Figure Figure -

and Primer selection selection Primer h Boiest Isiue f Ontario, of Institute Biodiversity the

. wa

(658bp);

3 Libraries were prepared using a a using prepared were Libraries purified purified R ( . precautions were taken to establish clean conditions clean establish to taken were precautions 2 2008) ) ound s

- 1 n I and B1 mlcn 1R using B1FR amplicon and modified from from modified

1:

both degenerate, modified for use with macroinvertebrate macroinvertebrate with use for modified degenerate, both

PCR 2. T Bry Mhod, anr & o, 01 O’donn 2011; Loy, & Wagner, Mahfoudh, (Berry,

able able amplification was performed using only the target specific specific target the only using performed was amplification ) and amplicon

amplicon FF130R using using FF130R amplicon -

primer pairs primer 130R product sequences hog s through 3 ) . )

Hajibabaei Hajibabaei ,

These primers were selected based on visual visual on based selected were primers These

and amplification performance tests on mock mock on tests performance amplification and steril from round 1 was 1 round from

: , and a final s final a and , qecn of equencing

I 1 (Appendix I, sequence alignment of COI COI of alignment sequence I, (Appendix library preparation zto o eupet y uolvn or autoclaving by equipment of ization - . B1 forward primer primer forward B1 ) . amplicon FFFR using the using FFFR amplicon II et al. et . h smls were samples The

hogot h lbay preparation, library the Throughout

two

( Folmer forward primer primer forward Folmer nvriy of University 2012) tep was used for indexing of all all of indexing for used was tep - step three three used as template using the the using template as used , and and ,

C protocol PCR

amplicons and I -

( 130R

2x sequenced on an an on sequenced ulh (Canada) Guelph

250)

omr reverse Folmer Folmer primers Folmer for the analysis analysis the for

wa

of the COI COI the of

paired s modified modified s with final final with l, Kelly, ell, –

I - 130R - end end .

This article isThis article by protected copyright. All rightsreserved. (Canada). Guelph BIO at run was sequencing MiSeq and (UK), Bangor at performed was analysis data Neighbo MEGA6 in insertions systems. Wales (HPC) including NCBI downloaded BLAST+ ( community a as used sequences) abundant most the through selected are case USEARCH in table OTU an of generation and clustering with removed 2x250 Illumina of limitations current the to due forwar the between overlap sufficient allow not did (658bp) amplicon >25) score (phred ( whole (FFFR, 1 amplicon 450bp) de to Amplicon data analysis 2017) 230bp at truncation - Accepted ArticleBista elcto ad egn o sequences. of merging and replication Quality control and analysis of the Illumina MiSeq sequences was performed according performed was sequences MiSeq Illumina the of analysis and control Quality , ,

using Quantitative Insights In Microbial Ecology (QIIME) Ecology Microbial In Insights Quantitative using et al. et u the forward and reverse reads were merged with a 25bp minimum overlap. For For overlap. minimum 25bp a with merged were reads reverse and forward the (

n approximate n megablast

r USEARCH, QIIME and BLAST and QIIME USEARCH, - J oining samples.

(

2017) the . Th .

de novo de (NJ) method method (NJ) COI ) , using a a using ,

e truncation e Cmco t l, 2009) al., et (Camacho Similarly, taxonomic assignment was performed as in in as performed was assignment taxonomic Similarly, )

Tmr e a. 2007) al., et (Tamura bsd on based ,

h ietfe OU wr frhr hce for checked further were OTUs identified The acds ad u cso gnrtd O barcodes COI generated custom our and barcodes, average

option in USEARCH in option barcode USEARCH v7 USEARCH (Sait

value for characterisation of the diverse taxa present in the in present taxa diverse the of characterisation for value

strategy was selected because the length of the original the of length the because selected was strategy ou & Nei, 1987) Nei, & ou atC ( FastQC ein 658bp) region were (Edgar, 2010) (Edgar,

For amplicons 2 (FF130R, 130bp) and 3 (B1FR, (B1FR, 3 and 130bp) (FF130R, 2 amplicons For

performed using High Performance Computing Performance High using performed , www.bioinformatics.babraham.ac.uk , and a 97% similarity level was used for OTU OTU for used was level similarity 97% a and ,

n through and ih ≥ with .

bp Metabarcoding library preparation and and preparation library Metabarcoding ny h frad re forward the only

8 smlrt cut similarity 98%

custom pipeline fo pipeline custom (the dominant (the

MiSeq chemistry. Chimeras were were Chimeras chemistry. MiSeq . Th .

phylogenetic phylogenetic e OTU e

(Caporaso et al., 2010) al., et (Caporaso

level of similarity was was similarity of level

OTU centroids in this in centroids OTU d and reverse reads reads reverse and d r filtering, sorting, sorting, filtering, r stop codons and and codons stop - off nlss sn a using analysis d wr used were ads . , All against Bista Bista ) analys

results t al. et , and , the e s

This article isThis article by protected copyright. All rightsreserved. follo Accepted 0.6.2 BWA using on based scaffolds protein inside generated w frame translation and and gene mitogenomes reference by mitogenome for used were construction which sequences, mitogenome candidate 2012) Chin, scaffolds Articlel with reads removing described previously Shenzhen skimming mitogenome HiSeq2000 species instruction manufacturer’s following constructed F mitogenomes. reference Construction mitogenomes ofreference ag t al. et Tang Onco wing Siphlonurus immanis Siphlonurus eoi DA xrce fo individual from extracted DNA Genomic s

from 4 arthropod 4 from

libraries were pooled and and pooled were libraries using SOAPdenovo melania hupensis melania , China , Tang Tang .

. Assembled sequences were annotated following annotated were sequences Assembled . . irr construction Library

, Subsequently

a et al. nd each of them were aligned with corresponding reference protein reference corresponding with aligned were them of each nd .

( L & ubn 2009) Durbin, & (Li 2014) Fo PE

r ( as

2014, 2015) ow quality or adaptor contamination. Clean data Clean contamination. adaptor or quality ow reads. In addition, the original read original the addition, In reads. reference mitogenomes r mitogenomes reference in

. and analysis of analysis and

hce i MEGA6 in checked

species ) and 3 and ) hu t al. et Zhou Thirteen - ) using CLUSTALW 2.1 CLUSTALW using ) oig gene coding

- manual correction and checking were were checking and correction manual each or Trans ( Trans

( mollusc species mollusc .

Macrogyrus oblongus, Gammarus duebeni, Ligia oceanica Ligia duebeni, Gammarus oblongus, Macrogyrus

n asml o rfrne mito reference of assembly and - protein K 71)K 100

s ( n SMOL 0.1.19 SAMTOOLS and pecies 2013

mock community mock s

and

bp PE PE bp (Xie et al., 2014)etal., (Xie by - coding genes genes coding ), Tmr e a. 2007) al., et (Tamura a , mito

ag t al. et Tang the aw data aw

eune on sequenced ( s (Thompson, Higgins, & Gibson, 1994) Gibson, & Higgins, (Thompson, geno Biomphalaria tenagophila, Physella acuta Physella tenagophila, Biomphalaria irr wt isr sz o 200 of size insert with library pecimens s

seby rga we constructing when program assembly Ilmn, Nextera) (Illumina, me

from each species were filtered as filtered were species each from -

ma

samples and IDBA and skimming (PCG) (2014) and and (2014) pping a ue for used was

Tang et al. et Tang L e a. 2009) al., et (Li

a whole whole a ee xrce fo all from extracted were was carried out carried were -

UD

to correct gap length length gap correct to genomes

we done and monitored and done

. ag t al. et Tang (Peng, Leung, Yiu, & & Leung,Yiu, (Peng,

performed performed The 12 individual individual 12 The ( re 2015) ae of lane

assembled generation

as described described as r , s el as well as

espectively espectively to identify identify to bp was was bp Illumina - ( at coding coding 2015) . The The . BGI into of of , ,

This article isThis article by protected copyright. All rightsreserved. index the in function metaMDS the using (nMDS), performed was scaling multidimensional nonmetric program R (AIC) analys statistical criterion information Akaike using selected was model best The ( biomass and reads of number between relationship the explaining model best the select mock reads of number total the from reads OTU of amount as performed was normalisation (pShotgun) (M normalis data, shotgun the For data sequencing Statistical from the onto BWA by mitogenomes reference constructed previously sample librar mg

AccepteditoNorm), Article ), linear and linear ), vegan Each Each o iuls cmuiy variation community visualise To of normalisation performed we samples, between comparisons statistical allow To ies the s maue of measure a as community ) , according to manufacturer’s instructions manufacturer’s to according , wih ee 0b P sequenced PE 100bp were which ,

focal reference of the 10 the of

and community package in R (version 3.3.0) (version R in package Core

( e dtie ifrain in information detailed see and specie Team s icuig aclto o mdl aaees wr promd sn the using performed were parameters, model of calculation including es, logistic

per mock community mock per as proportion of of proportion as ,

s community for each amplicon amplicon each for

with 100% read coverage and coverage read 100% with s. (

2015)

eaie abundance relative models were models analysis .

ation was performed performed was ation DNA mito

sample compared , , based on on based , chondrial while (target_species_reads/total_community_reads) across bsd n pce lvl data level species on based ,

s ag t al., et Tang n w lns f a of lanes two on

was used for constructio for used was accounting for variation of sequencing depth. depth. sequencing of variation for accounting eunig treatment sequencing

for each species and sequencing methods. methods. sequencing and species each for reads on total reads total on reads calculation of the of calculation . Filter . ≥ 99% identity were considered as reads reads as considered were identity 99% by

mitogenome length mitogenome , ed data were aligned onto the 12 the onto aligned were data ed

( and rea and 2015

) HiSeq ) .

ds that uniquely mapped mapped uniquely that ds For

Bray

2000 n s per m per

the of 200 of for amplicon data, data, amplicon for - Curtis dissimil Curtis .

diinly t Additionally, ( H, 1987) (Hu, o ca. mlcn data amplicon and ck community ck bp insert bp 2 - mito 3

G b - ratio arity arity . - .

si per per per All All he To ze

This article isThis article by protected copyright. All rightsreserved. 0.6 (mito species 14,760 and phontinalis bp 16,159 DNA. extracted experiment; this OTUs data our in diversity intraspecific for (B1FR) 14 S (Table 16 produced 1,430,531 was control S Results community sequencingmethodscomposition between PRIMER plot ordination the on communities treatments) same sequencing the different connect from (resulting to used was vegan package in “ordispider” function equencing

Accepted ,815 Article 92 The amount of data attributed to mito to attributed data of amount The the sequenced We metabarcoding For % ( % assigned tothe assigned ) bp , 2.) 2.) , - 4

v6 E E. )

. . wt three with , h following the

OTUs respectively OTUs

After OTU clustering OTU After danica results F3R 10450 ( 1,004,530 FF130R: - ratio) varied largely between species, ranging between 0.011% ( 0.011% between ranging species, between largely varied ratio) K R Cak & oly 2006) Gorley, & Clarke R. (K.

Gyrinus oa lnt o mtcodil N asmle rne between ranged assemblies DNA mitochondrial of length Total ), with ), species

same speciessame wasgenerally low(Table , sequenced on a fraction of an Illumina MiSeq lane MiSeq Illumina an of fraction a on sequenced ,

pce as aheig iclr eoe ( genomes circular achieving also species marinus reference mitochondrial genomes mitochondrial reference data total an . vortex A. average mito average . , t , Collapsing of multiple OTUs per species was used to account account to used was species per OTUs multiple of Collapsing number of reads ( reads of number he , f , ) ( ) 0,5 ± 87 ± 100,453 or each amplicon, each or Table total

a nt nldd n h rn u t lw ult of quality low to due run the in included not was although number of number S - 6 ratio ). chondrial

The average mitochondrial genome length was was length genome mitochondrial average The ,

366), of the observed intraspecific diversity amongst amongst diversity intraspecific observed the a as ue t eaie ifrne in differences examine to used also was Mean Mean

0.1 sequencing sequencing we obtained obtained we 9 3.) reads compared to the total reads per per reads total the to compared reads %. The aver The %.

(nMDS, Bray (nMDS, ±

1R 177 B1FR:

SD),

for S 5

1.) ). 12 out out 12 reads

49 (FF130R), 20 (FFFR) and (FFFR) 20 (FF130R), 49 FFFR1: 248 FFFR1: age sequencing depth sequencing age Notonecta , 22 - Curtis) obtained (17 5 of of 13 species used in in used species 13 . ,

2. ± 24 ± 722.5 . .

, E 776 (24 776 glauca h soft The ach amplicon amplicon ach G. pulex G. after quality quality after 13,3 , Physa Physa , ,878 ) and ) ,418 ware ware 2

per per 6

± - )

This article isThis article by protected copyright. All rightsreserved. power of calculations. we total 125 = 9) x (1sp. species Occurrences (known) expected of cut lower a on based Detection ratesperspecies (FFFR1). amplicon no but 0.008), = p 0.610, = R² (FF130R: 0) = p 0.939, = R² (B1FR: amplicons the of two for found were reads of number total the and sample control positive the the found for was relationship significant no while 0.002), = p 0.717, = (R² sample per achieved reads genome mitochondrial for and reads, of number reads S mitogenome 6 )

Acceptedspecies Article e osdrn te ubr f pce across species of number the considering re .

o te w pstv controls positive two the For Cases of Cases . melanogaster D. htu sqecn of sequencing Shotgun P. fontinalis P. ) .

The proportion presence of false negatives is reported here based on number of number on based here reported is negatives false of presence proportion The

was 177.4was we D. melanogaster false negatives and false positives false and negatives false . mineus M. re calculated normally as 10 per species (10 communities), except for for except communities), (10 species per 10 as normally calculated re with anaverageof2,398,420,020

occurrences

(9 -

occurrences off of reads of off occurrences s positive control was used. Significant relationships between the the between relationships Significant used. was control positive 7

X only)

(min depth: 6.4depth: (min

n vrg o 773 (± 787.3 of average an the Te la The .

read vs. number totalnumber.

( presence in sample in presence ok community mock

/cases] ( /cases] present , ) and ) . melanogaster D. t e ws infcnl creae wt te ubr of number the with correlated significantly was ter N. glauca N. X

based on counts on based (

– defined as zero for indicating presence/absence presence/absence indicating for zero as defined

G.pulex

were found were all

(6 reads reads

ape returned samples

frec pce nte communities. the in species each for )

125.2) 0 omnte t ices statistical increase to communities 10 occurrences , max depth: 670.4max , depth: returned an average average an returned

( Figure S Figure from Fgr S (Figure

for both sequencing methods sequencing both for F or amplicon s amplicon or ) [(11sp. x 10) + (1sp. x 6) + 6) x (1sp. + 10) x [(11sp. ) t for the whole COI region region COI whole the for t Table 3 , TableS 4

) 23,984,200,200 total total 23,984,200,200

1 X (r ).

– a number ead

4 For this step this For E. danicaE. f 4. (± 344.2 of equencing ).

)

( , Table 51.3) 51.3) s

, o

nly nly we for for , This article isThis article by protected copyright. All rightsreserved. were community each for obtained data total the Additionally, best appropriate the with plotted and each species, and data) sequen shotgun and data, each amplicon all for of sum individually amplicons, COI (three examined treatment was biomass with reads of relationship sequencing different the across species to linked generally type relationship biomass to reads of number the characterising for appropriate Associations betweenb specimens cross to attributed 1 34,511 reads posi except species gastropod from came mostly negatives false Overall, community. that in present be to known were biomass species of B1FR for species) FFFR, for species) rare the for present biomass of amount lowest second and lowest the for only occurred method Generall species. 5 for (5.6%), cases 125 of out 7 in samples , Acceptedamplicon). (FFFR 10 community in reads 204 Article tives

h sogn prah ald o eet h peec o seis n the in species of presence the detect to failed approach shotgun The ttsia m Statistical pce. o the For species.

were detected for detected were .

detected . dl investigation odel (Table (Table - , b

contamination but also when up to 10 (FF130R, FFFR), 13 (FFFR) or 17 (B1FR) specimens 17 specimens or 13(B1FR) FFFR), (FFFR) to10(FF130R, but also when up .)

three css 45 i 3 pce) o F10, and FF130R, for species) 3 in (4.5% cases 6 in iomass 1 , communities 7 and 7 and communities

, Table S Table , G. pulex G.

mlcn, false amplicons, species species

and

due to carry over from species during common storage common during species from over carry to due

7 ( s amphipod, number ofreads )

. N. glauca N. ugse that suggested

Here false negatives appeared not only for the lowest lowest the for only not appeared negatives false Here 9 respectively

eaie ocre in occurred negatives

The presence of false positives false of presence The - in two cases cases two in and cases) 2 fit model (Figure model

both

, and for species species for , and logistic E. danica E. ( y, the false negatives with this this with negatives false the y, FFFR amplicon FFFR 3

, Figure 4 Figure , n linear and methods (Table (Table methods iutnosy plotted simultaneously a

c.) ( .) 3 cases (2.2% in 3 3 in (2.2% cases 3 .) mayfly, css (5. cases 7 s, with the model the with s, P. , Figures , ) ,

fontinalis here could be be could here with models were were models 1 case). case). 1 community 111 and and 111 6 4 S i 5 in % ). The The ). 5 False False

- with cing S

8 of of ). ).

This article isThis article by protected copyright. All rightsreserved. case this in although groups, the of separation vertical observe only again could we reads shotgun W (Figure data shotgun and amplicon of plotting Simultaneous amplicons. different the by obtained results 10 and (Figure 9 communities of exception the with axis vertical the along communities same Community analysis comparison to theothertwoamplicons in not but data, amplicon (Folmer) FFFR only using to comparison in improvement an was reads significant Nevertheless, 4) Figure thr all from obtained ( the using generated amplicon FFFR the for only but relationship, of exception the with relationships, for relationships across varied results reads data,FiguresS9amplicon of sum and (Shotgun community per trends overall visualise to biomass specimen against Figure Figure

Acceptedamplico the hen Article P Comparison between the three COI amplicons on the on amplicons COI three the between Comparison

stv ad oty infcn relationships significant mostly and ositive (11 out of 12 species wit species 12 of out (11 S S mc cerr hn hn h idvda apios ee lte. oevr the Moreover, plotted. were amplicons individual the when than clearer much , 11 7 . In this case, the detection rates rates detection the case, this In . ). ). e lo netgtd the investigated also We S12a uh findings Such ) shows each sequencing treatment separated along the horizontal axis. axis. horizontal the along separated treatment sequencing each shows ) 4 n data were plotted as a sum (SumAmplicon) (Figure (Figure (SumAmplicon) sum a as plotted were data n

- e mlcn v. nw boas as biomass, known vs. amplicons ee

ot f 3 species 13 of out 8 amplicons, - S10 suggest a qualitatively similar community composition in the the in composition community similar qualitatively a suggest ) .

– h 1 trending towards significance; p = 0.08). = p significance; towards trending 1 h

bio but we found significant positive read read positive significant found we but mass relationships were found in only 8 species, which species, 8 only in found were relationships mass . danica E.

( eainhp otie we adn u te reads the up adding when obtained relationships Table

4 )

( . Table which presented negative reads reads negative presented which

improved,

ee on bten biomass between found were 4

).

l seis rsne positive presented species All su n y removing by MDS o apio dt ( data amplicon of m universal showed grouping of the the of grouping showed S12b as negatives. false

Folmer primers Folmer M etabarcoding etabarcoding ) against the the against ) – –

- biomass biomass biomass biomass T shotgun shotgun able able 4 ,

This article isThis article by protected copyright. All rightsreserved. a would coverage 10X that Accepted reference complete almost of assembly enabling Sequencing performance and coverage sample combined. amp three all from results when improved slightly method the of accuracy the biomass the approach, gene. COI the of metabarcoding amplicon to and reads between relationship also metabarcoding mitogen while cases, several in h species misrepresented biomass low detect to failed multiple Article different of testing allowed which replicates of design structured a through used, specimen each for measurements biomass exact used Gómez of biomass metabarcoding Discussion ord similarity er ofcommunitiesas B,C,G, I ee we Here ufcet oeae f eeec mtgnm sqecn ws civd ( achieved was sequencing mitogenome reference of coverage Sufficient - Rodríguez et al et Rodríguez

mc eunig rvds more a provides sequencing omic of of per species). D species). per ranking of communities was almost identical for the two types of sequencing (see (see sequencing of types two the for identical almost was communities of ranking

a large proportion of the species the of proportion large a uk netbae samples invertebrate bulk

metabarcoding used and ge boas pce a well. as species biomass igher

htu mitogenomics shotgun two ., 2015; Yu et al., 2012) al., et Yu 2015; ., etection of rare taxa proved challenging for both methods, as they they as methods, both for challenging proved taxa rare of etection llow shotgun mitogenome assembly ( assembly mitogenome shotgun llow HTS

data did not provide accurate quantitative information on the the on information quantitative accurate provide not did data

biomass from biomass based etc.). etc.). .

Unlike previous work work previous Unlike methods consistent and representative estimate of the the of estimate representative and consistent

, in bulk macroinvertebrate samples macroinvertebrate bulk in

ims seais fo snl seie to specimen single (from scenarios biomass

we extracted whole extracted we bulk macroinvertebrate samples macroinvertebrate bulk to

hn osdrn te single the considering When Our results results Our mito characterize for

genomes. Previous reports suggest suggest reports Previous genomes.

N based DNA (Crampton suggest

Zhou et al. 2013) al. et Zhou species composition and and composition species

bodies in bulk and also and bulk in bodies

idvriy analysis biodiversity that using shotgun shotgun using that - Platt et al., 2015; 2015; al., et Platt licons were were licons ,

Table , , but even but , compared

amplicon although S 6 ), ,

This article isThis article by protected copyright. All rightsreserved. Acceptedboth methods for r Sequence run. same therefor should included) molecules of taxa. size to of (according sequencing range during contents a library acrossNormalising used are pairs primer different when efficiency amplification binding reads shorter to attributed be could sequencing amplicons two amplicon shorter Article2016) to compared sufficient mapping, for baits as sequences (mito reads sequencing of proportion 8.5 size genome genome nuclear related tolarge Leese & Weigand, Zizka, duebeni length for coverage 6.4X at The majority majority The varied coverage sequencing work, metabarcoding the For . ,

, 15,651 bp 15,651 , y oprsn o te aalbe lsl re closely available other to comparison by Ar e a. 2011) al., et (Aird

eads when

e be taken into consideration when multiple amplicons multiple when consideration into taken be e BF, 5b; FR 658bp) FFFR, 450bp; (B1FR, – , while only while , - (FF130R, 130bp) (FF130R,

of species presented positive relationships of biomass with the read data read the with biomass of relationships positive presented species of biomass 05 pg 10.5 ,

G. pulex G. in eunig ie leng mixed sequencing

Krebes & Bastrop & Krebes e novo de ( 2017) relationships . Jfey & (Jeffery

, we assembled 13 assembled we ,

one species showed negative relationship ( relationship negative showed species one diinly ti vrain ol b atiue t different to attributed be could variation this Additionally, ]

. for this species for

The low coverage achieved for achieved coverage low The seby ra bsd approach) based (read assembly obtaining u pipeline our luia ie sqecn peeetal amplifying preferentially sequencing MiSeq Illumina

- rgr, 2014) Gregory, ratio 0.011%, Table Table 0.011%, ratio

for both methodsfor both ( 2012)

higher number of reads reads of number higher h amplicons th Tbe S (Table ,

326 bp, 326 (as reportedwith(as forspecies nuclear related loe lwr eunig oeae o be to coverage sequencing lower allowed

and ae seis [ species lated Gammarus wih cone fr h largest the for accounted which , 4 at

) . an estim an

Such Such or variable efficiency of primer primer of efficiency variable or S 6). among variation in the depth of of depth the in variation y using By G. pulex G. roeselli ated 8 ated see (Crampton compared to compared

are sequenced in the in sequenced are

amplicon species species

species 3 15,989 in 15,989

could be could - xsig barcode existing 85 % of expected of % - E. danica E.

lt e al., et Platt s Gammarus , with the the with , the other other the Macher, possibly possibly ,

in

This article isThis article by protected copyright. All rightsreserved. Accepted Zhang,Hartl, (Bensasson, & Hewitt,2001 of removal and detection easier allows diversity. of recovery the outperforms genomes & Vogler,2017) beetles data genome reference a depth sequencing reduced of use allowed which genomes, mitochondrial Perspective & Bernatchez,2016;etal., Tang 2015; etal., Zhou 2013) Articlemitogenomic relationship biomass to reads of representation the for s and number read amplicon increased amplicon models linear to compared number logistic r be to likely are results the that species same The (Figure data metabarcoding . We

O

S hr studies ther (Gómez

model was a better descriptor of the relationship between biomass and read read and biomass between relationship the of descriptor better a was model 7 perform ), which was sequenced using the universal Folmer primers primers Folmer universal the using sequenced was which ),

on mito

studies , -

Rodríguez et al., 2015) al., et Rodríguez have also

ed also failed to amplify during individual barcoding (Table (Table barcoding individual during amplify to failed also logistically ). Th ). genomic genomic ,

(Doi et al., 2017; Elbrecht & Leese, 2015; Lacoursière 2015; Leese, & Elbrecht 2017; al., et (Doi using species

mito Additionally, e

E. danica E. shown that that shown

genom ioeoi sqecn to sequencing mitogenomic e novo de pecies based analysis A. vortex A.

with increasing biomass, suggesting a biological link between link biological a suggesting biomass, increasing with elated to primer incompatibility. In incompatibility. primer to elated e

biomass.

reverse trend was found for the FFFR (658bp) amplicon amplicon (658bp) FFFR the for found was trend reverse viaiiy f eeec mito reference of availability approach skimming

mitogenome mitogenome

(

was not included in downstream analysis of shotgun of analysis downstream in included not was al 4 Table or mass or )

from shotgun sequencing data from shotgunsequencing ula Mtcodil suoee (NUMTs) pseudogenes Mitochondrial Nuclear

of biodiversity Both Both

). wtot eeec li reference (without - trapped arthropods trapped

based on custom custom on based I l n most cases cases most n inear inear assembly on based .

and

i publi in s characterise assemblages of leaf leaf of assemblages characterise

logistic h nme o ras per reads of number the genomes during analysis analysis during genomes hd eaacdn and metabarcoding shed some models brary) (Choo, Crampton (Choo, eeae reference generated (Folmer et al., 1994) al., et (Folmer reference sequencedreference (Tang etal.,2014)

. cases, the use of a of use the cases,

- In t In

Roussel, Rosabal, Rosabal, Roussel, n cuay and accuracy in have been have S 1), suggesting suggesting 1), he absence of absence he -

Platt, Platt, used used .

.

This article isThis article by protected copyright. All rightsreserved. data total the of 0.5% about for accounted which achieved, was reads Accepted2016) al., et (Liu extraction) DNA here demonstrated did it nevertheless danica the In wi a on work further as difficult 2013) al., et 0.5 only leaving DNA, genomic to attributed are reads the of 99% approximately generally but vary, might 2013) al., et data, Article workused inthepresent have available are options normalisation et size body of variability the by produced concentrations DNA different for account to mito normali reads sequencing of amount total the to compared mito and reads, l, 2014) al., - were reads shotgun the analyses, core the For T Mitogenomic ratio and proportion of total reads (Table (Table reads total of proportion and ratio as o enhance the mtDNA contribution to the data the to contribution mtDNA the enhance o ) present work mito work present , with an average of 0.19 of average an with , s the genomic DNA represents the largest amount of total DNA in the sample the in DNA total of amount largest the represents DNA genomic the to mtos hwd iia fnig bten ed normali reads between findings similar showed methods ation . Depending on the taxon, the genomic to mitochondrial DNA ratio (mito ratio DNA mitochondrial to genomic the taxon, the on Depending . . nme o idvdas n rltv abundance relative and individuals of number ,

Attempts to generalise the expected genomic to mitochondrial DNA ratio are are ratio DNA mitochondrial to genomic expected the generalise to Attempts - -

1% of the data to be used (for used be to data the of 1% ratio, (Macher et al., 2017; Zhou et al., 2013) al., et Zhou 2017; al., et (Macher have been used been have

eunig urnl ue a eysal rcin f h ttl sequencing total the of fraction small very a uses currently sequencing in

. not

which accounted for the variability of mitochondrial sequencing effort effort sequencing of mitochondrial variability the for accounted which

- seem to present a huge challenge for the method, as was was as method, the for challenge huge a present to seem nuclear ratio nuclear

also been der variety of taxa is necessary is taxa of variety der % . In In .

Table S6) Table

(Weiss et al., 2017) al., et (Weiss Zhou et al. (2013) al. et Zhou

ranged between 0.011% ( 0.011% between ranged used , which w which ,

in ( other studies insects the mito the insects 4 ).

Normali as lower than the previously described, described, previously the than lower as normalised according to prop to according normalised ,

e community per enrichment via centrifugation (during (during centrifugation via enrichment , only a moderate increase of mtDNA of increase moderate a only .

,

Similar normali Similar and s ation of sequencing data is used used is data sequencing of ation

oligonucleotide capture array capture oligonucleotide n communities. in (Crampton e.g. - ratio is 0.5% is ratio

G in . pulex .

Tang Tang . Our investigation of of investigation Our . , while , s ation strategies strategies ation - Platt et al., 2016) al., et Platt sed according to to according sed et al. ) and 0.692 and )

or lower or

Macher et al. al. et Macher

vrey of variety A ( 2015) ortion of of ortion ) (Gillett

- (Zhou (Zhou (Zhou ratio) ). % ( %

as E. E. .

This article isThis article by protected copyright. All rightsreserved. introgression Acceptedgroup, linkage single a represent (Dupuis work marker single to compared species related closely for success delimitation increases presence. taxon establishing for criteria on selection and sequences longer of use the by reduced be also could positives sequencing variable or degradation to due genes of out drop random by caused negatives false of effects the reducing while barcodes, COI single than rate effective more a at diversity intraspecific of measurement These genes. coding contigs mitochondrial long through content information Additionally, multiplexing. sample as app traditional over advantages presenting Articlepicking probes the designing limited be array. the designing for used species the to compared the of distance phylogenetic the on depending variations array capture ca. for accounted 100 by ratio mitochondrial the arrays, capture 140 to 129 a report (2017) plctos of Applications -

up non taxa target et al. et by the availability of sequencing information for the target organisms used for used organisms target the for information sequencing of availability the by , 2012), (Weigand et al., 2017) al., et (Weigand

maintain designed based on 379 mitochondrial genomes mitochondrial 379 on based designed 2 o te eunig data sequencing the of 42% .

Nevertheless, since multiple markers derived from mitochondrial reads reads mitochondrial from derived markersmultiple since Nevertheless, mito og contigs long Hjbbe, igr Cae & eet 2007) Hebert, & Clare, Singer, (Hajibabaei, ed

genomic genomic (Liu et al.,(Liu et 2016) the original ratio of species biomass in the sample, with a few few a with sample, the in biomass species of ratio original the - od nihet f tN truh centrifugation through mtDNA of enrichment fold or - od oprd o rvos teps mDA reads (mtDNA attempts previous to compared fold

, the , eunig a b ue fr idvriy assessment, biodiversity for used be can sequencing could represent could ol pro could coverage use utemr, obntos f utpe markers multiple of combinations Furthermore, . mito roaches similar to those of metabarcoding, such such metabarcoding, of those to similar roaches

of vide vide

independent nuclear marker nuclear independent genomic Tn e a. 2014) al., et (Tang

fe enrichment after improved

mito

The accuracy of accuracy The

- nuclear discordance discordance nuclear eunig rvds increased provides sequencing

test sample species composition species sample test which contain multiple protein protein multiple contain which phylogenetic resolution and and resolution phylogenetic

(Liu et al., 2016) al., et (Liu ).

.

or due to occasionally occasionally to due or Additionally, h nme o false of number The capture capture s should also be also should as a result of result a as array

increase stringent stringent . use of a a of use

s Use of of Use

could d

This article isThis article by protected copyright. All rightsreserved. FF130R amplicon to second macroinv for modified and designed were primers FF130R and B1FR depth. sequencing or specificity primer to related be could amplicons between Th amplicons. two other the for 13 of out 8 to compared species, read significant showed primers) Folmer (universal in biomass to failed work metabarcoding relative support field the in findings of majority the to 2016) Trites, & Harsch, Eveson, Deagle, reflect S samples environmental traditional increasingly is work metabarcoding Additionally, methods using traditional to achieve difficult samples community Metabarcoding 2015; Adkins,Janecek,& Miyamoto, 1994)Allard, Rodney, considered Accepted species of proportion higher where abundance, read Articleequencing eaacdn hs en any sd o te eoey f pce rcns from richness species of recovery the for used mainly been has Metabarcoding d y a by ed ly species

rvos findings previous because the sequencing coverage achieved for the Folmer region (FFFR) (FFFR) region Folmer the for achieved coverage sequencing the because csse mntrn, hr acrt etmtos of estimations accurate where monitoring, ecosystem to resolve the presence of cryptic species cryptic of presence the resolve to h mc communities mock the

- ihr rprin f eunig reads sequencing of proportion higher abundance as abundance based ,

. Summing of sequencing results from all three amplicons slightly slightly amplicons three all from results sequencing of Summing . uncovering in uncovering biodiversity biodiversity are are ,

and eeal desirable generally derived from derived detect significant relationships significant detect

mainly reflect the larger uncertainty of assumptions on on assumptions of uncertainty larger the reflect mainly

analysis many cases extensi cases many

in , but th but ,

( several e.g.

metabarcoding metabarcoding

(Leray & Knowlton, 2015; Sinniger et al.,2016)et Sinniger & Knowlton,2015; (Leray lrct Lee 2015) Leese, & Elbrecht ese observations do not generally correspond correspond generally not do observations ese

cases (Table (Table cases

J e a. 21; hkal e a. 2015) al., et Shokralla 2013; al., et (Ji - biomass relationships in only in relationships biomass en pooe a a atraie to alternative an as proposed being

(Campos ve diversity ve

a be suggested been has .

workflows between read data and known known and data read between 3 - Soto, Soto, . h FF apio data amplicon FFFR The ). e

discrepancy in efficiency efficiency in discrepancy which would have been been have would which species species Torres . More specifically, the . Or re Our . ’

rert taxa ertebrate ims would biomass - Pérez, & Solari, Solari, & Pérez,

ab ( ut partially sults e.g. B undance undance 4 ecause the ecause compared

out of 13 of out Thomas, ,

and be be in . . This article isThis article by protected copyright. All rightsreserved. the support partially results Our content. known of samples in detection species of levels Acceptedused were region amplicons samples community taxa of number large available already sequences repositoriesreference large of to the due markers to other 2012) metabarcoding through biodiversity 2014) al., detection specific group universal for designed sites priming has marker COI Article resembling more data shotgun the but amplicons), (individual assessed comparable, still reads amplico individual based composition community in similarities revealed from themetabarcoding data relationships reads/biomass the improved Multi Selection of the of Selection - , though the COI still retains its superior value superior its retains still COI the though , biomass . - In . against Additionally dimensional scaling analysis (nMDS, Bray (nMDS, analysis scaling dimensional

Gibson Dal e a. 2014) al., et (Deagle relationships for individual species, the community profiles obtained were were obtained profiles community the species, individual for relationships on occasion been occasion on

closely , showing , ns (Figure (Figure ns albeit the

target region can influence metabarcoding results, and the metabarcoding and results, influence regioncan target

ol as b ahee truh the through achieved be also could , t al. et (

Clarke (Gibson et al., 2014; Hajibabaei et al., 2012; Leray et al., 2013) al., et Leray 2012; al., et Hajibabaei 2014; al., et (Gibson different different htu dt, iia patterns similar data, shotgun results results obtained fromtheB1FRamplicon ( ih oe xetos cmuiis 9 (communities exceptions some with e.g. freshwater invertebrates freshwater e.g.

that combinations of several primers significantly increased the increased significantly primers several of combinations that

( S

2014) (Table (Table 11 t al. et

markers –

questioned , S12 2017) ,

4 , . a set of 11 primer pairs targeting the COI barcoding barcoding COI the targeting pairs primer 11 of set a

) s A . ).

uch

lternative Such trends trends Such have been have , but mainly assisted in assisted mainly but ,

. as nrae acrc i boiest detection biodiversity in accuracy Increased

18S with regards to universality of the available the of universality to regards with we

COI - e oe cond more re

Za e a. 2014) al., et (Zhan Curtis index) index) Curtis with some taxonomic groups taxonomic some with

proposed for use in characteris in use for proposed i ndicate , metaba

Elbrecht & Leese, 2017; Leese, & Elbrecht were were on

simultaneous that despite the variations in in variations the despite that rcoding the sequencing results for for results sequencing the

of the metabarcoding data data metabarcoding the of

on ars treatments across found removal of false negatives false of removal (Figure S12) (Figure ne ars te y the across ensed - 10,

, or 16S 16S or , rmr hv been have primers Figure Figure

use of multiple multiple of use .

S utility Ep t al., et (Epp 11

compared compared Gibson et Gibson ). When When ). ation of of ation

of the of the

- for a a for axis , or or , in in ,

This article isThis article by protected copyright. All rightsreserved. Accepted 2012) al., et (Hajibabaei or biomass smaller en to order in suggested been has primers multiple of use between v while primer abnormal or negatives false caused with species species rare of detection for depth sequencing higher for need p biomass of end lowest the at species rare missed of specimens 10 (e.g. communities the in biomass higher with species for occurring also negatives false to due unpredictable more 5. and (FF130R) 4.5% (B1FR), 2.2% at amplicon per lower somewhat or comparable either were metabarcoding for detections Article negative false False al., 2016) time analysis data and handling as well as preparation library and primers tagged consideration into studies marker different the between ra detection the increase can amplicons multiple from reads sequencing of combination the that idea The percentage of false negative detections for the shotgun work reached 5.6%, while while 5.6%, reached work shotgun the for detections negative false of percentage The negatives incompatibility rain n eunig et c depth sequencing in ariation

. reads

te of species richness richness species of te limited

, and species biomass species and (Dupuis et al., 2012; Zhan et al., 2014) al., et Zhan 2012; al., et (Dupuis

detection ofrarediversity

t hat the use of multiple amplicons or loci also creates additional costs for for costs additional creates also loci or amplicons multiple of use the hat

contribution to overall biomass overall to contribution lower (as was likely the case for species species for case the likely was (as species ( species Gnrly te nblt o ete mto t dtc rare detect to method either of inability the Generally, .

eaie bnac i te ape truh metabarcoding through samples the in abundance relative Table of metab of biomass representation in metabarcoding data, through through data, metabarcoding in representation biomass (Hajibabaei et al., 2011) al., et (Hajibabaei 4 E. danica E. ). ould Similar findings are also reported by o by reported also are findings Similar , and arcoding, as improvement of improvement as arcoding,

also influence the quantitative relationships relationships quantitative the influence also , FFFR amplicon). FFFR , closely relatedclosely species 6 . Primer binding related bias could have have could bias related binding Primer . % (FFFR). Ho (FFFR). % resence, which could be indicative of a a of indicative be could which resence, .

E. Furthermore, Furthermore, hance the detection of species of of species of detection the hance danica . Increased sequencing depth or depth sequencing Increased . wever, metabarcoding was was metabarcoding wever, ) a wl a sal bodied small as well as ,

The or low sequencing depth sequencing low or

shotgun method only only method shotgun

we our

should also also should results varied results ther multi ther (Creer et (Creer or low low or take take - , This article isThis article by protected copyright. All rightsreserved. Leese, stream (e.g. results assessment identification, based morphology to comparable are costs exist. groups target the of databases reference good accur where cases in especially monitoring, and cost), analytical the reducing is multiplexing cases both (in sequencing shotgun than lower remains currently metabarcoding COI date to used widely more and biomass Overall F sequences. p Using sequences. reference barcode individual against OTUs of identification BLAST by achieved was species results. the in confidence more previ the onto mapping by species two species two the between differentiate effectively to able species samples for manyendangered biomass Accepteduture persp Article ial, h aiiy o iciiae ewe coey eae seis in species related closely between discriminate to ability the Finally, ( , the ,

2017)

of should also be taken into consideration. We analysed We consideration. into taken be also should species could have significant implications for conservatio for implications significant have could species

gastropod

mito ectives ) shifts in in shifts .

genomic approach genomic hylogenetic hylogenetic

s

biomass or invasive ,

w B ould allow allow ould ithynia

in mixed mixed in (NJ) (NJ)

Similarly, Similarly,

leachii species species i a bio analysis analysis could present could eaacdn t metabarcoding

ssessment and bulk ul gnrtd eeec mitog reference generated ously (Zhan (Zhan & MacIsaac, 2015) identification of metabarcoding reads for the two the for reads metabarcoding of identification further supported the correct annotation of the the of annotation correct the supported further B s t boas nomto i nt required not is information biomass ate amples ithynia . Nevertheless, the cost of metabarcoding metabarcoding of cost the Nevertheless,

more effective more lrct Vms Mise, rvia & Aroviita, Meissner, Vamos, Elbrecht,

tentaculata taken from the wild the from taken

o be used for for used be o Additionally, current metabarcoding metabarcoding current Additionally,

.

A nnotation of the reads the of nnotation whilst two closely related congener related closely two Sogn eunig was sequencing Shotgun . . and

n surveys, as is the case case the is as surveys, n

providing comparable comparable providing

accurate detection of detection accurate applied applied , compared to the the to compared , nms provi enomes community community ecosystem

for the the for ,

ded and

This article isThis article by protected copyright. All rightsreserved. Government’s the Welsh programme Fund(ESF) convergence European Social initiative led byBangorUniversity sector HE onbehalfofinWales.Itis partfunded by the experiments Grail forassistance Ec This funded work was Acknowledgements to environmentalof response change. subtle increasinglycapturing wellas as scales, temporal of scale unprecedented investig and networks ecological reconstruct to methods learning machine and samplers automated utilizing pipelines automated through future sequencing si ecological meaningful more reveal might predation, and competition to linked interactions biotic other as well as interactions trophic of dynamics and nature the where especially change ecological to responses subtle on applied as levels, generating to nl rlvn t biomonitoring to relevant gnals

Acceptedonomy Skills Scholarship(KESS) Article HTS The i The

plctos bt i gnrtn cmaal dt st ars lre pta and spatial large across sets data comparable generating in both applications,

mportance of mportance el s csse functioning ecosystem as well

niiul or individual (HTS) .

Knowledge Economy Skills Scholarships (KESS) is Scholarships(KESS) Knowledge Economy Skills a pan more

data of species richness and relative abundance could be could abundance relative and richness species of data

and informative (Bohan et al., 2017)al., (Bohanet a

nd supported Andrian ChalkleyforcollectionAndrian of specimens invertebrate used in

optimising lmtd number limited a estimates of estimates . W

and standardising and (Woodward, Gray, & Baird, Baird, & Gray, (Woodward, e by the Environment Agency (EA) UK, (EA) EnvironmentAgency by the

also . .

Such Such

S thank Bangor University forsupportthank BangorUniversity Drig t l, 2017) al., et (Darling

imultaneous of ecological interactions across taxa and trophic and taxa across interactions ecological species, species, advances advances such , but ecologically informative, signals informative, ecologically but ,

which in oioig cos rpi levels, trophic across monitoring

ate ecological interactions at an an at interactions ecological ate sequencing sequencing technologies

may not necessarily capture capture necessarily not may Traditional . 2013) - Wales higherWales approach underpin the underpin . High throughput throughput High . and enhanced enhanced

a Knowledge a Knowledge for West monitor

and es - level skills

Wendy Wendy is linked linked is in the the in utility ing is is ing

This article isThis article by protected copyright. All rightsreserved. Accepted C.,Camacho, Coulouris,G.,Avagyan, V.,Papadopoulos, J.,Bealer, Ma,N., K., T. & Madden, TamaddoniBohan, D. C., A.,Vacher, Bista, I.,Walsh, K.,M., Carvalho, G.R.,Seymour, Hajibabaei,M., Lallias,D., …Creer, S. Berry, D.,Mahfoudh,Wagner, primersK. Ben, M.,used& Loy,A.(2011).in Barcoded Bensasson, Hartl,D. D.,G. L., &Hewitt, Zhang,D., M.(2001).pseudogenes: Mitochondrial A.C.,Benke, A.D., Huryn, Smock,L.a,& Wallace, J. B. (1999).Length Article D. Baird, J., & M. Hajibabaei, Biomonitoring2.0: (2012). newparadigminecosystem A Aird, D.,Ross, T., M. Chen,W.M.,Fennell, Russ,A. G., S., Danielsson, C.,…Gnirke, (2011). References through AgriculturalChina University. manuscript. We alsothankProf. and Leese for two reviewers commen Florian anonymous andtheValleys.WethankHPCWales Walesforallowing use for oftheirsystems analysis. https://doi.org/10.1186/1471 architectureL. (2009). BLAST+: and applications. https://doi.org/10.1016/j.tree.2017.03.001 Networks.Reconstruction of Ecological G. (2017).Next https:/ oflakeecosystemdynamics biodiversity. (2017). time Annual Microbiology amplicon pyrosequencingmultiplex biasamplification. https://doi.org/10.5061/dryad.d2f79 evolution ’smisplaced witnesses. 308 southeast inNorthfreshwater macroinvertebrates America withparticularreference to the 21 possible next assessment made by BiologyGenome andAnalyzing bias minimizingPCRamplification in Illum (8), 2039 – 343.https://doi.org/10.2307/1468447 /doi.org/10.1038/ncomms14087 X.Z. is also supported by the ChineseFund bythe UniversitiesX.Z. is Scientific (2017 alsosupported

ern United ern States. – 2044. https://doi.org/10.1111/j.1365 , 77 - Generation Global Large Biomonitoring: , (21 12 - (2), R18. https://doi.org/10.1186/gb ), 7846 series of analysis aqueouseDNAreveals ecologically relevant – 7849.https://doi.org/10.1128/AEM.05220 Journal NorthJournal of the AmericanBenthological Society - 21 - Nezhad, A., Raybould, Nezhad, A.,Raybould, Dumbrell, A. J., & Woodward, 05

TRENDS in TRENDS - generation DNAsequencing. - 10

- 421 Trends in Ecology and inEcology Evo Trends

Nature Nature Communications

Ecology &Ecology Evolution BMC BioinformaticsBMC - 294X.2012.05519.x

Applied andEnvironmental Applied - ina sequencing libraries. ina sequencing - scale, Automated 2011 - 12 - Molecular Ecolo Molecular mass relationshipsfor , - lution , 2 16 8 - , 14087. r18 , (6), 314 10

- .

11 (1), 421. t s on the

– 321. QC114) QC114) gy , 18 , (3), (3),

This article isThis article by protected copyright. All rightsreserved. B.E., S. N.,Deagle, Jarman, Coissac AcceptedDarling, J.A., G.Galil, B. Carvalho, S., R., Viard,Rius, M., F.,& S., Creer, Deiner, K., Frey, S., D.,P.,Porazinska, Taberlet, Thomas, W. K., …Bik,H. M.(2016). Cranston, P. S.(1990).andinvertebrate taxonomy.Biomonitoring Crampton Crampton Collins, A., D.G.,Hoare,D.,& Ohandja, Voulvoulis, N. theWater (2012).Implementing ArticleClarke, J., L.J.,Soubrier, Weyrich, L.S., metabarcodes &A.(2014).Environmental Cooper, Clarke, J.M., L.J.,Beard, Swadling,K. M., &B.E.(2017). ofmarkerchoiceand Effect Deagle, Clarke, K.R., & Gorley,v6: N. R. UserManual/Tutorial. (2006). Plymouth: Primer PRIMER Choo, Crampton L. Q., Caporaso, G., J.,J. Kuczynski, Stombaugh,J., Bittinger,K.,Bushman,F.D.,Costello, E.K.,… Campos https://doi.org/10.1016/j.marpol.2017.08.014 biological ecosystems. invasions inmarine Recommendations fordeveloping andapplying genetictools andmanage to assess and Ecology The ecologist’s fieldguidetosequence and Assessmen https://doi.org/10.1186/s13742 outletting thegenes ofthebottle. Evolution mitochondrial metagenomicsrainforest ofabornean sample. V.,Khen, C. &Vogler,A. https://doi.org/10.1016/j.envsci.2011.11.003 and Wales. Framework Directive: A transition fromestabl Resources for insects:Insilico PCR Evolution th https://doi.org/10.1111/mec.14258 mitochondrial abundancespectrum. size classesin Phylogeny, alocal assemblageoftropicalDiptera: species diversityand data. Knight, R. (2010).allows of analysis high QIIME https:// (Hemiptera, Reduviidae). with two mitochondrial genesinMepraia spp. andTriatomaeratyrusiformis ermal cyclingermal protocol onDNA metabarcoding zooplankton studies. - Soto, Soto, R., Torres - - Nature Methods Platt, A., Yu,D.W., X., Zhou, & Platt, A., Timmermans, M.M. J.T. L., T. N.,Gimmel, Kutty,S. Cockerill, D., doi.org/10.1590/S1415 , , , pp. 1160 32 7 Environmental Science andPolicy (3), 873 Evolution (9), 2302 t , - 14 Platt, A., & Vogler,A.P. Shotgunmitogenomics (2017). acrossbody - – (2 Pérez, F.,&Pérez, Solari,A. incongruence inferred(2015). Phylogenetic – 883. https://doi.org/10.1002/ece3.2667 1170. https://doi.org/10.1111/1755 – – , , 3), 265 56 2316. https://doi.org/10.1093/molbev/msv111 7

reveals potentialreveals fortaxonomic bias. (5), 335 P. (2015). Soup to tree: The phylogeny of beetles P. toThephylogenyofbeetles inferred (2015).Soup tree: by , 68 Genetics andGenetics Molecular Biology – – , E., Pompanon, F., P.,, Taberlet, Taberlet, P., … 74. https://doi.org/10.1111/2041 - 273. https://doi.org/10.1007/BF00677921 475738320140301 - – 016 336. ht GigaScience

- Vogler, metagenomics: Mitochondrial A.P. (2016). Molecul 0120 - based identification of biodiversity. tps://doi.org/10.1038/nmeth0510 - Marine Policy y

, ar Ecology

ished monitoring networksinEngland 17

- ,

throughput community sequencing communitysequencing throughput 5 , 49 (1), 15.

– 61. ,

, Piraino, S. (2017). 17 , - 85 0998.12265 38 (8), 2045 (May), 54 (May), (3), 390

Molecular Molecular Ecology Environmental MonitoringEnvironmental Molecular Biology and Molecular - 210X.12574 – – 2054. – 395.

Ecology and Ecology

64.

- Methods Methods in 335

- E.

This article isThis article by protected copyright. All rightsreserved. Accepted C.,Baird,D.J.,Gibson, Shokralla,S., Curry, Monk, J.F., W. A.,King,I., & Hajibabaei, M O.,Folmer, Black,M., Hoeh, W.,R.,& forLutz, Vrijenhoek,R.(1994). primers DNA G.Ficetola, Pompanon, F., P.S Miaud, C., &Taberlet, (2008). N. Fahner, A., Shokralla,S.,Baird,D.J.,&M.(2016).Hajibabaei, Large Epp, L.S.,Bellemain, E.P.,J.,Esposito,A.,Riaz,T.,…Brochmann,C. Boessenkool, Haile, V.,Elbrecht, Vamos, Meissner, K., E. Aroviita,E., J.,& F.(2017). Leese, strengths Assessing Article V.,Elbrecht, & F.(2017). Leese, Validation andDevelopment ofPrimers COIMetabarcoding V.,Elbrecht, & F.(2015). Leese, DNA Can R.C. Edgar, (2010). andclusteringorder Search Dupuis, J.Roe,A.D.,& Spe R., Doi, H., R.,Akam Inui, throughput sequencing. throughput sequencing. (2015). Large https://doi.org/10.1371/journal.pone.0013102 invertebrates. amplification cytochrom of mitochondrial https://doi.org/10.1098/rsbl.2008.0118 environmental DNA fromwatersamples. https://doi.org/10.1371/journal.pone.0157505 annotation offourDNAmarkers. environmentalDNAplants through metabarcoding of Recovery, soil: resolution, and https://doi.org/10.1111/j.1365 past andpresentecosystems. (2012). environmentalmetabarcod New https://doi.org/10.1111/2041 routine stream monitoring. and weaknesses of DNAmetabarcoding 5 Macroinvertebrate Bioassfor Freshwater https://doi.org/10.1371/journal.pone.0130324 InnovativeMetabarcoding Protocol. Biasand TestingAbundance? Primer Biomass Bioinformatics 4436. https://doi.org/10.1111/j.1365 related animalsandfungi:isnot Onemarker enough. Freshwater Biology DNAanalysisforestimatingtheabundance ofEnvironmental and biomass streamfish. https://doi.org/10.1098/rsbl.2014.0562 not marker: a perfectmatch. Hajibabaei, M. (2014).cytochromeoxidase DNAmetabarcodingandthe c subunitI (April), 1 (April), – 11. https://doi.org/10.3389/fenvs.2017.00011 - scale biomonitoringscale of remoteand threatened ecosystems via high , Molecular Marine Biology Molecular Marine andBiotechnology 26 atsu, Y., Kanno, K., Yamanaka, H.,Takahara,T., T.& Minamoto, (2017). (19), 2460 , 62 (1), 30 PLoS ONE rling, F. Multi A. H. (2012). Methods in Ecology and inEcology Evolution Methods – – Biology Letters 2461. https://doi.org/10.1093/bioinformatics/btq461 - Molecular Ecology 39. https://doi.org/10.1111/fwb.12846 210X.12789 - 294X.2012.05537.x PLoS ONE PLoS , 10 - Based Ecosystem Assessments Species Ecosystem Based Quantify PLOS ONE - (10), 1 294X.2012.05642.x -

es foranalysingsoil DNA:Potential for studying based macroinvertebrat essment. essment. Biology Letters e c oxidasee c subunitI metazoan from diverse s than faster of BLAST. magnitude

, , 11 – — 10 15. , (6), e0157505.

Sequence ReSequence 10 (9), 1789 , 21 (7), e0130324. Frontiers inEnvironmentalScience

-

locus species delimitation in speciesdelimitation locus closely (8), 1821 Molecular EcologyMolecular , –

4 1793. pecies detectionusingpecies (4), 423 , lationships with an 3 – . (5), 294 1833. e identification for - scale monitoringscale – 425. –

299. , 21 (18), 4422 (18), . -

of

, – This article isThis article by protected copyright. All rightsreserved. AcceptedJackson, J.K., J. Battle, M., White, B.P.,Pilgrim, E.M., St Hu, S.(1987). Hiiesalu, Hebert, P. N., D. Ratna Hajibabaei, M., Spall,J.L.,Shokralla,S.,&van Konynenburg, Hajibabaei, M., Singer,G. Article Hajibabaei, M., Shokralla, S., Zhou, Gómez Gillett, C.P. D.T.,Crampton Gibson, S.,Porter, J., T.King,Shokralla, Konynenburg,S.,D. M., I.,van Janzen, H., … Science communities andDNAmorphological based on identifications. barcode biodiversitya B. W.Cryptic instreams: comparison ofmacroinvertebrate (2014). 4754(87)90094 Simulation https://doi.org/10.1111/nph andrichness biomass. Species mycorrhizalfungi: richness of Associations withplant arbuscular grassland https://doi.org/10.1098/rsbl.2003.0025 Proceedings RoyalSciences ofthe Society B: Biological among subunitspecies.cytochrome c oxidase 1 divergences closely related https://doi.org/10.1186/1472 environmental barcodingof DNAfrompreservativeethanol. of afreshwaterbenthic macroinvertebrate through community non https://doi.org/10.1186/1741 in of DNA andDNAbarcodes biodiversity monitoring. arrays https://doi.org/10.1371/journal.pone.0017497 using river benthos. barcoding: Anext 883 of assemblages.and phylogenetics complex P. the powermetagenomics (2015).Validating of ecology mitochondrial forcommunity and Evolution ofelucidates thephylogeny weevils(Coleoptera: Curculionoidea). & Vogler,A.P. (2014).denovofrom assembly Bulk mitogenome pool https://doi.org/10.1073/pnas.1406468111 AcademyNational ofSciences a bulksampleoftropicalmetasystematics. arthropods DNA through Hajibabaei, microbiome in M. (2014).assessmentof and Simultaneous themacrobiome https://doi.org/10.1371/journal.pone.0138432 - Rodríguez, C., CramptonRodríguez, C., – I., Pärtel,M., Davison, J.,P., M., Gerhold, Moora, Metsis, … Wilson, M.,D. S. (2014). 894.https://doi.org/10.1111/2041 , 33 Akaike informationcriterionAkaike statistics

(1), 312 (Vol. 29). KTKScientificPublishers,Tokyo. https://doi.org/10.1016/0378 , 31 - 2

(8), 2223 singham, S.,& Barcodinganimallife: J.R.singham, deWaard, (2003). - generation sequencing approach for biomonitoring applications for generation applications approach biomonitoringsequencing – 324. https://doi.org/10.1086/675225 PLoS ONE PLoS a C., Clare, E. L., E. &a C.,Clare, P. D. Hebert, andapplicability N.(2007).Design New Phytologist New - Platt, A., Timmermans, M. J. T. N., Jordal, B. H., Emerson, Emerson, Platt, A.,Timmermans,M.N.,Jordal,B.H., C., J.T. – - 2237. https://doi.org/10.1093/molbev/msu154 Platt, A., M. Timmermans, J.T.N.,Baselga,A.,& Vogler, .12765 , - - , X., Singer,G. A. C.,&Baird,D.J.Environmental (2011). 6785 7007 111 6 (4), e17497. (22), 8007 - -

12 5 - 24 , - 28 -

203 210X.12376

(1), 233 Methods in andEvolution Methods Ecology – 8012. .

Mathematics and Computers in andComputers Mathematics ein, E. D.,ein, E. Miller,P.& Sweeney, E., –

244. , 270

S. (2012). Assessing S. (2012). biodiversity Assessing

, S96 BMC BiologyBMC BMC EcologyBMC – S99. Molecular Biology - destructive destructive Proceedings of thProceedings of ed total DNA ed Freshwater , , 5

12 (1), 24. (1), 28. , 6 (8), -

A. e This article isThis article by protected copyright. All rightsreserved. J. Macher, Loreau, M.,& asynthesis C. andecosystemstability: deMazancourt, (2013).Biodiversity of Liu, S., Wang, X., L.,Tan,M.,Xie, Z., Su,X.,Zhou,X. capture Li, … (2016). Mitochondrial B.,Wysoker,Li, H.,Handsaker, The A.,Fennell,T.,Ruan,J.,Homer,N.,R.(2009). … Durbin, short Fastandaccurate readalignmentLi, H.,&R.(2009). withBurrows Durbin, M.,Yang, C.P.,Leray, N., J. Meyer, Mills,S.C.,Agudelo, Y., Ranwez,V., R.J.… Machida, M.,&Leray, Knowlton,ofstandardized N. andmetabarcoding (2015).DNAbarcoding Lacoursière L.,Krebes, & Kelly, R. Ji, Y., D. Ashton,L.,Pedley,Y., S.M.,A.,… P., Tang, Nakamura, Yu,D.Edwards, W.(2013). Je

Accepted Article ffery, N. W., & Gregory,T. usingGenome sizeestimatesR. (2014). Feulgen forcrustaceans protocol formetagenomic studies increases mitochondrial DNAyieldby twoorders of https://doi.org/10.1111/ele.12073 underlying mechanisms. 0998.12472 Molecular EcologyResources enriches mito https://doi.org/10.1093/bioinformatics/btp352 Alignment/MapSequence format andSAMtools. https://doi.org/10.1093/bioinformatics/btp324 transf 10 fish gut contents. region for metabar (2013). versatile settargeting primer ashortA new fragmentof themitochondrialCOI Academy ofSciences revealsamples patterns diversity. ofbenthic marine https://doi.org/10.1111/1755 environmental conditions. biomass eDNAconcentrations: amongvariability from methodsand capture Proteomics in thecontrolregion. Anewgene andnon Amphipoda): order Science L. DNAtoCrowder, environmentalmanagement.B. (2014). improve Harnessing Letters Reliable, verifiable andefficient monitoring of viametabarcoding.biodiversity 862 analysisimage of ethanol densitometry - 34 P., Port, J. a., Yamahara, K.P.,Port, J.a.,Yamahara, G., R. M.,Martone, Lowell, N., P.Thomsen, F.,… – 868.https://doi.org/10.1002/cyto.a.22516 -

orm. N., Zizka, - , , Roussel, A.,Rosabal, M.,& Bernatchez, L.(2016). Estimatingfish and abundance

16 Bastrop, R. (2012). The mitogenome of Gammarus duebeni (Crustacea (Crustacea mitogenome ofGammarus(2012). duebeni Bastrop, R. The 344 (10), 1245 , Bioinformatics

(6191), 1455 7 (2), 201 - DNA 100 fold, enablingPCRDNA 100 fold,

V.M. A., Weigand,A.M.,& Leese, F.(2017).centrifugation Asimple Frontiers inZoology Frontiers coding metazoan diversity: applicationcoding metazoandiversity: for characterizing coralreef , – – Comparative Biochemistry Comparative and Physiology 11 1257. https://doi.org/10.1111/ele.12162 211. https://doi.org/10.1016/j.cbd.2012.02.004 2 Ecology Letters Ecology – (7), 2076 , 1456. https://doi.org/10.1126/science.1251156 25 Molecular Molecular Ecology , (14), 1754 - 1 0998.12522 6 (2), 470 –

2081. https://doi.org/10.1073/pnas.1424997112 , 10 - – , preserved tissues. preserved - – 1760. neutral sequence repeats sequence evolutionneutral of tandem 16 (1), 34. https://doi.org/10.1186/1742(1), 34. 479. https://doi.org/10.1111/1755 -

free free mitogenomics biodiversity analysis. , 106

Resources

– Bioinformatics 115.

Proceedings of the ofthe Proceedings National , 16 Cytometry PartCytometry A (6), 1401

, -

25 Part D: Genomics and Part D:Genomics (16), 2078(16), – 1414.

- Wheeler Wheeler , - 85 – Ecology Ecology 20 - (10), 9994 79. -

This article isThis article by protected copyright. All rightsreserved. Accepted M.,Seymour, Durance,I.,Cosby,Ransom B.J., Saitou, N., &Nei,M.(1987).Theneighbor Piñol, G., J.,Mir, Gomez M.,Pfrender, Hawkins,C.,Bagley, M., Courtney,G.,Creutzburg, B.,J.,… Epler, Whiting, M. Peng, Y., H.Yiu, M., Leung, C.M., &Chin,F. S. L.(2012).IDBA Y. O’donnell, L., R.P., Kelly, J. Lowell, C., N. Port,J. A.PCRinduce& primers (2016). Indexed Article J.,Morinière, L., Hendrich, Balke,M., Beermann, A. J., König,T., …Haszprunar,G. Hess, M., Miyamoto, M. M., M.Allard, R. W., L.L.,& M., Janecek, Adkins, Rodney, I.,Meusnier, Singer,G.A. C.,Landry,J.F. M.,Mährlein, Pätzig, Brauns,M., M., Mä chler, E., Deiner, K.,chler, E.,Deiner, Steinmann, P., Utility & F.(2014). ofenvironmental Altermatt, DNA 0005 mesocosms. S. (2018). promotes degradation Acidity of multi https://doi.org/citeulike phylogenetic trees. https://doi.org/10.1111/1755 metabarcoding of arthropods. mismatches limit theuse of high ReviewQuarterly ofBiology and Challenges inDNA (2010). Macroinvertebrate Assessing Biodiversity inFreshwaterEcosystems: Advances Bioinformatics single https://doi.org/10.137 template 1293 (Ephemeroptera, Plecoptera andTrichoptera). (2017). barcode forGermany′slibrary A DNA stoneflies andcaddisflies mayflies, Biology Congruence Test of Using Reliability Linked MitochondrialDNASequences. 214. https://doi.org/10.1186/1471 DNAmini (2008). A universal Hydrobiologia macroinvertebrateslake corrected forback 1174 andindicatorfor monitoringrare macroinvertebrate species. https://doi.org/10.1111/2041 magnitude. – – - 3 - 1307. https://doi.org/10.1111/17551307. https://doi.org/10.1086/678128.1183. cell andcell metagenomic sequencingdatawithhighlyunevendepth.

, 43 -

Specific biasin Large (2), 236 Methods in and Methods Ecology Evolution

Communications BiologyCommunications , , 768 28 (11), 1420 – - (1), 37 Polo, N. Universalandblockingprimer P.,&Agustí, (2015). Molecular Biology and E 249. https://doi.org/10.1093/sysbio/43.2.236 - 1/journal.pone.0148698 based ApproachesReview TheQuarterly of - article – 50. https://doi.org/10.1007/s10750 , – - 85 1428. https://doi.org/10.1093/bioinfo barcode for barcode biodiversity analysis. - - 210X.12937 0998.12355 Molecular Ecology Resources Molecular - - (3), 319

Scale DNA sequencing studies.Scale DNAsequencing id:93683 & Dolman,A.M.Length (2016). - throughput DNAsequencing forthequant throughput - 2164 , Hickey,D. P. A.,Hebert, M. D.N.,&Hajibabaei, - , joining method: a new method method:anewmethod forreconstructing joining 1 – (1), 4. https://doi.org/10.1038/s42003(1), 4. 340. https://doi.org/10.1086/655118 -

9 - Jones, E., Deiner, K., Ormerod, S. J., … Creer, Jones, E.,Deiner,K.,Ormerod,S. J.,…Creer, -

- 0998.12683 214 - volution transformation and preservationtransformation effects.

, 2017 Molecular EcologyResources

- species environmentalspecies DNAin lotic

(0), 1 , 4 (4), 406

– 5. - UD: Ade novo assemblerfor , 15 Freshwater Science – - PLoS ONE BMC Genomics BMC 015 (4), 819 25. – mass relationshipsfor

- Biology. 2526 rmatics/bts174

(1994). A – 830. , - 4 11

Systematic Systematic itative Source: The Source: , (3), 1 17

, - 017 9 (6), (1), , – 33 11.

- , This article isThis article by protected copyright. All rightsreserved. AcceptedWeiss, S., S.,Amir,Xu, Z.Z.,Peddada, A., K.,Bittinger, A.,…Knight, Gonzalez, R.(2017). Weiss, H.,Li, Weigand, H., M.,Cai, C., Y., Yu,L., & Leese,F.(2017).Decipheringthe Zhang, Thomp Thomas, Deagle,B. A.C., E., J. P.,Eveson, Harsch,C.H C. Computing.Team, R. (2015). R:ALanguageandEnvironmentforStatistical Vienna, Tang, M., Tan,Meng,G.,sequencing Su,X.,S.,…X.(2014). Yang,Liu, Zhou, Multiplex S., Article C.J.,Tang, M., Hardman, Ji,Y., G., Tan, Meng, Yu,D. Liu,S., W.(2015). M., … High Tamura, K.,Dudley,J.,Nei,M., S. &(2007). MEGA4:MolecularEvolutionary Kumar, Genetics B.Sweeney, T.of W., J. J.K.,barcodes Battle, M., Jackson, &Dapkey, (2011).CanDNA F., Sinniger, Pawlowski, J., S.,Gooday,Harii, Chevaldonné,P., A. Yamamoto, H., …Creer, J., Shokralla, S.,Porter, W., T.M.,Gibson,J.F.,R.,Janzen,D.H.,Hallwachs, … Dobosz, characteristics. Normalization strategies upon anddifferential abundance depend data microbial 26 origin of mito 4 gap specific and choice. penalties weightmatrix alignmentof progressivemultiple sequence weighting, throughsequence position 714 correction derivedfromcontrol factors material. DNA metabarcoding: of Improvedestimates species proportional biomassusing Computing. RFoundation Austria: Statistical forStatistical Computing;2014.for RFoundation https://doi.org/ mito of pooledmitochondrialgenomes 210X.12416 in and Methods Ecology Evolution of wildthroughput monitoringdiversityand bee abunda 1599. https://doi.org/10.1093/molbev/msm092 (MEAnalysis https://doi.org/10.1899/10 quality? macroinvertebratesstream improve descr https://doi.org/10.3389/fmars.2016.00092 Knowledge of Deep of S. (2016).Analysis SedimentaryDNAReveals Worldwide Maj https://doi.org/10.1038/srep09687 identificat Hajibabaei, M. (2015). parallel Massively multiplex DNAsequencing for specimen 680. https://doi.org/10.1093/nar/22.22.4673 son, J.son, D., D. & Higgins, G., Gibson, W: improving T.J.CLUSTAL thesensitivity (1994). (20), 5705 – - 726.https://doi.org/10.1111/1755 metagenomics. metagenomics. Journal North ofthe American BenthologicalSociety ion using MiSeqplatform.an Illumina

– GA) softwareversion4.0. - 5715. https://d nuclear discordance in discordance two caddisflyspecies. nuclear sibling Microbiome 10.1093/nar/gku917 - Sea Benthos. Nucleic AcidsResearch - , 016.1 5 oi.org/10.1111/mec.14292 (1), 27. https://doi.org/10 , Frontiers inMarineFrontiers Science — 6

a crucial stepa crucial toward using biodiversityanalysis (9), 1034

Molecular Biology and Molecular Biology Evolution - 0998.12490 iptions of community structure iptions structure ofandwater community

, – 42 1043. https://doi.org/10.1111/2041

(22), e166

Scientific Reports Nucleic Acids Research ., &Trites, A. W.(2016). Quantitative Molecular ResourcesMolecular Ecology

.1186/s40168

nce viamitogenomics.nce – e166. , , 30 3 , 92. or Gaps in Taxonomic or GapsinTaxonomic (1), 195 , 5 Molecular Ecology , 9687. - 017 , , – 24 22 216. - (8), 1596 (22), 4673 0237 - , 16 - y (3),

- - – – , This article isThis article by protected copyright. All rightsreserved. performed prep,metabarcoding library ranbioinformatics of metabarcoding data IB theexperimentsIB, SC,GRCdesigned Author contributions accession(Study The dataforthis work canbeaccessed Nucleotide attheEuropean Archive (ENA) database Data accessibility Zhou, X.,Y., Yang,Q.,Zhou,L.,Q. Li, Huang, Liu,S., Su,X., (2013). … Ultra H. A., Zhan, & MacIsaac, J. biosphereexploration Rare (2015). using high A.,S. Zhan, H.Bailey, Heath, D. Macisaac, D.,& J. Performance (2014). comparisonof Yu, D. W.,Ji,Y., Eme X.,Yang, Miller,J.A.,M.,Ji,Y.,C., Wang, De Blécourt, Yan Y.,Xie, Wu,G., Tang, Luo,R.,Patter J., G.,Woodward, & Gray,C., (2013). the21stnew Baird,D.J. Biomonitoringfor Century:

Accepted measurements, performed optimised andperformed morphometric DNA extractions and Article amplification. highenables 522. https://doi.org/10.1007/s10592 sequencing: progressresearch perspectives. and https://doi.org communities.complex markersgenetic for high https://doi.org/10.1111/j.2041 biomonitoring. and assessment arthropodsforrapidbiodiversity of soup: Metabarcoding https://doi.org/10.1016/j.ecolind.2014.06.028 biodiversitygeneral indicator. metabarcoding tocollected askifeasily soilleaf and 1660 De novo withassembly transcriptome short RNA 32 perspectives threats. globalisation andemerging inanageof environmental (2), 159 – 1666. https://doi.org/10.1093/bioinformatics/btu0771666. statistical analysis. SS,MHran – 174. PRJEB23036 - fidelity recoveryfidelity ofbiodiversity forbulkarthropod sampleswithoutPCR GigaScience /10.1111/1755 rson, B. C., Wang, X., B.C.,rson, Wang, Ye,C.,Yang,&Ding,Z. (2012).Biodiversity Methods in Ecology and inEcology Evolution Methods

Molecular EcologyResources ). - throughput sequencing throughput

, 2 (1), 4. https://doi.org/10.1186/2047 - Ecological Indicators 0998.12254 - 210X.2012.00198. . son, S.,J. J., Liu, … Wang, (2014).SOAPdenovo

KW, MH, XZ, MC contributed MCcontributed XZ, KW, MH, the - 014 metabarcoding sequencing. MT, SL, XZ - 0678

- 9 - - Conservation Genetics , based biodiversity assessmentbased biodiversity in

Seq reads.Seq x 3 ,

(4), 613 - 46 , g, C., … Yu, D. W. (2014). Using g, C., … Yu,D.W.(2014).Using litter canbe samples usedasa 14 , 379 (5), 1049 – –

Bioinformatics 623. 389. to – - . MS andIB. MS 217X experimental 1059. - - throughput throughput deep sequencing deep - , 2 16 - 4

, (3), 513 30 Limnetica (12), - design. Trans: ran – , This article isThis article by protected copyright. All rightsreserved. an invertebrate IB specimens. wrotefirstversionSC,GRCeditedmanuscript, ofmanuscript. sequen

Accepted Articled allauthorsprovidedcomments cing analysis ofand bioinformatics mitogenomics

.

.

DB taxonomically identifiedDB taxonomically This article isThis article by protected copyright. All rightsreserved. community. foreachmock positive control community. in (lightgrey) each abundance highest grey) (dark and abundance lowest with thespecies Highlighted Last col used. species Table 1 Accepted ArticleNumber 10 14 13 12 11 9 8 7 6 5 4 3 2 1 :

Design of mock communities. See columns for the detailed contents of each community (1 community each of thecontents for detailed columns See communities. mock of Design Total Number of species of Total Number Total specimens Drosophila melanogaster Gammarus pulex Bithynia leachi Notonecta glauca Physa fontinalis Radix balthica Potamopyrgus antipodarum planorbis Gyrinus marinus Ephemera danica Bithynia tentaculata Asellus aquaticus Anisus vortex umn: total number ofspecimens/ totalnumber umn: Species

157 14 12 10 10 24 16 24 14 35 3 2 1 3 2 1 1 species, bottom line: total number of specimens/ numberof total line: species, bottom 157 13 15 32 25 10 13 40 3 5 3 0 3 1 3 4 2

145 13 28 19 12 45 3 6 5 0 4 5 3 1 6 8 3 149 14 17 25 22 10 25 11 10 3 4 1 4 6 6 5 4 155 14 16 21 26 10 14 25 3 8 9 2 8 1 4 8 5 Community - 10), with numbers referring to specimens from each from specimens to referring numbers with 10), 146 14 11 10 10 33 12 17 20 3 8 1 4 8 1 8 6 community and number of species/ numberof and community 139 13 12 12 14 10 27 19 15 3 1 8 0 7 5 6 7 Drosophila melanogaster Drosophila 139 14 13 17 10 18 15 21 10 3 8 7 6 1 9 1 8 151 13 13 13 13 14 20 24 30 3 3 0 9 1 6 2 9 community. was used as used as was 141 10 13 14 16 20 26 24 3 7 8 0 6 5 7 4 1 per species Specimens Specimens 1479 186 141 108 180 142 226 30 52 83 31 70 94 55 81

This article isThis article by protected copyright. All rightsreserved. Total (mg) marinus Gyrinus pulex Gammarus Asellus aquaticus glauca Notonecta Radix balthica antipodarum Potamopyrgus Physa fontinalis tentaculata Bithynia leachi Bithynia planorbis Planorbis contortus Bathyomphalus Anisus Species Table 2

Accepted danica Ephemera Article

vortex

:

Estimated biomass Estimated

for each species included species for each inthemockcommunities 370.25 21.69 71.92 30.49 1.00 77.38 20.47 3.82 1.16 115.60 11.97 8.87 0.79 5.08 1 Community

255.46 9.36 14.24 54.70 4.92 0.00 89.72 13.37 4.14 46.33 3.19 9.04 0.70 5.75 2

211.02 30.00 4.18 75.11 9.00 0.00 31.37 11.66 4.78 26.58 4.52 6.75 0.65 6.44 3

479.23 104.94 28.62 51.18 13.02 33.88 106.12 9.89 7.91 113.80 0.94 7.66 0.56 0.70 4

464.82 40.35 35.93 90.38 19.50 16.09 101.69 8.10 10.81 128.77 8.95 0.37 0.47 3.41 5

. 377.48 85.77 54.17 104.72 22.01 7.47 57.27 13.87 13.10 4.01 10.30 1.56 0.40 2.84 6 Values are presented are inmilligramsValues (mg).

371.80 52.20 46.02 10.13 22.68 0.00 70.67 5.27 16.42 135.45 7.45 3.10 0.31 2.10 7

503.18 98.29 80.35 128.80 27.56 46.54 6.08 6.46 17.02 79.47 6.61 4.53 0.05 1.42 8

395.20 60.07 62.88 42.39 30.73 0.00 53.80 0.36 18.70 104.71 12.50 4.77 0.10 4.21 9

524.22 72.01 86.31 83.34 37.72 62.09 35.91 1.74 0.00 124.64 14.54 5.55 0.22 0.13 10

This article isThis article by protected copyright. All rightsreserved. Table I-130R I-B1 HC02198 LCO1490 Accepted ArticlePrimer Name 3 :

COI primers used primers COI for metabarcoding. GAAAATYATAAIGAAIGCRTGAGC CCHGATATAACITTYCCICG TAAACTTCAGGGTGACCAAAAAATCA GGTCAACAAATCATAAAGATATTGG Primer Sequence Primer Sequence

Direction

R R F F Meusnier Meusnier Hajibabaei Folmer Folmer et al. et al. et et al. et et al et 1994 1994 Citation 2008 (modified) . 2012. (modified)

Number 13 12 11 10 9 8 7 6 5 4 3 2 1

Accepted isThis article by protected copyright. All rightsreserved. Article( with indicated are (<0.05) relationships Significant t The normalised). ratio SumAmpl 658bp, FFFR: 130bp, Table

4 Gyrinidae Ephemeridae Gammaridae Asellidae Notonectidae Lymnaeidae Hydrobiidae Physidae Bithyniidae Bithyniidae Planorbidae Planorbidae Family Taxa :

Summary table of significance of of ofsignificance table Summary

ype of model used used ofmodel ype Radix balthica antipodarum Potamopyrgus Physa fontinalis tentaculata Bithynia leachi Bithynia planorbis Planorbis contortus Bathyomphalus Anisus vortex Species Gyrinus marinus Gyrinus danica Ephemera pulex Gammarus Asellus aquaticus glauca Notonecta icon

: sum of all amplicon data per species), and shotgun data (“Shotgun”, (“Shotgun”, data shotgun and dataperspecies), ofallamplicon sum :

reads to biomass to biomass reads

is

indicated by shading (light grey: shading by indicated

*). *).

correlations <0.01* 0.04 0.52 0.05* 0.02* 0.03* 0.04* 0.25 0.58 0.11 0.03* <0.01* 0.01* B1FR Amplicon

,

for each sequencing treatment. Amplicon data (“Amplicon”, B1FR:450bp, FF130R: FF130R: B1FR:450bp, (“Amplicon”, data Amplicon treatment. each for sequencing <0.01* 0.02* 0.06 0.07 0.01* 0.01* <0.01* <0.01* 0.27 0.01* 0.06 0.06 <0.01* FF130R

logistic

, white:

linear <0.01* 0.06 0.32 0.15 0.51 0.03* 0.02* 0.57 0.37 0.04* 0.09 0.07 0.06 FFFR

). Shotgun data not shown for species species for shown not data ). Shotgun

pShotgun: proportion of reads, Mit ofreads, proportion pShotgun: <0.01* 0.02* 0.46 0.06 0.09 0.01* 0.02* 0.04* 0.46 0.03* 0.07 0.02* 0.02* SumAmpl

icon

<0.01* <0.01* <0.01* 0.02* 0.01* <0.01* <0.01* <0.01* 0.08 <0.01* 0.01* <0.01* NA pShotgun Shotgun

A. vortex A.

oNorm: mito oNorm:

0.01* <0.01* <0.01* 0.03* 0.02* 0.01* <0.01* <0.01* 0.09 <0.01* 0.01* <0.01* NA MitoNorm (NA). (NA).

-

This article isThis article by protected copyright. All rightsreserved. Accepted Article

This article isThis article by protected copyright. All rightsreserved. Accepted Article

This article isThis article by protected copyright. All rightsreserved. Accepted Article