Posted on Authorea 10 May 2021 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.162067941.16111128/v1 | This a preprint and has not been peer reviewed. Data may be preliminary. pdrCtlg 01,i ihydvregopo rdtr htcnb on nnal l terrestrial all nearly in found be can that (World predators species of known group ~49,000 diverse least highly at 2020; with a 2021). Arachnida, al., al., is within et et 2021), Sheffer fact, group Thomas 2021; Catalog, In largest al., 2018; incomplete. Spider et the (Fan al., and Araneae), date et fragmented to (order currently reported highly (Garb Spiders are been them group have there of genomes in this Acari), spider most interest from chromosome-level (mites, two being special species Garb, S´anchez-Gracia, control only natural 2018), and of & pest bioactive 2004; representation spiders), sequences Rozas, or Walter, from great Vizueta, genome scorpions), (silk and available and this sciences L., few spiders Despite material very 130,000 from as Prendini, (about such toxins 2014). S., Arachnida areas, (venom al., research class M. compounds et translational the Harvey, and Sharma to G., basic belong 2018; many which Giribet, Ayoub, Chelicerata of & them, A., most Among Sharma, Jonathan species, Trilobita. arthropod (Coddington, extinct Chelicerata, of namely about species; the 10% subphyla, includes and extant about and four Hexapoda, for ecosystems, encompasses accounts and all It (paraphyletic) in 2013). found Crustacea 2011, virtually (Zhang, Myriapoda, Earth, species animal on known phylum of species-rich 80% most the is Arthropoda of genomes, biology chelicerate extraordinary of the in organization Introduction played and families structure gene large the and This into elements insights them. repetitive among of gain variants, third many structural to spiders. the of that just significantly resource origin role and a valuable recent clade, the with a Synspermiata new including clusters the suggesting genomic a of family, 83 representative representing the in genome of high-quality spiders, localize average first them the the of is than 54% assembly the them least chromosome-level illustrate At between (i.e., To distances families with chromosome. pseudochromosomes, gene evolutionary X chromosome. chemoreceptor all lower the across arthropod X in distributed major large sequences underrepresentation with two chemoreceptor notable characteristic consistently the previous 545 of a identified match the We the analysis and including comprehensive receptors). to size a ionotropic species, assembly and performed respect gustatory this we total with conformation resource of the new chromosome times karyotype of this the 4,500 of the 87% using value cover of than performed pseudochromosomes chromosomes more was or seven Mb), of scaffolds the 174.2 improvement largest seven continuity of adaptive mostly of The N50 a drivers archipelago scaffold genomic version. represents this the and technique, study in Gb; to diversification scaffolding model (1.37 spider remarkable excellent capture assembly an ground-dwelling a new becoming undergone nocturnal specialization, The has trophic a of Dysdera radiations. level 1981, genus the Schmidt, in The shifts silvatica with Islands. Dysdera associated Canary of the assembly from genome endemic chromosome-level the present We Abstract 2021 10, May 2 1 Sanchez-Herrero spiders in Escuer families Paula gene chemoreceptor of structure genome and spider endemic silvatica Island Dysdera Canary the of assembly Chromosome-scale ntttdIvsiaioe iece el au emn ra Pujol i Trias Germans Salut la Ci`encies d’Investigaci´o de en Institut Barcelona de Universitat 1 ai Pisarenco Vadim , 2 iulArnedo Miquel , Aahia rna)seslgto h origin the on light sheds Araneae) (Arachnida, 1 ne Fern´andez-Ruiz Angel , 1 ljnr S´anchez-Gracia Alejandro , 1 1 olVizueta Joel , 1 n ui Rozas Julio and , 1 Jose , 1 Posted on Authorea 10 May 2021 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.162067941.16111128/v1 | This a preprint and has not been peer reviewed. Data may be preliminary. h e eoeasml of assembly genome new The assembly Genome and Sequencing DNA the Sampling, of understanding Methods our gene improve and chelicerates. large to Material in and contribute evolution inaccessible elements, will genome is repetitive and data that variants, organization, genomic data structural structure, a high-quality variation as islands on this performed in such Additionally, based diversification and genomes, families. analyses rapid fragmented genome genomic underlying available new basis comparative this currently This annotated molecular allowing in in scale. the by and chromosome-level of identified shifts families at studies have ecological members gene further we and family chemoreceptor foster 2.0, all Dovetail will version of arthropod resource libraries; clustering the genomic physical major of Hi-C the utility two assembly of and enhanced continuous analysis the (Chicago highly the comprehensive improved, of of libraries an example obtain members ligation to an proximity the 2016) As al., used genome. et this we (Putnam of pipeline point, HiRise starting the and a genomics), as 2019) species al., the 2.0 of & assembly version chromosome-level Porter, draft, populations, high-quality Mentzer, first between Gurdasani, the Pollard, divergence present 2016; We and (Bleidorn, hybridization speciation 2019). of in Saha, regions, impact evolution 2018; repetitive chromosomal of Sandhu, and of analysis genes impact role of the the the distribution continuous families, processes, or content, gene highly base adaptive of a recombination, evolution having on the (e.g., and of va- etc..) features of structure benefits structural genome genome meaning the number the of evolutionary a of of identification of and examples study the the clear biological chromosomes, are Some or the assembly the organization. clusters, chromosome-level understanding gene across gene for and mapping prevented and fundamental structure gene it genes are genome resource, frag- as long features research very such fruitful very These a continuity, a in of riation. greater been resulted annotation requiring has (53.8%), draft comprehensive aspects species first the genomic this this of While of kb). sequences study which, 38 genome the reads, of the short (N50 of on draft nature based genome of repetitive was mented genome high assembly the the the of sequenced to first most we added the Nevertheless, genus, 2019). generated this al., (S´anchez-Herrero and for et strategy genome Gb) make reference (~1.37 a shifts 1981 obtaining eco-phenotypic Schmidt, of independent aim multiple the with With coupled proliferation species different dera in of recurrently rates occurred high have shifts The diet these species, rejected continental usually in areas. these as geographic in Islands, accumulated Canary metals the heavy In and 2017). cap- 1985; morphological substances Martin, to toxic by & adaptations the (Hopkin accompanied physiological assimilate prey and was to strategies) and on specialization hunting feeding woodlice trophic (unique in ture behavioral This specialized mouthparts), obligatorily of even Isopoda). or (modifications facultatively (Crustacea: species) woodlice island (Pek´ar, and terrestrial L´ıznarov´a, spiders ( continental generalists group (both mostly & this species the in some across diversification specialization) island prey of (i.e. drivers phagy main preferences the dietary of in one Mac´ıas-Hern´andez, as Shifts identified & S´anchez-Gracia, 2019). been spec- have & most Mac´ıas-Hern´andez, the Rozas, Mac´ıas-Hern´andez, Orom´ı, of Vizueta, M´urria, Arnedo, 2001; Arnedo, one (Arnedo, Mac´ıas-Hern´andez, Orom´ı,2016; representing Ribera, L´opez, & spiders Orom´ı, Roca-Cusachs, archipelago, & within & Arnedo, Islands islands 2007; Approximately Canary on family. Ribera, the diversification Dysderidae from of the endemic examples are of genus tacular genus diversity this ground-dwelling the its of nocturnal of delimitate species The half 50 and 2020). nearly phylogeny represents al., their distribution, et elucidate Mediterranean Kallal 2017; to al., helped et Dysdera greatly (Wheeler have lineages studies evolutionary Recent main 1). (Figure ecosystems necletmdlfrudrtnigtegnmcbsso dpierdain Vzeae l,2019). al., et (Vizueta radiations adaptive of basis genomic the understanding for model excellent an aril 84 hc otis26seis(ol pdrCtlg 01,msl ihacircum- a with mostly 2021), Catalog, Spider (World species 286 contains which 1804, Latreille ) sn h rtvrino h eoeasml fti pce (S´anchez-Herrero et species this of assembly genome the of version first the Using . Rez´aˇcov´a, Indeed, 2021). ˇ Rez´aˇc, 2008; Lubin, Pek´ar, & ˇ .silvatica D. a banduigtepeiu eoeasml dntdas (denoted assembly genome previous the using obtained was Dysdera enovo de 2 nldssm ftefwrpre ae fsteno- of cases reported few the of some includes Rez´aˇc Mac´ıas-Hern´andez, & Toft Pek´ar, 2007; & eoeasml fti pce sn hybrid a using species this of assembly genome ˇ .slaia(.silvatica (D. silvatica D. Rez´aˇc, Pek´ar, Arnedo, ˇ Rez´aˇc, with 2016), ˇ ydr silvatica Dysdera genome Dys- Posted on Authorea 10 May 2021 | The copyright holder is the author/funder. All rights reserved. No reuse without permission. | https://doi.org/10.22541/au.162067941.16111128/v1 | This a preprint and has not been peer reviewed. Data may be preliminary. ;TbeS) swl salohraahis rhoosadedszaseissree nSnhzHreoet Sanchez-Herrero in surveyed species ecdysozoa and arthropods arachnids, other all as well as S1), Table 1; Araneidae the and iorum with together Araneomorphae (Sicariidae), suborder mimosarum the 1940 of Mulaik, representatives & the and Mygalomorphae, order namely Araneae, order the of across homologs for searched We encoded (tRNAs) genes RNA searches Homology transfer pathways the and detected enzymes We KEGG of 2000). five associate 2019). sequence Goto, top to genomic (using & Blast2GO the BLAST (Kanehisa, the used in genes from We annotated inherited the obtained results. were to InterProScan 2000) and al, hits) et (Ashburner significant evidence. terms InterProScan functional (GO) in all ontology signatures ArthropodDB). integrated Gene domain of and protein description 2014) specific detailed al., a for et for peptides (Jones 2017, predicted (including v5.31.70.0 al., the the et of searched genus Vizueta annotation also this functional (see we the Furthermore, databases of perform ArthropodDB To ( species the 2019). searches of al., five v2.4 version et BLASTP from used (Hoff we (–etpmode) sequences models, evidence gene orthologous as 2019) & and both al., Baertsch, et using 2017) Diekhans, (Vizueta 2006), Stanke, al., Waack, 2019; et & Stanke, Sch¨offmann,(Vizueta & Morgenstern, Borodovsky, Stanke, Lomsadze, 2008; Hoff, Haussler, (Br˚una, 2021; Lomsadze, BRAKER2 Borodovsky, with Hoff, accomplished & was Stanke, genome soft-masked the of annotation Structural annotation steps. option genome annotation (-xsmall) functional genome the and the using Structural sequences in repetitive used RepBase- these be and the for to Dfam_Consensus-20171107 masked used RepeatMasker plus was we of assembly RepeatModeler, new analysis with the The generated the of databases. details), For 20181026 1 more version RepeatMasker). for the a Green; (2019) similarity-based for of & al. a built Hubley, combination and database (Smit, RepeatModeler), elements a v.4.0.7 Hubley; repetitive using & RepeatMasker assembly (Smit, with new v1.0.11 search RepeatModeler the with in identification regions ments repetitive analyzed We datasets. genes) 2934 255 (odb10; identification (odb10; arachnida eukaryota elements the Repetitive and searching data genes) 2019), Single Zdobnov, short-read Universal 1013 Benchmarking & (odb10; illumina of Manni, arthropoda Seppey, pipeline using the genes), 5.0.0; 2020) applying v. by Liu, (BUSCO; assembly as Orthologs new & Copy identified the Sun, of ( unequivocally Fan, completeness assembly the individual (Hu, and determined final NextPolish single library) The with (one a libraries. Hi-C polished from sequencing library), further lane) (one and one Chicago 2016), platform; of ten newly combination X The a California). silvatica (HiSeq Cruz; as paired-end (Santa obtained 150-bp Genomics was Illumina Dovetail in data performed sequence were generated sequencing and extraction DNA genus as the laboratory of the female -80 in single at libraries. identified a Islands), generated Hi-C We using (Canary 2019. S´anchez-Herrero and al. libraries Gomera in et 1 Chicago sequencing Table see using latest libraries; sequencing the scaffolded of hybrid types further a four and using was individuals five generated which was assembly 2019), 1 al., Version S´anchez-Herrero et 1; version ° ni t use. its until C C .Kc,14)and 1841) Koch, L. (C. eeec eoevrin2 a eeae sn h iiesaodn otae(unme al., et (Putnam software scaffolding HiRise the using generated was 2) version genome reference aei 83and 1883 Pavesi, rcoehl clavipes Trichonephila .silvatica D. .dumicola S. .silvatica D. ardcu hesperus Latrodectus cnhsurageniculata Acanthoscurria sn h RAcnS 207sfwr Ca,Ln a,&Lowe, & Mak, Lin, (Chan, software v2.0.7 tRNAscan-SE the using ntegnm aao ersnaie fabodtxnmcrange taxonomic broad a of representatives of data genome the in ook 88(rsde,adteAraneoidea the and (Eresidae), 1898 Pocock, Lnau,16)and 1767) (Linnaeus, .silvatica D. E vle=10 = -value enovo de hmeln&Ii,13,bt ntefml Theridiidae, family the in both 1935, Ivie, & Chamberlin 3 .silvatica D. .silvatica D. ebro h ld Synspermiata, clade the of member eoeasml ycmiigifrainfrom information combining by assembly genome .silvatica D. -5 C .Kc,14)(hrpoia) ntesub- the in (Theraphosidae), 1841) Koch, L. (C. gis CIn,SisPo,ada updated an and Swiss-Prot, NCBI-nr, against ) rip bruennichi Argiope serslsaddsuso eto) We section). discussion and results (see Dysdera rzni irgnlqi n stored and liquid nitrogen in frozen , rf eoeseS´anchez-Herrero et see genome draft ape nMrh21 nLa in 2012 March in sampled , ooclsreclusa Loxosceles enovo de .silvatica D. Sooi 72 (Figure 1772) (Scopoli, aataoatepidar- Parasteatoda rnpsbeele- transposable .silvatica D. Nsqdata RNAseq Stegodyphus Gertsch D. ) Posted on Authorea 10 May 2021 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.162067941.16111128/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. epromdacmrhniecrto falmmeso h ao hmrcpo eefmle encoded families gene chemoreceptor major the of members ( of all genome of the curation in comprehensive a performed We genes chemoreceptor the of Annotation orthogroups 2017). for searched 2005) also spiders IV, we tick Finally, Black the positive). & namely a database, Piesman, as 2019) hit al., among a et (Kriventseva relationships consider to homology ( length searches establish BLASTP alignment and of >30% series of a by filter conducted a was search The (2019). al., lsee fte r ragdwti eoi einta pn esta eti cut-off certain than less spans that region genomic a within as copies arranged paralogous that are the consider they classified we We if scaffold. Operationally a clustered and clustered”. (forming physical family “non pairwise closer gene and of particular physically distribution a “clustered” are the of analyzing family members by the gene chance between by chemoreceptor distances expected given than a pseudochromosomes the of in members cluster) the whether determined We minimum analysis ( the and and genes computed definition different GR to Cluster we the attributed family, for unequivocally chemoreceptor (2018). acids, using be al. each recovered least, amino could et For be at that 180 Vizueta fragments. not contains, sequences and could protein chemoreceptor “partial” (235 The that of encoded as family number models input. the classified gene the of as incomplete length were of sequence remaining the characteristic Apollo genome The if length respectively). and “complete” domain proteins, as browser models protein IR/iGluR profiles gene genome gene the a Apollo (HMM) the annotated classified of model We in 80% the Markov necessary, 2013). when hidden using al., re-annotated and et and and (Lee set validated, (2018) were data al. proteins sequence identified et resulting homologous S´anchez-Gracia, Vizueta the Vizueta, in with 2020; used along Rozas, S´anchez-Gracia, & 2020), Escuer, Rozas, (Vizueta, & BITACORA pipeline the used S MA)fralsqecs(h uldt e,cmrs h opeeadprilcpe dnie in identified copies partial and in complete identified the (MSA) comprise sequences alignments set, complete sequence data only multiple full with (the three (MSAc) sequences build silvatica MSA all to for a 2013) (MSAf) generated MSA Standley, we & First, (Katoh family. silvatica 7.475 gene each v. per Mafft used We alignments sequence and scaffolds. Multiple 2016), across Schlesner, clustering analyses & gene evolutionary Eils, of and (Gu, visualization Phylogenetic ComplexHeatmap the package facilitate R to the heatmaps with as processed plotted were copies more) family (or gene two chance by conservative weguarantees finding Here of distribution, clustered. probability (Poisson are 0.0004 the the they in genome, that lower the consider even to across (and family Mb given 3.32 a every of of copies value two the between set distance maximum the is where (2007): Rozas S´anchez-Gracia, & Vieira, Ir/iGluR C tgdpu mimosarum Stegodyphus L sn h at–dfamnsoto –epegh oainfamn pril eune othe to sequences (partial) fragment align to (–keeplength) option –addfragments Mafft the using ) sn h -N- loih ihotos–oapi mxtrt 00 hn ebitasecond a built we Then, 1000. –maxiterate –localpair options with algorithm L-INS-i the using stemxmmlnt facutrta otistoo oecpe ftesm aiy and family, same the of copies more or two contains that cluster a of length maximum the is aiis(iut,Ece,FısLpz ta. 00 iut ta. 08.Frti ak we task, this For 2018). al., et Vizueta 2020; al., et Frıas-Lopez, Escuer, (Vizueta, families ) g o10k.Tegn est fthe of density gene The kb. 100 to .silvatica D. C L egh o h w hmrcpo aiis ariepyia itne between distances physical Pairwise families. chemoreceptor two the for lengths , λ h mite the aeyteGsaoyrcpo ( Gustatory-receptor the namely , .31;this 0.0301); = Sngade l,21)and 2014) al., et (Sanggaard Gr erncu urticae Tetranychus aiy.Asmn nfr itiuino eefml members family gene of distribution uniform a Assuming family). .silvatica D. C p L vlei vnlwrfrthe for lower even is -value = g xdsscapularis Ixodes Ir 4 ( n aiyi the in family − n h rcnd nlddi h rhD (v10) OrthoDB the in included arachnids the and 1) .L oh 86(rice l,2011) al., (Grbi´c et 1836 Koch, L. C. n aataoatepidariorum Parasteatoda Gr lsl ikdgnsfo eefml are family gene a from genes linked closely n oorpc(ltmt)receptor (glutamate) Ionotropic and ) .silvatica D. E a,12 Ulan ia Guerrero, Lima, (Ullmann, 1821 Say, Ir vlectff<10 < cutoff -value ee na10k tec is stretch kb 100 a in genes Gr aiy hs h selected the Thus, family. eoei bu n copy one about is genome C -3 Shae tal., et (Schwager L eas applied also we ; au following value S MIN , n the and sin as ) p D. D. = g g Posted on Authorea 10 May 2021 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.162067941.16111128/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. eefml ois setmtdas: estimated is copies, family gene where ie,uigMA;seaoe.Frteaayi eue h etfi mn cdsbttto oe found model amino substitution whether acid test To amino replicates. best-fit bootstrap the ultrafast used 1000 we from species analysis support both from the node proteins For estimated complete and only above). and IQ-TREE, see by Arthropods), in MSAp; families using these for (i.e., far so genome annotated identified best families gene chemoreceptor melanogaster the Drosophila of members of genome the the among in estimated relationships phylogenetic those the from inferred We different significantly are the scaffold) analyses of particular Phylogenetic copies a between distances (in evolutionary genes. clusters unclustered the genomic across whether in determine to family U-test same Mann–Whitney the used We i where as: estimated is genome), the across (or scaffold same the of clusters all across And sequences between distance evolutionary the where We genes. unclustered to attributable is that genome). distance whole evolutionary the of the means of by computed proportion distances evolutionary the and measures physical which respectively). between families, software relationship model Ir and the 10.2.4 substitution (5 investigated and We sites MEGA-CC JTT Gr al., among the the with variation et for rate using performed heterogeneous (Minh classes 2012), gamma-distributed was discrete 2.1.2 with Tamura, 7 analysis 1992), & Thornton, The v. amino & Peterson, Taylor, per software Stecher, (Jones, replacements comparisons. IQ-TREE acid (Kumar, pairwise amino by version) of all number found (command-line the across model as site) (measured substitution distances acid evolutionary acid the amino estimate to best-fit 2020) the each used for We MSA third distances a evolutionary generate versus Physical to algorithm of genome L-INS-i the the from in used sequences annotated complete also the we included Finally, which family, MSAc. computed previous and D j n D n C . T h vrg ftepiws vltoaydsac ewe oisfo ihnacutr averaged cluster, a within from copies between distance evolutionary pairwise the of average the , stenme fgn aiymmesi h uvydsaod(ri h niegnm) and genome), entire the in (or scaffold surveyed the in members family gene of number the is stenme fcpe ncluster in copies of number the is C h vrg ftepiws vltoay(mn cdrpaeet e ie itne between distances site) per replacements acid (amino evolutionary pairwise the of average the , ST neednl ntetoceoeetrfmle,adsprtl o ahsaod(rfrthe for (or scaffold each for separately and families, chemoreceptor two the in independently C .silvatica D. ST setmtdas: estimated is hmrcposi h nlssa vltoayrfrne ( references evolutionary as analysis the in chemoreceptors rspiamelanogaster Drosophila ytemxmmlklho ehdipeetdi QTE.W loincluded also We IQ-TREE. in implemented method likelihood maximum the by D D Ck k T D C i and , ST = C = and = n = n ( ( dij n j. m .silvatica D. n 1 D 2 5 − 2 − egn 80(MSAp). 1830 Meigen, k T X steaioai-ae itnebtensequences between distance acid-based amino the is 1) m =1 1) D − T D X D i
41. Mate position 8 1.00 Gbp 1.20 Gbp % 200 Mbp 400 Mbp 600 Mbp 800 Mbp 0 bp 0 bp U niq u
e 200 Mbp 7 .6 % Ar S t hr y Ar ns opo an per
Ar 400 Mbp eomo da mi an 31. a ea A t r a r pha e 0 ach Che 0 % 2 . . 3 3 e ni % li % 4. cer da 7% 7 a 600 Mbp t . Read position 5 a % 4 . 8 % 18
800 Mbp B)
1.00 Gbp 0
1.20 Gbp 10,000 Chr5 Chr6 Chr4 ChrX Chr1 Chr2 Chr3 20,000 10 10 10 10 0 1 2 3 Posted on Authorea 10 May 2021 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.162067941.16111128/v1 — This a preprint and has not been peer reviewed. Data may be preliminary. families-in-spiders arachnida-araneae-sheds-light-on-the-origin-and-genome-structure-of-chemoreceptor-gene- chromosome-scale-assembly-of-the-canary-island-endemic-spider-dysdera-silvatica- Fig4B_IRs_chromosomes.pdf file Hosted A) Gr12295 Gr10964 Gr10899 Gr10898 Gr10897 Gr10895 Gr10894 Gr10875 Gr10625 Gr10523 Gr10522 Gr10520 Gr10508 Gr10489 Gr10332 Gr10331 Gr10232 A) Gr familyinChr1 Gr9343 Gr132 Gr025 Gr024 Gr029 Gr183 Gr149 Gr066
Gr9343 Gr10232 Gr066 Gr10331 Gr10332 Gr149 Gr10489 Gr10508 Gr183 Gr10520 Gr10522 Gr10523 Gr10625 Gr10875
Gr10894 at available Gr10895 Gr029 Gr024 Gr10897 Gr10898 Gr10899 Gr025 Gr10964 Gr12295 Gr132 0 <0.1 0.1-0.2 0.2-0.5 0.5-1 1-2 2-5 5-10 >10 B) https://authorea.com/users/413060/articles/521514- Evolutionary distance 0 2 4 6 19 1 0 1 1 Ph 0 2 ysi ca l
d i 1 st 0 a 3 n ce 1 0 4 1 0 5
Posted on Authorea 10 May 2021 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.162067941.16111128/v1 — This a preprint and has not been peer reviewed. Data may be preliminary.
1:2 1:1 1:2
1:8 1:1 1:4 1:6
1:3 1:7 1:3 Ir familyinChr1 C)
1:2 Ir12320 Ir12098 Ir11738 Ir11680 Ir11679 Ir11557 Ir11555 Ir11469 Ir11452 Ir11451 Ir11230 Ir11229 Ir11228 Ir11088 Ir11005 Ir10748 Ir10675 Ir10673 Ir10324 Ir10250 Ir9850 Ir9745 1:5
1:1 Ir191 Ir188 Ir589 Ir491 Ir542 Ir144 Ir511 Ir225 Ir466 Ir314 Ir640 Ir514
1:1
Gr13401 Gr13399
1:2 Gr18962
Gr18549 Gr18960 Ir514 Gr10522
Gr055
Gr10520 Gr014 Gr189 Ir9745 Gr018
Gr019 Gr053
Gr050 Ir9850
Gr054
2:2 Gr052 Ir10250 1 Gr183
r
2:1 Gr14025Gr6120 Gr156 DmelG Ir10324
r DmelG Ir640
Ir314 r
Gr093 DmelG
Ir10673
r Gr190 DmelG
Gr21 Ir10675
Gr1 64b r
DmelG Ir466
64d Ir225
r DmelG
Gr528 15 64c
Gr5805 131 Ir10748
r DmelG
Gr24309 64a Ir11005
61a Ir11088
r
DmelG Ir511
r DmelG
Gr22760Gr82 5a Ir144
64e Ir542
r
DmelG Ir11228
Gr124 64f
r DmelG Ir11229
1:2 63a
Gr125 Ir11230
r Gr33267 DmelG
1:1 21a Ir11451
Ir11452
39a r
Gr8322 DmelG Ir11469
68a Ir11555
r
Gr8321 DmelG Ir11557
32a Ir11679 r Gr9083 DmelG
Ir11680
43a Ir11738 r
Gr002 DmelG 33a Ir491
Ir12098
r Gr491 DmelG
Gr14452 28b Ir12320
1:6 Ir589 r 1 DmelG 28a Ir188
1:7 Ir191 Gr041 66a r
1:3 Gr042 DmelG 59f r DmelG
1:2 Gr084 59e 0 <0.1 0.1-0.2 0.2-0.5 0.5-1 1-2 2-5 5-10 >10 r DmelG
Gr040 2a
r 1:1 DmelG
Gr038 98a
r 1:4 DmelG
Gr039 98b
r 1:5 DmelG
Gr030 98d
D)
r
1:8 DmelG
Gr031 98c
r
1:10 DmelG Evolutionary distance
Gr184 9a
0 2 4 r DmelG
1:9 Gr037 8a
20
D m e l G r 3 9
Gr032 b G r 1 2 0 : 8 5 9 7
Gr10894
2:1 DmelGr23a
Gr024
2:4 DmelGr57a
Gr025 1 2:7 DmelGr
47b 0
Gr10875 DmelGr 1
Gr10899 77a
DmelGr
2:6
Gr029 10b
DmelGr
Gr10895
2:3 DmelGr 97a
Gr13421
2:2 DmelGr 94a
Gr194 DmelGr 93a 1 Gr134 DmelGr 93c 0
2
Gr31761 DmelGr 93b Ph Gr16172 DmelGr
93d ysi
Gr15138 DmelGr Gr16077 92a
DmelGr ca
Gr166 DmelGr 47a
l Gr7207 1
DmelGr 85a
d 0 Gr32826
DmelGr 22f i
3
st Gr32978
2:2 DmelGr 22c
Gr20822 DmelGr a Gr20824 22b
DmelGr n
2:1 ce
Gr20825 DmelGr 22d
Gr20827 DmelGr 22e
Gr20826
1:1 DmelGr 22a
Gr20829 DmelGr 1 1:2 Gr20828 10a
DmelGr
0
Gr154 89a
1:3 DmelGr
Gr5432 DmelGr 59b 4
1:5
Gr3454 DmelGr
Gr106 DmelGr 59a 1:4
Gr149 DmelGr 58b Gr3475
1:7 DmelGr
Gr1814 DmelGr 58a
Gr1815 Gr2301
1:6
Gr32418 Gr2300 58c
Gr32419 Gr21408
Gr32421 Gr10232
Gr12295 Gr27759 59c
Gr16255 Gr132 Gr14277 Gr20330 Gr20696 Gr27305 59d 36c 36b 1 36a 0 5
Minor Sc ChrU1 Chr6 Chr5 Chr4 Chr3 Chr2 Chr1 ChrX
1:1
1:2 2:2
1:1 2:1
1:2 1:3 Posted on Authorea 10 May 2021 — The copyright holder is the author/funder. All rights reserved. No reuse without permission. — https://doi.org/10.22541/au.162067941.16111128/v1 — This a preprint and has not been peer reviewed. Data may be preliminary.
U1705: :
U1705: U71:1 U1705
2:1
: U666
2
U189: U71:2
: U3226
1 4 2
3
U512:
U445: 2 Ir571
Ir084
Ir076
Ir101
1 Ir
1 Ir Ir066 Ir1 Ir081
Ir1
Ir057 Ir1 Ir089
Ir064 Ir091 Ir068 3 Ir046 Ir099
Ir086
Ir095 Ir088 Ir374
1 Ir103 Ir097
Ir255 Ir124 10 Ir062
Ir1 15
U560: 17 19
Ir1 18 Ir249
Ir094 Ir052
U666: Ir090 Ir075
Ir061 Ir069 Ir123 Ir083
U560: 12
Ir514 16 Ir058
Ir100 Ir056
Ir063 Ir054 Ir105 Ir120
Ir014 1 1 1 Ir Ir375 U2894: 1 Ir107
2 Ir047
Ir050 14
Ir104
C Dmel C Ir082 Dmel U189: Ir102 Ir080
1 Ir078
5 Ir106 iGluR14
6 Ir077 iGluR31
1 Ir096
U13508 G9935
155 11
Ir109 G 2 iGluR46
1 iGluR
u Ir141 DmelGl
Ir254 u
Ir087 DmelGl
0
Ir1 iGluR25
:1 59
5 Ir251 iGluR26
41
1245 1 Ir256 13 iGluR25
Ir051 6 iGluR31
6
U325: Ir259 RIB
5 iGluR14
Ir260 RIA
5
Ir390 iGluR24
98
8 U13508 iGluR14 C
Ir018 Dmel
2 01
2:2 Ir085
06
8
Ir092 iGluR32
U3226: 38 0 Ir053 iGluR10 58
:2 G3822
4 Ir067 iGluR24 20
Ir059 97 6
U1209: iGluR31
Ir079
5
1 iGluR24 C
U1209: Dmel
Ir121 45
u DmelGl
Ir055 1
1 1 u DmelGl Ir049 64
G5621
2 Ir562 26
l Dmelc
22
u
U5666: Ir1 DmelGl
Ir016 RIIE
u
11 DmelGl
Ir614 RIID
u U189: DmelGl
U1705: 2 Ir029 umsy
2 Ir512 RIIC
RIIB
r Ir093 DmelI
3 RIIA Ir070 iGluR618
r
Ir210 DmelI 5 iGluR28
U2894: Ir622 8a 1 iGluR17
Ir432 25a 6 iGluR21
N
2 Ir262 Dmel
98
N Ir392 Dmel
U445: Ir620 32 r mda
65 1
Ir621 iGluR r
3 Ir4612 mda 1
8 iGluR45
1240
2 0 Ir060 iGluR46
U445: Ir074 5
Ir034 Ir28190 r DmelI
2 Ir368 0
Ir389 76b
Ir367 Ir3440 Ir1995
Ir509 Ir29164
r
Ir098 DmelI
r DmelI
2:3 Ir364 93a
r DmelI
2:4 Ir508 75a
r DmelI
Ir510 31a
r U5666: DmelI
Ir434 75d
r DmelI
U325: 1 Ir126 84a
r DmelI
3 Ir294 64a
r DmelI
U325: Ir261 68a
r DmelI
1 Ir370 21a
r DmelI
U5121:2: Ir207 10a
100a
r
U189: 1 Ir265 DmelI 40a r
Ir048 DmelI
U512: 4
92a
r
2 Ir266 DmelI
7c
r 1:4 Ir427 DmelI
U467: Ir647 7b r
1 DmelI
Ir466 7f r
1:5 DmelI
7e
r Ir615 DmelI
1:1
87a
r Ir225 DmelI
2:2 68b
r Ir073 DmelI
41a
r
U2027: Ir511 DmelI
2 Ir025 85a r DmelI
Ir5552 67c r DmelI
Ir8785 48c r DmelI
Ir9146 67b
r DmelI
Ir8499 60e
r DmelI
Ir8703 56d
r DmelI
U3722: Ir506 56c
r 1 DmelI
Ir1019 62a
r DmelI
Ir414 56b
r DmelI
Ir424 60d
D m e l I r 9 4
21 Ir553 h
D m e l I r 9 4
U1751:2 Ir033 g
D m e l I r 6 7
Ir504 a
I r 5 5
Ir62 947
I r 2 2 3 1
6 DmelIr94c
Ir548 DmelIr94b
Ir7782 DmelIr94d
Ir549 DmelIr52c
Ir505 DmelIr52d
Ir558 DmelIr52b
Ir557 DmelIr52a
Ir554 Ir3161
Ir555 Ir31848
Ir556 Ir26867
Ir589 :
U3722 Ir134
2
Ir6501 Ir11005
1
Ir8 Ir332 17
: U742
2
Ir423 Ir658
Ir8302 Ir8078
Ir13590 Ir7733
Ir5527 Ir923
: U1845
Ir419 Ir7794
2
Ir503 : U2770 Ir6486
1
Ir7082 Ir698
Ir501 Ir6484 U301:
Ir33246 Ir33000 2 : U1751
Ir837 Ir7164
1 U301:
1
Ir3 Ir31520 1 169 Ir7160 Ir7183
U1566:
Ir6500 Ir7165
Ir5528 Ir20603 1
Ir5551 Ir8944
Ir13449
: U742 Ir8943 U1566:
Ir13589 Ir1 : U2770
1 1680 2
Ir3981 Ir1
: U2027 U872:
2 1679
Ir18377 Ir227 U872: 2
1
Ir632 Ir7860
:
U1845 6:2
Ir445 Ir17653 1
Ir451 Ir22423 6:1 1
Ir019 Ir7641 2:1
Ir416 Ir7643
Ir1015 Ir22427 :
U1631 2:2
Ir330 Ir22426 2:1
Ir25673
1 Ir23242
:
U1631 U37:1
Ir247 Ir6960
4:2 U37:2
Ir250 Ir13473
2
Ir253 Ir1 2:3
Ir590
4:1 Ir4911738 2:2
Ir371 Ir9745 4:1
Ir369 Ir24267 5:8
Ir633 Ir7021
Ir19895 Ir17156
5:5 7:1 Ir19896 Ir16984
5:7
Ir3328 Ir16986 7:2
1 Ir2
5:6 Ir16985
Ir310 Ir748 5:2
1
Ir22258 Ir746 5:1
Ir212 Ir745
5:3
Ir661 Ir3629
5:4
Ir489 Ir4848 1:1
Ir507 Ir4846 1:3
Ir16444 Ir4844
4:3 1:2
Ir33213 Ir2366 U166:
Ir1277 Ir7536
Ir17980 Ir475 U166:
Ir17979
1:2 Ir15210 U166: 3
Ir17981 Ir15209 1:1 Ir17942 2
Ir537 2:3
Ir150 Ir536 1
: U467
Ir151 Ir7601 2:2
Ir8533 Ir8262 2:1
Ir17891 Ir20563
Ir22433 Ir6910 2
Ir10250 Ir478
Ir383
3:2 Ir319
Ir384 Ir361
3:1 1:2
Ir385 Ir8232
3:3
Ir382 Ir7522 1:1
Ir3876 Ir8849
Ir3877 Ir7028
Ir10673 Ir576
Ir10675 Ir8084
Ir22895 Ir583
Ir23243 Ir581
Ir23982 Ir577
Ir7906 Ir7988
Ir462 Ir575
1
Ir46 Ir7994
Ir31241 Ir8912
Ir301 Ir8358
Ir398 Ir20188
Ir15346 Ir20187
Ir18225 Ir1
Ir18942 Ir32753
1
1
Ir Ir12098
1 Ir Ir24047
1:1
Ir7099 Ir22031 1088
Ir7101 Ir1
Ir7098
1:2 Ir1 1557
Ir5535 Ir1
1555
Ir7097 Ir1 1:2
Ir7100 Ir314
Ir8790 Ir12320 1229
1:3 Ir8788 Ir071
Ir31622 Ir1 1469
Ir24551 Ir542 1452 Ir9195 Ir144
Ir21058
Ir23765 1451
Ir640 Ir20479
Ir8799 Ir22814
Ir10748 Ir184 Ir7927
4:2 Ir10324 Ir1404
1
Ir228 Ir188
Ir21391 Ir191 Ir1403 Ir1405
Ir5723 Ir1407
1230
U74:2 U74:1
1:2 1
1:1 Sc Minor ChrU1 Chr6 Chr5 Chr4 Chr3 Chr2 Chr1 ChrX
2:1
5:2
5:1
: U203
: U203 3:3 :
U203 4:2
: U203 4:1
: U203 1:1
: U1788 :
U1788 3:4 3:2
3:1
3:2
3 U1
3:1
1 U U1
U1 5
2
1
4 1:2
1:1 1:3
1:4
2 1