The Common Dragonet, Callionymus Lyra 4 5 Authors 6 Sven Winter1,2*, Stefan Prost3,4, Jordi De Raad1,2,3, Raphael T
Total Page:16
File Type:pdf, Size:1020Kb
bioRxiv preprint doi: https://doi.org/10.1101/2020.09.08.287078; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Title 2 Chromosome-level genome assembly of a benthic associated 3 Syngnathiformes species: the common dragonet, Callionymus lyra 4 5 Authors 6 Sven Winter1,2*, Stefan Prost3,4, Jordi de Raad1,2,3, Raphael T. F. Coimbra1,2, Magnus Wolf1,2, 7 Marcel Nebenführ1, Annika Held1, Melina Kurzawe1, Ramona Papapostolou1, Jade Tessien1, 8 Julian Bludau1, Andreas Kelch1, Sarah Gronefeld1, Yannis Schöneberg1, Christian Zeitz1, 9 Konstantin Zapf1, David Prochotta1, Maximilian Murphy1, Monica M. Sheffer5, Moritz 10 Sonnewald6, Maria A. Nilsson2,3, Axel Janke1,2,3 11 12 Affiliations 13 1Institute for Ecology, Evolution and Diversity, Goethe University, Frankfurt am Main, Germany 14 2Senckenberg Biodiversity and Climate Research Centre, Frankfurt am Main, Germany 15 3LOEWE-Centre for Translational Biodiversity Genomics, Frankfurt am Main, Germany 16 4South African National Biodiversity Institute, National Zoological Garden, Pretoria, South 17 Africa 18 5Zoological Institute and Museum, University of Greifswald, Greifswald, Germany. 19 6Senckenberg Research Institute, Department of Marine Zoology, Section Ichthyology, 20 Frankfurt am Main, Germany 21 *Correspondence: [email protected] 22 23 Email and Orcid: Name e-mail ORCID Sven Winter [email protected] 0000-0002-1890-0977 Stefan Prost [email protected] 0000-0002-6229-3596 Jordi de Raad [email protected] 0000-0002-4991-4431 Raphael T.F. Coimbra [email protected] 0000-0002-6075-7203 Magnus Wolf [email protected] 0000-0001-9212-9861 Marcel Nebenführ [email protected] Annika Held [email protected] Melina Kurzawe [email protected] Ramona Papapostolou [email protected] frankfurt.de Jade Tessien [email protected] Julian Bludau [email protected] Andreas Kelch [email protected] 0000-0002-8265-4643 Sarah Gronefeld [email protected] 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.09.08.287078; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Yannis Schöneberg [email protected] 0000-0003-1113- 973X Christian Zeitz [email protected] Konstantin Zapf [email protected] David Prochotta [email protected] Maximilian Murphy [email protected] Monica M. Sheffer [email protected] 0000-0002-6527-4198 Moritz Sonnewald [email protected] Maria A. Nilsson [email protected] 0000-0002-8136-7263 Axel Janke [email protected] 0000-0002-9394-1904 24 25 Abstract 26 Background 27 The common dragonet, Callionymus lyra, is one of three Callionymus species inhabiting the 28 North Sea. All three species show strong sexual dimorphism. The males show strong 29 morphological differentiation, e.g., species-specific colouration and size relations, while the 30 females of different species have few distinguishing characters. Callionymus belongs to the 31 ‘benthic associated clade’ of the order Syngnathiformes. The ‘benthic associated clade’ so far 32 is not represented by genome data and serves as an important outgroup to understand the 33 morphological transformation in ‘long-snouted’ syngnatiforms such as seahorses and 34 pipefishes. 35 Findings 36 Here, we present the chromosome-level genome assembly of C. lyra. We applied Oxford 37 Nanopore Technologies’ long-read sequencing, short-read DNBseq, and proximity-ligation- 38 based scaffolding to generate a high-quality genome assembly. The resulting assembly has a 39 contig N50 of 2.2 Mbp, a scaffold N50 of 26.7 Mbp. The total assembly length is 568.7 Mbp, 40 of which over 538 Mbp were scaffolded into 19 chromosome-length scaffolds. The 41 identification of 94.5% of complete BUSCO genes indicates high assembly completeness. 42 Additionally, we sequenced and assembled a multi-tissue transcriptome with a total length of 2 bioRxiv preprint doi: https://doi.org/10.1101/2020.09.08.287078; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 43 255.5 Mbp that was used to aid the annotation of the genome assembly. The annotation 44 resulted in 19,849 annotated transcripts and identified a repeat content of 27.66%. 45 Conclusions 46 The chromosome-level assembly of C. lyra provides a high-quality reference genome for 47 future population genomic, phylogenomic, and phylogeographic analyses. 48 49 Keywords 50 Callionymidae, genome annotation, genome assembly, Hi-C, Nanopore, Syngnathiformes 51 52 Data Description 53 Background information 54 Until recently, the family Callionymidae was placed into the order Perciformes, which is often 55 considered a “polyphyletic taxonomic wastebasket for families not placed in other orders” [1]. 56 However, recent phylogenetic analyses suggest a placement of Callionymidae within the order 57 Syngnathiformes, which currently contains ten families with highly derived morphological 58 characters such as the pipefish and seahorses [1]. Syngnathiformes has recently been divided 59 into two clades, a ‘long-snouted clade’ and a ‘benthic associated clade,’ each comprising five 60 families [2]. The ‘long-snouted clade’ (Syngnathidae, Solenostomidae, Aulostomidae, 61 Centriscidae, and Fistulariidae) is currently represented by genomes from the Gulf pipefish 62 (Syngnathus scovelli) and the tiger tail seahorse (Hippocampus comes) [3,4] and additional 63 draft assemblies of pipefish [5]. A genome of the ‘benthic associated clade’ (Callionymidae, 64 Draconettidae, Dactylopteridae, Mullidae, and Pegasidae) has not been sequenced and 65 analysed yet. Callionymidae comprises 196 species [6], of which the common dragonet, 66 Callionymus lyra (Linnaeus, 1758) (Fig. 1), is one of three Callionymus species inhabiting the 67 North Sea [7]. All three species also occur in the East Atlantic, and the Mediterranean Sea [6]. 68 They represent essential prey fish for commercially important fish species such as the cod 69 (Gadus morhua) [8]. The males of the North Sea dragonet species (C. lyra, C. maculatus, C. 70 reticulatus) show strong morphological differentiation in the form of species-specific 3 bioRxiv preprint doi: https://doi.org/10.1101/2020.09.08.287078; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 71 colouration and size relations. The much less conspicuous females can be distinguished 72 morphologically, with rather high inaccuracy, by the presence or absence of their preopercular, 73 basal spine and by various percentual length ratios. The great resemblance among the 74 different species’ females, together with the fact that all three species can be found in 75 sympatry, suggest the possibility of hybridization among them. 76 77 Here, we present the chromosome-level genome of the common dragonet, representing the 78 first genome of the ‘benthic associated’ Syngnathiformes clade as a reference for future 79 population genomic, phylogenomic, and comparative genomic analyses. The chromosome- 80 level genome assembly was generated as part of a six-week university master’s course. For 81 a detailed description and outline of the course, see [9]. 82 83 Sampling, DNA extraction, and Sequencing 84 We sampled two Callionymus lyra individuals (one of each sex) during a yearly monitoring 85 expedition to the Dogger Bank in the North Sea (Female: 54°59.189'N 1°37.586'E; Male: 86 54°48.271'N 1°25.077'E) with the permission of the Maritime Policy Unit of the UK Foreign 87 and Commonwealth Office in 2019. The samples were initially frozen at -20°C on the ship and 88 later stored at -80°C until further processing. The study was conducted in compliance with the 89 ‘Nagoya Protocol on Access to Genetic Resources and the Fair and Equitable Sharing of 90 Benefits Arising from Their Utilization’. 91 We extracted high molecular weight genomic DNA (hmwDNA) from muscle tissue of the 92 female individual following the protocol by [10]. Quantity and quality of the DNA was evaluated 93 using the Genomic DNA ScreenTape on the Agilent 2200 TapeStation system (Agilent 94 Technologies). Library preparation for long-read sequencing followed the associated protocols 95 for Oxford Nanopore Technologies (ONT, Oxford, UK) Rapid Sequencing Kit (SQK-RAD004). 96 A total of seven sequencing runs were performed using individual flow cells (FLO-MIN106 97 v.9.41) on a ONT MinION v.Mk1B. 4 bioRxiv preprint doi: https://doi.org/10.1101/2020.09.08.287078; this version posted September 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 98 Additionally, we sent tissue samples to BGI Genomics (Shenzhen, China) to generate 99 additional sequencing data. A 100 bp paired-end short-read genomic DNA sequencing library 100 was prepared from the muscle tissue of the female individual. This library was later used for 101 genome assembly polishing. Moreover, a 100 bp paired-end RNAseq library was prepared for 102 pooled RNA isolates derived from kidney, liver, gill, gonad, and brain tissue of the male 103 individual. Both libraries were sequenced on BGI’s DNBseq platform.