Downloaded 27 Additional Transcriptomes from Genbank
Total Page:16
File Type:pdf, Size:1020Kb
bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 A phylotranscriptome study using silica gel-dried leaf tissues 2 produces an updated robust phylogeny of Ranunculaceae 3 4 Running title: Phylotranscriptomics using silica gel-dried tissues 5 Jian He1†, Rudan Lyu1†, Yike Luo1†, Jiamin Xiao1†, Lei Xie1*, Jun Wen2*, Wenhe Li1, 6 Linying Pei4, Jin Cheng3 7 1 School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, 8 100083 PR China 9 2Department of Botany, National Museum of Natural History, MRC 166, Smithsonian 10 Institution, Washington, DC 20013-7012, USA 11 3 Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, 12 College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 13 100093 PR China 14 4Beijing Engineering Technology Research Center for Garden Plants, Beijing Forestry 15 University Forest Science Co. Ltd., Beijing, 100083, PR China 16 †These authors contributed equally to this work. 17 Correspondence: Lei Xie, email: [email protected]; , Jun Wen, email: [email protected]. 18 Funding information: This study was supported by the National Natural Science 19 Foundation of China [grant number 31670207] and the Beijing Natural Science 20 Foundation [grant number 5182016]. 21 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 22 Abstract 23 The utility of transcriptome data in plant phylogenetics has gained popularity in recent years. 24 However, because RNA degrades much more easily than DNA, the logistics of obtaining 25 fresh tissues has become a major limiting factor for widely applying this method. Here, we 26 used Ranunculaceae to test whether silica-dried plant tissues could be used for RNA 27 extraction and subsequent phylogenomic studies. We sequenced 27 transcriptomes, 21 from 28 silica gel-dried (SD-samples) and six from liquid nitrogen-preserved (LN-samples) leaf 29 tissues, and downloaded 27 additional transcriptomes from GenBank. Our results showed that 30 although the LN-samples produced slightly better reads than the SD-samples, there were no 31 significant differences in RNA quality and quantity, assembled contig lengths and numbers, 32 and BUSCO comparisons between two treatments. Using this data, we conducted 33 phylogenomic analyses, including concatenated- and coalescent-based phylogenetic 34 reconstruction, molecular dating, coalescent simulation, phylogenetic network estimation, and 35 whole genome duplication (WGD) inference. The resulting phylogeny was consistent with 36 previous studies with higher resolution and statistical support. The 11 core Ranunculaceae 37 tribes grouped into two chromosome type clades (T- and R-types), with high support. 38 Discordance among gene trees is likely due to hybridization and introgression, ancient genetic 39 polymorphism and incomplete lineage sorting. Our results strongly support one ancient 40 hybridization event within the R-type clade and three WGD events in Ranunculales. 41 Evolution of the three Ranunculaceae chromosome types is likely not directly related to WGD 42 events. By clearly resolving the Ranunculaceae phylogeny, we demonstrated that SD-samples 2 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 43 can be used for RNA-seq and phylotranscriptomic studies of angiosperms. 44 Keywords 45 chromosomal type, phylotranscriptomics, Ranunculaceae, RNA-seq, silica-dried leaf tissue 3 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 46 1 INTRODUCTION 47 With the recent advances in high-throughput sequencing and analytical methods, 48 phylogenetic reconstruction using genome-wide sequence data has become widely used 49 in plant evolutionary studies (Johnson et al., 2012; Yu et al., 2018). However, whole- 50 genome sequencing of densely sampled phylogenetic analyses has remained impractical 51 and unnecessary due to high costs and computational limitations. Hence researchers 52 have developed reduced-representation methods (e.g., genome skimming, restriction 53 site-associated DNA sequencing or RAD-seq, target enrichment sequencing such as 54 Hyb-seq, and transcriptome sequencing or RNA-seq) as practical tools for phylogenetic 55 studies (Zimmer & Wen, 2015; McKain et al., 2018). 56 Genome skimming is one of the most widely applied partitioning strategies for 57 phylogenetic inferences, and especially efficient for obtaining complete plastid genome 58 sequences of plants (Dodsworth, 2015; Liu et al., 2018; Zhai et al., 2019; He et al, 2019; 59 Liu et al., 2020; Wang et al, 2020). However, it is often of limited utility to obtain 60 enough single copy nuclear genes for phylogenetic analyses, especially for non-model 61 plant taxa possessing both huge genome sizes and no whole-genome reference (McKain 62 et al., 2018; but see Liu et al., 2021). Hyb-seq is another widely used genome 63 partitioning method that can target low-copy nuclear genes. Like genome skimming, 64 Hyb-seq can use almost all kinds of tissue samples (e.g., silica gel-preserved, flash- 65 frozen, fresh, or even old herbarium materials) (Yu et al., 2018; McKain et al., 2018; 66 Reichelt et al., 2021; Wang et al., 2021). However, Hyb-seq requires a complex 67 laboratory protocol involving bait capture. Furthermore, this method often results in a 4 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 68 high proportion of missing data, and also cannot be used to detect ancient whole 69 genome duplication (WGD) based on paralogous genes. 70 In recent years, using RNA-seq to reconstruct phylogenetic relationships 71 (phylotranscriptomics) and gene family evolution has gained popularity because of its 72 relatively low cost and improved analytical pipelines (Wen et al., 2013, 2015; Wickett et 73 al., 2014; Landis et al., 2017; Zeng et al., 2017; One Thousand Plant Transcriptomes 74 Initiative, 2019; Cheon et al., 2020; Alejo-Jacuinde et al., 2020). Using that method, 75 researchers can often assemble thousands of genes (especially single-copy nuclear 76 genes) from plant taxa under study and use them for both inferring phylogenetic 77 relationships and gene family evolution (Yang and Smith, 2014; Yang et al., 2015; 78 Xiang et al., 2017). Compared to Hyb-seq, RNA-seq uses relatively simple 79 experimental protocols to generate more complete nuclear data, and paralogous genes 80 from RNA-seq data can also be used to infer WGD (McKain et al., 2018). 81 While improvements in sequencing and extraction protocols have made RNA-seq 82 much easier in plants (Romero et al., 2014; Yang et al., 2017), its application in 83 phylogenetic study still remains challenging. Because RNA is more unstable than DNA, 84 RNA-seq requires more stringent material preservation techniques. Previous studies 85 have used fresh or liquid-nitrogen flash frozen plant tissues or fresh tissue quickly 86 soaked in RNA stabilization solution, and then subsequently preserved in an ultra-low 87 temperature (-80 ℃) freezer (Yu et al., 2018; McKain et al., 2018; Dodsworth et al., 88 2019). However, because phylotranscriptomic studies often focus on non-model plant 89 taxa, they usually require extensive field work and broad taxon sampling schemes. The 5 bioRxiv preprint doi: https://doi.org/10.1101/2021.07.29.454256; this version posted July 29, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 90 logistics of using liquid nitrogen tanks in the field or expensive RNA stabilization 91 solution to preserve collected plant tissues at multiple locations, not to mention quick 92 access to an ultra-low temperature freezer for subsequent laboratory work, greatly limits 93 the practicality of using RNA-seq for phylogenetic studies (Zimmer & Wen, 2015; Yang 94 et al., 2017). 95 Traditional DNA-based molecular phylogenetics, DNA barcoding, as well as 96 genome skimming and RAD-seq methods, often use silica gel-dried leaf tissues 97 (Narzary et al., 2015; Yu et al., 2018). Such sampling method is cheaper and amenable 98 to collecting and transporting large number of samples. Much emphasis has been placed 99 on how different plant tissues, preservation methods, and RNA extraction protocols may 100 impact the quantity and quality of extracted RNA (Johnson et al., 2012; Romero et al., 101 2014; Yang et al., 2017), but no empirical study has explored silica gel-dried plant 102 tissues for RNA-seq. This may be due to the reasons that even though total RNA may be 103 extracted from silica gel-dried plant tissues, it is not possible to quantitatively measure 104 gene expression using this kind of samples for evo-devo studies.