De Novo Sequencing and Comparative Analysis of Three Red Algal
Total Page:16
File Type:pdf, Size:1020Kb
Acta Oceanol. Sin., 2014, Vol. 33, No. 2, P. 45–53 DOI: 10.1007/s13131-014-0440-7 http://www.hyxb.org.cn E-mail: [email protected] De novo sequencing and comparative analysis of three red algal species of Family Solieriaceae to discover putative genes associated with carrageenan biosysthesis SONG Lipu1,3,4†, WU Shuangxiu1,3†, SUN Jing1,3,4†, WANG Liang1,3,4, LIU Tao2, CHI Shan2, LIU Cui2, LI Xingang1,3, YIN Jinlong1, WANG Xumin1,3*, YU Jun1,3* 1 CAS Key Laboratory of Genome Sciences and Information, Beijing Key Laboratory of Genome and Precision Medicine Technologies, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China 2 College of Marine Life Science, Ocean University of China, Qingdao 266003, China 3 Beijing Key Laboratory of Functional Genomics for Dao-di Herbs, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China 4 University of Chinese Academy of Sciences, Beijing 100049, China Received 25 March 2013; accepted 22 July 2013 ©The Chinese Society of Oceanography and Springer-Verlag Berlin Heidelberg 2014 Abstract Betaphycus gelatinus, Kappaphycus alvarezii and Eucheuma denticulatum of Family Solieriaceae, Order Gi- gartinales, Class Rhodophyceae are three important carrageenan-producing red algal species, which pro- duce different types of carrageenans, beta (β)-carrageenan, kappa (κ)-carrageenan and iota (ι)-carrageenan. So far the carrageenan biosynthesis pathway is not fully understood and few information is about the So- lieriaceae genome and transcriptome sequence. Here, we performed the de novo transcriptome sequencing, assembly, functional annotation and comparative analysis of these three commercial-valuable species using an Illumina short-sequencing platform Hiseq 2000 and bioinformatic software. Furthermore, we compared the different expression of some unigenes involved in some pathways relevant to carrageenan biosynthe- sis. We finally found 861 different expressed KEGG orthologs which contained a glycolysis/gluconeogenesis pathway (21 orthologs), carbon fixation in photosynthetic organisms (16 orthologs), galactose metabolism (5 orthologs), and fructose and mannose metabolism (9 orthologs) which are parts of the carbohydrate me- tabolism. We also found 8 different expressed KEGG orthologs for sulfur metabolism which might be impor- tantly related to biosynthesis of different types of carrageenans. The results presented in this study provided valuable resources for functional genomics annotation and investigation of mechanisms underlying the biosynthesis of carrageenan in Family Solieriaceae. Key words: Betaphycus gelatinus, Kappaphycus alvarezii, Eucheuma denticulatum, Solieriaceae, de novo transcriptome sequencing, carrageenan Citation: Song Lipu, Wu Shuangxiu, Sun Jing, Wang Liang, Liu Tao, Chi Shan, Liu Cui, Li Xingang, Yin Jinlong, Wang Xumin, Yu Jun. 2014. De novo sequencing and comparative analysis of three red algal species of Family Solieriaceae to discover putative genes as- sociated with carrageenan biosysthesis. Acta Oceanologica Sinica, 33(2): 45–53, doi: 10.1007/s13131-014-0440-7 1 Introduction ducing species which produce different types of carrageenans Solieriaceae is a red algal family in Order Gigartinales of (Lobban and Harrison, 1994), i.e., beta (β)-carrageenan, kappa Class Rhodophyceae. In this family, there are many valuable (κ)-carrageenan and iota (ι)-carrageenan, which are different in species which generally have multiaxial thalli and often in- their chemical structures and properties, and are mainly pro- habite the warm seas, including Betaphycus gelatinus, Kappa- duced by B. gelatinus, K. alvarezii and E. denticulatum, respec- phycus alvarezii and Eucheuma denticulatum. B. gelatinus is tively (Rudolph, 2000) . most abundant in about one meter below the low tide region Carrageenans are sulfated galactans clustering in cell walls under cultivation in the Qionghai and Wenchang districts of of numerous red seaweeds, which are especially rich in some the Hainan Province, China (Tseng et al., 1981). K. alvarezii is species such as B. gelatinus, K. alvarezii and E. denticulatum often cultivated in the upper part of the sublittoral zone, from (Cole and Sheath, 1990). In the food industry, carrageenans are just below the low tide line to rocky substrates where water widely utilized due to their excellent physical properties, such flow is slow to moderate. E. denticulatum is produced in large as thickening, gelling and stabilizing abilities (McHugh, 2003). quantity in Taiwan and is known in China as Qilincai (mean- They are also used in various non-food products, such as phar- ing unicorn vegetable), a rather common algal food in China. maceutical, cosmetics, printing and textile formulations (Ime- These three species are the most significant carrageenan-pro- son, 2000). Carrageenans are linear units of D-galactose resi- Foundation item: The National Natural Science Foundation of China under contract Nos 31140070, 31271397 and 41206116; the algal transcrip- tome sequencing was supported by 1KP Project (www.onekp.com). *Corresponding author, E-mail: [email protected], [email protected] †Contributed equally. 46 SONG Lipu et al. Acta Oceanol. Sin., 2014, Vol. 33, No. 2, P. 45–53 dues composed of alternating 3-linked β-D-galactopyranose those on gene structure and expression profiling, were carried and 4-linked α-D-galactopyranose or 4-linked 3,6-anhydroga- out not only on model species (Mortazavi et al., 2008; Nagalak- lactose (Gordon-Mills et al., 1978). Sulfated galactans are clas- shmi et al., 2008), but also on non-model organisms from de sified according to the presence of the 3,6-anhydro-bridge on novo analysis (Wang et al., 2010; Yuan et al., 2012), leading to a the 4-linked-galactose residue and the position and number of dramatic acceleration in gene discovery, especially for the genes sulfate groups (Fig. 1), which influences the physical properties having low expression levels which were ignored by tradition- of carrageenans (Chiovitti et al., 1995; Gordon-Mills et al., 1978; al method (Barrero et al., 2011; Garg et al., 2011), and rapidly van de Velde et al., 2002). broadening our knowledge on gene regulations and networks Carrageenans from Family Solieriaceae have attracted con- in metabolism. Hence we can study the organisms without the siderable attention because of the economically important reference genome information at the genome-wide level by the genera Eucheuma and Kappaphycus, of which algal species are cost-effective RNA-seq method. the main resources for the production of ι- and κ-carrageenans In this article, we, for the first time, performed the de novo (Doty, 1988; Santos, 1989). Moreover, B. gelatinus, previously transcriptome sequencing, assembly, functional annotation known as Eucheuma gelatinae, produces a hybrid of carrageen- and compararive analysis of the three most significant car- ans with significant β-carrageenan character (Greer and Yaphe, rageenan-producing species, B. gelatinus, K. alvarezii and E. 1984; Santos, 1989). However, carrageenans have very hetero- denticulatum of Family Solieriaceae collected from the Hainan geneous chemical structures, depending on the algal source, Province, using an Illumina short-sequencing platform Hiseq life stage, and extraction procedure (Cole and Sheath, 1990; 2000 and bioinformatic software. Furthermore, we used the se- Usov, 1998). Although the carrageenan biosynthesis pathway quencing reads to map back to the assembly result to get the is not fully understood, it is usually accepted that the last step expression profile by software RSEM (Li and Dewey, 2011) and consists of the formation of a 3,6-anhydro ring found in κ- and compared the differences of some unigenes possibly involved ι-carrageenans through the enzymatic conversion of D-galac- in some pathways relevant to carrageenan biosynthesis. The tose-6-sulfate or D-galactose-2,6-disulfate occurring in μ- and assembled and annotated transcriptomic sequences provided ν-carrageenan, respectively (van de Velde et al., 2002). Accord- comprehensive genomic resources for further gene identifica- ing to the information above the structures of carrageenans, tion, characterization and metabolic studies of the carrageen- β-carrageenan, κ-carrageenan and ι-carrageenan have different an-producing species or relative species of Family Solieriaceae percentages of sulfate. However, so far, there is no report on the and even other red algae. gene characteristics related to the carrageenan biosynthesis on the molecular level. 2 Materials and methods Over the past several years, the next generation sequencing technology has been widely used for transcriptome analysis 2.1 Algal sample collection because it has evolved to be more cost-effective, have high- Branches of the algal samples of B. gelatinus, K. alvarezii throughput and is deep depth. By using this technology, the and E. denticulatum were collected from the Hainan Province genome-wide transcriptome studies (RNA-seq), including (19°27′59″N, 108°52′59″E; 18°24′30″N, 110°03′43″E; 18°24′30″N, ab n B. gelatinus β-carrageenan n K. alvarezii κ-carrageenan n E. denticulatum ι-carrageenan Fig.1. Photos of B. gelatinus, K. alvarezii and E. denticulatum (a) and the structures of three different carrageenans (b). SONG Lipu et al. Acta Oceanol. Sin., 2014, Vol. 33, No. 2, P. 45–53 47 110°03′43″E) in China in February 2012, March 2011 and Oc- Groups (COG) classification was performed against COG data- tober 2010, respectively. The algal sample, B. gelatinus, was base (Tatusov et