Comparison of Cytochrome P450 Genes from Six Plant Genomes

Comparison of Cytochrome P450 Genes from Six Plant Genomes

Tropical Plant Biol. DOI 10.1007/s12042-008-9022-1 Comparison of Cytochrome P450 Genes from Six Plant Genomes David R. Nelson & Ray Ming & Maqsudul Alam & Mary A. Schuler Received: 2 November 2008 /Accepted: 14 November 2008 # Springer Science + Business Media, LLC 2008 Abstract Plants depend on cytochrome P450 (CYP) (grape), Carica papaya (papaya), Populus trichocarpa enzymes for nearly every aspect of their biology. In several (poplar), Oryza sativa (rice), Arabidopsis thaliana (Arabi- sequenced angiosperms, CYP genes constitute up to 1% of dopsis or mouse ear’s cress) and Physcomitrella patens the protein coding genes. The angiosperm sequence (moss). An evolutionary analysis is presented that tracks diversity is encapsulated by 59 CYP families, of which 52 land plant P450 innovation over time from the most ancient families form a widely distributed core set. In the 20 years and conserved sequences to the newest dicot-specific since the first plant P450 was sequenced, 3,387 P450 families. The earliest or oldest P450 families are devoted sequences have been identified and annotated in plant to the essential biochemistries of sterol and carotenoid databases. As no new angiosperm CYP families have been synthesis. The next evolutionary radiation of P450 families discovered since 2004, it is now apparent that the sampling appears to mediate crucial adaptations to a land environ- of CYP diversity is beginning to plateau. This review ment. And, the newest CYP families appear to have driven presents a comparison of 1,415 cytochrome P450 sequen- the diversity of angiosperms in mediating the synthesis of ces from the six sequenced genomes of Vitis vinifera pigments, odorants, flavors and order-/genus-specific sec- ondary metabolites. Family-by-family comparisons allow Electronic supplementary material The online version of this article the visualization of plant genome plasticity by whole (doi:10.1007/s12042-008-9022-1) contains supplementary material, genome duplications and massive gene family expansions which is available to authorized users. via tandem duplications. Molecular evidence of human D. R. Nelson (*) domestication is quite apparent in the repeated P450 gene Department of Molecular Sciences, duplications occurring in the grape genome. University of Tennessee Health Sciences Center, 858 Madison Ave., Suite G01, Keywords Cytochrome P450 . Evolution . P450 Memphis, TN 38163, USA e-mail: [email protected] superfamily. CYP. Papaya . Grape . Comparative genomics R. Ming Department of Plant Biology, Introduction University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA As genome sequencing projects were undertaken over the M. Alam past 15 years, leaders of these projects realized quite Department of Microbiology, Advance Studies in Genomics, quickly that individual genomes were hard to annotate Proteomics and Bioinformatics, University of Hawaii at Manoa, 2565 McCarthy Mall Keller 301, without comparison to closely related genomes. The Honolulu, HI 96822, USA subsequent sequencing of closely related pairs of organisms has provided the annotation bootstrapping necessary for M. A. Schuler more accurate assignment of open reading frames and Departments of Cell and Developmental Biology, Biochemistry and Plant Biology, University of Illinois at Urbana-Champaign, intron positions. In animals, this strategy has been carried Urbana, IL 61801, USA forward to some extremes with sequenced genomes now Tropical Plant Biol. 90 available for 12 species of Drosophila, dozens of mammals, five fish and pairs of more unusual creatures, such as 80 nematodes and sea squirts. Although plant genomes have not 70 been deeply sampled at present, six land plant genomes (Vitis 60 vinifera (grape), Carica papaya (papaya), Populus tricho- 50 carpa (poplar), Oryza sativa (rice), Arabidopsis thaliana 40 (Arabidopsis or mouse ear’s cress) and Physcomitrella 30 patens (moss)) and several algal genomes (Chlamydomonas 20 reinhardtii and Volvox carteri (green algae), Ostreococcus 10 tauri and O. lucimarinus (microalgae), Cyanidioschyzon 0 merolae and Galdiera sulphuraria (red algae)) have been fully sequenced and more plant species, including Zea mays CYP51 CYP71 CYP72 CYP73 CYP74 CYP75 CYP76 CYP77 CYP78 CYP79 CYP80 CYP81 CYP82 CYP83 CYP84 CYP85 CYP86 CYP87 CYP88 CYP89 (corn) and Medicago truncatula (barrel medic or barrel 30 clover), are in the sequencing pipeline. Among these, the 25 land plant genomes are close enough in sequence to aid in gene identification but far enough apart to provide insights 20 into the evolutionary differences between these quite diverse species. 15 In this review, we focus on similarities and differences 10 that exist in the cytochrome P450 (CYP) monooxygenase superfamily that in these six land plant genomes collec- 5 tively contains 1,415 full-length and putatively functional 0 genes. In this large number of genes, there exist only 73 families as defined in the standardized nomenclature CYP90 CYP92 CYP93 CYP94 CYP96 CYP97 CYP98 CYP99 developed for cytochrome P450 sequences [1] (Fig. 1, CYP701 CYP702 CYP703 CYP704 CYP705 CYP706 CYP707 CYP708 CYP709 CYP710 CYP711 18 Supplementary Table 1). Sixteen of these families are moss- 16 specific and the remaining 57 families encompass the P450 14 diversity existing in the five angiosperms. Four additional CYP families are not included in this collection because 12 they are of limited taxonomic range. Of these, CYP719 is 10 found only in Ranunculales and Aristolochiales, which is 8 one of the ANITA (Amborella-Nymphaeales-Illiciales- 6 Trimeniales-Aristolochiales) basal angiosperms [2]. 4 CYP725 is only found in Taxus but has some overlap with 2 the more common CYP716 family and may be an offshoot 0 of that family. CYP726 is found only in some Euphorbia species producing vernolic acid [3] and it clusters within CYP712 CYP714 CYP715 CYP716 CYP718 CYP720 CYP721 CYP722 CYP723 CYP724 CYP727 CYP728 CYP729 CYP733 CYP734 CYP735 CYP736 CYP746 CYP749 the CYP71 family suggesting that it is part of a CYP71 subfamily. CYP750 is present only in gymnosperms, but is Fig. 1 Cytochrome P450 genes in each family among the six plant genomes. The order of the species in each family block is Arabidopsis, somewhat related to the CYP75 and CYP92 families. The papaya, grape, poplar, rice and moss. The numbers are given in most recent review comparing Arabidopsis and rice CYP Supplementary Table 1. The number of genes tends to decrease at genes is [4] and the most recent review comparing moss higher CYP family numbers that were historically discovered and and angiosperm CYP genes is [1]. named later The recently sequenced genomes of grapevine and papaya have added 457 P450 genes to the expanding synthesis of resveratrol and monoterpenes that contribute to collection of plant sequences. With both being fruiting the aroma of grapes, wine and foods derived from them. woody plants and important crops, grapevine CYP genes Despite the obvious expansion of its secondary metabolic are especially interesting since this plant has been cultivated pathways, no mention of the multiplicity of grape P450 for thousands of years and selected for flavors and colors sequences has been included in this analysis except in their generated by many P450-mediated biosynthetic steps. As Supplemental Table S9 [5]. More recent analysis of a noted in the French-Italian Consortium paper on the inbred heterozygous Pinot Noir ENTAV 115 strain [6] identified PN40024 grape genome [5], the grape genome contains 43 21 stilbene synthases and 35 terpene synthases in this stilbene synthases and 89 terpene synthases important in the genome with the large differences in gene counts for these Tropical Plant Biol. two protein families suggesting that the initial grape cus 0.2 genome paper counted alleles for these loci as separate entities. The discussion of CYP genes in [6] focused on five 97C1 tomato enzymes in the phenolic and terpenoid pathways, including olvox 97C9 carrot 97C12 Ostreococ cinnamate 4-hydroxylase (C4H/CYP73, three genes in 97C5 moss ko Supplemental Table S4), flavonoid 3’-hydroxylase (F3’H/ 97B17 V CYP75B, one gene), flavonoid 3’,5’-hydroxylase (F3’5’H/ 97B8 moss 97B12 ging 97B7 poplar CYP75A, ten genes) and two unidentified P450s in Fig. 4b. 97B15 grape Other than cursory asides on CYP genes in the text, Velasco 97C12 grape 97B2 soybean C B 97C4 poplar O et al. Supplemental Table S5 lists five CYP707A genes, and O O E A orthologues of CYP97A3, CYP701A1/GA3 and F 97E4 Phaeodactylum 97E2 Thalassios CYP711A1 [6]. To better appreciate the divergence of the 97E1 Skeletone ira plant CYP superfamily, more detailed analyses are clearly 97E6 Ostreococ justified. 97A8 moss 97F3 Phae ma 97F2 Euglena 97A13 V 97F1 Thalassiosira cus 1 grape odactylum 97A1 olvox An Evolutionary Tour of Plant CYP Genes from most 97A14 Ostreococcus Ancient to most Recent CYP Gene Families Found in all Six Plant Genomes, 97A5 Chlamydomonas the Core CYPs of Land Plants Fig. 2 An unrooted Neighbor-joining tree of CYP97 plant sequences. Sequences were aligned using the program ClustalW and the NJ tree The first of several CYP gene categories to be discussed are was computed using the Phylip package those highly conserved orthologs that exist in a clear 1:1:1:1:1:1 pattern across all six species (Fig. 1, Supple- hypothetical proteins that are not syntenic with the genes mental Tables S1 and S2). The only three genes in this most around rice CYP51G1 and the percent sequence identity highly conserved group are CYP97A [7] and CYP97C [8], with Arabidopsis CYP51G1 is only 55% compared to 76% which are carotenoid hydroxylases, and the closely related for the ortholog CYP51G1 pair. In addition to these CYP97B, whose function has not yet been identified. The uncharacterized CYP51G subfamily members, rice is oldest of these three genes is CYP97B as evidenced by its unusual in having a divergent CYP51H subfamily with higher phylogenetic identity with the CYP97E and eight intact genes. A paralog of these, oat (Avena strigosa) CYP97F sequences of diatoms and other protists (Fig. 2 CYP51H10 is synonomous with SAD2 (saponin-deficient and Supplemental Fig.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    20 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us