Single Nucleotide Polymorphism Identification in Candidate Gene

Single Nucleotide Polymorphism Identification in Candidate Gene

The Pharmacogenomics Journal (2001) 1, 193–203 2001 Nature Publishing Group All rights reserved 1470-269X/01 $15.00 www.nature.com/tpj ORIGINAL ARTICLE Single nucleotide polymorphism identification in candidate gene systems of obesity K Irizarry1,3 ABSTRACT GHu1,3 We have constructed a large panel of single nucleotide polymorphisms (SNP) 2 identified in 68 candidate genes for obesity. Our panel combines novel SNP M-L Wong identification methods based on EST data, with public SNP data from large- J Licinio2 scale genomic sequencing, to produce a total of 218 SNPs in the coding CJ Lee1,3 regions of obesity candidate genes, 178 SNPs in untranslated regions, and over 1000 intronic SNPs. These include new non-conservative amino acid 1Dept of Chemistry and Biochemistry, changes in thyroid receptor beta, esterase D, acid phosphatase 1. Our data University of California, Los Angeles, CA, USA; show evidence of negative selection among these polymorphisms implying 2Laboratory of Pharmacogenomics, Neuropsychiatric Institute, University of functional impacts of the non-conservative mutations. Comparison of overlap California, Los Angeles, CA, USA; 3UCLA-DOE between SNPs identified independently from EST data vs genomic sequen- Laboratory for Structural Biology and cing indicate that together they may constitute about one half of the actual Molecular Medicine, University of California, total number of amino acid polymorphisms in these genes that are common Los Angeles, CA, USA in the human population (defined here as a population allele frequency Correspondence: above 5%). We have analyzed our polymorphism panel to construct a data- Dr CJ Lee, UCLA-DOELaboratory for base of detailed information about their location in the gene structure and Structural Biology and Molecular effect on protein coding, available on the web at http://www.bioinformat Medicine, University of California, Los ics.ucla.edu/snp/obesity. We believe this panel can serve as a valuable new Angeles, CA 90095–1570, USA Tel: +1 310 825 7374 resource for genetic and pharmacogenomic studies of the causes of obesity. Fax: +1 310 267 0248 The Pharmacogenomics Journal (2001) 1, 193–203. E-mail: leecȰmbi.ucla.edu Keywords: obesity; SNP; neuropeptides; body weight regulation: appetite; polymor- phisms INTRODUCTION The draft sequence of the human genome has now been completed to 93% cover- age, with an estimated total of 32000 genes.1,2 Of those, 22000 have been ident- ified so far by stricter experimental criteria. This is likely to have profound and far-reaching impact on the genetics, pharmacogenetics, and pharmacogenomics of complex diseases. The most interesting question for investigators today is, where to start? While genome-wide single-nucleotide polymorphism (SNP) map- ping has the potential to identify markers associated with treatment responses of common and complex disorders, use of this approach is still in its infancy and has not demonstrated success yet.3 Meanwhile, association approaches have examined associations of specific SNPs or small groups of SNPs to complex traits. With some notable exceptions, those efforts have also not yet borne fruit. An alternative approach would be to identify large batches of SNPs in candidate gene systems, that could be used for genomic and pharmacogenomic studies.4 Obesity provides a logical target for such an SNP discovery effort. Obesity is a common and complex disorder that represents the outcome of gene–environ- ment interactions. In the general population of the US, the prevalence of over- −2 Received: 2 March 2001 weight individuals (body mass index—BMI—between 25 and 29.9 kg m )is Revised: 17 July 2001 34.9% among women and 31.7% among men, and it more than doubles between Accepted: 18 July 2001 the ages of 20 and 55. The incidence of obesity (BMI equal to or higher than 30 SNP identification in candidate gene systems of obesity K Irizarry et al 194 kg m−2) has been conservatively estimated to be at 11%. RESULTS Among women, obesity is strongly associated with socio- Identification of SNPs in Obesity Candidate Genes economic status, being twice as common among those with We based our SNP discovery effort on the Human Gene lower socioeconomic status as it is among those with higher Glossary of the Human Obesity Gene Map (105 candidate status. Weight gain has been described as epidemic among obesity genes; http://www.obesity.chair.ulaval.ca/gloss children and adolescents. The data from the Third National ary.html), as a well-established set of genes for future genetic Health and Nutrition Examination Survey (NHANES III and pharmacogenetic studies of obesity.19 We were able to 1988–1994) indicate that approximately 14% of children map 75 genes from this set to EST clusters from Unigene by and 12% of adolescents are overweight. Among adults, matching gene identifiers. We used these 75 genes for EST- approximately 33% of men and 36% of women were over- based SNP identification as described in this paper. This set weight. Among women, 34% of non-Hispanic whites, 52% included genes encoding neuropeptides and their receptors, of non-Hispanic blacks, and 50% of Mexican Americans transporters, enzymes and regulatory proteins involved in were overweight. Racial/ethnic group-specific variation neurotransmitter synthesis, reuptake, and signaling, as well among men was less than that among women.5 Obesity is as metabolic enzymes. also a well-recognized major risk factor for heart disease. The To identify SNPs in this set of genes, we searched through American Heart Association (AHA) has declared obesity as a the Unigene database of expressed sequence tags (EST) for overlapping cDNA sequence fragments of these genes from ‘major risk factor’ and an expert committee of the National different people. We assembled a total of 9293 sequences Institutes of Health/National Heart Lung Blood Institute (8942 ESTs, 222 mRNAs, and 129 DNAs) for this candidate established obesity as an independent risk factor for heart gene set (Unigene release July 2000), representing 698 disease based on a meta-analysis of over 300 prospectively- libraries. Since most of these libraries were prepared from a controlled randomized trials in literature. A small fraction single individual’s DNA, and a few from multiple individ- of obese patients have been identified who have specific uals, these data represent sequence from more than 700 Mendelian gene mutations that appear to cause obesity. individuals. Sequences from about nine people on average These include mutations in the leptin, POMC, prohormone were available for each gene. From this set of 75 clusters, 45 6–13 convertase 1, and MCR4 receptor genes. contained at least one SNP. The pharmacogenomics of obesity is a new and understu- Based on this gene set, we identified 152 polymorphisms, died area. While some treatments for obesity can have a including 76 cSNPs causing 51 amino acid changes (Table positive outcome for a minority of patients, it is unknown 1). We obtained the original sequencing chromatogram data why some patients respond to pharmacological and/or for the ESTs, and performed a detailed analysis consisting of 14–17 behavioral interventions, while others do not. multiple sequence alignment of overlapping ESTs for each Considerable progress has been achieved on the neurobi- gene, removal of potential paralogous ESTs, and statistical ology of obesity. A variety of new and established candidate scoring of candidate SNPs (see Methods). We selected candi- genes are now known to interact at the central level, parti- date SNPs with a likelihood odds ratio greater than 103 in cularly in the hypothalamus, to regulate food intake, energy favor of a polymorphism as opposed to a sequencing error. expenditure and body weight. The interactions of those We limited this study to candidate SNPs with population gene products and their role in human energy homeostasis allele frequency greater than 1%, and at least one EST obser- have been one of the most exciting areas of biomedical vation of the polymorphic band with good chromatogram investigation in the last decade. For reviews see Refs 18– quality (see Methods), or an observation in a curated 24.18–24 The Human Obesity Gene Map provides an excellent mRNA sequence. resource of candidate genes that may be involved in human Our analysis also estimated population allele frequency obesity.19 We have based our work on this extensive listing for each SNP, reporting an expectation value and 95% rank of candidate genes, to identify polymorphisms that can confidence interval (see Table 2). Population allele fre- serve as useful tools for analyzing the genetic components quencies for the obesity gene SNPs ranged from 3% to 37% of obesity. We have constructed a large panel of single nucleotide Table 1 Obesity gene SNP discovery based on EST data polymorphisms (SNP) in these candidate obesity genes, combining novel SNP identification methods based on EST Analysis stage Genes SNPs data with public SNP data from large-scale genomic sequen- cing. We believe this panel can serve as a valuable new Human Obesity Map gene list 105 resource for genetic and pharmacogenomic studies of the Mapped to Unigene clusters by gene identifier 75 causes of obesity. It includes over 1400 SNPs in obesity can- SNPs identified in EST data 45 152 Untranslated region (UTR) SNPs 27 76 didate genes, as well as a database of detailed information Protein coding region SNPs (cSNPs) 28 76 about their location in the gene structure and effect on Non-conservative amino acid changes 13 22 protein coding, which will be available

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    11 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us