A Dataset for the Analysis of Gene Expression and Alternative Splicing Events in Colon Disease Wilfrido Mojica1, Lesleyann Hawthorn2*

A Dataset for the Analysis of Gene Expression and Alternative Splicing Events in Colon Disease Wilfrido Mojica1, Lesleyann Hawthorn2*

Mojica and Hawthorn BMC Genomics 2010, 11:5 http://www.biomedcentral.com/1471-2164/11/5 RESEARCH ARTICLE Open Access Normal colon epithelium: a dataset for the analysis of gene expression and alternative splicing events in colon disease Wilfrido Mojica1, Lesleyann Hawthorn2* Abstract Background: Studies using microarray analysis of colorectal cancer have been generally beleaguered by the lack of a normal cell population of the same lineage as the tumor cell. One of the main objectives of this study was to generate a reference gene expression data set for normal colonic epithelium which can be used in comparisons with diseased tissues, as well as to provide a dataset that could be used as a baseline for studies in alternative splicing. Results: We present a dependable expression reference data set for non-neoplastic colonic epithelial cells. An enriched population of fresh colon epithelial cells were obtained from non-neoplastic, colectomy specimens and analyzed using Affymetrix GeneChip EXON 1.0 ST arrays. For demonstration purposes, we have compared the data derived from these cells to a publically available set of tumor and matched normal colon data. This analysis allowed an assessment of global gene expression alterations and demonstrated that adjacent normal tissues, with a high degree of cellular heterogeneity, are not always representative of normal cells for comparison to tumors which arise from the colon epithelium. We also examined alternative splicing events in tumors compared to normal colon epithelial cells. Conclusions: The findings from this study represent the first comprehensive expression profile for non-neoplastic colonic epithelial cells reported. Our analysis of splice variants illustrate that this is a very labor intensive procedure, requiring vigilant examination of the data. It is projected that the contribution of this set of data derived from pure colonic epithelial cells will enhance studies in colon-related disease and offer a vital baseline for studies aimed at elucidating the mechanisms of alternative splicing. Background biomarkers, and lead to the discovery of new therapeutic Colorectal cancer (CRC) is considered to develop as a targets. As with any microarray study, however, tissue result of a multi-step progression and accumulation of heterogeneity is a confounding factor. Previous studies genetic alterations [1]. Gene expression profiling using on CRC have often used “adjacent normal colon” as a microarrays has been used extensively in attempts to control cell type. However, this tissue usually represents identify the genetic events associated with this progres- a mixture of cell types which do not accurately repre- sion. For colon cancer, expression profiling can readily sent the epithelial cell lining of the luminal surface of discriminate between tumor and normal tissue as well the colon that gives rise to CRC. Clearly, a comprehen- as distinguish between tumors of different histopatholo- sive and representative geneexpressionprofileexclu- gical stage and prognosis to some extent [2,3]. It is sively from normal colon epithelial cells would be a anticipated that the identification of these gene expres- great benefit for comparison with tumor cells, especially sion differences may lead to a better understanding of since these cells are not readily cultured. neoplasia, help identify diagnostic and prognostic Here we report a reliable expression reference data set for non-neoplastic colonic epithelial cells. These cells * Correspondence: [email protected] were derived from fresh, unfixed, clinically excised non- 2 Molecular Oncology Program, Medical College of Georgia Cancer Center, tumor related colectomy specimens and enriched using Augusta, GA, USA © 2010 Mojica and Hawthorn; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Mojica and Hawthorn BMC Genomics 2010, 11:5 Page 2 of 14 http://www.biomedcentral.com/1471-2164/11/5 magnetic beads coated with a colon cell specific anti- 6 were female and 4 male, with the ages ranging from body. RNA from these preparations was used to probe 47 to 80 (table 1). The surgeries were performed by 6 Affymetrix GeneChip® EXON 1.0 ST arrays. These different surgeons with four of the specimens excised by arrays not only report gene expression levels but can hand assisted laparoscopy and the other 6 performed by also identify differential splicing events. the conventional open surgical approach. The procure- The findings from this study represent the first com- ment protocol was designed to effectively enrich for the prehensive expression profile for non-neoplastic colonic selected colonic cell population in the shortest time pos- epithelial cells. We have compared the data derived sible to control for variables that may influence the from these colonic epithelial cells to a publically avail- quality and integrity of clinical biospecimens, as able set of tumor and normal matched colon data that described in the Materials and Methods section. The had been hybridized to the same arrays. The dataset is data discussed in this publication have been deposited available from Affymetrix.com http://www.affymetrix. in NCBI’s Gene Expression Omnibus and are accessible com/support/technical/sample_data/exon_array_data. through GEO Series accession number GSE19163 affx. This analysis allowed an assessment of global gene http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi? expression alterations in the tumors and demonstrated acc=GSE19163 that adjacent normal tissues are not always the optimal Gene Expression control. Twelve samples of isolated colonic epithelial cells from Alternative splicing (AS) is the process by which the 10 different patients (two of these samples were repli- exons of primary transcripts (pre-mRNAs) can be cates; see table 1 and figure 1a) were analyzed using the spliced in different arrangements to produce structurally Exon 1.0 ST arrays. These data were then compared and functionally distinct mRNA and protein variants. with a publically available data set http://www.affyme- Alternative splicing leads to massive diversity by gener- trix.com/support/technical/sample_data/exon_array_- ating multiple different mRNAs from a single template. data.affx, [7] which provided raw Exon1.0 ST data for Current estimates indicate that up to 95% of genes in ten colon tumors and matched ‘normal ‘tissues from the the human genome exhibit alternative splicing [4]. same patients. Total RNA from these samples were pur- Furthermore, alternative splicing has increasingly been chased from Biochain Institute Inc. All of the raw data acknowledged as critical during normal development. was imported into PARTEK and the arrays normalized Previous studies of alternative gene splicing have, for using quantile normalization followed by probeset sum- the most part, focused on the identification and charac- marization using RMA. PCA analysis for these data sets terization of the control of splice site selection on a is shown in figure 1a, where it is evident that the NOR- gene-by-gene basis. While these studies have provided MAL and TUMOR tissues from the public data set clus- extensive information about the factors and interactions tered together, while the CELL samples obtained from that control formation of the spliceosome, relatively lit- our enrichment procedure separated distinctly from the tle is known about the global regulatory properties of two other tissue sources. A3-wayANOVAidentified alternative splicing [5,6]. that a major source of variance was due to the sample In this analysis, we also compared AS events in processing differences between the 2 laboratories, tumors and the enriched normal colon epithelial cells. referred to as “SCAN DATE“ (Figure 1b). Patient “SEX“ This analysis of splice variants on a genome-wide scale, however, is very labor intensive, and requires careful supervised examination of the data. It is anticipated that Table 1 Clinical Correlates of Patient Cohort the contribution of this set of data derived from pure Case # Age Sex Location Surgical Approach Diagnosis colonic epithelial cells will facilitate future studies in 1** 55 F Sigmoid Resection Diverticulitis colon-related disease and offer a unique baseline for stu- 2 58 M Sigmoid Resection Diverticulosis dies aimed at elucidating the mechanisms of alternative 3 67 M Sigmoid Laparoscopic Diverticulosis splicing. 4** 58 M Sigmoid Laparoscopic Diverticulosis 5 62 M Sigmoid Laparoscopic Diverticulitis Results 6 80 F Sigmoid Resection Diverticulosis One of the main objectives of this study was to generate 7 47 F Sigmoid Resection Diverticulosis a reference gene expression data set for normal colonic 8 65 F Sigmoid Resection Diverticulosis epithelium which can be used in comparisons with 9 62 F Sigmoid Resection Diverticulosis tumor samples. Non-adenomatous and non-neoplastic 10 58 F Sigmoid Laparoscopic Diverticultis colonic epithelial cells were procured from 10 different Demographics of patients from where the enriched samples of non-neoplastic colectomy specimens with the clinical indication for colonic epithelial cells were derived. ** Indicate those samples on which each case being diverticular disease. Of the 10 patients, replicates were performed. Mojica and Hawthorn BMC Genomics 2010, 11:5 Page 3 of 14 http://www.biomedcentral.com/1471-2164/11/5

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    14 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us