University of Southampton Bioinformatic

Total Page:16

File Type:pdf, Size:1020Kb

University of Southampton Bioinformatic University of Southampton Faculty of Environmental and Life Sciences Biological Sciences Bioinformatic Analysis of Human Next Generation Sequencing Data; extracting Additional Information, Optimising Mapping and Variant Calling, and Application in a Rare Disease Volume 1 of 1 By Roshan Kumar Sood A thesis presented for the degree of Doctor of Philosophy May 2019 Bioinformatic Analysis of Human NGS Data; extracting Additional Info., Optimising Mapping & Variant Calling, Application in Rare Disease Roshan Kumar Sood Page 2 University of Southampton Abstract Faculty of Environmental and Life Science Biological Sciences A thesis presented for the degree of Doctor of Philosophy Bioinformatic Analysis of Human Next Generation Sequencing Data; extracting Additional Information, Optimising Mapping and Variant Calling, and Application in a Rare Disease by Roshan Kumar Sood With the increased application of Next Generation Sequencing (NGS) to medicine it is important to test and develop approaches to extract the optimum information from datasets. In this thesis five aspects of NGS are investigated ranging from quality control to variant calling. Firstly a method to estimate contamination from a VCF file was developed which would be useful in cases where no BAM file was available to use existing tools. Unmapped reads were investigated to extract additional information from NGS samples and were able to detect the abundance of oral microbes from saliva samples relative to blood collected samples, but failed to identify differences between inflammatory bowel disease patients and controls. For a familial trio with a reported rare case of Sedaghatian-type spondylometaphyseal dysplasia (SSMD) sequenced both by whole exome (WES) and genome (WGS) sequencing it was shown that nearly all coding variants from WES were called in WGS despite differences in mean depth of coverage. This comparison highlighted that as sequencing costs decrease WGS will offer the greatest diagnostic value with potential for future re-analysis of cases currently unable to be resolved. Using the familial trio attempts were made to identify causal variant(s) in the gene currently implicated in causing SSMD { Glutathione peroxidase 4 (GPX4 ). However no variants either small SNPs or large structural were identified over the GPX4 gene and no plausible candidates were identified from the trio. Finally variant calling of the FCGR low affinity locus was performed using targeted NGS. FCGR genes have been highly duplicated and so by using customised references it was possible to infer the combinations of alleles across homologous sites. Using this approach it was possible to predict SNPs in the FCGR3B gene and predict human neutrophil antigen haplotypes involved in the immune response to treatments such as monoclonal antibodies. Bioinformatic Analysis of Human NGS Data; extracting Additional Info., Optimising Mapping & Variant Calling, Application in Rare Disease Roshan Kumar Sood Page 2 Contents 1 Introduction 25 1.1 Next generation sequencing . 25 1.1.1 First generation . 25 1.1.2 Second generation . 27 1.1.3 Third generation . 33 1.2 Sequencing projects . 35 1.2.1 Human Genome Project . 35 1.2.2 dbSNP . 36 1.2.3 HapMap . 36 1.2.4 ENCODE project . 36 1.2.5 1,000 Genomes . 38 1.2.6 Other sequencing projects . 39 1.2.7 100,000 Genomes project . 40 1.3 Software pipelines . 41 1.3.1 Pre-processing . 42 1.3.2 Read mapping or assembly . 44 1.3.3 Human reference sequence . 45 1.3.4 Post-mapping . 46 1.3.5 Variant calling . 46 1.3.6 Variant annotation . 48 1.3.7 Structural & copy number variants . 50 1.4 Aims . 53 2 Estimating contamination levels in exome sequencing using alternate allele frequencies and variant zygosity 55 3 Bioinformatic Analysis of Human NGS Data; extracting Additional Info., Optimising Mapping & Variant Calling, Application in Rare Disease Roshan Kumar Sood 2.0.1 Contamination estimation tools . 57 2.0.2 Alternate allele frequency changes with contamination . 59 2.0.3 Applications of machine learning with NGS . 63 2.0.4 Regression models . 63 2.1 Aims . 65 2.2 Materials & methods . 66 2.2.1 Contamination simulations . 66 2.2.2 Alignment and variant calling pipeline . 68 2.2.3 Alternate allele frequency profiles . 69 2.2.4 Measurements used for contamination estimation . 71 2.2.5 Investigating relationships of measurements . 72 2.2.6 Principal component & clustering analysis . 72 2.2.7 Regression analysis - model selection and training . 72 2.2.8 Application of regression models . 73 2.3 Results . 74 2.3.1 Alternate allele frequency profiles and measurements used in contamination estimation . 74 2.3.2 Principal component and clustering analysis . 82 2.3.3 Training regression models . 84 2.3.4 Application of regression models . 93 2.4 Discussion . 105 2.5 Conclusions . 111 3 Unmapped reads provide insights to potentially clinically important information and can distinguish collection methods 113 3.1 Introduction . 113 3.1.1 Unmapped reads . 113 3.1.2 Cross-species contamination . 114 3.1.3 Microbiome information from unmapped reads . 115 3.1.4 Analysis programs . 118 3.1.5 MEGAN6 . 119 3.1.6 Sequencing methods . 119 3.2 Aims . 120 Page 4 Bioinformatic Analysis of Human NGS Data; extracting Additional Info., Optimising Mapping & Variant Calling, Application in Rare Disease Roshan Kumar Sood 3.3 Materials & methods . 121 3.3.1 Samples . 121 3.3.2 Extraction of unmapped reads . 123 3.3.3 FastQC . 124 3.3.4 BLAST classification of sequences . 124 3.3.5 Creation of taxonomic trees . 126 3.3.6 Clustering analysis . 127 3.3.7 Tandem repeat finder . 127 3.3.8 Depth of coverage across bacterial genomes . 127 3.3.9 Calculating sequence similarity between species . 127 3.3.10 Plotting of bacterial genomes . 128 3.4 Results . 129 3.4.1 Extraction & classification of unmapped reads from exome case and control samples . 129 3.4.2 Comparing collection methods . 140 3.4.3 Unmapped reads of RNA-SEQ and WES from the same individual145 3.4.4 Comparison of 16S rRNA sequencing with unmapped WES reads mapped to bacteria . 154 3.4.5 Investigating cronobacter sakazakii read matches . 155 3.4.6 Mapping unmapped reads back against the human genome . 161 3.4.7 Unclassifiable reads . 164 3.5 Discussion . 165 3.6 Conclusions . 174 4 Comparing variant calling using a whole exome and genome sequencing trio 175 4.1 Introduction . 175 4.1.1 Non-coding variant annotation . 176 4.1.2 Limitations of WES compared to WGS . 179 4.1.3 Variant calling with GATK HaplotypeCaller . 182 4.1.4 ACMG genes . 183 4.2 Aims . 183 4.3 Materials & methods . 184 Page 5 Bioinformatic Analysis of Human NGS Data; extracting Additional Info., Optimising Mapping & Variant Calling, Application in Rare Disease Roshan Kumar Sood 4.3.1 Samples & sequencing . 184 4.3.2 Quality control . 185 4.3.3 Sample coverage . 187 4.3.4 Variant calling . 188 4.3.5 Annotation of VCF variants . 188 4.3.6 Annotation of non-coding variants . 190 4.3.7 GEMINI variant analysis . 190 4.3.8 Structural and copy number variant pipeline . 191 4.3.9 Comparison of exome and genome variants . 194 4.4 Results . 195 4.4.1 Quality control . 195 4.4.2 Variant calling . 202 4.4.3 Non-coding variants . 211 4.4.4 GEMINI variant calls . 214 4.4.5 Structural & copy number variants . 217 4.4.6 Comparing whole exome with whole genome sequencing . 226 4.5 Discussion . 234 4.6 Conclusions . 241 5 Prioritisation of trio called variants 243 5.1 Introduction . 243 5.1.1 Using NGS to identify disease causing variants . 243 5.1.2 Sedaghatian-type SpondyloMetaphyseal Dysplasia . 243 5.1.3 Short-Rib polydactyly syndromes . 246 5.2 Aims . 251 5.3 Materials & methods . 252 5.3.1 Family & sequencing . 252 5.3.2 Variant calling . 253 5.3.3 GPX4 variant, transcription factors and binding sites . 253 5.3.4 GPX4 coverage . 254 5.3.5.
Recommended publications
  • Evolution, Expression and Meiotic Behavior of Genes Involved in Chromosome Segregation of Monotremes
    G C A T T A C G G C A T genes Article Evolution, Expression and Meiotic Behavior of Genes Involved in Chromosome Segregation of Monotremes Filip Pajpach , Linda Shearwin-Whyatt and Frank Grützner * School of Biological Sciences, The University of Adelaide, Adelaide, SA 5005, Australia; fi[email protected] (F.P.); [email protected] (L.S.-W.) * Correspondence: [email protected] Abstract: Chromosome segregation at mitosis and meiosis is a highly dynamic and tightly regulated process that involves a large number of components. Due to the fundamental nature of chromosome segregation, many genes involved in this process are evolutionarily highly conserved, but duplica- tions and functional diversification has occurred in various lineages. In order to better understand the evolution of genes involved in chromosome segregation in mammals, we analyzed some of the key components in the basal mammalian lineage of egg-laying mammals. The chromosome passenger complex is a multiprotein complex central to chromosome segregation during both mitosis and meio- sis. It consists of survivin, borealin, inner centromere protein, and Aurora kinase B or C. We confirm the absence of Aurora kinase C in marsupials and show its absence in both platypus and echidna, which supports the current model of the evolution of Aurora kinases. High expression of AURKBC, an ancestor of AURKB and AURKC present in monotremes, suggests that this gene is performing all necessary meiotic functions in monotremes. Other genes of the chromosome passenger complex complex are present and conserved in monotremes, suggesting that their function has been preserved Citation: Pajpach, F.; in mammals.
    [Show full text]
  • Lecture9'21 Chromatin II
    Genetic Organization -Chromosomal Arrangement: From Form to Function. Chapters 9 & 10 in Genes XI The Eukaryotic chromosome – Organized Structures -banding – Centromeres – Telomeres – Nucleosomes – Euchromatin / Heterochromatin – Higher Orders of Chromosomal Structure 2 Heterochromatin differs from euchromatin in that heterochromatin is effectively inert; remains condensed during interphase; is transcriptionally repressed; replicates late in S phase and may be localized to the centromere or nuclear periphery Facultative heterochromatin is not restricted by pre-designated sequence; genes that are moved within or near heterochromatic regions can become inactivated as a result of their new location. Heterochromatin differs from euchromatin in that heterochromatin is effectively inert; remains condensed during interphase; is transcriptionally repressed; replicates late in S phase and may be localized to the centromere or nuclear periphery Facultative heterochromatin is not restricted by pre-designated sequence; genes that are moved within or near heterochromatic regions can become inactivated as a result of their new location. Chromatin inactivation (or heterochromatin formation) occurs by the addition of proteins to the nucleosomal fiber. May be due to: Chromatin condensation -making it inaccessible to transcriptional apparatus Proteins that accumulate and inhibit accessibility to the regulatory sequences Proteins that directly inhibit transcription Chromatin Is Fundamentally Divided into Euchromatin and Heterochromatin • Individual chromosomes can be seen only during mitosis. • During interphase, the general mass of chromatin is in the form of euchromatin, which is slightly less tightly packed than mitotic chromosomes. TF20210119 Regions of compact heterochromatin are clustered near the nucleolus and nuclear membrane Photo courtesy of Edmund Puvion, Centre National de la Recherche Scientifique Chromatin: Basic Structures • nucleosome – The basic structural subunit of chromatin, consisting of ~200 bp of DNA wrapped around an octamer of histone proteins.
    [Show full text]
  • Antibodies Products
    Chapter 2 : Gentaur Products List • Human Signal peptidase complex catalytic subunit • Human Sjoegren syndrome nuclear autoantigen 1 SSNA1 • Human Small proline rich protein 2A SPRR2A ELISA kit SEC11A SEC11A ELISA kit SpeciesHuman ELISA kit SpeciesHuman SpeciesHuman • Human Signal peptidase complex catalytic subunit • Human Sjoegren syndrome scleroderma autoantigen 1 • Human Small proline rich protein 2B SPRR2B ELISA kit SEC11C SEC11C ELISA kit SpeciesHuman SSSCA1 ELISA kit SpeciesHuman SpeciesHuman • Human Signal peptidase complex subunit 1 SPCS1 ELISA • Human Ski oncogene SKI ELISA kit SpeciesHuman • Human Small proline rich protein 2D SPRR2D ELISA kit kit SpeciesHuman • Human Ski like protein SKIL ELISA kit SpeciesHuman SpeciesHuman • Human Signal peptidase complex subunit 2 SPCS2 ELISA • Human Skin specific protein 32 C1orf68 ELISA kit • Human Small proline rich protein 2E SPRR2E ELISA kit kit SpeciesHuman SpeciesHuman SpeciesHuman • Human Signal peptidase complex subunit 3 SPCS3 ELISA • Human SLAIN motif containing protein 1 SLAIN1 ELISA kit • Human Small proline rich protein 2F SPRR2F ELISA kit kit SpeciesHuman SpeciesHuman SpeciesHuman • Human Signal peptide CUB and EGF like domain • Human SLAIN motif containing protein 2 SLAIN2 ELISA kit • Human Small proline rich protein 2G SPRR2G ELISA kit containing protein 2 SCUBE2 ELISA kit SpeciesHuman SpeciesHuman SpeciesHuman • Human Signal peptide CUB and EGF like domain • Human SLAM family member 5 CD84 ELISA kit • Human Small proline rich protein 3 SPRR3 ELISA kit containing protein
    [Show full text]
  • Single Cell Derived Clonal Analysis of Human Glioblastoma Links
    SUPPLEMENTARY INFORMATION: Single cell derived clonal analysis of human glioblastoma links functional and genomic heterogeneity ! Mona Meyer*, Jüri Reimand*, Xiaoyang Lan, Renee Head, Xueming Zhu, Michelle Kushida, Jane Bayani, Jessica C. Pressey, Anath Lionel, Ian D. Clarke, Michael Cusimano, Jeremy Squire, Stephen Scherer, Mark Bernstein, Melanie A. Woodin, Gary D. Bader**, and Peter B. Dirks**! ! * These authors contributed equally to this work.! ** Correspondence: [email protected] or [email protected]! ! Supplementary information - Meyer, Reimand et al. Supplementary methods" 4" Patient samples and fluorescence activated cell sorting (FACS)! 4! Differentiation! 4! Immunocytochemistry and EdU Imaging! 4! Proliferation! 5! Western blotting ! 5! Temozolomide treatment! 5! NCI drug library screen! 6! Orthotopic injections! 6! Immunohistochemistry on tumor sections! 6! Promoter methylation of MGMT! 6! Fluorescence in situ Hybridization (FISH)! 7! SNP6 microarray analysis and genome segmentation! 7! Calling copy number alterations! 8! Mapping altered genome segments to genes! 8! Recurrently altered genes with clonal variability! 9! Global analyses of copy number alterations! 9! Phylogenetic analysis of copy number alterations! 10! Microarray analysis! 10! Gene expression differences of TMZ resistant and sensitive clones of GBM-482! 10! Reverse transcription-PCR analyses! 11! Tumor subtype analysis of TMZ-sensitive and resistant clones! 11! Pathway analysis of gene expression in the TMZ-sensitive clone of GBM-482! 11! Supplementary figures and tables" 13" "2 Supplementary information - Meyer, Reimand et al. Table S1: Individual clones from all patient tumors are tumorigenic. ! 14! Fig. S1: clonal tumorigenicity.! 15! Fig. S2: clonal heterogeneity of EGFR and PTEN expression.! 20! Fig. S3: clonal heterogeneity of proliferation.! 21! Fig.
    [Show full text]
  • Supplementary Table 1
    Supplementary Table 1. 492 genes are unique to 0 h post-heat timepoint. The name, p-value, fold change, location and family of each gene are indicated. Genes were filtered for an absolute value log2 ration 1.5 and a significance value of p ≤ 0.05. Symbol p-value Log Gene Name Location Family Ratio ABCA13 1.87E-02 3.292 ATP-binding cassette, sub-family unknown transporter A (ABC1), member 13 ABCB1 1.93E-02 −1.819 ATP-binding cassette, sub-family Plasma transporter B (MDR/TAP), member 1 Membrane ABCC3 2.83E-02 2.016 ATP-binding cassette, sub-family Plasma transporter C (CFTR/MRP), member 3 Membrane ABHD6 7.79E-03 −2.717 abhydrolase domain containing 6 Cytoplasm enzyme ACAT1 4.10E-02 3.009 acetyl-CoA acetyltransferase 1 Cytoplasm enzyme ACBD4 2.66E-03 1.722 acyl-CoA binding domain unknown other containing 4 ACSL5 1.86E-02 −2.876 acyl-CoA synthetase long-chain Cytoplasm enzyme family member 5 ADAM23 3.33E-02 −3.008 ADAM metallopeptidase domain Plasma peptidase 23 Membrane ADAM29 5.58E-03 3.463 ADAM metallopeptidase domain Plasma peptidase 29 Membrane ADAMTS17 2.67E-04 3.051 ADAM metallopeptidase with Extracellular other thrombospondin type 1 motif, 17 Space ADCYAP1R1 1.20E-02 1.848 adenylate cyclase activating Plasma G-protein polypeptide 1 (pituitary) receptor Membrane coupled type I receptor ADH6 (includes 4.02E-02 −1.845 alcohol dehydrogenase 6 (class Cytoplasm enzyme EG:130) V) AHSA2 1.54E-04 −1.6 AHA1, activator of heat shock unknown other 90kDa protein ATPase homolog 2 (yeast) AK5 3.32E-02 1.658 adenylate kinase 5 Cytoplasm kinase AK7
    [Show full text]
  • Mammalian Germ Cells Are Determined After PGC Colonization of the Nascent Gonad
    Mammalian germ cells are determined after PGC colonization of the nascent gonad Peter K. Nichollsa, Hubert Schorlea,b, Sahin Naqvia,c, Yueh-Chiang Hua,d,e, Yuting Fana,f, Michelle A. Carmella, Ina Dobrinskig, Adrienne L. Watsonh, Daniel F. Carlsonh, Scott C. Fahrenkrugh, and David C. Pagea,c,i,1 aWhitehead Institute, Cambridge, MA 02142; bDepartment of Developmental Pathology, Institute of Pathology, University of Bonn Medical School, 53127 Bonn, Germany; cDepartment of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139; dDivision of Developmental Biology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH 45229; eDepartment of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH 45267; fReproductive Medicine Center, Sixth Affiliated Hospital, Sun Yat-sen University, 510655 Guangzhou, China; gDepartment of Comparative Biology & Experimental Medicine, Faculty of Veterinary Medicine, University of Calgary, Calgary, AB T2N 4N1, Canada; hRecombinetics, Inc., Saint Paul, MN 55104; and iHoward Hughes Medical Institute, Whitehead Institute, Cambridge, MA 02142 Contributed by David C. Page, October 15, 2019 (sent for review June 28, 2019; reviewed by Katherine L. Nathanson and Dustin L. Updike) Mammalian primordial germ cells (PGCs) are induced in the embry- transplanted to ectopic sites (8) and give rise to pluripotent cell onic epiblast, before migrating to the nascent gonads. In fish, lines in culture (9–11). It has also been suggested that pre- frogs, and birds, the germline segregates even earlier, through the sumptive PGCs (labeled genetically by Prdm1-Cre) in the pos- action of maternally inherited germ plasm. Across vertebrates, terior region of the embryo during allantoic elongation may migrating PGCs retain a broad developmental potential, regardless contribute to nongametogenic lineages (12, 13).
    [Show full text]
  • A Temporally Controlled Sequence of X-Chromosome Inactivation and Reactivation Defines Female Mouse in Vitro Germ Cells with Meiotic Potential
    bioRxiv preprint doi: https://doi.org/10.1101/2021.08.11.455976; this version posted August 11, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. A temporally controlled sequence of X-chromosome inactivation and reactivation defines female mouse in vitro germ cells with meiotic potential Jacqueline Severino1†, Moritz Bauer1,9†, Tom Mattimoe1, Niccolò Arecco1, Luca Cozzuto1, Patricia Lorden2, Norio Hamada3, Yoshiaki Nosaka4,5,6, So Nagaoka4,5,6, Holger Heyn2, Katsuhiko Hayashi7, Mitinori Saitou4,5,6 and Bernhard Payer1,8* Abstract The early mammalian germ cell lineage is characterized by extensive epigenetic reprogramming, which is required for the maturation into functional eggs and sperm. In particular, the epigenome needs to be reset before parental marks can be established and then transmitted to the next generation. In the female germ line, reactivation of the inactive X- chromosome is one of the most prominent epigenetic reprogramming events, and despite its scale involving an entire chromosome affecting hundreds of genes, very little is known about its kinetics and biological function. Here we investigate X-chromosome inactivation and reactivation dynamics by employing a tailor-made in vitro system to visualize the X-status during differentiation of primordial germ cell-like cells (PGCLCs) from female mouse embryonic stem cells (ESCs). We find that the degree of X-inactivation in PGCLCs is moderate when compared to somatic cells and characterized by a large number of genes escaping full inactivation.
    [Show full text]
  • Impairment of Spermatogenesis and Sperm Motility by the High-Fat Diet
    Gut microbiota ORIGINAL RESEARCH Gut: first published as 10.1136/gutjnl-2019-319127 on 2 January 2020. Downloaded from Impairment of spermatogenesis and sperm motility by the high- fat diet- induced dysbiosis of gut microbes Ning Ding ,1 Xin Zhang,2 Xue Di Zhang,1 Jun Jing,3,4 Shan Shan Liu,5 Yun Ping Mu,1 Li Li Peng ,6 Yun Jing Yan,1 Geng Miao Xiao,1 Xin Yun Bi,1 Hao Chen,1 Fang Hong Li,1 Bing Yao,3,4 Allan Z Zhao1 ► Additional material is ABSTRact published online only. To view, Objective High- fat diet (HFD)- induced metabolic Significance of this study please visit the journal online disorders can lead to impaired sperm production. We aim (http:// dx. doi. org/ 10. 1136/ What is already known on this subject? gutjnl- 2019- 319127). to investigate if HFD- induced gut microbiota dysbiosis can functionally influence spermatogenesis and sperm ► High- fat diet (HFD) leads to obesity and 1The School of Biomedical motility. metabolic syndromes and affects the functions and Pharmaceutical Sciences, in reproductive system. Guangdong University of Design Faecal microbes derived from the HFD- fed or Technology, Guangzhou, normal diet (ND)- fed male mice were transplanted to the ► HFD- induced gut microbiota dysbiosis and Guangdong, China endotoxaemia have been reported. 2 mice maintained on ND. The gut microbes, sperm count Dizal Pharma, Shanghai, China Emerging evidence demonstrated that gut 3 and motility were analysed. Human faecal/semen/blood ► Jinling Hospital Department microbiota dysbiosis is closely associated Reproductive Medical Center, samples were collected to assess microbiota, sperm Nanjing Medicine University, quality and endotoxin.
    [Show full text]
  • The Characterization of Cell Surface Receptor Complexes by Affinity Chromatography, Liquid Chromatography and Tandem Mass Spectrometry
    The Characterization of Cell Surface Receptor Complexes by Affinity Chromatography, Liquid Chromatography and Tandem Mass Spectrometry by Jaimie Dufresne BSc, MSc, Ryerson University A dissertation presented to Ryerson University in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Program of Molecular Science Toronto, Ontario, Canada, 2017 © Jaimie Dufresne 2017 AUTHOR'S DECLARATION FOR ELECTRONIC SUBMISSION OF A DISSERTATION I hereby declare that I am the sole author of this dissertation. This is a true copy of the dissertation, including any required final revisions, as accepted by my examiners. I authorize Ryerson University to lend this dissertation to other institutions or individuals for the purpose of scholarly research. I further authorize Ryerson University to reproduce this dissertation by photocopying or by other means, in total or in part, at the request of other institutions or individuals for the purpose of scholarly research. I understand that my dissertation may be made electronically available to the public. ii Abstract Cell surface receptors are of critical importance to the treatment of disease but are difficult to isolate and identify by classical approaches. Here, a robust and general method for capturing a receptor complex from the surface of live cells with ligands presented on nanoscopic beads is demonstrated. Two forms of affinity chromatography: the presentation of a biotinylated ligand to the surface of live cells and recovered by classical affinity chromatography was compared to the presentation of the ligand on the surface of nanoscopic chromatography beads for the isolation of the IgG-FcR complex from the surface of live cells.
    [Show full text]
  • Content Based Search in Gene Expression Databases and a Meta-Analysis of Host Responses to Infection
    Content Based Search in Gene Expression Databases and a Meta-analysis of Host Responses to Infection A Thesis Submitted to the Faculty of Drexel University by Francis X. Bell in partial fulfillment of the requirements for the degree of Doctor of Philosophy November 2015 c Copyright 2015 Francis X. Bell. All Rights Reserved. ii Acknowledgments I would like to acknowledge and thank my advisor, Dr. Ahmet Sacan. Without his advice, support, and patience I would not have been able to accomplish all that I have. I would also like to thank my committee members and the Biomed Faculty that have guided me. I would like to give a special thanks for the members of the bioinformatics lab, in particular the members of the Sacan lab: Rehman Qureshi, Daisy Heng Yang, April Chunyu Zhao, and Yiqian Zhou. Thank you for creating a pleasant and friendly environment in the lab. I give the members of my family my sincerest gratitude for all that they have done for me. I cannot begin to repay my parents for their sacrifices. I am eternally grateful for everything they have done. The support of my sisters and their encouragement gave me the strength to persevere to the end. iii Table of Contents LIST OF TABLES.......................................................................... vii LIST OF FIGURES ........................................................................ xiv ABSTRACT ................................................................................ xvii 1. A BRIEF INTRODUCTION TO GENE EXPRESSION............................. 1 1.1 Central Dogma of Molecular Biology........................................... 1 1.1.1 Basic Transfers .......................................................... 1 1.1.2 Uncommon Transfers ................................................... 3 1.2 Gene Expression ................................................................. 4 1.2.1 Estimating Gene Expression ............................................ 4 1.2.2 DNA Microarrays ......................................................
    [Show full text]
  • Mitotic Checkpoints and Chromosome Instability Are Strong Predictors of Clinical Outcome in Gastrointestinal Stromal Tumors
    MITOTIC CHECKPOINTS AND CHROMOSOME INSTABILITY ARE STRONG PREDICTORS OF CLINICAL OUTCOME IN GASTROINTESTINAL STROMAL TUMORS. Pauline Lagarde1,2, Gaëlle Pérot1, Audrey Kauffmann3, Céline Brulard1, Valérie Dapremont2, Isabelle Hostein2, Agnès Neuville1,2, Agnieszka Wozniak4, Raf Sciot5, Patrick Schöffski4, Alain Aurias1,6, Jean-Michel Coindre1,2,7 Maria Debiec-Rychter8, Frédéric Chibon1,2. Supplemental data NM cases deletion frequency. frequency. deletion NM cases Mand between difference the highest setswith of theprobe a view isdetailed panel Bottom frequently. sorted totheless deleted theprobe are frequently from more and thefrequency deletion represent Yaxes inblue. are cases (NM) metastatic for non- frequencies Corresponding inmetastatic (red). probe (M)cases sets figureSupplementary 1: 100 100 20 40 60 80 20 40 60 80 0 0 chr14 1 chr14 88 chr14 175 chr14 262 chr9 -MTAP 349 chr9 -MTAP 436 523 chr9-CDKN2A 610 Histogram presenting the 2000 more frequently deleted deleted frequently the 2000 more presenting Histogram chr9-CDKN2A 697 chr9-CDKN2A 784 chr9-CDKN2B 871 chr9-CDKN2B 958 chr9-CDKN2B 1045 chr22 1132 chr22 1219 chr22 1306 chr22 1393 1480 1567 M NM 1654 1741 1828 1915 M NM GIST14 GIST2 GIST16 GIST3 GIST19 GIST63 GIST9 GIST38 GIST61 GIST39 GIST56 GIST37 GIST47 GIST58 GIST28 GIST5 GIST17 GIST57 GIST47 GIST58 GIST28 GIST5 GIST17 GIST57 CDKN2A Supplementary figure 2: Chromosome 9 genomic profiles of the 18 metastatic GISTs (upper panel). Deletions and gains are indicated in green and red, respectively; and color intensity is proportional to copy number changes. A detailed view is given (bottom panel) for the 6 cases presenting a homozygous 9p21 deletion targeting CDKN2A locus (dark green).
    [Show full text]
  • Dual Functions for Insulinoma-Associated 1 in Retinal Development
    University of Kentucky UKnowledge Theses and Dissertations--Biology Biology 2015 DUAL FUNCTIONS FOR INSULINOMA-ASSOCIATED 1 IN RETINAL DEVELOPMENT Marie A. Forbes-Osborne University of Kentucky, [email protected] Right click to open a feedback form in a new tab to let us know how this document benefits ou.y Recommended Citation Forbes-Osborne, Marie A., "DUAL FUNCTIONS FOR INSULINOMA-ASSOCIATED 1 IN RETINAL DEVELOPMENT" (2015). Theses and Dissertations--Biology. 31. https://uknowledge.uky.edu/biology_etds/31 This Doctoral Dissertation is brought to you for free and open access by the Biology at UKnowledge. It has been accepted for inclusion in Theses and Dissertations--Biology by an authorized administrator of UKnowledge. For more information, please contact [email protected]. STUDENT AGREEMENT: I represent that my thesis or dissertation and abstract are my original work. Proper attribution has been given to all outside sources. I understand that I am solely responsible for obtaining any needed copyright permissions. I have obtained needed written permission statement(s) from the owner(s) of each third-party copyrighted matter to be included in my work, allowing electronic distribution (if such use is not permitted by the fair use doctrine) which will be submitted to UKnowledge as Additional File. I hereby grant to The University of Kentucky and its agents the irrevocable, non-exclusive, and royalty-free license to archive and make accessible my work in whole or in part in all forms of media, now or hereafter known. I agree that the document mentioned above may be made available immediately for worldwide access unless an embargo applies.
    [Show full text]