INTRODUCING a NOVEL METHOD for GENETIC ANALYSIS of AUTISM SPECTRUM DISORDER Sepideh Nouri

Total Page:16

File Type:pdf, Size:1020Kb

INTRODUCING a NOVEL METHOD for GENETIC ANALYSIS of AUTISM SPECTRUM DISORDER Sepideh Nouri The Texas Medical Center Library DigitalCommons@TMC The University of Texas MD Anderson Cancer Center UTHealth Graduate School of The University of Texas MD Anderson Cancer Biomedical Sciences Dissertations and Theses Center UTHealth Graduate School of (Open Access) Biomedical Sciences 12-2013 INTRODUCING A NOVEL METHOD FOR GENETIC ANALYSIS OF AUTISM SPECTRUM DISORDER sepideh nouri Follow this and additional works at: https://digitalcommons.library.tmc.edu/utgsbs_dissertations Part of the Bioinformatics Commons, Computational Biology Commons, and the Genomics Commons Recommended Citation nouri, sepideh, "INTRODUCING A NOVEL METHOD FOR GENETIC ANALYSIS OF AUTISM SPECTRUM DISORDER" (2013). The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences Dissertations and Theses (Open Access). 417. https://digitalcommons.library.tmc.edu/utgsbs_dissertations/417 This Thesis (MS) is brought to you for free and open access by the The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences at DigitalCommons@TMC. It has been accepted for inclusion in The University of Texas MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences Dissertations and Theses (Open Access) by an authorized administrator of DigitalCommons@TMC. For more information, please contact [email protected]. INTRODUCING A NOVEL METHOD FOR GENETIC ANALYSIS OF AUTISM SPECTRUM DISORDER by Sepideh Nouri, M.S. APPROVED: Eric Boerwinkle, Ph.D., Supervisor Kim-Anh Do, Ph.D. Alanna Morrison, Ph.D. James Hixson, Ph.D. Paul Scheet, Ph.D. APPROVED: Dean, The University of Texas Graduate School of Biomedical Sciences at Houston INTRODUCING A NOVEL METHOD FOR GENETIC ANALYSIS OF AUTISM SPECTRUM DISORDER A THESIS Presented to the Faculty of The University of Texas Health Science Center at Houston and The University of Texas M. D. Anderson Cancer Center Graduate School of Biomedical Sciences in Partial Fulfillment of the Requirements for the Degree of MASTER OF SCIENCE by Sepideh Nouri, M.S. Houston, Texas December 2013 ACKNOWLEDGEMENTS I would like to thank my advisor, Dr. Eric Boerwinkle for giving me the opportunity to study under his supervision at the Graduate School of Biomedical Sciences at Houston. I would also like to thank my committee members, Dr. Kim- Anh Do, Dr. Alanna Morrison, Dr. James Hixson, Dr. Paul Scheet, and my previous member Dr. John Cook for their guidance throughout my master’s degree. iii INTRODUCING A NOVEL METHOD FOR GENETIC ANALYSIS OF AUTISM SPECTRUM DISORDER Sepideh Nouri, M.S. Supervisory Professor: Eric Boerwinkle, Ph.D. Autism is a spectrum of neurological disorders that is characterized by repetitive and stereotyped behaviors, lack of social skills in verbal and non-verbal communications, and intellectual disability. Recent statistics shows that 1 out of every 88 children in the US is affected by autism. In this thesis, I first review previous studies on genetic association analyses of autism spectrum disorder. A large number of these studies fall into two categories: Genome Wide Association Studies (GWAS) and sequencing studies. Although GWAS are able to identify multiple common risk variants associated with different diseases, these common variants explain only a small portion of disease occurrence. In the case of autism, GWAS has had only limited success in identification of common risk variants. Recent studies suggest that rare variants have an important role in causing this disorder. There are a number of sequence-based association studies that have analyzed the relationship of rare variants with autism. Chapter 2 of my thesis presents some of the most popular sequencing- based rare variant association analysis methods along with their advantages and disadvantages. From previously published methods, the Weighted Sum Method (WSM) and Sequence Kernel Association Test (SKAT) are used to analyze the iv autism disease sequencing data since they have some advantages over similar methods. A new method called the Single Variant Method (SVM) is introduced in section 2.4. This new method provides a better understanding of the data and identifies associated variants not found by the previous methods. In Chapter 3, the results of the analysis of the autism data using WSM, SKAT and SVM are presented and discussed. Although there is no gold standard, there is insightful information to be learned by applying multiple methods, including the novel SVM method proposed here. v TABLE OF CONTENTS ACKNOWLEDGEMENTS ............................................................................................... iii ABSTRACT ................................................................................................................... iv LIST OF FIGURES ........................................................................................................ viii LIST OF TABLES ............................................................................................................ ix ABBREVIATIONS ........................................................................................................... x CHAPTER I: INTRODUCTION AND LITERATURE REVIEW ................................................. 1 1.1 Introduction ...................................................................................................... 1 1.2 GEnetiCs of Autism- a rEviEw of thE litEraturE .................................................... 2 1.2.1 GWAS and Sequencing Studies ......................................................................... 4 1.3 Conclusion ......................................................................................................... 7 CHAPTER II: REVIEW OF RARE VARIANT ANALYSIS METHODS ....................................... 8 2.1 Introduction ...................................................................................................... 8 2.2 WEightEd-Sum MEthod (WSM) ........................................................................ 11 2.3 SEquencE KErnel Association Test (SKAT) ......................................................... 12 2.4 SinglE Variant MEthod (SVM) ........................................................................... 16 2.4.1 Choosing a Proper Constant and Initial Results .............................................. 22 2.4.2 Estimating the P-values of the Measured Scores ........................................... 26 2.4.3 The Effect of Constant on the Score of Rare Variants .................................... 29 2.4.4 Calculating the Score of Genes based on the Score of Variants ..................... 34 CHAPTER III: AUTISM SPECTRUM DISORDER DATA AND RESULTS ............................... 36 vi 3.1 Autism SpeCtrum Disorder Data ....................................................................... 36 3.3 DisCussion ........................................................................................................ 40 3.4 Conclusions ...................................................................................................... 42 APPENDIX A: MATLAB CODES ..................................................................................... 43 REFERENCES ............................................................................................................... 48 VITA ........................................................................................................................... 55 vii LIST OF FIGURES Figure 1: Beta(MAF; 0.3, 0.7) ............................................................................. 14 Figure 2: Beta(MAF; 0.3, 0.9) ............................................................................. 15 Figure 3: Beta(MAF; 0.5, 0.5) ............................................................................. 15 Figure 4: Beta(MAF; 1, 25) ................................................................................. 16 Figure 5: Weight as a function of SDcontrolsi for constant values 0.001, 0.01, 0.03, 0.1 ....................................................................................................... 22 Figure 6: Histogram of SDcontrolsi for all variants .............................................. 24 Figure 7: Categorizing variants based on their total number of minor alleles in controls ........................................................................................................ 25 Figure 8: Calculated scores for variant ID=17318, score=0.99 ........................... 27 Figure 9: Calculated scores for variant ID=224, score=0.66 ............................... 28 Figure 10: Calculated scores for a low-rank variant ............................................ 28 Figure 11: Calculated scores for multiple constants ........................................... 29 Figure 12: Calculated scores for multiple constants. Zoomed version ............... 30 viii LIST OF TABLES Table 1: Example 1, calculated scores for three variants based on TestA……....18 Table 2-a: Comparison between TestA and TestB with constant C=0.01…….….20 Table 2-b: Comparison between TestA and TestB with constant C=1……………20 Table 2-c: Comparison between TestA and TestB with constant C=100…………20 Table 3: Variants with top 10 scores for constant C=0.03………………………...26 Table 4: Calculated scores for multiple constants……………….…………………31 Table 5: Scores for variants calculated for C=0.001, 0.01, 0.02, 0.03, 0.06, 0.08…...……………………………………………………………………………33 Table 6: P-values for variants calculated for C=0.001, 0.01, 0.02, 0.03, 0.06, 0.08………………………………………………………………………...………34 Table
Recommended publications
  • Polyclonal Antibody to APIP (Full Length) - Purified
    OriGene Technologies, Inc. OriGene Technologies GmbH 9620 Medical Center Drive, Ste 200 Schillerstr. 5 Rockville, MD 20850 32052 Herford UNITED STATES GERMANY Phone: +1-888-267-4436 Phone: +49-5221-34606-0 Fax: +1-301-340-8606 Fax: +49-5221-34606-11 [email protected] [email protected] AP08426PU-N Polyclonal Antibody to APIP (full length) - Purified Alternate names: APAF1-interacting protein, CGI-29 Quantity: 50 µg Concentration: 0.5 mg/ml Background: The mammalian homologues of the key cell death gene CED-4 in C. elegans has been identified recently from human and mouse and designated Apaf1 (for apoptosis protease activating factor 1). Apaf1 binds to cytochrome c (Apaf2) and caspase 9 (Apaf3), which leads to caspase 9 activation. Activated caspase 9 in turn cleaves and activates caspase 3 that is one of the key proteases, being responsible for the proteolytic cleavage of many key proteins in apoptosis. A new Apaf1 Interacting Protein (APIP) also known as CG129 and MMRP19, has been identified as a negative regulator of ischemic injury. APIP competes with Caspase 9 binding site of Apaf1. APIP is predicted to code for a 204 amino acid. An isoform of APIP, APIP2 encodes a 242 amino acid protein, which is an alternative splicing variant differing in its N terminus from APIP. APIP transcript is ubiquitously expressed in most adult tissue with high expression in skeletal muscle, heart, and kidney. Uniprot ID: Q96GX9 NCBI: NP_057041.2 GeneID: 51074 Host / Isotype: Rabbit / IgG Immunogen: Recombinant protein corresponding to a full-length His-tagged recombinant Human APIP protein Format: State: Liquid purified Ig fraction Purification: Protein G Chromatography Buffer System: PBS containing 0.2% Gelatin as stabilizer and 0.05% Sodium Azide as preservative Applications: Immunohistochemistry on Paraffin Sections: 10 µg/ml.
    [Show full text]
  • BIRC7 Sirna (Human)
    For research purposes only, not for human use Product Data Sheet BIRC7 siRNA (Human) Catalog # Source Reactivity Applications CRJ2387 Synthetic H RNAi Description siRNA to inhibit BIRC7 expression using RNA interference Specificity BIRC7 siRNA (Human) is a target-specific 19-23 nt siRNA oligo duplexes designed to knock down gene expression. Form Lyophilized powder Gene Symbol BIRC7 Alternative Names KIAP; LIVIN; MLIAP; RNF50; Baculoviral IAP repeat-containing protein 7; Kidney inhibitor of apoptosis protein; KIAP; Livin; Melanoma inhibitor of apoptosis protein; ML-IAP; RING finger protein 50 Entrez Gene 79444 (Human) SwissProt Q96CA5 (Human) Purity > 97% Quality Control Oligonucleotide synthesis is monitored base by base through trityl analysis to ensure appropriate coupling efficiency. The oligo is subsequently purified by affinity-solid phase extraction. The annealed RNA duplex is further analyzed by mass spectrometry to verify the exact composition of the duplex. Each lot is compared to the previous lot by mass spectrometry to ensure maximum lot-to-lot consistency. Components We offers pre-designed sets of 3 different target-specific siRNA oligo duplexes of human BIRC7 gene. Each vial contains 5 nmol of lyophilized siRNA. The duplexes can be transfected individually or pooled together to achieve knockdown of the target gene, which is most commonly assessed by qPCR or western blot. Our siRNA oligos are also chemically modified (2’-OMe) at no extra charge for increased stability and Application key: E- ELISA, WB- Western blot, IH- Immunohistochemistry,
    [Show full text]
  • Genome Wide Association Study of Response to Interval and Continuous Exercise Training: the Predict‑HIIT Study Camilla J
    Williams et al. J Biomed Sci (2021) 28:37 https://doi.org/10.1186/s12929-021-00733-7 RESEARCH Open Access Genome wide association study of response to interval and continuous exercise training: the Predict-HIIT study Camilla J. Williams1†, Zhixiu Li2†, Nicholas Harvey3,4†, Rodney A. Lea4, Brendon J. Gurd5, Jacob T. Bonafglia5, Ioannis Papadimitriou6, Macsue Jacques6, Ilaria Croci1,7,20, Dorthe Stensvold7, Ulrik Wislof1,7, Jenna L. Taylor1, Trishan Gajanand1, Emily R. Cox1, Joyce S. Ramos1,8, Robert G. Fassett1, Jonathan P. Little9, Monique E. Francois9, Christopher M. Hearon Jr10, Satyam Sarma10, Sylvan L. J. E. Janssen10,11, Emeline M. Van Craenenbroeck12, Paul Beckers12, Véronique A. Cornelissen13, Erin J. Howden14, Shelley E. Keating1, Xu Yan6,15, David J. Bishop6,16, Anja Bye7,17, Larisa M. Haupt4, Lyn R. Grifths4, Kevin J. Ashton3, Matthew A. Brown18, Luciana Torquati19, Nir Eynon6 and Jef S. Coombes1* Abstract Background: Low cardiorespiratory ftness (V̇O2peak) is highly associated with chronic disease and mortality from all causes. Whilst exercise training is recommended in health guidelines to improve V̇O2peak, there is considerable inter-individual variability in the V̇O2peak response to the same dose of exercise. Understanding how genetic factors contribute to V̇O2peak training response may improve personalisation of exercise programs. The aim of this study was to identify genetic variants that are associated with the magnitude of V̇O2peak response following exercise training. Methods: Participant change in objectively measured V̇O2peak from 18 diferent interventions was obtained from a multi-centre study (Predict-HIIT). A genome-wide association study was completed (n 507), and a polygenic predictor score (PPS) was developed using alleles from single nucleotide polymorphisms= (SNPs) signifcantly associ- –5 ated (P < 1 ­10 ) with the magnitude of V̇O2peak response.
    [Show full text]
  • SPATA33 Localizes Calcineurin to the Mitochondria and Regulates Sperm Motility in Mice
    SPATA33 localizes calcineurin to the mitochondria and regulates sperm motility in mice Haruhiko Miyataa, Seiya Ouraa,b, Akane Morohoshia,c, Keisuke Shimadaa, Daisuke Mashikoa,1, Yuki Oyamaa,b, Yuki Kanedaa,b, Takafumi Matsumuraa,2, Ferheen Abbasia,3, and Masahito Ikawaa,b,c,d,4 aResearch Institute for Microbial Diseases, Osaka University, Osaka 5650871, Japan; bGraduate School of Pharmaceutical Sciences, Osaka University, Osaka 5650871, Japan; cGraduate School of Medicine, Osaka University, Osaka 5650871, Japan; and dThe Institute of Medical Science, The University of Tokyo, Tokyo 1088639, Japan Edited by Mariana F. Wolfner, Cornell University, Ithaca, NY, and approved July 27, 2021 (received for review April 8, 2021) Calcineurin is a calcium-dependent phosphatase that plays roles in calcineurin can be a target for reversible and rapidly acting male a variety of biological processes including immune responses. In sper- contraceptives (5). However, it is challenging to develop molecules matozoa, there is a testis-enriched calcineurin composed of PPP3CC and that specifically inhibit sperm calcineurin and not somatic calci- PPP3R2 (sperm calcineurin) that is essential for sperm motility and male neurin because of sequence similarities (82% amino acid identity fertility. Because sperm calcineurin has been proposed as a target for between human PPP3CA and PPP3CC and 85% amino acid reversible male contraceptives, identifying proteins that interact with identity between human PPP3R1 and PPP3R2). Therefore, identi- sperm calcineurin widens the choice for developing specific inhibitors. fying proteins that interact with sperm calcineurin widens the choice Here, by screening the calcineurin-interacting PxIxIT consensus motif of inhibitors that target the sperm calcineurin pathway. in silico and analyzing the function of candidate proteins through the The PxIxIT motif is a conserved sequence found in generation of gene-modified mice, we discovered that SPATA33 inter- calcineurin-binding proteins (8, 9).
    [Show full text]
  • NEK2 Antibody (Aa287-299) Rabbit Polyclonal Antibody Catalog # ALS11259
    10320 Camino Santa Fe, Suite G San Diego, CA 92121 Tel: 858.875.1900 Fax: 858.622.0609 NEK2 Antibody (aa287-299) Rabbit Polyclonal Antibody Catalog # ALS11259 Specification NEK2 Antibody (aa287-299) - Product Information Application IHC Primary Accession P51955 Reactivity Human Host Rabbit Clonality Polyclonal Calculated MW 52kDa KDa NEK2 Antibody (aa287-299) - Additional Information Gene ID 4751 Anti-NEK2 antibody IHC of human testis. Other Names Serine/threonine-protein kinase Nek2, 2.7.11.1, HSPK 21, Never in mitosis NEK2 Antibody (aa287-299) - Background A-related kinase 2, NimA-related protein kinase 2, NimA-like protein kinase 1, NEK2, Protein kinase which is involved in the control NEK2A, NLK1 of centrosome separation and bipolar spindle formation in mitotic cells and chromatin Target/Specificity condensation in meiotic cells. Regulates aa 287-299 of Human NEK2 protein. centrosome separation (essential for the formation of bipolar spindles and high-fidelity Reconstitution & Storage chromosome separation) by phosphorylating Store vial at -20 C prior to opening. Dilute centrosomal proteins such as CROCC, CEP250 only prior to immediate use. For extended and NINL, resulting in their displacement from storage aliquot contents and freeze at -20 C or below. Avoid cycles of freezing and the centrosomes. Regulates kinetochore thawing. microtubule attachment stability in mitosis via phosphorylation of NDC80. Involved in Precautions regulation of mitotic checkpoint protein NEK2 Antibody (aa287-299) is for research complex via phosphorylation of CDC20 and use only and not for use in diagnostic or MAD2L1. Plays an active role in chromatin therapeutic procedures. condensation during the first meiotic division through phosphorylation of HMGA2.
    [Show full text]
  • Seq2pathway Vignette
    seq2pathway Vignette Bin Wang, Xinan Holly Yang, Arjun Kinstlick May 19, 2021 Contents 1 Abstract 1 2 Package Installation 2 3 runseq2pathway 2 4 Two main functions 3 4.1 seq2gene . .3 4.1.1 seq2gene flowchart . .3 4.1.2 runseq2gene inputs/parameters . .5 4.1.3 runseq2gene outputs . .8 4.2 gene2pathway . 10 4.2.1 gene2pathway flowchart . 11 4.2.2 gene2pathway test inputs/parameters . 11 4.2.3 gene2pathway test outputs . 12 5 Examples 13 5.1 ChIP-seq data analysis . 13 5.1.1 Map ChIP-seq enriched peaks to genes using runseq2gene .................... 13 5.1.2 Discover enriched GO terms using gene2pathway_test with gene scores . 15 5.1.3 Discover enriched GO terms using Fisher's Exact test without gene scores . 17 5.1.4 Add description for genes . 20 5.2 RNA-seq data analysis . 20 6 R environment session 23 1 Abstract Seq2pathway is a novel computational tool to analyze functional gene-sets (including signaling pathways) using variable next-generation sequencing data[1]. Integral to this tool are the \seq2gene" and \gene2pathway" components in series that infer a quantitative pathway-level profile for each sample. The seq2gene function assigns phenotype-associated significance of genomic regions to gene-level scores, where the significance could be p-values of SNPs or point mutations, protein-binding affinity, or transcriptional expression level. The seq2gene function has the feasibility to assign non-exon regions to a range of neighboring genes besides the nearest one, thus facilitating the study of functional non-coding elements[2]. Then the gene2pathway summarizes gene-level measurements to pathway-level scores, comparing the quantity of significance for gene members within a pathway with those outside a pathway.
    [Show full text]
  • Genomic Analyses Reveal Foxg As an Upstream Regulator of Wnt1 Required For
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.08.416008; this version posted December 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Genomic analyses reveal FoxG as an upstream regulator of wnt1 required for posterior identity specification in planarians E. Pascual-Carreras1, M. Marín-Barba3, S. Castillo-Lara1, P. Coronel-Córdoba1, M.S. Magri2, G.N. Wheeler3 J.F. Abril1, J.L. Gomez-Skarmeta2, E. Saló1* & T. Adell1* 1 Department of Genetics, Microbiology and Statistics, Universitat de Barcelona (UB) & Institute of Biomedicine of Universitat de Barcelona (IBUB), Barcelona, Spain. 2 Centro Andaluz de Biología del Desarollo (CABD), Universidad Pablo de Olavide, Sevilla, Spain. 3 School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, UK. *Corresponding authors: Emili Saló ([email protected]) and Teresa Adell ([email protected]) 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.08.416008; this version posted December 9, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Abstract Embryonic specification of the first body axis requires the formation of an Organizer, a group of cells with the ability to instruct fates in the surrounding tissue. The existence of organizing regions in adults, i.e.
    [Show full text]
  • A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
    Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated.
    [Show full text]
  • Interconversion Between Active and Inactive TATA-Binding Protein
    1446–1459 Nucleic Acids Research, 2012, Vol. 40, No. 4 Published online 19 October 2011 doi:10.1093/nar/gkr802 Interconversion between active and inactive TATA-binding protein transcription complexes in the mouse genome Mohamed-Amin Choukrallah, Dominique Kobi1, Igor Martianov1, W. W. M. Pim Pijnappel2,4, Nikolai Mischerikow2,3, Tao Ye1, Albert J. R. Heck3,4,5, H. Th. Marc Timmers2,4 and Irwin Davidson1,* 1Institut de Ge´ ne´ tique et de Biologie Mole´ culaire et Cellulaire, CNRS/INSERM/ULP, 1 Rue Laurent Fries, 67404 Illkirch Ce´ dex, France, 2Molecular Cancer Research, University Medical Center Utrecht, Universiteitsweg 100, 3Biomolecular Mass Spectrometry and Proteomics Group, Bijvoet Centre for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, 4Netherlands Proteomics Centre and 5Centre for Biomedical Genetics, Padualaan 8, 3584 CH Utrecht, The Netherlands Received June 10, 2011; Revised September 12, 2011; Accepted September 13, 2011 ABSTRACT preinitiation complex comprising B-TFIID that primes the promoter for productive preinitiation The TATA binding protein (TBP) plays a pivotal role complex formation in mammalian cells. in RNA polymerase II (Pol II) transcription through incorporation into the TFIID and B-TFIID complexes. The role of mammalian B-TFIID composed of TBP INTRODUCTION and B-TAF1 is poorly understood. Using a com- Accurate initiation of transcription by RNA polymerase plementation system in genetically modified mouse II (Pol II) requires the assembly of the multiprotein cells where endogenous TBP can be condi- preinitiation complex (PIC) on the core promoter around tionally inactivated and replaced by exogenous the mRNA start site (1–3). Amongst the basal transcrip- mutant TBP coupled to tandem affinity purification tion factors in this process is the TFIID complex com- and mass spectrometry, we identify two TBP prising the TATA binding protein (TBP) and a set of 13–14 TBP-associated factors (TAFs) (4–7).
    [Show full text]
  • Epigenetic Mechanisms of Lncrnas Binding to Protein in Carcinogenesis
    cancers Review Epigenetic Mechanisms of LncRNAs Binding to Protein in Carcinogenesis Tae-Jin Shin, Kang-Hoon Lee and Je-Yoel Cho * Department of Biochemistry, BK21 Plus and Research Institute for Veterinary Science, School of Veterinary Medicine, Seoul National University, Seoul 08826, Korea; [email protected] (T.-J.S.); [email protected] (K.-H.L.) * Correspondence: [email protected]; Tel.: +82-02-800-1268 Received: 21 September 2020; Accepted: 9 October 2020; Published: 11 October 2020 Simple Summary: The functional analysis of lncRNA, which has recently been investigated in various fields of biological research, is critical to understanding the delicate control of cells and the occurrence of diseases. The interaction between proteins and lncRNA, which has been found to be a major mechanism, has been reported to play an important role in cancer development and progress. This review thus organized the lncRNAs and related proteins involved in the cancer process, from carcinogenesis to metastasis and resistance to chemotherapy, to better understand cancer and to further develop new treatments for it. This will provide a new perspective on clinical cancer diagnosis, prognosis, and treatment. Abstract: Epigenetic dysregulation is an important feature for cancer initiation and progression. Long non-coding RNAs (lncRNAs) are transcripts that stably present as RNA forms with no translated protein and have lengths larger than 200 nucleotides. LncRNA can epigenetically regulate either oncogenes or tumor suppressor genes. Nowadays, the combined research of lncRNA plus protein analysis is gaining more attention. LncRNA controls gene expression directly by binding to transcription factors of target genes and indirectly by complexing with other proteins to bind to target proteins and cause protein degradation, reduced protein stability, or interference with the binding of other proteins.
    [Show full text]
  • Bioinformatics Analyses of Genomic Imprinting
    Bioinformatics Analyses of Genomic Imprinting Dissertation zur Erlangung des Grades des Doktors der Naturwissenschaften der Naturwissenschaftlich-Technischen Fakultät III Chemie, Pharmazie, Bio- und Werkstoffwissenschaften der Universität des Saarlandes von Barbara Hutter Saarbrücken 2009 Tag des Kolloquiums: 08.12.2009 Dekan: Prof. Dr.-Ing. Stefan Diebels Berichterstatter: Prof. Dr. Volkhard Helms Priv.-Doz. Dr. Martina Paulsen Vorsitz: Prof. Dr. Jörn Walter Akad. Mitarbeiter: Dr. Tihamér Geyer Table of contents Summary________________________________________________________________ I Zusammenfassung ________________________________________________________ I Acknowledgements _______________________________________________________II Abbreviations ___________________________________________________________ III Chapter 1 – Introduction __________________________________________________ 1 1.1 Important terms and concepts related to genomic imprinting __________________________ 2 1.2 CpG islands as regulatory elements ______________________________________________ 3 1.3 Differentially methylated regions and imprinting clusters_____________________________ 6 1.4 Reading the imprint __________________________________________________________ 8 1.5 Chromatin marks at imprinted regions___________________________________________ 10 1.6 Roles of repetitive elements ___________________________________________________ 12 1.7 Functional implications of imprinted genes _______________________________________ 14 1.8 Evolution and parental conflict ________________________________________________
    [Show full text]
  • Methods in and Applications of the Sequencing of Short Non-Coding Rnas" (2013)
    University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations 2013 Methods in and Applications of the Sequencing of Short Non- Coding RNAs Paul Ryvkin University of Pennsylvania, [email protected] Follow this and additional works at: https://repository.upenn.edu/edissertations Part of the Bioinformatics Commons, Genetics Commons, and the Molecular Biology Commons Recommended Citation Ryvkin, Paul, "Methods in and Applications of the Sequencing of Short Non-Coding RNAs" (2013). Publicly Accessible Penn Dissertations. 922. https://repository.upenn.edu/edissertations/922 This paper is posted at ScholarlyCommons. https://repository.upenn.edu/edissertations/922 For more information, please contact [email protected]. Methods in and Applications of the Sequencing of Short Non-Coding RNAs Abstract Short non-coding RNAs are important for all domains of life. With the advent of modern molecular biology their applicability to medicine has become apparent in settings ranging from diagonistic biomarkers to therapeutics and fields angingr from oncology to neurology. In addition, a critical, recent technological development is high-throughput sequencing of nucleic acids. The convergence of modern biotechnology with developments in RNA biology presents opportunities in both basic research and medical settings. Here I present two novel methods for leveraging high-throughput sequencing in the study of short non- coding RNAs, as well as a study in which they are applied to Alzheimer's Disease (AD). The computational methods presented here include High-throughput Annotation of Modified Ribonucleotides (HAMR), which enables researchers to detect post-transcriptional covalent modifications ot RNAs in a high-throughput manner. In addition, I describe Classification of RNAs by Analysis of Length (CoRAL), a computational method that allows researchers to characterize the pathways responsible for short non-coding RNA biogenesis.
    [Show full text]