Computational Methods to Advance from Genetic Association to Biological Insight

Total Page:16

File Type:pdf, Size:1020Kb

Computational Methods to Advance from Genetic Association to Biological Insight Computational Methods to Advance From Genetic Association to Biological Insight The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters Citation Fine, Rebecca S. 2020. Computational Methods to Advance From Genetic Association to Biological Insight. Doctoral dissertation, Harvard University, Graduate School of Arts & Sciences. Citable link https://nrs.harvard.edu/URN-3:HUL.INSTREPOS:37365548 Terms of Use This article was downloaded from Harvard University’s DASH repository, and is made available under the terms and conditions applicable to Other Posted Material, as set forth at http:// nrs.harvard.edu/urn-3:HUL.InstRepos:dash.current.terms-of- use#LAA ! ! ! ! ! ! ! ! ! "#$%&'(')#*(+!$,'-#./!'#!(.0(*1,!23#$!4,*,')1!(//#1)(')#*!'#!5)#+#4)1(+!)*/)4-'! ! 6!.)//,3'(')#*!%3,/,*',.! 57! 8,5,11(!9:!;)*,! '#! <-,!=)0)/)#*!#2!>,.)1(+!91),*1,/! )*!%(3')(+!2&+2)++$,*'!#2!'-,!3,?&)3,$,*'/! 2#3!'-,!.,43,,!#2!! =#1'#3!#2!@-)+#/#%-7! )*!'-,!/&5A,1'!#2! B,*,')1/!(*.!B,*#$)1/! ! ! ! ! C(30(3.!D*)0,3/)'7! "($53).4,E!>(//(1-&/,''/!! ;,53&(37!FGFG! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! H!FGFG!8,5,11(!9:!;)*,! 6++!3)4-'/!3,/,30,.:! ! ! ! =)//,3'(')#*!6.0)/#3I!J#,+!K:!C)3/1--#3*! ! ! ! ! !!!!!8,5,11(!9:!;)*,! ! "#$%&'(')#*(+!$,'-#./!'#!(.0(*1,!23#$!4,*,')1!(//#1)(')#*!'#!5)#+#4)1(+!)*/)4-'! !"#$%&'$( <-,!)*13,(/)*4!(0()+(5)+)'7!#2!+(34,L/1(+,!4,*,')1!.('(!-(/!,*(5+,.!$(//)0,!,22#3'/!'#! &*.,3/'(*.!'-,!4,*,')1!5(/)/!#2!-&$(*!'3()'/!(*.!.)/,(/,/:!C#M,0,3E!3#&4-+7!NGO!#2!4,*#$,L M).,!(//#1)(')#*!/'&.7!PBQ69R!/)4*(+/!1(*!5,!(''3)5&',.!'#!*#*1#.)*4!0(3)(')#*:!9&1-!0(3)(*'/! (3,!1-(++,*4)*4!'#!)*',3%3,'!(*.!1#**,1'!'#!5)#+#4)1(+!)*/)4-'!5,1(&/,!PSR!'-,3,!(3,!#2',*!$&+')%+,! %#//)5+,!1(&/(+!4,*,/!('!,(1-!+#1&/!(*.!PFR!+)*T)*4!*#*1#.)*4!0(3)(*'/!'#!'-,!4,*,/!'-,7!3,4&+(',! )/!.)22)1&+'E!M-)1-!#5/1&3,/!'-,!3,+,0(*'!5)#+#4)1(+!%3#1,//,/:!U*,!&/,2&+!(%%3#(1-!)*!(..3,//)*4! '-)/!%3#5+,$!)/!'-,!(%%+)1(')#*!#2!(+4#3)'-$/!'-('!/,(31-!2#3!1#$$#*(+)'),/!(13#//!+#1)E!M-)1-!1(*! )$%+)1(',!5)#+#4)1(+!%3#1,//,/!'-3#&4-!4,*,!/,'!,*3)1-$,*'!(*(+7/)/!(*.!%3)#3)')V,!+)T,+7!1(&/(+! 4,*,/:!<#!2(1)+)'(',!%3#43,//!23#$!4,*,')1!(//#1)(')#*!'#!5)#+#4)1(+!-7%#'-,/,/E!W!-(0,!5&)+'!&%#*! '-)/!/'3(',47!'#!.,0,+#%!'M#!.)22,3,*'!$,'-#./:! ;)3/'E!W!.,0,+#%,.!(!4,*,!/,'!,*3)1-$,*'!(*(+7/)/!$,'-#.!2#3!'-,!XY#$,"-)%E!(! 4,*#'7%)*4!(33(7!M-)1-!2#1&/,/!#*!3(3,!(*.!+#ML23,?&,*17!1#.)*4!0(3)(')#*:!<#!.#!'-)/E!W! (.(%',.!=X@W"<E!(!4,*,!/,'!,*3)1-$,*'!(*(+7/)/!$,'-#.!.,0,+#%,.!57!#&3!+(5!2#3!BQ69!.('(:! =X@W"<!)/!%(3')1&+(3+7!%#M,32&+!(/!(!'##+!2#3!5)#+#4)1(+!)*',3%3,'(')#*!.&,!'#!)'/!&/,!#2!4,*,!/,'/! '-('!-(0,!5,,*!,Y',*.,.!0)(!1#,Y%3,//)#*!.('(!'#!$(T,!%3,.)1')#*/!(5#&'!'-,!2&*1')#*!#2! &*1-(3(1',3)V,.!4,*,/:!W!(%%+),.!'-)/!$,'-#.!'#!$(*7!.)22,3,*'!'7%,/!#2!'3()'/E!)*1+&.)*4! (*'-3#%#$,'3)1!P,:4:!-,)4-'RE!-#3$#*(+!P,:4:!(.)%#*,1')*RE!(*.!4+71,$)1!P,:4:!2(/')*4!4+&1#/,R:! 9,1#*.E!$(*7!.)22,3,*'!4,*,!%3)#3)')V(')#*!(+4#3)'-$/!2#3!BQ69!-(0,!5,,*!.,0,+#%,.! &/)*4!(!1#$$#*(+)'7L5(/,.!/'3(',47E!5&'!)'!)/!.)22)1&+'!'#!.,',3$)*,!M-)1-!(3,!'-,!$#/'!(11&3(',:! )))! ! ! ! >#/'!*,M!$,'-#./!5,*1-$(3T!&/)*4!Z4#+.!/'(*.(3./[!P4,*,/!(+3,(.7!T*#M*!'#!%+(7!(!3#+,!)*!'-,! '3()'!#2!)*',3,/'R:!C#M,0,3E!/&1-!4#+.!/'(*.(3./!(3,!5)(/,.!'#M(3.!M,++L/'&.),.!4,*,/!(*.! %('-M(7/:!<-,3,2#3,E!W!.,0,+#%,.!\,*1-$(3T,3E!(!+,(0,L#*,L1-3#$#/#$,L#&'!$,'-#.!2#3! 5,*1-$(3T)*4!%3)#3)')V(')#*!$,'-#./!'-('!3,+),/!#*+7!#*!'-,!#3)4)*(+!BQ69!.('(!(*.!,0(+&(',/! $,'-#.!%,32#3$(*1,!&/)*4!/'3(')2),.!]=!/1#3,!3,43,//)#*:! <-,!$,'-#./!.,/13)5,.!)*!'-)/!.)//,3'(')#*!1#*'3)5&',!'#!(*!)$%#3'(*'!(*.!43#M)*4!5#.7! #2!M#3T!#*!$#3,!,22,1')0,!(*.!3)4#3#&/!(*(+7/)/!#2!BQ69!.('(!'#!#5'()*!5)#+#4)1(+!)*/)4-':! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! )0! ! ! ! )&"*+(,-(.,/$+/$#( ! !"#$%&'$(00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(111! )&"*+(,-(.,/$+/$#(00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(2! !'3/,4*+56+7+/$#(0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(211! 8+51'&$1,/(00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(9! !$$%1":$1,/#(00000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(91! !"#$%&'()(************************************************************************************************************************************(+,! !"#$%&'(-(***********************************************************************************************************************************(+,,! !"#$%&'(.(***********************************************************************************************************************************(+,,! .;&<$+%(=>(?/$%,5:'$1,/(0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(=! /#'&(#01(23456'&78&09:(931,0;(<#',#%,30(***********************************************************************************(-! =&0&(>&%(&0',9"?&0%(#0#2:>,>(******************************************************************************************************(@! A#'%,%,30,0;("&',%#B,2,%:(***************************************************************************************************************(C! =&0&($',3',%,D#%,30(********************************************************************************************************************(EE! F&09"?#'G,0;(>%'#%&;,&>(***********************************************************************************************************(E@! H,>>8&(#01(9&22(%:$&(&0',9"?&0%(************************************************************************************************(EI! J8??#':(************************************************************************************************************************************()E! .;&<$+%(@>(A.B8AC?.)>(&(6+/+(#+$(+/%1';7+/$(&/&*D#1#(7+$;,5(-,%(A9,7+.;1<(5&$&(000000(@@! KB>%'#9%(*************************************************************************************************************************************().! L0%'3189%,30(*******************************************************************************************************************************()@! M&%"31>(*************************************************************************************************************************************()C! /&>82%>(***************************************************************************************************************************************(-N! O,>98>>,30(*********************************************************************************************************************************(N@! .;&<$+%(E>(F+/';7&%3+%>(&/(:/"1&#+5G(&##,'1&$1,/B5&$&B5%12+/(#$%&$+6D($,(+2&*:&$+(6+/+( <%1,%1$1H&$1,/(&*6,%1$;7#(0000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(IJ! KB>%'#9%(*************************************************************************************************************************************(IP! L0%'3189%,30(*******************************************************************************************************************************(IE! M#%&',#2>(#01(M&%"31>(**************************************************************************************************************(I.! 0!! ! ! /&>82%>(***************************************************************************************************************************************(Q@! O,>98>>,30(*******************************************************************************************************************************(EPP! .;&<$+%(K>(!<<*1'&$1,/(,-(#1/6*+B'+**(LM!(#+N:+/'1/6(5&$&($,(1/$+%<%+$(6+/,7+B415+( &##,'1&$1,/(#$:51+#(,-(",5DB7&##(1/5+9(000000000000000000000000000000000000000000000000000000000000000000000000000000000000(=OI! KB>%'#9%(***********************************************************************************************************************************(EPI! L0%'3189%,30(*****************************************************************************************************************************(EPQ! M&%"31>(***********************************************************************************************************************************(EE-! /&>82%>(*************************************************************************************************************************************(E)P! O,>98>>,30(*******************************************************************************************************************************(E-.! .;&<$+%(P>(81#':##1,/(000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000(=EJ! M#R3'(6,01,0;>(#01(,?$2,9#%,30>(*********************************************************************************************(E.P! S8%8'&(1,'&9%,30>(*********************************************************************************************************************(E.N! !30928>,30>(*****************************************************************************************************************************(E@.! !<<+/519(!>(Q:<<*+7+/$&%D(7&$+%1&*#(-,%(.;&<$+%(@(0000000000000000000000000000000000000000000000000000000000(=PP! J8$$2&?&0%#':(6,;8'&>(63'(!"#$%&'()(**************************************************************************************(E@N!
Recommended publications
  • WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2019/079361 Al 25 April 2019 (25.04.2019) W 1P O PCT (51) International Patent Classification: CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, C12Q 1/68 (2018.01) A61P 31/18 (2006.01) DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, C12Q 1/70 (2006.01) HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, (21) International Application Number: MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, PCT/US2018/056167 OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (22) International Filing Date: SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, 16 October 2018 (16. 10.2018) TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (25) Filing Language: English (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, (26) Publication Language: English GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, (30) Priority Data: UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, 62/573,025 16 October 2017 (16. 10.2017) US TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, ΓΕ , IS, IT, LT, LU, LV, (71) Applicant: MASSACHUSETTS INSTITUTE OF MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TECHNOLOGY [US/US]; 77 Massachusetts Avenue, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, Cambridge, Massachusetts 02139 (US).
    [Show full text]
  • Integrated Genomic Analysis Identifies ANKRD36 Gene As a Novel and Common Biomarker of Disease Progression in Chronic Myeloid Leukemia
    Preprints (www.preprints.org) | NOT PEER-REVIEWED | Posted: 25 August 2021 doi:10.20944/preprints202108.0498.v1 Article Integrated Genomic Analysis Identifies ANKRD36 Gene as a Novel and Common Biomarker of Disease Progression in Chronic Myeloid Leukemia Zafar Iqbal 1,2,3, Muhammad Absar2, Tanveer Akhtar2, Aamer Aleem4, Abid Jameel5, Sulman Basit 6, Anhar Ullah7, Sibtain Afzal8, Khushnooda Ramzan9, Mahmood Rasool10, Sajjad Karim10, Zeenat Mirza10, Mudassar Iqbal11, Mar- yam AlMajed1, Buthinah AlShehab1, Sarah AlMukhaylid1, Nouf AlMutairi1, Nawaf Al-anazi1, 12, Muhammad Farooq Sabar13, Muhammad Arshad14, Muhammad Asif15, Masood Shammas16 and Amer Mehmood15 1: Clinical Laboratory Sciences, College of Applied Medical Sciences (CoAMS-A), King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City/Kind Abdullah International Medical Research Centre (KAIMRC)/Saudi Society for Blood and Marrow Transplantation (SSBMT), King Abdulaziz Medical City, National Guard Health Affairs, Al-Ahsa, Saudi Arabia. 2:Hematology, Oncology and Pharmaco-genetic Engineering Sciences (HOPES) Group, Health Sciences Labora- tories (HaSiL), Department of Zoology, University of the Punjab (ZPU), Lahore 54590, Pakistan. 3: Next-Generation Medical Biotechnology & Genomic Medicine Division, Department of Biotechnology, Qarshi University, Lahore, Pakistan. 4Division of Hematology/Oncology, Department of Medicine, KKUH and College of Medicine, King Saud University, Riyadh, Saudi Arabia. 5 Khyber Teaching Hospital and Hayatabad Medical Complex, Peshawar, Pakistan. 6Taibah University Madinah-Center for Genetics and, Inherited Diseases Center for Genetics and Inherited Diseases,, Taibah University Madinah, 30001, Saudi Arabia, Almadinah Almunawarah 30001, Saudi Arabia. 7 National Heart and Lung Institute, Imperial College London, UK. 8Biomedical Research Laboratory, College of Medicine, Alfaisal University, Riyadh, Saudi Arabia. 9Research Centre, King Faisal Specialist Hospital and Research Centre, Riyadh, Saudi Arabia.
    [Show full text]
  • An Investigation of the Contribution of Centrosomal Genes to Schizophrenia and Cognitive Function
    Provided by the author(s) and NUI Galway in accordance with publisher policies. Please cite the published version when available. Title An investigation of the contribution of centrosomal genes to schizophrenia and cognitive function Author(s) Flynn, Mairéad Publication Date 2020-03-11 Publisher NUI Galway Item record http://hdl.handle.net/10379/15846 Downloaded 2021-09-25T16:18:14Z Some rights reserved. For more information, please see the item record link above. An investigation of the contribution of centrosomal genes to schizophrenia and cognitive function By Mairéad Flynn, B.Sc. A thesis submitted for the Degree of Doctor of Philosophy to the Discipline of Biochemistry, School of Natural Sciences, National University of Ireland, Galway. Supervisor: Dr Derek Morris Discipline of Biochemistry, School of Natural Sciences, National University of Ireland, Galway Co-Supervisor: Prof Ciaran Morrison Discipline of Biochemistry, School of Natural Sciences, National University of Ireland, Galway August 2019 Table of Contents Table of Contents Table of Contents ............................................................................................................. I Declaration .................................................................................................................... VI Statement of Contribution ......................................................................................... VII Acknowledgements .................................................................................................... VIII List
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • Downloaded the “Top Edge” Version
    bioRxiv preprint doi: https://doi.org/10.1101/855338; this version posted December 6, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Drosophila models of pathogenic copy-number variant genes show global and 2 non-neuronal defects during development 3 Short title: Non-neuronal defects of fly homologs of CNV genes 4 Tanzeen Yusuff1,4, Matthew Jensen1,4, Sneha Yennawar1,4, Lucilla Pizzo1, Siddharth 5 Karthikeyan1, Dagny J. Gould1, Avik Sarker1, Yurika Matsui1,2, Janani Iyer1, Zhi-Chun Lai1,2, 6 and Santhosh Girirajan1,3* 7 8 1. Department of Biochemistry and Molecular Biology, Pennsylvania State University, 9 University Park, PA 16802 10 2. Department of Biology, Pennsylvania State University, University Park, PA 16802 11 3. Department of Anthropology, Pennsylvania State University, University Park, PA 16802 12 4 contributed equally to work 13 14 *Correspondence: 15 Santhosh Girirajan, MBBS, PhD 16 205A Life Sciences Building 17 Pennsylvania State University 18 University Park, PA 16802 19 E-mail: [email protected] 20 Phone: 814-865-0674 21 1 bioRxiv preprint doi: https://doi.org/10.1101/855338; this version posted December 6, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 22 ABSTRACT 23 While rare pathogenic copy-number variants (CNVs) are associated with both neuronal and non- 24 neuronal phenotypes, functional studies evaluating these regions have focused on the molecular 25 basis of neuronal defects.
    [Show full text]
  • Metastatic Adrenocortical Carcinoma Displays Higher Mutation Rate and Tumor Heterogeneity Than Primary Tumors
    ARTICLE DOI: 10.1038/s41467-018-06366-z OPEN Metastatic adrenocortical carcinoma displays higher mutation rate and tumor heterogeneity than primary tumors Sudheer Kumar Gara1, Justin Lack2, Lisa Zhang1, Emerson Harris1, Margaret Cam2 & Electron Kebebew1,3 Adrenocortical cancer (ACC) is a rare cancer with poor prognosis and high mortality due to metastatic disease. All reported genetic alterations have been in primary ACC, and it is 1234567890():,; unknown if there is molecular heterogeneity in ACC. Here, we report the genetic changes associated with metastatic ACC compared to primary ACCs and tumor heterogeneity. We performed whole-exome sequencing of 33 metastatic tumors. The overall mutation rate (per megabase) in metastatic tumors was 2.8-fold higher than primary ACC tumor samples. We found tumor heterogeneity among different metastatic sites in ACC and discovered recurrent mutations in several novel genes. We observed 37–57% overlap in genes that are mutated among different metastatic sites within the same patient. We also identified new therapeutic targets in recurrent and metastatic ACC not previously described in primary ACCs. 1 Endocrine Oncology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 2 Center for Cancer Research, Collaborative Bioinformatics Resource, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892, USA. 3 Department of Surgery and Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA. Correspondence and requests for materials should be addressed to E.K. (email: [email protected]) NATURE COMMUNICATIONS | (2018) 9:4172 | DOI: 10.1038/s41467-018-06366-z | www.nature.com/naturecommunications 1 ARTICLE NATURE COMMUNICATIONS | DOI: 10.1038/s41467-018-06366-z drenocortical carcinoma (ACC) is a rare malignancy with types including primary ACC from the TCGA to understand our A0.7–2 cases per million per year1,2.
    [Show full text]
  • Cell Type-Specific Analysis of Human Interactome and Transcriptome Shahin Mohammadi Purdue University
    Purdue University Purdue e-Pubs Open Access Dissertations Theses and Dissertations January 2016 Cell Type-specific Analysis of Human Interactome and Transcriptome Shahin Mohammadi Purdue University Follow this and additional works at: https://docs.lib.purdue.edu/open_access_dissertations Recommended Citation Mohammadi, Shahin, "Cell Type-specific Analysis of Human Interactome and Transcriptome" (2016). Open Access Dissertations. 1371. https://docs.lib.purdue.edu/open_access_dissertations/1371 This document has been made available through Purdue e-Pubs, a service of the Purdue University Libraries. Please contact [email protected] for additional information. Graduate School Form 30 Updated 12/26/2015 PURDUE UNIVERSITY GRADUATE SCHOOL Thesis/Dissertation Acceptance This is to certify that the thesis/dissertation prepared By Shahin Mohammadi Entitled CELL TYPE-SPECIFIC ANALYSIS OF HUMAN INTERACTOME AND TRANSCRIPTOME For the degree of Doctor of Philosophy Is approved by the final examining committee: Ananth Grama Wojciech Szpankowski Chair David Gleich Jennifer Neville Markus Lill To the best of my knowledge and as understood by the student in the Thesis/Dissertation Agreement, Publication Delay, and Certification Disclaimer (Graduate School Form 32), this thesis/dissertation adheres to the provisions of Purdue University’s “Policy of Integrity in Research” and the use of copyright material. Approved by Major Professor(s): Ananth Grama William Gorman, Assistant to the Department Head 11/1/2016 Approved by: Head of the Departmental Graduate Program Date CELL TYPE-SPECIFIC ANALYSIS OF HUMAN INTERACTOME AND TRANSCRIPTOME A Dissertation Submitted to the Faculty of Purdue University by Shahin Mohammadi In Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy December 2016 Purdue University West Lafayette, Indiana ii I dedicate this thesis to my mom, whose role in my life I can not even begin to describe.
    [Show full text]
  • Newfound Coding Potential of Transcripts Unveils Missing Members Of
    bioRxiv preprint doi: https://doi.org/10.1101/2020.12.02.406710; this version posted December 3, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1 Newfound coding potential of transcripts unveils missing members of 2 human protein communities 3 4 Sebastien Leblanc1,2, Marie A Brunet1,2, Jean-François Jacques1,2, Amina M Lekehal1,2, Andréa 5 Duclos1, Alexia Tremblay1, Alexis Bruggeman-Gascon1, Sondos Samandi1,2, Mylène Brunelle1,2, 6 Alan A Cohen3, Michelle S Scott1, Xavier Roucou1,2,* 7 1Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, 8 Quebec, Canada. 9 2 PROTEO, Quebec Network for Research on Protein Function, Structure, and Engineering. 10 3Department of Family Medicine, Université de Sherbrooke, Sherbrooke, Quebec, Canada. 11 12 *Corresponding author: Tel. (819) 821-8000x72240; E-Mail: [email protected] 13 14 15 Running title: Alternative proteins in communities 16 17 Keywords: alternative proteins, protein network, protein-protein interactions, pseudogenes, 18 affinity purification-mass spectrometry 19 20 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.12.02.406710; this version posted December 3, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 21 Abstract 22 23 Recent proteogenomic approaches have led to the discovery that regions of the transcriptome 24 previously annotated as non-coding regions (i.e.
    [Show full text]
  • Open Data for Differential Network Analysis in Glioma
    International Journal of Molecular Sciences Article Open Data for Differential Network Analysis in Glioma , Claire Jean-Quartier * y , Fleur Jeanquartier y and Andreas Holzinger Holzinger Group HCI-KDD, Institute for Medical Informatics, Statistics and Documentation, Medical University Graz, Auenbruggerplatz 2/V, 8036 Graz, Austria; [email protected] (F.J.); [email protected] (A.H.) * Correspondence: [email protected] These authors contributed equally to this work. y Received: 27 October 2019; Accepted: 3 January 2020; Published: 15 January 2020 Abstract: The complexity of cancer diseases demands bioinformatic techniques and translational research based on big data and personalized medicine. Open data enables researchers to accelerate cancer studies, save resources and foster collaboration. Several tools and programming approaches are available for analyzing data, including annotation, clustering, comparison and extrapolation, merging, enrichment, functional association and statistics. We exploit openly available data via cancer gene expression analysis, we apply refinement as well as enrichment analysis via gene ontology and conclude with graph-based visualization of involved protein interaction networks as a basis for signaling. The different databases allowed for the construction of huge networks or specified ones consisting of high-confidence interactions only. Several genes associated to glioma were isolated via a network analysis from top hub nodes as well as from an outlier analysis. The latter approach highlights a mitogen-activated protein kinase next to a member of histondeacetylases and a protein phosphatase as genes uncommonly associated with glioma. Cluster analysis from top hub nodes lists several identified glioma-associated gene products to function within protein complexes, including epidermal growth factors as well as cell cycle proteins or RAS proto-oncogenes.
    [Show full text]
  • An Ancient Germ Cell-Specific RNA-Binding Protein Protects
    RESEARCH ARTICLE An ancient germ cell-specific RNA-binding protein protects the germline from cryptic splice site poisoning Ingrid Ehrmann1, James H Crichton2, Matthew R Gazzara3,4, Katherine James5, Yilei Liu1,6, Sushma Nagaraja Grellscheid1,7, TomazˇCurk8, Dirk de Rooij9,10, Jannetta S Steyn11, Simon Cockell11, Ian R Adams2, Yoseph Barash3,12*, David J Elliott1* 1Institute of Genetic Medicine, Newcastle University, Newcastle, United Kingdom; 2MRC Human Genetics Unit, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, United Kingdom; 3Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, United States; 4Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, United States; 5Life Sciences, Natural History Museum, London, United Kingdom; 6Department of Plant and Microbial Biology, University of Zu¨ rich, Zu¨ rich, Switzerland; 7School of Biological and Biomedical Sciences, University of Durham, Durham, United Kingdom; 8Laboratory of Bioinformatics, Faculty of Computer and Information Sciences, University of Ljubljana, Ljubljana, Slovenia; 9Reproductive Biology Group, Division of Developmental Biology, Department of Biology, Faculty of Science, Utrecht University, Utrecht, The Netherlands; 10Center for Reproductive Medicine, Academic Medical Center, University of Amsterdam, Amsterdam, The Netherlands; 11Bioinformatics Support Unit, Faculty of Medical Sciences, Newcastle University, Newcastle, United Kingdom; 12Department of Computer and Information Science, University of Pennsylvania, Philadelphia, United States *For correspondence: [email protected] (YB); [email protected] (DJE) Competing interests: The Abstract Male germ cells of all placental mammals express an ancient nuclear RNA binding authors declare that no protein of unknown function called RBMXL2. Here we find that deletion of the retrogene encoding competing interests exist.
    [Show full text]
  • Genomics of Inherited Bone Marrow Failure and Myelodysplasia Michael
    Genomics of inherited bone marrow failure and myelodysplasia Michael Yu Zhang A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy University of Washington 2015 Reading Committee: Mary-Claire King, Chair Akiko Shimamura Marshall Horwitz Program Authorized to Offer Degree: Molecular and Cellular Biology 1 ©Copyright 2015 Michael Yu Zhang 2 University of Washington ABSTRACT Genomics of inherited bone marrow failure and myelodysplasia Michael Yu Zhang Chair of the Supervisory Committee: Professor Mary-Claire King Department of Medicine (Medical Genetics) and Genome Sciences Bone marrow failure and myelodysplastic syndromes (BMF/MDS) are disorders of impaired blood cell production with increased leukemia risk. BMF/MDS may be acquired or inherited, a distinction critical for treatment selection. Currently, diagnosis of these inherited syndromes is based on clinical history, family history, and laboratory studies, which directs the ordering of genetic tests on a gene-by-gene basis. However, despite extensive clinical workup and serial genetic testing, many cases remain unexplained. We sought to define the genetic etiology and pathophysiology of unclassified bone marrow failure and myelodysplastic syndromes. First, to determine the extent to which patients remained undiagnosed due to atypical or cryptic presentations of known inherited BMF/MDS, we developed a massively-parallel, next- generation DNA sequencing assay to simultaneously screen for mutations in 85 BMF/MDS genes. Querying 71 pediatric and adult patients with unclassified BMF/MDS using this assay revealed 8 (11%) patients with constitutional, pathogenic mutations in GATA2 , RUNX1 , DKC1 , or LIG4 . All eight patients lacked classic features or laboratory findings for their syndromes.
    [Show full text]
  • Small Supernumerary Marker Chromosomes: a Legacy of Trisomy Rescue?
    UNIVERSITA’ DEGLI STUDI DI PAVIA Dipartimento di Medicina Molecolare Small supernumerary marker chromosomes: a legacy of trisomy rescue? Nehir Edibe Kurtas Dottorato di Ricerca in Genetica, Biologia Molecolare e Cellulare XXXI Ciclo – A.A. 2015-2018 UNIVERSITA’ DEGLI STUDI DI PAVIA Dipartimento di Medicina Molecolare Small supernumerary marker chromosomes: a legacy of trisomy rescue? Nehir Edibe Kurtas Supervised by Prof. Orsetta Zuffardi Dottorato di Ricerca in Genetica, Biologia Molecolare e Cellulare XXXI Ciclo – A.A. 2015-2018 Abstract Abstract We studied by a whole cytogenomics approach 12 de novo, non-recurrent small supernumerary marker chromosomes (sSMC), detected as mosaics during pre- or postnatal diagnosis. In 11 cases maternal age was increased. Trios genotyping, WGS, and CGH+SNP array, demonstrated that (i) four sSMCs contained pericentromeric portions only, whereas eight had additional non-contiguous portions of the same chromosome, assembled together in a disordered fashion by repair-based mechanisms in a chromothriptic event; (ii) in four cases maternal hetero/isodisomy was detected with a sharp paternal origin of the sSMC in two cases, whereas in a fifth case two maternal alleles in the sSMC region and biparental haplotypes of the homologs were detected. In five other cases the homologs were biparental while the sSMC had the same haplotype of the maternally inherited chromosome. In one case, the sSMC was of paternal origin while the homologs were biparental. These findings strongly suggest that most sSMCs are the result of a multiple-step mechanism, initiated by maternal meiotic non-disjunction followed by post-zygotic anaphase lagging of the supernumerary chromosome and its subsequent insertion within a micronucleus, whose segregation to one of the two daughter cells accounts for the mosaic condition.
    [Show full text]