Transcriptomics-Guided Design of Synthetic Promoters for a Mammalian System

Total Page:16

File Type:pdf, Size:1020Kb

Transcriptomics-Guided Design of Synthetic Promoters for a Mammalian System Transcriptomics-guided design of synthetic promoters for a mammalian system Joseph K. Cheng1 and Hal S. Alper1,2,* 1Department of Chemical Engineering, The University of Texas at Austin, 200 E Dean Keeton St. Stop C0400, Austin, Texas 78712 2Institute for Cellular and Molecular Biology, The University of Texas at Austin, 2500 Speedway Avenue, Austin, Texas 78712 *Correspondence and requests for materials should be addressed to H.S.A. ([email protected]) Supporting Information Accession codes of nucleotide sequences used in this manuscript Human cytomegalovirus: M60321.1 Simian Virus 40: J02400.1 Human chromosomes, GRCh38.p2 assembly: NC_000001.11, NC_000002.12, NC_000003.12, NC_000004.12, NC_000005.10, NC_000006.12, NC_000007.14, NC_000008.11, NC_000009.12, NC_000010.11, NC_000011.10, NC_000012.12, NC_000013.11, NC_000014.9, NC_000015.10, NC_000016.10, NC_000017.11, NC_000018.10, NC_000019.10, NC_000020.11, NC_000021.9, NC_000022.11, NC_000023.11, NC_000024.10 Equations From the derived GMM, we can determine the probabilities of several key concerns: 1) the false positive probability at a particular threshold expression value (log-transformed) of belonging to the high expression group; 2) the false negative probability at a particular threshold expression value (log- transformed) of belonging to the high expression group; and 3) the probability of observing an expression value (log-transformed), X , less than or equal to a specified expression value (e.g. median expression value) if X belongs to the high expression group. ∞, , , , ∞, , , , , , , , Supporting Tables and Graphics Nucleotides considered from ref. file Gene Chromosome (GRCh38.p2 Primary Assembly) 2000-bp region 500-bp region EEF1A1* 6 73,521,382 - 73,520,027 73,521,074 - 73,520,027 CLUAP1 16 3,498,945 - 3,500,944 3,500,445 - 3,500,944 TPT1 13 45,343,284 - 45,341,285 45,341,784 - 45,341,285 TUBA1B 12 49,133,521 - 49,131,522 49,132,021 - 49,131,522 GGA1 22 37,606,385 - 37,608,384 37,607,885 - 37,608,384 LAIR1 19 54,372,553 - 54,370,554 54,371,053 - 54,370,554 UBC* 12 124,915,549 - 124,913,772 124,915,081 - 124,913,772 VIM 10 17,225,935 - 17,227,934 17,227,435 - 17,227,934 ITIH5 10 7,668,998 - 7,666,999 7,667,498 - 7,666,999 F2R 5 76,714,043 - 76,716,042 76,715,543 - 76,716,042 TMEM158 3 45,228,322 - 45,226,323 45,226,822 - 45,226,323 MSH3 5 80,652,648 - 80,654,647 80,654,148 - 80,654,647 CCR6 6 167,109,807 - 167,111,806 167,111,307 - 167,111,806 ACTG1 17 81,514,866 - 81,512,867 81,513,366 - 81,512,867 FTL 19 48,963,309 - 48,965,308 48,964,809 - 48,965,308 TMSB10 2 84,903,639 - 84,905,638 84,905,139 - 84,905,638 GNB2L1 5 181,245,906 - 181,243,907 181,244,406 - 181,243,907 ROCK2 2 11,346,585 - 11,344,586 11,345,085 - 11,344,586 HSP90AA1 14 102,141,749 - 102,139,750 102,140,249 - 102,139,750 EIF4A1* 17 7,571,970 - 7,572,705 7,572,206 - 7,572,705 Table S1a. Details of 20 most highly expressed genes. *denotes alternative length of promoter considered for annotation. Nucleotides considered from ref. file Gene Chromosome (GRCh38.p2 Primary Assembly) 2000-bp region 500-bp region SUMO4 6 149,398,359 - 149,400,358 149,399,859 - 149,400,358 MAGED4B 23, X 52,071,272 - 52,069,273 52,069,572 - 52,069,273 ARTN 1 43,931,320 - 43,933,319 43,932,820 - 43,933,319 DUSP23 1 159,778,940 - 159,780,939 159,780,440 - 159,780,939 KLC4 6 43,057,594 - 43,059,593 43,059,094 - 43,059,593 NR1I2 3 119,778,484 - 119,780,483 119,779,984 - 119,780,483 PRIM2 6 57,314,305 - 57,314,804 57,312,805 - 57,314,804 MAP2 2 209,422,009 - 209,424,008 209,423,509 - 209,424,008 SMN1 5 70,922,941 - 70,924,940 70,924,441 - 70,924,940 COL25A1 4 109,304,643 - 109,302,644 109,303,143 - 109,302,644 NPTX1 17 80,478,604 - 80,476,605 80,477,104 - 80,476,605 NPSR1 7 34,656,285 - 34,658,284 34,657,785 - 34,658,284 LRRIQ1 12 85,034,321 - 85,036,320 85,035,821 - 85,036,320 MED31 17 6,653,799 - 6,651,800 6,652,299 - 6,651,800 CACNB4 2 152,101,079 - 152,099,080 152,099,579 - 152,099,080 HDAC9 7 18,084,942 - 18,086,941 18,086,442 - 18,086,941 SHC2 19 462,996 - 460,997 461,496 - 460,997 POMC 2 25,170,690 - 25,168,691 25,169,190 - 25,168,691 AGTR1 3 148,695,871 - 148,697,870 148,697,371 - 148,697,870 FGL1 8 17,912,365 - 17,910,366 17,910,865 - 17,910,366 Table S1b. Details of 20 genes with median expression. Nucleotides considered from ref. file Gene Chromosome (GRCh38.p2 Primary Assembly) 2000-bp region 500-bp region ALB 4 73,402,255 - 73,404,254 73,403,755 - 73,404,254 ALG8 11 78,141,660 - 78,139,661 78,140,160 - 78,139,661 ARHGEF11 1 157,048,640 - 157,046,641 157,047,140 - 157,046,641 ASPSCR1 17 81,975,550 - 81,977,549 81,977,050 - 81,977,549 CLIP1 12 122,424,632 - 122,422,633 122,423,132 - 122,422,633 CRNN 1 152,416,274 - 152,414,275 152,414,774 - 152,414,275 DRAM 12 101,875,327 - 101,877,326 101,876,827 - 101,877,326 FAM134B 5 16,619,058 - 16,617,059 16,617,558 - 16,617,059 HDHD3 9 113,379,009 - 113,377,010 113,377,509 - 113,377,010 KRTAP4-2 17 41,180,208 - 41,178,209 41,178,708 - 41,178,209 LYSMD4 15 99,735,458 - 99,733,459 99,733,958 - 99,733,459 MUC17 7 101,018,076 - 101,020,075 101,019,576 - 101,020,075 OR13F1 9 104,502,263 - 104,504,262 104,503,763 - 104,504,262 PLSCR4 3 146,253,179 - 146,251,180 146,251,679 - 146,251,180 POLR3H 22 41,546,606 - 41,544,607 41,545,106 - 41,544,607 POP2 3 191,459,163 - 191,461,162 191,460,663 - 191,461,162 SFI1 22 31,486,688 - 31,488,687 31,488,188 - 31,488,687 TAF9 5 69,372,013 - 69,370,014 69,370,513 - 69,370,014 TCP1 6 159,791,703 - 159,789,704 159,790,203 - 159,789,704 TFDP3 23, X 133,220,348 - 133,218,349 133,218,848 - 133,218,349 Table S1c. Details of 20 randomly selected genes. Nucleotides considered from ref. file promoter Chromosome 2000-bp region 500-bp region CMV IE viral 1 - 2,105 534 - 1,193 SV40 viral 5,176 - 346 5,176 - 346 HSV-TK viral 48,633 - 47,881 48,633 - 47,881 ACTB 7 5,532,044 - 5,529,628 78,103,669 - 78,104,168 PGK1 23, X 78,102,169 - 78,104,168 6,533,905 - 6,534,404 GAPDH 12 6,532,405 - 6,534,404 94,394,061 - 94,394,560 COL1A2 7 94,392,561 - 94,394,560 534 - 1,193 Table S1d. Details of some common viral-derived and endogenous promoters. Human cytomegalovirus immediate-early (CMV IE) promoter taken from M60321.1; Simian Virus 40 (SV40) promoter taken from J02400.1, reverse complement; Herpes Simplex Virus thymidine-kinase (HSV-TK) promoter taken from NC_001806.1, human herpesvirus 1, reverse complement. Endogenous sequences are taken from GRCh38.p2 primary assembly. transcription factor consensus sequence AR ARGAACANNNTGTNC BATF::JUN RNWATGASTCA BRCA1 NCAACMS CDX2 NNGYMATAAAA CEBPA ATTGCAYAAYN CEBPB KATTGCAYMAY CREB (half) CGTCA CREB1 TGACGTCA CREB1 (other) BNBGRTGACGYN CTCF YNRCCASYAGRKGGCRSYN CTCF (short) CCGCGNGGNGGCAG DUX4 TAAYYYAATCA E2F1 NGGGCGGGARV E2F4 NGGCGGGARRN E2F6 RGGCGGGARRN EBF1 NYCCCCWGGGA EGR1 NCNCCGCCCCCKCN EHF MCTTCCTS ELF1 NRANCMGGAAGTG ELK1 NNNCCGGAAR ELK4 NCRCTTCCGGN ESR1 NNNNNAGGTCACCCTGACCY ESR2 AGGTCASNNTGNCCY ESRRA YCAAGGTCACN ETS1 YWTCCK EWSR1-FLI1 GGAAGGAAGGAAGGAAGG FEV CAGGAART FLI1 RCAGGAAGTGR FOS NNTGASTCATN FOSL1 RRTGASTCAKN FOSL2 NRRTGASTCAB FOXA1 TGTTTRCWYWN FOXC1 NNNNNGTA FOXD1 GTAAACAN FOXF2 NNANSGTAAACAAN FOXH1 NNNAATCCACA FOXI1 NNNTRTTTRTTT FOXL1 WNNANATA FOXO3 KGTAAACA FOXP1 NNNANGTAAACAAAN FOXP2 NWGTAAACARN GABPA ACCGGAAGNS GATA2 NNNTTCTTATCTSN GATA3 AGAGAAGA HIF1A::ARNT VNACGTGN HINFP NAACGTCCGC HLF NRTTACRYAATN HNF1A GGTTAATNATTANC HNF1B YYAATRWTTAAC HNF4A NTGRACTTTGNNCYN HNF4G NRRGNNCAAAGKYCA HOXA5 CDBWAATK HSF1 NTTCTRGAANNTTCY INSM1 TGYCWGGGGGCR IRF1 NNNYRSTTTCACTTTCNNTTT transcription factor consensus sequence IRF2 SGAAAGYGAAASCNWWWM JUN NNRATGATGTMAT JUN (var.2) NNNNRRTGASTCAN JUN::FOS TGACTCA JUNB NRRTGASTCAK JUND NRTGASTCATN JUND (var.2) NNNRATGABGTCATN KLF5 GCCCCDCCCH MAFF NCTNASTCAGCANWWWNN MAFK MNNASTCAGCANWWW MAX RRGCACATGK MEF2A NKCTAAAAATAGMNN MEF2C NNNCYAAAAATAGMN MYC::MAX RASCACGTGGT MZF1_1-4 NGGGGA MZF1_5-13 NKAGGGGKAR NF1 (consensus) TTGGCNNNNNNCC NFATC2 TTTTCCA NFE2::MAF ATGACTCAGCANWWN NFE2L2 RTGACWMAGCA NFIC TTGGCN NFIL3 TTAYGTAAYNN NFKB1 KGGRNTTTCCM NFYA AGNSYKCTGATTGGTNNR NFYB NNNYNRRCCAATCAG NHLH1 NCGCAGCTGCGN NKX3-1 ATACTTA NR1H2::RXRA AAAGGTCAAAGGTCAAC NR2C2 NGRGGTCARAGGTCA NR2F1 TGAMCTTTGVMCHT NR4A2 AAGGTCAC NRF1 GCGCNTGCGCR PAX5 NNGNKCAGYSRAGCRTGAC Pax6 TTCACGCWTSANTK PBX1 NCATCAATCAAW PLAG1 GGGGCCCWAGGG POU2F2 NNNATTTGCATRW PPARG STRGGTCACNGTGACCYANT PPARG::RXRA NNWGRGGTCAAAGGTCANNN PRDM1 NRAAAGTGAAAGTNN REL BGGRNWTTCC RELA GGGRRTTTCC REST TTCAGCACCATGGACAGCKCC RFX2 GTYNCCATGGCAACNRNNN RFX5 YNSCMTRGCAACAGN RORA_1 AWNNAGGTCA RORA_2 NWWAWNTAGGTCAN RREB1 CCCCMAAMCAMCCMCMMMCN RUNX1 WWYTGYGGTWW RUNX2 KNKNNYTGTGGTYTK RXR::RAR_DR5 RGKTCANNNRSAGGTCA RXRA::VDR GGGTCAWNGRGTTCA SMAD2::SMAD3::SMAD4 NTGTCTGNCACCT SOX10 CWTTGT SOX9 CYATTGTTN transcription factor consensus sequence SP1 NCCCCKCCCCC SP1 (short) CCKCCY SP2 GYCCCGCCYCYNNNN SPI1 AGGAAGT SPIB WSMGGAA SREBF1 VTCACCCCAY SREBF2 RTGGGGTGAY SRF NNTKNCCAWATAWGGNAA SRY NWWAACAAT STAT1 NTTCYRGGAAA STAT2::STAT1 TNAGTTTCNNTTTCY STAT3 NTTCYKGGAAN TAL1::GATA1 NTTATCWNNNNNNNNCAG TAL1::TCF3 NNAMCATCTGKT TBP STATAWAWRNNNNNN TCF7L2 NNASWTCAAAGNNN TEAD1 YACATTCCWSNG TFAP2A NNNNGCCYSAGGGCA TFAP2C NNNNSCCYCAGGSCN THAP1 YTGCCCNNA TLX1::NFIC TGGCASSRNGCCAA TP53 RCATGYCCAGACATG TP63 NNRCAWGYNCARRCWTGYNN USF1 NNCAYGTGACC USF2 RYCAYGTGACY YY1 CAARATGGCNGC ZBTB33 NTCTCGCGAGANYTN ZEB1 NCWCACCTG ZNF263 GGAGGAGGRRGRGGRGGRRRR ZNF354C MTCCAC Table S2.
Recommended publications
  • Epstein-Barr Virus Viral Protein Lmp1 Increases Rap1
    EPSTEIN-BARR VIRUS VIRAL PROTEIN LMP1 INCREASES RAP1 SUMOYLATION, ENHANCING TELOMERE MAINTENANCE DURING TUMORIGENESIS By WYATT TYLER CRAMBLET B.S., Biology, Gordon State College, 2016 A Thesis Submitted to the Faculty of Mercer University School of Medicine in Partial Fulfillment of the Requirements for the Degree MASTER OF SCIENCE IN BIOMEDICAL SCIENCE Macon, GA 2018 EPSTEIN-BARR VIRUS VIRAL PROTEIN LMP1 INCREASES RAP1 SUMOYLATION, ENHANCING TELOMERE MAINTENANCE DURING TUMORIGENESIS By WYATT TYLER CRAMBLET Approved: ___________________________________________ Date _________ Gretchen L. Bentz, Ph.D. Faculty Advisor for Thesis ___________________________________________ Date _________ Richard O. McCann, Ph.D. Thesis Committee Member ___________________________________________ Date _________ Laura Silo-Suh, Ph.D. Thesis Committee Member ___________________________________________ Date _________ Jon Shuman, Ph.D. Thesis Committee Member ___________________________________________ Date _________ Jean Sumner, MD Dean, Mercer University School of Medicine ACKNOWLEDGEMENTS I would like to thank the National Institutes of Health for providing the funding for this project and Mercer University School of Medicine for the opportunity for discovery and knowledge this experience has provided me. My utmost respect and gratitude goes to Dr. Gretchen Bentz. Along the course of my research Dr. Bentz has always provided what I required to be successful: materials, focus, experience, and understanding. Her guidance at both the lab and the writing desk is what has allowed me to accomplish the greatest achievement of my academic career. I thank Dr. Richard McCann and Dr. Christy Bridges who have been the Directors of the Master of Science Biomedical Sciences Program. Their leadership has been invaluable to my peers and me for becoming the professionals we set out to be.
    [Show full text]
  • Site-Specific Identification and Quantitation of Endogenous SUMO Modifications Under Native Conditions
    ARTICLE DOI: 10.1038/s41467-017-01271-3 OPEN Site-specific identification and quantitation of endogenous SUMO modifications under native conditions Ryan J. Lumpkin1, Hongbo Gu2, Yiying Zhu2, Marilyn Leonard3, Alla S. Ahmad 1, Karl R. Clauser4, Jesse G. Meyer 1, Eric J. Bennett 3 & Elizabeth A. Komives1 fi fi 1234567890 Small ubiquitin-like modi er (SUMO) modi cation regulates numerous cellular processes. Unlike ubiquitin, detection of endogenous SUMOylated proteins is limited by the lack of naturally occurring protease sites in the C-terminal tail of SUMO proteins. Proteome-wide detection of SUMOylation sites on target proteins typically requires ectopic expression of mutant SUMOs with introduced tryptic sites. Here, we report a method for proteome-wide, site-level detection of endogenous SUMOylation that uses α-lytic protease, WaLP. WaLP digestion of SUMOylated proteins generates peptides containing SUMO-remnant diglycyl- lysine (KGG) at the site of SUMO modification. Using previously developed immuno-affinity isolation of KGG-containing peptides followed by mass spectrometry, we identified 1209 unique endogenous SUMO modification sites. We also demonstrate the impact of protea- some inhibition on ubiquitin and SUMO-modified proteomes using parallel quantitation of ubiquitylated and SUMOylated peptides. This methodological advancement enables deter- mination of endogenous SUMOylated proteins under completely native conditions. 1 Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093-0378, USA. 2 Proteomic Service Group Cell Signaling Technology, 3 Trask Lane, Danvers, MA 01923, USA. 3 Cell and Developmental Biology, University of California, San Diego, La Jolla, CA 92093-0380, USA. 4 Broad Institute of MIT and Harvard, 415 Main St, Cambridge, MA 02142, USA.
    [Show full text]
  • Emerging Applications of Metabolomic and Genomic Profiling in Diabetic Clinical Medicine
    Reviews/Commentaries/ADA Statements REVIEW Emerging Applications of Metabolomic and Genomic Profiling in Diabetic Clinical Medicine AINE M. MCKILLOP, PHD however, powerful methodologies are now PETER R. FLATT, PHD available for exploitation of novel quantita- tive and qualitative disease-related biomark- ers. Novel biomarkers are needed that are Clinical and epidemiological metabolomics provides a unique opportunity to look at genotype- independent of known clinical risk factors. phenotype relationships as well as the body’s responses to environmental and lifestyle factors. fl Fundamentally, metabolomics aims to Fundamentally, it provides information on the universal outcome of in uencing factors on monitor changes in products of metabo- disease states and has great potential in the early diagnosis, therapy monitoring, and understand- lism and provide valuable information on ing of the pathogenesis of disease. Diseases, such as diabetes, with a complex set of interactions fl between genetic and environmental factors, produce changes in the body’s biochemical profile, arangeofin uencing factors and gene- thereby providing potential markers for diagnosis and initiation of therapies. There is clearly related outcomes. Exploitation of genomic a need to discover new ways to aid diagnosis and assessment of glycemic status to help reduce technology in recent times has resulted in diabetes complications and improve the quality of life. Many factors, including peptides, proteins, many technical advances, and genomic metabolites, nucleic acids, and polymorphisms, have been proposed as putative biomarkers for analysis has now emerged as a valuable diabetes. Metabolomics is an approach used to identify and assess metabolic characteristics, changes, tool in predicting the body’s response to and phenotypes in response to influencing factors, such as environment, diet, lifestyle, and path- fi stimuli caused by disease or injury.
    [Show full text]
  • Role of Ubiquitin and SUMO in Intracellular Trafficking
    Curr. Issues Mol. Biol. (2020) 35: 99-108. Role of Ubiquitin and SUMO in Intracellular Trafcking Maria Sundvall1,2,3* 1Institute of Biomedicine, University of Turku, Turku, Finland. 2Western Cancer Centre FICAN West, Turku University Hospital, Turku, Finland. 3Department of Oncology and Radiotherapy, University of Turku, Turku, Finland. *Correspondence: [email protected] htps://doi.org/10.21775/cimb.035.099 Abstract behaviour. In addition to e.g. chemical groups, Precise location of proteins at a given time within such as phosphate groups, proteins can be modi- a cell is essential to convey specifc signals and fed by small polypeptides, such as ubiquitin and result in a relevant functional outcome. Small SUMO. Ubiquitin and SUMO share a similar ubiquitin-like modifcations, such as ubiquitin three-dimensional structure and are principally and SUMO, represent a delicate and diverse way covalently linked at lysine (K) residues of sub- to transiently regulate intracellular trafcking strate proteins by a similar conjugation pathway events of existing proteins in cells. Trafcking (Hershko et al., 1998; Hay, 2013). Unlike ubiq- of multiple proteins is controlled reversibly by uitin, SUMO is preferentially atached at SUMO ubiquitin and/or SUMO directly or indirectly via consensus sites ΨKxE in substrates Ψ = hydropho- regulation of transport machinery components. bic residue with high preference for I or V, x = any Regulation is dynamic and multilayered, involv- amino acid) under steady state conditions, but ing active crosstalk and interdependence between under stress conditions in particular more non- post-translational modifcations. However, in consensus sites are SUMOylated (Hendriks et al., most cases regulation appears very complex, and 2016).
    [Show full text]
  • Full Text (PDF)
    medRxiv preprint doi: https://doi.org/10.1101/2020.12.29.20248986; this version posted January 4, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license . Insights into the molecular mechanism of anticancer drug ruxolitinib repurposable in COVID-19 therapy Manisha Mandal Department of Physiology, MGM Medical College, Kishanganj-855107, India Email: [email protected], ORCID: https://orcid.org/0000-0002-9562-5534 Shyamapada Mandal* Department of Zoology, University of Gour Banga, Malda-732103, India Email: [email protected], ORCID: https://orcid.org/0000-0002-9488-3523 *Corresponding author: Email: [email protected]; [email protected] Abstract Due to non-availability of specific therapeutics against COVID-19, repurposing of approved drugs is a reasonable option. Cytokines imbalance in COVID-19 resembles cancer; exploration of anti-inflammatory agents, might reduce COVID-19 mortality. The current study investigates the effect of ruxolitinib treatment in SARS-CoV-2 infected alveolar cells compared to the uninfected one from the GSE5147507 dataset. The protein-protein interaction network, biological process and functional enrichment of differentially expressed genes were studied using STRING App of the Cytoscape software and R programming tools. The present study indicated that ruxolitinib treatment elicited similar response equivalent to that of SARS-CoV-2 uninfected situation by inducing defense response in host against virus infection by RLR and NOD like receptor pathways. Further, the effect of ruxolitinib in SARS- CoV-2 infection was mainly caused by significant suppression of IFIH1, IRF7 and MX1 genes as well as inhibition of DDX58/IFIH1-mediated induction of interferon- I and -II signalling.
    [Show full text]
  • Bioinformatic Gene Analysis for Potential Biomarkers And
    Zou et al. J Transl Med (2019) 17:45 https://doi.org/10.1186/s12967-019-1790-x Journal of Translational Medicine RESEARCH Open Access Bioinformatic gene analysis for potential biomarkers and therapeutic targets of atrial fbrillation-related stroke Rongjun Zou1†, Dingwen Zhang1†, Lei Lv1†, Wanting Shi2, Zijiao Song3, Bin Yi1, Bingjia Lai4, Qian Chen5, Songran Yang6,7* and Ping Hua1* Abstract Background: Atrial fbrillation (AF) is one of the most prevalent sustained arrhythmias, however, epidemiological data may understate its actual prevalence. Meanwhile, AF is considered to be a major cause of ischemic strokes due to irregular heart-rhythm, coexisting chronic vascular infammation, and renal insufciency, and blood stasis. We studied co-expressed genes to understand relationships between atrial fbrillation (AF) and stroke and reveal potential biomarkers and therapeutic targets of AF-related stroke. Methods: AF-and stroke-related diferentially expressed genes (DEGs) were identifed via bioinformatic analysis Gene Expression Omnibus (GEO) datasets GSE79768 and GSE58294, respectively. Subsequently, extensive target prediction and network analyses methods were used to assess protein–protein interaction (PPI) networks, Gene Ontology (GO) terms and pathway enrichment for DEGs, and co-expressed DEGs coupled with corresponding predicted miRNAs involved in AF and stroke were assessed as well. Results: We identifed 489, 265, 518, and 592 DEGs in left atrial specimens and cardioembolic stroke blood samples at < 3, 5, and 24 h, respectively. LRRK2, CALM1, CXCR4, TLR4, CTNNB1, and CXCR2 may be implicated in AF and the hub- genes of CD19, FGF9, SOX9, GNGT1, and NOG may be associated with stroke. Finally, co-expressed DEGs of ZNF566, PDZK1IP1, ZFHX3, and PITX2 coupled with corresponding predicted miRNAs, especially miR-27a-3p, miR-27b-3p, and miR-494-3p may be signifcantly associated with AF-related stroke.
    [Show full text]
  • Genes Predisposing to Type 1 Diabetes Mellitus and Pathophysiology: a Narrative Review Tajudeen Yahaya,¹ Titilola Salisu²
    100 Med J Indones 2020;29(1) Review Article Genes predisposing to type 1 diabetes mellitus and pathophysiology: a narrative review Tajudeen Yahaya,¹ Titilola Salisu² pISSN: 0853-1773 • eISSN: 2252-8083 ABSTRACT https://doi.org/10.13181/mji.rev.203732 The possibility of targeting the causal genes along with the mechanisms of Med J Indones. 2020;29:100–9 pathogenically complex diseases has led to numerous studies on the genetic etiology Received: April 24, 2019 of some diseases. In particular, studies have added more genes to the list of type 1 Accepted: January 23, 2020 diabetes mellitus (T1DM) suspect genes, necessitating an update for the interest of Authors' affiliations: all stakeholders. Therefore this review articulates T1DM suspect genes and their ¹Department of Biology, Faculty of pathophysiology. Notable electronic databases, including Medline, Scopus, PubMed, Science, Federal University Birnin Kebbi, and Google-Scholar were searched for relevant information. The search identified Kebbi State, Nigeria, ²Department of Zoology and Environmental Biology, over 73 genes suspected in the pathogenesis of T1DM, with human leukocyte antigen, Olabisi Onabanjo University Ago-Iwoye, insulin gene, and cytotoxic T lymphocyte-associated antigen 4 accounting for most of Ogun State, Nigeria the cases. Mutations in these genes, along with environmental factors, may produce Corresponding author: a defective immune response in the pancreas, resulting in β-cell autoimmunity, insulin Tajudeen Yahaya deficiency, and hyperglycemia. The mechanisms leading to these cellular reactions are Department of Biology, Faculty of gene-specific and, if targeted in diabetic individuals, may lead to improved treatment. Science, Federal University Birnin Kebbi, Medical practitioners are advised to formulate treatment procedures that target these Kebbi State, Nigeria Tel/Fax: +234-8033550788/ genes in patients with T1DM.
    [Show full text]
  • Analyses of Multiple Single-Nucleotide Polymorphisms in the SUMO4/IDDM5 Region in Affected Sib-Pair Families with Type I Diabetes
    Genes and Immunity (2009) 10, S16–S20 & 2009 Macmillan Publishers Limited All rights reserved 1466-4879/09 $32.00 www.nature.com/gene ORIGINAL ARTICLE Analyses of multiple single-nucleotide polymorphisms in the SUMO4/IDDM5 region in affected sib-pair families with type I diabetes R Podolsky, MV Prasad Linga-Reddy, J-X She and the Type I Diabetes Genetics Consortium Center for Biotechnology and Genomic Medicine, Medical College of Georgia, Augusta, GA, USA Previous studies suggested that the SUMO4 gene, located in the IDDM5 interval on chromosome 6q25, was associated with type I diabetes (T1D) and several other autoimmune diseases. Subsequent analyses of the SUMO4 variants with T1D suggested that the association was stronger and more consistent in the Asian populations. In addition, considerable heterogeneity has been observed in the Caucasian populations. In this report, a 40-kb genomic interval including the SUMO4 gene was tagged with 15 single-nucleotide polymorphisms. A total of 2317 affected sib-pair families from the Type I Diabetes Genetic Consortium were genotyped using both the Illumina and Sequenom genotyping platforms. In these Caucasian families, we found little evidence supporting an association between SUMO4 and T1D. Genes and Immunity (2009) 10, S16–S20; doi:10.1038/gene.2009.86 Keywords: type I diabetes; SUMO4; genetic susceptibility; linkage disequilibrium; single-nucleotide polymorphism Introduction influence NFkB activity.1,6 Subsequently, this initial observation was a subject of intensive debate because SUMO4 is a member of a highly conserved family of of the failure to replicate the association in several genes that encode small ubiquitin-related modifiers Caucasian populations.6–9 Since the initial report, a (SUMO).1 The process to attach SUMO to proteins is number of studies have also revealed significant associa- known as sumoylation.
    [Show full text]
  • Genome-Wide Association Analyses of Symptom Severity Among Clozapine- Treated Patients with Schizophrenia Spectrum Disorder
    Genome-wide association analyses of symptom severity among clozapine- treated patients with schizophrenia spectrum disorder Supplementary Information Supplementary Methods…………………………………………………………………………..……….……..3 Supplementary Table 1. Conversion Table of PANSS Scores and CGI-S Scores…………………...……….….6 Supplementary Figure 1. Principal Components Plots for Primary and Sensitivity Analyses……..…………....8 Supplementary Table 2. Number of Samples and SNPs Included in Each Step of the Quality Control……..….9 Supplementary Table 3. Complex-LD Regions and Long-Range LD Regions that Were Excluded from PRS- Analyses……………………………………………………………………………..……………….…………...10 Supplementary Table 4. Activity Scoring Assignments for Each of the Observed Alleles……………..……...11 Supplementary Table 5. List of Inhibitors and Inducers by Enzyme…………………………….…..……..…..13 Supplementary Table 6. Descriptive Statistics of the Study Population………………………….…..………...14 Supplementary Figure 2. Q-Q plots of the Genome-wide Association Analyses…………………………...….15 Supplementary Figure 3. Regional Associations Plots for the Top SNPs identified in GWA Analysis………..16 Supplementary Table 7. Top SNPs Identified in the GWA Analyses…………………………………………..17 Supplementary Results………………………………………………………………………………………….18 Supplementary Figure 4. Dose-corrected clozapine concentrations in relation to the rs1923778 polymorphisms……………………………………………………………………………………………………19 Supplementary Table 8. Tukey post-hoc test for dose-adjusted clozapine concentrations of the rs1923778 polymorphism……………………………………………………………………………………………………..19
    [Show full text]
  • Metascape Provides a Biologist-Oriented Resource for the Analysis of Systems-Level Datasets
    UC San Diego UC San Diego Previously Published Works Title Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Permalink https://escholarship.org/uc/item/44d0x399 Journal Nature communications, 10(1) ISSN 2041-1723 Authors Zhou, Yingyao Zhou, Bin Pache, Lars et al. Publication Date 2019-04-03 DOI 10.1038/s41467-019-09234-6 Peer reviewed eScholarship.org Powered by the California Digital Library University of California ARTICLE https://doi.org/10.1038/s41467-019-09234-6 OPEN Metascape provides a biologist-oriented resource for the analysis of systems-level datasets Yingyao Zhou1, Bin Zhou1, Lars Pache2, Max Chang 3, Alireza Hadj Khodabakhshi1, Olga Tanaseichuk1, Christopher Benner3 & Sumit K. Chanda2 A critical component in the interpretation of systems-level studies is the inference of enri- ched biological pathways and protein complexes contained within OMICs datasets. Suc- 1234567890():,; cessful analysis requires the integration of a broad set of current biological databases and the application of a robust analytical pipeline to produce readily interpretable results. Metascape is a web-based portal designed to provide a comprehensive gene list annotation and analysis resource for experimental biologists. In terms of design features, Metascape combines functional enrichment, interactome analysis, gene annotation, and membership search to leverage over 40 independent knowledgebases within one integrated portal. Additionally, it facilitates comparative analyses of datasets across multiple independent and orthogonal experiments. Metascape provides a significantly simplified user experience through a one- click Express Analysis interface to generate interpretable outputs. Taken together, Metascape is an effective and efficient tool for experimental biologists to comprehensively analyze and interpret OMICs-based studies in the big data era.
    [Show full text]
  • 1 Supplementary Material Figure S1. Volcano Plot of Differentially
    Supplementary material Figure S1. Volcano Plot of differentially expressed genes between preterm infants fed own mother’s milk (OMM) or pasteurized donated human milk (DHM). Table S1. The 10 most representative biological processes filtered for enrichment p- value in preterm infants. Biological Processes p-value Quantity of DEG* Transcription, DNA-templated 3.62x10-24 189 Regulation of transcription, DNA-templated 5.34x10-22 188 Transport 3.75x10-17 140 Cell cycle 1.03x10-13 65 Gene expression 3.38x10-10 60 Multicellular organismal development 6.97x10-10 86 1 Protein transport 1.73x10-09 56 Cell division 2.75x10-09 39 Blood coagulation 3.38x10-09 46 DNA repair 8.34x10-09 39 Table S2. Differential genes in transcriptomic analysis of exfoliated epithelial intestinal cells between preterm infants fed own mother’s milk (OMM) and pasteurized donated human milk (DHM). Gene name Gene Symbol p-value Fold-Change (OMM vs. DHM) (OMM vs. DHM) Lactalbumin, alpha LALBA 0.0024 2.92 Casein kappa CSN3 0.0024 2.59 Casein beta CSN2 0.0093 2.13 Cytochrome c oxidase subunit I COX1 0.0263 2.07 Casein alpha s1 CSN1S1 0.0084 1.71 Espin ESPN 0.0008 1.58 MTND2 ND2 0.0138 1.57 Small ubiquitin-like modifier 3 SUMO3 0.0037 1.54 Eukaryotic translation elongation EEF1A1 0.0365 1.53 factor 1 alpha 1 Ribosomal protein L10 RPL10 0.0195 1.52 Keratin associated protein 2-4 KRTAP2-4 0.0019 1.46 Serine peptidase inhibitor, Kunitz SPINT1 0.0007 1.44 type 1 Zinc finger family member 788 ZNF788 0.0000 1.43 Mitochondrial ribosomal protein MRPL38 0.0020 1.41 L38 Diacylglycerol
    [Show full text]
  • Genetic Contribution to Type 1 Diabetes Microvascular Complications Ettie
    Genetic contribution to type 1 diabetes microvascular complications Ettie M. Lipner Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the Graduate School of Arts and Sciences COLUMBIA UNIVERSITY 2013 ! ©2013 Ettie M. Lipner All rights reserved ! ABSTRACT Genetic contribution to type 1 diabetes microvascular complications Ettie M. Lipner Type 1 diabetes (T1D) is an autoimmune disease characterized by destruction of beta cells in the pancreas resulting in insulin deficiency, which leads to hyperglycemia and organ damage. T1D patients experience an increased risk of morbidity and mortality due to long-term complications, specifically retinopathy, nephropathy, and neuropathy. Studies demonstrating familial aggregation support the claim that a genetic contribution may influence the development of complications. This dissertation aims to identify genes/chromosomal regions that predispose T1D patients to, or protect them from, the expression of the chronic microvascular complications: retinopathy, neuropathy and nephropathy. In my first chapter, I introduce the history of type 1 diabetes and microvascular complications and their importance as a public health concern. Data show that the prevalence for T1D is increasing, and thus it is likely that the prevalence for the associated complications will also increase. Further, a large proportion of T1D patients develop at least one microvascular complication within 15 years of T1D diagnosis. By identifying risk alleles for the microvascular complications of T1D, the findings of this study could allow physicians to determine which patients are at greater/lesser risk for developing complications, help to develop interventions to delay or protect against the development of complications and thus reduce medical expenditure and suffering due to diabetes.
    [Show full text]