Implementation of an Hybrid Machine Learning Methodology for Pharmacological Modeling
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Accurate Annotation of Human Protein-Coding Small Open Reading Frames
HHS Public Access Author manuscript Author ManuscriptAuthor Manuscript Author Nat Chem Manuscript Author Biol. Author Manuscript Author manuscript; available in PMC 2020 June 09. Published in final edited form as: Nat Chem Biol. 2020 April ; 16(4): 458–468. doi:10.1038/s41589-019-0425-0. Accurate annotation of human protein-coding small open reading frames Thomas F. Martinez1,*, Qian Chu1, Cynthia Donaldson1, Dan Tan1, Maxim N. Shokhirev2, Alan Saghatelian1,* 1Clayton Foundation Laboratories for Peptide Biology, Salk Institute for Biological Studies, La Jolla, California 92037, USA 2Razavi Newman Integrative Genomics Bioinformatics Core, Salk Institute for Biological Studies, La Jolla, California 92037, USA Abstract Functional protein-coding small open reading frames (smORFs) are emerging as an important class of genes. However, the number of translated smORFs in the human genome is unclear because proteogenomic methods are not sensitive enough, and, as we show, Ribo-Seq strategies require additional measures to ensure comprehensive and accurate smORF annotation. Here, we integrate de novo transcriptome assembly and Ribo-Seq into an improved workflow that overcomes obstacles with previous methods to more confidently annotate thousands of smORFs. Evolutionary conservation analyses suggest that hundreds of smORF-encoded microproteins are likely functional. Additionally, many smORFs are regulated during fundamental biological processes, such as cell stress. Peptides derived from smORFs are also detectable on human leukocyte antigen complexes, revealing smORFs as a source of antigens. Thus, by including additional validation into our smORF annotation workflow, we accurately identify thousands of unannotated translated smORFs that will provide a rich pool of unexplored, functional human genes. Annotation of open reading frames (ORFs) from genome sequencing was initially carried out by locating in-frame start (AUG) and stop codons1–3. -
Environmental Influences on Endothelial Gene Expression
ENDOTHELIAL CELL GENE EXPRESSION John Matthew Jeff Herbert Supervisors: Prof. Roy Bicknell and Dr. Victoria Heath PhD thesis University of Birmingham August 2012 University of Birmingham Research Archive e-theses repository This unpublished thesis/dissertation is copyright of the author and/or third parties. The intellectual property rights of the author or third parties in respect of this work are as defined by The Copyright Designs and Patents Act 1988 or as modified by any successor legislation. Any use made of information contained in this thesis/dissertation must be in accordance with that legislation and must be properly acknowledged. Further distribution or reproduction in any format is prohibited without the permission of the copyright holder. ABSTRACT Tumour angiogenesis is a vital process in the pathology of tumour development and metastasis. Targeting markers of tumour endothelium provide a means of targeted destruction of a tumours oxygen and nutrient supply via destruction of tumour vasculature, which in turn ultimately leads to beneficial consequences to patients. Although current anti -angiogenic and vascular targeting strategies help patients, more potently in combination with chemo therapy, there is still a need for more tumour endothelial marker discoveries as current treatments have cardiovascular and other side effects. For the first time, the analyses of in-vivo biotinylation of an embryonic system is performed to obtain putative vascular targets. Also for the first time, deep sequencing is applied to freshly isolated tumour and normal endothelial cells from lung, colon and bladder tissues for the identification of pan-vascular-targets. Integration of the proteomic, deep sequencing, public cDNA libraries and microarrays, delivers 5,892 putative vascular targets to the science community. -
A Computational Approach for Defining a Signature of Β-Cell Golgi Stress in Diabetes Mellitus
Page 1 of 781 Diabetes A Computational Approach for Defining a Signature of β-Cell Golgi Stress in Diabetes Mellitus Robert N. Bone1,6,7, Olufunmilola Oyebamiji2, Sayali Talware2, Sharmila Selvaraj2, Preethi Krishnan3,6, Farooq Syed1,6,7, Huanmei Wu2, Carmella Evans-Molina 1,3,4,5,6,7,8* Departments of 1Pediatrics, 3Medicine, 4Anatomy, Cell Biology & Physiology, 5Biochemistry & Molecular Biology, the 6Center for Diabetes & Metabolic Diseases, and the 7Herman B. Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, IN 46202; 2Department of BioHealth Informatics, Indiana University-Purdue University Indianapolis, Indianapolis, IN, 46202; 8Roudebush VA Medical Center, Indianapolis, IN 46202. *Corresponding Author(s): Carmella Evans-Molina, MD, PhD ([email protected]) Indiana University School of Medicine, 635 Barnhill Drive, MS 2031A, Indianapolis, IN 46202, Telephone: (317) 274-4145, Fax (317) 274-4107 Running Title: Golgi Stress Response in Diabetes Word Count: 4358 Number of Figures: 6 Keywords: Golgi apparatus stress, Islets, β cell, Type 1 diabetes, Type 2 diabetes 1 Diabetes Publish Ahead of Print, published online August 20, 2020 Diabetes Page 2 of 781 ABSTRACT The Golgi apparatus (GA) is an important site of insulin processing and granule maturation, but whether GA organelle dysfunction and GA stress are present in the diabetic β-cell has not been tested. We utilized an informatics-based approach to develop a transcriptional signature of β-cell GA stress using existing RNA sequencing and microarray datasets generated using human islets from donors with diabetes and islets where type 1(T1D) and type 2 diabetes (T2D) had been modeled ex vivo. To narrow our results to GA-specific genes, we applied a filter set of 1,030 genes accepted as GA associated. -
Supplementary Table 3 Complete List of RNA-Sequencing Analysis of Gene Expression Changed by ≥ Tenfold Between Xenograft and Cells Cultured in 10%O2
Supplementary Table 3 Complete list of RNA-Sequencing analysis of gene expression changed by ≥ tenfold between xenograft and cells cultured in 10%O2 Expr Log2 Ratio Symbol Entrez Gene Name (culture/xenograft) -7.182 PGM5 phosphoglucomutase 5 -6.883 GPBAR1 G protein-coupled bile acid receptor 1 -6.683 CPVL carboxypeptidase, vitellogenic like -6.398 MTMR9LP myotubularin related protein 9-like, pseudogene -6.131 SCN7A sodium voltage-gated channel alpha subunit 7 -6.115 POPDC2 popeye domain containing 2 -6.014 LGI1 leucine rich glioma inactivated 1 -5.86 SCN1A sodium voltage-gated channel alpha subunit 1 -5.713 C6 complement C6 -5.365 ANGPTL1 angiopoietin like 1 -5.327 TNN tenascin N -5.228 DHRS2 dehydrogenase/reductase 2 leucine rich repeat and fibronectin type III domain -5.115 LRFN2 containing 2 -5.076 FOXO6 forkhead box O6 -5.035 ETNPPL ethanolamine-phosphate phospho-lyase -4.993 MYO15A myosin XVA -4.972 IGF1 insulin like growth factor 1 -4.956 DLG2 discs large MAGUK scaffold protein 2 -4.86 SCML4 sex comb on midleg like 4 (Drosophila) Src homology 2 domain containing transforming -4.816 SHD protein D -4.764 PLP1 proteolipid protein 1 -4.764 TSPAN32 tetraspanin 32 -4.713 N4BP3 NEDD4 binding protein 3 -4.705 MYOC myocilin -4.646 CLEC3B C-type lectin domain family 3 member B -4.646 C7 complement C7 -4.62 TGM2 transglutaminase 2 -4.562 COL9A1 collagen type IX alpha 1 chain -4.55 SOSTDC1 sclerostin domain containing 1 -4.55 OGN osteoglycin -4.505 DAPL1 death associated protein like 1 -4.491 C10orf105 chromosome 10 open reading frame 105 -4.491 -
4-6 Weeks Old Female C57BL/6 Mice Obtained from Jackson Labs Were Used for Cell Isolation
Methods Mice: 4-6 weeks old female C57BL/6 mice obtained from Jackson labs were used for cell isolation. Female Foxp3-IRES-GFP reporter mice (1), backcrossed to B6/C57 background for 10 generations, were used for the isolation of naïve CD4 and naïve CD8 cells for the RNAseq experiments. The mice were housed in pathogen-free animal facility in the La Jolla Institute for Allergy and Immunology and were used according to protocols approved by the Institutional Animal Care and use Committee. Preparation of cells: Subsets of thymocytes were isolated by cell sorting as previously described (2), after cell surface staining using CD4 (GK1.5), CD8 (53-6.7), CD3ε (145- 2C11), CD24 (M1/69) (all from Biolegend). DP cells: CD4+CD8 int/hi; CD4 SP cells: CD4CD3 hi, CD24 int/lo; CD8 SP cells: CD8 int/hi CD4 CD3 hi, CD24 int/lo (Fig S2). Peripheral subsets were isolated after pooling spleen and lymph nodes. T cells were enriched by negative isolation using Dynabeads (Dynabeads untouched mouse T cells, 11413D, Invitrogen). After surface staining for CD4 (GK1.5), CD8 (53-6.7), CD62L (MEL-14), CD25 (PC61) and CD44 (IM7), naïve CD4+CD62L hiCD25-CD44lo and naïve CD8+CD62L hiCD25-CD44lo were obtained by sorting (BD FACS Aria). Additionally, for the RNAseq experiments, CD4 and CD8 naïve cells were isolated by sorting T cells from the Foxp3- IRES-GFP mice: CD4+CD62LhiCD25–CD44lo GFP(FOXP3)– and CD8+CD62LhiCD25– CD44lo GFP(FOXP3)– (antibodies were from Biolegend). In some cases, naïve CD4 cells were cultured in vitro under Th1 or Th2 polarizing conditions (3, 4). -
Molecular Profiling of Peripheral Blood Is Associated with Circulating Tumor Cells Content and Poor Survival in Metastatic Castration-Resistant Prostate Cancer
www.impactjournals.com/oncotarget/ Oncotarget, Vol. 6, No. 12 Molecular profiling of peripheral blood is associated with circulating tumor cells content and poor survival in metastatic castration-resistant prostate cancer Mercedes Marín-Aguilera1, Òscar Reig1,2, Juan José Lozano3, Natalia Jiménez1, Susana García-Recio1,4, Nadina Erill5, Lydia Gaba2, Andrea Tagliapietra2, Vanesa Ortega2, Gemma Carrera6, Anna Colomer5, Pedro Gascón4 and Begoña Mellado1,2 1 Translational Genomics Group and Targeted Therapeutics in Solid Tumors Group, Institut d’Investigacions Biomèdiques August Pi i Sunyer (IDIBAPS), Barcelona, Spain 2 Medical Oncology Department, Hospital Clínic, Barcelona, Spain 3 Bioinformatics Platform Department, Centro de Investigación Biomédica en Red en el Área temática de Enfermedades Hepáticas y Digestivas (CIBEREHD), Hospital Clínic, Barcelona, Spain 4 Laboratory of Translational Oncology, Fundació Clínic per a la Recerca Biomèdica, Barcelona, Spain 5 Althia, Barcelona, Spain 6 Medical Oncology Department, Hospital Plató, Barcelona, Spain Correspondence to: Begoña Mellado, email: [email protected] Keywords: circulating tumor cells, peripheral blood, microarrays, cell search system Received: January 22, 2015 Accepted: February 14, 2015 Published: March 12, 2015 This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. ABSTRACT The enumeration of circulating -
Genetic and Genomic Analysis of Hyperlipidemia, Obesity and Diabetes Using (C57BL/6J × TALLYHO/Jngj) F2 Mice
University of Tennessee, Knoxville TRACE: Tennessee Research and Creative Exchange Nutrition Publications and Other Works Nutrition 12-19-2010 Genetic and genomic analysis of hyperlipidemia, obesity and diabetes using (C57BL/6J × TALLYHO/JngJ) F2 mice Taryn P. Stewart Marshall University Hyoung Y. Kim University of Tennessee - Knoxville, [email protected] Arnold M. Saxton University of Tennessee - Knoxville, [email protected] Jung H. Kim Marshall University Follow this and additional works at: https://trace.tennessee.edu/utk_nutrpubs Part of the Animal Sciences Commons, and the Nutrition Commons Recommended Citation BMC Genomics 2010, 11:713 doi:10.1186/1471-2164-11-713 This Article is brought to you for free and open access by the Nutrition at TRACE: Tennessee Research and Creative Exchange. It has been accepted for inclusion in Nutrition Publications and Other Works by an authorized administrator of TRACE: Tennessee Research and Creative Exchange. For more information, please contact [email protected]. Stewart et al. BMC Genomics 2010, 11:713 http://www.biomedcentral.com/1471-2164/11/713 RESEARCH ARTICLE Open Access Genetic and genomic analysis of hyperlipidemia, obesity and diabetes using (C57BL/6J × TALLYHO/JngJ) F2 mice Taryn P Stewart1, Hyoung Yon Kim2, Arnold M Saxton3, Jung Han Kim1* Abstract Background: Type 2 diabetes (T2D) is the most common form of diabetes in humans and is closely associated with dyslipidemia and obesity that magnifies the mortality and morbidity related to T2D. The genetic contribution to human T2D and related metabolic disorders is evident, and mostly follows polygenic inheritance. The TALLYHO/ JngJ (TH) mice are a polygenic model for T2D characterized by obesity, hyperinsulinemia, impaired glucose uptake and tolerance, hyperlipidemia, and hyperglycemia. -
Redefining the Specificity of Phosphoinositide-Binding by Human
bioRxiv preprint doi: https://doi.org/10.1101/2020.06.20.163253; this version posted June 21, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. Redefining the specificity of phosphoinositide-binding by human PH domain-containing proteins Nilmani Singh1†, Adriana Reyes-Ordoñez1†, Michael A. Compagnone1, Jesus F. Moreno Castillo1, Benjamin J. Leslie2, Taekjip Ha2,3,4,5, Jie Chen1* 1Department of Cell & Developmental Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801; 2Department of Biophysics and Biophysical Chemistry, Johns Hopkins University School of Medicine, Baltimore, MD 21205; 3Department of Biophysics, Johns Hopkins University, Baltimore, MD 21218; 4Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205; 5Howard Hughes Medical Institute, Baltimore, MD 21205, USA †These authors contributed equally to this work. *Correspondence: [email protected]. bioRxiv preprint doi: https://doi.org/10.1101/2020.06.20.163253; this version posted June 21, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC 4.0 International license. ABSTRACT Pleckstrin homology (PH) domains are presumed to bind phosphoinositides (PIPs), but specific interaction with and regulation by PIPs for most PH domain-containing proteins are unclear. Here we employed a single-molecule pulldown assay to study interactions of lipid vesicles with full-length proteins in mammalian whole cell lysates. -
Supplementary Table 1: Adhesion Genes Data Set
Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like, -
5' Untranslated Region Elements Show High Abundance and Great
International Journal of Molecular Sciences Article 0 5 Untranslated Region Elements Show High Abundance and Great Variability in Homologous ABCA Subfamily Genes Pavel Dvorak 1,2,* , Viktor Hlavac 2,3 and Pavel Soucek 2,3 1 Department of Biology, Faculty of Medicine in Pilsen, Charles University, 32300 Pilsen, Czech Republic 2 Biomedical Center, Faculty of Medicine in Pilsen, Charles University, 32300 Pilsen, Czech Republic; [email protected] (V.H.); [email protected] (P.S.) 3 Toxicogenomics Unit, National Institute of Public Health, 100 42 Prague, Czech Republic * Correspondence: [email protected]; Tel.: +420-377593263 Received: 7 October 2020; Accepted: 20 November 2020; Published: 23 November 2020 Abstract: The 12 members of the ABCA subfamily in humans are known for their ability to transport cholesterol and its derivatives, vitamins, and xenobiotics across biomembranes. Several ABCA genes are causatively linked to inborn diseases, and the role in cancer progression and metastasis is studied intensively. The regulation of translation initiation is implicated as the major mechanism in the processes of post-transcriptional modifications determining final protein levels. In the current bioinformatics study, we mapped the features of the 50 untranslated regions (50UTR) known to have the potential to regulate translation, such as the length of 50UTRs, upstream ATG codons, upstream open-reading frames, introns, RNA G-quadruplex-forming sequences, stem loops, and Kozak consensus motifs, in the DNA sequences of all members of the subfamily. Subsequently, the conservation of the features, correlations among them, ribosome profiling data as well as protein levels in normal human tissues were examined. The 50UTRs of ABCA genes contain above-average numbers of upstream ATGs, open-reading frames and introns, as well as conserved ones, and these elements probably play important biological roles in this subfamily, unlike RG4s. -
HIC1 Modulates Prostate Cancer Progression by Epigenetic Modification
Author Manuscript Published OnlineFirst on January 22, 2013; DOI: 10.1158/1078-0432.CCR-12-2888 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. HIC1 modulates prostate cancer progression by epigenetic modification Jianghua Zheng#1, 2, Jinglong Wang#1, Xueqing Sun#1,Mingang Hao1,Tao Ding3, Dan Xiong 1, Xiumin Wang1, Yu Zhu 4, Gang Xiao 1, Guangcun Cheng1, Meizhong Zhao5, Jian Zhang6, Jianhua Wang1* 1 Department of Biochemistry and Molecular Cell Biology, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China. 2 Shanghai Public Health Clinical Center, Fudan University, Shanghai, 201508, China. 3 Department of Urology, Shanghai Putuo Hospital, Shanghai Traditional Chinese Medicine University, Shanghai, 200062, China. 4 Department of Urology, Shanghai Ruijin Hospital, Shanghai, 200025, China. 5 Shanghai Ruijin Hospital, Comprehensive Breast Health Center, Shanghai, 200025, China. 6 Guangxi Medical University, Nanning, 530021, China. # J Zheng, J Wang and X Sun contributed equally to this work. *Corresponding author: Jianhua Wang, PhD, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China. Phone: 021-54660871; Fax: 021-63842157; E-mail:[email protected]. No potential conflicts of interest were disclosed 1 Downloaded from clincancerres.aacrjournals.org on October 1, 2021. © 2013 American Association for Cancer Research. Author Manuscript Published OnlineFirst on January 22, 2013; DOI: 10.1158/1078-0432.CCR-12-2888 Author manuscripts have been peer reviewed and accepted for publication but have not yet been edited. Statement of Translational Relevance This study aimed to further our understanding of the role that hypermethylatioted in cancer 1 (HIC1) plays in prostate cancer (PCa) progression. -
Why Cells and Viruses Cannot Survive Without an ESCRT
cells Review Why Cells and Viruses Cannot Survive without an ESCRT Arianna Calistri * , Alberto Reale, Giorgio Palù and Cristina Parolin Department of Molecular Medicine, University of Padua, 35121 Padua, Italy; [email protected] (A.R.); [email protected] (G.P.); [email protected] (C.P.) * Correspondence: [email protected] Abstract: Intracellular organelles enwrapped in membranes along with a complex network of vesicles trafficking in, out and inside the cellular environment are one of the main features of eukaryotic cells. Given their central role in cell life, compartmentalization and mechanisms allowing their maintenance despite continuous crosstalk among different organelles have been deeply investigated over the past years. Here, we review the multiple functions exerted by the endosomal sorting complex required for transport (ESCRT) machinery in driving membrane remodeling and fission, as well as in repairing physiological and pathological membrane damages. In this way, ESCRT machinery enables different fundamental cellular processes, such as cell cytokinesis, biogenesis of organelles and vesicles, maintenance of nuclear–cytoplasmic compartmentalization, endolysosomal activity. Furthermore, we discuss some examples of how viruses, as obligate intracellular parasites, have evolved to hijack the ESCRT machinery or part of it to execute/optimize their replication cycle/infection. A special emphasis is given to the herpes simplex virus type 1 (HSV-1) interaction with the ESCRT proteins, considering the peculiarities of this interplay and the need for HSV-1 to cross both the nuclear-cytoplasmic and the cytoplasmic-extracellular environment compartmentalization to egress from infected cells. Citation: Calistri, A.; Reale, A.; Palù, Keywords: ESCRT; viruses; cellular membranes; extracellular vesicles; HSV-1 G.; Parolin, C.