Gene Co-Expression Network Mining Using Graph Sparsification

Total Page:16

File Type:pdf, Size:1020Kb

Gene Co-Expression Network Mining Using Graph Sparsification Gene Co-expression Network Mining Using Graph Sparsification THESIS Presented in Partial Fulfillment of the Requirements for the Degree Master of Science in the Graduate School of The Ohio State University By Jinchao Di Graduate Program in Electrical and Computer Engineering The Ohio State University 2013 Master’s Examination Committee: Dr. Kun Huang, Advisor Dr. Raghu Machiraju Dr. Yuejie Chi ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! Copyright by Jinchao Di 2013 ! ! Abstract Identifying and analyzing gene modules is important since it can help understand gene function globally and reveal the underlying molecular mechanism. In this thesis, we propose a method combining local graph sparsification and a gene similarity measurement method TOM. This method can detect gene modules from a weighted gene co-expression network more e↵ectively since we set a local threshold which can detect clusters with di↵erent densities. In our algorithm, we only retain one edge for each gene, which can generate more balanced gene clusters. To estimate the e↵ectiveness of our algorithm we use DAVID to functionally evaluate the gene list for each module we detect and compare our results with some well known algorithms. The result of our algorithm shows better biological relevance than the compared methods and the number of the meaningful biological clusters is much larger than other methods, implying we discover some previously missed gene clusters. Moreover, we carry out a robustness test by adding Gaussian noise with di↵erent variance to the expression data. We find that our algorithm is robust to noise. We also find that some hub genes considered as important genes could be artifacts. ii Dedication This document is dedicated to my family. iii Acknowledgments Iwouldliketoo↵er my gratitude to my advisor, Dr. Kun Huang, for his guidance. I would like to thank Dr. Raghu Machiraju for providing suggestions for my project. IwouldliketothankDr.YuejieChiforbeingmycommitteemember.Iwouldlike to thank Chao Wang, Qihang Li, Nan Meng and Hao Ding for the help they o↵ered me during my study at OSU. iv Vita 2011................................................................B.S. Electrical and Computer Engineering, Harbin Institute of Technology, China 2011 to present ...............................................Graduate Student, ECE Graduate Program, The Ohio State University, USA Fields of Study Major Field: Electrical and Computer Engineering v Table of Contents Page Abstract....................................... ii Dedication . iii Acknowledgments.................................. iv Vita......................................... v List of Tables . viii ListofFigures ................................... x Chapters: 1. Introduction . 1 1.1 Background............................... 1 1.2 Previous work . 2 1.3 Problem statement . 8 1.4 Thesis statement . 8 1.5 Roadmap . 9 2. BiologicalNetworkStructure . 10 2.1 BiologicalNetwork ........................... 10 2.2 Scale Free Topology . 12 vi 3. Sparsification of WGCN with TOM . 16 3.1 DatasetsUsedinthisThesis. 16 3.2 Algorithm................................ 17 3.2.1 Co-expression Measure . 17 3.2.2 Local Graph Sparsification . 18 3.2.3 Choosing the Parameter for Adjacency Function . 21 3.3 Properties and Observation . 23 3.3.1 Property 1: In a cluster with N nodes, there are N 1edges23 − 3.3.2 Observation: . 24 4. Experiment and Evaluation . 27 4.1 GeneOntologyAnalysis . 27 4.1.1 AlgorithmsComparison . 27 4.1.2 Evaluation Method . 29 4.2 Robustness test . 31 5. Discussion and Conclusion . 43 Bibliography . 45 vii List of Tables Table Page 4.1 The biological relevance analysis result for both ABA and BC dataset usingthefouralgorithms. 30 4.2 Number of common genes of BC dataset and BC dataset with noise when k=1 . 33 4.3 Hub genes of clusters of BC dataset and hub genes of corresponding clusters of BC dataset with noise when k=1 . 33 4.4 Number of common genes of BC dataset and BC dataset with noise when k=5 . 34 4.5 Hub genes of clusters of BC dataset and hub genes of corresponding clusters of BC dataset with noise when k=5 . 34 4.6 Number of common genes of BC dataset and BC dataset with noise when k=10 . 35 4.7 Hub genes of clusters of BC dataset and hub genes of corresponding clusters of BC dataset with noise when k=10 . 35 4.8 Number of common genes of BC dataset and BC dataset with noise when k=20 . 36 4.9 Number of common genes of ABA dataset and ABA dataset with noise when k=1 . 37 viii 4.10 Hub genes of clusters of ABA dataset and hub genes of corresponding clustersofABAdatasetwithnoisewhenk=1 . 38 4.11 Number of common genes of ABA dataset and ABA dataset with noise when k=5 . 39 4.12 Hub genes of clusters of ABA dataset and hub genes of corresponding clustersofABAdatasetwithnoisewhenk=1 . 40 4.13 Number of common genes of ABA dataset and ABA dataset with noise when k=10 . 41 4.14 Number of common genes of ABA dataset and ABA dataset with noise when k=20 . 42 ix List of Figures Figure Page 2.1 Degree distribution for biological data: breast cancer data and ABA data,;(a) degree distribution p(k) k of beast cancer data; (b) degree ⇠ distribution log(p(k)) log(k)forbreastcancerdata;(c)degree ⇠ distribution p(k) k of ABA data; (d) degree distribution log(p(k)) ⇠ ⇠ log(k)forABAdata ........................... 13 2.2 Degree distribution for random network with 1000 nodes and 100 sam- ples..................................... 14 3.1 Maximum cluster size as e changes; (a) Maximum cluster size as e changes for GDS2250 breast cancer dataset; (b) Maximum cluster size as e changesforABAdataset . 19 3.2 Cluster size distribution for e =0,e =2ande =5;(a)Cluster size distribution for GDS2250 breast cancer dataset; (b) Cluster size distributionforABAdataset . 20 3.3 Maximum cluster size as β changes; (a) Maximum cluster size as β changes for GDS2250 breast cancer datase; (b) Maximum cluster size as β changesforABAdataset.. 22 3.4 Cluster size distribution as β changes; (a) Cluster size distribution for GDS2250 breast cancer dataset, β =2,β =4,β =6,β =8;(b)Cluster size distribution for ABA dataset,β =6,β =8,β =10,β =12. ... 23 3.5 Visualization result of a subnetwork of breast cancer dataset . 25 x 3.6 Visualization result of a subnetwork of simulated dataset . 26 xi Chapter 1 Introduction 1.1 Background A gene is a subunit of DNA that is usually coding for a particular protein and conductive to phenotype [29]. Genes are not working separately, the interaction of genes can carry out biological function [28]. A gene module is a group of genes that share common biological properties and functions. It is important to identify and analysis gene modules since it can help understand gene function globally and reveal the underlying molecular mechanism. A disease related gene module may also contribute to therapy development since it can elucidate how abnormal phenotype is induced [2]. If two genes are regulated by common transcription factors, they tend to be on the same pathway and have similar expression patterns [23]. Therefore, the genes within a gene module tend to have similar expression pattern, also called co- expression [14]. Microarray gene expression data can measure the expression levels of many genes in one biological sample. We can identify gene modules by analyzing the gene co-expression patterns using a microarray gene expression dataset. 1 1.2 Previous work Many methods can be used to measure the strength of gene co-expression. One of the most commonly used matrc is Pearson Correlation Coefficient (PCC) [8]. Pearson correlation coefficient is defined as the ratio of the covariance of two gene expression profiles (vectors) to the product of their standard deviations. Usually, we use the absolute value of the Pearson correlation that lies between 0 and 1. The higher the value is, the more significantly the two genes are co-expressed. If the absolute value of the correlation is close to one, that means the two genes are highly co-expressed. In contrast, the two genes are uncorrelated to each other if the value is close to zero. Pearson correlation coefficient measures the linear dependence between two genes, which may miss the non-linear correlation between the genes. A low value of the absolute correlation does not necessarily mean independence. Mutual Information o↵ers a more general and accurate measurement of the similarity of gene expression [26]. The concept of entropy is introduced by Shannon [25], it is the self-information of the expression profile of a gene. This concept can be extended to mutual infor- mation which describes the distance of two genes [4]. Assume we have a random variable A with MA possible states ai, i =1,...,MA, the entropy of the variable H(A) is defined as the minus sum of the probability corresponding to each state times the log scale probability: MA H(A)= p(a )logp(a ). (1) − i i i=1 X 2 The mutual information of variables A and B I(A, B)isdefinedasthesumofthe entropy of A and entropy of B minus the joint entropy of A and B [4]: I(A, B)=H(A)+H(B) H(A, B), (2) − where the joint entropy of A and B is defined as: MA MB H(A, B)= p(a ,b )logp(a ,b ). (3) − i j i j i=1 j=1 X X To analyze the gene expression data using mutual information, one can partition the expression data into bins and calculate the probability for each bin by counting the number of data in the bin [5]. The mutual information of two genes then can be calculated by the definition above. Some studies show that both Pearson correlation and mutual information can generate good results in measuring the co-expression [16][13]. However, there is no clearly demonstrated advantage of mutual information over PCC [15].
Recommended publications
  • Old Data and Friends Improve with Age: Advancements with the Updated Tools of Genenetwork
    bioRxiv preprint doi: https://doi.org/10.1101/2021.05.24.445383; this version posted May 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. Old data and friends improve with age: Advancements with the updated tools of GeneNetwork Alisha Chunduri1, David G. Ashbrook2 1Department of Biotechnology, Chaitanya Bharathi Institute of Technology, Hyderabad 500075, India 2Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA Abstract Understanding gene-by-environment interactions is important across biology, particularly behaviour. Families of isogenic strains are excellently placed, as the same genome can be tested in multiple environments. The BXD’s recent expansion to 140 strains makes them the largest family of murine isogenic genomes, and therefore give great power to detect QTL. Indefinite reproducible genometypes can be leveraged; old data can be reanalysed with emerging tools to produce novel biological insights. To highlight the importance of reanalyses, we obtained drug- and behavioural-phenotypes from Philip et al. 2010, and reanalysed their data with new genotypes from sequencing, and new models (GEMMA and R/qtl2). We discover QTL on chromosomes 3, 5, 9, 11, and 14, not found in the original study. We narrowed down the candidate genes based on their ability to alter gene expression and/or protein function, using cis-eQTL analysis, and variants predicted to be deleterious. Co-expression analysis (‘gene friends’) and human PheWAS were used to further narrow candidates.
    [Show full text]
  • Genome Sequence, Population History, and Pelage Genetics of the Endangered African Wild Dog (Lycaon Pictus) Michael G
    Campana et al. BMC Genomics (2016) 17:1013 DOI 10.1186/s12864-016-3368-9 RESEARCH ARTICLE Open Access Genome sequence, population history, and pelage genetics of the endangered African wild dog (Lycaon pictus) Michael G. Campana1,2*, Lillian D. Parker1,2,3, Melissa T. R. Hawkins1,2,3, Hillary S. Young4, Kristofer M. Helgen2,3, Micaela Szykman Gunther5, Rosie Woodroffe6, Jesús E. Maldonado1,2 and Robert C. Fleischer1 Abstract Background: The African wild dog (Lycaon pictus) is an endangered African canid threatened by severe habitat fragmentation, human-wildlife conflict, and infectious disease. A highly specialized carnivore, it is distinguished by its social structure, dental morphology, absence of dewclaws, and colorful pelage. Results: We sequenced the genomes of two individuals from populations representing two distinct ecological histories (Laikipia County, Kenya and KwaZulu-Natal Province, South Africa). We reconstructed population demographic histories for the two individuals and scanned the genomes for evidence of selection. Conclusions: We show that the African wild dog has undergone at least two effective population size reductions in the last 1,000,000 years. We found evidence of Lycaon individual-specific regions of low diversity, suggestive of inbreeding or population-specific selection. Further research is needed to clarify whether these population reductions and low diversity regions are characteristic of the species as a whole. We documented positive selection on the Lycaon mitochondrial genome. Finally, we identified several candidate genes (ASIP, MITF, MLPH, PMEL) that may play a role in the characteristic Lycaon pelage. Keywords: Lycaon pictus, Genome, Population history, Selection, Pelage Background Primarily a hunter of antelopes, the African wild dog is a The African wild dog (Lycaon pictus) is an endangered highly distinct canine.
    [Show full text]
  • It Is NOT Junk a Blog About Genomes, DNA, Evolution, Open Science, Baseball and Other Important Things
    it is NOT junk a blog about genomes, DNA, evolution, open science, baseball and other important things The Past, Present and Future of Scholarly Publishing By MIC H A EL EISEN | Published: MA RC H 2 8 , 2 0 1 3 Search To search, type and hit enter I gave a talk last night at the Commonwealth Club in San Francisco about science publishing and PLoS. There will be an audio link soon, but, for the first time in my life, Michael Eisen I actually gave the talk (largely) from prepared remarks, so I thought I’d post it here. I'm a biologist at UC An audio recording of the talk with Q&A is available here. Berkeley and an Investigator of the —— Howard Hughes Medical Institute. I work primarily On January 6, 2011, 24 year old hacker and activist Aaron Swartz was arrested by on flies, and my research encompases police at the Massachusetts Institute of Technology for downloading several million evolution, development, genetics, articles from an online archive of research journals called JSTOR. genomics, chemical ecology and behavior. I am a strong proponent of After Swartz committed suicide earlier this year in the face of legal troubles arising open science, and a co-founder of the from this incident, questions were raised about why MIT, whose access to JSTOR he Public Library of Science. And most exploited, chose to pursue charges, and what motivated the US Department of Justice importantly, I am a Red Sox fan. (More to demand jail time for his transgression. about me here).
    [Show full text]
  • Viewed Graphically with a Java Graphical User Interface (GUI), Allowing Users to Evaluate Contig Sequence Quality and Predict Snps
    BMC Research Notes BioMed Central Short Report Open Access BEAP: The BLAST Extension and Alignment Program- a tool for contig construction and analysis of preliminary genome sequence James E Koltes†, Zhi-Liang Hu†, Eric Fritz and James M Reecy* Address: Department of Animal Science, Iowa State University, Ames, Iowa 50011-3150, USA Email: James E Koltes - [email protected]; Zhi-Liang Hu - [email protected]; Eric Fritz - [email protected]; James M Reecy* - [email protected] * Corresponding author †Equal contributors Published: 22 January 2009 Received: 29 October 2008 Accepted: 22 January 2009 BMC Research Notes 2009, 2:11 doi:10.1186/1756-0500-2-11 This article is available from: http://www.biomedcentral.com/1756-0500/2/11 © 2009 Reecy et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Background: Fine-mapping projects require a high density of SNP markers and positional candidate gene sequences. In species with incomplete genomic sequence, the DNA sequences needed to generate markers for fine-mapping within a linkage analysis confidence interval may be available but may not have been assembled. To manually piece these sequences together is laborious and costly. Moreover, annotation and assembly of short, incomplete DNA sequences is time consuming and not always straightforward. Findings: We have created a tool called BEAP that combines BLAST and CAP3 to retrieve sequences and construct contigs for localized genomic regions in species with unfinished sequence drafts.
    [Show full text]
  • Molecular Features of Triple Negative Breast Cancer Cells by Genome-Wide Gene Expression Profiling Analysis
    478 INTERNATIONAL JOURNAL OF ONCOLOGY 42: 478-506, 2013 Molecular features of triple negative breast cancer cells by genome-wide gene expression profiling analysis MASATO KOMATSU1,2*, TETSURO YOSHIMARU1*, TAISUKE MATSUO1, KAZUMA KIYOTANI1, YASUO MIYOSHI3, TOSHIHITO TANAHASHI4, KAZUHITO ROKUTAN4, RUI YAMAGUCHI5, AYUMU SAITO6, SEIYA IMOTO6, SATORU MIYANO6, YUSUKE NAKAMURA7, MITSUNORI SASA8, MITSUO SHIMADA2 and TOYOMASA KATAGIRI1 1Division of Genome Medicine, Institute for Genome Research, The University of Tokushima; 2Department of Digestive and Transplantation Surgery, The University of Tokushima Graduate School; 3Department of Surgery, Division of Breast and Endocrine Surgery, Hyogo College of Medicine, Hyogo 663-8501; 4Department of Stress Science, Institute of Health Biosciences, The University of Tokushima Graduate School, Tokushima 770-8503; Laboratories of 5Sequence Analysis, 6DNA Information Analysis and 7Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Tokyo 108-8639; 8Tokushima Breast Care Clinic, Tokushima 770-0052, Japan Received September 22, 2012; Accepted November 6, 2012 DOI: 10.3892/ijo.2012.1744 Abstract. Triple negative breast cancer (TNBC) has a poor carcinogenesis of TNBC and could contribute to the develop- outcome due to the lack of beneficial therapeutic targets. To ment of molecular targets as a treatment for TNBC patients. clarify the molecular mechanisms involved in the carcino- genesis of TNBC and to identify target molecules for novel Introduction anticancer drugs, we analyzed the gene expression profiles of 30 TNBCs as well as 13 normal epithelial ductal cells that were Breast cancer is one of the most common solid malignant purified by laser-microbeam microdissection. We identified tumors among women worldwide. Breast cancer is a heteroge- 301 and 321 transcripts that were significantly upregulated neous disease that is currently classified based on the expression and downregulated in TNBC, respectively.
    [Show full text]
  • Trevor John Hastie 1040 Campus Drive Stanford, CA 94305 Home Phone&FAX: (650) 326-0854
    Trevor John Hastie 1040 Campus Drive Stanford, CA 94305 Home Phone&FAX: (650) 326-0854 Department of Statistics Born: June 27, 1953, South Africa Sequoia Hall Married, two children Stanford University U. S. citizen, S.A. citizen Stanford, CA 94305 E-Mail: [email protected] (650) 725-2231 Fax: 650/725-8977 Updated: June 22, 2021 Present Position 2013{ John A. Overdeck Professor of Mathematical Sciences, Stanford University. 2006{2009 Chair, Department of Statistics, Stanford University. 2005{2006 Associate Chair, Department of Statistics, Stanford University. 1999{ Professor, Statistics and Biostatistics Departments, Stanford University. Founder and co-director of Statistics department industrial affiliates program. 1994{1998 Associate Professor (tenured), Statistics and Biostatistics Departments, Stan- ford University. Research interests include nonparametric regression models, computer in- tensive data analysis techniques, statistical computing and graphics, and statistical consulting. Currently working on adaptive modeling and predic- tion procedures, signal and image modeling, and problems in bioinformatics with many more variables than observations. Education 1984 Stanford University, Stanford, California { Ph.D, Department of Statis- tics (Werner Stuetzle, advisor) 1979 University of Cape Town, Cape Town, South Africa { First Class Masters Degree in Statistics (June Juritz, advisor). 1976 Rhodes University, Grahamstown, South Africa { Bachelor of Science Honors Degree in Statistics. 1975 Rhodes University, Grahamstown, South Africa { Bachelor of Science Degree (cum laude) in Statistics, Computer Science and Mathematics. Awards and Honors 2020 \Statistician of the year" award, Chicago chapter ASA. 2020 Breiman award (senior) 2019 Recipient of the Sigillum Magnum, University of Bologna, Italy. 2019 Elected to The Royal Netherlands Academy of Arts and Science. 2019 Wald lecturer, JSM Denver.
    [Show full text]
  • A Chromosome Level Genome of Astyanax Mexicanus Surface Fish for Comparing Population
    bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020. The copyright holder for this preprint (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. 1 Title 2 A chromosome level genome of Astyanax mexicanus surface fish for comparing population- 3 specific genetic differences contributing to trait evolution. 4 5 Authors 6 Wesley C. Warren1, Tyler E. Boggs2, Richard Borowsky3, Brian M. Carlson4, Estephany 7 Ferrufino5, Joshua B. Gross2, LaDeana Hillier6, Zhilian Hu7, Alex C. Keene8, Alexander Kenzior9, 8 Johanna E. Kowalko5, Chad Tomlinson10, Milinn Kremitzki10, Madeleine E. Lemieux11, Tina 9 Graves-Lindsay10, Suzanne E. McGaugh12, Jeff T. Miller12, Mathilda Mommersteeg7, Rachel L. 10 Moran12, Robert Peuß9, Edward Rice1, Misty R. Riddle13, Itzel Sifuentes-Romero5, Bethany A. 11 Stanhope5,8, Clifford J. Tabin13, Sunishka Thakur5, Yamamoto Yoshiyuki14, Nicolas Rohner9,15 12 13 Authors for correspondence: Wesley C. Warren ([email protected]), Nicolas Rohner 14 ([email protected]) 15 16 Affiliation 17 1Department of Animal Sciences, Department of Surgery, Institute for Data Science and 18 Informatics, University of Missouri, Bond Life Sciences Center, Columbia, MO 19 2 Department of Biological Sciences, University of Cincinnati, Cincinnati, OH 20 3 Department of Biology, New York University, New York, NY 21 4 Department of Biology, The College of Wooster, Wooster, OH 22 5 Harriet L. Wilkes Honors College, Florida Atlantic University, Jupiter FL 23 6 Department of Genome Sciences, University of Washington, Seattle, WA 1 bioRxiv preprint doi: https://doi.org/10.1101/2020.07.06.189654; this version posted July 6, 2020.
    [Show full text]
  • Ddawson2012.Pdf (700.6Kb)
    Open Science and Crowd Science: Selected Sites and Resources Diane (DeDe) Dawson Natural Sciences Liaison Librarian University of Saskatchewan Saskatoon, Saskatchewan, Canada [email protected] Table of Contents Introduction Open Science Crowd Science Methods and Scope Open Science – Definitions and Principles Open Science – Open Lab Notebooks of Individuals and Lab Groups Open Science – Blogs Crowd Science – Projects for Individuals or Small Teams Crowd Science – Volunteer Distributed Computing Projects The Main Software Organizations Selected Projects Further Sources for Projects Selected Examples of Collaborative Science Sites for Specialists Main Software & Online Tools for Open Science Open Science Conferences and Community Conferences Further Reading/Viewing Videos Reports and White Papers Open e-Books Selected Essays, Articles, and Interviews References Introduction “To take full advantage of modern tools for the production of knowledge, we need to create an open scientific culture where as much information as possible is moved out of people’s heads and laboratories, and onto the network” (Nielsen 2011, p.183). New Internet technologies are radically enhancing the speed and ease of scholarly communications, and are providing opportunities for conducting and sharing research in new ways. This webliography explores the emerging “open science” and “crowd science” movements which are making use of these new opportunities to increase collaboration and openness in scientific research. The collaboration of many researchers on a project can enhance the rate of data-collection and analysis, and ignite new ideas. In addition, since there are more eyes to spot any inaccuracies or errors, collaborative research is likely to produce better quality results. Openness early in the research process alerts others to the work resulting in less duplication of efforts.
    [Show full text]
  • Single Cell Derived Clonal Analysis of Human Glioblastoma Links
    SUPPLEMENTARY INFORMATION: Single cell derived clonal analysis of human glioblastoma links functional and genomic heterogeneity ! Mona Meyer*, Jüri Reimand*, Xiaoyang Lan, Renee Head, Xueming Zhu, Michelle Kushida, Jane Bayani, Jessica C. Pressey, Anath Lionel, Ian D. Clarke, Michael Cusimano, Jeremy Squire, Stephen Scherer, Mark Bernstein, Melanie A. Woodin, Gary D. Bader**, and Peter B. Dirks**! ! * These authors contributed equally to this work.! ** Correspondence: [email protected] or [email protected]! ! Supplementary information - Meyer, Reimand et al. Supplementary methods" 4" Patient samples and fluorescence activated cell sorting (FACS)! 4! Differentiation! 4! Immunocytochemistry and EdU Imaging! 4! Proliferation! 5! Western blotting ! 5! Temozolomide treatment! 5! NCI drug library screen! 6! Orthotopic injections! 6! Immunohistochemistry on tumor sections! 6! Promoter methylation of MGMT! 6! Fluorescence in situ Hybridization (FISH)! 7! SNP6 microarray analysis and genome segmentation! 7! Calling copy number alterations! 8! Mapping altered genome segments to genes! 8! Recurrently altered genes with clonal variability! 9! Global analyses of copy number alterations! 9! Phylogenetic analysis of copy number alterations! 10! Microarray analysis! 10! Gene expression differences of TMZ resistant and sensitive clones of GBM-482! 10! Reverse transcription-PCR analyses! 11! Tumor subtype analysis of TMZ-sensitive and resistant clones! 11! Pathway analysis of gene expression in the TMZ-sensitive clone of GBM-482! 11! Supplementary figures and tables" 13" "2 Supplementary information - Meyer, Reimand et al. Table S1: Individual clones from all patient tumors are tumorigenic. ! 14! Fig. S1: clonal tumorigenicity.! 15! Fig. S2: clonal heterogeneity of EGFR and PTEN expression.! 20! Fig. S3: clonal heterogeneity of proliferation.! 21! Fig.
    [Show full text]
  • Sperm Differentiation
    International Journal of Molecular Sciences Review Sperm Differentiation: The Role of Trafficking of Proteins Maria E. Teves 1,* , Eduardo R. S. Roldan 2,*, Diego Krapf 3 , Jerome F. Strauss III 1 , Virali Bhagat 4 and Paulene Sapao 5 1 Department of Obstetrics and Gynecology, Virginia Commonwealth University, Richmond, VA 23298, USA; [email protected] 2 Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales (CSIC), 28006 Madrid, Spain 3 Department of Electrical and Computer Engineering, Colorado State University, Fort Collins, CO 80523, USA; [email protected] 4 Department of Physiology and Biophysics, Virginia Commonwealth University, Richmond, VA 23298, USA; [email protected] 5 Department of Chemistry, Virginia Commonwealth University, Richmond, VA 23298, USA; [email protected] * Correspondence: [email protected] (M.E.T.); [email protected] (E.R.S.R.) Received: 4 March 2020; Accepted: 20 May 2020; Published: 24 May 2020 Abstract: Sperm differentiation encompasses a complex sequence of morphological changes that takes place in the seminiferous epithelium. In this process, haploid round spermatids undergo substantial structural and functional alterations, resulting in highly polarized sperm. Hallmark changes during the differentiation process include the formation of new organelles, chromatin condensation and nuclear shaping, elimination of residual cytoplasm, and assembly of the sperm flagella. To achieve these transformations, spermatids have unique mechanisms for protein trafficking that operate in a coordinated fashion. Microtubules and filaments of actin are the main tracks used to facilitate the transport mechanisms, assisted by motor and non-motor proteins, for delivery of vesicular and non-vesicular cargos to specific sites.
    [Show full text]
  • Nº Ref Uniprot Proteína Péptidos Identificados Por MS/MS 1 P01024
    Document downloaded from http://www.elsevier.es, day 26/09/2021. This copy is for personal use. Any transmission of this document by any media or format is strictly prohibited. Nº Ref Uniprot Proteína Péptidos identificados 1 P01024 CO3_HUMAN Complement C3 OS=Homo sapiens GN=C3 PE=1 SV=2 por 162MS/MS 2 P02751 FINC_HUMAN Fibronectin OS=Homo sapiens GN=FN1 PE=1 SV=4 131 3 P01023 A2MG_HUMAN Alpha-2-macroglobulin OS=Homo sapiens GN=A2M PE=1 SV=3 128 4 P0C0L4 CO4A_HUMAN Complement C4-A OS=Homo sapiens GN=C4A PE=1 SV=1 95 5 P04275 VWF_HUMAN von Willebrand factor OS=Homo sapiens GN=VWF PE=1 SV=4 81 6 P02675 FIBB_HUMAN Fibrinogen beta chain OS=Homo sapiens GN=FGB PE=1 SV=2 78 7 P01031 CO5_HUMAN Complement C5 OS=Homo sapiens GN=C5 PE=1 SV=4 66 8 P02768 ALBU_HUMAN Serum albumin OS=Homo sapiens GN=ALB PE=1 SV=2 66 9 P00450 CERU_HUMAN Ceruloplasmin OS=Homo sapiens GN=CP PE=1 SV=1 64 10 P02671 FIBA_HUMAN Fibrinogen alpha chain OS=Homo sapiens GN=FGA PE=1 SV=2 58 11 P08603 CFAH_HUMAN Complement factor H OS=Homo sapiens GN=CFH PE=1 SV=4 56 12 P02787 TRFE_HUMAN Serotransferrin OS=Homo sapiens GN=TF PE=1 SV=3 54 13 P00747 PLMN_HUMAN Plasminogen OS=Homo sapiens GN=PLG PE=1 SV=2 48 14 P02679 FIBG_HUMAN Fibrinogen gamma chain OS=Homo sapiens GN=FGG PE=1 SV=3 47 15 P01871 IGHM_HUMAN Ig mu chain C region OS=Homo sapiens GN=IGHM PE=1 SV=3 41 16 P04003 C4BPA_HUMAN C4b-binding protein alpha chain OS=Homo sapiens GN=C4BPA PE=1 SV=2 37 17 Q9Y6R7 FCGBP_HUMAN IgGFc-binding protein OS=Homo sapiens GN=FCGBP PE=1 SV=3 30 18 O43866 CD5L_HUMAN CD5 antigen-like OS=Homo
    [Show full text]
  • Download Special Issue
    BioMed Research International Novel Bioinformatics Approaches for Analysis of High-Throughput Biological Data Guest Editors: Julia Tzu-Ya Weng, Li-Ching Wu, Wen-Chi Chang, Tzu-Hao Chang, Tatsuya Akutsu, and Tzong-Yi Lee Novel Bioinformatics Approaches for Analysis of High-Throughput Biological Data BioMed Research International Novel Bioinformatics Approaches for Analysis of High-Throughput Biological Data Guest Editors: Julia Tzu-Ya Weng, Li-Ching Wu, Wen-Chi Chang, Tzu-Hao Chang, Tatsuya Akutsu, and Tzong-Yi Lee Copyright © 2014 Hindawi Publishing Corporation. All rights reserved. This is a special issue published in “BioMed Research International.” All articles are open access articles distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Contents Novel Bioinformatics Approaches for Analysis of High-Throughput Biological Data,JuliaTzu-YaWeng, Li-Ching Wu, Wen-Chi Chang, Tzu-Hao Chang, Tatsuya Akutsu, and Tzong-Yi Lee Volume2014,ArticleID814092,3pages Evolution of Network Biomarkers from Early to Late Stage Bladder Cancer Samples,Yung-HaoWong, Cheng-Wei Li, and Bor-Sen Chen Volume 2014, Article ID 159078, 23 pages MicroRNA Expression Profiling Altered by Variant Dosage of Radiation Exposure,Kuei-FangLee, Yi-Cheng Chen, Paul Wei-Che Hsu, Ingrid Y. Liu, and Lawrence Shih-Hsin Wu Volume2014,ArticleID456323,10pages EXIA2: Web Server of Accurate and Rapid Protein Catalytic Residue Prediction, Chih-Hao Lu, Chin-Sheng
    [Show full text]