A Data Mining Approach for Detecting Evolutionary Divergence in Transcriptomic Data

Total Page:16

File Type:pdf, Size:1020Kb

A Data Mining Approach for Detecting Evolutionary Divergence in Transcriptomic Data A Data Mining Approach for Detecting Evolutionary Divergence in Transcriptomic Data by Owen Zeno Woody A thesis presented by the University of Waterloo in fulfilment of the thesis requirement for the degree of Doctor of Philosophy in Biology Waterloo, Ontario, Canada, 2019 © Owen Zeno Woody 2019 Examining Committee Membership The following served on the Examining Committee for this thesis. The decision of the Examining Committee is by majority vote. External Examiner Dr. Teresa Crease Professor, Department of Integrative Biology, University of Guelph Supervisor(s) Dr. Brendan J. McConkey Associate Professor, Department of Biology, University of Waterloo Internal Member Dr. Kirsten M. Müller Professor, Department of Biology, University of Waterloo Internal-external Member Dr. Dan Brown Professor, Cheriton School of Computer Science, University of Waterloo Other Member(s) Dr. Josh D. Neufeld Professor, Department of Biology, University of Waterloo ii I hereby declare that I am the sole author of this thesis. This is a true copy of the thesis, including any required final revisions, as accepted by my examiners. I understand that my thesis may be made electronically available to the public. iii Abstract It has become common to produce genome sequences for organisms of scientific or popular interest. Although these genome projects provide insight into the gene and protein complements of a species including their evolutionary relationships, it remains challenging to determine gene regulatory behavior from genome sequence alone. It has also become common to produce “expression atlas” transcriptomic data sets. These atlases employ high-throughput transcript assays to survey an assortment of tissues, developmental states, and responses to stimuli that each may individually elicit or inhibit the transcription of genes. Although genomic and transcriptomic data sets are both routinely collected, they are seldom analyzed in tandem. Here I present a novel approach to combining these complementary data with a software package called BranchOut. BranchOut uses genomic information to construct gene family phylogenies, and then attempts to map gene expression activity onto this phylogeny to allow estimation of ancestral expression states. This allows the identification of specific innovations due to gene duplications that resulted in fundamental diversification in the roles of otherwise closely related genes. As a proof of concept, the BranchOut technique is first applied to a tangible small- scale example in Apis mellifera. Subsequently, the power of BranchOut to analyze complete genomes is shown for two mammalian genomes, Sus scrofa and Bos taurus. The transcriptomic data sets for these two mammals employ microarray and RNAseq platforms, respectively, for expression analysis, demonstrating BranchOut’s applicability to both future and historic expression atlases. Potential refinements to the approach are also discussed. iv Acknowledgements I would like to thank my supervisor, Brendan McConkey, for his patience, kindness and insight throughout the preparation of this work. I’d also like to thank my committee members for their suggestions and support over many years: Dan Brown, Kirsten Müller, and Josh Neufeld. It was thanks to your mentorship that this was all possible. I am very grateful for the support I received through both NSERC and OGS for this project. v Table of Contents List of Figures vii List of Tables ix 1 Introduction – Sporks, Soups and Sausages 1 2 Preprocessing Methodology for Genomic and Transcriptomic Data 23 3 BranchOut Software Specifications 31 4 Small-scale Application Involving the Honeybee, Apis mellifera 47 5 Application of BranchOut to Sus scrofa Microarray Expression Atlas 63 6 Application of BranchOut to High-Throughput Sequencing: Bos taurus Data Set 89 7 Future Directions and Conclusion 117 Bibliography 128 Appendices 141 vi List of Figures Figure 3.1: An example of an “all conditions” heatmap from BranchOut 39 Figure 3.2: A reconstruction block produced by BranchOut 41 Figure 3.3: An example state assignment diagram 43 Figure 3.4: An example BranchOut single-condition reconstruction 45 Figure 4.1: Heatmap showing gene expression behavior for the yellow gene family in Apis mellifera 55 Figure 4.2: MCLUST results for the yellow protein family in Apis mellifera 57 Figure 4.3: MrBayes tree for the yellow protein family in Apis mellifera 59 Figure 4.4: Hypothetical expression states of ancestral yellow gene family members from two Apis mellifera tissues 60 Figure 5.1a & b: Map of Expression Clustering Assignments for the Disintegrin protein family in Sus scrofa 73, 74 Figure 5.2: BranchOut reconstruction of the Disintegrin protein family in the prefrontal cortex 75 Figure 5.3: BranchOut reconstruction of the Disintegrin protein family in a blood sample 76 Figure 5.4: BranchOut reconstruction of the Disintegrin protein family in the testis 77 Figure 5.5a & b: Map of Expression Clustering Assignments for the ABC transporter protein family in Sus scrofa 79, 80 Figure 5.6: BranchOut reconstruction of the ABC transporter protein family in the ileum 81 Figure 5.7: Screenshot of Ensembl exon/intron model for selected ABC-transporters 82 Figure 5.8a & b: Map of Expression Clustering Assignments for the Cytochrome P450 protein family in Sus scrofa 84, 85 Figure 5.9: BranchOut reconstruction of the cytochrome P450 protein family in the cortex of the kidney 86 Figure 6.1a & b: Map of Expression Clustering Assignments for the WW-domain protein family in Bos taurus 98, 99 Figure 6.2: BranchOut reconstruction of the WW protein family in an ovarian follicle sample 100 Figure 6.3: BranchOut reconstruction of the WW protein family in a lactating mammary gland 102 vii Figure 6.4a & b: Map of Expression Clustering Assignments for the Tubulin FtsZ family: GTPase domain protein family in Bos taurus 104, 105 Figure 6.5: BranchOut reconstruction of the Tubulin protein family in the temporal cortex 106 Figure 6.6: BranchOut reconstruction of the Tubulin protein family in the supraspinatus 107 Figure 6.7a & b: Map of Expression Clustering Assignments for the Ligand binding domain of nuclear hormone receptor protein family in Bos taurus 109, 110 Figure 6.8: BranchOut reconstruction of the ligand binding domain of nuclear hormone receptor family in the rumen 111 Figure 6.9: BranchOut reconstruction of the ligand binding domain of nuclear hormone receptor family in the pituitary gland 112 Figure 6.10: BranchOut reconstruction of the ligand binding domain of nuclear hormone receptor family in the salivary gland 113 Figure 6.11: BranchOut reconstruction of the ligand binding domain of nuclear hormone receptor family in the infundibulum 114 Figure 7.1: Pairwise comparisons of BranchOut scores with varying input sources 124 viii List of Tables Table 5.1: Sus scrofa tissues with many high-scoring BranchOut signal scores and a summary of findings 68 Table 5.2: Rank-ordered list of Sus scrofa tissues that contained a large number of high- scoring BranchOut reconstruction signals 70 Table 5.3: Rank-ordered list of Sus scrofa gene families that contained a large number of high-scoring BranchOut reconstruction signals 71 Table 6.1: Bos taurus tissues with many high-scoring BranchOut signal scores and a summary of findings 92 Table 6.2: Rank-ordered list of Bos taurus tissues that contained a large number of high- scoring BranchOut reconstruction signals 94 Table 6.3: Rank-ordered list of Bos taurus gene families that contained a large number of high-scoring BranchOut reconstruction signals 96 ix Chapter 1: Introduction – Sporks, Soups and Sausages The work presented in this thesis addresses a specific aspect of evolutionary biology: the evolution of novel function at the gene family level. Biological organisms are capable of remarkably complex activities, and all this functionality is somehow encoded in the organism’s genomic complement. In order to discuss the evolution of function – broadly, how an organism can become capable of “new” activities – it is important first to establish a working understanding of a few key concepts. First, I will describe the features and activities of a gene that make it a functional unit in the cell. Next, I will cover the means by which new genes can arise, and what it means for a gene to be part of a family. Lastly, I will describe what I define as a “novel function”, and how novelty can be introduced into the genome through the process of gene origination and duplication. 1.1 Defining the gene as a functional unit This work will focus on genes as fundamental units of inheritance and evolution. There are some intriguing examples of inheritable adaptation that do not require changes at the genetic level (RNA interference, for example (Spracklin, Fields et al. 2017)), but they will lie outside the scope of this discussion. The “Central Dogma of Biology” – that DNA genes encode (messenger) RNA molecules, which are in turn translated to form proteins – remains a powerful explanatory tool, and my research will make a similar assumption about the central role genes play in biological activity and inheritance. 1 A gene is a string of information in the genome with two main parts: the coding sequence and its regulatory control (MacCarthy, Bergman 2007). The coding sequence of a gene provides a blueprint for the construction of an active biomolecule – typically a protein, but sometimes an RNA molecule with catalytic properties (ribozyme). Because the number of building blocks for these biomolecules is limited (4 nucleotides or approximately 20 amino acids), the sequence and arrangement of these components is the fundamental characteristic that determines a biomolecule’s role. Consequently, when the coding sequence is the focus, the Central Dogma can be restated as follows: (nucleotide) sequence directs (biomolecule) structure, which in turn directs function. Changes to the coding sequence can fundamentally alter the structure of the encoded biomolecule and may impact the biological capability of the resulting product. A first draft for the definition of “gene function” might stop there.
Recommended publications
  • Searching for Novel Peptide Hormones in the Human Genome Olivier Mirabeau
    Searching for novel peptide hormones in the human genome Olivier Mirabeau To cite this version: Olivier Mirabeau. Searching for novel peptide hormones in the human genome. Life Sciences [q-bio]. Université Montpellier II - Sciences et Techniques du Languedoc, 2008. English. tel-00340710 HAL Id: tel-00340710 https://tel.archives-ouvertes.fr/tel-00340710 Submitted on 21 Nov 2008 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. UNIVERSITE MONTPELLIER II SCIENCES ET TECHNIQUES DU LANGUEDOC THESE pour obtenir le grade de DOCTEUR DE L'UNIVERSITE MONTPELLIER II Discipline : Biologie Informatique Ecole Doctorale : Sciences chimiques et biologiques pour la santé Formation doctorale : Biologie-Santé Recherche de nouvelles hormones peptidiques codées par le génome humain par Olivier Mirabeau présentée et soutenue publiquement le 30 janvier 2008 JURY M. Hubert Vaudry Rapporteur M. Jean-Philippe Vert Rapporteur Mme Nadia Rosenthal Examinatrice M. Jean Martinez Président M. Olivier Gascuel Directeur M. Cornelius Gross Examinateur Résumé Résumé Cette thèse porte sur la découverte de gènes humains non caractérisés codant pour des précurseurs à hormones peptidiques. Les hormones peptidiques (PH) ont un rôle important dans la plupart des processus physiologiques du corps humain.
    [Show full text]
  • The EMILIN/Multimerin Family
    View metadata, citation and similar papers at core.ac.uk brought to you by CORE REVIEW ARTICLE published: 06 Januaryprovided 2012 by Frontiers - Publisher Connector doi: 10.3389/fimmu.2011.00093 The EMILIN/multimerin family Alfonso Colombatti 1,2,3*, Paola Spessotto1, Roberto Doliana1, Maurizio Mongiat 1, Giorgio Maria Bressan4 and Gennaro Esposito2,3 1 Experimental Oncology 2, Centro di Riferimento Oncologico, Istituto di Ricerca e Cura a Carattere Scientifico, Aviano, Italy 2 Department of Biomedical Science and Technology, University of Udine, Udine, Italy 3 Microgravity, Ageing, Training, Immobility Excellence Center, University of Udine, Udine, Italy 4 Department of Histology Microbiology and Medical Biotechnologies, University of Padova, Padova, Italy Edited by: Elastin microfibrillar interface proteins (EMILINs) and Multimerins (EMILIN1, EMILIN2, Uday Kishore, Brunel University, UK Multimerin1, and Multimerin2) constitute a four member family that in addition to the Reviewed by: shared C-terminus gC1q domain typical of the gC1q/TNF superfamily members contain a Uday Kishore, Brunel University, UK Kenneth Reid, Green Templeton N-terminus unique cysteine-rich EMI domain. These glycoproteins are homotrimeric and College University of Oxford, UK assemble into high molecular weight multimers. They are predominantly expressed in *Correspondence: the extracellular matrix and contribute to several cellular functions in part associated with Alfonso Colombatti, Division of the gC1q domain and in part not yet assigned nor linked to other specific regions of the Experimental Oncology 2, Centro di sequence. Among the latter is the control of arterial blood pressure, the inhibition of Bacil- Riferimento Oncologico, Istituto di Ricerca e Cura a Carattere Scientifico, lus anthracis cell cytotoxicity, the promotion of cell death, the proangiogenic function, and 33081 Aviano, Italy.
    [Show full text]
  • Supplementary Table 1: Adhesion Genes Data Set
    Supplementary Table 1: Adhesion genes data set PROBE Entrez Gene ID Celera Gene ID Gene_Symbol Gene_Name 160832 1 hCG201364.3 A1BG alpha-1-B glycoprotein 223658 1 hCG201364.3 A1BG alpha-1-B glycoprotein 212988 102 hCG40040.3 ADAM10 ADAM metallopeptidase domain 10 133411 4185 hCG28232.2 ADAM11 ADAM metallopeptidase domain 11 110695 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 195222 8038 hCG40937.4 ADAM12 ADAM metallopeptidase domain 12 (meltrin alpha) 165344 8751 hCG20021.3 ADAM15 ADAM metallopeptidase domain 15 (metargidin) 189065 6868 null ADAM17 ADAM metallopeptidase domain 17 (tumor necrosis factor, alpha, converting enzyme) 108119 8728 hCG15398.4 ADAM19 ADAM metallopeptidase domain 19 (meltrin beta) 117763 8748 hCG20675.3 ADAM20 ADAM metallopeptidase domain 20 126448 8747 hCG1785634.2 ADAM21 ADAM metallopeptidase domain 21 208981 8747 hCG1785634.2|hCG2042897 ADAM21 ADAM metallopeptidase domain 21 180903 53616 hCG17212.4 ADAM22 ADAM metallopeptidase domain 22 177272 8745 hCG1811623.1 ADAM23 ADAM metallopeptidase domain 23 102384 10863 hCG1818505.1 ADAM28 ADAM metallopeptidase domain 28 119968 11086 hCG1786734.2 ADAM29 ADAM metallopeptidase domain 29 205542 11085 hCG1997196.1 ADAM30 ADAM metallopeptidase domain 30 148417 80332 hCG39255.4 ADAM33 ADAM metallopeptidase domain 33 140492 8756 hCG1789002.2 ADAM7 ADAM metallopeptidase domain 7 122603 101 hCG1816947.1 ADAM8 ADAM metallopeptidase domain 8 183965 8754 hCG1996391 ADAM9 ADAM metallopeptidase domain 9 (meltrin gamma) 129974 27299 hCG15447.3 ADAMDEC1 ADAM-like,
    [Show full text]
  • Multimerin-2 Is a Ligand for Group 14 Family C-Type Lectins CLEC14A, CD93 and CD248 Spanning the Endothelial Pericyte Interface
    OPEN Oncogene (2017) 36, 6097–6108 www.nature.com/onc ORIGINAL ARTICLE Multimerin-2 is a ligand for group 14 family C-type lectins CLEC14A, CD93 and CD248 spanning the endothelial pericyte interface KA Khan1, AJ Naylor2, A Khan1, PJ Noy1, M Mambretti1, P Lodhia1, J Athwal1, A Korzystka1, CD Buckley2, BE Willcox3, F Mohammed3 and R Bicknell1 TheC-typelectindomaincontaininggroup14familymembers CLEC14A and CD93 are proteins expressed by endothelium and are implicated in tumour angiogenesis. CD248 (alternatively known as endosialin or tumour endothelial marker-1) is also a member of this family and is expressed by tumour-associated fibroblasts and pericytes. Multimerin-2 (MMRN2) is a unique endothelial specific extracellular matrix protein that has been implicated in angiogenesis and tumour progression. We show that the group 14 C-type lectins CLEC14A, CD93 and CD248 directly bind to MMRN2 and only thrombomodulin of the family does not. Binding to MMRN2 is dependent on a predicted long-loop region in the C-type lectin domainandisabrogatedbymutationwithinthedomain. CLEC14A and CD93 bind to the same non-glycosylated coiled-coil region of MMRN2, but the binding of CD248 occurs on a distinct non-competing region. CLEC14A and CD248 can bind MMRN2 simultaneously and this occurs at the interface between endothelium and pericytes in human pancreatic cancer. A recombinant peptide of MMRN2 spanning the CLEC14A and CD93 binding region blocks CLEC14A extracellular domain binding to the endothelial cellsurfaceaswellasincreasingadherenceofhumanumbilical vein endothelial cells to the active peptide. This MMRN2 peptide is anti-angiogenic in vitro and reduces tumour growth in mouse models. These findings identify novel protein interactions involving CLEC14A, CD93 and CD248 with MMRN2 as targetable components of vessel formation.
    [Show full text]
  • Supplementary Table S4. FGA Co-Expressed Gene List in LUAD
    Supplementary Table S4. FGA co-expressed gene list in LUAD tumors Symbol R Locus Description FGG 0.919 4q28 fibrinogen gamma chain FGL1 0.635 8p22 fibrinogen-like 1 SLC7A2 0.536 8p22 solute carrier family 7 (cationic amino acid transporter, y+ system), member 2 DUSP4 0.521 8p12-p11 dual specificity phosphatase 4 HAL 0.51 12q22-q24.1histidine ammonia-lyase PDE4D 0.499 5q12 phosphodiesterase 4D, cAMP-specific FURIN 0.497 15q26.1 furin (paired basic amino acid cleaving enzyme) CPS1 0.49 2q35 carbamoyl-phosphate synthase 1, mitochondrial TESC 0.478 12q24.22 tescalcin INHA 0.465 2q35 inhibin, alpha S100P 0.461 4p16 S100 calcium binding protein P VPS37A 0.447 8p22 vacuolar protein sorting 37 homolog A (S. cerevisiae) SLC16A14 0.447 2q36.3 solute carrier family 16, member 14 PPARGC1A 0.443 4p15.1 peroxisome proliferator-activated receptor gamma, coactivator 1 alpha SIK1 0.435 21q22.3 salt-inducible kinase 1 IRS2 0.434 13q34 insulin receptor substrate 2 RND1 0.433 12q12 Rho family GTPase 1 HGD 0.433 3q13.33 homogentisate 1,2-dioxygenase PTP4A1 0.432 6q12 protein tyrosine phosphatase type IVA, member 1 C8orf4 0.428 8p11.2 chromosome 8 open reading frame 4 DDC 0.427 7p12.2 dopa decarboxylase (aromatic L-amino acid decarboxylase) TACC2 0.427 10q26 transforming, acidic coiled-coil containing protein 2 MUC13 0.422 3q21.2 mucin 13, cell surface associated C5 0.412 9q33-q34 complement component 5 NR4A2 0.412 2q22-q23 nuclear receptor subfamily 4, group A, member 2 EYS 0.411 6q12 eyes shut homolog (Drosophila) GPX2 0.406 14q24.1 glutathione peroxidase
    [Show full text]
  • Identification of the Rassf3 Gene As a Potential Tumor Suppressor
    Clemson University TigerPrints All Dissertations Dissertations 12-2006 Identification of the Rassf3 Gene as a Potential Tumor Suppressor Responsible for the Resistance to Mammary Tumor Development in MMTV/ neu Transgenic Mice Isabelle Jacquemart Clemson University, [email protected] Follow this and additional works at: https://tigerprints.clemson.edu/all_dissertations Part of the Microbiology Commons Recommended Citation Jacquemart, Isabelle, "Identification of the Rassf3 Gene as a Potential Tumor Suppressor Responsible for the Resistance to Mammary Tumor Development in MMTV/neu Transgenic Mice" (2006). All Dissertations. 26. https://tigerprints.clemson.edu/all_dissertations/26 This Dissertation is brought to you for free and open access by the Dissertations at TigerPrints. It has been accepted for inclusion in All Dissertations by an authorized administrator of TigerPrints. For more information, please contact [email protected]. IDENTIFICATION OF THE Rassf3 GENE AS A POTENTIAL TUMOR SUPPRESSOR RESPONSIBLE FOR THE RESISTANCE TO MAMMARY TUMOR DEVELOPMENT IN MMTV/neu TRANSGENIC MICE A Dissertation Presented to the Graduate School of Clemson University In Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy Microbiology by Isabelle C. Jacquemart December 2006 Accepted by: Dr. Wen Y. Chen, Committee Chair Dr. Charles D. Rice Dr. Lyndon L. Larcom Dr. Lesly Temesvari i ABSTRACT The MMTV/neu transgenic mouse line is a well-documented animal model for studying HER2/neu-related breast cancer. It has been reported that a small percentage, approximately 20%, of the virgin female MMTV/neu mice seems resistant to the development of mammary gland adenoma, despite the overexpression of the neu oncogene. To identify the factors that are responsible for the tumor resistance in these MMTV/neu female transgenic mice, comparative genetic profiling was used to screen the alterations in gene expression in the mammary gland.
    [Show full text]
  • EGFL7 Meets Mirna-126: an Angiogenesis Alliance
    - http://vascularcell.com/ REVIEW | OPEN ACCESS EGFL7 meets miRNA-126: an angiogenesis alliance Journal of Angiogenesis Research 2:9 | DOI: 10.1186/2040-2384-2-9 | © Li et al.; licensee Publiverse Online S.R.L. 2010 Received: 21 Apr 2010 | Accepted: 8 Apr 2010 | Published: 8 Apr 2010 Nikolic Iva, Plate Karl-Heinz, Schmidt Mirko HH@ + Contributed equally@ Corresponding author Abstract Blood vessels form de novo through the tightly regulated programs of vasculogenesis and angiogenesis. Both processes are distinct but one of the steps they share is the formation of a central lumen, when groups of cells organized as vascular cords undergo complex changes to achieve a tube-like morphology. Recently, a protein termed epidermal growth factor-like domain 7 (EGFL7) was described as a novel endothelial cell-derived factor involved in the regulation of the spatial arrangement of cells during vascular tube assembly. With its impact on tubulogenesis and vessel shape EGFL7 joined the large family of molecules governing blood vessel formation. Only recently, the molecular mechanisms underlying EGFL7's effects have been started to be elucidated and shaping of the extracellular matrix (ECM) as well as Notch signaling might very well play a role in mediating its biological effects. Further, findings in knock-out animal models suggest miR-126, a miRNA located within the egfl7 gene, has a major role in vessel development by promoting VEGF signaling, angiogenesis and vascular integrity. This review summarizes our current knowledge on EGFL7 and miR-126 and we will discuss the implications of both bioactive molecules for the formation of blood vessels.
    [Show full text]
  • A Proteomic Analysis of Salivary Glands of Female Anopheles Gambiae Mosquito
    Proteomics 2005, 5, 3765–3777 DOI 10.1002/pmic.200401210 3765 REGULAR ARTICLE A proteomic analysis of salivary glands of female Anopheles gambiae mosquito Dário E. Kalume1, Mobolaji Okulate2, Jun Zhong1, Raghunath Reddy1, Shubha Suresh3, Nandan Deshpande3, Nirbhay Kumar2* and Akhilesh Pandey1 1 McKusick-Nathans Institute of Genetic Medicine and Department of Biological Chemistry, Johns Hopkins School of Medicine, Baltimore, MD, USA 2 Department of Molecular Microbiology and Immunology, Johns Hopkins Malaria Research Institute, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA 3 Institute of Bioinformatics, Discoverer Unit 1, Bangalore, India Understanding the development of the malaria parasite within the mosquito vector at the mo- Received: September 3, 2004 lecular level should provide novel targets for interrupting parasitic life cycle and subsequent Revised: December 16, 2004 transmission. Availability of the complete genomic sequence of the major African malaria vector, Accepted: December 27, 2004 Anopheles gambiae, allows discovery of such targets through experimental as well as computa- tional methods. In the female mosquito, the salivary gland tissue plays an important role in the maturation of the infective form of the malaria parasite. Therefore, we carried out a proteomic analysis of salivary glands from female An. gambiae mosquitoes. Salivary gland extracts were digested with trypsin using two complementary approaches and analyzed by LC-MS/MS. This led to identification of 69 unique proteins, 57 of which were novel. We carried out a functional annotation of all proteins identified in this study through a detailed bioinformatics analysis. Even though a number of cDNA and Edman degradation-based approaches to catalog transcripts and proteins from salivary glands of mosquitoes have been published previously, this is the first report describing the application of MS for characterization of the salivary gland proteome.
    [Show full text]
  • WO 2013/184908 A2 12 December 2013 (12.12.2013) P O P C T
    (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization I International Bureau (10) International Publication Number (43) International Publication Date WO 2013/184908 A2 12 December 2013 (12.12.2013) P O P C T (51) International Patent Classification: Jr.; One Procter & Gamble Plaza, Cincinnati, Ohio 45202 G06F 19/00 (201 1.01) (US). HOWARD, Brian, Wilson; One Procter & Gamble Plaza, Cincinnati, Ohio 45202 (US). (21) International Application Number: PCT/US20 13/044497 (74) Agents: GUFFEY, Timothy, B. et al; c/o The Procter & Gamble Company, Global Patent Services, 299 East 6th (22) Date: International Filing Street, Sycamore Building, 4th Floor, Cincinnati, Ohio 6 June 2013 (06.06.2013) 45202 (US). (25) Filing Language: English (81) Designated States (unless otherwise indicated, for every (26) Publication Language: English kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, (30) Priority Data: BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DK, DM, 61/656,218 6 June 2012 (06.06.2012) US DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, (71) Applicant: THE PROCTER & GAMBLE COMPANY HN, HR, HU, ID, IL, IN, IS, JP, KE, KG, KN, KP, KR, [US/US]; One Procter & Gamble Plaza, Cincinnati, Ohio KZ, LA, LC, LK, LR, LS, LT, LU, LY, MA, MD, ME, 45202 (US). MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SC, (72) Inventors: XU, Jun; One Procter & Gamble Plaza, Cincin SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, nati, Ohio 45202 (US).
    [Show full text]
  • The Structure, Function and Evolution of the Extracellular Matrix: a Systems-Level Analysis
    The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis by Graham L. Cromar A thesis submitted in conformity with the requirements for the degree of Doctor of Philosophy Department of Molecular Genetics University of Toronto © Copyright by Graham L. Cromar 2014 ii The Structure, Function and Evolution of the Extracellular Matrix: A Systems-Level Analysis Graham L. Cromar Doctor of Philosophy Department of Molecular Genetics University of Toronto 2014 Abstract The extracellular matrix (ECM) is a three-dimensional meshwork of proteins, proteoglycans and polysaccharides imparting structure and mechanical stability to tissues. ECM dysfunction has been implicated in a number of debilitating conditions including cancer, atherosclerosis, asthma, fibrosis and arthritis. Identifying the components that comprise the ECM and understanding how they are organised within the matrix is key to uncovering its role in health and disease. This study defines a rigorous protocol for the rapid categorization of proteins comprising a biological system. Beginning with over 2000 candidate extracellular proteins, 357 core ECM genes and 524 functionally related (non-ECM) genes are identified. A network of high quality protein-protein interactions constructed from these core genes reveals the ECM is organised into biologically relevant functional modules whose components exhibit a mosaic of expression and conservation patterns. This suggests module innovations were widespread and evolved in parallel to convey tissue specific functionality on otherwise broadly expressed modules. Phylogenetic profiles of ECM proteins highlight components restricted and/or expanded in metazoans, vertebrates and mammals, indicating taxon-specific tissue innovations. Modules enriched for medical subject headings illustrate the potential for systems based analyses to predict new functional and disease associations on the basis of network topology.
    [Show full text]
  • Periostin (POSTN) Regulates Tumor Resistance to Antiangiogenic Therapy in Glioma Models Soon Young Park1, Yuji Piao1, Kang Jin Jeong2, Jianwen Dong1, and John F
    Published OnlineFirst June 15, 2016; DOI: 10.1158/1535-7163.MCT-15-0427 Cancer Biology and Signal Transduction Molecular Cancer Therapeutics Periostin (POSTN) Regulates Tumor Resistance to Antiangiogenic Therapy in Glioma Models Soon Young Park1, Yuji Piao1, Kang Jin Jeong2, Jianwen Dong1, and John F. de Groot1 Abstract Periostin (POSTN) interacts with multiple integrins to volume. Resistance to anti-VEGF-A therapy regulated by coordinate a variety of cellular processes, including epitheli- POSTN was associated with increased expression of TGFb1 al-to-mesenchymal transition (EMT) and cell migration. In and hypoxia-inducible factor-1a (HIF1a)inGSCs.Atthe our previous study, anti-VEGF-A therapy was associated with molecular level, POSTN regulated invasion and expression of resistance and EMT. This study sought to determine the role of EMT (caveolin-1) and angiogenesis-related genes (HIF1a and POSTN in the resistance of glioma stem cells (GSC) to anti- VEGF-A) through activation of STAT3. Moreover, recombinant angiogenic therapy. In mouse xenograft models of human POSTN increased GSC invasion. Collectively, our findings glioma, POSTN expression was associated with acquired resis- suggest that POSTN plays an important role in glioma inva- tance to anti-VEGF-A therapy and had a synergistic effect with sion and resistance to antiangiogenic therapy. Mol Cancer Ther; bevacizumab in prolonging survival and decreasing tumor 15(9); 2187–97. Ó2016 AACR. Introduction vascularization mechanisms may contribute to resistance of glio- blastoma to antiangiogenic therapies (5). Tumor angiogenesis is regulated by angiogenic factors, includ- Periostin (POSTN; osteoblast-specific factor 2) is a 90-kDa ing VEGF, platelet-derived growth factor, and hypoxia-inducible extracellular matrix (ECM) protein containing an amino-terminal factor-1a (HIF1a; ref.
    [Show full text]
  • Table S5.Xlsx
    Table S5. List of PFAM domains identified in predicted V. nonalfalfae secretome PFAM description PFAM ID Number of hits EFFECTOR-SPECIFIC PFAM 66 Calcineurin-like phosphoesterase PF00149 6 Cerato-platanin PF07249 2 CFEM (Common in Fungal Extracellular Membranes) domain PF05730 13 Chitin recongnition protein PF00187 1 Chitin recongnition protein PF03067 3 CVNH (Cyanovirin-N) domain PF08881 2 Cysteine-rich secretory protein family PF00188 4 Fungal hydrophobin PF06766 4 LysM domain PF01476 9 Lytic transglycolase PF03330 3 Necrosis inducing protein (NPP1) PF05630 6 PAN domain PF00024 1 Hce2 (Homologs of C.
    [Show full text]