Structure and Function of Per-ARNT-Sim Domains and Their Possible Role in the Life-Cycle Biology of Trypanosoma Cruzi
Total Page:16
File Type:pdf, Size:1020Kb
Accepted Manuscript Title: Structure and function of Per-ARNT-Sim domains and their possible role in the life-cycle biology of Trypanosoma cruzi Authors: Maura Rojas-Pirela, Daniel J. Rigden, Paul A. Michels, Ana J. Caceres,´ Juan Luis Concepcion,´ Wilfredo Quinones˜ PII: S0166-6851(17)30135-4 DOI: https://doi.org/10.1016/j.molbiopara.2017.11.002 Reference: MOLBIO 11096 To appear in: Molecular & Biochemical Parasitology Received date: 30-4-2017 Revised date: 12-10-2017 Accepted date: 2-11-2017 Please cite this article as: Rojas-Pirela Maura, Rigden Daniel J, Michels Paul A, Caceres´ Ana J, Concepcion´ Juan Luis, Quinones˜ Wilfredo.Structure and function of Per-ARNT-Sim domains and their possible role in the life- cycle biology of Trypanosoma cruzi.Molecular and Biochemical Parasitology https://doi.org/10.1016/j.molbiopara.2017.11.002 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Review Structure and function of Per-ARNT-Sim domains and their possible role in the life-cycle biology of Trypanosoma cruzi Maura Rojas-Pirelaa, Daniel J. Rigdenb, Paul A. Michelsc, Ana J. Cáceresa, Juan Luis Concepcióna, Wilfredo Quiñonesa,* a Laboratorio de Enzimología de Parásitos, Departamento de Biología, Facultad de Ciencias, Universidad de Los Andes, Mérida 5101, Venezuela b Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, United Kingdom. c Centre for Immunity, Infection and Evolution and Centre for Translational and Chemical Biology, School of Biological Sciences, The University of Edinburgh, The King's Buildings, Edinburgh EH9 3FL, Scotland, United Kingdom * Corresponding author. Laboratorio de Enzimología de Parásitos. Facultad de Ciencias. Universidad de Los Andes. La Hechicera, 5101-Mérida, Venezuela. Tel.: +58 274 2401302. Fax: +58 274 2401291. E-mail address: [email protected] Graphicalabstract ACCEPTED MANUSCRIPT 1 HIGHLIGHTS PAS domains are ubiquitous in nature and endow multiple functions to proteins Analysis of kinetoplastid genomes revealed PAS domains in various protein kinases A PAS-phosphoglycerate kinase (PGK)-like protein was identified in T. cruzi The PAS-PGK-like protein contains a PTS1 sequence for import into glycosomes ACCEPTED MANUSCRIPT 2 Abstract Per-ARNT-Sim (PAS) domains of proteins play important roles as modules for signalling and cellular regulation processes in widely diverse organisms such as Archaea, Bacteria, protists, plants, yeasts, insects and vertebrates. These domains are present in many proteins where they are used as sensors of stimuli and modules for protein interactions. Characteristically, they can bind a broad spectrum of molecules. Such binding causes the domain to trigger a specific cellular response or to make the protein containing the domain susceptible to responding to additional physical or chemical signals. Different PAS proteins have the ability to sense redox potential, light, oxygen, energy levels, carboxylic acids, fatty acids and several other stimuli. Such proteins have been found to be involved in cellular processes such as development, virulence, sporulation, adaptation to hypoxia, circadian cycle, metabolism and gene regulation and expression. Our analysis of the genome of different kinetoplastid species revealed the presence of PAS domains also in different predicted kinases from these protists. Open-reading frames coding for these PAS-kinases are unusually large. In addition, the products of these genes appear to contain in their structure combinations of domains uncommon in other eukaryotes. The physiological significance of PAS domains in these parasites, specifically in Trypanosoma cruzi, is discussed. ACCEPTED MANUSCRIPT 1. Introduction 3 Protein domains are parts of the polypeptide sequence that can evolve, fold into a three- dimensional structure, and function independently from the rest of the protein. Domain sizes can vary widely; however, the majority of domains comprise between 100 and 200 residues, with the most common size being about 100 residues [1]. Protein domains are present in the three domains of life: archaea, bacteria and eukaryotes. In the case of eukaryotes, many proteins possess one or multiple domains, with different architectures [2]. It has been estimated that there are at least 1200 families of protein domains [3]. Many of these families have specific roles in diverse cellular processes, such as apoptosis, modulation of the cytoskeleton, vesicle trafficking, DNA binding, protein-protein interactions and regulation of intercellular or intracellular signalling. However, some families are considered to contain “promiscuous domains” or “versatile domains”. Moreover, the possibility of combining different domains in a single polypeptide provides proteins with the ability to be involved in broad spectra of processes that are key in interaction networks in the cell, especially those that contribute to signal translation [4,5]. In eukaryotes about 215 promiscuous protein domains have been identified [4], which include the Per-ARNT-Sim (PAS) domains. The versatile PAS domain is present in all kingdoms of life [6]. PAS is an acronym derived from the three eukaryotic proteins in which the domain was first recognized by sequence homology: the period circadian protein (Per), the vertebrate aryl hydrocarbon receptor nuclear translocator (ARNT) and single- minded (Sim), a Drosophila protein involved in embryonic development [7]. One of the first structural studies of a PAS domain involved the photoactive yellow protein (PYP), a mediator of the phototactic negative response by the phototrophic bacterium Ectothiohodospira halophila [8]. This globular protein is considered to contain the prototype structure of a PAS domain [9,10] (Fig. 1). During the last few years, studies of proteins from all domains of life have resulted in 21,000 entries annotated as PAS domain in the Pfam database [6]. The more than 200 proteinsACCEPTED having a PAS domain comprise receptors, MANUSCRIPT signal transducers, kinases, transcription factors, ion channels, chemotaxis proteins, cyclic nucleotide phosphodiesterases and proteins involved in embryological development of the central nervous system [11]. In this paper, we review the known functionality of PAS domains as a prelude to considering the 4 enigmatic role of this domain in the biology of trypanosomatid parasites, with particular emphasis on Trypanosoma cruzi, since the presence of these regulatory domains in parasitic protists was so far unknown. 2. Localization and structure of PAS domains PAS domain proteins have different subcellular locations. The domain may be contained in proteins present in the plasma membrane, exposed to either the intra- or the extracellular environment [11]. Membrane proteins with PAS domains (Aer, FixL, ArcB and DcuS) have been previously studied [12-16]. In all such membrane proteins analysed until now, the PAS domain is located adjacent to a transmembrane region. Possibly, such a location allows the PAS domain to interact with other modules present in the same or other, adjacent membrane proteins [17]. PAS domains can also be part of soluble cytoplasmic proteins (such as NifL, guanylyl cyclase and histidine kinase (HK), photoactive yellow protein (PYP) and transcription factors). These proteins may have a single or multiple PAS domains in their structure [18-21]. In all these proteins, the PAS domain functions as a sensor of changes or various stimuli in its environment (oxygen, pH, light, ions, glucose, voltage, oligomerisation state, carboxylic acids, energy level or fatty acids) and as modulator of protein-protein interactions, allowing organisms to “sense” and respond appropriately to the changes or stimuli [22-24]. These domains are often associated with other regulatory modules in multi-domain proteins. Moreover, a PAS protein can contain a single or multiple, tandemly-organized PAS domains [11]. Some studies suggest that PAS domains in tandem can regulate recruitment and oligomerisation state of proteins [25,26]. The presence of a PAS domain in a protein can confer association specificity of its effector domain. In prokaryotes, PAS domains have been shown to induce the formation of homodimers [25], and in eukaryotes the formation of heterodimers [24]. Oligomerisation is a necessary requirement for the function of many proteinsACCEPTED. This process is crucial to trigger MANUSCRIPTand regulate different physiological processes, such as gene expression, activity of enzymes and cell-cell adhesion [27]. In the case of some PAS proteins with enzymatic activity, such as the histidine kinases, dimerisation is a prerequisite to achieve phosphorylation in trans [11]. In addition, the PAS domain can 5 mediate the oligomerisation of transcription factors such as bHLH/PAS, and it can confer specificity and distinct recognition of target genes [21]. Initially, PAS domains were considered homologous regions of about 50 amino acids in the Per, ARNT and Sim proteins [28]. However, subsequent studies revealed that PAS domains can also be formed by regions comprising approximately 100 to 150 amino acids [10]. PAS domains adopt a globular structure formed by several α-helices