Genome-Wide Analysis of Regulatory Proteases Sequences Identified
Total Page:16
File Type:pdf, Size:1020Kb
Himmelfarb Health Sciences Library, The George Washington University Health Sciences Research Commons Microbiology, Immunology, and Tropical Medicine Microbiology, Immunology, and Tropical Medicine Faculty Publications 6-4-2014 Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium Hong-Bin Yan Chinese Academy of Agricultural Sciences, Lanzhou, China Zhong-Zi Lou Chinese Academy of Agricultural Sciences, Lanzhou, China Li Li Chinese Academy of Agricultural Sciences, Lanzhou, China Paul J. Brindley George Washington University Yadong Zheng Chinese Academy of Agricultural Sciences, Lanzhou, China See next page for additional authors Follow this and additional works at: http://hsrc.himmelfarb.gwu.edu/smhs_microbio_facpubs Part of the Medical Immunology Commons, and the Medical Microbiology Commons Recommended Citation Yan, H., Lou, Z., Li, L., Brindley, P.J., Zheng, Y. et al. (2014). Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium. BMC Genomics, 15:428. This Journal Article is brought to you for free and open access by the Microbiology, Immunology, and Tropical Medicine at Health Sciences Research Commons. It has been accepted for inclusion in Microbiology, Immunology, and Tropical Medicine Faculty Publications by an authorized administrator of Health Sciences Research Commons. For more information, please contact [email protected]. Authors Hong-Bin Yan, Zhong-Zi Lou, Li Li, Paul J. Brindley, Yadong Zheng, Xuenong Luo, Junling Hou, Aijiang Guo, Wan-Zhong Jia, and Xuepeng Cai This journal article is available at Health Sciences Research Commons: http://hsrc.himmelfarb.gwu.edu/smhs_microbio_facpubs/ 124 Yan et al. BMC Genomics 2014, 15:428 http://www.biomedcentral.com/1471-2164/15/428 RESEARCH ARTICLE Open Access Genome-wide analysis of regulatory proteases sequences identified through bioinformatics data mining in Taenia solium Hong-Bin Yan1,2*, Zhong-Zi Lou1,LiLi1, Paul J Brindley2, Yadong Zheng1, Xuenong Luo1, Junling Hou1, Aijiang Guo1, Wan-Zhong Jia1* and Xuepeng Cai1* Abstract Background: Cysticercosis remains a major neglected tropical disease of humanity in many regions, especially in sub-Saharan Africa, Central America and elsewhere. Owing to the emerging drug resistance and the inability of current drugs to prevent re-infection, identification of novel vaccines and chemotherapeutic agents against Taenia solium and related helminth pathogens is a public health priority. The T. solium genome and the predicted proteome were reported recently, providing a wealth of information from which new interventional targets might be identified. In order to characterize and classify the entire repertoire of protease-encoding genes of T. solium, which act fundamental biological roles in all life processes, we analyzed the predicted proteins of this cestode through a combination of bioinformatics tools. Functional annotation was performed to yield insights into the signaling processes relevant to the complex developmental cycle of this tapeworm and to highlight a suite of the proteases as potential intervention targets. Results: Within the genome of this helminth parasite, we identified 200 open reading frames encoding proteases from five clans, which correspond to 1.68% of the 11,902 protein-encoding genes predicted to be present in its genome. These proteases include calpains, cytosolic, mitochondrial signal peptidases, ubiquitylation related proteins, and others. Many not only show significant similarity to proteases in the Conserved Domain Database but have conserved active sites and catalytic domains. KEGG Automatic Annotation Server (KAAS) analysis indicated that ~60% of these proteases share strong sequence identities with proteins of the KEGG database, which are involved in human disease, metabolic pathways, genetic information processes, cellular processes, environmental information processes and organismal systems. Also, we identified signal peptides and transmembrane helices through comparative analysis with classes of important regulatory proteases. Phylogenetic analysis using Bayes approach provided support for inferring functional divergence among regulatory cysteine and serine proteases. (Continued on next page) * Correspondence:[email protected]; [email protected]; [email protected] 1State Key Laboratory of Veterinary Etiological Biology, Key Laboratory of Veterinary Parasitology of Gansu Province, Key Laboratory of Veterinary Public Health of Agriculture Ministry, Lanzhou Veterinary Research Institute, Chinese Academy of Agricultural Sciences, Lanzhou 730046, Gansu Province, PR China 2Department of Microbiology, Immunology & Tropical Medicine, and Research Center for Neglected Diseases of Poverty, School of Medicine & Health Sciences, The George Washington University, Washington DC, USA © 2014 Yan et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. Yan et al. BMC Genomics 2014, 15:428 Page 2 of 13 http://www.biomedcentral.com/1471-2164/15/428 (Continued from previous page) Conclusion: Numerous putative proteases were identified for the first time in T. solium, and important regulatory proteases have been predicted. This comprehensive analysis not only complements the growing knowledge base of proteolytic enzymes, but also provides a platform from which to expand knowledge of cestode proteases and to explore their biochemistry and potential as intervention targets. Keywords: Proteases, Taenia solium, Drug target, Vaccine candidate antigen, Genome-wide analysis, Cysticercosis, Platyhelminth Background however, have been registered in recent years [1]. Accord- Taeniosis and cysticercosis caused by adult and larval stages ingly, it is sensible to explore and seek novel molecular tar- of the Taenia solium (Platyhelminthes: Cestoda, Cyclophyl- gets and their potential for vaccines and chemotherapeutic lidea, Taeniidae) parasite, respectively, remain important agents to block transmission of this cestode. parasitic diseases and a major health and economic burdens Proteases have been examined in depth in immunological in less developed countries. Moreover, these infectious dis- or chemotherapeutic studies aiming to develop anti- eases also are increasingly seen in more developed coun- protozoa agents; this focus relates to the critical roles pro- tries because of immigration from endemic areas where teolytic enzymes play in the developmental cycles of the pigs are reared and pork is consumed [1]. T. solium has a parasites [7]. Moreover, proteases are important regulatory complex, two-host developmental cycle. Humans are the elements in all cells [8,9]. They also play a key role as effec- only definitive host - harboring the adult tapeworm, which tors of virulence in pathogens through converting host sig- result in taeniasis, whereas pigs, humans and other mam- nal transduction and modifying the immune response mals can serve as intermediate hosts for the larval (cysticer- [10-13]. However, few proteolytic enzymes have been iden- cus) form [2,3]. Infection of the human nervous system by tified or characterized for functions and interactions in T. the cysticercus leads to neurocysticercosis with the symp- solium and other cestodes (Cestoda). toms of acquired epilepsy and seizure. Although five main catalytic classes of proteases have Therapeutic measures available to treat neurocysticerco- been identified from activated oncospheres in vitro of T. sis include steroids, treatments for symptoms, surgery, and solium by proteomic analysis, only several have been de- antiparasitic drugs to kill cysticerci in the central nervous scribed in depth [14]. Three proteases of T. solium have system, muscles, and other sites. By contrast, infection of been named and classified in MEROPS database; among the human small intestine by the adult developmental stage them, a cDNA encoding TsCL-1 - cathepsin L-like cyst- of tapeworm is usually asymptomatic, and readily treated eine protease from the T. solium metacestode has been with the oral medication praziquantel. However, attempts identified and the biochemical properties of the recombin- to date to control transmission of the parasite have often ant enzyme characterized [15]. Few proteases have been been poorly effective and not sustainable [1-3]. investigated for the potential to serve as chemotherapeutic Over the past decade, research has been undertaken to targets or vaccine candidates against cysticercosis [15-20]. develop vaccines and novel chemotherapeutic agents for use The newly available genome sequences of T. solium pro- in pigs to prevent and control transmission of T. solium. vide new avenues to discover novel vaccine candidates or Whereas noteworthy progress has been made [3-6], no ideal therapeutic targets [21]. These abundant new data com- vaccine is currently available for immunization to prevent in- bined with specialized databases and bioinformatics tech- fection of porcine or human populations at risk of infection. niques should accelerate the identification of anti-cestode The screening and identification of