The Genome and Transcriptome of the Zoonotic Hookworm Ancylostoma Ceylanicum Identify Infection-Specific Gene Families
Total Page:16
File Type:pdf, Size:1020Kb
LETTERS OPEN The genome and transcriptome of the zoonotic hookworm Ancylostoma ceylanicum identify infection-specific gene families Erich M Schwarz1, Yan Hu2,3, Igor Antoshechkin4, Melanie M Miller3, Paul W Sternberg4,5 & Raffi V Aroian2,3 Hookworms infect over 400 million people, stunting and closely related to the free-living Caenorhabditis elegans than is the impoverishing them1–3. Sequencing hookworm genomes and free-living Pristionchus pacificus (Fig. 2)12–15. Treatments effective finding which genes they express during infection should against A. ceylanicum might thus also prove useful against other help in devising new drugs or vaccines against hookworms4,5. strongylids, such as Haemonchus contortus, that infect farm ani- Unlike other hookworms, Ancylostoma ceylanicum infects mals and depress agricultural productivity16. Characterizing the both humans and other mammals, providing a laboratory genome and transcriptome of A. ceylanicum is a key step toward such model for hookworm disease6,7. We determined an comparative analysis. A. ceylanicum genome sequence of 313 Mb, with We assembled an initial A. ceylanicum genome sequence of 313 Mb transcriptomic data throughout infection showing expression and a scaffold N50 of 668 kb, estimated to cover ~95% of the genome, of 30,738 genes. Approximately 900 genes were upregulated with Illumina sequencing and RNA scaffolding17,18 (Supplementary during early infection in vivo, including ASPRs, a cryptic Tables 1–3). The genome size was comparable to those of Ancylostoma subfamily of activation-associated secreted proteins (ASPs)8. caninum (347 Mb)19 and H. contortus (320–370 Mb)20,21 but larger Genes downregulated during early infection included than those of N. americanus, C. elegans and P. pacificus (100– ion channels and G protein–coupled receptors; this 244 Mb)22–24. We found that 40.5% of the genomic DNA was repeti- downregulation was observed in both parasitic and free-living tive, twice as much as in N. americanus, C. elegans or P. pacificus nematodes. Later, at the onset of heavy blood feeding, C-lectin (17–24%). We predicted 26,966 protein-coding genes25 with products genes were upregulated along with genes for secreted clade V of ≥100 residues (Supplementary Table 4). We also predicted 10,050 proteins (SCVPs), encoding a previously undescribed protein genes with products of 30–99 residues, to uncover smaller proteins family. These findings provide new drug and vaccine targets that might aid in parasitism26. With RNA sequencing (RNA-seq), and should help elucidate hookworm pathogenesis. we detected expression of 23,855 (88.5%) and 6,883 (68.5%) of these genes, respectively (Fig. 3). The two hookworm species causing the most infections are Necator The genomes of plant-parasitic, necromenic and animal-parasitic americanus and Ancylostoma duodenale, which are generally restricted nematodes have all acquired bacterial genes through horizontal gene to human hosts1,9. Hookworms are free living during part of their transfer (HGT)27,28. We detected one instance of bacterial HGT in life cycle, with eggs hatching in soil and larvae feeding on bacteria A. ceylanicum: Acey_s0012.g1873, a homolog of the N-acetylmuramoyl- through the first and second larval stages. At the infectious third- l-alanine amidase amiD, which encodes a protein that may help stage larval phase (L3i), hookworms cease feeding and wait until they bacteria recycle their murein29. Acey_s0012.g1873 was strongly encounter a human host. They generally enter their host by burrow- expressed in L3i and then downregulated in all later stages of ing into skin, although Ancylostoma can alternatively enter by being infection. It has nine predicted introns, presumably acquired after swallowed. Hookworms then pass through the bloodstream, lungs HGT; it has only one homolog in the entire nematode phylum and digestive tract to the small intestine, where they affix themselves, (NECAME_15163 from N. americanus) but many bacterial homologs mature to adulthood, mate and lay eggs that are excreted by the (Supplementary Fig. 1 and Supplementary Table 5). The sap-feeding host1. The ability to culture A. ceylanicum in golden hamster allows insects Acyrthosiphon pisum and Planococcus citri also have amiD it to be used as a model system for the human-specific hookworms genes, acquired by HGT, that may promote bacterial lysis30,31. N. americanus and A. duodenale, upon which new drug and vaccine To find genes acting at specific points of infection, we carried out candidates can be tested (Fig. 1)6,10,11. Human-specific hookworms RNA-seq on specimens collected at developmental stages spanning belong to a class of parasitic nematodes, strongylids, that are more the onset and establishment of infection by A. ceylanicum in golden 1Department of Molecular Biology and Genetics, Cornell University, Ithaca, New York, USA. 2Program in Molecular Medicine, University of Massachusetts Medical School, Worcester, Massachusetts, USA. 3Section of Cell and Developmental Biology, University of California, San Diego, La Jolla, California, USA. 4Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, USA. 5Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California, USA. Correspondence should be addressed to E.M.S. ([email protected]) Received 12 January 2014; accepted 5 February 2015; published online 2 March 2015; doi:10.1038/ng.3237 NATURE GENETICS ADVANCE ONLINE PUBLICATION 1 LETTERS Figure 1 Life cycle of A. ceylanicum. A. ceylanicum hatch in feces and 19.D L3i 24.HCM grow as free-living first- to third-stage (L1–L3) larvae. Before exiting the third larval stage, they mature into infectious third-stage (L3i) larvae, arresting further development until they are inside a host. In 24 h after gavage into golden hamsters, A. ceylanicum are still in the stomach but 17.D 24.PI have exited the L3i stage (24.PI). A standard model for parasite infection is to incubate L3i larvae for 24 h in hookworm culture medium (24.HCM), which evokes changes in larval shape and behavior thought to mimic those of 24.PI larvae in vivo. By 5 d after infection, larvae have migrated further to the intestine, affixed themselves there and grown into early fourth-stage 12.D 5.D (L4) larvae (5.D; female shown) with visible sexual differentiation. By 12 d (12.D; female shown), they start heavy blood feeding and become young adults with mature males and a few gravid females, with little or no egg laying. By 17 d (17.D; male shown), they are fully mature adults. They begin laying many eggs that are deposited outside the host during defecation, renewing the life cycle. From 19 d onward (19.D; male shown), they remain fertile adults for weeks in hamsters. Scale bars: with N. americanus or other nematodes might encode previously 100 m (L3i through 5.D), 500 m (12.D) and 1 mm (17.D and 19.D). µ µ undescribed components of infection (Supplementary Table 10). Proteases, protease inhibitors, nucleases and protein synthesis were hamster (Figs. 1 and 3, and Supplementary Table 6), beginning at upregulated during early infection (L3i to 24.PI; Supplementary L3i and followed by 24 h either of incubation in hookworm culture Tables 9a and 11a); proteases and protease inhibitors were medium (24.HCM), a standard model for early hookworm infection32, also upregulated after L3i in N. americanus24, as were proteases in or infection in the hamster stomach (24.PI). We found 942 genes to H. contortus21. Secreted proteases could allow hookworms to digest be significantly upregulated from L3i after 24 h of infection in vivo host proteins in blood and intestinal mucosa6,11,35–37. Secreted (Supplementary Table 7). In contrast, we observed only 240 genes proteases might also digest and inactivate proteins of the host’s significantly upregulated from L3i after 24 h of incubation in HCM, immune system37,38. Conversely, secreted protease inhibitors could of which 141 were also upregulated with in vivo infection. This lower also suppress host immunity39–41. number matches previous observations32 and shows that infection G protein–coupled receptors (GPCRs), receptor-gated ion channels in vivo has stronger effects on gene activity than its in vitro model. and neurotransmission-related functions in general were downregu- We linked known or probable gene functions to steps of infec- lated during early infection (L3i to 24.PI), along with transcription tion by assigning gene ontology (GO) terms to A. ceylanicum genes33 factors (Supplementary Tables 9b and 11b). We observed the and computing which GO terms were over-represented among same pattern among genes downregulated in the transition from L3 genes upregulated or downregulated in developmental transitions to fourth-stage (L4) larvae both in H. contortus21 and C. elegans42 (Supplementary Tables 8 and 9)34. We also analyzed homologous (Supplementary Table 8). This finding is consistent with down- gene families for disproportionate upregulation or downregulation; regulation after L3 of sensory perception and transcription genes in in particular, gene families identified by orthology of A. ceylanicum both C. elegans43 and N. americanus24 and of ion channel genes in A. caninum and Brugia malayi32,44. Such downregulation might thus A. ceylanicum C. elegans be conserved in both parasitic and free-living nematodes. A. ceylanicum 9,860 7,042 Among gene families upregulated during early infection, we found N. americanus 7,506 6,225 some already known from other parasitic nematodes, such as ASPs H. contortus 5,268 4,576 C. elegans 7,042 11,323 L3i 24.HCM 24.PI 5.D 12.D 17.D 19.D C. briggsae 6,933 10,795 P. pacificus 4,833 4,979 B. xylophilus 5,373 5,587 M. hapla 4,161 4,376 A. suum 5,955 6,209 B. malayi 5,312 5,545 D. immitis 5,166 5,308 T. spiralis 3,015 3,186 Figure 2 Evolutionary relatedness of A. ceylanicum to other nematodes.