View metadata, citation and similar papers at core.ac.uk brought to you by CORE

provided by PubMed Central

http://genomebiology.com/2002/3/11/reviews/1031.1

Minireview

Plasmodium falciparum virulence determinants unveiled comment Brendan S Crabb and

Address: The Walter and Eliza Hall Institute of Medical Research, , Victoria 3050, Australia.

Correspondence: Brendan Crabb or Alan Cowman. E-mail: [email protected] or [email protected]

Published: 25 October 2002 reviews Genome 2002, 3(11):reviews1031.1–1031.4 The electronic version of this article is the complete one and can be found online at http://genomebiology.com/2002/3/11/reviews/1031 © BioMed Central Ltd (Print ISSN 1465-6906; Online ISSN 1465-6914)

Abstract reports

The human parasite , one of the world’s most devastating pathogens, has an astonishing array of sequences and genes that play key roles in pathogenesis and immune evasion. We must understand the functions of these elements if the chronicity and unpredictable virulence of Plasmodium is to be explained. deposited research

Despite intensive efforts over the last century to understand deep microvasculature of various tissues and organs, and is and control malaria, the causative agent of the most severe associated with certain severe disease outcomes (reviewed in form of the disease - Plasmodium falciparum - remains firmly [8-10]). A parasite protein inserted into the infected red blood entrenched as a leading cause of morbidity and mortality in cell surface, known as P. falciparum erythrocyte membrane humans. Approximately 300-500 million clinical episodes and protein 1 (PfEMP1), is considered to be a key adhesive ligand 2.7 million deaths are attributed to P. falciparum infections mediating sequestration. In a process known as antigenic vari-

each year and, with the emergence of widespread drug- ation, clonal P. falciparum parasites can vary the type of refereed research resistant parasite populations and insecticide-resistant mos- PfEMP1 molecule they express, so as to avoid antibody-medi- quitoes, this situation is predicted to worsen [1]. New ated clearance. Intriguingly, different PfEMP1 ligands mediate cost-effective strategies for controlling malaria, such as the adherence to different receptors on endothelial cells, including development of a vaccine, are urgently required. The complete the scavenger receptor CD36, chondroitin sulfate A and intra- sequence of the 14 linear chromosomes that comprise the cellular adhesion molecule-1. In some instances, parasite pop- P. falciparum genome has recently been determined [2-5]. It ulations with a predisposition to adhere to certain receptors is perhaps not surprising for such a successful pathogen that are more commonly associated with certain disease outcomes, these studies have revealed that a high proportion of the 5,300 such as cerebral and placental malaria, although the precise predicted genes encode proteins known or predicted to play a role of parasite-receptor interactions in determining disease interactions role in pathogenic processes, such as invasion of red blood severity remains to be understood. P. falciparum infections cells, cytoadherence and immune evasion. We have reviewed are persistent, and this chronicity is promoted by antigenic elsewhere the impact of the genome sequence on red blood variation at the infected red blood cell surface. Proteins of the cell invasion [6]; in this article, we comment on our increased repetitive interspersed family (rifins) are also expressed at the understanding of the virulence genes that encode proteins surface of infected red blood cells, and, like PfEMP1, these involved in cytoadherence and immune evasion. Insight into undergo antigenic variation [11,12]. Another family of proteins the function, diversity and regulation of these genes promises related to the rifins has also been described - the subtelomeric information to reveal new strategies for fighting malarial disease [7]. variable open reading frame (stevor) proteins - but their func- tion remains unknown [11]. PfEMP1 and rifin proteins are considered key virulence factors in P. falciparum. Cytoadherence and antigenic variation The adhesion of parasite-infected red blood cells to vascular The PfEMP1, rifin and stevor proteins are encoded by endothelium leads to sequestration of P. falciparum in the members of the var, rif and stevor gene families, respectively 2 Genome Biology Vol 3 No 11 Crabb and Cowman

[11,13,14]. As a measure of the significance of these genes directly adjacent to the non-coding subtelomeric repeats to parasite survival, it is now evident that around 10% of ( igure 1); but 23 of the 59 var genes are found in central the 2.3 megabase P. falciparum genome is committed to locations on chromosomes 4, 6, 7, 8 and 12, mostly in the expression and generation of diversity of these viru- head-to-tail arrays of between three and seven genes (see lence genes. This includes the genes themselves - a total of igure 1, top). 59 var, 149 rif and 28 stevor genes - as well as intergenic (regulatory) regions and non-coding subtelomeric repeat regions thought to contribute to diversity and transcrip- The subtelomeric region tional control of the neighboring virulence genes (these are rom the genome sequence, it is clear that most of the 28 discussed in more detail below). Members of the var, rif P. falciparum chromosome ends are structurally highly and stevor gene families are mostly concentrated at the conserved, and each end can be divided into five different end of each P. falciparum chromosome and are positioned subtelomeric blocks comprising up to 120 kilobases each

SBs CCB SBs Central var array 60-120 kb (Chromosomes 4, 6, 7, 8 and 12)

var rif(n) stev rif(n) stev B Rep20

var rifvar rif(n) stev rif(n) B A Rep20

var rifvar rif(n) stev rif(n)

Telomeric cluster Telomeric B A

at nuclear periphery at nuclear Rep20

SB1 SB2 SB3 SB4 SB5 (tel) (TAREs 1-5) (TARE6) (var + repeats) (Virulence genes)

Figure 1 Conserved structures at the ends of P. falciparum chromosomes. A typical chromosome is shown at the top, highlighting the position of subtelomeric blocks (SBs) and centrally located var gene arrays that occur on only a subset of chromosomes. Three typical chromosome end structures are expanded in the lower section and are represented as clustered at the nuclear periphery [19,20]. For all chromosomes, SBs 1-3 are highly conserved in both order and sequence - the individual boxes represent telomere-associated repeat elements (TAREs) 1-6, as previously described [16] - while cross-linking mediated by SB3 (made up of the degenerate 21 base-pair repeat known as Rep20; shown in blue) has been implicated in the stabilization/formation of chromosomal clusters within the cell nucleus (gray ovals) [18]. The first transcribed var gene is typically transcribed toward the centromere in SB4 (for 22 of the 24 chromosomal ends that have a terminal var). The predominant ‘type 1’ var genes (colored red) are found in the positions shown. A, B and C represent var upstream sequences, upsA, upsB and upsC, respectively, that are typically found upstream of var genes in the locations shown (note that upsB is found upstream of both subtelomeric and internal var genes). The rif, stevor and var genes in SB5 have a common arrangement in some, but not all, chromosome ends. For example, the arrangement of telomere-var-rif/stevor(n) shown at the top of the cluster in the figure is found at 10 chromosome ends, but even among this group there is variation in the arrangement and orientation of transcription of the rif and stevor genes. Hence, although chromosomal clustering within the nucleus promotes ectopic recombination and the generation of diversity in virulence genes [15,19], this is likely to be limited by the variation in the gene structure within SB5. http://genomebiology.com/2002/3/11/reviews/1031.3

( igure 1) [2]. Three subtelomeric blocks (SB1-SB3) are cysteine-rich domains). Although 16 different var types non-coding, while the remaining two (SB4 and SB5) incor- were identified in the sequenced 3D7 genome, 38 of the 59

porate virulence-gene family members amongst other var genes in this parasite line were of the same four- comment sequences ( igure 1). SB1 is located at the extreme terminus domain type, termed ‘type 1’ [2]. It is striking that the 38 and is comprised of approximately 1.2 kb of a seven base- type 1 var genes are not distributed randomly but are pair G-rich telomere repeat with the consensus sequence found either at the extreme telomere, where they are GGGTT(T/C)A [15]. SB2 and SB3 together are 30-40 kb in always present in a telomere-to-centromere orientation, or length and comprise six different non-coding repeat ele- in central chromosomal locations, where they comprise 20 ments previously known as telomere-associated repeat of the 23 var genes in this region. The dominance of one elements (TAREs) 1-6 [15,16]. var type was unexpected, as previously characterized var genes to which an adhesive phenotype had been assigned reviews Although subtelomeric repeats have been identified in other were generally not of this type, and indeed encoded more organisms, the P. falciparum SB2 and SB3 elements are than four domains. unique in eukaryotes, and even amongst the Plasmodia, in terms of their size, composition and complexity. SB2 com- The mechanisms that control expression of the P. falciparum prises a series of five ordered repeat elements (TAREs 1-5) virulence genes remain poorly understood. In the case of that are interspersed with non-repetitive sequence. These var genes, at any one time all but one are repressed. A elements are always found in the same order, and they cooperative interaction - which does not depend on chro- display remarkable sequence conservation and a CG-bias mosomal context - between the var promoter and intron reports (around 30% G+C) that is unusual in the non-coding regions can mediate this silencing [21], but the mecha- regions of the otherwise extremely AT-rich genome (nor- nism(s) that control var gene activation and switching mally about 10% G+C in non-coding regions and 19.4% G+C remain to be determined. It is interesting that the overall). It is possible that SB2 (TAREs 1-5) represents a upstream regions of the 59 var genes in the sequenced distinct functional unit, although this remains to be exam- genome could be classified into three distinct classes, ined. SB3 (or TARE6) consists entirely of a 10-20 kb stretch termed upsA, upsB and upsC [2]. These regions are not of a unique degenerate 21 base-pair repeat known as Rep20 distributed randomly but are generally associated with deposited research [17,18]. More repeat elements are found in SB4, but this particular var genes: upsA with subtelomeric genes tran- region also contains at least one var gene. Hence, the most scribed toward the telomere; upsB with subtelomeric genes telomeric var genes, which are usually transcribed in a transcribed toward the centromere; and upsC with 13 cen- telomere-to-centromere direction, are the first genes tran- trally located var genes; the remaining 10 internal var scribed at most chromosome ends (in total, in 24 of 28 genes have the upsB-type 5؅-untranslated region. The upsB ends). In some chromosome ends, fragments of SB4 are and upsC are likely to be the same as the two var promoter inverted. SB5 may extend up to 120 kb inwards, towards the elements that have previously been identified [22]; but the centromere, and contains members of the var, rif, stevor functional relevance of these three different upstream refereed research and other gene families. region types remains unknown.

The conserved arrangement of P. falciparum chromosome ends is thought to mediate chromosome-end alignment and Beyond the genome clustering in a manner that promotes recombination in The genome sequence of P. falciparum has revealed a telomere-associated genes located at the ends of heterologous long-suspected but nevertheless breathtaking array of chromosomes (ectopic recombination). This has been sequences and genes known or suspected to mediate viru- demonstrated to occur in var genes, resulting in gene con- lence. Much remains to be learned about the mechanisms version events [19]. The process of ectopic recombination that control the expression, switching, and generation of interactions in var genes, which is also likely to occur in other sub- allelic diversity of P. falciparum virulence genes, as well as telomeric gene family members, allows the rapid genera- about the nature of the adhesive and antigenic phenotypes tion of diverse antigenic and adherence phenotypes. Recent of the proteins encoded by these genes. As a result of the evidence suggests that elements in SBs 2-5, and in particular genome project, such studies are now possible, and these SB3, are necessary for chromosome-end clustering but not are likely to provide fertile ground for researchers in the for the anchoring of chromosome ends at the nuclear coming decade. It is hoped that such understanding will periphery, a phenomenon that is probably mediated by SB1 have significant impact on control measures that aim to information (the telomere tract) [18,20]. Hence, parasite populations alleviate the misery of malaria. and individual clonal lines have distinct complements of virulence genes. It is interesting to note that the comple- ment of var genes in the sequenced P. falciparum genome Acknowledgements is dominated by one particular var type (var types are clas- We thank Rebecca O’Donnell and Till Voss for critical reading of this manuscript. B.S.C. and A.F.C. are International Research Fellows of the sified according to their particular arrangement of encoded Howard Hughes Medical Institute. 4 Genome Biology Vol 3 No 11 Crabb and Cowman

References 1. Greenwood B, Mutabingwa T: Malaria in 2002. Nature 2002, 415:670-672. 2. Gardner MJ, Hall N, Fung E, White O, Berriman M, Hyman RW, Carlton JM, Nelson KE, Bowman S, Paulsen IT, et al.: Genome sequence of the human malaria parasite Plasmodium falci- parum. Nature 2002, 419:498-511. 3. Gardner MJ, Shallom SJ, Carlton JM, Salzberg SL, Nene V, Shoaibi A, Ciecko A, Lynn J, Rizzo M, Weaver B, et al.: Sequence of Plasmod- ium falciparum chromosomes 2, 10, 11 and 14. Nature 2002, 419:531-534. 4. Hall N, Pain A, Berriman M, Churcher C, Harris B, Harris D, Mungall K, Bowman S, Atkin R, Baker S, et al.: Sequence of Plasmodium fal- ciparum chromosomes 1, 3-9 and 13. Nature 2002, 419:527-531. 5. Hyman W, Fung E, Conway A, Kurdi O, Mao J, Miranda M, Nakao B, Rowley D, Tamaki T, Wang F, et al.: Sequence of Plasmodium fal- ciparum chromosome 12. Nature 2002, 419:534-537. 6. Cowman AF, Crabb BS: The Plasmodium falciparum genome—a blueprint for erythrocyte invasion. Science 2002, 298:126-128. 7. Baruch DI, Gamain B, Barnwell JW, Sullivan JS, Stowers A, Galland GG, Miller LH, Collins WE: Immunization of Aotus monkeys with a functional domain of the Plasmodium falciparum variant antigen induces protection against a lethal parasite line. Proc Natl Acad Sci USA 2002, 99:3860-3865. 8. Beeson J, Reeder J, Rogerson S, Brown G: Parasite adhesion and immune evasion in placental malaria. Trends Parasitol 2001, 17:331-337. 9. Craig A, Scherf A: Molecules on the surface of the Plasmodium falciparum infected erythrocyte and their role in malaria pathogenesis and immune evasion. Mol Biochem Parasitol 2001, 115:129-143. 10. Kyes S, Horrocks P, Newbold C: Antigenic variation at the infected red cell surface in malaria. Annu Rev Microbiol 2001, 55:673-707. 11. Cheng Q, Cloonan N, Fischer K, Thompson J, Waine G, Lanzer M, Saul A: stevor and rif are Plasmodium falciparum multicopy gene families which potentially encode variant antigens. Mol Biochem Parasitol 1998, 97:161-176. 12. Kyes SA, Rowe JA, Kriek N, Newbold CI: Rifins: a second family of clonally variant proteins expressed on the surface of red cells infected with Plasmodium falciparum. Proc Natl Acad Sci USA 1999, 96:9333-9338. 13. Baruch DI, Pasloske BL, Singh HB, Bi X, Ma XC, Feldman M, Taraschi TF, Howard RJ: Cloning the P. falciparum gene encoding PfEMP1, a malarial variant antigen and adherence receptor on the surface of parasitized human erythrocytes. Cell 1995, 82:77-87. 14. Su XZ, Heatwole VM, Wertheimer SP, Guinet F, Herrfeldt JA, Peter- son DS, Ravetch JA, Wellems TE: The large diverse gene family var encodes proteins involved in cytoadherence and anti- genic variation of Plasmodium falciparum-infected erythro- cytes. Cell 1995, 82:89-100. 15. Scherf A, Figueiredo LM, Freitas-Junior LH: Plasmodium telomeres: a pathogen’s perspective. Curr Opin Microbiol 2001, 4:409-414. 16. Figueiredo LM, Pirrit LA, Scherf A: Genomic organisation and chromatin structure of Plasmodium falciparum chromosome ends. Mol Biochem Parasitol 2000, 106:169-174. 17. Aslund L, Franzen L, Westin G, Persson T, Wigzell H, Pettersson U: Highly reiterated non-coding sequence in the genome of Plasmodium falciparum is composed of 21 base-pair tandem repeats. J Mol Biol 1985, 185:509-516. 18. O’Donnell RA, Freitas-Junior LH, Preiser PR, Williamson DH, Duraisingh M, McElwain TF, Scherf A, Cowman AF, Crabb BS: A genetic screen for improved plasmid segregation reveals a role for Rep20 in the interaction of Plasmodium falciparum chromosomes. EMBO J 2002, 21:1231-1239. 19. Freitas-Junior LH, Bottius E, Pirrit LA, Deitsch KW, Scheidig C, Guinet F, Nehrbass U, Wellems TE, Scherf A: Frequent ectopic recombination of virulence factor genes in telomeric chro- mosome clusters of P. falciparum. Nature 2000, 407:1018-1022. 20. Figueiredo LM, Freitas-Junior LH, Bottius E, Olivo-Marin JC, Scherf A: A central role for Plasmodium falciparum subtelomeric regions in spatial positioning and telomere length regula- tion. EMBO J 2002, 21:815-824. 21. Deitsch KW, Calderwood MS, Wellems TE: Malaria. Cooperative silencing elements in var genes. Nature 2001, 412:875-876. 22. Voss T, Thompson J, Waterkeyn J, Felger I, Weiss N, Cowman A, Beck, H: Genomic distribution and functional characterisation of two distinct and conserved Plasmodium falciparum var gene .5؅ flanking sequences. Mol Biochem Parasitol 2000, 107:103-115