REVIEW
CURRENT OPINION HIV subtype diversity worldwide
Nicholas Bbosaa, Pontiano Kaleebua,b, and Deogratius Ssemwangaa,b
Purpose of review To provide a summary of the current data on the global HIV subtype diversity and distribution by region. HIV is one of the most genetically diverse pathogens due to its high-mutation and recombination rates, large population size and rapid replication rate. This rapid evolutionary process has resulted in several HIV subtypes that are heterogeneously globally distributed. Recent findings Subtype A remains the most prevalent strain in parts of East Africa, Russia and former Soviet Union countries; subtype B in Europe, Americas and Oceania; subtype C in Southern Africa and India; CRF01_AE in Asia and CRF02_AG in Western Africa. Recent studies based on near full-length genome sequencing highlighted the growing importance of recombinant variants and subtype C viruses. Summary The dynamic change in HIV subtype distribution presents future challenges for diagnosis, treatment and vaccine design and development. An increase in recombinant viruses suggests that coinfection and superinfection by divergent HIV strains has become more common necessitating continuous surveillance to keep track of the viral diversity. Cheaper near full-length genome sequencing approaches are critical in improving HIV subtype estimations. However, missing subtype data and low sequence sampling levels are still a challenge in some geographical regions. Video abstract http://links.lww.com/COHA/A14. Keywords diversity, HIV, subtype
INTRODUCTION global HIV pandemic since its inception, other group As of 2017, there were 36.9 million people living with N, O and P viruses have not been disseminated widely. HIV worldwide with a majority of these infections in Group M viruses are further subdivided into nine sub-Saharan Africa [1]. AIDS is a global pandemic subtypes (A-D, F-H, J, K) [2]. The genetic distance caused by two genetically diverse lentiviruses, HIV-1 within a subtype can be between 15 and 20%, whereas and HIV-2 [2], that were introduced through multiple genetic distances across subtypes are usually 25 and cross-species transmissions of simian immunodefi- 35% [4]. HIV-2 remains largely restricted to the ciency viruses from nonhuman primates to humans. These distinct zoonotic viral transmissions gave rise to aMedical Research Council (MRC)/Uganda Virus Research Institute separate HIV-1 groups M (Major), O (Outlier), N (non- (UVRI) and London School of Hygiene and Tropical Medicine (LSHTM) M, non-O) and the most recent group P [2]. The origin Uganda Research Unit and bUganda Virus Research Institute, Entebbe, of HIV-1 has been documented around Kinshasa in the Uganda present day Democratic Republic of Congo around the Correspondence to Deogratius Ssemwanga, Medical Research Council 1920s from where it spread along a transport network (MRC)/Uganda Virus Research Institute (UVRI) and London School of to other areas in sub-Saharan Africa, West Africa, Hygiene and Tropical Medicine (LSHTM) Uganda Research Unit, P.O. Europe and the rest of the world [3]. This global spread Box 49, Entebbe, Uganda. Tel: +256 (0) 417 704170; e-mail: [email protected] was marked by a geographically defined distribution Curr Opin HIV AIDS 2019, 14:153–160 of several genetically distinct viruses. For instance, subtype B became prevalent in almost all parts of DOI:10.1097/COH.0000000000000534 Europe and the Americas while a variety of subtypes This is an open access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 and intersubtype recombinants are found in Africa (CCBY-NC-ND), where it is permissible to download and share the work with the highest diversity reported in West Central provided it is properly cited. The work cannot be changed in any way or Africa. Whereas group M viruses have dominated the used commercially without permission from the journal.
1746-630X Copyright ß 2019 The Author(s). Published by Wolters Kluwer Health, Inc. www.co-hivandaids.com Phylogenetics in HIV transmission
UNIQUE RECOMBINANT FORMS KEY POINTS Unique recombinant forms (URFs) consist of a mix- Subtypes A, C and CRF02-AG are the most prevalent ture of subtypes, but unlike the CRFs, they were on the African continent, whereas subtype B sampled only once from a single multiply-infected predominates in Europe/the Americas and CRF01_AE individual [13]. Hence, all sequences corresponding dominate in Asia. to a CRF have the same recombination break points There is an increase in the prevalence of recombinant in the genome while each URF shows unique break- viruses particularly in geographical areas where points. Intrasubtype recombination does occur but multiple subtypes cocirculate such as Asia and Africa. as it does not lead to subtype recombination, these viruses are not described as URFs. Dual infections The emerging and dynamic HIV genetic diversity needs constant monitoring due to the challenges it presents with HIV-1 and HIV-2 have frequently been towards diagnosis, phylogenetic reconstruction, reported in regions where both viruses circulate treatment and vaccine development. [15] although no recombinants between them have been described. Several AD and AC intersubtype The classification of HIV variants based on near full- recombinants have been described in Eastern Africa length genomes is still limited but needs to be scaled up in a cost-effective way. [16,17], where these three subtypes cocirculate. BF intersubtype recombinants have been found in The problem of missing HIV subtype data and low Brazil and Argentina, where subtypes B and F are sequence sampling levels needs to be addressed. both common [18,19]. Subtypes A and C cocirculate in India with AC recombinants [20].
LITERATURE SEARCH western part of Africa although viral introductions We performed a structured literature search in have been reported elsewhere in Europe (Portugal PubMed using MeSH terms. A combination of key- and France), India and the United States of America words related to the subject of ‘HIV genetic diversity [5]. HIV-2 has also been documented to be less infec- worldwide’ were used with Boolean operators (OR tious than HIV-1 [6,7]; it is comprised of at least nine and AND) in PubMed’s advanced search builder. The groups (formerly referred to as subtypes; A to I) of results generated in the search builder at each level which groups A and D are presently circulating [5]. were combined to output all articles that overlapped HIV-2 subtype data is still limited and only a few across all the search words for the specified review recombinants have been described [5,8]. period (Fig. 1). This article focused mainly on HIV-1 as there was limited data on HIV-2 during the review period. CIRCULATING RECOMBINANT FORMS Recombination is an important phenomenon for GLOBAL DISTRIBUTION OF HIV-1 BY viral diversification that allows the virus to evade REGION the host immune system and antiretroviral treat- ment [9]. A recombinant is a viral sequence that has Africa regions from two or more distinct parental strains The greatest genetic diversity of HIV-1 has been [10]. A circulating recombinant form or circulating found in Africa especially in West Central Africa recombinant form (CRF) is a combination of viral [21]; however, other parts of the continent show genomes of different subtypes in dually infected an assortment of diverse viral strains. In Southern people that results in a mosaic genome composed Africa, subtype C collectively predominates [22,23], of regions from each of the different subtypes. When whereas in Eastern Africa subtype A is the most these recombinants are transmitted and spread dominant [24&&] although other subtype D and C within a population, they are recognized as circulat- viruses cocirculate as well [25]. An increase in sub- ing strains in the HIV epidemic and are classified as type C and a decrease in subtype D have been CRFs [11]. The virus must be isolated from at least reported in Kenya together with extensive intersub- two unlinked individuals and fully sequenced [12]. type recombination [24&&]. A recent study in Uganda The designation ‘cpx’ (complex) is used when more reported an increased prevalence of HIV-1 intersub- than three subtypes are present [13]. To date, 98 type recombinants of about 46% based on near full CRFs have been identified and updated in the Los length HIV sequences [26&&]. A separate study by the Alamos National Laboratory HIV sequence database Phylogenetic And Networks for Generalized HIV [14]. These are designated with sequential number- Epidemics in Africa (PANGEA)-HIV consortium ing as they are discovered and reported. [27&] showed a high proportion (about 50%) of
154 www.co-hivandaids.com Volume 14 Number 3 May 2019 HIV subtype diversity worldwide Bbosa et al.
FIGURE 1. Literature search. Diagrammatic representation of the literature search and selection of articles.
HIV-1 intersubtype recombinants in Uganda based strain. Other subtypes have also been introduced. on deep sequencing of near full-length viral For example, in Spain, subtype F1 was identified in a genomes (unpublished data). In West and West cohort of MSM [38] or CRF02_AG among immigrant Central Africa, the majority of viruses are CRF02_AG populations [39]. [21,28]. Subtypes A and G cocirculate in some coun- tries such as Nigeria [21]. Mosaic viruses involving CFR02_AG have been reported in several African countries [29] and an increase of subtype F2 and South America other recombinant forms in Cameroon [30] (Fig. 2). Subtype B predominates in South America [40,41]. Other intersubtype BF recombinants have also been recently reported [42]. An increase in pure subtype C Europe and North America compared with subtype B has also been reported In North America [31], Europe [32–36] and Australia among HIV/viral hepatitis coinfected patients in [37], subtype B is still the most widespread viral Southern Brazil [43].
1746-630X Copyright ß 2019 The Author(s). Published by Wolters Kluwer Health, Inc. www.co-hivandaids.com 155 Phylogenetics in HIV transmission
FIGURE 2. Global distribution of major HIV subtypes. Map showing the global distribution of the major HIV subtypes and circulating recombinant forms from the review. Source: MapoftheworldmodifiedtoshowHIV-1subtype diversity worldwide. Map-menu.com.
Asia Africa and India, CRF01_AE as the major strain Several subtypes circulate in Asia and it has been throughout Southeast Asia, CRF02_AG as the domi- described as the ‘hotbed’ of recombinant viruses nant virus in West Africa and about 30% of strains in with several CRFs [44]. Subtype A predominates in East Africa to consist of URFs [13,21]. However, there is increased and extensive HIV-1 genetic diversity in Russia and other former Soviet Union countries && && although CRF02_AG is common in Kyrgyzstan Asia [47,49–52,58] and Eastern Africa [24 ,26 ]. By [45]. CRF01_AE and CRF07_BC predominate in 2010, there were about 45 reported CRFs [59], this China [46–48]; new CRFs have been found in China figure has more than doubled within a 7-year period more than in any other country [49–52]. In south- with at least 98 CRFs that have currently been east Asia, CRF01-AE is the most prevalent [53,54] described [14]. This fast rate of increase in the prev- while subtype C predominates in India with reports alence of CRFs and the fact that several subtypes of increased URFs in the north eastern part of the cocirculate globally suggests that these numbers will country [55]. likely continue to grow in the future. The largest source of HIV sequence data (subtype B) is from Europe and the Americas with less data coming from Middle East and North Africa the Middle East (Table 1). As a result, HIV sequence Middle East and North Africa countries are grouped databases like Los Alamos [60] tend to represent a together because subtype B is the predominant sub- biased sampling and therefore do not necessarily type in the region and these countries show rela- reflect the distribution of subtypes worldwide. For tively low prevalence and limited sampling of instance, subtype B accounted for 55% of HIV infections [56&,57]. sequences in the Los Alamos database [60] which is about five times its frequency in the world [61]. Information on subtype diversity based on near full- CONCLUSION length genomes is still relatively scarce with major- Previous reviews have shown that subtype C ity of the available data still based on partial HIV accounts for more than 50% of all infections world- fragments. In East Africa, classification of HIV based wide with a concentration in East and Southern on near full-length sequences showed a prevalence
156 www.co-hivandaids.com Volume 14 Number 3 May 2019 7660 Copyright 1746-630X Table 1. Global prevalence and geographical distribution of HIV-1 subtypes and recombinant forms by studies reviewed
Global Prevalence based on the level of geographic spread HIV genomic Major and available data in Geographical region/available subtype the HIV sequence database Description Countries location Study (year) sequences ß
09TeAto() ulse yWlesKue elh Inc. Health, Kluwer Wolters by Published Author(s). The 2019 A High Includes intrasubtypes like A1 Kenya, Pakistan, Armenia, East Africa, Gounder et al. (2017), Khan Gag and env and A6 Azerbaijan, Belarus, Asia, EuroAsia et al. (2018), Aibekova et al. (n ¼ 386); gag, Estonia, Georgia, (2018), Lapovok et al. (2017), pol and env Kazakhstan, Kyrgyzstan, Kazennova et al. (2017) (n ¼ 755); pol Latvia, Lithuania, (n ¼ 3022), Moldova, Russia, integrase Ukraine and Uzbekistan (n ¼ 506) B High Includes reports of other United states of America, North America, Dapp et al. (2017), Pe´rez-Losada Gag (n ¼ 1790); multiple nonsubtype B virus Canada, France, United South America, et al. (2017), Vrancken et al. env (n ¼ 1755); importations Kingdom, Belgium, Australia, Europe (2017), Dennis et al. (2017), pol and integrase Bulgaria, Cyprus, Spain, Oster et al. (2017), Tumiotto (n ¼ 63 917); pol Iceland, Mexico, Brazil, et al. (2018), Volz et al. (n ¼ 23 196), Guatemala, Honduras, (2018), Hebberecht et al. Near full length Panama, Nicaragua, (2018), Alexiev et al. (2018), (n ¼ 450), gag & Belize, El Salvador, Yebra et al. (2018), Pineda- pol (n ¼ 66), vif Amazonas, Australia, Pen˜a et al. (2018), Pe´rez-Parra (n ¼ 317) Middle East and North et al. (2017), Rojas Sa´nchez Africa that include et al. (2017), Sallam et al. Algeria, Bahrain, (2017), Hernandez-Sanchez Djibouti, Egypt, Iran, et al. (2018), Alves et al. Iraq, Jordan, Kuwait, (2017), Martin et al. (2017), Lebanon, Libya, Chaillon et al. (2017), Morocco, Palestinian Villanova et al. (2017), Territories, Oman, Delatorre et al. (2017), Qatar, Saudi Arabia, Andrade et al. (2017), Barral Somalia, Sudan, Syria, et al. (2017), Machado et al. Tunisia, United Arab (2017), Lima et al. (2017), worldwide diversity subtype HIV Emirates, and Yemen Castley et al. (2017), Sallam et al. (2017), Daw et al. (2017) www.co-hivandaids.com C High Consists of half of all HIV South Africa, Tanzania, Southern Africa, Sivay et al. (2018), Ngcapu et al. pol (n ¼ 200); infections worldwide. India, Southern Brazil East (2017), Ngandu et al. (2017), RNase H Higher prevalence has been Africa, Asia Bauer et al. (2017), Naidoo (n ¼ 140); gag found among coinfected et al. (2017), Billings et al. (n ¼ 91); gag- patients in Brazil (2017), Sharma et al. (2018), protease (n ¼ 28); Sharma et al. (2017), Avanzi Near full length et al. (2017) (n ¼ 170), vpu (n ¼ 80); nef (n ¼ 57); pol/tat- vpu-env (n ¼ 110);
pol and env Bbosa (n ¼ 38) A/D Moderate Intersubtype recombinants of Uganda East Africa Lee et al. (2017) Near full length tal. et
157 subtypes A1 and D (n ¼ 200) 158 transmission HIV in Phylogenetics Table 1 (Continued)
Global Prevalence based www.co-hivandaids.com on the level of geographic spread HIV genomic Major and available data in Geographical region/available subtype the HIV sequence database Description Countries location Study (year) sequences
F1 and B/F Low Increasing prevalence among Spain, Argentina Europe, South Patin˜o-Galindo et al. (2017), Near ful length recombinants MSM in Spain and America Cevallos et al. (2017) (n ¼ 49), Near ful Argentina length (n ¼ 25), pol (n ¼ 312) CRF01_AE High Intersubtype recombinant of Indonesia, China, Vietnam, Asia Ueda et al. (2018), Li et al. pol (n ¼ 837); Near subtypes A and E. Previously Myanmar (2018), Xu et al. (2018), Chen full length classified as subtype E et al. (2018), Chikata et al. (n ¼ 335); partial (2018), Yang et al. (2018), gag, pol and env Zhang et al. (2017), Sun et al. (n ¼ 638); gag, (2017), Li et al. (2017) pol and nef (n ¼ 388), env and gag (n ¼ 70); gag-RT (n ¼ 187) CRF02_AG High Intersubtype recombinants of Ghana, Cameroon West Africa Nii-Trebi et al. (2017), Nanfack pol (n ¼ 99); pol & subtype A and G although et al. (2017), Rodgers et al. env (n ¼ 555); may show more extensive (2017) NFL (n ¼ 1) diversity CRF07_BC Moderate Intersubtype recombinants of China Asia Yang et al. (2017), Zhang et al. pol (n ¼ 276); gag subtypes B and C (2017), Wang et al. (2017), Li (n ¼ 610); Near et al. (2017) full length (n ¼ 1); gp160 (n ¼ 749) CRF08_BC Low Recombinant between subtyes China Asia Chen et al. (2018) gag, pol and env B and C and commonly (n ¼ 291) cocirculate with CRF07_BC and CRF01_AE in parts of Yunnan, China CRF35_AD Low Recombinant between Iran (Tehran) Middle East Vahabpour et al. (2017) pol (n ¼ 42) subtypes A and D Mixed subtypes Low CRF01_AE, CRF02_BC, Several countries within Central and West Tongo et al. (2018) and Tongo Near full length oue14 Volume and new subtypes A–H, J and K. congo Basin, China, Africa, et al. (2018), Fan et al. (n ¼ 305); pol CRFs or URFs Also include newly- African Asia, N. America, (2018), Miao et al. (2018), Lu (n ¼ 18); env recommended strains like countries,Canada, South South and East et al. (2018), Kong et al. (n ¼ 48); gag/ CRF55_01B, CRF67_01B Africa, Uganda and Africa (2017), Li et al. (2017), Wu pol/env and CRF69_01B, Kenya et al. (2017), Hu et al. (2017), (n ¼ 183), Gag- CRF96_cpx, CRF01/ Kiguoya et al. (2017) protease ubr3 Number CRF07_BC, CRF07_BC/ (n ¼ 1342) CRF55_01B, CRF87_cpx/ CRF88_BC