Mtdna Variation in the South African Kung and Khwe—And Their Genetic Relationships to Other African Populations Yu-Sheng Chen,1,* Antonel Olckers,4 Theodore G
Total Page:16
File Type:pdf, Size:1020Kb
View metadata, citation and similar papers at core.ac.uk brought to you by CORE provided by Elsevier - Publisher Connector Am. J. Hum. Genet. 66:1362–1383, 2000 mtDNA Variation in the South African Kung and Khwe—and Their Genetic Relationships to Other African Populations Yu-Sheng Chen,1,* Antonel Olckers,4 Theodore G. Schurr,1,2,† Andreas M. Kogelnik,1,3 Kirsi Huoponen,1,‡ and Douglas C. Wallace1,2 1Center for Molecular Medicine and 2Department of Anthropology, Emory University, and 3Program in Biomedical Engineering, College of Computing, Georgia Institute of Technology, Atlanta; and 4Department of Human Genetics, University of Pretoria, Pretoria The mtDNA variation of 74 Khoisan-speaking individuals (Kung and Khwe) from Schmidtsdrift, in the Northern Cape Province of South Africa, was examined by high-resolution RFLP analysis and control region (CR) sequencing. The resulting data were combined with published RFLP haplotype and CR sequence data from sub-Saharan African populations and then were subjected to phylogenetic analysis to deduce the evolutionary relationships among them. More than 77% of the Kung and Khwe mtDNA samples were found to belong to the major mtDNA lineage, macrohaplogroup L* (defined by a HpaI site at nucleotide position 3592), which is prevalent in sub-Saharan African populations. Additional sets of RFLPs subdivided macrohaplogroup L* into two extended haplogroups—L1 and L2—both of which appeared in the Kung and Khwe. Besides revealing the significant substructure of macrohap- logroup L* in African populations, these data showed that the Biaka Pygmies have one of the most ancient RFLP sublineages observed in African mtDNA and, thus, that they could represent one of the oldest human populations. In addition, the Kung exhibited a set of related haplotypes that were positioned closest to the root of the human mtDNA phylogeny, suggesting that they, too, represent one of the most ancient African populations. Comparison of Kung and Khwe CR sequences with those from other African populations confirmed the genetic association of the Kung with other Khoisan-speaking peoples, whereas the Khwe were more closely linked to non–Khoisan- speaking (Bantu) populations. Finally, the overall sequence divergence of 214 African RFLP haplotypes defined in both this and an earlier study was 0.364%, giving an estimated age, for all African mtDNAs, of 125,500–165,500 years before the present, a date that is concordant with all previous estimates derived from mtDNA and other genetic data, for the time of origin of modern humans in Africa. Introduction 95% of the Kung mtDNAs were characterized by a HpaI site gain at nucleotide position (np) 3592. This marker The South African Kung San were among the first human was subsequently found at very high frequencies in groups in which mtDNA variation was analyzed by re- mtDNAs of other sub-Saharan African populations striction analysis (Denaro et al. 1981; Johnson et al. (Scozzari et al. 1988, 1994; Soodyall and Jenkins 1992, 1983). In these studies, the mtDNA sequences of the 1993; Chen et al. 1995) but was not observed in pop- Kung were surveyed for variation, by means of six rare- ulations of non-African origin, with a few exceptions cutting restriction endonucleases (AvaII, BamHI, HaeII, (Cann et al. 1987; Soodyall 1993). Furthermore, the HincII, HpaI, and MspI) and Southern blotting. Al- African mtDNAs defined by the HpaI np-3592 site gain though this procedure screened only 2%–3% of the formed a group of related mtDNA haplotypes (originally ∼ mtDNA sequence for variation, it revealed that 90%– defined as haplogroup “L” and here redefined as “L*”), which was found to be the most divergent of those iden- Received May 19, 1999; accepted for publication October 17, 1999; tified in human populations from around the world electronically published March 30, 2000. Address for correspondence and reprints: Dr. Douglas C. Wallace, (Chen et al. 1995). These findings contributed to the Director, Center for Molecular Medicine, Emory University School of hypothesis of an African origin of modern human Medicine, 1462 Clifton Road, N.E., Atlanta, GA 30322. E-mail: mtDNAs (Johnson et al. 1983; Cann et al. 1987; Chen [email protected] et al. 1995), although other interpretations of these data * Present affiliation: Department of Psychiatry, University of Chi- cago Medical School, Chicago. have also been put forward (Excoffier and Langaney † Present affiliation: Southwest Foundation for Biomedical Re- 1989; Templeton 1992). search, Department of Genetics, San Antonio. Since 1981, relatively limited data have been collected ‡ Present affiliation: Laboratory of Genetics, Department of Biology, on the RFLP variation in Kung or related Khoisan- University of Turku, Turku, Finland. ᭧ 2000 by The American Society of Human Genetics. All rights reserved. speaking populations. Those studies that were con- 0002-9297/2000/6604-0019$02.00 ducted on Khoisan-speaking populations continued to 1362 Chen et al.: mtDNA Variation in South African Populations 1363 use low-resolution (LR)–RFLP analysis with only the their phylogenetic relationships to other African and six rare-cutting restriction endonucleases listed above non-African mtDNAs. (Soodyall and Jenkins 1992, 1993). As a consequence, Similarly, until very recently, only a limited num- these data provided additional information about Khoi- ber of Kung mtDNAs had been surveyed for control san groups but could not be entirely integrated with region (CR) sequence variation. In earlier studies (Vig- those obtained, on other populations, by high-resolu- ilant et al. 1989, 1991), mtDNAs from Kung individuals tion (HR)–RFLP analysis (Cann et al. 1987; Torroni et consistently clustered at the deepest nodes of the phy- al. 1992). logenies constructed from CR sequences, a result that The value of using HR-RFLP analysis with African indicated that they are some of the most diver- mtDNAs, for phylogenetic reconstructions of human gent—hence, oldest—mtDNAs present in human pop- mtDNA, has been shown primarily by two studies of ulations. Soodyall et al. (1996) also studied CR se- African populations (Cann et al. 1987; Chen et al. quence variation in southern-African populations and Lys 1995). In the first study, Cann et al. (1987) conducted found that the COII/tRNA intergenic 9-bp deletion genomic digestions of mtDNAs, by 12 enzymes (AluI, had occurred multiple times in African populations, in- AvaII, DdeI, FnuDII, HaeIII, HhaI, HinfI, HpaI, HpaII, dependent of those appearing in Asian mtDNAs, a trend MboI, RsaI, and TaqI), with the resulting restriction first noted by Vigilant (1990). Interestingly, deletion fragments being resolved by PAGE. In the later study, mtDNAs were not observed in Khoisan-speaking pop- the entire genome of African mtDNAs was PCR am- ulations and were rare in western- and southwestern- plified in nine overlapping fragments, which were then African groups, whereas they occurred frequently in subjected to digestion by 14 enzymes (AluI, AvaII, Pygmy and so-called Negroid populations from central BamHI, DdeI, HaeII, HaeIII, HhaI, HincII, HinfI, HpaI, Africa and in Bantu-speaking peoples of southern Africa HpaII, MboI, RsaI, and TaqI), with the resulting re- (Soodyall et al. 1996). This distribution suggested that striction fragments being resolved by agarose gel elec- deletion mtDNAs arose in central Africa and were dis- trophoresis. Both of these studies revealed that African seminated into southern Africa via the recent Bantu haplotypes form the deepest branches of the human expansion. mtDNA phylogeny and that African groups are the most Although studies using HR-RFLP methods of analysis have provided new insights into the mtDNA variation divergent of all world populations. of African populations, there has been almost no effort Several additional aspects of mtDNA variation in Af- to relate the data sets generated by RFLP to those gen- rican populations were shown in the Chen et al. (1995) erated by a CR sequence analyses. To date, only the study. First, two types of length polymorphisms were western-African Mandelaku (Mandenka) have been observed in some of the central-African mtDNAs subjected to both LR-RFLP and CR sequencing analyses (Mbuti [eastern] and Biaka [western] Pygmies), one of (Graven et al. 1995), whereas the linkage between the which was the COII/tRNALys intergenic 9-bp deletion HR-RFLP data and the CR sequence data for the Af- that formerly had been thought to occur only in Asian rican samples analyzed by Cann et al. (1987) and Vig- populations. This length polymorphism had previously ilant et al. (1989, 1991) has not been made explicitly been noted in African individuals but had not been as- clear. Thus, the relationship between HR-RFLP hap- signed to a particular haplogroup or mtDNA lineage lotypes and CR sequences in African populations has (Cann and Wilson 1983). In addition, western-African yet to be adequately addressed. In addition, because of populations from Senegal were found to have haplo- the different methods of RFLP haplotyping used in the types that lacked the HpaI np-3592 site gain and, thus, Cann et al. (1987) and Chen et al. (1995) studies, there appeared not to belong to haplogroup L*. Because were some inconsistencies in the positioning of certain nearly all of these “non-L” haplotypes had the DdeI polymorphisms that were critical for phylogenetic re- np-10394 site gain, a polymorphism that was also pre- constructions of the haplotypes defined in each study. sent in almost every haplogroup L* mtDNA, the data Consequently, a more complete characterization of the suggested that the former haplotypes derived from the distribution of genetic variation in coding (i.e., RFLP) latter, although the exact subhaplogroup (L1 or L2) versus noncoding (i.e., CR) regions of the mtDNA ge- from which these non-L haplotypes originated was not nome was needed. clear. Alternatively, non-L mtDNAs could have evolved Both to increase our knowledge of the genetic rela- from haplotypes observed in Europeans, such as those tionships between Khoisan (Kung San and Khoi) pop- belonging to haplogroups I–K (Torroni et al.