Biochemical Systematics and Ecology 61 (2015) 161e168

Contents lists available at ScienceDirect

Biochemical Systematics and Ecology

journal homepage: www.elsevier.com/locate/biochemsyseco

Molecular identification and allopatric divergence of the white species in China based on the cytoplasmic DNA variation

* Zhong-Hu Li a, b, c, , 1, Chen Yang b, 1, Kang-Shan Mao b, Ya-Zhen Ma b, Jie Liu c, Zhan-Lin Liu a, Tuan-Tuan Deng a, Gui-Fang Zhao a a Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an 710069, China b Molecular Ecology Group, State Key Laboratory of Grassland Agro-Ecosystem, College of Life Sciences, Lanzhou University, Lanzhou 730000, Gansu, China c Key Laboratory of Biodiversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming 650201, China article info abstract

Article history: Molecular identification of species may be highly related to the geographic isolation Received 11 December 2014 and speciation stages among species. In this study, we examined these possibilities in a Received in revised form 15 May 2015 group of white in China. We sampled 449 individuals from 60 natural populations of Accepted 11 June 2015 seven species from sect. Quinquefoliae subsect. Strobus. We sequenced four chloroplast Available online xxx DNA regions (around 3100 bp in length) and two mitochondrial DNAs (around 1000 bp in length). We identified 21 chlorotypes and 10 mitotypes. Both chlorotypes and mitotypes Keywords: recovered from four species with long disjunction and restricted distributions in northern Allopatric divergence Geographic isolation or northwestern China, Pinus sibirica, Pinus koraiensis, Pinus wallichiana and Pinus pumila fi Speciation stage are species-speci c, suggesting that these cytoplasmic DNAs can distinguish them from the Species identification close relatives. Allopatric isolations should have contributed greatly to their genetic di- White pines vergences. However, both chlorotypes and mitotypes recovered for Pinus dabeshanensis and Pinus fenzeliana distributed in southeastern and southern China are shared or closely related to those found in the widely distributed . These two species may have diverged or derived from P. armandii recently. All of our findings together suggest that the discrimination power of the molecular identifications based on the cytoplasmic DNA barcodes may show variable discriminability depending on geographic isolation and speciation stages among the sampled species. © 2015 Elsevier Ltd. All rights reserved.

1. Introduction

Species identification of plant materials is critical in diverse fields, especially related to biodiversity. Traditional identi- fication depend exclusively on morphological traits. Both well-trained taxonomists and the presence of the key morpho- logical traits (e.g. flowers and seeds) are necessary in identifying plant samples into species. However, molecular

* Corresponding author. Key Laboratory of Resource Biology and Biotechnology in Western China (Ministry of Education), College of Life Sciences, Northwest University, Xi'an 710069, China. E-mail address: [email protected] (Z.-H. Li). 1 These authors contributed equally to this work. http://dx.doi.org/10.1016/j.bse.2015.06.002 0305-1978/© 2015 Elsevier Ltd. All rights reserved. 162 Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168 identification based on the standard DNA barcodes and special DNA fragments provides rapid and accurate species identi- fication (Kress et al., 2005; Hollingsworth et al., 2009, 2011). Cytoplasmic DNA fragments, especially chloroplast (cp) DNA regions (e.g., rbcL and matK), are suggested as the suitable candidates for identifying plant species (Kress et al., 2005; CBOL Plant Working Group, 2009). However, the commonly used cpDNA fragments (e.g., matK and rbcL) failed to develop species- specific mutations in gymnosperm groups (Liu et al., 2012a). Although some nuclear DNA regions in may be useful for identifying plant species, it is difficult to amplify and sequence these fragments. Therefore, other cpDNAs as well as the mitochondrial (mt) DNAs with moderate mutations, were suggested to identify species in conifers (Liu et al., 2012a; Hao et al., 2015). In addition to looking for candidate DNA regions in barcoding species of numerous families or a special group (CBOL Plant Working Group, 2009; Hollingsworth et al., 2009; China Plant BOL Group, 2011), the recommended barcodes were also used for diverse applications, for example, to construct biodiversity inventory (Lahaye et al., 2008) and identify cryptic species (Yassin et al., 2008; Ragupathy et al., 2009; Liu et al., 2011). However, few researches were designed to examine discrimi- nation powers of these barcodes in identifying the closely related species with different speciation ages and allopatric iso- lations. Here we conduct such an example study to address this question using seven species of the white pine species (sect. Quinquefoliae subsect. Strobus) as a system. Around 11 white pine species are recorded in China, Pinus armandii Franchet, Pinus bhutanica Grierson & al., Pinus dabeshanensis Cheng et Law, Pinus pumila (Pallas) Regel in Kuester & al., Pinus fenzeliana Handel-Mazzetti, Pinus koraiensis Siebold & Zuccarini, Pinus kwantungensis Chun & Tsiang, Pinus morrisonicola Hayata, Pinus sibirica Du Tour in Deterville, Pinus wallichiana A. B. Jackson and Pinus wangii Hu & W. C. Cheng (Fu et al., 1999; Eckert et al., 2013). Among them, P. armandii is widely distributed from western to southeast China with altitudes ranged from 1000 to 3300 m. Six other species, Pinus bhutanica, Pinus kwantungensis, Pinus fenzeliana, Pinus dabeshanensis, Pinus wangii and Pinus morrisonicola were closely related to P. armandii. All of these seven species show short-distance disjunctions to each another. The other five species occur in southern China and some of them are narrowly restricted to a small range, for example, P. morrisonicola endemic to Taiwan and P. dabeshanensis to the Dabieshan Mountains. Recent studies suggested that P. armandii shared genetic variant with the other six species at the examined loci, indicating no complete lineage sorting among them (Liu et al., 2014a; Hao et al., 2015). However, Pinus wallichiana is found at higher altitude (1600e3300 m) along temperate forests in the Himalaya while P. koraiensis has a wide distribution in northeastern China. The other two species, P. pumila and P. sibirica, are found in the northeastern and western China, respectively. These four species are closely related to each another than to P. armandii complex although they are distributed with long-distance disjunctions based on morphological and molecular evidences (Fu et al., 1999; Eckert et al., 2013). Furthermore, the sequence variations from the commonly used cpDNAs could not diagnose them effectively (Hernandez-Le on et al., 2013; Liu et al., 2014a; Hao et al., 2015). In this study, we sequenced four cpDNA fragments and two mtDNAs for 449 individuals from 60 natural populations of seven species from two tentative groups of sect. Quinquefoliae subsect. Strobus. One group comprises P. armandii, P. dabeshanensis and P. fenzeliana and distributional ranges between them are allopatric, but with short-distance. The other group consists of P. sibirica, P. koraiensis, P. wallichiana and P. pumila with long-distance allopatric distributions. We hypothesized that with the more cpDNA fragments sequenced, it is likely to distinguish these species. However, discrimination power for species identification based on sequence variations will change dependent on the geographic isolations and possible speciation ages. In addition to test this hypothesis, our range-wide sampling and two large population genetic datasets based on cytoplasmic DNA variation undoubtedly will aid to understand the interspecific relationship of these species and their response to the historical climate changes (Liu et al., 2012b, 2014b; Hao et al., 2015).

2. Materials and methods

2.1. Plant material and sampling design

We collected 449 individuals from 60 natural populations of Pinus armandi, P. dabeshanensis, P. fenzeliana, P. koraiensis, P. pumila, P. sibirica and P. wallichiana. For both P. armandi and P. dabeshanensis, our samples cover their entire distributional ranges. We collected samples for P. koraiensis, P. pumila, P. sibirica and P. wallichiana from most recorded counties in China. For most populations, 5e10 individuals were sampled and needle leaves were taken and preserved in silica gel until DNA extraction. The geographic information of each sampling site, which includes the latitude, longitude and altitude, were recorded by an Extrex GPS (Garmin, Olathe, USA). A population of Pinus tabulaeformis was sampled as outgroup (Table 1, Figs. 1 and 2).

2.2. DNA extraction, amplification and sequencing

We extracted total genomic DNA from needle leaves using the modified 2 cetyltrimethylammonium bromide (CTAB) procedure (Doyle and Doyle, 1987). For all sampled individuals, four cpDNA non-coding regions (trnL-F, trnS-fM, trnS-G, rpl16) and two mtDNA non-coding regions (nad5 intron1, nad7 intron1) were amplified and sequenced using universal primers (see primer sequences and references in Table 2). All PCR amplifications were conducted in a total volume of 25 mL on a Gene Amp PCR system 9700 thermal cycler (Applied Biosystems, Foster City, CA, USA), using 10e40 ng genomic DNA. The mixture also contains 0.2 mM each primer, 1.5 mM MgCl2, 0.5 mM dNTPs, 50 mM TriseHCl (pH 8.3), and 1.0 U Taq polymerase (TaKaRa, Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168 163

Table 1 Geographic information of all sampled populations used in this study.

Species ID Population n Latitude (N) Longitude (E) Altitude (m) Pinus koraiensis 1 Ning'an, HLa 10 4411.7750 12631.6170 900 2 Baishan, JL 10 4156.3320 12735.5720 829 3 Helong, JL 10 4224.3840 12845.3200 564 4 Tieli, HL 10 4656.9220 12852.9740 418 5 Anyang, JL 10 4227.5000 12809.9600 802 6 Muleng, HL 10 4453.7220 13033.060 277 7 Fusong, JL 10 4233.5450 12747.0020 769 8 Dunhua, JL 10 4830.6530 12835.5950 594 9 Anyang, JL 10 4809.5200 12902.0520 613 10 Dandong, LN 10 4007.7400 12422.2980 40 11 Fengcheng, LN 10 4025.6120 12404.1750 176 12 Huanren, LN 10 4114.1420 12523.0070 400 13 Fusong, JL 10 4156.8380 12734.9170 830 14 Anyang, JL 10 4216.8100 12811.6300 970 15 Jidong, HL 10 4506.0180 13115.5680 180 16 Yichun, HL 10 4711.7300 12856.9830 386 Pinus wallichiana 17 Gyirong, TB 10 2830.6650 8513.0320 3362 18 Yadong, TB 10 2725.8380 8854.2050 3390 Pinus pumila 19 Genhe, IM 10 5221.9850 12228.4160 862 20 Jagdaqi, HL 10 5137.3540 12359.9300 627 21 Genhe, IM 10 5150.6370 12202.2060 847 Pinus sibirica 22 Burqin, XJ 1 4839.8400 8706.0560 2088 23 Burqin, XJ 10 4825.5120 8606.0560 1822 24 Burqin, XJ 5 4836.0960 8702.9790 1321 25 Burqin, XJ 8 4828.8130 8720.3930 1384 Pinus armandii 26 Meixian, SX 10 3359.4820 10759.8270 1615 27 Fengxian, SX 10 3352.9100 10633.4200 1452 28 Bome, TB 10 2945.6750 9556.7400 2922 29 Bome, TB 10 2957.6020 9522.3430 2909 30 Bome, TB 10 2937.2440 9618.2100 3244 31 Mianning, SC 10 2830.7090 10212.5090 1932 32 Bome, TB 10 2951.7420 9446.2500 3101 33 Xunhua, QH 10 3547.9780 10241.3670 2327 34 Hanyuan, SC 10 2938.5270 10229.4020 2295 35 Lixian, SC 10 3157.5120 10218.2790 2760 36 Wenxian, GS 6 3254.6270 10438.3100 2321 37 Zhouzhi, SX 3 3353.6070 10748.6220 2025 38 Nanzhao, HN 10 3334.2570 11158.8770 727 39 Chang'an, SX 4 3355.8950 10902.3800 2242 40 Zhouzhi, SX 1 3356.1320 10811.0350 1795 41 Taibaishan, SX 10 3403.2800 10741.9800 2145 42 Taibaishan, SX 4 3402.2070 10742.9500 2832 43 Taibaishan, SX 2 3403.1150 10741.9870 2215 44 Taibaishan, SX 3 3402.4350 10742.4780 2424 45 Taibaishan, SX 8 3401.1330 10743.1080 2730 46 Taibaishan, SX 7 3403.6630 10741.9870 1878 47 Taibaishan, SX 4 3400.5400 10743.2900 2537 48 Taibaishan, SX 5 3402.4470 10742.4080 2356 49 Yuexi, AH 7 3059.4630 11605.3380 1341 50 Kangle, GS 1 3457.1970 10341.0390 2270 51 Zhouqu, GS 5 3334.7440 10430.8950 2047 52 Shangri-Lab,YN 7 2722.0560 10007.8610 3280 53 Mouli, SC 3 2741.4830 10057.3820 2918 54 Taibaishan, SX 3 3359.9620 10743.6520 2659 Pinus dabeshanensis 55 Yuexi, AH 2 3059.3680 11605.6980 1131 56 Yuexi, AH 10 3044.8750 11605.7950 1039 57 Yuexi, AH 6 3043.2880 11608.2500 1034 58 Yuexi, AH 1 3053.1650 11605.8300 1121 Pinus fenzeliana 59 Wuzhishan, HA 2 1901.3600 10931.8370 1333 60 Wuzhishan, HA 1 1857.0470 10921.9830 1232 Pinus tabulaeformisc 61 Baotou, IM 10 4047.3020 11018.5330 1510

a HL, Heilongjiang; JL, Jilin; LN, Liaoning; TB, Tibet (Xizang Autonomous Region); IM, Inner Mongolia (Nei Monggol Autonomous Region); XJ, Xinjiang Uighur Autonomous Region; SX, Shaanxi; SC, Sichuan; GS, Gansu; QH, Qinghai; HN, Henan; YN, Yunnan; AH, Anhui; HA, Hainan; n, number of individuals in each population. b Also known as name Zhongdian before 2001. c Used as outgroup in this study. 164 Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168

Dalian, China). Two cpDNA fragments, trnS-G and trnS-fM, were performed under the PCR programs as follows: initial denaturation at 94 C for 5 min, then 36 cycles of denaturation at 94 C for 1 min, annealing at 57 C for 45 s, extension at 72 C for 1.5 min, then a final extension at 72 C for 7 min. Another two cpDNA fragments, trnL-F and rpl16, were performed as initial denaturation at 94 C for 3 min, 37 cycles of denaturation at 94 C for 45 s, annealing at 60 C for 55 s, extension at 72 C for 75 s, and a final extension at 72 C for 7 min. For, the mtDNA fragments nad5 intron 1 and nad7 intron 1 were performed as initial denaturation at 94 C for 3 min, 30 cycles of denaturation at 94 C for 30 s, annealing at 55 C for 30 s, extension at 72 C for 2 min and a final extension at 72 C for 10 min. The amplification products were separated on 1.5% agarose gels and purified with a TIANquick Midi Purification Kit (Tiangen Biotech, Beijing, China) according to the manufacturer's instructions. Sequencing PCR reactions were performed using ABI Prism BigDye Terminator version 3.1 Cycle Sequencing Kit, and capillary electrophoresis was run on an ABI 3130 XL sequencer (PE Applied Biosystems, Foster City, CA, USA).

2.3. Data analysis

DNA sequences were aligned and manually adjusted using MEGA v. 5 (Tamura et al., 2011). Since both chloroplast genome and mitochondrial genome are single circular DNA molecule and each could be regarded as a single locus, we therefore concatenated four cpDNA and two mtDNA fragments together, respectively, to create the four-locus cpDNA dataset and the two-locus mtDNA dataset. DnaSP v. 5.10 (Librado and Rozas, 2009) was applied to identify cpDNA and mtDNA haplotypes, and all haplotype sequences generated in the present study were deposited in GenBank and their registered Num. is KT159776eKT159796. Phylogenetic relationships among haplotypes were reconstructed using Bayesian Inference (BI) in

Fig. 1. CpDNA variation in Pinus subsection Strobus. (a) Geographic distribution of the 21 chlorotypes identified. The range of the seven species is indicated by different colours of outer rings. (b) Inferred phylogenetic network among the 21 chlorotypes (see text for details). Each chlorotype is represented by a circle whose size is proportional to its frequency over all populations. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.) Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168 165

Fig. 2. MtDNA variation in Pinus subsection Strobus. (a) Geographic distribution of the ten mitotypes identified. The range of the seven species is indicated by different colours of outer rings. (b) Inferred phylogenetic network among the ten mitotypes. Each mitotype is represented by a circle whose size is proportional to its frequency over all populations. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

MrBayes v. 3.1.2 (Huelsenbeck and Ronquist, 2001). The MCMC algorithm was run for 5 000 000 generations with four incrementally heated chains, starting from random trees and sampling one out of every 500 generations. The first 2500 trees were discarded as burn-in, depending on when chains appeared to have become stationary. An examination of in- terrelationships among both cpDNA and mtDNA haplotypes was separately conducted using NETWORK v. 4.2.0.1 (Bandelt et al., 1999). We assumed that all coded indels and mutations evolved at an equal rate in these analysis.

3. Results

3.1. cpDNA variation and species identification

Alignments of DNA sequences that represent 449 individuals from 60 populations of seven pine species generated a 3178 bp cpDNA dataset. A total of 16 substitutions and 7 indels were identified, which resulted in 21 chlorotypes (C1eC21). Most of these chlorotypes are species-specific. However, C15 and C18 are shared between P. armandii and P. dabeshanensis or P. fenzeliana, respectively (Fig. 1). In addition, the widespread P. armandii harbored 11 chlorotypes (C6eC21). Two chlorotypes, C4 and C5 are fixed in P. sibirica. Meanwhile, P. koraiensis, P. wallichiana and P. pumila have only fixed one chlorotype C1, C2 and C3, respectively. Four species (P. sibirica, P. koraiensis, P. wallichiana and P. pumila) can be successfully identified by the combination of four chloroplast DNA regions. Network and Bayesian analyses showed chlorotypes recovered from P. kor- aiensis in northeast China and P. sibirica from western China are more closely related to each another comparing with those from P. koraiensis and P. pumila (see Supporting information, Fig. S1). 166 Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168

Table 2 Primers of the amplification fragments used in present study.

Fragments Primer pairs (50e30) PCR product size (bp) Primer source trnLeF trnL: GGTTCAAGTCCCTCTATCCC 900e1000 Taberlet et al., 1991 trnF: ATTTGAACTGGTGACACGAG Taberlet et al., 1991 trnSefM trnS: GAGAGAGAGGGATTCGAACC 700e800 Shaw et al., 2005 trnfM: CATAACCTTGAGGTCACGGG Shaw et al., 2005 trnSeG trnS: GCCGCTTTAGTCCACTCAGC 600e700 Hamilton, 1999 trnG: GAACGAATCACACTTTTACCAC Hamilton, 1999 rpl16 F71: GCTATGCTTAGTGTGTGACTCGTTG 700e800 Shaw et al., 2005 R15: CCCTTCATTCTTCCTCTATGTTG Shaw et al., 2005 nad5intron1 F: AGTCCAATAGGGACAGCACAC 700e800 Jaramillo-Correa et al., 2003 R: ATCTCCGGTAACCGAMATTG Du et al., 2009 nad7intron1 F: GGAACCGCATATTGGATCAC 400e500 Jaramillo-Correa et al., 2004 R: GTTGTACCGTAAACCTGCTC Jaramillo-Correa et al., 2004

F, forward; R, reverse.

3.2. MtDNA variation and species discrimination

The size of the two mtDNA markers was 729e731 (nad5 intron 1) and 379e608 bp (nad7 intron 1) in length, respectively. A total of 8 indels and 17 substitutions were identified for the concatenated mitochondrial DNA sequence, yielding 10 mitotypes (M1-M10) among the trees investigated. Most mitotypes are species-specific; however, M8 is shared between P. armandii and P. fenzeliana. The widespread P. armandii possessed four mitotypes (M6-M8, M10) while two mitotypes, M3 and M4 are fixed in P. pumila. Each of the remaining four miotypes (M1, M2, M5 and M9) is fixed for P. koraiensis, P. wallichiana, P. sibirica and P. dabeshanensis respectively. Therefore, five exampled species, P. sibirica, P. koraiensis, P. wallichiana, P. pumila and P. dabe- shanensis, can be successfully discriminated by the combination of two mtDNA fragments. Both network and Bayesian tree showed that the mitotypes (M1, M2 and M5) from P. sibirica, P. koraiensis, P. wallichiana clustered together with a moderate support (see Supporting information, Fig. S2).

4. Discussion

4.1. Interspecific relationships and allopatric divergence of the white pine species

Both sets of cytoplasmic DNA variations suggested that seven examined white pine species can be classified into two group. Both chlorotypes and mitotypes of P. dabeshanensis and P. fenzeliana were recovered from or closely related to those from P. armandii. However, those from P. sibirica, P. koraiensis, P. wallichiana and P. pumila are more closely related to one another than to those from P. armandii complex. This finding is basically consistent with the previous studies (Hernandez- Leon et al., 2013; Liu et al., 2014a; Hao et al., 2015). Both chlorotypes and mitotypes of P. dabeshanensis revealed before were also found in P. armandii by Liu et al. (2014a). However, in the present study, we surveyed each population of P. dabeshanensis and our findings suggested that the only mitotype of P. dabeshanensis distinctly difference from those found in P. armandii although the chlorotypes showed no distinct differentiation between the two species. These two species showed allopatric distribution patterns, but geological isolations are short. It is very likely that both species diverged recently or P. dabeshanensis might have derived from P. armandii, which leaded to the low genetic differentiation between them. Similarly, P. fenzeliana may have also diverged from P. armandii recently and therefore, both chlorotypes and mitotypes of P. fenzeliana were found in P. armandii as reported before (Liu et al., 2014a). It is interesting that populations from four regions of P. armandii from southeastern to western China showed great genetic differentiation, especially at the mtDNA marker, with totally different mitotypes fixed in each region (Fig. 2). Therefore, geographic isolations seem to have also contributed greatly to the intraspecfic differentiation of P. armandii. In addition, total different mitotypes and chlorotypes were fixed for each species of the other group, P. koraiensis, P. pumila, P. sibirica and P. wallichiana respectively. Because of the strong geographic isolation, these findings are not unexpected. P. sibirica and P. pumila are of sympatric distribution in southern Russia and P. koraiensis and P. pumila co-occur in southeastern Russia. Natural hybridization and gene flow between P. sibirica and P. pumila were detected in Russia (Politov et al., 1999; Vasilyeva and Semeriko, 2014). However, from the chlorotype network (Fig. 1), P. sibirica seems to be more closely related to P. koraiensis than to other two species. This is different from the interspecific relationship inferred before based on the other cpDNA sequence variations (Liu et al., 2014a). Their cpDNA phylogenetic tree suggested that P. sibirica clustered as one small clade with P. pumila while the mtDNA haplotype network showed that P. sibirica is closely related to P. wallichiana (see Supporting information, Fig. S1). Obviously, the interspecific relationships among these four species were need further in- vestigations based on more evidences in future. However, our current data together with those reported before (Liu et al., 2014a) both suggest that geographic isolations should have played an important role in driving allopatric divergences among these species. We did not calibrate these divergences due to the lack of the reliable fossils and mutation rate. However, numerous studies of species diversification for other genera suggested the rapid uplifts of the Qinghai-Tibet Plateau since the Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168 167

Miocene and the subsequent climate change should have caused habitat fragmentation across China, which had probably triggered allopatric speciation in numerous genera occurred there (Liu et al., 2002, 2006, 2012b, 2014b; Xu et al., 2010; Liu et al., 2013). In addition, all detected mitotypes or chlorotypes should have produced before the last glaciation maximum (LGM, around 18e21 Ka BP) of the Quaternary. Therefore, our findings based on a large scale of investigation of a group of white pine species also supported the multiple-refugia hypothesis for across China during the LGM (Liu et al., 2012b). However, as suggested before (e.g. Liu et al., 2012b; Zou et al., 2012), the Pleistocene climatic oscillations should have caused regional range retreat and expansion of these white pine species, which not only promoted allopatric diverge, but also caused their regional fixture of the monotypic mitotype (Fig. 2).

4.2. Species identification based on cytoplasmic DNA sequence variation

Both interspecific introgression and incomplete lineage sorting may have led to the shared cytoplasmic haplotypes be- tween species. Generally, introgression could be caused by successive backcross of hybrids with one of its parents, under the premise that fertile hybrids survive in the contact zones between two parental species. According to the recent simulation (Currat et al., 2008) and empirical studies (e.g. Du et al., 2009; Petit and Excoffier, 2009), genetic markers with the higher rate of gene flow can discriminate species better. In addition, the high rate of gene flow also contributes to the faster lineage sorting (into monophyly) at the species level, which may promote the maintenance of species and their cohesive evolution (Petit and Excoffier, 2009; Zhou et al., 2010; Ren et al., 2012). For conifers, cpDNA and mtDNA are inherited paternally (via pollens) and maternally (via seeds) respectively. Therefore, it is expected that the variations at the cpDNA with the higher rate of gene flow should differentiate pine species better than those from the mtDNA markers. However, our results suggested that mitotypes based on mtDNA sequence variations can diagnose the sampled species better than the chlorotypes from the cpDNA (Figs. 1 and 2). This may be due to the fact that geographic isolations between species may have promoted the faster fixture of the species-specific mitotypes during the range retreats and expansions within the historical climatic oscillations. The long-distance dispersal of pollens, however, continue to transfer chlorotypes between regions, slowing down the lineage sorting between regions and species. The genetic differentiations at both mtDNA and cpDNA markers between one group of species (P. koraiensis, P. pumila P. sibirica and P. wallichiana) with the stronger geographic isolations are much higher than those between the other group of species (P. armandii and either P. dabeshanensis or P. fenzeliana). For the latter group, it is likely that the recent speciation between them may also reduce the diagnosing powers of the cytoplasmic DNA regions. These findings suggested that species identification based on cytoplasmic DNAs may vary dependent on geographic isolations and speciation stages.

Acknowledgments

We are highly grateful for Professor Jian-Quan Liu for his direction on this work and for revising the manuscript. This study was financially co-supported by the National Natural Science Foundation of China (41101058), the Specialized Research Fund for the Doctoral Program of Higher Education (20126101120021) and the open fund of Key Laboratory of Biodiversity and Biogeography, Kunming Institute of Botany of East Asia, the Chinese Academy of Sciences (KLBB201204).

Appendix A. Supplementary data

Supplementary data related to this article can be found at http://dx.doi.org/10.1016/j.bse.2015.06.002.

References

Bandelt, H.J., Forster, P., Rohl,€ A., 1999. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 16, 37e48. CBOL Plant Working Group., 2009. A DNA barcode for land plants. P. Natl. Acad. Sci. U. S. A. 106, 12794e12797. China Plant BOL Group, 2011. Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. P. Natl. Acad. Sci. U. S. A. 108, 19641e19646. Currat, M., Ruedi, M., Petit, R.J., Excoffier, L., 2008. The hidden side of invasions: massive introgression by local genes. Evolution 62, 1908e1920. Doyle, J.J., Doyle, J.L., 1987. A rapid DNA isolation procedure for small quantities of fresh leaf material. Phytochem. Bull. 19, 11e15. Du, F.K., Petit, R.J., Liu, J.Q., 2009. More introgression with less gene flow: chloroplast vs. mitochondrial DNA in the Picea asperata complex in China, and comparison with other Conifers. Mol. Ecol. 18, 1396e1407. Eckert, A.J., Bower, A.D., Jermstad, K.D., Wegrzyn, J.L., Knaus, B.J., Syring, J.V., Neale, D.B., 2013. Multilocus analyses reveal little evidence for lineage-wide adaptive evolution within major clades of soft pines (Pinus subgenus Strobus). Mol. Ecol. 22, 5635e5650. Fu, L.G., Li, N., Mill, R.R., 1999. In: Wu, C.Y., Raven, P.H. (Eds.), Flora of China, vol. 4. Science Press & Missouri Botanical Garden Press, Beijing & Missouri. Hamilton, M.B., 1999. Four primer pairs for the amplification of chloroplast intergenic regions with intraspecific variation. Mol. Ecol. 8, 513e525. Hao, Z.Z., Liu, Y.Y., Nazaire, M., Wei, X.X., Wang, X.Q., 2015. Molecular phylogenetics and evolutionary history of sect. Quinquefoliae (Pinus): implications for northern hemisphere biogeography. Mol. Phylogenet. Evol. 87, 65e79. Hernandez-Le on, S., Gernandt, D.S., Perez de la Rosa, J.A., Jardon-Barbolla, L., 2013. Phylogenetic relationships and species delimitation in Pinus section Trifoliae inferrred from plastid DNA. PLoS One 8, e70501. Hollingsworth, P.M., Forrest, L.L., Spouge, J.L., Hajibabaei, M., Ratnasingham, S., van der Bank, M., Chase, M.W., Cowan, R.S., Erickson, D.L., Fazekas, A.J., Graham, S.W., James, K.E., Kim, K.J., Kress, W.J., Schneider, H., van AlphenStahl, J., Barrett, S.C.H., van den Berg, C., Bogarin, D., Burgess, K.S., Cameron, K. M., Carine, M., Chacon, J., Clark, A., Clarkson, J.J., Conrad, F., Devey, D.S., Ford, C.S., Hedderson, T.A.J., Hollingsworth, M.L., Husband, B.C., Kelly, L.J., Kesanakurti, P.R., Kim, J.S., Kim, Y.D., Lahaye, R., Lee, H.L., Long, D.G., Madrinan, S., Maurin, O., Meusnier, I., Newmaster, S.G., Park, C.W., Percy, D.M., Petersen, G., Richardson, J.E., Salazar, G.A., Savolainen, V., Seberg, O., Wilkinson, M.J., Yi, D.K., Little, D.P., Grp, C.P.W., 2009. A DNA barcode for land plants. P. Natl. Acad. Sci. U. S. A. 106, 12794e12797. 168 Z.-H. Li et al. / Biochemical Systematics and Ecology 61 (2015) 161e168

Hollingsworth, P.M., Graham, S.W., Little, D.P., 2011. Choosing and using a plant DNA barcode. PLoS One 6, e19254. Huelsenbeck, J.P., Ronquist, F., 2001. MRBAYES: bayesian inference of phylogenetic trees. Bioinformatics 17, 754e755. Jaramillo-Correa, J.P., Bousquet, J., Beaulieu, J., Isabel, N., Perron, M., Bouille, M., 2003. Cross species amplification of mitochondrial DNA sequence tagged- site markers in conifers: the nature of polymorphism and variation within and among species in Picea. Theor. Appl. Genet. 106, 1353e1367. Jaramillo-Correa, J.P., Beaulieu, J., Bousquet, J., 2004. Variation in mitochondrial DNA reveals multiple distant glacial refugia in black spruce (Picea mariana), a transcontinental North American . Mol. Ecol. 13, 2735e2747. Kress, W.J., Wurdack, K.J., Zimmer, E.A., Weigt, L.A., Janzen, D.H., 2005. Use of DNA barcodes to identify flowering plants. P. Natl. Acad. Sci. U. S. A. 102, 8369e8374. Lahaye, R., van der Bank, M., Bogarin, D., Warner, J., Pupulin, F., Gigot, G., Maurin, O., Duthoit, S., Barraclough, T.G., Savolainen, V., 2008. DNA barcoding the floras of biodiversity hotspots. P. Natl. Acad. Sci. U. S. A. 105, 2923e2928. Librado, P., Rozas, J., 2009. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics 25, 1451e1452. Liu, J.Q., Gao, T.G., Chen, Z.D., Lu, A.M., 2002. Molecular phylogeny and biogeography of the Qinghai-Tibet plateau endemic Nannoglottis (Asteraceae). Mol. Phylogenet. Evol. 23, 307e325. Liu, J.Q., Wang, Y.J., Wang, A.L., Ohba, H., Abbott, R.J., 2006. Radiation and diversification within the Ligularia-Cremanthodium-Parasenecio complex (Asteraceae) triggered by uplift of the Qinghai-Tibetan Plateau. Mol. Phylogenet. Evol. 38, 31e49. Liu, J., Moller,€ M., Gao, L.M., Zhang, D.Q., Li, D.Z., 2011. DNA barcoding for the discrimination of Eurasian yews (Taxus L., Taxaceae) and the discovery of cryptic species. Mol. Ecol. Resour. 11, 89e100. Liu, J., Moller,€ M., Provan, J., Gao, L.M., Poudel, R.C., Li, D.Z., 2013. Geological and ecological factors drive cryptic speciation of yews in a biodiversity hotspot. New. Phytol. 199, 1093e1108. Liu, J., Provan, J., Gao, L.M., Li, D.Z., 2012a. Sampling strategy and potential utility of indels for DNA barcoding of closely related plant species: a case study in Taxus. Int. J. Mol. Sci. 13, 8740e8751. Liu, J.Q., Sun, Y.S., Ge, X.J., Gao, L.M., Qiu, Y.X., 2012b. Phylogeographic studies of plants in China: advances in the past and directions in the future. J. Syst. Evol. 50, 267e275. Liu, L., Hao, Z.Z., Liu, Y.Y., Wei, X.X., Cun, Y.Z., Wang, X.Q., 2014a. Phylogeography of Pinus armandii and its relatives: heterogeneous contributions of ge- ography and climate changes to the genetic differentiation and diversification of Chinese white Pines. PLoS One 9, e85920. Liu, J.Q., Duan, Y.W., Hao, G., Ge, X.J., Sun, H., 2014b. Evolutionary history and underlying adaptation of alpine plants on the Qinghai-Tibet Plateau. J. Syst. Evol. 52, 241e249. Petit, R.J., Excoffier, L., 2009. Gene flow and species delimitation. Trends. Ecol. Evol. 24, 386e393. Politov, D.V., Belokon, M.M., Maluchenko, O.P., Belokon, Y.S., Molozhnikov, V.N., Mejnartowicz, L.E., Krutovskii, K.V., 1999. Genetic evidence of natural hybridization between Pinus sibirica Du Tour and P. pumila (Pall.) Regel. For. Genet. 6, 41e48. Ragupathy, S., Newmaster, S.G., Murugesan, M., Balasubramaniam, V., 2009. DNA barcoding discriminates a new cryptic grass species revealed in an ethnobotany study by the hill tribes of the Western Ghats in southern India. Mol. Ecol. Resour. 9, 164e171. Ren, G.P., Abbott, R.J., Zhou, Y.F., Zhang, L.R., Peng, Y.L., Liu, J.Q., 2012. Genetic divergence, range expansion and possible homoploid hybrid speciation among pine species in Northeast China. Heredity 108, 552e562. Shaw, J., Lickey, E.B., Beck, J.T., Farmer, S.B., Liu, W., Miller, J., Siripun, K.C., Winder, C.T., Schilling, E.E., Small, R.L., 2005. The tortoise and the hare II: relative utility of 21 noncoding chloroplast DNA sequences for phylogenetic analysis. Am. J. Bot. 92, 142e166. Taberlet, P., Gielly, L., Pautou, G., Bouvet, J., 1991. Universal primers for amplification of three non-coding regions of chloroplast DNA. Plant. Mol. Biol. 17, 1105e1109. Tamura, K., Peterson, D., Peterson, N., Stecher, G., Nei, M., Kumar, S., 2011. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 28, 2731e2739. Vasilyeva, G., Semerikov, V., 2014. Application of amplified fragment length polymorphisms markers to study the hybridization between Pinus sibirica and P. pumila. Ann. For. Res. 57, 1e6. Xu, T.T., Abbott, R.J., Milne, R.I., Mao, K.S., Du, F.K., Wu, G.L., Ciren, Z.X., Miehe, G., Liu, J.Q., 2010. Phylogeography and allopatric divergence of cypress species (Cupressus L.) in the Qinghai-Tibetan Plateau and adjacent regions. BMC Evol. Biol. 10, 194. Yassin, A., Capy, P., Madi-Ravazzi, L., Ogereau, D., David, J.R., 2008. DNA barcode discovers two cryptic species and two geographical radiations in the invasive drosophilid Zaprionus indianus. Mol. Ecol. Resour. 8, 491e501. Zhou, Y.F., Abbott, R.J., Jiang, Z.Y., Du, F.K., Milne, R.I., Liu, J.Q., 2010. Gene flow and species delimitation: a case study of two Pine species with overlapping distributions in southeast China. Evolution 64, 2342e2352. Zou, J.B., Peng, X.L., Li, L.L., Liu, J.Q., Miehe, G., Opgenoorth, L., 2012. Molecular phylogeography and evolutionary history of Picea likiangensis in the Qinghai- Tibetan Plateau inferred from mitochondrial and chloroplast DNA sequence variation. J. Syst. Evol. 50, 341e350.