GENETIC MARKER STUDIES OF THE LARIX GMELINH COMPLEX AND THE DEVELOPMENT OF GENETIC MARKER THEORY FOR PLANT POPULATIONS
XLN-SHENG HU
A thesis submitted to the University of Edinburgh for the degree of Doctor of Philosophy
Institute of Ecology and Resource Management, University of Edinburgh
January 1998 Acknowledgements
The co-operative project between China and United Kingdom (ODA), " Early Establishment and Tree Improvement of Larix in China", provided me with a valuable opportunity to study in the University of Edinburgh, Scotland. The old, calm, handsome city with modern library is an excellent place to dig the earth of knowledge.
My supervisors Drs. Richard A. Ennos and Amanda C.M. Gillies have shown me extreme patience to guide my study throughout all stages. Dr. Ennos published an important paper that made me understand a new area in the population genetics of the plant kingdom. He used his specific method to bring me gradually into the door of population genetics. I always benefit from his stimulating discussion during the entire struggle. Dr. Gullies gave me many encouragements with careful correction and suggestions.
I deeply appreciate Dr. Nick H. Barton for helpful comments on some theoretical results.
The molecular ecology lab provides me with a friendly environment to learn new technology. Mr.J. Morman gave me much help and guidance. Dr. Eric Easton offered me and my family much help in study and life in the UK. Drs. Billy Sinclar, Anthony Langdon, Ashley Robertson and Theodore R. Allnutt were generous to let me try their methods. Miss Nicola Preston's successful help in DNA sequencing analysis brought my experiments to their conclusion.
I greatly appreciate Dr. Malcolm, Dr. Rook and Mr. W. Hogg for their frequent concern with my study. I am grateful to Profs. Shi-Ji Wang, Xiao-Shan Wang and Yi-Fan Han for much advice and encouragement. It is rather unfortunate that Prof Shi-Ji Wang has had no chance to comment on my thesis. I am greatly indebted to him for my teaching in China. Prof. Ben-Li Pan and Prof Wan-Chun Gu are appreciated for providing part of the seeds for the experiments.
To all those who are not mentioned above but gave me help in some or another, I express my sincere gratitude.
Last but not least, I thank my parents, my wife Fei Xiao and my son Yang Hu for immeasurable love and support. To them I would like to dedicate this thesis: my parents, Fei Xiao and Yang Hu.
Xin-Sheng Hu January 1998, Edinburgh
II I declare that all the work in this thesis was done purely by myself except where identified.
XIN-SHENG HU
January 1998, Edinburgh
III Abstract
The thesis is composed of two parts that are related by the theme of genetic markers. The first part involves the application of genetic markers to investigate the mating system, population genetic structure, and evolutionary relationship of the three Chinese larch taxa: Larix gmelinii, L. olgensis and L. principis-rupprechtii. The second part of the thesis explores the development of populations genetic theory that is relevant to plant populations and follows the behaviour of uniparentally as well as biparentally inherited markers.
Seventeen populations of the Larix taxa listed above were analysed using eight polymorphic allozyme markers. Results indicated that mating system was variable among taxa and
among natural populations within taxa. Outcrossing rates were tm = 0.986 ± 0.081 for one
population of L.ginelinii, tm =0.684±0.107 -4.203 ± 0.371 for six populations of L. olgensis,
and tm = 0.792 ± 0.169 -, 0.930 ± 0.149 for two populations of L. principis-rupprechtu. Population differentiation of each taxa was very small, showing that less than 2% of total genetic variation occurred among populations. Spatial distribution of genetic variation of L. g,nelinii was random, but a weak pattern of isolation by distance was detected in L. olgensis.
The genetic relationship among the three taxa elucidated by allozyme markers indicated that the genetic distances were very low between them. Nei's arithmetic genetic distance was about 0.01 between taxa and 0.002 among populations within taxa. L. gmelinii was more closely related to L. olgensis than to L. principis-rupprechtii. Analyses of PCR-RFLP and sequencing of three non-coding regions of cpDNA from genes trnL(UGU) to tm F(GAA) showed no differences at all between the three taxa. Thus it may be concluded that divergence among the three taxa has occurred within recent history. Based on morphological traits and the results obtained by allozyme and cpDNA sequence markers, it is reasonable to consider L. olgensis and L. principis-rupprechtii to be two varieties of L. gmelinii rather than two separate Larix species.
In the second part of this thesis, theories of plant population genetic structure were developed to incorporate biparentally, paternally, and maternally inherited genes into a variety of models. Population differentiation for each of the three plant genomes was
Iv formulated in the island, stepping stone and isolation by distance models of population structure. The results showed that maternally inherited organelle genes maintain larger differentiation than paternally inherited organelle genes, which in turn maintain larger differentiation than biparentally inherited nuclear genes. In the stepping-stone model, differences in genetic correlation with distance among the differently inherited genomes were conditional on the values of long and short distance migration for pollen and seeds. The relative contribution to migration of seed and pollen flow can be estimated in terms of gene frequency data or DNA sequence data. This can be carried out using Wright's F - statistics, Nei's genetic distance, and the number of segregating nucleotide sites.
When genes located on haploid genomes are under selection in a dine, results show that reparametrization may render previous dine theory suitable for plant organelle genes. One important results is that both the ratio of pollen to seed flow, and the ratio of fitnesses between paternally and maternally inherited genes play a critical role in determining dine displacement of these two types of genetic markers.
Integration of the theoretical results with practical work suggests that investigation of population structure in the L. gmelinii complex using maternally inherited markers (mtDNA markers) in addition to the nuclear markers already scored, is likely to yield the most interesting results concerning their biology, ecology and history.
V CONTENT
Acknowledgements ...... II Declaration...... m Abstract...... Iv
Chapter 1. Introduction of the Larix gmelinii complex in China ...... 1
1.1. Introduction ...... 2 1.2. Geographical distribution of Larix ...... 5 1.2.1. Distribution...... 5 1.2.2. Three Chinese larch taxa and their ecological requirements ...... 8 1.2.2.1 Larixg,nelinii ...... 8 1.2.2.2 Larix olgensis ...... 9 1.2.2.3 Larix principis-rupprechtii ...... 9 1.3. Taxonomic problems ...... 10 1.3.1. Taxonomy ...... 10 1.3.2. Genetic relationship of the three taxa ...... 12 1.4. Mating system and population structure ...... 15 1.4.1. Achievements outside China ...... 16 1.4.1.1. Mating system ...... 16 1.4.1.2. Population structure ...... 18 1.4.2. Achievements inside China ...... 20 1.4.2.1. Larixgmelinii ...... 21 1.4.2.2. Larix olgensis ...... 21 1.4.2.3. Larix principis-rupprechtii ...... 23 1.5. Objectives of this study ...... 26
Chapter 2. Use of allozymes to assess population structure and evolutionary relationships within the L. gmelinii complex 27
2.1. Introduction ...... 28 2.1.1. Use of allozyme marker in this study ...... 30 2.1.1.1. Principle ...... 30 2.1.1.2. Advantage and disadvantage of allozyme analysis ...... 30 2.1.1.3. Use of allozymes in studies of plant population structure ...... 31 2.1.1.4. Use of allozymes to study plantphylogeny ...... 33 2.1.2. Aims of present study ...... 36 2.2. Materials ...... 36 2.3. Methodology ...... 36 2.3.1. Seed preparation and enzyme extraction ...... 36 2.3.2. Buffer system and starch gel preparation ...... 38
VI
2.3.3. Electrophoresis 39 2.3.4. Scoring of gels ...... 39 2.4. Data analysis ...... 39 2.5 Results ...... 41 2.5.1. Primary screening of polymorphic markers ...... 41 2.5.2. Interpretation of banding pattern ...... 42 2.5.3. Allele frequency and polymorphism ...... 43 2.5.4. Hardy-Weinberg equilibrium ...... 51 2.5.5. Linkage disequilibrium ...... 52 2.5.6. Population differentiation ...... 55 2.5.7. Isolation by distance ...... 57 2.5.8. Genetic relationship among three taxa ...... 60 2.6. Discussion ...... 63 2.6.1. Allozyme markers ...... 63 2.6.2. Population structure and genetic conservation ...... 63 2.6.3. Genetic relationship among three taxa ...... 66 2.7. Summary ...... 68 Chapter 3 Use of allozymes to investigate the mating system of taxa within the L. gmelinii complex ...... 69
3.1 Introduction ...... 70 3.1.1. Significance of mating system ...... 70 3.1.2. Scoring of mating system ...... 71 3.1.3. Use of allozymes to study plant mating system 74 3.1.4. Aims of the chapter ...... 75 3 .2. Materials ...... 75 3.3. Methodology ...... 75 3.3.1. Seed preparation and enzyme extraction ...... 75 3.3.2. Buffer systems and starch gel preparation ...... 75 3.3.3. Electrophoresis ...... 76 3.3.4. Scoring of gel ...... 76 3.4. Data analysis ...... 76 3.5. Results ...... 77 3.5.1. Primary screening of polymorphic markers ...... 77 3.5.2. Mating system ...... 77 3.6. Discussion ...... 79 3.7. Summary ...... 84
Chapter 4. Use of chioroplast DNA to infer genetic relationships between the three Larix taxa: L. gmelinii, L. olgensis and L. principis-rupprechtii ...... 85
4.1. Introduction ...... 86 4.1.1. Use of cpDNA in studies of plant evolution ...... 86 4.1.1.1. Sequence organisation ...... 86 4.1.1.2. Features of cpDNA suitable for analysis of macroevolution ...... 88 4.1.1.3. Use of cpDNA in microevolution ...... 90 4.1.2. Aims of the present study ...... 94 4.2. Materials and methods ...... 95
VII 4.2.1. Materials 95 4.2.2. DNA extraction ...... 96 4.2.3. PCR-RFLP analysis ...... 96 4.2.3.1. PCR principle ...... 96 4.2.3.2. Setting up the PCR reaction ...... 96 4.2.3.3. Chloroplast DNA primers ...... 98 4.2.3.4. Digestion with restriction endonuclease 98 4.2.3.5. Preparation of agarose gel and electrophoresis 98 4.3. DNA sequencing ...... 98 4.4 Results ...... 99 4.4.1. PCR-RFLP ...... 99 4.4.1.1. PCR amplification ...... 99 4.4.1.2. RFLP analysis ...... 99 4.4.2 Sequencing ...... 105 4.5. Discussion ...... 111 4.6. Summary ...... 117
Chapter 5. Understanding the genetic structure of populations ...... 118
5.1. Introduction ...... 119 5.2. Theoretical considerations ...... 120 5.2.1. Island model and its variants ...... 120 5.2.1.1. The island model ...... 120 5.2.1.2. Constraints and relaxation ...... 122 5.2.1.3. Limitation to plant population ...... 122 5.2.1.4. Variants of the island model ...... 125 5.2.2. Stepping-stone model ...... 125 5.2.3. Isolation by distance and its related models ...... 128 5.2.4. Chine: a specific population genetic structure ...... 131 5.3. Testing our understanding ...... 134 5.4. Extension of the classical theories ...... 134 5.4.1. Significance in population genetics ...... 134 5.4.2. Purposes of this study ...... 137
Chapter 6. Extension to plant populations of the island
and stepping-stone models ...... 139
6.1. Introduction ...... 140 6.2. Island model ...... 141 6.2.1. Assumptions ...... 141 6.2.2. Biparentally inherited diploid genes ...... 143 6.2.3. Paternally inherited haploid genes ...... 146 6.2.4. Maternally inherited haploid genes ...... 147 6.3. Stepping-stone model ...... 147 6.3.1. Assumptions ...... 148 6.3.2. One dimensional case ...... 148 6.3.2.1. Biparentally inherited diploid genes ...... 148 6.3.2.2. Paternally inherited haploid genes ...... 153 6.3.2.3. Maternally inherited haploid genes ...... 154 6.3.3. Two dimensional case ...... 156 6.3.3.1 Biparentally inherited diploid genes ...... 156
VIII