The Evolution of Recombination and Genomic Structures: a Modeling Approach

The evolution of recombination and genomic structures: a modeling approach. Alexandra Popa To cite this version: Alexandra Popa. The evolution of recombination and genomic structures: a modeling approach.. Bioinformatics [q-bio.QM]. Université Claude Bernard - Lyon I, 2011. English. tel-00750370 HAL Id: tel-00750370 https://tel.archives-ouvertes.fr/tel-00750370 Submitted on 9 Nov 2012 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. N◦ d'ordre: Année2011 THESE présentée devant l'UNIVERSITE CLAUDE BERNARD - LYON I pour l'obtention du DIPLOME DE DOCTORAT (arrêtédu 7 août 2006) par Alexandra Mariela POPA The evolution of recombination and genomic structures: a modeling approach. Directeur de thèse: Christian GAUTIER Co-directrice de thèse: Dominique MOUCHIROUD JURY: Laurent DURET Président du jury Christian GAUTIER Directeur Sylvain GLEMIN Rapporteur Christine MEZARD Rapporteur Dominique MOUCHIROUD Directrice Matthew WEBSTER Rapporteur ii UNIVERSITE CLAUDE BERNARD - LYON 1 Président de l’Université M. A. Bonmartin Vice-président du Conseil d’Administration M. le Professeur G. Annat Vice-président du Conseil des Etudes et de la Vie Universitaire M. le Professeur D. Simon Vice-président du Conseil Scientifique M. le Professeur J-F. Mornex Secrétaire Général M. G. Gay COMPOSANTES SANTE Faculté de Médecine Lyon Est – Claude Bernard Directeur : M. le Professeur J. Etienne Faculté de Médecine et de Maïeutique Lyon Sud – Charles Directeur : M. le Professeur F-N. Gilly Mérieux UFR d’Odontologie Directeur : M. le Professeur D. Bourgeois Institut des Sciences Pharmaceutiques et Biologiques Directeur : M. le Professeur F. Locher Institut des Sciences et Techniques de la Réadaptation Directeur : M. le Professeur Y. Matillon Département de formation et Centre de Recherche en Biologie Directeur : M. le Professeur P. Farge Humaine COMPOSANTES ET DEPARTEMENTS DE SCIENCES ET TECHNOLOGIE Faculté des Sciences et Technologies Directeur : M. le Professeur F. Gieres Département Biologie Directeur : M. le Professeur F. Fleury Département Chimie Biochimie Directeur : Mme le Professeur H. Parrot Département GEP Directeur : M. N. Siauve Département Informatique Directeur : M. le Professeur S. Akkouche Département Mathématiques Directeur : M. le Professeur A. Goldman Département Mécanique Directeur : M. le Professeur H. Ben Hadid Département Physique Directeur : Mme S. Fleck Département Sciences de la Terre Directeur : Mme le Professeur I. Daniel UFR Sciences et Techniques des Activités Physiques et Sportives Directeur : M. C. Collignon Observatoire de Lyon Directeur : M. B. Guiderdoni Ecole Polytechnique Universitaire de Lyon 1 Directeur : M. P. Fournier Ecole Supérieure de Chimie Physique Electronique Directeur : M. G. Pignault Institut Universitaire de Technologie de Lyon 1 Directeur : M. le Professeur C. Coulet Institut de Science Financière et d'Assurances Directeur : M. le Professeur J-C. Augros Institut Universitaire de Formation des Maîtres Directeur : M. R. Bernard 2 iii Abstract Meiotic recombination plays several critical roles in molecular evolution. First, recombination represents a key step in the production and transmission of gametes during meiosis. Second, recombination facilitates the impact of natural selection by shuffling genomic sequences. Furthermore, the action of certain repair mechanisms during recombination affects the frequencies of alleles in populations via biased gene conversion. Lately, the numerous advancements in the study of recombination have unraveled the complexity of this process regarding both its mechanisms and evolution. The main aim of this thesis is to analyze the relationships between the different causes, characteristics, and effects of recombination from an evolutionary perspective. First, we developed a model based on the control mechanisms of meiosis and inter-crossover interference. We further used this model to compare the recombination strategies in multiple vertebrates and invertebrates, as well as between sexes. Second, we studied the impact of the sex-specific localization of recombination hotspots on the evolution of the GC content for several vertebrates. Last, we built a population genetics model to analyze the impact of recombination on the frequency of deleterious mutation in the human population. Résumé La recombinaison méiotique joue un double rôle de moteur évolutif en participant àla création d'une diversitégénétique soumise àla sélection naturelle et de contrôle dans la fabrication des gamètes lors de la méiose. De plus, en association avec certains mécanismes de réparation, la recombinaison, au travers de la conversion génique biaiséemanipule les fréquences alléliques au sein des populations. Les connaissances sur le fonctionnement même des ce processus ont considérablement augmentées ces dernières années faisant découvrir un processus complexe, autant dans son fonctionnement que dans son évolution. Le thème général de la thèseest l'analyse, dans un contexte évolutif, des relations entre les différent rôles et caractéristiques fonctionnelles de la recombinaison. Un modèle de la recombinaison prenant en compte des contraintes liéesau contrôle de la méiose et le phénomène d'interférence a permis une comparaison entre espècesau sein des vertébrés et des invertébrésde même qu'un comparaison entre sexes. Par ailleurs, nous avons montrél'impact de la localisation spécifique aux sexes des points chauds de recombinaison sur l'évolution du contenu en GC des génomes de plusieurs vertébrés. Finalement, nous proposons un modèle àl'échelle de la génétique des populations, permettant d'analyser l'impact de la recombinaison sur la fréquences de mutations délétères dans les populations humaines. Cette thèse,nous l'espérons, apportera sa pierre àl'étude interdisciplinaire de la recombinaison, àla fois au sein de la biologie et par ses relations au travers de la modélisation avec l'informatique et les mathématiques. iv Notations General abbreviations A Adenine bp base pair C Cytosine CpG A dinucleotide CG, p standing for a phosphate link. DNA Deoxyribonucleic acid G Guanine Gb Giga base kb kilo base Mb Mega base Myr Million years Ne effective population size SNP single nucleotide polymorphism T Thymine TSS Transcription Start Site Meiosis and recombination-related abbreviations CE Central Element (referring to the SC) cM centimorgan CO Crossover COI CO Interference COR Crossover Rate dHJ double Holliday Junction DSB Double-Strand Break DSBh Double-Strand Break hotspot DSBR Double-Strand Break Repair model dsDNA double stranded DNA F1 First generation of offspring in a crossing experiment F2 Second generation of offspring in a crossing experiment F/M Female/Male ratio HapMap1, 2, and 3 the 1st, 2nd, and 3rd respective phases of HapMap Project HJ Holliday Junction HR Homologous Recombination v vi HS Heterogeneous Stock (for mouse populations) LD Linkage Disequilibrium LE Lateral Element (referring to the SC) NAHR Nonallelic Homologous Recombination NCO Non-crossover NCOR Non-crossover Rate NE Nuclear Envelope NHEJ Nonhomologous End Joining PC Pairing Centres rDNA ribosomal DNA RI Recombinant Inbred lines (in a crossing experiment) SC Synaptonemal Complex SDSA Synthesis-Dependent Strand-Annealing model SEI Single End Invasion ssDNA single stranded DNA TF Transverse Filaments (referring to the SC) Mathematical symbols C coefficient of coincidence C3 Three-point coefficient of coincidence. D0 the difference between the frequency of a two locus haplotype and the product of the component alleles, divided by the most extreme possible value, given the marginal allele frequencies, measure of LD g genetic distance I Identity matrix m In the counting models, m stands for the number of NCO events that separate two consecutive COs. It is a measure of the strength of interference P Physical length (Mb) of an interval or chromosome p The fraction of COs that are not subject to interference under the two-pathway model Housworth and Stahl (2003). Q the substitution matrix along a branch of a phylogenetic tree R frequency of recombinants among the offspring r2 correlation of alleles at different loci, measure of LD y The mean number of DSB events in the counting model of Foss et al. (1993). # Number χ2 Chi-square distribution Γ Gamma distribution λ The rate parameter for Γ ν The shape parameter for Γ vii σ0 Uniform basal tensile stress in the mechanical stress model of Kleckner et al. (2004). ⊗ Tensor product of matrices BGC abbreviations BGC Biased Gene Conversion BER base excision repair gBGC GC Biased Gene Conversion GC* equilibrium or stationary GC-content MMR mismatch repair Other abbreviations AIC Akaike Information Criterion BIC Bayesian Information Criterion CEPH Centre d'Etude du Polymorphism Humain CI Confidence Interval DAF Derived Allele Frequency DT Distance to Telomeres HGMD Human Gene Mutation Database H-W test Hotteling-William's t-test L Likelihood LCR Low Copy Repeat LDT Log Distance to Telomeres LINE Long interspersed nuclear element LOD Logarithm of Odds MHC Major Histocompatibility Complex PAR Pseudoautosomal Region PCR Polymerase Chain Reaction RE Repetitive Element SINE Short Interspersed Nuclear Element TE Transposable Element viii Definitions Information assembled from Stumpf and McVean (2003); Arnheim et al. (2007); Lynch (2007); Paigen and Petkov (2010); DB-NCBI Allele One of the variant forms of a DNA sequence at a particular locus, or location, on a chromosome. Backcross Crossing experiment in which individuals in the first generation are crossed back with one or both their parents to obtain the second generation of offspring. Bouquet formation The clustering of telomeres together on the nuclear membrane early in meiosis.

The Evolution of Recombination and Genomic Structures: a Modeling Approach

Laboratory III Linkage

Revise Meiosis Process and the Relation Between Mendel Laws and Meiosis Before Proceeding with This Lecture

Issue 85 of the Genetics Society Newsletter

Linkage, Recombination, and Eukaryotic Gene Mapping 163

William Ernest Castle

Discovery of the Sex-Linked Traits. in 1910, Morgan Published Details of His Research in an Article Titled “Sex Limited Inheritance in Drosophila"

7 Linkage, Recombination, and Eukaryotic Gene Mapping

The Birth of Classical Genetics As the Junction of Two Disciplines: Conceptual Change As Representational Change Marion Vorms

Genetic Linkage and Crossing Over

Linkage, Recombination and Eukaryotic Gene Mapping

Linkage and Crossing Over

Linkage, Recombination, and Eukaryotic Gene Mapping 109 Located at the Same Position, Or Locus, on Each of the Two Experiment Homologous Chromosomes