Natural Variation of Large Plasmids in Bacterial Populations
Total Page:16
File Type:pdf, Size:1020Kb
NATURAL VARIATION OF LARGE PLASMIDS IN BACTERIAL POPULATIONS by LAURA E. WILLIAMS (Under the Direction of Anne O. Summers) ABSTRACT Plasmids encode an extraordinary range of adaptive functions, including antibiotic resistances, virulence factors and degradative enzymes. Plasmids also mediate horizontal gene transfer, which impacts evolution of bacterial chromosomes. Despite their important ecological and evolutionary roles, we lack a comprehensive picture of the variation and evolution of plasmids themselves. In this dissertation, I investigated the natural variation of large plasmids on two different scales: 1. individual plasmid sequences from Gram- negative and -positive bacteria and 2. total plasmid content of E. coli and Salmonella reference collections. For the sequence-based studies, I sequenced four large plasmids from E. coli, Staphylococcus and Corynebacterium. Whereas 98% of the E. coli plasmid was highly similar to other plasmid sequences, 40-60% of the three Gram-positive plasmids had no apparent similarity to known sequences. The E. coli plasmid’s backbone, which encodes “core” functions such as replication, was almost identical to that of a Salmonella enterica sv Typhimurium plasmid. Subsequent analysis identified this backbone in five other enterobacterial plasmids. All seven plasmids differed in accessory gene content, leading to variation in plasmid-encoded phenotypes. For the collection-based studies, I analyzed 228 E. coli and Salmonella strains from four reference collections. Plasmids, especially large plasmids, were abundant. Replicon typing identified two common families, IncF and IncI1, by the presence of characteristic backbone genes. At least 20% of strains with large plasmids in each collection were untypable, suggesting variation in plasmid backbones. Restriction fragment length polymorphism (RFLP) analysis showed high variation in large plasmid genomes. The majority of E. coli and Salmonella large plasmids (87.5% and 56%, respectively) had unique RFLP patterns. The lack of widespread RFLP patterns suggested that the prevalence of IncF and IncI1 is due to similar backbone genes on otherwise different plasmids. The only exception was the IncFII virulence plasmid pSLT, which was detected in multiple Typhimurium strains from different hosts and geographic locations, suggesting that virulence plasmids evolve differently from other plasmids. These results show that plasmid evolution is a significant force shaping the horizontal gene pool. Rather than existing as static elements, plasmids are dynamic and flexible genetic scaffolds driving gene flux in host bacteria. INDEX WORDS: plasmid evolution, horizontal gene transfer, mobile genetic element NATURAL VARIATION OF LARGE PLASMIDS IN BACTERIAL POPULATIONS by LAURA E. WILLIAMS B.S., Illinois Wesleyan University, 2001 A Dissertation Submitted to the Graduate Faculty of The University of Georgia in Partial Fulfillment of the Requirements for the Degree DOCTOR OF PHILOSOPHY ATHENS, GEORGIA 2009 © 2009 Laura E. Williams All Rights Reserved NATURAL VARIATION OF LARGE PLASMIDS IN BACTERIAL POPULATIONS by LAURA E. WILLIAMS Major Professor: Anne O. Summers Committee: Jessica Kissinger Mark Schell Patricia Sobecky Electronic Version Approved: Maureen Grasso Dean of the Graduate School The University of Georgia May 2009 DEDICATION Dedicated to Bram Tucker and Joan Williams. They made a hard journey easier. iv ACKNOWLEDGEMENTS Most importantly, I would like to thank my advisor Anne Summers for guiding me through my time here at UGA with constant enthusiasm. I admire how she fully integrates her students into the scientific community, which is one of the most important but intangible purposes of graduate school. From the beginning, she included me as a full participant in interactions with our collaborators and faculty colleagues at UGA and elsewhere. I felt very welcomed into the community of scientists, and I am grateful to her for that. My committee members Jessica Kissinger, Mark Schell and Patricia Sobecky have given me valuable help and advice during this journey. They have been supportive and patient, and I am very glad to have benefited from their expertise throughout my graduate work. I am indebted to a number of incredible mentors, including Beth Arthington- Skaggs, David Bollivar, Marcio Chedid, Jonathan Dey, Don LeBlanc, Jo Porter, Slava Titov, and Chris Zook. Each of them played an important role in getting me to this point, and I hope that their experiences were as rewarding as mine. I would also like to thank my fellow graduate students at UGA, especially Caran Cagle, Adrienne Cottrell, Erinn Howard, Bijal Patel and Julie Shearer. I wish that Bijal was still here to see his own work through to the end, but I imagine that he has all the answers now. v Joy Wireman deserves special recognition. It is easy to forget how much work it takes to keep things running smoothly, and Joy is a true master. Beyond that, she is a thoughtful and accomplished researcher. If I ever have a lab of my own, I’ll be lucky to find someone half as good as Joy. Balance in my life has always kept me sane, and during my time in Athens I have been blessed with a truly remarkable group of friends. The GSYPF crew, with special mention to Christy and Paul, and the Killer Tomahtoes tennis team have been a constant and delightful source of hilarity and friendship. I know that I will miss them when I move on. vi TABLE OF CONTENTS Page ACKNOWLEDGEMENTS.................................................................................................v LIST OF TABLES............................................................................................................. ix LIST OF FIGURES .............................................................................................................x CHAPTER 1 Introduction and Literature Review...................................................................1 Genetic Structure of Plasmids .......................................................................3 Backbone Systems.........................................................................................4 Accessory Genes .........................................................................................21 Plasmid Ecology and Evolution ..................................................................23 Objectives of the Dissertation .....................................................................35 References ...................................................................................................40 2 Facile Recovery of Individual High-Molecular-Weight, Low-Copy- Number Natural Plasmids for Genomic Sequencing.......................................63 Abstract .......................................................................................................64 Introduction .................................................................................................65 Materials and Methods ................................................................................67 Results .........................................................................................................72 Discussion ...................................................................................................79 References ...................................................................................................92 vii 3 Large Plasmids in Escherichia coli and Salmonella Reference Collections Are Highly Variable Despite Common Replicon Families .............................97 Abstract .......................................................................................................98 Introduction .................................................................................................99 Materials and Methods ..............................................................................102 Results .......................................................................................................109 Discussion .................................................................................................126 References .................................................................................................150 4 Conclusions....................................................................................................159 Update of Sequence Analysis from Chapter Two.....................................160 Broad Implications of Dissertation Research............................................165 Future Research.........................................................................................170 References .................................................................................................173 viii LIST OF TABLES Page Table 2.1: Bacterial strains and plasmids ..........................................................................83 Table 2.2: Sequencing coverage resulting in single, circularized contigs .........................84 Table 2.3: Highlights of the chimeric character of novel plasmid genomes .....................85 Table 3.1: Plasmid prevalence in reference collections of natural E. coli and Salmonella.......................................................................................................137 Table 3.2: Plasmid replicon families in reference collections of natural E. coli and Salmonella and in NCBI GenBank ................................................................138 Table 3.3: Summary of plasmid clusters by RFLP analysis ............................................139 Table 3.4: