UCC Library and UCC Researchers Have Made This Item Openly Available. Please Let Us Know How This Has Helped You. Thanks! Downlo
Total Page:16
File Type:pdf, Size:1020Kb
UCC Library and UCC researchers have made this item openly available. Please let us know how this has helped you. Thanks! Title Bacterial inhabitants of tumours: methods for exploration and exploitation Author(s) Walker, Sidney P. Publication date 2020-05-04 Original citation Walker, S. P. 2020. Bacterial inhabitants of tumours: methods for exploration and exploitation. PhD Thesis, University College Cork. Type of publication Doctoral thesis Rights © 2020, Sidney P. Walker. https://creativecommons.org/licenses/by-nc-nd/4.0/ Item downloaded http://hdl.handle.net/10468/10055 from Downloaded on 2021-10-08T08:21:36Z Coláiste na hOllscoile, Corcaigh THE NATIONAL UNIVERSITY OF IRELAND, CORK School of Microbiology Bacterial inhabitants of tumours: Methods for exploration and exploitation Thesis presented by Sidney Walker Under the supervision of Dr. Mark Tangney and Dr. Marcus Claesson For the degree of Doctor of Philosophy 2016-2020 1 Contents Abstract .................................................................................................................................................. 3 Chapter I: Literature Review .............................................................................................................. 7 Section 1: Sequencing and the Microbiome ........................................................................ 7 The Microbiome ............................................................................................................... 8 Sequencing ..................................................................................................................... 10 Analysis of Bacterial Sequence Data ............................................................................. 16 Section 2: Characterising intratumoural bacteria ............................................................... 30 Section 3: Utilising bacteria for therapeutic intervention .................................................. 54 Microbiome research as an R&D tool ............................................................................ 55 In silico platforms for protein analysis and design ........................................................ 59 Predicting protein function............................................................................................. 64 Conclusion ..................................................................................................................... 70 Chapter II: Assessing bioinformatic amplicon sequencing contamination control strategies via mock bacterial communities ........................................................................................................... 82 Chapter III: Bacteria in Breast Tumours .......................................................................................... 110 Chapter IV: Development of novel methodology for study of bacterial DNA from FFPE samples ................................................................................................................................................ 142 Chapter V: Microbiome analysis as a platform R&D tool for parasitic nematode disease management ........................................................................................................................................ 179 Chapter VI: Discussion and future perspectives ............................................................................... 232 Appendix I: Function2Form Bridge – Towards synthetic protein holistic performance- prediction ............................................................................................................................................ 238 2 Declaration This is to certify that the work I am submitting is my own and has not been submitted for another degree, either at University College Cork or elsewhere. All external references and sources are clearly acknowledged and identified within the contents. I have read and understood the regulations of University College Cork concerning plagiarism. ___________________ Sidney Walker 3 Abstract The presence of bacteria in patient tumours of various types has been reported by numerous groups since 2014, but the findings of these and many similar studies remain contentious. Tumour samples provide many obstacles to carrying out robust and reliable microbial surveys, primarily the anticipated low biomass of these samples, which leaves them vulnerable to environmental contamination. While the debate over the presence or absence of bacterial communities in these tumours continues, it impedes any research into how such bacteria might be utilised in medicine. Larger sample numbers are required, from diverse tumour tissues within the human body, and these must be analysed in a reproducible and accurate manner to allow for the drawing of definitive conclusions in this debate. To accommodate this requirement, the primary methodological aspects of this thesis were: i) The assembly and validation of a contamination control pipeline using recent advances is bioinformatic contamination control detection. ii) The development and validation of a bacterial DNA extraction protocol for formalin fixed, paraffin embedded (FFPE) samples, with accompanying FFPE biological standards for use as controls. A key aim of this thesis was to increase the accuracy and reproducibility of research into clinical tissue biopsies by eliminating the role of contamination, and to expand the applicability of FFPE tissues which represent an invaluable resource of samples for analysis. In this thesis, ecological surveys of a variety of related environments were conducted with the common goal of characterising a detectable bacterial community and identify potential bacterial biomarkers unique to these host environments. Regardless of whether or not a consistently present and detectable tumour microbiome exists, tumours possess several phenotypes making them hospitable environments for bacteria to colonise. Where the unique physiology of tumours is seen as an obstacle for traditional cancer treatments, they represent an opportunity for bacterial-mediated solutions. Therefore, findings from sequencing-based research of host environments have potential to be translated into the use of administered bacteria as delivery vehicles to locally produce biomolecules. 4 There are two considerations in this context, requiring two very different applications of bioinformatics. i) The first is to identify which bacteria colonize the desired niche in body; this can be a ‘foreign’ body such as a tumour (Chapter 3 and 4), or parasite (Chapter 5), or a distal niche such as the gut. ii) The second, often under-considered parameter, relates to what these bacteria produce. Synthetic biology presents enormous scope for sophisticated medical therapy mediated by novel synthetic proteins. However, the task of getting a bacterial cell to successfully express and secrete a stable protein that it does not produce naturally is far from trivial, and is becoming a key aspect of the synthetic biology field. To facilitate this synthetic protein aspect, a novel strategy for the performance prediction of designed protein constructs was developed. This tool was able to predict the overall performance of a protein construct in vitro using only in silico derived data. Thesis aim: This thesis aimed to develop novel strategies for the analysis of bacterial communities within tumours by i) increasing the sample sizes available to future projects by enabling the use of FFPE samples and ii) improving the accuracy of analysis by designing bioinformatics analysis pipelines appropriate for these samples. This enabled further research concerned with finding differentially present taxa between the tumour and surrounding environment, which have the potential for use as therapeutic vectors. As the key aim is to establish the presence of potential bacterial therapeutic vectors rather than to establish the role these bacteria play in tumorigenesis, this approach is easily translatable to other foreign bodies, and could therefore be validated in a parasitic nematode model. 5 GLOSSARY: 16S ribosomal RNA gene: Transcribed form of the small subunit gene located in the 30S subunit of a prokaryotic ribosome. Contains 9 variable regions that can be targeted for amplification and used for profiling of microbial communities. ASV: Amplicon Sequence Variant. Refers to individual DNA sequences recovered from high throughput marker gene sequencing following the removal of spurious sequences. BER: Base Excision Repair DADA2: Divisive amplicon denoising algorithm FFPE: Formalin-Fixed, Paraffin-Embedded LASSO: Least absolute shrinkage and selection operator MICROBIOME: The genetic material of all microorganisms that live on or in an ecological niche, such as the human body. MICROBIOTA: The ecological community of commensal, symbiotic and pathogenic microorganisms found in and on all multicellular organisms. NGS: Next Generation Sequencing NMR: Nuclear magnetic resonance OTU: Operational Toxonomic Unit. Cluster of sequences with similarity above a specified threshold, eg. 97%. PCR: Polymerase Chain Reaction qPCR: Quantitative Polymerase Chain Reaction QIIME: Quantitative insights into microbial ecology READS: DNA segments obtained from a sequencing experiment. SNP: Single Nucleotide Polymorphism. VARIANT CALLING: Process of identifying variants between closely related sets of sequence data, typically taking the form of SNPs. WGS: Whole Genome Sequencing 6 Chapter I Literature Review SECTION 1: SEQUENCING AND THE MICROBIOME A portion