Cenliau Thesis Final.Pdf (14.34Mb)
Total Page:16
File Type:pdf, Size:1020Kb
Application of second and third generation sequencing in pharmacogenetics A thesis submitted for the degree of Doctor of Philosophy in Pathology and Biomedical Science Yusmiati Liau University of Otago, Christchurch, New Zealand May 2020 Attribution of collaborative contributions to work in this thesis Chapter 3 The candidate carried out all laboratory work and bioinformatic analysis for nanopore sequencing with guidance from Dr Simone Cree (this laboratory). Dr Patrick A. Gladding (Theranostics Laboratory, Auckland, NZ) provided clinical samples with CYP2C19*2, CYP2C19*3, and CYP2C19*17 genotype information derived from Nanosphere Verigene® or Sequenom MassARRAY platform. Wubin Qu (iGeneTech Bioscience, Beijing, China) assisted with the primer design for multiplex assay using MFEprimer. Chapter 4 Dr Simran Maggo (this laboratory) consented all participants of the ADR biobanks. Dr Simran Maggo and Allison Miller (this laboratory) extracted the DNA samples. The candidate carried out all laboratory work and bioinformatic analysis on nanopore sequencing and some of the Sanger sequencing. Dr Simran Maggo performed some of the Sanger sequencing. Will Taylor (Department of Surgery, University of Otago, Christchurch) and Assoc. Prof John Pearson (Department of Pathology, University of Otago, Christchurch) helped with installation of several bioinformatic tools. Assoc. Prof John Pearson also helped with the statistics for detecting duplicated alleles. Chapter 5 Participants of the UDRUGS and GAARD biobank were consented by clinicians and nurses within the Clinical Pharmacology Department, University of Otago, Christchurch. Dr Simran Maggo and Allison Miller extracted the DNA samples. Dr Klaus Lehnert (School of Biological Sciences, The University of Auckland, NZ) processed the raw data and generated the final VCF files for WGS. The candidate performed all subsequent analyses using this dataset. With assistance from the candidate, Dr Kit Doudney (formerly: Canterbury Health Laboratories, Christchurch) performed array-based comparative genomic i hybridisation (aCGH). The candidate, with guidance from Dr Kit Doudney, performed the CNV data analysis for aCGH. The candidate performed all other laboratory work and data analysis. Chapter 6 Assoc. Prof Greg Jones (Department of Surgery, University of Otago, Dunedin, NZ) provided methylation and clinical data for the clinical cohort and guided the candidate in performing the data analysis for this dataset using the Qlucore software. John Wallace (Canterbury Health Laboratories) performed the ACE activity assay. The candidate prepared the DNA library for the first BSAS run, which was performed together with the library prepared by Alex Noble, an opportunity generously offered by Dr Amy Osborne from the University of Canterbury. The candidate prepared the initial library for the second BSAS run, and Dr Meik Dilcher (Canterbury Health Laboratories) performed the final step of BSAS and loading into MiSeq with the assistance from the candidate. The candidate performed data analysis for BSAS. Dr Aaron Stevens guided the candidate to analyse the methylation array data The candidate performed cell culture work and overall data analysis. ii Abstract The next generation sequencing technologies have advanced pharmacogenetics by enabling rapid assessment of the entire exome or even whole genome for associations between variants or gene regions with drug response. Third generation sequencing such as nanopore sequencing overcomes the limitations of short read sequencing by offering low capital cost and advantages in deciphering complex genes, detecting structural variants, and providing accurate haplotypes. The first aim of this thesis was to explore nanopore sequencing for pharmacogenetic implementation. The second part of this thesis applied genomic technologies to investigate genetic and epigenetic contributors to a rare but serious adverse drug reaction. At the outset of this thesis, a multiplex assay comprising common and rare genetic variants attributed to response variability to clopidogrel and warfarin was developed and validated on the nanopore sequencing technology. A one-tube multiplex PCR was employed to amplify 15 regions containing 27 variants in seven genes. Use of sample barcodes enabled inclusion of 84 samples in one sequencing run to provide a cost- effective assay. With the relatively error-prone state of the nanopore sequencing device and chemistry at the time of study, accuracy of variant genotyping relied heavily on the choice of bioinformatic tools and was sequence-dependent. Using Albacore v2 as the basecalling tool and nanopolish or Clairvoyante as variant calling tools, accuracy of ≥90% was achieved for all variants except one. Nanopolish resulted in <90% accuracy only for VKORC1 rs9923231 and Clairvoyante resulted in <90% only for CYP2C19*2 rs4244285. Accuracy did not improve by increasing read depth. This illustrated that it was possible to develop multiplexed, targeted genotyping assays on the nanopore platform, although further improvements in the platform may be required to obtain optimal accuracy. Nanopore sequencing offers great potential for genotyping complex genes such as CYP2D6, which is hindered by the very polymorphic nature of the gene, high homology with its pseudogene CYP2D7, and the occurrence of structural variants. Therefore, a second goal of this PhD was to establish and apply an effective nanopore sequencing method for CYP2D6. DNA amplicons encompassing the CYP2D6 gene from seven reference samples of known genotype (Coriell Institute, Camden, NJ, iii USA), and 25 clinical samples from adverse drug reaction cohorts were sequenced and analysed over two sequencing runs on the nanopore sequencer. All alleles were genotyped and all duplicated alleles were determined accurately. In addition, five novel variants, one novel allele, CYP2D6*127, and seven novel subvariants were identified in this study. Although the method proved efficient and generally effective, it was noteworthy that the presence of a hybrid allele CYP2D6*68 in two of the reference samples was undetected by this method. Although pharmacogenetic studies have been fruitful in elucidating the response variability of a large number of drugs, variation in response to many other drugs remains unresolved. An example is angioedema induced by angiotensin converting enzyme inhibitor (ACEi-A). This potentially fatal reaction occurs in approximately three of 1000 individuals taking the drugs. Genome-wide analysis by whole exome sequencing was performed for five DNA samples from participants having this ADR and whole genome sequencing was performed on another 15 samples. Initial analysis on 15 previously reported variants potentially associated with the ADR did not show any significant enrichment in the ACEi-A group compared to population allele frequency. Analysis on the whole exome and whole genome datasets only resulted in one rare coding variant found to be consistently enriched in both datasets, rs4365 in ACE, with allele frequency of 40% in the ACEi-A cohort compared to 3% in the population. This synonymous variant is predicted to be benign, and the possible significance of this variant to ACEi-A is yet to be elucidated. A number of moderately enriched variants were identified, however, no single rare variant with minor allele frequency of <1% was found in more than seven of the 15 whole genome sequencing datasets. Limited CNV analysis on subsets of samples did not find any variant affecting candidate genes. Further understanding of the genetic make-up of this rare ADR will likely benefit from collection of more cases and application of different approaches including analysis of combined effect of multiple variants, analysis of the ADR as complex trait, or more comprehensive analysis of structural variants. As attempts to find the genetic variants responsible for ACEi-A did not implicate any strong candidates; the potential of epigenetic regulation involvement in this ADR was explored by investigating the effect of ACEi on DNA methylation in clinical samples and cell culture model. Analysis of existing methylation array data from patient iv samples with and without ACEi consumption showed a putative differentially methylated CpG site on chromosome 10. However, this was not validated in the in vitro experiment. Methylation array data from the cell culture experiment resulted in several potential CpG sites showing differential methylation, however none of these was validated using bisulfite amplicon sequencing. Overall, this work showed little evidence of effect of ACEi on DNA methylation, and it seems unlikely that DNA methylation change by ACEi is involved in the association of angioedema with the drug. The work in this thesis applied various approaches and technologies to advance knowledge in pharmacogenetics. Nanopore sequencing was shown to be beneficial for pharmacogenetic applications and with the continual improvement in accuracy, this technology will continue to be a valuable tool, especially in elucidating complex structural variants and providing accurate haplotype information. Short read sequencing technologies for whole exome or whole genome sequencing are useful for interrogation of rare variants; however other approaches might be needed to elucidate the basis of potentially complex phenotypes such as ADR. Lastly, studies of epigenetic regulation in pharmacogenetics are still limited, and the potential role of epigenetics in ADR merits further exploration. v Acknowledgment “But the dream