UNIVERSIDAD POLITÉCNICA DE MADRID ESCUELA TÉCNICA SUPERIOR DE INGENIERÍA AGRONÓMICA, ALIMENTARIA Y DE BIOSISTEMAS

GRADO EN BIOTECNOLOGÍA

Validación del polimorfismo rs2395029 del gen HCP5 para la predicción de hipersensibilidad a abacavir.

Validation of HCP5 rs2395029 polymorphism as a biomarker for abacavir hypersensitivity prediction. .

TRABAJO FIN DE GRADO

Autor: Gonzalo Villapalos García

Tutoras: Miriam Saiz Rodríguez y Eva Miedes Vicente

Junio de 2019

UNIVERSIDAD POLITÉCNICA DE MADRID ESCUELA TÉCNICA SUPERIOR DE INGENIERÍA AGRONÓMICA, ALIMENTARIA Y DE BIOSISTEMAS

GRADO EN BIOTECNOLOGÍA

VALIDACIÓN DEL POLIMORFISMO RS2395029 DEL GEN HCP5 PARA LA PREDICCIÓN DE HIPERSENSIBILIDAD A ABACAVIR

VALIDATION OF HCP5 RS2395029 GENE POLYMORPHISM AS A BIOMARKER FOR ABACAVIR HYPERSENSITIVITY PREDICTION

TRABAJO FIN DE GRADO Gonzalo Villapalos García

MADRID, 2019

Directora: Dra. Miriam Saiz Rodríguez Servicio de Farmacología Clínica. Hospital Universitario de La Princesa

Profesora: Dra. Eva Miedes Vicente Dpto. de Biotecnología-Biología Vegetal

i

VALIDACIÓN DEL POLIMORFISMO RS2395029 DEL GEN HCP5 PARA LA PREDICCIÓN DE HIPERSENSIBILIDAD A ABACAVIR

VALIDATION OF HCP5 RS2395029 GENE POLYMORPHISM AS A BIOMARKER FOR ABACAVIR HYPERSENSITIVITY PREDICTION

Memoria presentada por Gonzalo Villapalos García para la obtención del título de Graduado en Biotecnología por la Universidad Politécnica de Madrid

Fdo: Gonzalo Villapalos García

VºBº Tutor y directores del TFG Directora: Miriam Saiz Rodríguez

Profesora: Eva Miedes Vicente

Dpto de Biotecnología-Biología Vegetal ETSIAAB – Universidad Politécnica de Madrid

Madrid, 25, julio, 2019

ii

INDEX

I. INTRODUCTION. 1

1. HUMAN IMMUNODEFICIENCY TREATMENT. 1 2. ABACAVIR HYPERSENSITIVITY REACTION. 2 3. PHARMACOGENETICS. 3 4. HLA-B*57:01 BIOMARKER: TRADITIONAL GENOTYPING AND ALTERNATIVE METHODS. 4 5. HOSPITAL DE LA PRINCESA HLA-B*57:01 DETERMINATION PROCESS. 6 6. SINGLE NUCLEOTIDE POLYMORPHISM GENOTYPING TECHNIQUES. 6 7. HLA COMPLEX P5 AND LINKAGE DISEQUILIBRIUM. 7

II. OBJECTIVES OF THE STUDY. 8

III. MATERIALS AND METHODS 9

1. STUDY POPULATION 9 2. DNA EXTRACTION USING MAGNA PURE SYSTEM 9 3. REAL TIME PCR (QPCR) USING LIGHTSNIP® TECHNOLOGY 9 4. SEQUENCE-SPECIFIC OLIGONUCLEOTIDE REVERSE HYBRIDIZATION: AUTOLIPA. 10 5. SANGER SEQUENCING 11 6. CNV ASSAY. 11 7. STATISTICAL ANALYSIS. 12

IV. RESULTS. 13

1. HCP5 GENOTYPING AND CORRELATION WITH HLA-B*57:01. 13 2. HCP5 CNV ANALYSIS. 15 3. ECONOMIC AND PRACTICAL COMPARISON BETWEEN HLA-B DIRECT GENOTYPING AND HCP5 RS2395029 USED AS BIOMARKER. 16

V. DISCUSSION. 18

REFERENCES. 21

iii

TABLE INDEX

TABLE 1. DRUG ADVERSE RESPONSE ASSOCIATED WITH HLA-B ALLELES.. 4

TABLE 2. CORRELATION BETWEEN HLA-B*57:01 AND HCP5 RS2395029. 14

TABLE 3. GENOTYPE FOR SINGLE PEAK FOR HCP5 RS2395029 G ALLELE AND THEIR CORRESPONDING HLA-B GENOTYPE. 15

TABLE 4. HS03608991_CN AND HS03593070_CN COPY NUMBER AND CONFIDENCE OF THE CALCULOUS...... 16

TABLE 5. TIME RESPONSE COMPARISON BETWEEN HCP5 RS2395029 AND HLA-B METHODS. .………….. ..16

TABLE 6. COSTS COMPARISON BETWEEN HCP5 RS2395029 AND HLA-B METHODS...... 17

FIGURE INDEX

FIGURE 1. HIV STRUCTURE AND COMPONENTS. MAIN TARGETS FOR ANTI-RETROVIRAL THERAPY ARE

HIGHLIGHTED IN RED. 1

FIGURE 2. A. ABACAVIR AND B. CARBOVIR 5’ TRIPHOSPHATE. 3

FIGURE 3. HLA NOMENCLATURE. 5

FIGURE 4. FRET PROBES MECHANISM OF ACTION (GREEN AND PURPLE). F: FLUORESCEIN; L: LIGHTCYCLER®

RED; P: PHOSPHATE. 9

FIGURE 5. WORKFLOW TO PREDICT ABC-HSR SUSCEPTIBILITY. BLUE PATH REPRESENTS PROCESS FOLLOWED

FOR EVERY PROBE. ORANGE PATH REPRESENTS PROCESS FOR HLA-B*57 POSITIVES OBTAINED AT THE END OF

BLUE PATH. 13

FIGURE 6. FRET PROBES-BASED GENOTYPING FOR HCP5 RS2395029 MELTING PEAKS. SAMPLES OBSERVED

GENOTYPE WERE: T/G HETEROZYGOTIC (PINK) AND G/G HOMOZYGOTIC (BLACK). 14

FIGURE 7. SAMPLE COPY NUMBER FOR SEQUENCES TARGETED BY HS03593070_CN AND HS03608991_CN

PROBES. BOLD SAMPLES CORRESPOND TO HCP5 RS2395029 SINGLE G ALLELE SIGNAL. 15

iv

ABREVIATIONS ABC Abacavir ABC-HSR Abacavir hypersensitivity reaction ADME Absorption, distribution, metabolization and excretion ADR Adverse drug response AIDS Acquired immunodeficiency syndrome BCI 5-Bromo-4-Chloro-3-Indolyl BCIP 5-Bromo-4-Chloro-3-Indolyl Phosphate CBV-TP Carbovir triphosphate CNV Copy number variation dNTP Deoxyribonucleotide triphosphate ddNTP Dideoxyribonucleotide triphosphate DNA Deoxyribonucleic acid ELISA Enzyme linked immunosorbent assay FN False negatives FP False positives FRET Föster resonance energy transfer G Guanine HAART High activity antiretroviral therapy HCP5 HLA Complex P5 HIV Human immunodeficiency virus HLA Human leukocyte antigen LD Linkage disequilibrium MHC Major histocompatibility complex NBT Nitro blue tetrazolium chloride NRTI nucleotide analogue reverse-transcriptase inhibitors PCR Polymerase chain reaction qPCR real time PCR RNA ribonucleic acid SNP Single nucleotide polymorphism SSO-PCR Sequence-specific oligonucleotide PCR SSP-PCR Sequence-specific primer PCR

v T Thymine Tm Melting temperature

TN True negatives

TP True positives

vi

Abstract

Abacavir is one of the most widely used drugs in the treatment of HIV. However, 4% of patients taking abacavir suffer from a life-threatening hypersensitivity reaction. Multiple research groups have confirmed the association between hypersensitivity and a particular genotype of a component of the major human histocompatibility complex type I: the HLA- B*57:01 allele. Predictive tests are, therefore, performed on all HIV-positive patients prior to abacavir treatment, as recommended by the drug label. The procedure used at Hospital Universitario de La Princesa to determine the HLA-B*57:01 genotype consists of inverse hybridization prior to sequencing. This technique determines the HLA-B allele but lacks the resolution necessary to discriminate the HLA-B*57:01 subvariant. Thus, the rest of the HLA-B genotypes other than HLA-B*57:01 are discriminated, for which abacavir can be administered. In patients carrying the HLA-B*57 allele, a second determination is made by PCR and Sanger sequencing to confirm the existence of HLA-B*57:01. Therefore, the current procedure is technically complex, costly and involves a moderate response time; therefore, substitution by another method, such as genotyping using hybridization probes, will shorten response times and reduce costs. The objective of this study is to validate a method based on the analysis of a theoretically perfect linkage disequilibrium between HLA-B*57:01 and the G allele of the rs2395029 polymorphism in the HLA complex P5 gene (HCP5), to be used as a biomarker predictive for abacavir hypersensitivity. From 1225 patients genotyped for HLA-B since 2008, 49 patients positive and 177 negatives for HLA-B*57:01 were genotyped by quantitative PCR with allele-specific hybridization probes for polymorphism rs239529. Specificity and sensitivity values were 100%, which 95% confidence intervals were 93−100% and 98−100% respectively. Positive predictive value was estimated as 84.4%, which 95% confidence interval was 48.1−93.9%. The negative predictive value was estimated as 99.9%; likewise, when setting the confidence level at 95%, the confidence level was 99.4−100%. A quantitative study of HCP5 was also performed, since this gene is a copy number variation zone, so we studied whether 5 patients apparently homozygous for the allele G of HCP5 but heterozygous for HLA-B had the most frequent deletions described for this zone. The results showed that they do not have such deletions. Also, economic and practical comparison between both methods resulted in an improvement in time and cost if the HCP5 rs2395029 method is incorporated. In conclusion, HCP5 genotyping is a viable alternative method for predicting hypersensitivity to abacavir, being able to replace the conventional HLA-B*57:01 typing.

vii

0 I. INTRODUCTION

I. Introduction

1. Human Immunodeficiency Virus treatment The Human Immunodeficiency Virus (HIV) (Figure 1) is categorized into the genus within the family of Retroviridae, subfamily . Its genome consists of two identical single-stranded RiboNucleic Acid (RNA) molecules enclosed within the virus core. Once these RNA particles are reverse-transcripted by an enzyme called reverse-transcriptase into a DeoxyRibonucleic Acid (DNA) molecule, the viral genome is integrated in the human genome by an integrase enzyme. Afterwards, the human cells start expressing viral from the integrated DNA and continuing the HIV lifecycle. HIV can infect humans via intact mucous membranes, injured skin or mucosa and by parenteral inoculation. HIV attaches first to dendritic cells or macrophages/monocytes and, when exposed to blood cells, can result in the direct infection of T helper cells (1,2).

Protein g120 Matrix

Protein g41 Viral genome (RNA)

Protease

Nucleocapside Reverse-transcriptase

Capside Integrase

Lipidic membrane

Figure 1. HIV structure and components. Main targets for anti-retroviral therapy are highlighted in red.

HIV infection is responsible for one of the most extended diseases in the world, infecting around 37 million people. If not treated, HIV can produce the Acquired ImmunoDeficiency Syndrome (AIDS), causing progressive degeneration and failure of the . It may eventually cause death by opportunistic infections. It emerged as a mortal pandemic disease in the 1980s. Nevertheless, thanks to prevention and medical advances, such as earlier diagnosis and anti-retroviral treatment, it has become a chronic disease. Although mortality and morbidity have been considerably reduced, the number of new cases and not diagnosed individuals are still considerable (2,3).

The treatment for the HIV infection is based on the usage of a variety of drugs that target different viral molecules. Attacking multiple targets of the virus life-cycle enhance the

1 I. INTRODUCTION treatment efficacy (3). This combination therapy is called High Activity Antiretroviral Therapy (HAART), which consists in the combination of three anti-retroviral drugs. It includes two Nucleoside analogue Reverse-Transcriptase Inhibitors (NRTI), namely ABaCavir (ABC), tenofovir, lamivudine or emtricitabine. These drugs act emulating nucleotides, competing with the nucleotide they are analogue to, so they are used by the reverse-transcriptase enzyme, truncating DNA viral genome and hampering viral replication (2,4). The other drug included can be one non-nucleoside analogue reverse-transcriptase inhibitor (e.g. efavirenz), one protease inhibitor (e.g. ritonavir, lopinavir or fosamprenavir) or one integrase inhibitor (e.g. bictegravir, raltegravir, elvitegravir or dolutegravir) (4).

Hence, some of the recommended treatments include: bictegravir/ emtricitabine /tenofovir or dolutegravir/ABC/ lamivudine (4).

Despite the sustained viral load achieved by HAART, the incidence of the HIV infection is still significant. In Spain 2017, 3,381 first time infections were reported (5), who were incorporated into public health treatment and evaluation protocols. After a new infection and diagnosis, clinicians evaluate the new patient’s clinical history and choose the most appropriate HAART strategy based on the HIV resistance, sustained viral response and history of hypersensitivity reactions and also actual HIV positive patient with HAART to change treatment in case of resistance appearance (4).

2. Abacavir hypersensitivity reaction As previously mentioned, ABC is one of the most commonly used NRTI for HIV treatment. It is one of the first-line prescriptions after HIV-positive first diagnosis and one of the recommended elections for a salvage treatment after resistance appearance (4).

ABC (Figure2A) is metabolized inside the cell into its active form, carbovir 5´-triphosphate (CBV-TP) (Figure2B), a guanosine (G) analogue, which competes for the incorporation in the reverse-transcription of viral DNA. Once incorporated, the viral DNA elongation is stopped. In addition, CBV-TP has a significantly lower affinity for the human polymerase than fort the reverse-transcriptase, which avoids the incorporation of CBV-TP to human DNA. This preference is important to avoid toxicity and to interfere only with viral replication (6).

2 I. INTRODUCTION

A. B.

Figure 2. A. Abacavir and B. Carbovir 5’ triphosphate.

However, ABC, as every drug, is not free from causing Adverse Drug Reactions (ADRs), such as the one called ABC HyperSensitivity Reaction (ABC-HSR). Around 4% of the patients suffer from ABC-HSR, which appears as fever, rash, fatigue, nausea, vomiting, headache, diarrhea, pruritus, abdominal pain and respiratory symptoms like dyspnea, cough or pharyngitis (7). The severity of ABC-HSR can be life-threatening and forces the cessation of ABC therapy. Thus, a predictive test for ABC-HSR susceptibility is required before treating new-diagnosed or treatment-changed patients with this drug.

3. Pharmacogenetics When taking a certain drug, not every patient experiences the same response. Indeed, some of them might suffer a lack of response while others may develop an ADR. This phenomenon is explained by how the patient’s plasma levels fit into the therapeutic window. Hence, patients with plasma concentrations below the minimum effective concentration may exhibit ineffectiveness. On the contrary, patients with plasma levels above the maximum tolerated concentration may show problems with drug tolerability. The objective of every therapy is to prescribe the right drug at the right dose to the right patient from the beginning, to avoid as far as possible the development of ADRs. Also, ADRs can derive from the drug interaction with immune system.

Pharmacogenetics is the subject area which studies associations between genetic variations and differences in drug response. There are multiple variations in genes that alter the effect a drug has on the organism. These variations can be located in genes encoding metabolizing enzymes, receptors, transporters and drug targets. These genes that are involved in the metabolism or response to one or more drugs are known as pharmacogenes.

Drug pharmacokinetics comprises its Absorption, Distribution, Metabolism and Elimination (ADME) processes in which several pharmacogenes are usually involved. Polymorphisms on them could result in enhancing or decreasing the effectiveness of the drug, but also in sharpening its secondary effects or even producing severe unexpected reactions.

3 I. INTRODUCTION

One of the main goals of modern medicine is to provide patients with therapy adapted to their condition, understanding that each patient is unique and that it will respond differently to same treatment from other with the same illness. This approach is known as precision or personalized medicine (8). Therefore, pharmacogenetic assays should be done for each patient if they are going to be treated with a drug whose pharmacogenetic information is known, in order to be given a personalized treatment.

4. HLA-B*57:01 biomarker: traditional genotyping and alternative methods The Human Leukocyte Antigen (HLA) is a gene complex that encodes the proteins corresponding to the Major Histocompatibility Complex in humans (MHC). This group of proteins is essential for the activation of the acquired immune response since they are responsible for the antigen presentation to T-lymphocytes.

There are two types of MHC: 1) MHC type I (MHC-I), composed of HLA genes A, B and C, presents peptides expressed by the cell itself and viral or other intracellular parasite elements and 2) MHC type II (MHC-II), composed of HLA genes DP, DM, DO, DQ, and DR, presents phagocyted peptides, such as pathogens, cell debris o elements from extracellular matrix (2).

HLA-B is one of the most important pharmacogenes because of its involvement in ADRs of several drugs, including those shown in Table 1 (9).

Table 1. Drug adverse response associated with HLA-B alleles.

HLA-B RISK DRUG ASSOCIATED ADR ALLELE 57:01 Abacavir Hypersensitivity reaction 57:01 Flucloxacillin Drug-induced livery injury Severe cutaneous adverse reactions, maculopapular 58:01 Allopurinol eruption 15:02 Carbamazepine Stevens-Johnson Syndrome/toxic epidermal necrolysis 15:02 Phenytoin Stevens-Johnson Syndrome/toxic epidermal necrolysis

The HLA-B gene has many possible variants, allowing the immune system to react to a wide range of foreign invaders. Hundreds of HLA-B alleles are known, for each of which a specific number is given (e.g. HLA-B*57). Closely related alleles are categorized together. For instance, more than 60 very similar alleles are subtypes of HLA-B*57. These subtypes are designated as presented in Figure 3 (10).

4 I. INTRODUCTION

Human leukocyte antigen Allele group HLA-B*57:01 Gene Specific HLA protein Figure 3. HLA nomenclature. Several independent studies have reported association between ABC-HSR and one specific variant of a component of human major histocompatibility complex type I: HLA-B*57:01 allele (11–13). The reason behind this remains unclear. Nonetheless, several theories have been suggested. One of them proposes ABC as a hapten that can bind to peptides. This aggregate can bind to immune receptors such as HLA-B unchaining immune reaction. Other theory is the p-i theory: drugs can interact allosterically with immune receptors, without being recognized by them, unchaining the immune reaction (14). Most recent studies suggest an alternative to these two. Difference in abundance, HLA-peptide stability and HLA bound conformation between HLA-B allotypes can result in ABC binding to the HLA-B pocket and causing its presentation as an antigen, unchaining hypersensitivity reactions (15–17). Genetic variation in the HLA-B pocket lead to changes in some aminoacids. Since aminoacids have different physical-chemical properties, the affinity to substrates may change. This would explain ABC recognition as a ligand.

A clinical trial was performed with a prospective genotyping of HLA-B*57:01 before ABC treatment (12,13). As a result, ABC-HSR was reduced significantly and, therefore, the need of HLA typing prior to ABC treatment was included in its drug label.

Thus, as previously described, HLA typing is required to predict ABC-HSR. There are multiple approaches to determine the HLA-B allotype. Serological typing assays use anti-HLA-B anti- bodies, including complement-dependent cytotoxicity assays, Enzyme Linked ImmunoSorbent Assay (ELISA)-based methods and flow cytometry techniques. Their principal limitation is the lack of the resolution needed for an accurate typing. They cannot discern between same allele protein variants (e.g. HLA-B*57:01 from HLA-B*57:03) (18). On the other side, genetic typing assays based on DNA, namely Sequence Specific Primers- Polymerase Chain Reaction (SSP- PCR), Sequence Specific Oligonucleotide-PCR (SSO-PCR), reverse SSO-PCR or sequencing-based typing. Their principal advantages are their simplicity, rapidness and resolution (they can identify the exact allele) (18,19). Therefore, these are the predominant techniques for ABC- HSR susceptibility prediction.

5 I. INTRODUCTION

5. Hospital de La Princesa HLA-B*57:01 determination process Since the compulsory HLA-B typing was included in ABC drug label in 2008, the Hospital Universitario de La Princesa Clinical Pharmacology Department determines HLA-B genotype prior to initiating ABC treatment. The method nowadays used includes an inverse hybridization process before sequencing, technique explained at later stage 1. HCP5 genotyping and correlation with HLA-B*57:01. Although this technique is not able to distinguish between HLA- B*57 subvariants, which is important because only HLA-B*57:01 patients are prone to suffer from ABC-HSR, it avoids direct sequencing on every single sample. Because sequencing is outsourced, the main drawback is the test response time. Therefore, this alternative enables faster response to most patients, but the samples from HLA-B*57 positive patients continue to be sequenced. The whole process includes: DNA extraction from blood, two different conventional PCR, inverse hybridization, two different electrophoresis runs, agarose DNA purification and outsourced sequencing. Therefore, the current procedure is technically complex, expensive and entails a moderate response time. These latter facts justify the need for an alternative method that could shorten response time and costs.

6. Single nucleotide polymorphism genotyping techniques Single Nucleotide Polymorphisms (SNPs) are variations of a single base, in one position in DNA that are present, at least, in 1% of population. Several million SNPs have been described through human genome, with an approximate frequency of 1 per 300 base pairs, contributing to 90% of genetic variability (20). Because of their abundance in genome and their importance in individual variability, they are commonly used as biomarkers.

Normally, for a specific position on DNA, also known as locus, there are two types of alleles. The wild-type allele, which it is the ancestral allele and usually the most frequent and the mutated alleles, which are the result of replication mechanism errors or mutagenic agent exposure, some of which have been selected by nature, and that are typically the less frequent variants.

There are several methods to analyse SNPs: SSP-PCR and real-time PCR (qPCR) based techniques, such as a primer extension–based method (SNaPshot®, Applied Biosystems, CA, USA), allelic discrimination probes (Taqman®, Applied Biosystems, CA, USA) or hybridization probes (LightSNiP®, Roche Diagnostics, Mannheim, Germany). As explained before, DNA-based techniques are simple, rapid and they have high resolution. Hence, SNP screening has been made in order to find biomarkers for different diseases and to develop easier and more affordable ways to predict them.

6 I. INTRODUCTION

However, HLA-B variants are not determined by a unique SNP, but by several different changes in multiple positions of the sequence. Thus, one single SNP that identified unequivocally HLA- B*57:01 would be desirable.

7. HLA Complex P5 and linkage disequilibrium Linkage Disequilibrium (LD) refers to the non-casual disposition of alleles between two nearby loci, so that these alleles will be inherited together over multiple generations. This is explained by events such as inbreeding, population subdivision and population bottlenecks, genetic drift, natural selection, DNA structure or low probability of recombination between two nearby loci (21). Therefore, if an allele is associated with a phenotype, it is possible that other alleles of other close SNPs may also be associated with that phenotype. LD is a useful tool to study genetic structure and to find biomarkers, e.g. SNPs. There are multiple SNPs (usually called tag- SNPs) in LD with genes of interest, which allows using the convenient methods for SNP genotyping with the certainty of working with the gene of interest, although it is necessary to assure complete LD.

HLA Complex P5 (HCP5) is a long non-coding RNA human gene located in chromosome 6, position 31463180 to 31465809 on GRCh38, cytogenetic band 6p21.33, near HLA-B. Its function is unknown, but it has been suggested to have a regulatory purpose in immunity and retroviral infection (22,23). This gene contains one SNP, rs2309529, whose variants are: T (wild-type allele) and G (mutated allele). Several independent groups reported complete LD between HLA-B*57:01 and this G allele (24–26). This makes rs2309529 a great candidate to be used as a predictor of HLA-B*57:01 presence and, therefore, to predict ABC-HSR susceptibility.

Nevertheless, other groups have reported patients with the HCP5 rs2395029 G allele that were negative for HLA-B*57:01 and vice versa (27–29). These results suggest an incomplete LD between this tagSNP and HLA-B*57:01. Another inconvenience is that HCP5 is present in a copy number variation (CNV) zone of the chromosome, which could lead to the absence or duplication of HCP5 that could alter predictive power of this test (29). Therefore, a more in- depth study is needed to assess the validity of rs2395029 as a diagnostic test in our population.

7 II.OBJECTIVES

II. Objectives of the study The main objective of this study is to validate a method based on the polymorphism HCP5 rs2395029, in theoretical complete linkage disequilibrium with HLA-B*57:01, as a biomarker for abacavir hypersensitivity reaction susceptibility prediction.

The specific objectives derived from the main objective are as follows:

1. To verify the carriage of the G allele of HCP5 in every HLA-B*57:01 positive samples, validating an alternative genetic solution to this biological problem.

2. To calculate the specificity and sensibility of the new method.

3. To determine HCP5 copy number variations.

4. To determine feasibility of the new method in daily practice, by reducing the time response and costs.

8 III. MATERIALS AND METHODS

III. Materials and methods

1. Study population DNA was extracted from 224 HIV positive patients’ blood, derived from the Infectious Diseases Department of Hospital Universitario de La Princesa. Patients were chosen from a total of 1225 that were screened from June 2008 to May 2019. We included all 49 confirmed HLA-B*57:01 positives and 177 randomly chosen HLA-B*57:01 negatives. These 224 samples were subsequently genotyped for HCP5 rs2395019. Ethnic data was not available since it is not documented on patients’ medical record.

2. DNA extraction using MagNA Pure system DNA extraction from 1 ml of blood was carried out by MagNA Pure LC 2.0 Instrument (Roche Diagnostics, Mannheim, Germany), with DNA LV 1000 Purification protocol. Reagents were contained in the MagNA Pure LC DNA Isolation Kit – Large volume. Extraction was done following manufacturer instructions.

3. Real time PCR (qPCR) using LightSNiP® technology HCP5 genotyping was carried out using primers and hybridization fluorescent probes designed by TIB MOLBIOL (GmbH, Berlin, Germany) in LightCycler 2.0 instrument (Roche Diagnostics, Mannheim, Germany).

This technology is based on melting curves, detected by fluorescence fall on a qPCR thermal cycler. Two probes complementary to the patient´s sequence are used: 1) Probe labelled with 3'-Fluorescein and 2) Probe labelled with 5'-LightCycler® Red dye which also contains a 3'- phosphate to avoid probe elongation (Figure 4).

Figure 4. FRET probes mechanism of action (green and purple). F: fluorescein; L: LightCycler® Red; P: phosphate.

9 III. MATERIALS AND METHODS

Both probes hybridize next to each other with the DNA, where 3'-Fluorescein (the donor) transmits its energy to 5'-LightCycler® Red (the acceptor). The acceptor emits at 530 nm wavelength and it is detected by the thermal cycler. This phenomenon is known as Föster Resonance Energy Transfer (FRET). Thus, only when both probes hybridize, fluorescence will be emitted and detected. After 40 cycles of amplification like a conventional PCR, fluorescence reaches its maximum value.

After this, the temperature increases progressively until it reaches 85 ºC, resulting in DNA pairs denaturalization. This is the parameter, known as the melting temperature (Tm), from which individual allelic variant is determined.

Depending on HCP5 rs2395029 genotype, the complementarity of the probes will be different (depends on the SNP rs2395029): allele G will have 100% complementarity with the probe whereas allele T will have a mismatch in said position. Therefore, the temperature at which the fluorescence decays by half is lower for allele T than for allele G. In case of homozygosis, only one melting peak will be shown, corresponding with melting temperature of either variant. Conversely, for heterozygous individuals, two melting peaks appear.

4. Sequence-specific oligonucleotide reverse hybridization: AutoLiPA HLA-B typing was performed by sequence-specific oligonucleotide reverse hybridization on an AutoLiPA instrument (Fujirebio-Europe, Belgium). Workflow was previously described in the section 5. Hospital de La Princesa HLA-B*57:01 determination process In order to perform this technique, HLA-B exons 2, 3 and 4 were amplified by multiplex PCR with intention to hybridize them with oligonucleotide-attached strips. The primers used were biotinylated, i.e. biotinylated PCR products were obtained. This is necessary for following steps of the assay.

An electrophoretic separation was performed on a 2% agarose gel to confirm amplification by observing bands with 555 base pairs (bp) (exon 2), 436 bp (exon 3) and 323 bp (exon 4).

SSO-PCR is based on sequence complementarity between problem DNA and known probes. First, DNA and probes bind. Second, revealing reaction occurs, identification of the corresponding DNA fragment. Reverse hybridization implies that it is the amplified DNA the one that hybridizes with pre-fixed probes.

The probes are attached to two different strips, displaying parallel bands. Each band correspond to a different fragment of different HLA-B alleles. Therefore, after hybridization and revealing reaction, a specific band pattern is displayed. Each HLA-B allele has its own pattern.

10 III. MATERIALS AND METHODS

A tray with gaps for the strips is placed in the AutoLiPA instrument, which adds reagents and incubates the reactions while maintaining agitation. Reagents are added as indicated on INNO- LiPA HLA-B Update Plus kit (Fujirebio). DNA sample (10 µl) and strips are also placed on the incubation tray.

The reaction includes a first period of denaturalization, hybridization and washing of non- specifically hybridized DNA molecules. Secondly, solution with streptavidin labelled with alkaline phosphatase is added. Streptavidin has high affinity for biotin due to non-covalent forces. Biotinylated PCR products are attached to alkaline phosphatase by the streptavidin– biotin union. Alkaline phosphatase is the enzyme that catalyses the colour development reaction. After this, substrates 5-Bromo-4-Chloro-3-Indolyl Phosphate (BCIP) and 4-NitroBlue Tetrazolium (NBT) are added. Alkaline phosphatase dephosphorylates BCIP into 5-Bromo-4- Chloro-3-Indolyl (BCI). Then, BCI is oxidized by NBT resulting in a violet precipitate. The band pattern is subsequently read with Line Reader Analysis Software for LiPA HLA and manually checked by a trained researcher, leading to a final HLA-B typing.

However, the resolution of this technique only reaches allele type. Protein subtype cannot be determined. For example, HLA-B*57 can be identified but not HLA-B*57:01 or HLA-B*57:08 subtypes.

5. Sanger sequencing Once HLA-B*57 is confirmed, conventional PCR is carried out to amplify 193 bp fragment of exon 3. Amplicon is sent to Parque científico de la Universidad Complutense de Madrid were Sanger sequencing is performed. Sequence is returned and compared with subvariants sequences such as HLA-B*57:01, HLA-B*57:03 or HLA-B*57:08 (30).

Sanger sequencing is based on the PCR reaction with both normal deoxyriboNucleotides TriPhosphate (dNTPs) and dideoxyriboNucleotides TriPhosphate (ddNTPs) marked with a different fluorophore for each nitrogenous base. Since ddNTPs stop the DNA elongation due to lack of 3’-OH, different length DNA fragments are obtained. The difference between this DNA fragments is 1 nucleotide length. After this, capillary electrophoresis is performed in order to separate them by size. Subsequently, fluorescence is read and sequence is identified.

6. CNV assay In the light of the results discussed later, the CNV assay was required in order to determine HCP5 copy number. The technology used in this assay is based on TaqMan® hydrolysis probes. These DNA molecules are attached to a quencher on 5’ and a fluorophore on 3’. The quencher

11 III. MATERIALS AND METHODS molecule prevents the fluorophore from emitting. On PCR amplification, probes hybridize, and DNA polymerase starts polymerizing DNA from 3’ end. When it reaches the probe, DNA polymerase 5’-3’ exonuclease activity degrades it, releasing the fluorophore from the quencher and producing signal.

The signal is proportional to DNA quantity. However, a standardization signal required, i.e. housekeeping gene. This type of genes always has a known number of copies. In this study, the RNAseP was chosen. Every individual has two copies of it; thus, it allows comparing signal of RNAseP with signal from the target sequence. Target probe and reference probe are attached to different fluorophores to measure both signals separately.

The regions studied were Hs03608991_cn, which is located in chromosome 6, position 31475384 on GRCh38, cytoband 6p21.33 and Hs03593070_cn, which is located in chromosome 6, position 31441708 on GRCh38, cytoband 6p21.33.

The qPCR assay was carried out in QuantStudio 12k Flex instrument (ThermoFisher Scientific, USA).

7. Statistical analysis The experiment was design as a case-control assay. For validation of the method, sensitivity and specificity are calculated as:

TP TN sensitivity = · 100% specificity = · 100% TP + FN TN + FP

Where True Positives (TP) were HCP5 rs2395029 G allele and HLA-B*57:01 positives, True Negatives (TN) were HCP5 rs2395029 T allele and HLA-B*57:01 negative, False Negatives (FN) were HCP5 rs2395029 T allele and HLA-B*57:01 positives and False Positives (FP) were HCP5 rs2395029 G allele and HLA-B*57:01 negative.

The exact conservative Clopper Pearson method (31) is used to calculate sensitivity and specificity confidence intervals (CI). Positive and negative predictive values and their CI were calculated using Mercaldo et al. asymptotic adjusted logit intervals (32). Confidence level was set at 95%. All calculous were computed in R-3.5.1 software (33) using BDtest function from package bdpv (34).

12 IV. RESULTS

IV. Results

1. HCP5 genotyping and correlation with HLA-B*57:01 Hospital Universitario de la Princesa Clinical Pharmacology Department has genotyped 1225 patients for HLA-B as an assistance labour performed by this department since 2008. During the stay in which the study was conducted, 15 patients were genotyped for HLA-B.

From the 1225 total, 60 samples were positive for HLA-B*57, which were sequenced by Sanger method (Figure 5).

A. 1225 patients. Electrophoresis gel patients

HLA-B genotyping by Blood HLA-B inverse DNA HLA-B PCR sample exons hybridization

Band cutting

B. 60 patients.

Sequencing PCR Electrophoresis gel DNA purification from band Sanger sequencing

Figure 5. Workflow to predict ABC-HSR susceptibility. Blue path represents process followed for every probe. Orange path represents process for HLA-B*57 positives obtained at the end of blue path.

The sequencing results revealed 49 patients with HLA-B*57:01, 7 patients with HLA-B*57:03, 2 patients with HLA-B*57:02, 1 patient with HLA-B*57:07 and 1 patient with HLA-B*57:16. Thus, the prevalence of HLA-B*57:01 was 4%.

All 49 HLA-B*57:01 positives and 177 randomly chosen HLA-B*57:01 negatives were genotyped for HCP5 rs2395029. HCP5 rs2395029 T allele Tm was found at 53.61 ± 0.27 ºC and G allele Tm at 62.35 ± 0.23 ºC.

No false negatives and false positives were found. All 49 HLA-B*57:01 positives carried HCP5 rs2395029 G allele and all 177 HLA-B*57:01 negatives did not carry HCP5 rs2395029 G allele, as displayed at Table 2.

13 IV. RESULTS

Table 2. Correlation between HLA-B*57:01 and HCP5 rs2395029.

HLA-B*57:01 + − + 49 0 HCP5 − 0 177 S = 100% E = 100%

Key: S: sensitivity, E: specificity; PPV: positive predictive value, NPV: negative predictive value. The confidence interval (CI) with a 95% confidence level was 93−100% for sensibility and 98−100% for specificity.

After applying the statistical approach based on adjusted logit intervals, the positive predictive value was estimated as 84.4%; when setting the confidence level at 95%, the CI was 48.1−93.9%. The negative predictive value was estimated as 99.9%; likewise, when setting the confidence level at 95%, the confidence level was 99.4−100%.

All 177 negative HLA-B*57:01 samples showed only one melting peak corresponding to HCP5 rs2395029 T allele; 44 samples HLA-B*57:01 positive exhibited two melting peaks corresponding to HCP5 rs2395029 T and G alleles. Five positives for HLA-B*57:01 showed one melting peak corresponding to HCP5 rs2395029 G allele, i.e. G/G homozygotes (Figure 6). Melting Peaks

2.685 2.485 Homozygotic sample 2.285 Heterozygotic sample 2.085 1.885 1.685 1.485 1.285 1.085 Negative control 0.885 0.685

-(d/dT)(530) Fluorescence 0.485 0.285

40 45 50 55 60 65 70 75 80 85 Temperature (°C)

Figure 6. FRET probes-based genotyping for HCP5 rs2395029 melting peaks. Samples observed genotype were: T/G heterozygotic (pink) and G/G homozygotic (black).

HLA-B genotypes for HCP5 rs2395029 G/G carriers were consulted in order to explain homozygosis. Genotype for these samples is exhibited at Table 3.

14 IV. RESULTS

Table 3. Genotype for single peak for HCP5 rs2395029 G allele and their corresponding HLA-B genotype.

Sample HLA-B genotype 01 B*51, B*57:01 02 B*52, B*57:01 03 B*52, B*57:01 04 B*57, B*57:01 05 B*51, B*57:01

HLA-B*51 and HLA-B*52 were present in HCP5 rs2395029 T homozygotes and no correlation was found between G allele and this HLA-B variant.

2. HCP5 CNV analysis Homozygotes for the G allele were further studied. All HCP5 rs2395029 G/G carriers were subjected to TaqMan® CNV assay. The variations that can be detected by Hs03608991_cn and Hs03593070_cn probes and that involve HCP5 are the following ones (named as registered in Database of Genomic Variants) (35): nsv1161369, dgv215e55, esv2759414, nsv441990, nsv1112900, dgv10404n54, esv26480, dgv10393n54, esv2752123, dgv10454n54, esv2432528, nsv435829, dgv1751e212, esv3576141, esv33563, dgv10403n54, nsv511868, nsv462854, dgv10453n54, dgv214e55, nsv823497, nsv5246, dgv995e201, dgv10459n54, esv2762584, esv2731822, dgv10458n54, nsv10820, esv3608563, dgv1098e199, esv34118, nsv1126749, nsv517441, nsv970363, nsv1073969, esv3411275, esv29968, nsv513736, dgv5944n100, dgv5943n100, dgv5942n100. The copy number calculation was 2 copy number (Figure 7). For this prediction, each region for every sample there had 99% confidence (Table 4). Therefore, no CNV for this region was detected in this assay.

Figure 7. Sample copy number for sequences targeted by Hs03593070_cn and Hs03608991_cn probes. Bold samples correspond to HCP5 rs2395029 single G allele signal.

15 IV. RESULTS

Table 4. Hs03608991_cn and Hs03593070_cn copy number and confidence of the calculous.

Copy number Copy number Sample Name Target Reference Confidence calculated predicted Hs03608991_cn 2.05 2 > 0.99 Sample 01 Hs03593070_cn 1.56 2 0.99 Hs03608991_cn 1.98 2 > 0.99 Sample 02 Hs03593070_cn 1.79 2 > 0.99 Hs03608991_cn 2.23 2 0.99 Sample 03 Hs03593070_cn 1.73 2 > 0.99 RNAseP Hs03608991_cn 1.95 2 > 0.99 Sample 04 Hs03593070_cn 1.77 2 > 0.99 Hs03608991_cn 1.55 2 > 0.99 Sample 05 Hs03593070_cn 1.57 2 > 0.99 Hs03608991_cn 1.96 2 > 0.99 Positive control Hs03593070_cn 2.00 2 0.99

3. Economic and practical comparison between HLA-B direct genotyping and HCP5 rs2395029 used as biomarker For comparing both methods, response time and costs were estimated and compared at Table 5 and Table 6.

Table 5. Time response comparison between HCP5 rs2395029 and HLA-B methods

HCP5 rs2395029 genotyping HLA-B genotyping DNA extraction 1.5 h DNA extraction 1.5 h qPCR 1.5 h Multiplex PCR 3 h Electrophoresis gel 1 h Inverse hybridization 3 h Total time 3 h Total time 8.5 h Although positive samples for HLA-B*57 are less frequent, they need more time to be determined: Sequencing PCR 3 h Electrophoresis gel 1 h Band purification 1 h Sanger sequencing (externalized) 48 h* Total time 61.5 h Key: *Sanger sequencing experiment does not need 48 h itself, but because of externalization it does last that much.

16 IV. RESULTS

Table 6. Costs comparison between HCP5 rs2395029 and HLA-B methods.

HCP5 rs2395029 genotyping HLA-B genotyping: DNA extraction 15 € DNA extraction 15 € qPCR 9 € Multiplex PCR Consumables + 30% Inverse hybridization kit 40 € Electrophoresis gel (x2) 12 € Consumables + 30% Estimated total 32 € Estimated total 87.1 € Although positive samples for HLA-B*57 are less frequent, they increase the costs: Sequencing PCR 20 € Sanger sequencing 15 € Consumables + 30% Estimated total 133 €

This alternative method entails a less time-consuming technique. It is 5.5 h faster and externalization is not required. Furthermore, HCP5 rs2395029 genotyping method had better time response and costs. Total investment for HLA-B genotyping reached 109,427.5 €. This number was calculated as the cost of all 1225 samples since 2008. Otherwise, HCP5 rs2395029 genotyping would have cost 39,200 €. Therefore, switching from the determination of HLA- B*57:01 to the alternative HCP5 rs2395029 to predict ABC-HSR would have resulted in 70,227.5 € (64%) cost reduction.

17 V. DISCUSSION

V. Discussion

1. HCP5 genotyping and correlation with HLA-B*57:01 HCP5 rs2395029 G allele was predictive for all HLA-B*57:01 carriers, with an adequate CI except the positive predictive value. The critical parameter for this diagnostic test is negative predictive value. It represents the proportion of people not susceptible for abacavir hypersensitivity reaction (true HLA-B*57:01 negatives) regarding the total of negatives predicted by the test. Because of the severe consequences of not detecting true HLA-B*57:01, it is desired that this parameter is close to 100%. If that is so, no true positive patient will be treated with abacavir because of being reported as negative by this assay. Negative predictive value was 99.9%, which CI was 99.4−100%. Therefore, this test is safe for ABC-HSR prediction.

Nevertheless, confidence interval for positive predictive value was 48.1−93.9%. This parameter represents the proportion of people susceptible for abacavir hypersensitivity reaction (true HLA-B*57:01 positives) regarding the total of positives predicted by the test. The results obtained exhibited inaccurate CI. This may be due to limited sample size. Although abacavir is an effective drug and one of election treatment, it is not imperative that this value is as close to 100% as negative predictive value; due to other treatments availability (4). If the total amount of patients genotyped for HLA-B*57:01 at Hospital Universitario de La Princesa were genotyped for HCP5, the statistical power would be much higher. Not only because of sample size but also because the direct positive and negative predictive values could be calculated. However, case-control design was the best option given the features of the study due to time and cost restrictions. On the other side, HLA-B typing or Sanger sequencing could be performed in order to confirm positives reported by HCP5 genotyping assay.

The results obtained in this study do not report any false positive, i.e. HCP5 positive and HLA- B*57:01 negative; or false negatives, i.e. HCP5 negative and HLA-B*57:01 positive. However, other groups that reported no false positives and negatives had different limitations, namely reduced number of HLA-B*57:01 positives samples (25) and limited sample size (26).

Nonetheless, other groups have described false positives (27−29) and false negatives (29). Despite that results, these groups reported similar positive and predictive values and confidence intervals. In one of these studies (29) one false negative was reported from 1777 individuals. The presence of false negatives implies severe consequences because this patient would have been treated with abacavir if ABC-HSR susceptibility were screened by HCP5

18 V. DISCUSSION rs2395029 genotyping. However, this could be understood as a rare case, what would imply assuming the risk of a false negative.

Consequently, HCP5 rs2395029 genotyping is a reliable method to predict ABC-HSR susceptibility despite the possibility of false positives, which can be resolved by applying an additional method to positive samples to this test, e.g. HLA-B typing or Sanger sequencing.

2. HCP5 CNV analysis Regarding the samples with HCP5 rs2395029 single G allele signal, a deletion which involves HCP5 could be possible. Therefore, CNV analysis was done. Initially, a direct CNV assay for HCP5 was planned, but its sequence is not suitable for a specific CNV TaqMan® assay due to similarity with its surroundings and other genome regions. Thus, two flanking regions were studied with the aim to search for described CNV that involve HCP5. Regions for Hs03608991_cn and Hs03593070_cn probes were selected because of involving several described deletions for HCP5 and commercial kit availability.

Deletions were expected for samples 01, 02, 03 and 05 if LD is complete for these two genes. The HCP5 rs2395029 G allele is expected in a chromosome without the HLA-B*57:01 allele. On the other side, sample 04 was reported as B*57/B*57 by AutoLiPA instrument and confirmed as B*57:01 by Sanger sequencing. Unfortunately, the latter technique does not allow sequencing both parental and maternal DNA. Nevertheless, HLA-B*57:01 homozygosis could be expected in this sample.

Our results display absence of the deletions screened, even the one described by Melis et al. (29). The cause of these results is unknown, yet it could be expected that the studied regions were located far from HCP5, resulting in missing unknown smaller deletions where Hs03608991_cn and Hs03593070_cn probes are not involved. The CNV assay was not tested in HCP5 rs2395029 T homozygotes because of this possibility.

Although the latter conclusions may be contradictory, they do not interfere with HCP5 robustness as the G allele presence is enough to prevent from ABC prescription, regardless if it is found in heterozygosis or homozygosis. However, it should be noted that if the deletion for HCP5 was confirmed, it is possible that HLA-B*57:01 was paired with no HCP5 gene, which would result in a false negative. Thus, the complete LD must be furtherly studied. Additional studies that could be performed in order to enlighten these results are comparative genome hybridization (29) or direct HCP5 CNV study.

19 V. DISCUSSION

3. Economic and practical comparison between HLA-B direct genotyping and HCP5 rs2395029 used as biomarker Despite CNV and wide positive predictive value confidence interval, the main advantage of the HCP5 rs2395029 method is its reduced working time by 5.5 h per assay and costs by 64%. However, since positive predictive value could not be precisely calculated, if further study expanding sample size is not performed, it is possible that Sanger sequencing or HLA-B conventional typing is required for HCP5 rs2395029 G allele positives to complement the new method. This is still an improvement because number of patients that requires the previous method is significantly reduced. Therefore, although this would raise the costs, yet the new method would still be better in this aspect.

In Spain 2017, 3,381 new diagnoses were reported (5). Thus, it is crucial to any healthcare system to be economically efficient and to respond rapidly to all this patients. ABC is one of the election treatments (4) and HCP5 genotyping test would prevent the life-threatening hypersensitivity reaction susceptible patients could suffer, but also it would confirm that suitable patients will receive the best possible treatment for them. Therefore, the implementation of this alternative would lead to this objective.

Conclusions o HCP5 rs2395029 correlates with HLA-B*57:01 in our population. However, sample size was a limitation for obtaining more precise positive predictive value CI. o Allele G presence is enough to avoid ABC prescription, regardless if it is found in heterozygosis or homozygosis despite the CNV inconvenience. o HCP5 rs2995029 was a good biomarker for predicting HLA-B*57:01 allele due to the adequate positive predictive value and the great negative predictive value. Therefore, it could be used as a reliable ABC-HSR predictor. o This new method presents advantages versus HLA-B genotyping methods, such as time response and cost reduction, important to every health system and routine clinical practice.

20

References

1. German Advisory Committee Blood (Arbeitskreis Blut), Subgroup ‘Assessment of Pathogens Transmissible by Blood’. Human immunodeficiency virus (HIV). Transfus Med Hemotherapy. 2016;43(3):203–22.

2. Rich RR, Fleisher TA, Shearer WT, Schroeder H, Frew AJ, Weyand CM. Clinical immunology: principles and practice. Elsevier.2019; 545–59.

3. Lohse N, Obel N. Update of survival for persons with HIV infection in Denmark. Ann Intern Med. 2016;165(10):749.

4. Panel de expertos de GeSIDA y Plan Nacional sobre el Sida. Documento de consenso de Gesida/Plan Nacional sobre el Sida respecto al tratamiento antirretroviral en adultos infectados por el virus de la inmunodeficiencia humana (Actualización enero 2019). [Internet] Available from: http://gesida-seimc.org/documento-de-consenso-de-gesidaplan-nacional-sobre-el-sida-respecto-al- tratamiento-antirretroviral-en-adultos-infectados-por-el-virus-de-la-inmunodeficiencia-humana- actualizacion-enero-2019.

5. D.G. de salud pública, calidad e innovación, Centro Nacional de Epidemiología, ISCIII. Área de Vigilancia de VIH y Comportamientos de riesgo. Vigilancia epidemiológica del VIH y SIDA en España 2017: sistema de información sobre nuevos diagnósticos de VIH y registro nacional de casos de SIDA. Plan Nacional sobre el Sida. 2018. [Internet] Available from: https://www.mscbs.gob.es/ciudadanos/enfLesiones/enfTransmisibles/sida/vigilancia/home.htm

6. Daluge SM, Good SS, Faletto MB, Miller WH, St Clair MH, Boone LR, et al. 1592U89, a novel carbocyclic nucleoside analog with potent, selective anti-human immunodeficiency virus activity. Antimicrob Agents Chemother. 1997;41(5):1082–93.

7. Hetherington S, McGuirk S, Powell G, Cutrell A, Naderer O, Spreen B, et al. Hypersensitivity reactions during therapy with the nucleoside reverse transcriptase inhibitor abacavir. Clin Ther. 2001;23(10):1603–14.

8. National Research Council (U.S.). Toward precision medicine: building a knowledge network for biomedical research and a new taxonomy of disease. National Academies Press.2011; 128. Available from: http://www.ncbi.nlm.nih.gov/books/NBK91503/

9. Whirl-Carrillo M, McDonagh EM, Hebert JM, Gong L, Sangkuhl K, Thorn CF, et al. Pharmacogenomics Knowledge for Personalized Medicine. Clin Pharmacol Ther. 2012;92(4):414–7.

10. Genetic Home Reference, U.S National Library of Medicine. HLA-B gene [Internet]. Available from: https://ghr.nlm.nih.gov/gene/HLA-B# [consulted in May 2019]

11. Hetherington S, Hughes AR, Mosteller M, Shortino D, Baker KL, Spreen W, et al. Genetic variations in HLA-B region and hypersensitivity reactions to abacavir. The Lancet. 2002;359(9312):1121–2.

12. Simon M, Elizabeth P, Giampiero C, Jean-Michel M, Cassy W, Janez T, et al. HLA-B*5701 screening for hypersensitivity to abacavir. N Engl J Med. 2008;12.

21

13. Hughes S, Hughes A, Brothers C, Spreen W, Thorborn D, on behalf of the CNA106030 Study Team. PREDICT-1 (CNA106030): the first powered, prospective trial of pharmacogenetic screening to reduce drug adverse events. Pharm Stat. 2008;7(2):121–9.

14. Pichler WJ, Beeler A, Keller M, Lerch M, Posadas S, Schmid D, et al. Pharmacological interaction of drugs with immune receptors: the p-i concept. Allergol Int Off J Jpn Soc Allergol. 2006;55(1):17–25.

15. Ostrov DA, Grant BJ, Pompeu YA, Sidney J, Harndahl M, Southwood S, et al. Drug hypersensitivity caused by alteration of the MHC-presented self-peptide repertoire. Proc Natl Acad Sci. 2012;109(25):9959–64.

16. Illing PT, Pymm P, Croft NP, Hilton HG, Jojic V, Han AS, et al. HLA-B57 micropolymorphism defines the sequence and conformational breadth of the immunopeptidome. Nat Commun. 2018;9(1):4693.

17. Illing PT, Vivian JP, Dudek NL, Kostenko L, Chen Z, Bharadwaj M, et al. Immune self- reactivity triggered by drug-modified HLA-peptide repertoire. Nature. 2012;486(7404):554–8.

18. Bontadini A. HLA techniques: Typing and antibody detection in the laboratory of immunogenetics. Methods. 2012;56(4):471–6.

19. Erlich HA, Opelz G, Hansen J. HLA DNA typing and transplantation. Immunity. 2001;14(4):347–56.

20. Feuk L, Carson AR, Scherer SW. Structural variation in the human genome. Nat Rev Genet. 2006;7(2):85–97.

21. Slatkin M. Linkage disequilibrium — understanding the evolutionary past and mapping the medical future. Nat Rev Genet. 2008;9(6):477–85.

22. Kulski JK, Dawkins RL. The P5 multicopy gene family in the MHC is related in sequence to human endogenous HERV-L and HERV-16. Immunogenetics. 1999;49(5):404- 12.

23. Vernet C, Ribouchon MT, Chimini G, Jouanolle AM, Sidibé I, Pontarotti P. A novel coding sequence belonging to a new multicopy gene family mapping within the human MHC class I region. Immunogenetics. 1993;38(1):47-53.

24. de Bakker PIW, McVean G, Sabeti PC, Miretti MM, Green T, Marchini J, et al. A high- resolution HLA and SNP haplotype map for disease association studies in the extended human MHC. Nat Genet. 2006;38(10):1166–72.

25. Sanchez-Giron F, Villegas-Torres B, Jaramillo-Villafuerte K, Silva-Zolezzi I, Fernandez-Lopez JC, Jimenez-Sanchez G, et al. Association of the genetic marker for abacavir hypersensitivity HLA-B*5701 with HCP5 rs2395029 in Mexican Mestizos. Pharmacogenomics. 2011;12(6):809–14.

26. He Y, Hoskins JM, Clark S, Campbell NH, Wagner K, Motsinger-Reif AA, et al. Accuracy of SNPs to predict risk of HLA alleles associated with drug-induced hypersensitivity events across racial groups. Pharmacogenomics. 2015;16(8):817–24.

22

27. Colombo S, Rauch A, Rotger M, Fellay J, Martinez R, Fux C, et al. The HCP5 single‐ nucleotide polymorphism: A simple screening tool for prediction of hypersensitivity reaction to abacavir. J Infect Dis. 2008;198(6):864–7.

28. Rodriguez-Novoa S, Cuenca L, Morello J, Cordoba M, Blanco F, Jimenez-Nacher I, et al. Use of the HCP5 single nucleotide polymorphism to predict hypersensitivity reactions to abacavir: correlation with HLA-B*5701. J Antimicrob Chemother. 2010;65(8):1567–9.

29. Melis R, Lewis T, Millson A, Lyon E, McMillin GA, Slev PR, et al. Copy number variation and incomplete linkage disequilibrium Interfere with the HCP5 genotyping assay for abacavir hypersensitivity. Genet Test Mol Biomark. 2012;16(9):1111–4.

30. Martin AM, Nolan D, Mallal S. HLA-B*5701 typing by sequence-specific amplification: validation and comparison with sequence-based typing. Tissue Antigens. 2005;65(6):571– 4.

31. Clopper CJ, Pearson ES. The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika. 1934;26(4):404–13.

32. Mercaldo ND, Lau KF, Zhou XH. Confidence intervals for predictive values with an emphasis to case–control studies. Stat Med. 2007;26(10):2170–83.

33. R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2014. Available from: http://www.R-project.org/.

34. Schaarschmidt F. bdpv: Inference and Design for Predictive Values in Diagnostic Tests. R package version 1.3. 2019. Available from: https://CRAN.R-project.org/package=bdpv.

35. MacDonald JR, Ziman R, Yuen RKC, Feuk L, Scherer SW. The Database of Genomic Variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2014;42(Database issue):D986-992.

23