Genomic Profiling of Primary Histiocytic Sarcoma Reveals Two Molecular
Total Page:16
File Type:pdf, Size:1020Kb
Hystiocyte DIsordes SUPPLEMENTARY APPENDIX Genomic profiling of primary histiocytic sarcoma reveals two molecular subgroups Caoimhe Egan, 1 Alina Nicolae, 1 Justin Lack, 2 Hye-Jung Chung, 1 Shannon Skarshaug, 1 Thu Anh Pham, 1 Winnifred Navarro, 1 Zied Abdullaev, 1 Nadine S. Aguilera, 3 Liqiang Xi, 1 Svetlana Pack, 1 Stefania Pittaluga, 1 Elaine S. Jaffe 1 and Mark Raffeld 1 1Laboratory of Pathology, National Cancer Institute, National Institutes of Health, Bethesda, MD; 2NIAID Collaborative Bioinformatics Resource (NCBR), National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD and 3University of Virginia Health System, Charlottesville, VA, USA ©2020 Ferrata Storti Foundation. This is an open-access paper. doi:10.3324/haematol. 2019.230375 Received: June 25, 2019. Accepted: August 21, 2019. Pre-published: August 22, 2019. Correspondence: MARK RAFFELD - [email protected] Supplementary Material: Genomic profiling of primary histiocytic sarcoma reveals two molecular subgroups Caoimhe Egan1, Alina Nicolae1, Justin Lack2, Hye-Jung Chung1, Shannon Skarshaug1, Thu Anh Pham1, Winnifred Navarro1, Zied Abdullaev1, Nadine S Aguilera3, Liqiang Xi1, Svetlana Pack1, Stefania Pittaluga1, Elaine S Jaffe1 and Mark Raffeld1. 1Laboratory of Pathology, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA. 2NIAID Collaborative Bioinformatics Resource (NCBR), National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA. 3 University of Virginia Health System, 1215 Lee Street, Box 800214, Charlottesville, VA, USA. 1 Supplementary Methods Case selection. Twenty-one cases of primary histiocytic sarcoma (pHS) with a tumor content of at least 40% were identified from the files of the Hematopathology Section of the National Cancer Institute under an IRB approved protocol. All cases were reviewed by a panel of hematopathologists (ESJ, SP, CE, AN). Cases were defined as pHS if they showed typical histomorphology and expression of at least two of CD68, CD163, CD4 and lysozyme, no expression of CD20, CD3, CD30, no markers of Langerhans cell or follicular dendritic cell differentiation, and no known history of lymphoid malignancy or mediastinal germ cell tumor. One patient had a history of myelodysplastic syndrome with del(5q), but the cytogenetic aberration was not present in the HS. One case had insufficient material for a complete immunohistochemical panel but had an appropriate morphology and expressed histiocytic markers. The histologic and immunophenotypic features and clonality characteristics of the 21 cases are detailed in Figure 1 and Supplementary Table S3. Immunohistochemistry. Additional immunohistochemical studies were performed using antibodies to CD68 (KP-1, predilute; Roche, Indianapolis, IN, USA), CD163 (MRQ-26, predilute; Cell Marque, Rocklin, CA, USA), lysozyme (EC.3.2.1.17, 1:2000; Dako, Santa Clara, CA, USA), CD4 (SP35, predilute; Roche), CD21 (1F8, 1:30; Dako), clusterin (41D, 1:5000; Millipore, Burlington, MA, USA), CD1a (EP3622, predilute; Roche), langerin (12D6, 1:200; Leica Biosystems, Buffalo Grove, IL, USA), CD30 (JCM182, predilute; Leica Biosystems), CD20 (L26, predilute; Roche), CD3 (2GV6, predilute; Ventana, Tucson, AZ, USA), S100 (15E2E2, 1:8000; BioGenex, Fremont, CA, USA), AE1/AE3 (AE1/AE3, 1:100; Dako) and Ki67 (MIB-1, 1:200; Dako) if required. Immunohistochemistry was performed on either a Ventana Benchmark Ultra automated immunostainer using UltraView detection (Ventana) or on a Leica Bond Max using Bond Polymer Refine detection systems (Leica Biosystems). The proliferation index obtained using Ki67 immunohistochemistry was the percentage of positive cells counted in three high power fields (400x). Images were taken using an Olympus Microscope Digital Camera Model DP71 on an Olympus BX50 microscope and cells were counted with the Cell Counter plugin in ImageJ (https://imagej.nih.gov/ij/index.html)1. Nucleic acid extraction. DNA and RNA were extracted from formalin-fixed paraffin embedded (FFPE) tissue sections using QIAGEN QIAamp DNA FFPE Tissue Kit and RNeasy FFPE Kit on a QIAcube robotic system according to the manufacturer’s protocol (QIAGEN, Germantown, MD, USA). Co-extraction, when performed, used a modified version of the QIAGEN AllPrep DNA/RNA FFPE protocol. 2 Immunoglobulin gene and T-cell receptor gene rearrangement studies. Polymerase chain reaction (PCR) for immunoglobulin gene (IGH and IGK loci) and T-cell receptor gene (TRG locus) rearrangements was performed according to methods described in detail elsewhere2, 3. To interrogate IGH rearrangements, laboratory designed consensus primers directed against VH 4, 5 framework II, VH framework III and the joining region (JH) were employed as described previously , or alternatively, commercially available BIOMED-2 primer sets to all three VH framework regions (Invivoscribe Technologies, San Diego, CA, USA) were used. IGK analysis was performed using the commercially available BIOMED-2 primer sets (Invivoscribe Technologies) and interrogated rearrangements involving the Vκ loci and Jκ (Tube A), the Vκ locus and the κDE locus (Tube B) and the κ-intron RSS locus and the κDE locus (Tube B)6 as described by the supplier. The TRG locus was assessed using a single multiplexed PCR reaction, with primers directed at all known Vγ family members and the Jγ1/2, JP1/2 and JP joining segments3. PCR products were separated by capillary electrophoresis on an ABI 3130xl Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) and electropherograms were analyzed using GeneMapper software version 4.0. IGH/BCL2 (MBR) translocation. PCR was performed using assays modified from a previous study7 designed to detect translocations between the major breakpoint region (MBR) of BCL2 on chromosome 18q21 and the IGH joining region (JH) on chromosome 14q32. The primers/probe sequences used were: BCL2: 5’ –TTAGAGAGTTGCTTTACGTGGCCTG‐3’, JHa: 5’‐ ACCTGAGGAGACGGTGACC‐3’, BCL2 probe: FAM‐CACACAGACCCACCCAG. The PCR reaction was 50 μL in total volume composed of 25 μL QUANTITECT Master Mix Buffer 2X (QIAGEN), 2.5 μL of GAPDH primer/probe mix, 2.5 μL of IGH/BCL2 primer/probe mix and 15 μL nuclease-free water with a sample DNA input of 5 μL. The reaction was incubated at 95˚C for 15 minutes and run for 45 cycles of 95˚C for 45 seconds followed by 60˚C for 60 seconds. Whole exome sequencing. Library preparation was performed using Agilent SureSelectXT Human All Exon V5 + UTRs target enrichment kit (Agilent Technologies, Santa Clara, CA, USA). Sequencing was performed on either an Illumina HiSeq2500 with TruSeq V4 chemistry or on an Illumina HiSeq3000 with TruSeq V2 chemistry (Illumina, San Diego, CA, USA). Reads were trimmed using Trimmomatic v0.338 and mapped to the hs37d5 version of the human reference genome (ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_seque nce/hs37d5.fa.gz) using BWA-MEM v07.159. Following alignment, BAM files were processed using Samtools v1.3 (http://www.htslib.org/)10 and duplicates were marked using Picard v2.1.1 3 (http://broadinstitute.github.io/picard/). GATK v3.5.011 was used to perform indel realignment and base recalibration. Read- and alignment-level quality control visualization was performed using MultiQC v1.1 (http://multiqc.info/)12 to aggregate QC metrics from FastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/), FastQ Screen (https://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/), Picard, BamTools stats (http://github.org/pezmaster31/bamtools)13 and Trimmomatic. Variant calling was performed with MuTect 1.1.7 and MuTect214. A panel of normals, developed from ExAC (excluding the TCGA) r0.3.115 and the 1000 Genomes Project16 including only variants > 0.001 in frequency in the general population was used in cases without a matched germline. Tumor variants with an allele frequency of ≤ 0.05 and ≥ 0.99 were excluded. Variants were annotated using Ensembl VEP 9217, CADD GRCh37-v1.418 and the COSMIC database v88 (http://cancer.sanger.ac.uk)19. Identification of target genes. As matched germline samples were unavailable in most cases, we focused on genes with a known role in cancer to reduce the number of variants for assessment, an approach similar in concept to using a targeted sequencing panel. Genes were obtained from the COSMIC Cancer Gene Census v88 (http://cancer.sanger.ac.uk)20, excluding genes that were germline only or only associated with the terms ‘T’ (translocation), ‘F’ (fusion), ‘A’ (amplification) or ‘D’ (large deletions) and from a review of relevant literature (Supplementary Table S2), designed to also include genes associated with RASopathies21-24 and lymphoid malignancies25-49, as histiocytic sarcoma is known to harbor RAS pathway mutations and may be associated with B- and T-cell neoplasms. Genes from the exome data were included if they were mutated in ≥ 3 samples following application of the coverage, CADD score and minor allele frequency filters (described below) and verification of the mutations in the Integrative Genomics Viewer (IGV)50. The resulting targeted gene list (430 genes) is presented in Supplementary Table S9. Variant filtration. Synonymous mutations and variants in UTRs, intronic and intergenic locations and in the mitochondrial genome were excluded. Variants with < 6 reads and < 20x coverage by read proportions were also excluded. We applied a stringent filter based on population allele frequencies to remove as many potential germline events as possible,