RESEARCH ARTICLE PAX5 is part of a functional network targeted in lymphoid

1 1,2 1,2 1 Kazuki Okuyama , Tobias StridID , Jacob KuruvillaID , Rajesh SomasundaramID , Susana Cristobal1, Emma Smith2, Mahadesh Prasad1, Thoas Fioretos3, Henrik LilljebjoÈ rn3, 2,3 2 2,4☯ 1,2,4☯ Shamit SonejiID , Stefan LangID , Jonas UngerbaÈckID , Mikael SigvardssonID *

1 Department of Clinical and Experimental Medicine, LinkoÈping University, LinkoÈping, Sweden, 2 Division of Molecular , Lund University, Lund, Sweden, 3 Division of Clinical Genetics Lund University, Lund, Sweden, 4 Lund Stemcell Center, Lund University, Lund, Sweden a1111111111 ☯ These authors contributed equally to this work. a1111111111 * [email protected] a1111111111 a1111111111 a1111111111 Abstract

One of the most frequently mutated proteins in human B-lineage leukemia is the transcrip- tion factor PAX5. These often result in partial rather than complete loss of function

OPEN ACCESS of the transcription factor. While the functional dose of PAX5 has a clear connection to human malignancy, there is limited evidence for that heterozygote loss of PAX5 have a dra- Citation: Okuyama K, Strid T, Kuruvilla J, Somasundaram R, Cristobal S, Smith E, et al. matic effect on the development and function of B-cell progenitors. One possible explana- (2019) PAX5 is part of a functional transcription tion comes from the finding that PAX5 mutated B-ALL often display complex karyotypes and factor network targeted in lymphoid leukemia. additional mutations. Thus, PAX5 might be one component of a larger transcription factor PLoS Genet 15(8): e1008280. https://doi.org/ 10.1371/journal.pgen.1008280 network targeted in B-ALL. To investigate the functional network associated with PAX5 we used BioID technology to isolate proteins associated with this transcription factor in the living Editor: Jun J Yang, St. Jude Children’s Research Hospital, UNITED STATES cell. This identified 239 proteins out of which several could be found mutated in human B- ALL. Most prominently we identified the commonly mutated IKZF1 and RUNX1, involved in Received: March 6, 2019 the formation of ETV6-AML1 , among the interaction partners. ChIP- as well Accepted: July 2, 2019 as PLAC-seq analysis supported the idea that these factors share a multitude of target Published: August 5, 2019 in human B-ALL cells. expression analysis of mouse models and primary Copyright: © 2019 Okuyama et al. This is an open human leukemia suggested that reduced function of PAX5 increased the ability of an onco- access article distributed under the terms of the genic form of IKZF1 or ETV6-AML to modulate . Our data reveals that Creative Commons Attribution License, which PAX5 belong to a regulatory network frequently targeted by multiple mutations in B-ALL permits unrestricted use, distribution, and reproduction in any medium, provided the original shedding light on the molecular interplay in leukemia cells. author and source are credited.

Data Availability Statement: ’ChIP-, RNA-, ATAC- and PLAC-sequencing data generated for this paper have been deposited in GEO under the Author summary accession numbers; GSE126375 for murine data and GSE126300 for data on the human cell-line The use of modern high throughput DNA-sequencing has dramatically increased our NALM6.’ ability to identify genetic alterations associated with cancer. However, while the mutations Funding: This work was supported by grants from per se are rather easily identified, our understanding of how these mutations impact cellu- the Swedish Cancer Society, the Swedish lar functions and drive malignant transformation is more limited. We have explored the Childhood Cancer Foundation, The Swedish function of the transcription factor PAX5, commonly mutated in human B-lymphocyte Research Council, including program grants to leukemia, to identify a regulatory network of transcription factors often targeted in Stem Therapy and BioCare, Knut and Alice

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 1 / 22 PAX5 is part of a transcription factor network in B-ALL

Wallenberg’s Foundation, A donation from Henry Hallberg and Lund as well as Linko¨ping University. human disease. Hence, we propose that malignant conversion of B-lymphocyte progeni- Lions forskningsfond (to TS) and KO was tors involves multiple targeting of a central transcription factor network aggravating the sponsored by a stipend from the Japan Society for impact of the individual mutations. These data increase our understanding for how indi- promotion of science (JSPS). The funders had no role in study design, data collection and analysis, vidual mutations collaborate to drive the formation of B—lineage leukemia. decision to publish, or preparation of the manuscript. Competing interests: The authors have declared Introduction that no competing interests exist. It is becoming increasingly clear that transcription factors essential for normal B-cell develop- ment are frequently mutated in B-lineage Acute Lymphoblastic Leukemia (B-ALL) [1]. One of the most common targets for genetic alterations in B-ALL is the transcription factor PAX5 tar- geted both by translocations generating fusion proteins [2, 3] and by partial inactivation by deletions [4–6] or point mutations [6, 7]. Mutations reducing PAX5 function is one of the most common alterations detected in B-ALL involving about one third of the malignancies [4, 5]. However, the findings that PAX5 deletion in human B-ALL is associated with complex kar- yotypes [4, 5] and that reduced function of PAX5 in mouse models generate a mild [8–10] indicate that PAX5 mutations collaborate with other oncogenic events to cause malig- nant transformation. This has been verified in mouse models where heterozygote deletion of Pax5 in combination with expression of constitutively active STAT5 [11] or partial inactiva- tion of the Ebf1 gene [10] largely increase leukemia formation. In order to better understand the regulatory networks in B-ALL and to resolve how other genetic events may be functionally linked to PAX5 mutations we have used BioID to identify collaboration partners for PAX5. BioID is based on the generation of a fusion between the fac- tor of interest and a mutated form of the bacterial protein biotinylase BIRA (BIRA�) [12, 13]. This mutant enzyme lack substrate specificity and covalently attach biotin to any protein within about ~10nm distance of the fusion protein (Fig 1A). Biotinylated proteins are isolated and identified by MS/MS allowing us to identify Proximity Interaction Partners (PXIs) for PAX5 in the living cell. Analysis of the Cosmic data base (http://cancer.sanger.ac.uk/cosmic) [14] revealed that a substantial fraction of the PAX5 mutated carried additional mutations in identified PXIs. Among these PXIs were, RUNX1, involved in the 12;21 translo- cation generating the ETV6-RUNX1 fusion protein, and IKZF1 both commonly mutated together with PAX5 in B-ALL [4, 5]. Using Chromatin Immuno-precipitation (ChIP)-seq, proximity ligation assisted ChIP-seq (PLAC-seq) [15] and RNA-seq analysis we confirmed that PAX5, RUNX1 and IKZF1 share a large number of target genes and that a dominant nega- tive form of IKZF1 or the ETV6-RUNX1 fusion protein acted collaboratively with heterozy- gote deletion of Pax5 to modulate gene expression. This suggest that the transformation process in B-ALL involve multiple mutations of genes being part of a regulatory network possi- bly causing an exacerbated effect on the transcriptional programming in the B-cell progenitor.

Results PAX5 associate with regulators of transcription commonly mutated in human leukemia To identify interaction partners for PAX5, we generated N-terminal fusions of BIRA� with a full length human PAX5 protein. A control protein was created by fusion of BIRA� to a SV40-Nuclear localization signal (Fig 1A). The fusion proteins were ectopically expressed in a mouse Abelson virus transformed Pre-B cell line (230–238), after which biotinylated proteins were purified on streptavidin beads. Tandem mass spectrometry and bioinformatic analysis

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 2 / 22 PAX5 is part of a transcription factor network in B-ALL

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 3 / 22 PAX5 is part of a transcription factor network in B-ALL

Fig 1. PAX5 is part of a complex regulatory network of DNA binding proteins and co-regulators of transcription. Panel (A) display a schematic drawing of the basic principle for PXI identification by BioID and the fusion proteins (PAX5-BIRA� and SV40NLS-BIRA�) used to resolve the interactome of PAX5 in mouse Pre-B cells. (B) Cytoscape (version 3.6.1) generated protein interaction map for PAX5 based on BioID data analyzed using the Trans-Proteomic Pipeline (TPP) software and SAINT Express v.3.3. A Bayesian FDR of 0.02 (corresponding to a SAINT score of ~0.80) was used as a cut-off to identify high confidence interactors. The red color of the node indicates a protein defined as involved in transcriptional regulation. The analysis is based on 3 biological and 2 technical replicates with BIRA� conjugated to NLS collapsed to the 2 highest spectral counts for each prey. (C) Data from de novo motif enrichment analysis (mm10 –size 200) for PAX5 sites in mouse 230238 Pre-B cells. T/BG% indicates the frequency of targets enriched for the motif/the background frequency of the motif. Identified PXIs associated with enriched motifs are indicated. https://doi.org/10.1371/journal.pgen.1008280.g001

(see extended Materials and Methods) identified 239 high confidence (Saint score above 0.8) PAX5 PXIs (Fig 1B, S1 Table). Among the identified PXIs we found six proteins previously identified as direct interaction partners [16] and STRING (https://string-db.org) based interactome analysis identified another 14 PXIs as linked to PAX5 (S1 Fig, S1 Table). Protein analysis through evolutionary relationships (PANTHER) (http://pantherdb.org/geneListAnalysis.do) analysis (S2 Fig), as well as (GO) analysis performed with PANTHER14.0 Database for Annotation (S1 Table), identified a large number of PXIs as sequence specific DNA binding proteins or transcriptional co-factors. Several PXIs constituted components of complexes previously linked to PAX5 including the SWI/SNF activator- [16] as well as the Mi-2/NuRD-complexes [17] and NCOR1 [16]. Because multiple PXIs represented DNA binding transcription factors we analyzed PAX5 ChIP-seq data from 230–238 Pre-B cells by de novo motif enrichment analysis (Fig 1C). Among the top 10 enriched motifs two were identified as PAX binding sites. However, an additional six top ranked motifs represented putative binding sites for TFs identified as PXIs providing an independent line of support for that these proteins indeed share regulatory ele- ments with PAX5. Thus, BioID analysis suggests that PAX5 is part of a complex network of transcriptional regulators in early B-cell progenitors. To unravel a potential involvement of PAX5 PXIs in human leukemia we used information from the Cosmic cancer database (cancer genes sensus V76) (http://cancer.sanger. ac.uk/cosmic) [14] to estimate the mutation frequency of PAX5 PXIs in human hematopoietic malignancies. Genes encoding PAX5 PXIs covered approximately 7% of the mutations reported (Fig 2A) involving 15% of the PAX associated proteins (Fig 2A). These included IKZF1, ARID1A, KMT2D, STAT3 and PAX5 itself (Fig 2B). The PXI mutations were enriched in lymphoid and NK lineage malignancies as compared to myeloid leukemias (S3 Fig) indicating a degree of lineage selectivity. To resolve if mutations in PXIs are linked to genetic alterations in PAX5 we extracted data from PAX5 mutated hema- tological malignancies. The absolute majority of these tumors were classified as B-ALL or ALL (Fig 2C). Several of these PAX5 mutated ALL cells carried additional mutations (Fig 2D). Among these we identified 5 genes encoding identified PAX5 PXIs (Fig 2D). Even though IKZF1 was the most prominently co-mutated PXI, we did identify tumors carrying combined PAX5 and FOXO1, EBF1, KTM2D or ARID1A mutations (Fig 2D). These data suggest that the transformation process may involve targeting of multiple proteins belonging the same regula- tory network.

PAX5 is part of a functional transcription factor network While the analysis of the Cosmic data revealed that IKZF1 was the most commonly mutated PXI (Fig 2D), PAX5 mutations are frequently detected in combination with 12:21. transloca- tion [4, 5]. This generates a fusion protein between ETV6 and the DNA binding domain of

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 4 / 22 PAX5 is part of a transcription factor network in B-ALL

Fig 2. PAX5 PXIs are frequently mutated in combination with PAX5 in human leukemia. (A) Pie charts showing the fraction of PAX5 PXIs (list of 239 entries) found mutated in the public cancer database COSMIC (cancer genes sensus V76) from the Sanger Institute Catalogue Of Somatic Mutations In Cancer web site, (http://cancer.sanger.ac.uk/cosmic) (14). The left chart displays the frequency of mutated PXIs of all reported mutations (a total of 449 genes) in

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 5 / 22 PAX5 is part of a transcription factor network in B-ALL

hematological and lymphoid malignancies while the right chart displays the fraction of PAX5 PXIs found mutated in the same malignancies. The diagram in panel (B) display the number of PAX5 and PAX5 PXI mutations identified in hematological malignancies in the database. Panel (C) displays a pie-chart presenting the clinical classification of tumors carrying PAX5 mutations while panel (D) displays a heatmap presenting the mutational spectra of 91 unique PAX5 mutated hematological and lymphoid tumors from the Cosmic database carrying reported co-mutations. A red dot indicates that the mutated gene encode a PAX5 PXI. https://doi.org/10.1371/journal.pgen.1008280.g002

RUNX1 (ETV6-RUNX1) suggested to act as a repressor at elements normally controlled by RUNX1 [18–20]. To explore the extent to which IKZF1, RUNX1 and PAX5 share regulatory elements we performed ChIP-seq analysis in 230–238 mouse Pre-B cells. This identified just over 21 000 binding sites for PAX5 out of which 65% overlapped with either IKZF1, RUNX1 or both (Fig 3A). Furthermore, 46% of the IKZF1 binding sites and 45% of the RUNX1 sites overlapped with binding of PAX5. Shared binding to target elements in the Igll1 and CD79a promoters could also be verified using ChIP-QPCR (S4A Fig). De novo motif enrichment analysis of peak positions identified enrichment of the expected motif as well as that of putative binding sites for the other factors of this putative network (S4B Fig). In the case of IKZF1 identified as an ETS protein binding site as previously reported [21]. Annotation of shared and unique binding sites to specific targets identified 7046 genes (S2 Table Metadata) that could be assigned to only one of the categories of overlapping or unique binding defined (Fig 3A). GO-term analysis of single, double, or triple bound genes failed to reveal any strong enrichment of genes encoding proteins belonging to any defined biological process (S3 Table). However, genes linked to binding of all three factors were enriched (Benja- mini-Hochberg p.adj -value � 0.05) for genes encoding proteins involved in cell cycle and DNA repair as well as protein transport (S3 Table). To explore the expression patterns of the genes targeted by single or multiple binding of PAX5, RUNX1 and/or IKZF1 in the hemato- poietic system in mice, we uploaded the unique gene sets defined in S2 Table to Gene Expres- sion commons (https://gexc.riken.jp). As expected Ikzf1 and Runx1 were expressed in multiple lineages and stages of development (S5A Fig) while the expression of Pax5 as well as a set of known PAX5 target genes were restricted to B-lineage cells (S5A and S5B Fig). The expression of genes bound only by IKZF1, was broadly detected in the hematopoietic system (S5B Fig). In contrast expression of genes linked to binding of only RUNX1 was limited and the PAX5 unique gene signature were virtually not expressed in the hematopoietic system (S5B Fig). In sharp contrast we noted high and broad expression of gene sets associated with binding of more than one factor (S5C Fig). These data support the idea that PAX5, IKZF1 and RUNX1 are part of a regulatory network in hematopoiesis. While IKZF1 and RUNX1 are commonly mutated in human malignancies, other PXIs could not be detected as targeted together with PAX5 in our dataset (Fig 2B). These included the Ets-protein FLI1 [22], reported to share binding sites with IKZF1 in B-cell progenitors [21]. In order to explore if PAX5 is part of other regulatory networks not directly targeted in leukemia, we performed ChIP-seq experiments to identify FLI1 binding sites in 230–238 mouse Pre-B cells. While the majority of the IKZF1 binding sites were shared with FLI1, a defined set of sites were unique for the individual factors (S4D Fig). Annotating the binding sites to genes (S4 Table) followed by GO-term analysis of genes unique to only one binding category (S5 Table) suggested that the genes bound by PAX5, IKZF1 and FLI1 were enriched for functions related to regulation DNA repair and proliferation. The genes annotated to sites unique for combined PAX5-FLI1 binding were broadly expressed (S5D Fig) and coded for proteins involved in

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 6 / 22 PAX5 is part of a transcription factor network in B-ALL

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 7 / 22 PAX5 is part of a transcription factor network in B-ALL

Fig 3. PAX5 is a coordinator of a transcription factor network in pro-B cells. Panel (A) display a Venn-diagram of ChIP-seq peaks based on PAX5, IKZF1 and RUNX1 ChIP seq analysis using the 230–238 Pre-B cell line. Peaks were called using the HOMER platform (findPeaks -style factor) and resulting files were filtered for peaks �15 normalized tags. Overlapping peaks were identified using the mergePeaks command in HOMER. (B-C) Heat maps displaying overlapping PAX5 and IKZF1 or RUNX1 binding sites as defined by ChIP-seq analysis of in vitro expanded Wt Pro-B cells. Peaks were identified using HOMER (findPeaks -style factor) and filtered for �15 (PAX5, RUNX1) or �10 (IKZF1) normalized tags. Overlapping peaks were identified using the mergePeaks command in HOMER. ChIP-seq and ATAC-seq (GSE92434) (43) signals were visualized in heatmaps with data obtained from primary in vitro expanded Wt or Pax5-/- mouse Pre-B cells as indicated. Heatmaps were clustered in Cluster3 using average linkage uncentered correlation. (D) Venn diagrams displaying results from RNA-seq experiments from primary in vitro expanded FL-Wt and Pax5+/- Pre-B cells transduced with retroviruses encoding either a dominant negative form of IKZF1 (IKZF1-dn), an ETV6-RUNX1 or a control vector (pMIG). Differentially expressed genes (FDR � 0.05, foldchange � 2) were identified using HOMER as described in materials and methods. (E) Pie-charts displaying the fraction on genes differentially expressed in ETV6-RUNX1 or IKZF1dn as compared to MIGR1 transduced Pax+/- cells that can be defined as direct target genes for the transcription factors. Definition of target genes were based on proximity annotation of IKZF1 and RUNX1 peak files merged with PAX5 peaks from 230–238 Pre-B cells to identify IKZF1 and PAX5 or RUNX1 and PAX5 co-bound genes. https://doi.org/10.1371/journal.pgen.1008280.g003

DNA-repair (S5 Table). This indicates that PAX5 may be part of several defined transcription factor networks. The extensive degree of overlapping binding of a lineage and stage specific factor such as PAX5 and broadly expressed proteins such as IKZF1 and RUNX1, expressed earlier in the developmental trajectory (S5A Fig), opens for the possibility that that the early factors act as molecular beacons targeting PAX5 to regulatory elements. In order to explore the interplay between these factors we performed ChIP-seq analysis targeting PAX5, IKZF1 and RUNX1 using in vitro expanded primary Wt or Pax5-/- mouse Pre-B cells. While the majority of the shared binding sites were occupied by transcription factors in the absence of PAX5 (Fig 3B and 3C), a finding verified by ChIP-QPCR analyzing binding to the Igll1 (S4C Fig), a subgroup of sites was not bound by IKZF1 and/or RUNX1 in Pax5-/- -Pre-B cells. Assay for Transposome Accessible Chromatin (ATAC-seq) [23] analysis revealed that sites with reduced IKZF1 or RUNX1 binding in Pax5-/- cells displayed low epigenetic accessibility as compared to Wt cells (Fig 3B and 3C). Hence, PAX5 has the ability to target IKZF1 and RUNX1 to a subset of epigenetically silent regulatory elements. In order to investigate the functional interplay between PAX5, RUNX1 and IKZF1 we stud- ied the impact of combined heterozygote deletion of Pax5 in mouse Pre-B cells and expression of ETV6-RUNX1 or a functionally impaired IKZF1-protein (IKZF1DN) lacking all four Zn- fingers in the DNA binding domain which is commonly observed in B-ALL [24, 25]. Compar- ative analysis of gene expression in Wt as compared to Pax5+/- mouse Pre-B cells transduced with control vector (pMIG) identified 809 downregulated genes (Fig 3D, S6 Table). In addition to Pax5 itself, we detected reduced expression of several genes encoding proteins identified as PAX5 PXIs. These included transcription factors such as Ebf1, Lef1, Foxo1 as well as Ikzf3 sug- gesting PAX5 to be directly involved in the regulation of co-factor expression. We noted no significant downregulation of other classical PAX5 target genes such as Cd19 or Cd79a indicating that B-cell identity is maintained despite reduced expression of several important transcription factors. 1275 genes were upregulated in Pax5+/- as compared to Wt cells and GO-analysis revealed a significant enrichment of genes associated with “immune sys- tem process” including Tlr1, Myd88 and Icosl (S7 Table). Ectopic expression of ETV6-RUNX in Wt Pre-B cells caused significant reduction in the expression of 70 genes and combined deletion of one allele of Pax5 and ectopic expression of ETV6-RUNX resulted in the repression of 540 genes. Additionally, we detected upregulation of 84 genes upon expression of ETV6-- RUNX in Wt mouse Pre-B cells and more than 900 genes in Pax5+/- cells. GO-analysis identi- fied genes involved in “immune system processes” as enriched among both up- and down- regulated ETV6-RUNX1 responsive genes (S7 Table) possibly reflecting a disruption of the dif- ferentiation process.

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 8 / 22 PAX5 is part of a transcription factor network in B-ALL

Ectopic expression of IKZF1DN in Wt Pre-B cells resulted in downregulation of 112 genes and increased expression of 71 genes while expression in Pax5+/- cells reduced the levels of 184 and up-regulated mRNA coding for 332 genes as compared to the control cells (Fig 3D, S6 Table). Even though we were able to identify a few genes such as Il2ra, Ifi30 and Aim2 associ- ated to the GO-term “immune system processes” as upregulated upon DN-IKZF1 expression, the most striking GO-term enrichments was detected for genes classified as involved in “cell adhesion and cell migration” being downregulated in response to expression of the truncated IKZF1 protein (S7 Table). By proximity annotation of ChIP-seq peaks we detected combined PAX5 and RUNX1 binding at over 50% of the genes modulated by ETV6-RUNX1 expression in Pax5+/- mouse pre-B cells (Fig 3E). The corresponding figure for IKZF1 was close to 25%. Despite the dramatic changes in gene expression patterns, we were unable to detect any short term (4–5 weeks) malignant expansion of these cells after transplantation. Hence, combined reduction of PAX5 dose and expression of oncogenic variants of RUNX1 or IKZF1 resulted in a synergistic modulation of gene expression in line with the idea that these factors constitute a functional network.

PAX5, IKZF1, RUNX1 and EBF1 form a genetic network in human B-ALL cells Having established that PAX5 is part of a functional regulatory network during B-cell develop- ment in mice, we wanted to explore the potential collaborative actions of PAX5 and the PXIs IKZF1, RUNX1 as well as EBF1 in human B-ALL cells. To this end we performed ChIP-seq analysis on chromatin from the human B-ALL cell line NALM6 (Fig 4A). We detected over- lapping binding of IKZF1 and/or RUNX1 at about 70% of the identified PAX5 binding sites and de novo motif enrichment analysis identified binding sites for PAX as well as IKZF1, EBF1 and RUNX1 (S6 Fig). Proximity annotation of TF binding sites revealed that as much as 2645 genes were linked to binding of all four proteins in this human B-ALL cell line. To explore how the activity of PAX5, IKZF1, RUNX1 and EBF1 are coordinated in the con- text of the chromatin conformation in human B-ALL cells we performed proximity ligation assisted ChIP-seq (PLAC-seq) analysis [15]. This method combines chromatin capture tech- nology with ChIP-seq to enrich for chromatin interactions associated with binding of a specific protein or a unique protein modification. In order to identify active regulatory elements in NALM6 cells we used ChIP antibodies targeting promoter associated H3K4-trimethylation (H3K4Me3) or H3K27-acetylation (H3K27Ac), marking transcriptionally active elements including enhancers [26, 27]. This allowed us to identify distal interactions (FDR � 0.05; see Methods) between regions separated by at least 10kb in the NALM6 genome with a 5kb resolution. The complexity of promoter-enhancer interactions is exemplified by analysis of interac- tions between the human PAX5 promoter and distal regions (Fig 4B), identifying multiple interactions with elements in the ZCCHC7 gene. ATAC-seq analysis suggested these distal regions to be epigenetically accessible, and the low level of H3K4Me3 signal in combination with high levels of H3K27Ac (Fig 4B) support the idea that several of these regions represent active enhancer elements [26, 27]. Analysis of ChIP-seq data revealed that multiple regions, including the PAX5 promoter, are bound by IKZF1, RUNX1 and EBF1 (Fig 4B). Hence, the PAX5 gene is a target for both autoregulation and potential modulation by RUNX1, IKZF1 and EBF1. Interactions between distal regions at the EBF1 (S7A Fig), IKZF1 (S7B Fig) or RUNX1 genes (S7C Fig) and their promoters identified multiple regions with binding of the transcription factors. This suggests that these four TFs create a regulatory network where these

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 9 / 22 PAX5 is part of a transcription factor network in B-ALL

Fig 4. PAX5 is a transcriptional network key regulator in human leukemia cells. Panel (A) display a 4-part Venn-diagram based on ChIP-seq experiments targeting PAX5, EBF1, RUNX1 and IKZF1 in the human PreB-ALL cell line NALM6. Overlapping ChIP-seq peaks were identified with the mergePeaks command in Homer. (B) Visualization of H3K4me3 and H3K27ac anchored PLAC-seq interactions at the PAX5 gene in NALM6 B-ALL cells. Binding of EBF1, PAX5, RUNX1 and IKZF1 as well as H3K4me3 and H3K27ac chromatin marks was determined by ChIP-seq. ATAC-seq was used to determine chromatin accessibility. The data was displayed using the WashU Genome Browser. (C) Chord diagram based on H3K4me3 anchored PLAC-seq from NALM6 displaying interactions between TSS and distal elements. PAX5-EBF1-RUNX1-IKZF1 gene networks anchored distal-to-TSS (transcription start site) are indicated in the diagram. Chromatin loops were filtered for overlapping PAX5, EBF1, RUNX1 or IKZF1 ChIP-seq peaks in either one or both anchor-points (for details, see Methods) with the constraints that one anchor-point had to be within 2.5kb from TSS and the other had to be located more than 2.5kb away from TSS (distal binding). The chord diagram was visualized with the circlize package in R. https://doi.org/10.1371/journal.pgen.1008280.g004

proteins not only share target genes, but also regulate each-others expression via inter- and auto-regulatory loops. The ChIP-seq analysis provided support for that PAX5, IKZF1, RUNX1 and EBF1 share binding at a substantial set of putative regulatory elements in the human Pre-B cell genome (Fig 4A). In order to investigate the interplay between these TFs over longer distance, we

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 10 / 22 PAX5 is part of a transcription factor network in B-ALL

identified interacting regions bound by one or several of the TFs from the PLAC-seq data. Based on H3K27Ac PLAC-seq data, the most common category of interacting elements in the human Pre-B cell genome bound all four TFs at the same region (anchor point (ap)1 or ap2) (S8A Fig). The second most common category contained functional binding sites for all four factors at both the interacting elements. Even though binding of IKZF1 alone to one or the other element was the most commonly detected variant in the DNA-interactome defined by the H3K4Me3 PLAC-seq data (S8B Fig), shared binding of all factors to the same element was prominent also in this data set. To explore the nature of the long-range interaction in more detail we annotated regions within 2.5kb of a transcriptional start sites (TSS) as promoters. The H3K27Ac PLAC-seq data provides evidence for that the majority of the interactions detected in the human pre-B cell genome were generated through chromatin loops between non-TSS containing elements (S8D Fig). In the H3K4Me3 based data, TSS containing elements were, as expected, enriched and involved in the majority of the detected interactions (Figs 4C and S8C). Binding of all TFs was detected at both promoters and enhancers without any clear specific preference. The major part of the PAX5, IKZF1, RUNX1 or EBF1 bound promoters interacted with enhancers lack- ing detectable binding of any of these factors. Presented data does, however, support the gen- eral idea that PAX5 is part of a regulatory network involving IKZF1 and RUNX1 and EBF1 in human B-ALL cells.

Mutations of PAX5 and IKZF1 or formation of ETV6-RUNX1 impact target gene expression in primary human leukemia cells Having identified an extensive functional interplay between PAX5, RUNX1 and IKZF1 we wanted to determine how functional perturbations to this regulatory network impact the expression of direct target genes in primary human B-ALL. In order to link binding of PAX5, EBF1, IKZF1 and RUNX1 to regulatory elements associated with a given gene we used a com- binatorial approach of proximity analysis (ChIP-seq) and H3K4Me3-PLAC-seq defined inter- actions. Because our chromatin configuration analysis allowed for a resolution of 5 kb we assigned TF binding peaks within 2500 base pairs of a TSS as involved in the regulation of the gene defined by the TSS. Elements located at larger distances from the TSS were only assigned to the gene if they were defined as TSS proximal regions in the H3K4Me3 PLAC-assay from NALM6 cells. This approach should increase the precision of the annotation of transcription factor binding sites to defined genes. Next, we analyzed the expression of these putative target genes in an RNA-seq data set con- taining 264 primary B-ALL samples with known mutational status of PAX5 and IKZF1 as well as he presence of ETV6-RUNX1 [6, 28]. To explore the impact of transcription factor muta- tions on target gene expression we extracted information about mutational status of PAX5, IKZF1 and RUNX1 (ETV6-RUNX1) and classified the tumors as single or double mutants accordingly (S8 Table). Tumors with unknown status for any of these genetic abnormalities were excluded from the analysis. Expression levels of direct transcriptional targets in PAX5 mutated (PAX5M) samples were compared to normal PAX5 (PAX5WT) tumor samples. This identified 56 significantly upregulated and 93 downregulated (p.adj � 0.05) PAX5 target genes (Fig 5A). Considering the large number of binding sites for PAX5 in the genome, only a small fraction of the putative target genes displayed detectable sensitivity to reduced function of PAX5. In order to study if re-expression of PAX5 in a PAX5 mutant human B-ALL cell would impact the expression of genes identified as downregulated in the patient samples, we took advantage of an RNA-seq dataset generated from REH cells [29]. These cells harbors a mutated PAX5 gene and carry a Tetracyclin inducible PAX5 encoding construct [29] to allow for

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 11 / 22 PAX5 is part of a transcription factor network in B-ALL

Fig 5. Mutations in B-lineage transcriptional regulators affects expression of their target genes in human leukemia. The figure displays diagrams over a differential expression analysis in a cohort of 264 B-ALL samples and 13 samples from normal HSC (CD34-) or B-cell progenitor (Pro-B, Pre-B or Immature B) populations as indicated. PAX5, RUNX1 and/or IKZF1 target genes are defined in the human B-ALL cell line NALM6 using ChIP- and PLAC-sequencing (for details see Methods). Boxplots describe the mean log2 RPKM gene expression for each sample category as defined by differential genes between mutated and unmutated samples (A-C); (A) Expression of PAX5 bound genes in PAX5 mutated vs unmutated leukemia cells, (B) RUNX1 bound genes in RUNX1 mutated vs unmutated or (C) IKZF1 bound genes in IKZF1 mutated vs unmutated B-ALL. Panel (D, E) displays boxplots of genes co-bound by (D) PAX5-RUNX1 or (E) PAX5-IKZF1 differentially expressed in PAX5 only mutated vs. double mutated tumor cells. n = the number of differential genes between mutated and unmutated samples (A-C) or double-mutated and PAX5 single- mutated samples (D, E). Mann-Whitney U-test p-values are shown. https://doi.org/10.1371/journal.pgen.1008280.g005

controlled expression of the factor. 86 genes identified as targeted by PAX5-binding in NALM6 cells and downregulated in PAX5 mutated primary B-ALL were found to be more PAX5-responsive than PAX5 unbound or bound but not downregulated in primary B-ALL (S8E Fig). This suggests that the approach we have taken allows us to identify relevant PAX5 target genes in primary patient samples. Gene expression analysis of RUNX1 target genes in ETV6-RUNX positive tumors identified 799 up- and 1362 down-regulated genes (Fig 5B) sup- porting the idea that this fusion protein indeed target RUNX1 regulated genes. IKZF1 mutated cells had a higher expression of 93 and lower expression of 236 direct target genes (Fig 5C) out of the 6138 IKZF1 bound genes expressed in the B-lineage cells. Hence, while ETV6-RUNX1 expression in primary human leukemia cells associates with altered expression of a substantial fraction of the RUNX1 target genes, reduced function of IKZF1 or PAX5 had limited impact on the expression of their target genes. GO-analysis based on the differentially expressed genes linked to TF mutations revealed little evidence for that this would result in dramatic changes in cellular function. A significant enrichment (Benja- mini-Hochberg p.adj � 0.05) was detected for GO-terms linked to cell proliferation and metabolism among the upregulated RUNX1 target genes while the down regulated genes were enriched for GO-terms linked to protein transport and transcription (S9 Table). In general, among the genes identified as differentially expressed within the different categories, a large proportion consisted of transcription factors and epigenetic modulators (S8 and S9 Tables), indicating that disruptions in these transcription factor networks may be part of a general mechanism of action. To assay whether shared target genes displayed a synergistic impact of combined PAX5 and RUNX1 loss, we used ChIP-seq data and binding site annotation as above to identify 3362 shared target genes expressed in the B-lineage cells. Analyzing the expression levels of these in PAX5 mutated as compared to PAX5M/ETV6-RUNX1 tumors identified 91 up- and 237

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 12 / 22 PAX5 is part of a transcription factor network in B-ALL

down- regulated genes (Fig 5D). Comparing expression of these genes in ETV6-RUNX1 carry- ing tumors with normal PAX5 genes to double mutant tumors revealed a significant difference in expression supporting the idea that the two targeting events are associated with an exacerba- tion of the impact on gene expression. Despite that GO-analysis did not detect statistically sig- nificant enrichment of any specific GO term describing cellular processes, we noted downregulation of CDKN1a as well as changes in genes involved in the regulation of cell cycle and (S9 Table). We were also able to detect similar effects on a low number (4 up and 55 down-regulated genes) of PAX5-IKZF1 targeted cells (Fig 5E). Hence, even though the mutations in this regulatory network impact the expression of a subset of target genes, it does not cause a collapse of stage and lineage specific programs. A finding well in line with the sta- ble cellular state of pre-B-ALL cells.

Discussion PAX5 as a frequent target for mutations in B-ALL indicates a central role in the malignant conversion of B-lineage cells [3–7, 30]. Considering its role in normal B-cell differentiation in mice [8] it has been suggested that functional impairment of PAX5 cause a developmental arrest. This idea is supported by the findings that mutant forms of human PAX5 are function- ally impaired [6, 7] and that restoration of PAX5 expression in a human leukemia cell allows for a progressive maturation of the B-ALL like cells and loss of malignant phenotype [29]. However, a complete block of differentiation as imposed by RAG deficiency, do not result in leukemia formation in collaboration with activated STAT5 as efficiently as heterozygote loss of Pax5 in mouse models [11]. Furthermore, it has been reported that PAX5 acts as a metabolic gatekeeper so that partial inactivation results in increased metabolic activity in human B-cell progenitors, likely promoting a malignant state [31]. In addition, PAX5 act as a critical regula- tor of cell identity and reduced function results in lineage plasticity in normal as well as malig- nant B-lineage cells from both mouse and humans [32–34]. These findings highlight the complex function of this transcription factor in normal and malignant B-cell development and expose our limited understanding of the mechanism of action in the leukemogenesis process likely extending far beyond impaired differentiation. Despite that the PAX5 gene is involved in a multitude of oncogenic events in human malig- nancies [2, 3], the most common alteration is partial inactivation of the gene [4–7], suggesting that the normal function of PAX5 is dose dependent. It does, however, appear as if the dose per se does not dramatically impact the formation of CD19+ cells in mice [8, 9] or induce leu- kemia, unless combined with additional oncogenic events such as expression of a constitu- tively active STAT5 [11] or heterozygote deletion of Ebf [10] in mouse models. The combined effects of STAT5 activation and PAX5 deficiency is well in line with the observations that acti- vating mutations in in the Il7 or TSLP signaling pathways are frequently observed in human B-ALL [35, 36]. The data presented in this paper suggests additional mechanisms by which multiple oncogenic events may synergize in order to drive malignant transformation in PAX5 mutated B-ALL. The identification of PAX5 PXIs, all representing putative collaboration part- ners for the protein, suggests that they are frequently mutated in human B-ALLs carrying PAX5 mutations (Fig 2D). Hence, the transcriptional network around PAX5 can be targeted by multiple mutations that may serve to aggravate the impact on gene expression. It is also notable that mutations in the identified PXIs appear to be enriched in lymphoid malignancies (S3 Fig). Even though we did not observe direct malignant transformation by the combination of ectopic expression of ETV6-RUNX1 or oncogenic forms of IKZF1 and heterozygote dele- tion of Pax5 in normal mouse Pre-B cells, this had a profound impact on target gene

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 13 / 22 PAX5 is part of a transcription factor network in B-ALL

expression. Based on this we do believe that multiple targeting of proteins being involved in the same regulatory network may be an important part of the oncogenic process. One general idea of the malignant conversion process is that initial mutations arise in early progenitor cells and that additional mutations accumulate during the differentiation process resulting in the generation of fully developed leukemia in lineage restricted progenitors [37]. This model is well in line with our data however, the ability of PAX5 to direct factors such as RUNX1 and IKZF1 to sites epigenetically silent in the Pax5-/- mouse Pre-B cells (Fig 3C) opens for other mechanisms possibly contributing to the ability of a modified protein to cause trans- formation in a lineage restricted manner. Both IKZF1 and RUNX1 are expressed in and impact blood cell development outside of the B-lymphoid compartment [38–42] (S5 Fig) and genetic alterations in early progenitors could therefore result in expression of the oncogenic protein in early multipotent progenitors as well as other lineages. However, since the target gene spectra of the oncogenic proteins could be modified by a B-cell restricted factor such as PAX5 their impact on cellular function may be lineage specific. Hence, the molecular context of the B-cell progenitors could possibly activate the oncogenic potential of mutated proteins by targeting them to epigenetically silenced target sites. In all, existing data creates a strong link between the level of functional PAX5 activity and malignant transformation. We believe that increased understanding of the regulatory network coordinated by PAX5 will aid in our understanding of how apparently modest disruptions in the regulatory circuitry can contribute to catastrophic events such as malignant transformation and in the long term open novel avenues for diagnosis and targeted treatments.

Materials and methods Please see S1 Text for details Ethics statement. All work is done in line with the regulations defined by the Swedish national agency responsible for animal welfare "Jordbruksverket". The ethical permit was granted by the Animal Ethics Committee at Linko¨pings Tingsra¨tt. Approval number 28–14. Animal models. Wt,, Pax5+/- and Pax5-/- [8] mice were on C57BL/6 (CD45.2) background. Cells and cell culture. Primary fetal liver (FL) Pre-B cells were cultured in vitro on OP9 stroma cells using Opti-MEM (ThermoFisher Scientific, Waltham, MA) supplemented with 10% heat-inactivated fetal calf serum (FCS) as in [43]. The mouse Pre-B cell line 230–238 and human B-ALL NALM6 cells were cultured in RPMI1640 with UltraGlutamine (Lonza, Basel, Swiss) supplemented with 10% heat-inactivated FCS, 20mM HEPES, 50μg/ml Gentamicin and 50μM β-ME. BioID assay. Human PAX5 as well as an SV40 NLS encoding cDNA was ordered from GenScript (Piscataway, NJ) and sub-cloned into retroviral pMIG vector carrying BIRA�. Ret- roviruses were produced and used to infect 230–238 mouse Pre-B cells. After infection, sorted GFP+ cells were pulsed with 50μM d-biotin. Following biotinylation, cells were frozen in -80˚C before being resuspended in lysis buffer for sonication. Pull down was achieved when the cell lysate was incubated with streptavidin-sepharose beads followed by trypsination. Tryp- tic peptides were analyzed on a reverse phase nano liquid chromatography coupled online to an LTQ Orbitrap Velos Pro (ThermoFisher Scientific) mass spectrometer. Identification and quantitation were achieved using Proteome discoverer (Thermo Scientific, version 1.3), SEQUEST algorithm (Thermo Fisher Scientific, San Jose, CA, USA; version 1.4.0.288) and X! Tandem (CYCLONE (2010.12.01.1) Data analysis was reconfirmed using Trans-Proteomic Pipeline (TPP) software [44] and Prohits software suite generated valid interactions with a

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 14 / 22 PAX5 is part of a transcription factor network in B-ALL

SAINT score A Bayesian FDR of 0.02 (corresponding to a SAINT score of ~0.80) was used as a cut-off to define high confidence interactors. PXIs were analyzed by PANTHER Overrepresentation Test (release 2018/10/10) (http:// geneontology.org/), and enrichment analyses were run with Gene ontology database released on 2018/9/6. Differentially Up or Downregulated genes were subject to GO-term analysis using The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.8 (https://david.ncifcrf.gov/) [45, 46]. RNA-sequencing and data analysis from cultured transduced pre-B cells. RNA was prepared from primary mouse FL Pre-B cells from Wt, and Pax+/- mice transduced with retro- viruses encoding functionally impaired IKZF1-protein (IKZF1DN) or ETV6-RUNX1 fusion protein. RNA-seq was performed as in [43]. Data was mapped to reference genome mm10 and analyzed using the HOMER package. For details see extended MM. Chromatin immunoprecipitation and ATAC-seq analysis. ChIP-seq and ATAC-seq analysis was performed essentially as in [43]. Briefly, ChIP was carried out using 10 μg per 107 cells of rabbit anti-Ikzf1 [ab26083, Abcam], anti-FLI1 polyclonal IgG [ab15289, Abcam], anti- Ebf1 [ABE1294, Millipore], anti-Pax5 [ab183575, Abcam], anti-RUNX1 polyclonal IgG [ab23980, Abcam] or 10μl of rabbit anti-H3K4Me3 polyclonal IgG [07–473 Millipore], or 25μg of rabbit anti-H3K27Ac IgG [ab4729, Abcam]. NALM6 ChIP-seq data peaks were called with and HOMER adapted version of the IDR package (Irreproducibility Discovery Rate) package [47] (Karmel A. 2015. homer-idr: Second pass updated) according to (https://sites.google. com/site/anshulkundaje/projects/idr). For details on ChIP-seq or ChIP-QPCR see extended MM. Proximity Ligation-Assisted ChIP (PLAC)-sequencing. PLAC-seq was carried out and analyzed in duplicates similar to previously reported [15] with minor modifications. Approxi- mately, 250M valid H3K4me3 interactions pairs and 310M valid H3K27ac interaction pairs were generated. Bias corrected significant interactions (FDR � 0.05) between anchor points and other anchor points/non-anchor-points (peak-to-all) were identified with the FitHiChIP pipeline (https://www.biorxiv.org/content/early/2018/09/10/412833). Interactions shared between samples within 1 bin size (5kb) were merged with a custom adaption of the mer- ge2Dbed.pl from the HOMER platform [48]. Interaction anchor points were annotated against hg19 with a custom bash script utilizing the HOMER annotation database (annotatePeaks.pl). A custom bash script utilizing Bedtools intersect [49] was used to derive interactions overlapping transcription factor ChIP-seq peaks in either or both end-points. For details see extended MM. RNA-seq analysis of primary human B-ALL. We extracted normalized RNA-seq data from two previously described patient cohorts [28] and part of the B-ALL phase II dataset [6] accessible in the TARGET repository. PAX5, RUNX1 and IKZF1 ChIP-seq peaks from NALM6 cells were annotated against hg19 with annotatePeaks.pl from the HOMER platform [48]. Peaks within 2.5kb (upstream or downstream) of TSS were assigned to the closest genes with proximity-based annotations. To assign a distal TF peaks (more than 2.5 kb away from a TSS) to genes the NALM6 H3K4me3 PLAC-seq interactions were utilized using a custom R script. TF sites further from TSS than 2.5kb that did not overlap with an interaction were dis- carded. Genes were considered co-bound by two transcription factors if they were annotated to the same gene independent of their position in the gene. Bound genes for each category (PAX5, RUNX1, IKZF1, PAX5-RUNX1, PAX5-IKZF1) were tested for differential expression with DESeq2 [50] between mutated (PAX5, RUNX1 or IKZF1) and unmutated B-ALL cases (Fig 5A–5C) or double-mutated (PAX5-RUNX1 or PAX5-IKZF1) and PAX5 single-mutated B-ALL cases (Fig 5D and 5E). Only genes with 10 or more counts in at least two samples were included in the analysis. Samples with missing mutation information were excluded for a given comparisons. Mean of log2 normalized RPKM for differentially expressed genes between

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 15 / 22 PAX5 is part of a transcription factor network in B-ALL

two conditions were plotted as boxplots and significance between categories were tested with Mann-Whitney U-test. Data availability. In line with the data availability policy, ChIP-, RNA-, ATAC- and PLAC-sequencing data generated for this paper are freely available and have been deposited in GEO under the acc. Numbers; GSE126375 for murine data and GSE126300 for data on the human cell-line NALM6.

Supporting information S1 Text. Extended supplementary materials and methods. This supplement contains detailed protocols for the methods used in this report. (DOCX) S1 Table. Identification of PAX5 proximity interactors (PXIs) in Pre-B cells. The diagram displays proteins identified as PAX5 PXIs in 230–238 mouse B-ALL cells. The Saint score and the enrichment of a given protein as compared to what was observed in cells transduced with an NLS-BIRA� is indicated. Proteins previously identified as PAX5 interacting factors [16] or proteins identified by STRING analysis (https://string-db.org) as direct or indirect partners (S1 Fig) are indicated. The table also display GO-terms defined using Gene ontology (GO) analysis performed with PANTHER14.0. (XLSX) S2 Table. PAX5, RUNX1 and IKZF1 display overlapping binding at putative regulatory elements annotated to defined genes. The table display genes annotated to specific ChIP-seq peaks with unique or combined binding of transcription factors as shown in Fig 3A. Genes annotated to several categories were excluded from the analysis. (XLSX) S3 Table. PAX5, RUNX1 and IKZF1 target genes involved in the regulation of cell prolifer- ation and transcription. The table display a GO analysis (DAVID v6.8 (https://david.ncifcrf. gov/)) [45, 46] of genes annotated to specific ChIP-seq peaks with unique or combined binding of transcription factors (S2 Table) as shown in Fig 3A. Genes annotated to several categories were excluded from the analysis. (XLSX) S4 Table. PAX5, RUNX1 and IKZF1 display overlapping binding at putative regulatory elements annotated to defined genes. The table display genes annotated to specific ChIP-seq peaks with unique or combined binding of transcription factors as shown in S4D Fig. Genes annotated to several categories were excluded from the analysis. (XLSX) S5 Table. PAX5, FLI1 and IKZF1 target genes involved in the regulation of cell prolifera- tion and transcription. The table display a GO analysis (DAVID v6.8 (https://david.ncifcrf. gov/)) [45, 46] of genes annotated to specific ChIP-seq peaks with unique or combined binding of transcription factors (Table 4) as shown in S3D Fig. Genes annotated to several categories were excluded from the analysis. (XLSX) S6 Table. Differential gene expression patterns upon expression of ETV6-RUNX1 or DN-IKZF1 in Wt or Pax5+/- mouse Pre-B cells. Differentially expressed genes between Wt and Pax5+/- Pro-B cells transduced with either constructs encoding TEL-AML or IKZF1-DN compared to control vector (pMIG) transduced counterparts as indicated were identified by

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 16 / 22 PAX5 is part of a transcription factor network in B-ALL

RNA-seq as described in materials and methods and extended methods. Log2 fold change, p- Value and adjusted p-Value (FDR) from the output file of the getDiffExpression.pl (using edgeR as statistical tool) command, run in HOMER are listed for each comparison in the form of a table listing all 24072 examined genes. (XLSX) S7 Table. Differential gene expression patterns upon expression of ETV6-RUNX1 or DN-IKZF1 in normal or Pax5+/- cells. Differentially Up or Downregulated genes defined in S6 Table were subject to GO-term analysis using The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.8 (https://david.ncifcrf.gov/) [45, 46]. (XLSX) S8 Table. A fraction of the PAX5, IKZF1 and/or RUNX1 target genes are differentially expressed in tumors carrying transcription factor mutations. The tables show the identity as well as the expression levels of transcription factor target genes in normal and malignant cells identified as up or down regulated by DESeq2 [50] in correlation to mutation in the tar- geting transcription factor. Genes were identified as in Fig 5A–5E. (XLSX) S9 Table. Identification of biological functions of significantly differentially expressed tar- get genes in primary human leukemia. Differentially Up- or Down-regulated transcription factor target genes identified in Fig 5 were subject to GO-term and KEGG pathway analysis using The Database for Annotation, Visualization and Integrated Discovery (DAVID v6.7 (https://david.ncifcrf.gov/) [45, 46]. (XLSX) S1 Fig. STRING analysis identifies multiple PXIs as potential partners for PAX5. The fig- ure displays a network map generated by analysis of the PAX5 interactome using Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) (https://string-db.org). Colored nodes indicate the query protein (PAX5, Red) and first shell of interactors. Meanwhile, the light grey nodes indicate the second shell of interactors. Edges represent protein-protein asso- ciations. The minimum required interaction score was 0.9. (PDF) S2 Fig. PAX5 is associated with a large variety of transcriptional regulators in pre-B cells. The diagram displays a functional enrichment analysis identified overrepresented protein clas- ses in the dataset by Gene ontology (GO) analysis performed with PANTHER14.0. (PDF) S3 Fig. Mutations in PAX5 and PXIs is enriched in lymphoid leukemias. The diagram dis- plays the fraction of tumors of different histological origin with reported PAX5 PXI mutations vs total number of reported tumors per type are listed. The analysis was based on a list of 239 PAX5 PXI´s that were investigated in the public cancer database COSMIC (cancer genes sen- sus V76) obtained from the Sanger Institute Catalogue Of Somatic Mutations In Cancer web site, (http://cancer.sanger.ac.uk/cosmic) [14]. Only entries classified as hematopoietic and lymphoid tissue are considered. (PDF) S4 Fig. PAX5, IKZF1 and RUNX1 share binding to multiple sites in the mouse Pre-B cell genome. (A) Diagrams displaying Q-PCR data for co-precipitations of the Igll1 and Cd79a/ Mb-1 promoters after ChIP of EBF1 (n = 7), PAX5 (n = 3), FLI1 (n = 4), IKZF1 (n = 4), and CBF0β (n = 3) from 230–238 Pre-B cells. The diagrams are based on the relative enrichment as

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 17 / 22 PAX5 is part of a transcription factor network in B-ALL

compared to a polyclonal IgG. Statistical analysis was based on Student’s t-tests (two-tailed). ���: p < 0.0005 (B) Peak files from ChIP-seq data of PAX5, IKZF1 and RUNX1 in 230–238 cells (Fig 3A) were analyzed for motif enrichment using findMotifsGenome.pl in Homer (mm10 -size 200). Rank, enriched motif, P-value, % of target (T) and background (Bg) and best match to known motifs of Top 3 motifs plus PAX5, EBF1 and RUNX1 motif when present for each peak set are listed. (C) Diagrams displaying Q-PCR data for co-precipitations of Igll1 promoter after ChIP-experiments in Wt, or Pax5-/- pro-B cells. EBF1 (n = 5), PAX5 (n = 3), FLI1 (n = 3), IKZF1 (n = 5), and CBFβ (n = 3). P-Values were based on Student’s t-test (two- tailed) ���: p < 0.0005, ��: p < 0.005, �: p < 0.05 RQ: relative quantity. Panel (D) display a Venn-diagram of ChIP-seq peaks based on PAX5, IKZF1 and FLI1 ChIP-seq analysis using the 230–238 Pre-B cell line. Peaks were called using the HOMER platform (findPeaks -style factor) and resulting files were filtered for peaks �15 normalized tags. Overlapping peaks were identified using the mergePeaks command in HOMER. (PDF) S5 Fig. Combined binding of PAX5, IKZF1 and RUNX1 is associated with high levels of gene expression in hematopoietic cells. The schematic diagrams in panel (A-D) display Gene Expression Commons generated representation of expression patterns for transcription factors or genes annotated to transcription factor binding in S2 Table (A-C) and S4 (D). Gene lists were uploaded as csv files and analysis was performed using the Gene-set activity function. Expression levels are indicated with heatmaps reaching from dark Blue (Low-expression) to dark Brown (High-expression) as indicated. (PDF) S6 Fig. PAX5, EBF1, IKZF1 and RUNX1 share binding to multiple sites in the human pre-B cell genome. Peak files from ChIP-seq data of PAX5, EBF1, IKZF1 and RUNX1 in NALM6 cells (Fig 4A) were analyzed for motif enrichment using findMotifsGenome.pl in Homer (hg19 -size 200). Rank, enriched motif, P-value, % of target (T) and background (Bg) and best match to known motifs of Top 3 motifs plus PAX5, EBF1 and RUNX1 motif when present for each peak set are listed. (PDF) S7 Fig. PAX5, EBF1, IKZF1 and RUNX1 are part of an intricate network of regulatory loops. The figure panels (A-C) displays WashU Genome Browser tracks of the EBF1, IKZF1 and RUNX1 genes in NALM6 B-leukemic cells. ChIP-seq data of EBF1, PAX5, RUNX1 and IKZF1 were linked to H3K4me3, H3K27ac chromatin marks and ATAC-seq as well as H3K4me3 and H3K27ac anchored PLAC-seq interactions on the EBF1 (A) IKZF1 (B) and RUNX1 (C) genes. The data is visualized in the WashU Genome Browser. (PDF) S8 Fig. PAX5, EBF1, RUNX1 and IKZF1 distal interactions are common in human B-leu- kemic cells. Panel (A, B) show diagrams displaying the abundance of combined PAX5, EBF1, RUNX1 and/or IKZF1 binding at identified PLAC-seq anchor-points. Top 40 combinations out of 251 possible for H3K27ac (A) and 252 for anchor-point bound H3K4me3 (B) PLAC- seq interactions are visualized as UpSet plots. Interactions were filtered for PAX5, EBF1, RUNX1 and/or IKZF1 overlapping ChIP-seq peaks in either one or both anchor-points (for details, see Methods). UpSet plots describe how the combination of TFs in one anchor-point (ap) relates to combination of TFs in the other. Panel (C-D) show Chord diagram displaying distal-to-TSS (transcription start site) anchored PAX5-EBF1-RUNX1-IKZF1 gene networks in NALM6 cells. C) H3K4me3 or D) H3K27ac anchored PLAC-seq was used to define chromatin interactions in NALM6 cells. Interactions were filtered for PAX5, EBF1, RUNX1 and/or IKZF1 overlapping ChIP-seq peaks in either one or both anchor-points Anchor points were

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 18 / 22 PAX5 is part of a transcription factor network in B-ALL

defined as either distal (more than 2.5kb away from TSS) or as TSS-anchored (within 2.5kb from TSS). The chord diagrams were visualized with the circlize package in R. (E) ECDF plot displaying the association of PAX5 binding in NALM6 cells and gene expression changes upon introduction of PAX5 in PAX5 mutated REH cells (GSE57480). The following gene cate- gories are shown: Red line describes genes identified as down regulated in patient samples car- rying a mutated PAX5 gene and targeted for PAX5 binding in NALM6 cells (Fig 5A, S8 Table). Blue line shows all genes with PAX5 binding in NALM6 independent of their gene expression status in primary human B-ALL samples (genes in the differentially expressed (red) category were excluded). Black line describes the expression changes of PAX5 unbound genes upon PAX5 reintroduction. Kolmogorov-Smirnov p-values are shown. (PDF)

Author Contributions Conceptualization: Kazuki Okuyama, Tobias Strid, Susana Cristobal, Thoas Fioretos, Jonas Ungerba¨ck, Mikael Sigvardsson. Data curation: Tobias Strid, Jacob Kuruvilla, Stefan Lang, Jonas Ungerba¨ck. Formal analysis: Tobias Strid, Jacob Kuruvilla, Rajesh Somasundaram, Jonas Ungerba¨ck, Mikael Sigvardsson. Funding acquisition: Kazuki Okuyama, Tobias Strid, Mikael Sigvardsson. Investigation: Kazuki Okuyama, Tobias Strid, Jacob Kuruvilla, Rajesh Somasundaram, Maha- desh Prasad, Henrik Lilljebjo¨rn, Stefan Lang, Jonas Ungerba¨ck. Methodology: Kazuki Okuyama, Tobias Strid, Jacob Kuruvilla, Susana Cristobal, Emma Smith, Mahadesh Prasad, Jonas Ungerba¨ck. Project administration: Mikael Sigvardsson. Resources: Emma Smith, Thoas Fioretos, Henrik Lilljebjo¨rn. Software: Shamit Soneji, Stefan Lang, Jonas Ungerba¨ck. Supervision: Thoas Fioretos, Shamit Soneji, Mikael Sigvardsson. Validation: Kazuki Okuyama, Jacob Kuruvilla, Rajesh Somasundaram, Jonas Ungerba¨ck, Mikael Sigvardsson. Visualization: Kazuki Okuyama, Tobias Strid, Jacob Kuruvilla, Jonas Ungerba¨ck, Mikael Sigvardsson. Writing – original draft: Kazuki Okuyama, Tobias Strid, Jacob Kuruvilla, Jonas Ungerba¨ck, Mikael Sigvardsson. Writing – review & editing: Kazuki Okuyama, Tobias Strid, Jacob Kuruvilla, Rajesh Soma- sundaram, Henrik Lilljebjo¨rn, Jonas Ungerba¨ck, Mikael Sigvardsson.

References 1. Somasundaram R, Prasad MA, Ungerback J, Sigvardsson M. Transcription factor networks in B-cell dif- ferentiation link development to acute lymphoid leukemia. Blood. 2015; 126(2):144±52. https://doi.org/ 10.1182/blood-2014-12-575688 PMID: 25990863 2. Smeenk L, Fischer M, Jurado S, Jaritz M, Azaryan A, Werner B, et al. Molecular role of the PAX5-ETV6 oncoprotein in promoting B-cell acute lymphoblastic leukemia. EMBO J. 2017; 36(6):718±35. https:// doi.org/10.15252/embj.201695495 PMID: 28219927

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 19 / 22 PAX5 is part of a transcription factor network in B-ALL

3. Coyaud E, Struski S, Prade N, Familiades J, Eichner R, Quelen C, et al. Wide diversity of PAX5 alter- ations in B-ALL: a Groupe Francophone de Cytogenetique Hematologique study. Blood. 2010; 115 (15):3089±97. https://doi.org/10.1182/blood-2009-07-234229 PMID: 20160164 4. Mullighan CG, Goorha S, Radtke I, Miller CB, Coustan-Smith E, Dalton JD, et al. Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia. Nature. 2007; 446(7137):758±64. https://doi. org/10.1038/nature05690 PMID: 17344859 5. Kuiper RP, Schoenmakers EF, van Reijmersdal SV, Hehir-Kwa JY, van Kessel AG, van Leeuwen FN, et al. High-resolution genomic profiling of childhood ALL reveals novel recurrent genetic lesions affect- ing pathways involved in lymphocyte differentiation and cell cycle progression. Leukemia. 2007; 21 (6):1258±66. https://doi.org/10.1038/sj.leu.2404691 PMID: 17443227 6. Gu Z, Churchman ML, Roberts KG, Moore I, Zhou X, Nakitandwe J, et al. PAX5-driven subtypes of B- progenitor acute lymphoblastic leukemia. Nat Genet. 2019. 7. Shah S, Schrader KA, Waanders E, Timms AE, Vijai J, Miething C, et al. A recurrent germline PAX5 mutation confers susceptibility to pre-B cell acute lymphoblastic leukemia. Nat Genet. 2013; 45 (10):1226±31. https://doi.org/10.1038/ng.2754 PMID: 24013638 8. UrbaÂnek P, Wang Z-Q, Fetka I, Wagner EF, Busslinger M. Complete block of early B cell differentiation and altered patterning of the posterior midbrain in mice lacking Pax5/BSAP. Cell. 1994; 79:901±12. https://doi.org/10.1016/0092-8674(94)90079-5 PMID: 8001127 9. Ahsberg J, Ungerback J, Strid T, Welinder E, Stjernberg J, Larsson M, et al. Early B-cell Factor 1 regu- lates the expansion of B-cell progenitors in a dose dependent manner. J Biol Chem. 2013. 10. Prasad MA, Ungerback J, Ahsberg J, Somasundaram R, Strid T, Larsson M, et al. Ebf1 heterozygosity results in increased DNA damage in pro-B cells and their synergistic transformation by Pax5 haploinsuffi- ciency. Blood. 2015; 125(26):4052±9. https://doi.org/10.1182/blood-2014-12-617282 PMID: 25838350 11. Heltemes-Harris LM, Willette MJ, Ramsey LB, Qiu YH, Neeley ES, Zhang N, et al. Ebf1 or Pax5 haploin- sufficiency synergizes with STAT5 activation to initiate acute lymphoblastic leukemia. J Exp Med. 2011; 208(6):1135±49. https://doi.org/10.1084/jem.20101947 PMID: 21606506 12. Cronan JE. Targeted and proximity-dependent promiscuous protein biotinylation by a mutant Escheri- chia coli biotin protein ligase. The Journal of nutritional biochemistry. 2005; 16(7):416±8. https://doi.org/ 10.1016/j.jnutbio.2005.03.017 PMID: 15992681 13. Roux KJ, Kim DI, Raida M, Burke B. A promiscuous biotin ligase fusion protein identifies proximal and interacting proteins in mammalian cells. J Cell Biol. 2012; 196(6):801±10. https://doi.org/10.1083/jcb. 201112098 PMID: 22412018 14. Bamford S, Dawson E, Forbes S, Clements J, Pettett R, Dogan A, et al. The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br J Cancer. 2004; 91(2):355±8. https://doi.org/ 10.1038/sj.bjc.6601894 PMID: 15188009 15. Fang R, Yu M, Li G, Chee S, Liu T, Schmitt AD, et al. Mapping of long-range chromatin interactions by proximity ligation-assisted ChIP-seq. Cell Res. 2016; 26(12):1345±8. https://doi.org/10.1038/cr.2016. 137 PMID: 27886167 16. McManus S, Ebert A, Salvagiotto G, Medvedovic J, Sun Q, Tamir I, et al. The transcription factor Pax5 regulates its target genes by recruiting chromatin-modifying proteins in committed B cells. EMBO J. 2011; 30(12):2388±404. https://doi.org/10.1038/emboj.2011.140 PMID: 21552207 17. Gao H, Lukin K, Ramirez J, Fields S, Lopez D, Hagman J. Opposing effects of SWI/SNF and Mi-2/ NuRD chromatin remodeling complexes on epigenetic reprogramming by EBF and Pax5. Proc Natl Acad Sci U S A. 2009; 106(27):11258±63. https://doi.org/10.1073/pnas.0809485106 PMID: 19549820 18. Uchida H, Downing JR, Miyazaki Y, Frank R, Zhang J, Nimer SD. Three distinct domains in TEL-AML1 are required for transcriptional repression of the IL-3 promoter. . 1999; 18(4):1015±22. https://doi.org/10.1038/sj.onc.1202383 PMID: 10023677 19. Hiebert SW, Sun W, Davis JN, Golub T, Shurtleff S, Buijs A, et al. The t(12;21) translocation converts AML-1B from an activator to a repressor of transcription. Mol Cell Biol. 1996; 16(4):1349±55. https://doi. org/10.1128/mcb.16.4.1349 PMID: 8657108 20. Morrow M, Samanta A, Kioussis D, Brady HJ, Williams O. TEL-AML1 preleukemic activity requires the DNA binding domain of AML1 and the dimerization and corepressor binding domains of TEL. Onco- gene. 2007; 26(30):4404±14. https://doi.org/10.1038/sj.onc.1210227 PMID: 17237815 21. Schwickert TA, Tagoh H, Gultekin S, Dakic A, Axelsson E, Minnich M, et al. Stage-specific control of early B cell development by the transcription factor Ikaros. Nat Immunol. 2014; 15(3):283±93. https:// doi.org/10.1038/ni.2828 PMID: 24509509 22. Li Y, Luo H, Liu T, Zacksenhaus E, Ben-David Y. The ets transcription factor Fli-1 in development, can- cer and disease. Oncogene. 2015; 34(16):2022±31. https://doi.org/10.1038/onc.2014.162 PMID: 24909161

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 20 / 22 PAX5 is part of a transcription factor network in B-ALL

23. Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome posi- tion. Nat Methods. 2013; 10(12):1213±8. https://doi.org/10.1038/nmeth.2688 PMID: 24097267 24. Mullighan CG, Miller CB, Radtke I, Phillips LA, Dalton J, Ma J, et al. BCR-ABL1 lymphoblastic leukae- mia is characterized by the deletion of Ikaros. Nature. 2008; 453(7191):110±4. https://doi.org/10.1038/ nature06866 PMID: 18408710 25. Churchman ML, Low J, Qu C, Paietta EM, Kasper LH, Chang Y, et al. Efficacy of Retinoids in IKZF1- Mutated BCR-ABL1 Acute Lymphoblastic Leukemia. Cancer Cell. 2015; 28(3):343±56. https://doi.org/ 10.1016/j.ccell.2015.07.016 PMID: 26321221 26. Schneider R, Bannister AJ, Myers FA, Thorne AW, Crane-Robinson C, Kouzarides T. Histone H3 lysine 4 methylation patterns in higher eukaryotic genes. Nat Cell Biol. 2004; 6(1):73±7. https://doi.org/10. 1038/ncb1076 PMID: 14661024 27. Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, et al. High-resolution profiling of histone methylations in the . Cell. 2007; 129(4):823±37. https://doi.org/10.1016/j.cell.2007.05. 009 PMID: 17512414 28. Lilljebjorn H, Henningsson R, Hyrenius-Wittsten A, Olsson L, Orsmark-Pietras C, von Palffy S, et al. Identification of ETV6-RUNX1-like and DUX4-rearranged subtypes in paediatric B-cell precursor acute lymphoblastic leukaemia. Nature communications. 2016; 7:11790. https://doi.org/10.1038/ ncomms11790 PMID: 27265895 29. Liu GJ, Cimmino L, Jude JG, Hu Y, Witkowski MT, McKenzie MD, et al. Pax5 loss imposes a reversible differentiation block in B-progenitor acute lymphoblastic leukemia. Genes Dev. 2014; 28(12):1337±50. https://doi.org/10.1101/gad.240416.114 PMID: 24939936 30. Iida S, Rao PH, Nallasivam P, Hibshoosh H, Butler M, Louie DC, et al. The t(9;14)(p13;q32) chromo- somal translocation associated with lymphoplasmacytoid involves the PAX-5 gene. Blood. 1996; 88(11):4110±7. PMID: 8943844 31. Chan LN, Chen Z, Braas D, Lee JW, Xiao G, Geng H, et al. Metabolic gatekeeper function of B-lym- phoid transcription factors. Nature. 2017; 542(7642):479±83. https://doi.org/10.1038/nature21076 PMID: 28192788 32. Somasundaram R, Ahsberg J, Okuyama K, Ungerback J, Lilljebjorn H, Fioretos T, et al. Clonal conver- sion of B lymphoid leukemia reveals cross-lineage transfer of malignant states. Genes Dev. 2016; 30 (22):2486±99. https://doi.org/10.1101/gad.285536.116 PMID: 27913602 33. Simmons S, Knoll M, Drewell C, Wolf I, Mollenkopf HJ, Bouquet C, et al. Biphenotypic B-lymphoid/mye- loid cells expressing low levels of Pax5: potential targets of BAL development. Blood. 2012; 120 (18):3688±98. https://doi.org/10.1182/blood-2012-03-414821 PMID: 22927250 34. Jacoby E, Nguyen SM, Fountaine TJ, Welp K, Gryder B, Qin H, et al. CD19 CAR immune pressure induces B-precursor acute lymphoblastic leukaemia lineage switch exposing inherent leukaemic plas- ticity. Nature communications. 2016; 7:12320. https://doi.org/10.1038/ncomms12320 PMID: 27460500 35. Mullighan CG, Collins-Underwood JR, Phillips LA, Loudin MG, Liu W, Zhang J, et al. Rearrangement of CRLF2 in B-progenitor- and Down syndrome-associated acute lymphoblastic leukemia. Nat Genet. 2009; 41(11):1243±6. https://doi.org/10.1038/ng.469 PMID: 19838194 36. Mullighan CG, Zhang J, Harvey RC, Collins-Underwood JR, Schulman BA, Phillips LA, et al. JAK muta- tions in high-risk childhood acute lymphoblastic leukemia. Proc Natl Acad Sci U S A. 2009; 106 (23):9414±8. https://doi.org/10.1073/pnas.0811761106 PMID: 19470474 37. Corces-Zimmerman MR, Majeti R. Pre-leukemic evolution of hematopoietic stem cells: the importance of early mutations in leukemogenesis. Leukemia. 2014; 28(12):2276±82. https://doi.org/10.1038/leu. 2014.211 PMID: 25005245 38. Nichogiannopoulou A, Trevisan M, Neben S, Friedrich C, Georgopoulos K. Defects in activity in Ikaros mutant mice. J Exp Med. 1999; 190:1201±13. https://doi.org/10.1084/jem. 190.9.1201 PMID: 10544193 39. Ng SY, Yoshida T, Zhang J, Georgopoulos K. Genome-wide lineage-specific transcriptional networks underscore Ikaros-dependent lymphoid priming in hematopoietic stem cells. Immunity. 2009; 30 (4):493±507. https://doi.org/10.1016/j.immuni.2009.01.014 PMID: 19345118 40. Klug CA, Morrison SJ, Masek M, Hahm K, Smale ST, Weissman IL. Hematopoietic stem cells and lym- phoid progenitors express different Ikaros isoforms, and Ikaros is localized to heterochromatin in imma- ture lymphocytes. Proc Natl Acad Sci U S A. 1998; 95(2):657±62. https://doi.org/10.1073/pnas.95.2. 657 PMID: 9435248 41. Okuda T, van Deursen J, Hiebert SW, Grosveld G, Downing JR. AML1, the target of multiple chromo- somal translocations in human leukemia, is essential for normal fetal liver hematopoiesis. Cell. 1996; 84 (2):321±30. https://doi.org/10.1016/s0092-8674(00)80986-1 PMID: 8565077

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 21 / 22 PAX5 is part of a transcription factor network in B-ALL

42. Wang Q, Stacy T, Binder M, Marin-Padilla M, Sharpe AH, Speck NA. Disruption of the Cbfa2 gene causes necrosis and hemorrhaging in the central and blocks definitive hematopoiesis. Proc Natl Acad Sci U S A. 1996; 93(8):3444±9. https://doi.org/10.1073/pnas.93.8.3444 PMID: 8622955 43. Jensen CT, Ahsberg J, Sommarin MNE, Strid T, Somasundaram R, Okuyama K, et al. Dissection of progenitor compartments resolves developmental trajectories in B-lymphopoiesis. J Exp Med. 2018. 44. Pedrioli PG. Trans-proteomic pipeline: a pipeline for proteomic analysis. Methods Mol Biol. 2010; 604:213±38. https://doi.org/10.1007/978-1-60761-444-9_15 PMID: 20013374 45. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols. 2009; 4(1):44±57. https://doi.org/10.1038/nprot. 2008.211 PMID: 19131956 46. Huang da W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the compre- hensive functional analysis of large gene lists. Nucleic Acids Res. 2009; 37(1):1±13. https://doi.org/10. 1093/nar/gkn923 PMID: 19033363 47. Li QB JB, Huang H, Bickel PJ. Measuring reproducibility of high-throughput experiments. Ann Appl Stat 2011; 5(5):1752±79. 48. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. Simple combinations of lineage-deter- mining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010; 38(4):576±89. https://doi.org/10.1016/j.molcel.2010.05.004 PMID: 20513432 49. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformat- ics. 2010; 26(6):841±2. https://doi.org/10.1093/bioinformatics/btq033 PMID: 20110278 50. Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome biology. 2014; 15(12):550. https://doi.org/10.1186/s13059-014-0550-8 PMID: 25516281

PLOS Genetics | https://doi.org/10.1371/journal.pgen.1008280 August 5, 2019 22 / 22