Published OnlineFirst October 14, 2016; DOI: 10.1158/1078-0432.CCR-16-1125
Statistics in CCR Clinical Cancer Research Toxicity and Efficacy Probability Interval Design for Phase I Adoptive Cell Therapy Dose-Finding Clinical Trials Daniel H. Li1, James B. Whitmore2, Wentian Guo3, and Yuan Ji4
Abstract
Recent trials of adoptive cell therapy (ACT), such as the rates efficacy outcomes to inform dosing decisions to optimize chimeric antigen receptor (CAR) T-cell therapy, have demon- efficacy and safety simultaneously. Rather than finding the strated promising therapeutic effects for cancer patients. A main maximum tolerated dose (MTD), the TEPI design is aimed at issue in the product development is to determine the finding the dose with the most desirable outcome for safety and appropriate dose of ACT. Traditional phase I trial designs for efficacy. The key features of TEPI are its simplicity, flexibility, cytotoxic agents explicitly assume that toxicity increases mono- and transparency, because all decision rules can be prespecified tonically with dose levels and implicitly assume the same for prior to trial initiation. We conduct simulation studies to efficacy to justify dose escalation. ACT usually induces rapid investigate the operating characteristics of the TEPI design responses, and the monotonic dose–response assumption is and compare it to existing methods. In summary, the TEPI unlikely to hold due to its immunobiologic activities. We design is a novel method for ACT dose finding, which possesses propose a toxicity and efficacy probability interval (TEPI) superior performance and is easy to use, simple, and transparent. design for dose finding in ACT trials. This approach incorpo- Clin Cancer Res; 23(1); 13–20. 2016 AACR.
Introduction prespecified target rate, say 30%. The commonly used methods are the rule-based designs, such as the 3 þ 3 design (1), and the In the past few years, promising antitumor effects have been model-based designs, such as the modified toxicity probability seen in patients with late-stage cancer when they were treated interval (mTPI) design (2–4) and the continual reassessment with adoptive cell therapy (ACT), including chimeric antigen method (CRM; ref. 5). receptor (CAR) T cells, T-cell receptors (TCR), and tumor-infil- Traditional designs considering the DLT data implicitly assume trating lymphocytes (TIL). Early data continue to warrant the a monotonically increasing relationship between dose and expedited development of these therapies as a treatment option response efficacy; otherwise, there is no justification to escalate for patients with various cancers. ACT is a highly personalized the dose level when it is safe. Monotone efficacy may be a cancer therapy that harnesses a patient's own immune cells reasonable assumption for cytotoxic agents. However, this (specifically, T lymphocytes) to recognize cancer-specific abnor- assumption may not hold for ACTs. In ACT trials, the MTD is malities, enabling them to target and attack malignant cells not always optimal, as clinical response correlates less with dose throughout the body. However, challenging questions remain level. For example, recent phase I trials of CAR T cells targeting regarding the immunobiology and development of these person- CD19 suggest that a range of dose levels is generally safe alized therapies, such as the design of an appropriate phase I dose- and effective, and there is no clear correlation between T-cell escalation study that takes into consideration the unique prop- dose and clinical response (6). In particular, two patients in the erties of ACTs. CAR T-cell trials reported in refs. 7 and 8 exhibited inferior Traditional phase I dose-finding oncology trials aim to identify efficacy outcomes to another patient despite receiving a 10- the maximum tolerated dose (MTD), which is the highest dose fold higher T-cell dose. A TIL study for metastatic cancer (9) that has a dose-limiting toxicity (DLT) rate less than or close to a found no correlation between the number of cells administered and the likelihood of a clinical response. Therefore, traditional algorithmic and model-based designs based on binary mea- 1 2 Juno Therapeutics, Seattle, Washington. Juno Therapeutics, Seattle, Washing- sures of toxicity may be inappropriate for identifying the 3 4 ton. Laiya Consulting, Inc., Wilmette, Illinois. NorthShore University Health- optimal dose of ACTs. Given the complexity of an ACT product, System, Chicago, Illinois. preliminary dose exploration should aim to capture effective Note: Supplementary data for this article are available at Clinical Cancer biologic activity rather than dose-limiting toxicity alone (10). Research Online (http://clincancerres.aacrjournals.org/). Emerging data suggest that CAR T-cell therapy in B-cell hema- Corresponding Author: Yuan Ji, NorthShore University HealthSystem/ tologic malignancies may induce rapid responses. Toxicity and The University of Chicago, 1001 University Place, Evanston, IL 60201. Phone: efficacy of a biomarker (e.g., cell expansion) may, therefore, be 224-364-7312; Fax: 224-364-7319; E-mail: [email protected] measured in the same time frame (6, 11, 12). Model-based doi: 10.1158/1078-0432.CCR-16-1125 methods have been developed to model toxicity and efficacy data 2016 American Association for Cancer Research. jointly in order to determine the acceptable dose level (13); these
www.aacrjournals.org 13
Downloaded from clincancerres.aacrjournals.org on September 27, 2021. © 2017 American Association for Cancer Research. Published OnlineFirst October 14, 2016; DOI: 10.1158/1078-0432.CCR-16-1125
Li et al.
methods are powerful and effective, although they may require be treated at the next higher dose level. In order to formulate substantial investigation from trial statisticians to ensure proper this table, it is required to have determined: (i) the maximum implementation and calibration. To this end, we developed a tolerated toxicity rate, pT, and (ii) the minimum acceptable practical dose-finding design for an ACT that (i) incorporates efficacy rate, qE, at which the clinician is willing to treat future both toxicity and efficacy data; (ii) provides the same adaptive patients at the current dose level. feature as model-based designs; and (iii) most importantly, is Table 1 is elicited for an ACT trial based on pT ¼ 0:4, qE ¼ 0:2, transparent to clinicians and as simple to implement as the 3 þ 3 historical data for the ACT, and clinician input. For other trials, the and mTPI designs. We propose a toxicity and efficacy probability preset table can be modified accordingly. A more detailed descrip- interval (TEPI) design, which is based on a clinician-elicited tion of Table 1 is given in Supplementary Material A. decision table in terms of efficacy and toxicity probability inter- vals. This design is motivated for the conduct of a phase I CD19- Dose-finding algorithm targeted CAR T-cell therapy dose-finding trial. Building upon the preset table, we set up a "local" decision- theoretic framework and derive a Bayes rule. Here, local means Materials and Methods that the framework focuses on the optimal decision to be made for the current dose instead of the trial. We show that the Bayes rule is Elicited decision table equivalent to computing the joint unit probability mass (JUPM) Consider d ascending doses in a single-agent ACT phase I trial. for the toxicity and efficacy probability intervals. For a given Due to ethical considerations, it is always assumed that the region A, the JUPM is defined as the ratio between the probability toxicity probability p increases with dose level i. However, the i of the region and the size of the region (2, 14). Considering the efficacy probability q may increase initially and then reach a i two-dimensional unit square ð0; 1Þ ð0; 1Þ in the real space, the plateau from which minimal improvement or even decreasing JUPM for each interval combination ða; bÞ ðc; dÞ is efficacy may be seen with increasing dose. For this reason, we assume that qi is not monotone with i,andthatpi and qi are ; ; ; ðc;dÞ Prfg pi 2ða bÞ qi 2ðc dÞjD ; < < < ; < < < independent. Suppose that dose i is currently used in the trial JUPM ; 0 a b 1 0 c d 1 ða bÞ ðb aÞ ðd cÞ and ni patients have already been allocated to dose i,withxi and yi patients experiencing toxicity and efficacy outcomes. Aggre- Here, the numerator, Prfpi 2ða; bÞ; qi 2ðc; dÞjDg,isthe gating across all the doses, the trial data are denoted as posterior probability of pi and qi falling in the interval (a, b) D ¼fðni; xi; yiÞ; i ¼ 1; ...; dg. and (c, d), respectively. Similar to mTPI, we partition the unit intervals (0, 1) for pi and Assume the prior for each pi follows an independent qi into subintervals. Denoting (a, b)and(c, d) a subinterval in the betaðap; bpÞ, and the prior for each qi follows an independent partition for pi and qi, respectively. The interval combination betaðaq; bqÞ, where betaða; bÞ denotes a beta distribution with a; b c; d fi ð Þ ð Þ forms the basis for dose- nding decisions, with mean a=ða þ bÞ. The rationale of using independent priors fol- each combination corresponding to a specific decision, such as lows the same argument in (2). Assume xijpi : Binðni; piÞ and dose escalation or de-escalation. A dose-finding decision table yijqi : Binðni; qiÞ, where Binðn; qÞ denotes a binomial distribution can then be elicited with clinicians for all interval combinations. with n trials and q probability of success. Then, the likelihood An example of such a two-dimensional table is given in Table 1. function for the observed toxicity data ðxi; niÞ; i ¼ 1; ::: ; d,isa fi Q We call this the "preset table" for the TEPI design, which is xed d xi ni xi product of binomial densities lðpÞ¼ p ð1 piÞ , and and elicited prior to the trial. As illustrated in Table 1, there are i¼1 i the likelihood function for the efficacy data ðyi; niÞ; i ¼ 1; :::; d four subintervals for toxicity (rows) and efficacy (columns), the Q l q d qxi 1 q ni xi intersection of which forms 16 interval combinations. Each of is a product of binomial densities ð Þ¼ i¼1 i ð iÞ . the 16 combinations corresponds to a specific dosing decision. Thus, the posterior distributions for pi and qi are Decision "E" denotes escalation (i.e., treating the next cohort of betaðap þ xi; bp þ ni xiÞ and betaðaq þ yi; bq þ ni yiÞ, respec- patients at the next higher dose). Decision "S" denotes staying at tively. Based on the posterior distributions, there exists a "win- the current dose for treating the next cohort of patients. Decision ning" interval combination ða ; b Þ ðc ; d Þ that achieves the "D" denotes de-escalation (i.e., treating the next cohort of maximum JUPM among all the combinations in Table 1, and patients at the next lower dose). These reflect practical clinical the corresponding decision for that combination is selected for actions when the particular combination of toxicity and efficacy treating the next cohort of patients. It can be shown that the data are observed at a certain dose level. For example, Table 1 decision is the Bayes rule under a balanced loss function shows that the interval combination ð0; 0:15Þ ð0; 0:2Þ for pi and (Supplementary Material B). qi corresponds to an action of "E"—escalation. This means that if The basic dose-finding concept of TEPI is as follows. Assume the observed toxicity rate for a dose falls in (0, 0.15) and the that the current patient cohort is treated at dose i. After the current observed efficacy rate is in (0, 0.2), the next patient cohort will cohort completes DLT and response evaluation, compute the
Table 1. An example of a probability decision table based on pT ¼ 0:4 and qE ¼ 0:2 Efficacy rate Low Moderate High Superb (0, 0.2) (0.2, 0.4) (0.4, 0.6) (0.6, 1) Toxicity rate Low (0, 0.15) E E E E Moderate (0.15, 0.33) E E E S High (0.33, 0.4) D S S S Unacceptable (0.4, 0.1) D D D D NOTE: "E," "S," and "D" denote escalation, stay, and de-escalation, respectively.
14 Clin Cancer Res; 23(1) January 1, 2017 Clinical Cancer Research
Downloaded from clincancerres.aacrjournals.org on September 27, 2021. © 2017 American Association for Cancer Research. Published OnlineFirst October 14, 2016; DOI: 10.1158/1078-0432.CCR-16-1125
Tox–Eff Interval Dose-Finding Design
Table 2. Dose-finding table of the TEPI design Number of responders Number of patients treated at current dose level Number of DLTs 01–3 30 EE 1DS 2DD 3DUT DUT 01–45–6 60 EUE E 1EUES 2–3DUE DS 4DUE DD 5–6DUT DUT DUT 01 2–67–9 90–1EUEEE 2EUEES 3–4DUE DS S 5–6DUE DD D 7–9DUT DUT DUT DUT 0–12 3–78–12 12 0–1EUEEE 2EUEES 3–5DUE DS S 6DUE DD D 7–12 DUT DUT DUT DUT 0–12 3–910–15 15 0–2EUEEE 3–4EUEES 5–7DUE DS S 8–9DUE DD D 10–15 DUT DUT DUT DUT
Number of responders Number of patients treated at current dose level Number of DLTs 0–23 4–11 12–18 18 0–2EUEEE 3–5EUEES 6–9DUE DS S 10 DUE DD D 11–18 DUT DUT DUT DUT 0–23 4–13 14–21 21 0–2EUEEE 3–6EUEES 7–10 DUE DS S 11–12 DUE DD D 13–21 DUT DUT DUT DUT 0–34 5–15 16–24 24 0–3EUEEE 4–6EUEES 7–12 DUE DS S 13 DUE DD D 16–24 DUT DUT DUT DUT 0–34–56–17 18–27 27 0–3EUEEE 4–7EUEES 8–13 DUE DS S 14 DUE DD D 16–24 DUT DUT DUT DUT NOTE: The letters are computed based on the preset decisions in Table 1 and represent different dose-finding actions during the trial based on the observed toxicity and efficacy data. Decisions "D," "S," and "E" correspond to actions of de-escalate, stay (retain), and escalate the dose level, respectively. Decisions "DUT" and "DUE" correspond to actions of de-escalate the dose level due to high toxicity or low efficacy, respectively, and to mark the current dose unacceptable for future use. Decision "EU" is to escalate the dose level and mark the current dose unacceptable for future use.
JUPMs for all the interval combinations in Table 1. The TEPI comes as binomial counts, for any toxicity and efficacy counts that design recommends "E," "S," or "D" corresponding to the com- can be observed in the trial, the TEPI dose-finding decisions can be bination with the largest JUPM value. Because for a given trial precalculated. For our ACT trial, based on Table 1, all the decisions there are a finite number of possible toxicity and efficacy out- have been precalculated and presented in Table 2. Similar to the
www.aacrjournals.org Clin Cancer Res; 23(1) January 1, 2017 15
Downloaded from clincancerres.aacrjournals.org on September 27, 2021. © 2017 American Association for Cancer Research. Published OnlineFirst October 14, 2016; DOI: 10.1158/1078-0432.CCR-16-1125
Li et al.
Safety utility function Efficacy utility function
1.0 1.0
0.8 0.8
0.6 0.6 f(p) f(q) 0.4 0.4 Figure 1. Safety and efficacy utility functions.
0.2 0.2
0.0 0.0
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 p q
© 2016 American Association for Cancer Research
decision table for mTPI, this table enables clinicians to conduct and efficacy can be constructed based on pT and qE through the trial with transparency. discussions with clinicians. For example, Fig. 1 (left) shows the In practice, the TEPI design needs to be calibrated according utility function f1(p) for safety, where p denotes the toxicity rate. to physicians' needs. This is transparent and requires little effort. Utility f1(p) is set to 1 if p 15%, 0 if p > 40%, and linearly The tuning is for the intervals in Table 1 so that the decisions decreasing with p if p 2ð15%; 40%Þ: Utility f2(q) is set in a similar in Table 2 are satisfactory to the clinicians. Specifically, by mod- fashion. Figure 1 shows both utility functions. We assume a ifying the interval combinations in Table 1, a new decision monotonic constraint on priors for pi's while selecting the table can be generated in the form of Table 2. The tuning is best dose (i.e., p1 p2 pd). The utility score function is completed once the desirable decisions are obtained. defined as Uðp; qÞ¼f1ðpÞf2ðqÞ, where p denotes the toxicity rate, To enable ethical constraints, we introduce two additional rules and q denotes the efficacy rate. Both f1ð Þ and f2ð Þ are truncated as part of the dose-finding algorithm. One is to exclude any dose linear functions, given by 8 8 with excessive toxicity, and the other is to exclude any dose with ; ; ; ; ; ; < 1 p 2ð0 p1 < 0 q 2ð0 q1 fi p p q q unacceptable ef cacy. 1 ; ; 1 ; ; f1ðpÞ¼ 1 ; p 2ðp p Þ f2ðqÞ¼ ; q 2ðq q Þ : p2 p1 1 2 : q2 q1 1 2 * Safety rule:IfPrðpi>pTjDÞ>h for a h close to 1 (say, 0.95), ; ; ; ; ; ; 0 p 2½p2 1Þ 1 q 2½q2 1Þ exclude dose i; i þ 1; ; d from future use for this trial (i.e., these doses will never be tested again in the trial) and treat the where p 's and q 's are prespecified cutoff values. For each dose i, i 1 next cohort of patients at dose . This corresponds to a we use a simple numerical approximation approach to compute dosing action of "DU "—de-escalate due to unacceptable T the posterior expected utility, E½Uðpi; qiÞjD . We generate a total of high toxicity. T random samples from the posterior distributions. For each * Futility rule:IfPrðqi > qEjDÞ< j for a small j (say 0.3), then t t t t t t sample t, we generate p ¼ðp1; ...; p Þ and q ¼ðq1; ...; q Þ as a exclude dose i from future use in the trial. This corresponds to d d random sample of d probabilities of toxicity and efficacy, respec- a dosing action of "EU"—escalate and never return due to tively. We perform the isotonic transformation (2, 14) on pt to unacceptable low efficacy—or "DU "—de-escalate and never E obtain p^t ¼ðp^t ; ...; p^t Þ where p^t p^t if i