<<

Bounded and the choice of jury selection procedures

Martin Van der Linden January 12, 2017

Abstract A peremptory challenge procedure allows the parties to a jury trial to dismiss some prospective jurors without justification. Complex challenge procedures offer unfair advantage to parties who are better able to strategize. I introduce a new measure of strategic complexity based on “level-k” thinking and use this measure to compare challenge procedures commonly used in practice. In applying this measure, I overturn some commonly held beliefs about which jury selection procedures are “strategically simple”.

1 Introduction

It is customary to let the parties involved in a jury trial dismiss some of the potential jurors without justification. The procedures for dismissal are known as peremptory challenge procedures, and are the status-quo in many common-law countries, including the United States.1 Side-stepping the lively on the value of such procedures, it is natural to ask whether particular challenge procedures are more desirable than others.2 Fairness is an important issue in jury selection. One feature of procedures which impacts fairness is strategic complexity. If a procedure is complex, par- ties with better strategic skills are likely to secure more favorable juries. This

1 In Swain v. Alabama, the Supreme Court affirmed that “the [peremptory] challenge is one of the most important of the rights secured to the accused.” (LaFave et al., 2009). Following Batson v. Kentucky, a party can disqualify a peremptory challenge by an op- ponent if she can prove that the challenge was based on a set of characteristics including race or gender. However, Batson v. Kentucky is notoriously hard to implement and judges rarely rule in favor of Batson challenges (Marder, 2012; Daly, 2016). 2 See, e.g., Horwitz (1992) and Keene (2009) for a defense and Broderick (1992) and Smith (2014) for an attack on peremptory challenges.

1 is particularly relevant in jury selection where the parties put significant re- sources into adopting an effective and jury selection consultancy has become a well-established industry.3 Using strategically simple procedures limits the impact on the selected jury of differences in the parties’ ability to strategize, or in their financial means to hire jury consultants. Experienced judges share this concern and have developed procedures which attempt to limit the parties’ ability to strategize. In a report on judges’ practices regarding peremptory challenges, Shapard and Johnson (1994, p. 6) write: “Some judges require that peremptories be exercised [following procedure X] [...]. This approach [...] makes it more difficult to pursue a strategy prohibited by Batson (or any other strategy). [...] A more extreme approach to the same end is [procedure Y] [...]. This approach imposes maximum limits on counsel’s ability to employ peremptories in a strategic manner.”4 In this paper, I develop a new measure of strategic complexity based on level- k thinking (see the survey in Crawford et al., 2013) and use this measure to compare the complexity of some challenge procedures commonly used by judges. The comparison of jury selection procedures presents two challenges. First, jury selection procedures are indirect mechanisms in which the par- ties’ actions do not consist in revealing their .5 Second, in some procedures commonly used in practice, the parties submit their challenges simultaneously, which induces games of incomplete . These two difficulties make it impossible to apply measures of strategic complexity pre- viously developed in the literature (Pathak and S¨onmez, 2013; de Clippel et al., 2014; Arribillaga and Mass´o, 2015). I overcome these difficulties by introducing the concept of a rationality threshold. Given a model of the behavior of her opponent, a party has a straightforward strategy if one of her strategies is a to any

3 As evidenced by the existence of the American Society of Trial Consultants, and its publication “The Jury Expert: The Art and Science of Litigation Advocacy”. Jury consultant Roy Futterman writes: “Caditz argues that [...] jury selectors pay [...] little to no attention to the strategic use of strikes. [...] it is a bit of a reach to say that strategy is barely utilized. In my experience, [...] [jury selection] comes closer to a long battle of stealth, counter-punches, misdirection, and hand-to-hand combat than a lofty academic experience.” (Excerpt from Roy Futterman’s answer to David Caditz’s August 20, 2014 post on http://www.thejuryexpert.com/). 4 See footnote 1 regarding Batson v. Kentucky. 5 A procedure based on direct revelation would go against the idea of allowing the parties to challenge jurors without justification.

2 strategy of her opponent that is consistent with this model. The rationality threshold measures the complexity of the model of her opponent that a party needs in order to have a straightforward strategy. For example, a rationality threshold of 1 corresponds to the parties having dominant strategies. In this case, a party has a straightforward strategy given any model of her opponent. When the rationality threshold is 2, a party only needs to know that her opponent is a best responder in order to have a straightforward strategy.6 Using the rationality threshold as a measure of strategic complexity pro- vides new insights and overturns some commonly held beliefs about jury selection procedures. Shapard and Johnson (1994, p. 6) write:

“Other judges, for the same purposes [limiting the parties’ abil- ity to strategize], allow all peremptories to be exercised after all challenges for cause, but with the parties making their choices ‘blind’ to the choices made by opposing parties (in contrast to alternating “strikes” from a list of names of panel members).”7

I show that, contrary to the judges’ intuition, procedures in which challenges are sequential tend to be strategically simpler than procedures in which chal- lenges are simultaneous: By generating incomplete information games, simul- taneous procedures increase the amount of guesswork needed to determine optimal strategies. I identify theoretical conditions under which this is true and show that these conditions are relevant for jury selection using simula- tions as well as field data. I also study the design of “maximally simple” jury selection procedures. I show that it is impossible to construct a “reasonable” procedure that allows the parties to challenge jurors and always has a dominant strategy. Hence, the lowest achievable rationality threshold is 2. Such a lowest rationality threshold is attained by a procedure that I call sequential one-shot in which the parties sequentially submit a single list of jurors that they want to chal- lenge. Although the focus of this paper is the study of jury selection procedures, the rationality threshold is a general measure of strategic complexity. Unlike previous measures in the literature, the rationality threshold can be used to compare any type of game, including indirect games and games of incomplete

6 Ho et al. (1998) previously introduced the term “rationality threshold” to refer to a similar concept. They used their concept of a rationality threshold to explain some aspects of behavior in beauty contest games. The novelty here is to use the rationality threshold as a measure of strategic complexity. 7 Unlike peremptory challenges, challenges for cause must be based on recognized by law, such as being a direct relative of one of the parties.

3 information. The rationality threshold is the first such measure of strategic complexity proposed in the literature. Related Literature.— This paper differs from the previous game theoretic literature on jury selection procedures in at least two ways (see Flanagan (2015) for a recent review). First, the literature has focused on per- fect equilibrium as a .8 Subgame perfection requires a high level of strategic sophistication, especially in complex procedures. By relying on the concept of a rationality threshold, this paper accounts for the possi- bility of boundedly rational parties. I show how the rationality threshold, which measures the “amount” of common and rationality needed to reach an equilibrium, can be used to measure the strategic complexity of a procedure. Second, here, jury selection is studied from the point of view of and .9 Most of the literature focuses on the characterization and properties of equilibria in different procedures.10 When the performance of procedures is compared, it is typically in terms of their effects on the composition of the jury. This approach has yielded few policy recommen- dations.11 By contrast, this paper adopts a traditional mechanism design approach and compares procedures with respect to the standard objective of limiting the parties’ ability to strategize. This later approach enables a clear comparison of some of the procedures used by judges. The paper is organized as follows. Section 2 introduces the model as well as several examples of jury selection procedures and a general class thereof. In Section 3, I show that most “reasonable” jury selection procedures do not have dominant strategies. Section 4 formally introduces the concept of a rationality threshold. The rationality threshold is then applied to comparing the strategic complexity of jury selection procedures in Sections 5 and 6. Omitted proofs can be found in the Appendix.

2 Model and procedures

I focus on struck procedures. In addition to peremptory challenges which require no justification, the parties can raise challenges for cause which must be based on some recognized by law, such as being a direct relative of

8 Two exception are Bermant (1982) and Caditz (2015). 9 In this respect, the closest paper is de Clippel et al. (2014), which takes a mechanism design perspective but studies the selection of a single arbitrator. 10 See Roth et al. (1977), Brams and Davis (1978), DeGroot and Kadane (1980), Kadane et al. (1999), Alpern and Gal (2009), and Alpern et al. (2010). 11 See, however, Bermant (1982) and Flanagan (2015, Section 4.2).

4 one of the parties. As explained by Bermant and Shapard (1981, p. 92), the defining feature of a struck procedure “is that the judge rules on all challenges for cause before the parties claim any peremptories. Enough potential jurors are examined to allow for the size of the jury plus the number of peremptory challenges allotted to both sides. In a federal felony trial, for example, the jury size is twelve; the prosecution has six peremptories, and the defense has ten. Under the struck jury method, therefore, 28 potential jurors are cleared through challenges for cause before the exercise of peremptories.” This contrasts with strike and replace procedures in which challenges for cause and peremptory challenges are intertwined. In a strike and re- place procedure, prospective jurors who are challenged (either for cause or peremptorily) are replaced by new jurors from the pool and “to one degree or another, counsel exercise their challenges without knowing the character- istics of the next potential juror to be interviewed” (Bermant and Shapard, 1981, p. 93). Struck procedures are commonly used in federal courts. In a 1977 survey of judges’ practices regarding the exercise of peremptory challenges, 55% of federal district judges reported using a struck procedure (Bermant and Shapard, 1981), as opposed to a strike and replace procedure. Today, the use of a struck procedure is, for example, recommended by law as the preferred method for criminal cases other than first-degree murder in Minnesota.12 Struck procedures are used in other court circuits as well.13 However, the details of a procedure are often left to the discretion of the judge and may vary inside a single court circuit or state (Bermant and Shapard, 1981; Cohen and Cohen, 2003), which makes it hard to determine the procedures used in practice without surveying judges.

2.1 The model The set of prospective jurors left after all challenges for cause have been raised is N = {1,...,n}. The defendant is d and the plaintiff is p. The defendant and the plaintiff are allowed cd and cp peremptory challenges, respectively. Out of N, a jury J of b jurors must be selected. The jurors in J are the impaneled jurors. As explained above, when struck procedures are considered, n = b + cd + cp in order to allow the parties to challenge up to cd and cp jurors.

12 Minnesota Court Rules, Criminal Procedure, Rule 26.02, Subd.4.(3)b). Minnesota Court Rules, however, recommend using a strike and replace procedure for first-degree murders (Minnesota Court Rules, Criminal Procedure, Rule 26.02, Subd.4.(3)d)). 13New Jersey Court Rule 1:8-3(e) suggests the use of a struck procedure.

5 Let J be the set of juries containing b jurors and ∆J be the set of lotteries on J . The parties have expected preferences Rd and Rp on ∆J , respectively, with corresponding Bernoulli utility functions ud and up on J . A pair of preferences (Rp, Rd) is called a (preference) profile and a quintuple (Rd, Rp,cd,cp, b) a (jury selection) problem. Example 1. If π(J) is the probability that jury J convicts the defendant, then party i’s preference could be represented by a utility function of the form ui = vi(π(J)), where vp is increasing and vd decreasing. Throughout, I assume that preferences on juries are separable, i.e., if replacing juror h by juror j in jury J is an improvement according to ui, then the same is true when h is replaced by j in any other jury J ′. Formally, for any i ∈ {d,p} and any J, J ′ ∈ J , h ∈ J ∩ J ′ and j ∈ N\(J ∪ J ′), ′ ′ ui(J ∪{j}\{h}) ≥ ui(J) implies that ui(J ∪{j}\{h}) ≥ ui(J ). Separability is mostly assumed for simplicity. Separability eases the expo- sition because it implies that the preferences Rd and Rp induce well-defined preferences on the jurors in N. Below, I explain how some of the results in this paper can be extended when the separability assumption is relaxed. For simplicity, it is also assumed that the preferences on jurors induced by Rd and Rp are strict. Slightly abusing the notation, let Ri serve to denote i’s preference on jurors. An extreme kind of profiles are juror inverse profiles. A profile is juror inverse if Rd and Rp induce inverse preferences on jurors (i.e., for all j, h ∈ N, j Rp h if and only if h Rd j). Unlike separability which is assumed throughout the paper, juror inverse profiles are only considered as a special case.

2.2 Procedures As attested to by Bermant and Shapard (1981), a wide variety of struck pro- cedures are used by judges. One common type of struck procedure are pro- cedures that I call one-shot. In a one-shot procedure, each party i ∈{d,p} submits a list of up to ci jurors in N that i wants to challenge. Depending on the procedure, the parties submit their lists simultaneously (one-shotM ) or sequentially (one-shotQ). The impaneled jurors are the jurors in N who have not been challenged. If more than b jurors are left unchallenged, the b impaneled jurors are drawn (uniformly) at random from the unchallenged jurors. The use of one-shotM is documented by Bermant (1982, Step. 5, Com- ments by Judges Feikens and Voorhees). I have not found direct evidence of the use of one-shotQ, although Bermant (1982, Step. 5, Comments by Judge Enright) shows that a procedure in which the parties alternate challenges

6 twice has been used in practice, with each party allowed to challenge up to ci 14 2 jurors in each round. Another common type of struck procedure are the procedures that I call alternating. Alternating procedures proceed through a succession of rounds in which the parties can challenge as many jurors in N as they have challenges left. Again, an alternating procedure can be either simultaneous (alternatingM ) or sequential (alternatingQ) depending on whether chal- lenges are submitted simultaneously or sequentially. In alternatingM , if both parties challenge the same jurors in a given round, both parties are charged with the challenge and can challenge one less juror. Alternating procedures stop when neither of the parties has challenges left, or when both parties abstain from challenging a juror in a single round. The impaneled jurors are the jurors left unchallenged in N, or a random draw of b of these jurors if more than b jurors are left unchallenged. AlternatingQ is, for example, recommended by law as the preferred method for criminal cases other than first-degree murder in Minnesota (see footnote 12). I have not found direct evidence of the use of alternatingM . However, the use of simultaneous challenges in sequential procedures is not unheard of. Simultaneous challenges are, for example, used in the alternating strike and replace procedure for civil cases in Tennessee.15 One-shot and alternating procedures are part of the class of N-struck procedures in which parties take turns challenging jurors from N for a number of rounds.16 Formally, every N-struck procedure consists of a max- imum of f ≥ 1 rounds, where f differs between procedures. Each round r r ∈ {1,...,f} is characterized by a maximum number of challenges xi ≥ 1 f r for each party, with Pr=1 xi ≥ ci. The number of challenges party i has left r 1 in round r is ℓi , with ℓi = ci. In each round r :

r r (a) The parties can challenge up to min{xi ,ℓi } jurors among the jurors in 14 In the prior game theoretic literature on jury selection procedures, one-shot proce- dures have sometimes been misleadingly called struck procedures. In line with the law literature (see the quote from Bermant and Shapard, 1981, at the beginning of this sec- tion), it is more appropriate to use the term “struck procedures” to denote the class of struck procedures that includes one-shot procedures. 15 Tennessee Court Rules, Rules of Civil Procedure, Rule 47.03. The imposition of a uniform procedure for peremptory challenges in Tennessee came with a reform started in late 1997 and described in detail in Cohen and Cohen (2003). The adoption of simultaneous challenges was part of the reform’s effort to “permit peremptory challenges to be exercised in a way that does not disclose to potential jurors which lawyer exercised the challenge.” (Cohen and Cohen, 2003, pp. 25–26) 16 The name “N-struck procedure” emphasizes the fact that, in each round, the parties can challenge any juror in N that has not been challenged yet. This is not the case in every struck procedure. See, e.g., the procedure described in footnote 19.

7 N who have not yet been challenged. Challenges are sequential if the procedure is sequential, and simultaneous if the procedure is simultane- ous.

(b) For each party i ∈{d,p}, the number of challenges left is decreased by r+1 the number of jurors that the party challenged in (a) (i.e., ℓi equals r ℓi minus the number of jurors that the party challenged in (a)). The procedure terminates when no party has challenges left, when round f is reached, or when both parties abstain from challenging any juror in a single round. The jurors left unchallenged when the procedure terminates are the impaneled jurors. If more than b jurors are left unchallenged when the procedure terminates, the b impaneled jurors are drawn at random from the unchallenged jurors. 1 One-shot procedures are N-struck procedures with f = 1 and xi = ci for both parties i ∈{d,p}. Alternating procedures are N-struck procedures with r xi = ci for both i ∈ {d,p} and all r ∈ {1,...,f}, and f = 2 maxi∈{d,p} ci. Besides one-shot and alternating procedures, the class of N-struck procedures includes, for example, the two-round procedure described above. From a game theoretic point of view, a (jury selection) procedure is an extensive game form Γ: Sd × Sp → ∆J that associates any pair of strategies (sd,sp) in some strategy space Sd × Sp with a lottery on juries in J .

3 Impossibility results

Given preference Ri, a best response for party i to some strategy s−i of her opponent is a strategy ti(s−i) such that

′ ′ Γti(s−i),s−i Ri Γsi,s−i for all si ∈ Si. (1)

When −i plays s−i and i plays ti(s−i), party i best responds to −i. Given some model S−i ⊆ S−i of her opponent, a strategy si ∈ Si is straightfor- ward for i if si is a best response to every strategy s−i ∈ S−i. A dominant ∗ strategy is a strategy si ∈ Si that is a best response for i to any strat- egy s−i ∈ S−i. In other words, a dominant strategy is a strategy that is straightforward for i given any model of her opponent.17

17 Conversely, i has a straightforward strategy given model S−i ⊆ S−i if i has a dominant ′ ′ strategy in the reduced game Γ : Si × S−i → ∆J defined by Γ (si,s−i) = Γ(si,s−i) for all (si,s−i) ∈ Si × S−i. In this sense, the concept of a straightforward strategy generalizes that of a dominant strategy.

8 Given some domain of preferences, a dominant strategy procedure is a procedure in which both parties have a dominant strategy for every profile in the domain. Dominant strategy procedures are strategically simple because each party can determine an optimal strategy independently of any guess about the strategy of her opponent.18 Dominant strategy procedures guarantee a form of equality among equals: Two parties having the same preference but different abilities to guess how their opponent will play should be able to secure similar outcomes. It is useful to relate dominant strategies with level-k thinking (see the survey in Crawford et al., 2013). In the level-k terminology, an L0 party is a non-strategic party who could potentially play any strategy. An L1 party assumes that her opponent is L0, makes a guess about the L0 strategy s0 that her opponent will employ, and best responds to s0. Similarly, an Lk party assumes that her opponent is Lk−1, makes a guess about the Lk−1 strategy sk−1 that her opponent will employ, and best responds to sk−1. Observe that, because an L0 strategy can be any strategy, i has a dom- inant strategy if and only if i has an L1 strategy that is a best response to every L0 strategy of her opponent. In the language of level-k thinking, a dominant strategy procedure limits the impact of differences in strategic skills because i can determine an optimal strategy independently of her belief k−i about her opponent’s level of rationality k−i, or her guess about which L strategy her opponent will employ. Unfortunately, most reasonable procedures that permits challenges do not have a dominant strategy. Consider one-shotM . In one-shotM , i’s only best response to any strategy s−i is to challenge her ci worst jurors among the jurors that −i does not challenge in s−i. As illustrated in Example 2, such a best response is very dependent on the challenges chosen by −i. Hence, one-shotM is not a dominant strategy procedure.

Example 2. Suppose that each juror has four challenges (cd = cp = 4) and one juror has to be selected (b = 1). A set of nine prospective jurors N = {1,..., 9} will therefore remain after all challenges for cause have been raised. Let d’s preference on these nine jurors be 1 Rd 2 Rd ... Rd 9. If p challenges the circled jurors in (2), then d’s best response is to challenge the squared jurors in (2).

Rd : 1 2 3 4 5 6 7 8 9 (2)

18 Requiring the existence of dominant strategies in a procedure is somewhat similar to ∗ requiring strategy-proofness in direct mechanisms. One difference is that si need not be a simple function of i’s true preference.

9 On the other hand, if p challenges the circled jurors in (3), then d’s best response is to challenge the squared jurors in (3).

Rd : 1 2 3 4 5 6 7 8 9 (3)

Clearly, the challenge of the squared jurors in (3) is not a best response for d to p challenging the circled jurors in (2), which shows that one-shotM is not a dominant strategy procedure. As shown in Proposition 1, the preceding example generalizes to the whole class of N-struck procedures and to any problem. Intuitively, in any N-struck procedure, if −i does not challenge any jurors, then i’s best response is to challenge her ci worst jurors. On the other hand, if −i challenges one of the ci worst jurors of i, say w, then i is better off not challenging w and challenging her other ci worst jurors. Recall that a (jury selection) problem is a quintuple (Rd, Rp,cd,cp, b). Proposition 1. For any problem, (i) one of the parties does not have a dominant strategy in one-shotQ and (ii) neither party has a dominant strategy in any N-struck procedure different from one-shotQ.

Note that one-shotM is an N-struck procedure different from one-shotQ. Hence, Proposition 1 shows that, for every problem, neither party has a dominant strategy in one-shotM . One-shotQ is the exception among N-struck procedures: It is the only N-struck procedure in which one of the parties — the second party to challenge — has a dominant strategy, although the other party does not (see the proof of the proposition). Of course, N-struck procedures are only a small subset of all possible jury selection procedures. Other procedures used in practice include the strike and replace procedures, as well as other struck procedures.19 It is therefore natural to ask whether there exists dominant strategy procedures for jury selection outside of the class of N-struck procedures. The next proposition shows that if such procedures exist, then they must either deprive a party from her right to challenge at least one juror in N, or be so intricate that they are unlikely to be used in practice. A procedure satisfies finiteness if the set of decision nodes is finite for both parties and for Nature. A procedure satisfies minimal challenge if

19 For example, Bermant (1982, Step. 5, Comments by Judge Atkins) describes a struck procedure in which a first set of b jurors is selected from N and the parties take turns challenging jurors in that set, with every challenged juror being replaced by another juror from N that has not been selected yet. This procedure is not an N-struck procedure because juror from N cannot be challenged before they have been picked to replace one of the b jurors initially selected.

10 for every prospective juror j ∈ N, both parties i ∈ {d,p} have a strategy j j 20 si ∈ Si such that j is never part of the chosen jury when i plays si . Every N-struck procedure satisfies both finiteness and minimal challenge (strategy j si can, for example, involve challenging juror j and only juror j in the first round).

Proposition 2. On the domain of separable preferences, no dominant strat- egy procedure satisfies both finiteness and minimal challenge.

In the Appendix, I show that Proposition 2 is true even for smaller do- mains of profiles, including the domain of additive profiles. The proof of Proposition 2 makes use of Theorem 2 in Van der Linden (2015). That theo- rem implies that, in order to be a dominant strategy procedure, a procedure satisfying minimal challenge must generate a direct mechanism that has a range of possible outcomes which is in some sense “dense”. This, in turn, im- plies that the parties sometimes have a continuum of actions which conflicts with finiteness.

4 A measure of strategic complexity

Propositions 1 and 2 show that most procedures are not strategically simple in the sense that both parties cannot always follow the simple recommen- dation of playing a dominant strategy. This does not mean, however, that judges should give up on the idea of using procedures that are as simple as possible. This section and the next show that, although procedures generally fail to feature dominant strategies, not all procedures are equal in terms of strategic complexity.

4.1 Motivating example Brams and Davis (1978, p. 969) have argued that, when the parties have juror inverse preferences, one-shot procedures raise “no strategic questions of timing: given that each side can determine those veniremen it believes least favorably disposed to its cause, it should challenge these up to the limit of its peremptory challenges.”. This may come as a surprise given Example 2 and Proposition 1. Certainly, one-shotM is not a dominant strategy procedure. How can we then make sense of Brams and Davis’ claim? The next examples hints at one possible answer.

20 j That is, the probability that j is chosen given that i plays si is zero for all s−i.

11 Example 3. Consider one-shotM with cd = cp = 2 and b = 5. Let d have preference 1 Rd ... Rd 9. Also, suppose that the parties have juror inverse preferences. If d believes that p is best responding to some of her strategies, then d knows that p will challenge two of the circled jurors in (4).

R : 1 2 3 4 5 6 7 8 9 d (4) Rp :9 8 7 65 4 3 2 1

Indeed, a best response by p involves challenging her two worst jurors among the seven jurors that she believes d will not challenge. This can never lead to p challenging a juror in {5,..., 9}. Thus, a best response by d to the minimal belief that p is a best responder always consists in challenging her two worst jurors (squared in (4)). By symmetry, the same is true for p.

In Example 3, one-shotM “raises no strategic question” because a party only needs to know that her opponent is a best responder in order to have a straightforward best response. For each party i, challenging her ci worst jurors is a best response to any strategy of party −i that is itself a best response to some of i’s strategies. In this sense, each party has a straight- forward strategy given a minimal model of the strategic behavior of her opponent. In the rest of this section, I generalize this logic to obtain a measure of strategic complexity. This measure is then applied in the next two sec- tions to compare struck procedures for different assumptions on the problem (Rd, Rp,cd,cp, b).

4.2 The rationality threshold As argued above, first-best procedures are dominant strategy procedures in which the parties have a straightforward strategy whatever their model of their opponent. It is then natural for second-best procedures to be procedures in which the parties have a straightforward strategy given a minimal model of their opponent. As suggested by Example 3, a meaningful concept of minimal model is for a party to assume that her opponent will play a best response to some of her strategies. In the language of level-k thinking, a procedure is second-best if each party i has an L2 strategy that is a best responds to every L1 strategy of her opponent. Such second-best procedures limit the impact of differences in strategic skills because i’s optimal strategy depends minimally on her model

12 Si S−i Si S−i Si S−i 1 1 1 1 b b s 1 b b s 1 b b s si −i si −i si −i 2 2 2 2 b b s 2 b b s 2 b b s si −i si −i si −i 3 3 3 3 b b s 3 b b s 3 b b s si −i si −i si −i 4 4 4 4 b b s 4 b b s 4 b b s si −i si −i si −i 5 5 5 5 b b s 5 b b s 5 b b s si −i si −i si −i 6 6 6 6 b b s 6 b b s 6 b b s si −i si −i si −i

0 0 0 L2 L−i 1 L−i 1 L−i 1 −i L−i L−i L−i (a) First-best procedure (b) Second-best procedure (c) Third-best procedure

Figure 1: Representation of first-, second-, and third-best procedures. An arrow from strategy si to strategy s−i means that si is a best response to s−i. In first-best procedures represented in (a), each party i has a strategy — 6 si in the figure — that is a best response to every strategy of her opponent 0 (i.e., to every L−i strategy). In second-best procedures represented in (b), 4 each party i has a strategy — si in the figure — that is a best response to every strategy of her opponent that is itself a best response (i.e., to every 1 4 0 L−i strategy). However, si does not need to be a best response to every L−i 4 1 strategy. For example, in the figure, si is not a best response to s−i. The same logic extends to third-best procedures represented in (c). of −i: i only needs to assume that −i is L1 to have a straightforward strategy. The difference between first-best and second-best procedures is illustrated in Figure 1(a) and (b). A second-best procedure guarantees a form of second-best equality among equal parties. Consider two defendants with the same preference who both believe that the plaintiff is L1 and best responds to one of their strategies. The two defendants might differ in other strategic aspects, such as their ability to guess which of their strategies the plaintiff best responds to. In a second-best procedure, these differences have no impact: the two defendants play equivalent strategies and secure the same . Similarly, third-best procedures feature straightforward best responses given a model that is minimally stronger than in second-best procedures. A natural candidate for such a minimally stronger model is for i to assume that −i is L2 (see Figure 1(c)). This logic extends to higher level reasoning. In procedures with multiple rounds, it is important to ensure that best responses be enforced throughout the game tree. Therefore, the measure

13 of strategic complexity defined below relies on iterated conditional best re- sponses, rather than iterated best responses. Iterated conditional best re- sponses are obtained by iterative elimination of strategies that are never best responses in every subgame of an extensive game.21

Definition 1 (Iterated conditional best response). For any procedure Γ and any profile (Rd, Rp), the process of iterated conditional best response is defined as follows:

0 Round 0. For each i ∈ {d,p}, the set of CLi (conditionally level 0) strategies is Si.

0 Round 1. For each i ∈{d,p}, eliminate from CLi the strategies si for which there exists a subgame γ of Γ such that the restriction si|γ of si to γ is not a best response to any s−i|γ in γ. 1 1 The remaining set of strategies is denoted by CLi . Any si ∈ CLi is called 1 a CLi (conditionally level 1) strategy for i. . .

k−1 Round k. For each i ∈ {d,p}, eliminate from CLi the strategies si for which there exists a subgame γ of Γ such that the restriction si|γ of si to γ k−1 is not a best response to s−i|γ for any s−i ∈ CL−i . k k The remaining set of strategies is denoted CLi . Any si ∈ CLi is called a k CLi (conditionally level k) strategy for i.

0 Observe that the sets of conditionally level k strategies are nested (CLi ⊇ 1 CLi ⊇ ... ). Observe also that, for every procedure Γ that satisfies finiteness, the set of conditionally level k strategies is non-empty for every k.22 The argument at the beginning of this section suggests using the following concept of a rationality threshold as a measure of strategic complexity.23

21 The use of “conditional” to qualify a process of iterated elimination of strategies which applies to every subgame is taken from Fudenberg and Tirole (1991). 22 When Γ satisfies finiteness, for any i ∈ {d, p} and any strategy s−i, the set {Γ(si,s−i) | si ∈ Si} is finite because Si is finite. Hence, there must exist a strategy ′ ′ 1 ti(s−i) such that Γti(s−i),s−i Ri Γsi,s−i for all si ∈ Si and CLi is non-empty. The k non-emptiness of CLi then follows by induction. 23 The term “rationality threshold” is often used in relation to the process of iterated elimination of undominated strategies (e.g., Mas-Colell et al., 1995; Ho et al., 1998). Here I use the same term to refer to the iterated elimination of strategies that are never best responses. In general, the two processes are not equivalent, see the Conclusion.

14 Definition 2 (Rationality threshold). For any procedure Γ and any profile ∗ (Rd, Rp), the rationality threshold is the smallest integer r such that, for r∗ ∗ each i ∈ {d,p}, there exists a CLi strategy si that is a best response to r∗−1 every CL−i strategy. If there exists no such integer, then the rationality threshold of Γ is ∞, i.e., the procedure cannot be solved by iterated conditional best response. Note that if the rationality threshold r∗ is finite, then there exists a pair of r∗ r∗ strategies (si,s−i) ∈ CLi × CL−i is a subgame perfect equilibrium. Any such equilibrium is, in fact, a straightforward subgame perfect equilibrium in the sense that each player i can be∗ viewed as playing a straightforward (r −1) 24 strategy given some model S−i ⊆ CL−i of the other player.

5 One-shot procedures

In this section, I show that one-shotQ is strategically simpler than one-shotM in the following sense.

Proposition 3. (i) For every problem, the rationality threshold of one-shotM is no smaller than the rationality threshold of one-shotQ. (ii) For some problems, the rationality threshold of one-shotM is larger than the rationality threshold of one-shotQ.

In the rest of this section, I prove and illustrate Proposition 3. The proof of Proposition 3(ii) relies on problems the profiles of which are not juror inverse. I show that such problems are common using simulations and field data.

5.1 One-shotQ is always maximally simple The next example illustrates how to compute the rationality threshold of one-shotQ for a particular problem.

Example 4. The example is illustrated in Figure 2. Suppose that cd = cp = b = 1 and d is the first party to challenge. Also suppose that the parties have aligned preferences 1 Rd 2 Rd 3 and 1 Rp 2 Rp 3. Because preferences on jurors are strict, p has a unique dominant strategy ∗ sp which consists in (a) challenging juror 3 if d did not challenge 3 and (b) challenging juror 2 if d did challenge juror 3 (dashed blue branches in the

24 The integer r∗ is the smallest integer for which such an equilibrium exists.

15 2 b {1} 3 p b 1 b {2}

∅ b L{1,2} 3 b {1} 2 p b 1 b {3}

∅ b L{1,3} 1 2 b CLp and CLp d 3 b {2} 1 p CL2 b 2 b {3} d

∅ b L{2,3}

3 b L{1,2}

∅ 2 b p b L{1,2}

1 b L{2,3}

∅ b L{1,2,3} Figure 2: Computing the rationality threshold in Example 4. In the figure, LT for T ⊂ N represents a lottery in which two jurors are drawn (uniformly) at random from T . The labels on the branches of the tree indicate the juror who is challenged in the corresponding action.

∗ 1 figure). Strategy sp is, therefore, the unique CLp strategy. It directly follows ∗ 1 from uniqueness that sp is a best response to all CLd strategies. 1 ∗ 2 Because there is a unique CLp strategy sp, any CLd strategy that best ∗ responds to sp (either of the red dashed branches in the figure) is a best 1 response to all CLp strategies. Hence, the rationality threshold of one-shotQ is at most 2 for this problem. But by Proposition 1, because one-shotM is an N-struck procedure, the rationality threshold of one-shotQ is at least 2 for every problem. Thus, the rationality threshold of one-shotQ is 2 for this problem.

It is not hard to see how the argument in Example 4 generalizes to any problem. In general, the party −i who challenges second in one-shotQ has a ∗ ∗ unique dominant strategy s−i. Then, any best response to s−i by i is a best 1 response to every CL−i strategy.

Proposition 4. For any problem, the rationality threshold of one-shotQ is 2.

As can be seen from the proof in Appendix, Proposition 4 does not rely on the separability assumption. Rather, the proof relies on the fact that

16 preferences on the outcomes of the procedure are strict. The proposition also extends to situations in which (which is implicit in the definition of a rationality threshold of 2) is relaxed. Consider Example 4. In order to have a straightforward best response, d only needs to know that p will challenge juror 3 if she challenges juror 2. Hence, d only needs to know which juror is p’s worst juror in order to have a straightforward best response (as opposed to knowing all of p’s preference on jurors). By Proposition 1, because one-shotM is an N-struck procedure, the ra- tionality threshold of one-shotM is at least 2 for every problem. Together with Proposition 4, this implies that the rationality threshold of one-shotM is never smaller than the rationality threshold of one-shotQ, which proves Proposition 3(i).

5.2 One-shotM is often complex: one-common profiles

I now show that one-shotM is more complex than one-shotQ when the profile is not juror inverse and preferences on jurors satisfy some “commonality at the bottom”. One such profile is presented in Example 5. In the example, both parties agree that juror 3 is the worst juror. Therefore, any best re- sponding party would challenge juror 3 if her opponent did not. But each party also prefers a situation in which her opponent challenges 3, and she challenges her second worst juror. That is, each party would like to make a credible threat not to challenge juror 3 and free ride on her opponent’s chal- lenge of juror 3. But because the procedure is simultaneous, such a credible threat is impossible. As explained in detail in Example 5, the impossibility for the parties to commit to leaving juror 3 unchallenged makes the ratio- nality threshold of one-shotM larger than 2 for this problem. Together with Proposition 4, Example 5 therefore proves Proposition 3(ii). In fact, the rationality threshold in Example 5 is ∞, which shows just how complex one-shotM can become when the profile is not juror inverse. As the example illustrates, for some profiles that are not juror inverse, one- shotM cannot be solved by iterated conditional best response. For these profiles, the parties’ of each others’ rationality, prefer- ences, knowledge of each others’ preferences, etc., is not sufficient to provide the parties with straightforward strategies and make the game strategically simple. Even for “high levels of common knowledge”, the game induced by one-shotM remains akin to a game of chicken in which each party prefer to swerve (i.e., challenge some of her worst jurors) if her opponent stays straight (i.e., does not challenge some of her worst jurors), but prefers to stay straight if her opponent swerves.

17 CL0 CL1 CL2 CL3 p d p d ... strategies strategies strategies strategies Juror 3 2 3 2 ... challenged 1 3 1 3 ...

Figure 3: For the profile of preferences and the values of cd, cp and b in Example 5, the rationality threshold of one-shotM is ∞.

Example 5. The example is illustrated in Figure 3. Suppose that b = cd = cp = 1. Also suppose that the parties’ preferences are 1 Rd 2 Rd 3 and 2 Rp 1 Rp 3. 0 Both challenging juror 3 and challenging juror 1 are CLp strategies. Chal- lenging juror 2 is d’s best response to p challenging juror 3, and challenging juror 3 is d’s best response to p challenging juror 1. Hence, both challenging 1 juror 2 and challenging juror 3 are CLd strategies. But no strategy of p is 1 a best response to both of these CLd strategies. Therefore, the rationality threshold of one-shotM is at least 3 for this problem. 1 Party p’s best responses to these two CLd strategies are to challenge juror 3 (p’s best response to d challenging juror 2) and to challenge juror 1 (p’s best response to d challenging juror 3). Thus, both challenging juror 3 and 2 2 challenging juror 1 are CLp strategies. But these two CLp strategies are the 0 two CLp strategies considered at the beginning of the example. The argument therefore extends by induction to show that the rationality threshold of one- shotM is ∞ for this problem.

5.2.1 One-common profiles

Profiles for which the rationality threshold of one-shotM is ∞ are not rare. Given cd and cp, a profile is one-common if a juror w that is among the cd worst jurors of d is also among the cp worst jurors p. Intuitively, the rationality threshold of one-shotM is ∞ for one-common profiles because the free rider problem described in Example 5 extends to one-common profiles. When the profile is one-common, each party would like to make a credible threat not to challenge juror w and free ride on her opponent’s challenge of juror w. But if her opponent does not challenge w, each party prefers challenging w herself than leaving w unchallenged.

Proposition 5. If the profile is one-common, then the rationality threshold of one-shotM is ∞. Although Proposition 5 relies more directly than Proposition 4 on the

18 .20 .46 .67 .81 .2 .7 .11 .13 .15

.26 .55 .75 .87 .4 .11 .14 .16 .18

.40 .70 .85 .93 .10 .17 .20 .21 .22 Number of jury members (b) Number of jury members (b) 2 4 6 8 10 12 2 4 6 8 10 12

2 4 6 8 5 10 15 20 Number of challenges per party (c) Number of challenges per party (c)

(a) (b)

Figure 4: Proportion of one-common profiles relative to the set of (a) all profiles and (b) almost juror inverse profiles. All proportions are proportions of profiles of preferences on jurors. Also, the proportions are for the case cd = cp = c. Alternatively, the proportions are lower bounds for c = min{cd,cp}. See the Appendix for details on the computation of these proportions.

separability assumption,25 the intuition behind Proposition 5 applies when separability is relaxed. Regardless of the assumptions on preferences, if for some juror j, both parties have best responses that include challenging j, then the rationality threshold is larger than 2 in one-shotM . The proposition also extends to situations of imperfect information in which the parties only know that they have a common juror w at the bottom of their ranking of jurors (but do not know each other’s complete preference on juror). One-common profiles arise in a number of natural jury selection situa- tions. For example, both parties might dislike a juror who they view as too unpredictable. Both parties may also dislike “devil advocate” or “irresolute” jurors who are likely to induce hung juries forcing retrials of the case. Finally, d may dislike juror j because of its foreseen position some charges, while p dislike juror j because of its foreseen position a different charges. As shown in Figure 4(a), the proportion of one-common profiles relative to the set of all profiles is high. (In the figure and henceforth, proportions of

25 One-common profiles are not well-defined without the separability assumption.

19 profiles refer to proportions of profiles of preferences on jurors.) Even among profiles that are close to being juror inverse, the proportion of one-common profiles can be significant when c is high relative to b. Figure 4(b) shows the proportion of one-common profiles among almost juror inverse profiles. A profile is almost juror inverse if it can be constructed from a juror inverse profiles by changing the ranking of a single juror in the preference of one of the parties.26 This is illustrated in Example 6.

Example 6. Suppose that cd = cp = 4 and b = 1. The following profile is almost juror inverse because it is constructed from a juror inverse profile by changing the ranking of a single juror — namely juror 7 — in the preference of p.

R :123456 7 8 9 d (5) Rp :9865432 7 1

The above profile is also one-common because 7 is among the four worst jurors of both d and p, and cd = cp = 4.

Procedures in which c is high relative to b exist in practice. In the United States, the number of peremptory challenges tends to increase with the grav- ity of the charges. For example, in federal cases for which the death penalty is sought by the prosecution, b = 12 and cd = cp = 20. In this case, Proposi- tion 5 together with the proportion in Figure 4(b) show that the rationality threshold is ∞ for 15% of almost juror inverse profiles.

5.2.2 One-common profiles in the field To obtain further evidence of the prevalence of one-common profiles, I con- sider real-world arbitration cases from the New Jersey Public Employment Relations Commission (Bloom and Cavanagh, 1986; de Clippel et al., 2014). From 1985 to 1996, the Commission used a veto-rank mechanism to select an arbitrator in cases involving a union and an employer. In the veto-rank mechanism used by the Commission, the union and the employer are presented with seven potential arbitrators. The union and the employer simultaneously challenge three potential arbitrators and rank order the remaining arbitrators. The chosen arbitrator is the unchallenged arbitrator with the lowest combined rank. Except for the way an arbitrator

26 ∗ ∗ Formally, profile (Rd, Rp) is almost juror inverse if there exists a juror inverse profile ∗ ∗ (Rd, Rp), a party i ∈ {d, p} and a juror j ∈ N such that (a) for all j, k ∈ N, j R−i k if ∗ ∗ and only if j R−i k, (b) for all j, k ∈ N with j, k 6= j , j Ri k if and only if j Ri k, and ∗ ∗ ∗ ∗ ∗ ∗ ∗ (c) for some j ∈ N with j 6= j , j Ri j and j Ri j or j Ri j and j Ri j .

20 is selected when challenges overlap, this procedure is equivalent to one-shotM 27 with b = 1 and cd = cp = 3. Out of 750 cases, de Clippel et al. (2014) report that the frequency of overlaps in the challenges was as indicated in Table 1.

Number of common challenges Proportion 0 13% 1 50% 2 34% 3 3%

Table 1: Overlap in challenges in the 750 arbitration cases.

Let a party be truthful if she challenges her three worst arbitrators (regardless of her reported ranking over the remaining arbitrators). If both parties were truthful in each of the 750 cases, then the data in Table 1 would imply that the profile of preferences on arbitrators was one-common in 87% of the 750 cases. However, because truthfulness is not a dominant strategy in the veto-rank mechanism, this is not an accurate estimate of the proportion of one-common profiles in the 750 cases. To obtain a more realistic estimate of this proportion, I consider the laboratory experiment on the veto-rank mechanism described in de Clippel et al. (2014). In the experiment, participants play the veto-rank mechanisms with five arbitrators (i.e., b = 1 and cd = cp = 2). The participants are randomly assigned to four different profiles of preferences, denoted Pf1, Pf2, Pf3, and Pf4 (see de Clippel et al., 2014). For each profile, de Clippel et al. (2014) observe 350 instances of the game being played. Based on the experimental data from de Clippel et al. (2014), I compute for each profile the proportion of plays in which both parties were truthful. I also compute this proportion across the four profiles. These proportions are reported in Table 2. I propose to use the values in Table 2 as estimates of the proportion of the 750 New Jersey cases in which both parties were truthful. Whichever estimate x from Table 2 is used, x − 13% is a lower bound on the proportion of one-common profiles. This lower bound is obtained by assuming that both parties were truthful in all 13% of cases in which the challenges did not overlap.

27 de Clippel et al. (2014) report that after 1996, the Commission started selecting the arbitrator at random from the list of unchallenged arbitrators, which means the Commis- sion effectively used one-shotM after 1996.

21 Proportion of plays in which Profile both parties challenged their two worst arbitrators Pf1 65% Pf2 45% Pf3 38% Pf4 24% Across the four profiles 43%

Table 2: Proportion of experimental plays of the veto-rank mechanism in de Clippel et al. (2014) in which both players challenged their two worst arbitrators.

This lower bounds is illustrated in Figure 5 for different values of the truthfulness estimate. Even under the most conservative truthfulness esti- mate (i.e., 24%), the lower bound on the proportion of one-common profiles is 11%. Using the truthfulness estimate based on the average across the four profiles, the lower bound on the proportion of one-common profiles is 30%. Overall, the results in this section contrasts with the judges’ intuitions that “blind” (i.e., simultaneous) procedures leave less room for the parties to strategize than sequential ones.28 Contrary to the judges’ intuition, the rationality thresholds suggest that one-shotQ is strategically simpler than one-shotM : By making past actions observable, one-shotQ allows the parties to make credible threats about the jurors they challenge, which reduces the amount of guesswork involved in determining an appropriate strategy. The next section shows that similar results hold for other N-struck procedures.

6 Alternating and other N-struck procedures

In general, it is unclear how alternatingM and alternatingQ compare. How- ever, extending the logic of Proposition 5, it is possible to obtain a partial comparison for a significant subset of profiles. For this subset of profiles, the rationality threshold of any simultaneous N-struck procedure (including alternatingM ) is infinite, whereas the rationality threshold of any sequential N-struck procedure (including alternatingQ) is finite. If preferences on the outcomes of a sequential N-struck procedure are strict (including preferences on lotteries), then the procedure always has a

28 See the last quote from Shapard and Johnson (1994) in the Introduction.

22 80 Across the four profiles 70 Pf4 Pf3 Pf2 Pf1

60

50

40

30

20

10

0

Lower bound on the proportion of the cases

in which the profile was one-common (in percents) 0 10 20 30 40 50 60 70 80 Proportion of the cases in which both parties challenged their three worst jurors (in percents) Figure 5: Lower bounds on the proportion of one-common profiles in the 750 arbitration cases (blue line) as a function of the proportion of cases in which both parties were truthful. Estimates for the later proportion using experimental data from de Clippel et al. (2014) are shown by the dashed vertical lines (see Table 2).

finite rationality threshold. This follows from the fact that, with no indiffer- ences on outcomes, sequential N-struck procedures induce games of that can be solved uniquely by .29 Then, the number of rounds of backward induction required to solve the game is an upper bound for the rationality threshold. Proposition 6. For any sequential N-struck procedures, if preferences on the outcomes of the procedure are strict, then the rationality threshold is finite and smaller than the depth of the game tree.

Recall that one-shotM has an infinite rationality threshold when the pro- file is one-common because each party would like to free ride on her oppo- nent’s challenge of one of the jurors they both dislike (see Example 7). This

29 More precisely, multiple strategy profiles can survive backward induction, but each of these profiles must yield the same outcome.

23 idea generalizes to the class of simultaneous N-struck procedures as a whole. Below, I identify for each simultaneous N-struck procedure Γ a set of Γ-one- common profiles. In Proposition 7, I show that any Γ-one-common profiles induces an infinite rationality threshold in Γ. In particular, together with Proposition 6, Proposition 7 implies that, whenever the profile is alternatingM -one-common (and preferences on out- comes are strict), the rationality threshold of alternatingQ is lower than the rationality threshold of alternatingM . That is, Proposition 3 extends par- tially to alternating procedures. Informally, given a simultaneous N-struck procedure Γ, a profile is Γ-one- common if, in one of the last of Γ, the set of jurors that remain unchallenged gives rise to the free rider problem described above. Formally, given Γ, a profile is Γ-one-common if there exists a subgame γ of Γ such γ that (a) both parties can still challenge jurors in γ (i.e., ℓi ≥ 1 for both i ∈ {d,p}), (b) the first round of γ is the last round of γ in which both parties can challenge jurors,30 and (c) among the unchallenged jurors, one of γ γ the ℓd worst jurors according to Rd is also one of the ℓp worst jurors according to Rp. For example, consider alternatingM -one-common profiles. The only sub- game of one-shotM is one-shotM itself. Hence, in the case of one-shotM , (a), (b) and (c) boil down to requiring that among N, one of the cd worst jurors according to Rd is also one of the cp worst jurors according to Rp, which is the definition of a one-common profile. Because the sets of alternatingM - one-common and one-common profiles are identical, the next proposition generalizes Proposition 5. Proposition 7. For any simultaneous N-struck procedure Γ, if the profile is Γ-one-common, then the rationality threshold of Γ is ∞. In general, the Γ-one-common and the one-common conditions are not equivalent. For example, there exist alternatingM -one-common profiles that are not one-common.

Example 7. Consider alternatingM and any problem in which cd = cp = 2, b = 1, and the preferences on jurors are,

R :12345 d (6) Rp :24513

The profile is not one-common because {4, 5}∩{1, 3} = ∅.

30 This could arise because the first round of γ is the terminal round of Γ, or because both parties only have one challenge left in γ.

24 However, consider the subgame γ∗ that follows from d challenging ju- ror 4 and p challenging juror 5 in the first round. Subgame γ∗ satisfies (a) and (b) in the definition of an alternatingM -one-common profile. Also, both players have the same least prefer jurors among {1, 2, 3}, the set of unchal- lenged jurors at the beginning of γ∗. Hence, condition (c) in the definition of an alternatingM -one-common profile is also satisfied, and profile (6) is alternatingM -one-common. To see why the rationality threshold is infinite in subgame γ∗, observe that in γ∗, each party wants to free-ride on her opponent’s challenge of juror 3. This induces an infinite rationality threshold for the same reason that the rationality threshold is infinite in Example 5. The procedure used to reach subgame γ∗ in Example 7 can be generalized. In alternatingM , for any set of jurors T containing b + 2 jurors, there exists a subgame γ satisfying (a) and (b) such that T is the set of unchallenged jurors 31 and each party has one challenge left. Hence, a profile is alternatingM -one- common if there exists a set T containing b + 2 jurors such that Rp and Rd have the same worst juror among the jurors in T . This sufficient condition can be used to prove the following result.

Proposition 8. Every one-common profile is alternatingM -one-common.

Hence, the proportion of alternatingM -one-common profiles is no smaller than the proportion of one-common profiles, and the proportions in Figure 4 are therefore lower bounds for the proportions of alternatingM -one-common profiles.32 The lower bounds on the proportions of one-common profiles in the arbitration data in Figure 5 are also lower bounds on the proportion of alternatingM -one-common profiles in the same data.

7 Conclusion

This paper shows how jury selection procedures can be compared in terms of their strategic complexity by computing their rationality thresholds, i.e., the number of rounds of elimination of strategies that are not best responses required for the parties to have a dominant strategy. The results in this paper notably show that procedures in which challenges are made sequen- tially tend to be strategically simpler than procedures in which challenges are simultaneous. 31 Such a subgame is reached, for example, after d alone challenges jurors in N\T for cd − 1, followed by p alone challenging remaining jurors among N\T for cp − 1 rounds. 32 In fact, together with Example 7, Proposition 8 implies that the proportion of alternatingM -one-common profiles is strictly larger than that of one-common profiles.

25 Si S−i 1 1 b b s si −i 2 2 b b s si −i 3 3 b b s si −i 4 4 b b s si −i 5 5 b b s si −i 6 6 b b s si −i

0 S2 0 S2 Si 1 i S−i 1 −i Si S−i Figure 6: An arbitrary hierarchical model for a given some preference profile (Rd, Rp).

The rationality threshold offers a new method to compare the strategic complexity of mechanisms. Unlike previous methods in the literature (Pathak and S¨onmez, 2013; de Clippel et al., 2014; Arribillaga and Mass´o, 2015), it allows for comparisons even when the mechanisms at stake are indirect or induce games of incomplete information. More generally, the rationality threshold shows how hierarchical models can be used to compare the strategic complexity of mechanisms. As illus- trated in Figure 6, for any profile (Rd, Rp), a hierarchical model specifies 0 m 0 m 0 a pair ({Sd ,...,Sd }, {Sp,...,Sp }) of nested strategy sets, i.e., Si ⊆ Si ⊆ m k ···⊆ Si (m could be infinite). As k increases, the sets Si represent increas- ingly restrictive models of the strategies that i could potentially play. 0 This paper studies the (conditional) level-k hierarchical model ({CLd,..., m 0 m CLd }, {CLp,...,CLp }) defined in Section 4. Given a profile (Rd, Rp), I define the rationality threshold as the smallest hierarchical level r∗ for which ∗ each party i has a strategy si ∈ Si that is a best response to every strategy r∗−1 in CL−i . I then use the rationality threshold as a measure of strategy complexity. Clearly, this logic is not specific to the level-k hierarchical model. A natural alternative would be to use the “undominated” hierarchical model UD defined by the process of iterated undominated strategies. One could then define an alternative UD-rationality threshold,33 and perform analysis similar to those of this paper.

33 ∗ ∗ I.e., the smallest hierarchical level rUD for which each party i has a strategy si ∈ Si ∗ that is a best response to every strategy of her opponent that survives rUD − 1 rounds of iterated undominated strategies.

26 Another option is to use a hierarchical model in which the level of sophis- tication of the parties is fixed, say to level-1, and the parties’ information about each others’ preferences is refined in each iteration of the model. The ∗ corresponding I-rationality threshold could, for example, be the smallest rI ∗ such that each party i has a strategy si ∈ Si that is a best response to every 1 ∗ CL−i strategy for any preference R−i that has the same rI worst jurors as e ∗ R−i. That is, i only needs to know the rI worst jurors according to R−i and the fact that −i is a best responder in order to have a dominant strategy.34 In general, there is no logical relation between the UD-rationality thresh- old and the level-k rationality threshold of a game, henceforth “CL-rationality threshold”. However, some of the results in this paper extend to the UD- rationality threshold. First, it is not hard to see that the UD-rationality threshold of one-shotQ is 2. Second, it can be shown that for every problem, the UD-rationality threshold of one-shotM is at least as large as the CL-rationality threshold of 35 one-shotM . Hence, the results in Sections 5.2 extend to the UD-rationality threshold, and the UD-rationality threshold of one-shotM is larger than 2 for a significant set of problems.Whether the results of Section 6 also extend to the UD-rationality threshold is left as an open question.

Acknowledgments

I am grateful to Benoit Decerf, Hanna Frank, Greg Leo, Clayton Master- man, Claudia Rei, Roberto Serrano, Yves Sprumont, Rodrigo Velez and John Weymark for helpful discussions and comments. I also want to thank Paul Edelman, Eun Jeong Heo, Myrna Wooders as well as the participants to pre- sentations at Universit´eCatholique de Louvain, Vanderbilt University, the 13th meeting of the Society for Social Choice and Welfare and the fifth world congress of the Society for their questions and suggestions. I thank CORE and CEREC for their hospitality during visits. Support from the Kirk Dornbush summer research grant and the National Science Foun- dation grant IIS-1526860 is gratefully acknowledged.

34 ∗ Observe that rI = 1 in one-shotQ for the problem described in Example 4. 35 This follows from the fact that, by the assumption that preferences on jurors are strict, best responses are unique in one-shotM . Thus, every best response is also an undominated strategy, and the set of strategies that survive k rounds of iterated conditional best re- sponse is a subset of the set of strategies that survive k rounds of iterated undominated strategies.

27 Appendix A Omitted proofs

For any (separable) preferences (Rd, Rp) and any integer z, let R(z) be the z-th juror according to (Rd, Rp)’s ranking of jurors. Proof of Proposition 1. Consider any problem. (i). Let i ∈ {d,p} be the party who challenges w jurors first and let w be i’s worst juror. Let s−i be the strategy in which −i challenges only w (provided i did not already challenge w), and −i challenges 0 no other juror otherwise. Also, let s−i be the strategy in which −i does not challenge any jurors. Because i’s preference on jurors is strict, i’s unique best w response ti(s−i) consists in challenging her ci worst jurors among N\{w}. 0 Differently, party i’s unique best response ti(s−i) consists in i challenging her ci worst jurors among N, which includes challenging juror w. Thus, no w 0 strategy of i is a best response to both s−i and s−i, and i does no have a dominant strategy. w (ii). The proof of (ii) is similar to the proof of (i). Lets ˜−i be the strategy in which, in each round, −i challenges as many as possible of i’s worst jurors 0 among the jurors that remain unchallenged. Also, lets ˜−i be the strategy in which −i never challenges any juror. w In any best response ti(˜s−i), i never challenges any of her c−i worst jurors and challenges her ci other worst jurors over the course of the procedure. 0 Differently, in any best response ti(˜s−i), i challenges her ci worst jurors among N over the course of the procedure, which includes challenging some of her w c−i worst jurors. Thus, no strategy of i is a best response to boths ˜−i and 0 s˜−i, and i does no have a dominant strategy. Proof of Proposition 2. Let D be the set of separable preferences on ∆J . In order to derive a contradiction, assume that there exists a challenge procedure Γ satisfying finiteness and minimal challenge such that, for every profile (Rd, Rp) ∈D×D, both parties have a dominant strategy. ∗ Let si (Ri) be one of i’s dominant strategies when i’s preference are Ri. Consider the direct mechanism M Γ : D×D→ ∆J constructed from Γ by setting

∗ ∗ M(Rd, Rp) := Γ(sd(Rd),sp(Rp)), (7)

∗ ∗ ∗ ∗ where Γ(sd(Rd),s (Rp)) is the lottery that obtains when (sd(Rd),s (Rp)) is played in Γ.

28 ∗ Because si (Ri) is a dominant strategy given Ri, for all i ∈{d,p} and all Ri ∈D, ∗ ∗ ∗ ′ ∗ ′ Γ(si (Ri),s−i(R−i)) Ri Γ(si (Ri),s−i(R−i)) for all Ri ∈D and R−i ∈D.

But then, by construction of M, for all i ∈{d,p} and all Ri ∈D, ′ ′ M(Ri, R−i) Ri M(Ri, R−i) for all Ri ∈D and R−i ∈D, and M is strategy-proof on the domain of separable profiles. Notice that the domain of additive profiles Dadd ×Dadd is a subset of the domain of separable profiles.36 By Van der Linden (2015, Example 3), any domain of profiles that contains the domain of additive profiles is a negative leximin domain (see Van der Linden, 2015, Domain Property 3). But then, M contradicts Van der Linden (2015, Corollary 3) which states that, on a negative leximin domain, no mechanisms constructed from a procedure satisfying finiteness and minimal challenge as in (7) is strategy-proof. As stated in the body of the paper, this proof shows that Proposition 2 is true on any negative leximin domain of profiles, which includes the domain of additive profiles. Proof of Proposition 4. Consider any problem. Let i be the first party ∗ to challenge. For −i, the unique best response s−i to any strategy by i is to challenge her c−i worst jurors among the jurors that i did not challenge (uniqueness follows again from preferences on jurors being strict). This strat- 1 egy is, therefore, the unique CL−i. It directly follows from uniqueness that ∗ 1 s−i is a best response to all CLi strategies. 1 ∗ 2 Because there is a unique CL−i strategy s−i, any CLi strategy that best ∗ 1 responds to s−i is a best response to all CL−i strategies. Hence, the rational- ity threshold of one-shotQ is at most 2 for this problem. But by Proposition 1, because one-shotM is an N-struck procedure, the rationality threshold of one-shotQ is at least 2 for every problem. Thus, the rationality threshold of one-shotQ is 2 for this problem. Proof of Proposition 5. The proof generalizes Example 5 and is illus- trated in Figure 7. Consider any problem in which the profile is one-common. Let w ∈ N be the juror that is among the cd worst jurors of d and among the cp worst jurors of p. For any set N ⊆ N containing at least ci jurors, let e ti(N) be the set of the ci worst jurors in N according to Ri. e e Induction basis. For each i ∈{d,p}, both challenging jurors ti(N) and 0 challenging jurors ti(N\{w}) are CLi strategies of one-shotM . Observe that w ∈ ti(N) and w∈ / ti(N\{w}).

36 ′ Preferences Ri are additive if there exists ui : N → R such that, for all L,L ∈ J , ′ ′ L Ri L ⇔ PJ∈J L(J) Pt∈J ui(t) ≥ PJ∈J L (J) Pt∈J ui(t).

29 0 1 2 3 CL−i CLi CL−i CLi strategies strategies strategies strategies t (N) t (t (N)) t (t (t (N))) Set of −i i −i −i i −i ... ∋ w 6∋ w ∋ w jurors t (N\{w}) t (t (N\{w})) t (t (t (N\{w}))) challenged −i i −i −i i −i ... 6∋ w ∋ w 6∋ w

Figure 7: For any one-common profile of preferences, the rationality threshold of one-shotM is ∞.

Induction step. For any k ∈ N, suppose that for each i ∈ {d,p}, k k the set of CLi strategies of one-shotM contains a strategy si in which w is k challenged, and a strategys ˜i in which w is not challenged. Then, for each k k i ∈ {d,p}, we have w∈ / ti(s−i) and w ∈ ti(˜s−i). Hence, for each i ∈ {d,p}, k+1 k+1 k the set of CLi strategies of one-shotM contains a strategy si = ti(˜s−i) k+1 k in which w is challenged, and a strategys ˜i = ti(s−i) in which w is not challenged. k k By the induction step, si ands ˜i are well-defined for each i ∈{d,p} and N N k k for all k ∈ . But observe that, for every k ∈ , w∈ / ti(s−i) and w ∈ ti(˜s−i) k k k k which implies that ti(s−i) =6 ti(˜s−i). Because ti(s−i) and ti(˜s−i) are unique k−1 best responses, no strategy of i is a best response to every CL−i strategy of one-shotM . Hence, the rationality threshold is at least k + 1. Because this is true for all k ∈ N, this concludes the proof. Proof of Proposition 6. Let Γ be any sequential N-struck procedure. Lete ¯ be the depth of the game tree. It is convenient to locate subgames of Γ in terms of the height of their root node, where the height of a node is the length of the longest path from that node to a leaf node. A subgame of Γ the root node of which has height h is denoted γh. Induction basis. Consider any subgame γ1 of Γ. Because preferences on outcomes are strict, (a) the outcome of γ1 is the same under any level-1 profile, and (b) the rationality threshold of γ1 is 1. Induction step. For any h ∈{1,..., e¯}, suppose that for any subgame γh−1 of Γ, (a’) the outcome of γh−1 is the same under any level-(h−1) profile, and (b’) the rationality threshold of γh−1 is no larger than h − 1. Consider any subgame γh of Γ. Let i be party who moves at the root node of Γ. By nestedness, any level-h profile is a level-(h − 1) profile. Hence, by (a’), for any subgame γh−1 directly following γh, the outcome of γh−1 is the same under any level-h profile. Thus, if the outcomes of γh differ under two level-h profiles, it must be that i’s action at the root node of γh lead to

30 two subgames γh−1 and γh−1 with different level-(h−1) outcomes. But then, because preferencesb on outcomese are strict, i’s action at the root node of γh cannot be part of a level-h strategy. Therefore, (a”) the outcome of γh must be the same under any level-h profile. h−1 It follows from (a”) that for each i ∈{d,p}, any best response to a CL−i h−1 strategy is a best response to all CL−i strategies. Hence, (b”) the rationality threshold of γh is no larger than h. Proof of Proposition 7. For any simultaneous N-struck procedure Γ, consider the subgame γ described in the definition of a Γ-one-common profile. γ Let j ∈ N be (one of) the unchallenged juror(s) in γ that is among the ℓd γ worst jurors according to Rd and among the ℓp worst jurors according to Rp. The proof is then a straightforward of the proof of Proposition 5 γ with ci replaced by ℓi and the challenges described in the proof of Proposition 5 occurring in the first round of γ. Proof of Proposition 8. Consider an arbitrary one-common profile 1 2 (Rd, Rp). Let Ni := {Ri(1),...,Ri(b + c−i)} and Ni := {Ri(b + c−i+1),...,Ri(n)}. 2 2 By assumption, there exits a juror w ∈ N such that w ∈ Nd ∩ Np . Observe 1 2 that #Ni = b+c−i and #Ni = ci, where for any set T , #T is the cardinality of T . 2 2 1 1 1 1 Because w ∈ Nd ∩Np , we have w∈ / Nd ∩Np . Hence, if #(Nd ∩Np ) ≥ b+1, 1 1 then #(Nd ∩ Np ) ∪{j} ≥ b + 2 and w is the worst juror for both Rd and 1 1 Rp among (Nd ∩ Np ) ∪{j}, making (Rd, Rp) an alternatingM profile by the sufficient condition identified in the text. 1 1 Hence, in order to derive a contradiction, suppose that #(Nd ∩ Np ) ≤ b. 1 1 In words, for some i ∈ {d,p}, at most b of the jurors in Ni belong to N−i. 1 2 1 Because N−i and N−i partition N, all the jurors in Ni that do not belong 1 2 1 to N−i must belong to N−i. Hence, because there are b + c−i jurors in Ni at 1 1 most b of which belong to N−i, at least c−i of the jurors in Ni must belong to 2 2 1 2 N−i. Recall that w ∈ Ni by assumption. Thus, because Ni and Ni partition 1 1 2 N, w∈ / Ni , and w cannot be one of the c−i jurors in Ni that belong to N−i. 2 But this implies that there are at least c−i + 1 jurors in N−i, a contradiction.

B Computing the proportions in Figure 4

Because the one-common property is preserved under relabeling of the jurors, let us, without loss of generality, fix

b +2c R−i b +2c − 1 R−i ... R−i 2 R−i 1.

Ri Ri Ri Also, for any Ri, let the c worst jurors according to Ri be {j1 , j2 ,...,jc } Ri Ri with j1 Ri ... Ri jc .

31 Figure 4a). The proportion of one-common profiles is then equal to the proportion of Ri that induce a one-common profile given the fix R−i. Clearly, this proportion can be treated as a probability, where the set of outcomes is the set of preferences Ri, and the probability of drawing any particular Ri is 1 (b+2c)! . We are interested in the probability that one of the c worst juror of i is among the c worst jurors of −i, i.e.,

P Ri Ri jc ∈{1,...,c}∪···∪ j1 ∈{1,...,c}, This probability is equal to

P Ri jc ∈{1,...,c}+ P Ri Ri jc−1 ∈{1,...,c} ∩ jc ∈{/ 1,...,c}+ . (8) . P Ri Ri Ri j1 ∈{1,...,c} ∩ jc ∈{/ 1,...,c}∩···∩ j2 ∈{/ 1,...,c}. The elements of the sum in (8) can be computed recursively. In step r = 1, P Ri we initiate the recursion by computing jc ∈{1,...,c}. To do so, observe that P Ri P Ri Ri jc ∈{1,...,c} = jc =1 ∪···∪ jc = c. Ri Ri Hence, because jc =1,...,jc = c are mutually exclusive, we have c c P jRi ∈{1,...,c} = P jRi = i)= . c  X c b +2c i=1 Then, in each step r ∈{2,...,c}, Bayes’ rule implies that

P Ri Ri Ri jr ∈{1,...,c} ∩ jc ∈{/ 1,...,c}∩···∩ jr−1 ∈{/ 1,...,c} P Ri Ri Ri = jr ∈{1,...,c} jc ∈{/ 1,...,c}∩···∩ jr−1 ∈{/ 1,...,c} (9)

P Ri Ri jc ∈{/ 1,...,c}∩···∩ jr−1 ∈{/ 1,...,c}. For all r ∈{2,...,c}, the first term on the right-hand side of (9)

P Ri Ri Ri jr ∈{1,...,c} jc ∈{/ 1,...,c}∩···∩ jr−1 ∈{/ 1,...,c} c (10) = . b +2c − (r − 1) The second term on the right-hand side of (9)

P Ri Ri jc ∈{/ 1,...,c}∩···∩ jr−1 ∈{/ 1,...,c} P Ri Ri =1 − jc ∈{1,...,c}∪···∪ jr−1 ∈{1,...,c},

32 and

P Ri Ri jc ∈{1,...,c}∪···∪ jr−1 ∈{1,...,c} r−1 (11) P Ri Ri Ri = X ji ∈{1,...,c} ∩ jc ∈{/ 1,...,c}∩···∩ ji ∈{/ 1,...,c}, i=1 where the terms inside the sum on the right-hand side of (11) have been obtained in steps {1,...,r − 1}. Figure 4b). Again, computing the proportion in Figure 4b) is equivalent to computing the probability that (Ri, R−i) is one-common when Ri is drawn uniformly among the preferences that make (Ri, R−i) almost juror inverse. That is, Ri is drawn uniformly at random among the preferences that differs ∗ ∗ ∗ ¯Ri from Ri with 1 Ri ... Ri b +2c by the ranking of a single juror, say j . Ri For (Ri, R−i) to be one-common, ¯j must be one of the c worst jurors in R−i and in Ri. That is, the relevant probability is

P ¯Ri ¯Ri Ri Ri j ∈{1,...,c} ∩ j ∈{j1 ,...,jc }. Applying Bayes’ rule, this is equal to

P ¯Ri Ri Ri ¯Ri ¯Ri j ∈{j1 ,...,jc } j ∈{1,...,c}P j ∈{1,...,c}.

∗ Because Ri is drawn at random among the preferences that differ from Ri ¯Ri c by the ranking of a single juror, P (j ∈{1,...,c})= b+2c . Also, c P ¯Ri Ri Ri ¯Ri (j ∈{j1 ,...,jc } j ∈{1,...,c})= , b +2c − 1 where b+2c−1 is the number of (equally likely) positions in which ¯j could be ranked,37 and c is the number of these positions that induce a one-common ranking given ¯jRi ∈{1,...,c}). 38 Hence,

c2 P ¯jRi ∈{1,...,c} ∩ ¯jRi ∈{jRi ,...,jRi } = . 1 c  (b +2c)(b +2c − 1) 37 ∗ ¯ By definition of an almost juror inverse profile, Ri 6= Ri , Because j is the only juror ∗ ¯ the ranking of which changes between Ri and Ri , this implies that j must be ranked ∗ differently in Ri and Ri . 38 Ri ∗ Ri Indeed, observe that given ¯j ∈{1,...,c}, because (R , R−i) is juror inverse, ¯j ∈/ ∗ ∗ i Ri Ri {j1 ,...,jc }.

33 References

Alpern, S., Gal, S., 2009. Analysis and design of selection committees: A game theoretic secretary problem. International Journal of Game Theory 38, 377–394. Alpern, S., Gal, S., Solan, E., 2010. A sequential selection game with vetoes. Games and Economic Behavior 68, 1–14. Arribillaga, R.P., Mass´o, J., 2015. Comparing generalized median voter schemes according to their manipulability. Theoretical 11, 547– 586. Bermant, G., 1982. Jury Selection Procedures in United States District Courts. volume 25. Federal Judicial Center, Washington D.C. Bermant, G., Shapard, J., 1981. The voir dire examination, juror challenges, and adversary advocacy, in: Sales, B.D. (Ed.), The Trial Process. Springer, Berlin, pp. 69–114. Bloom, D., Cavanagh, C.L., 1986. An analysis of the selection of arbitrators. American Economic Review 76, 408–422. Brams, S.J., Davis, M.D., 1978. Optimal jury selection: A game-theoretic model for the exercise of peremptory challenges. Operations Research 26, 966–991. Broderick, R.J., 1992. Why the peremptory challenge should be abolished. Temple Law Review 65, 369. Caditz, D.M., 2015. Selection under veto with limited foresight. Mimeo . Cohen, N.P., Cohen, D.R., 2003. Jury reform in Tennessee. University of Memphis Law Review 34, 1–71. Crawford, V.P., Costa-Gomes, M.A., Iriberri, N., 2013. Structural Models of Nonequilibrium Strategic Thinking: Theory, Evidence, and Applications. Journal of Economic Literature 51, 5–62. Daly, M., 2016. Foster v. Chatman: Clarifying the Batson Test for discrimi- natory peremptory strikes. Duke Journal of Constitutional Law and Public Policy Sidebar 11, 148–162. de Clippel, G., Eliaz, K., Knight, B., 2014. On the selection of arbitrators. American Economic Review 104, 3434–3458. DeGroot, M.H., Kadane, J.B., 1980. Optimal challenges for selection. Oper- ations Research 28, 952–968. Flanagan, F.X., 2015. Peremptory challenges and jury selection. Journal of Law and Economics 58, 385–416. Fudenberg, D., Tirole, J., 1991. Game Theory. MIT Press, Cambridge MA. Ho, T.H., Camerer, C., Weigelt, K., 1998. Iterated dominance and iterated best response in experimental “p-beauty contests”. American Economic Review , 947–969.

34 Horwitz, B.L., 1992. Extinction of the peremptory challenge: what will the jury system lose by its demise. U. Cin. L. Rev. 61, 1391–1440. Kadane, J.B., Stone, C.A., Wallstrom, G., 1999. The donation paradox for peremptory challenges. Theory and Decision 47, 139–155. Keene, D.L., 2009. Fairness, justice and true understanding: The benefits of peremptory strikes. The Jury Expert 2, 24–25. LaFave, W., Israel, J., King, N., Kerr, O., 2009. Criminal Procedure. West Academic Publishing, St. Paul, MN. 5th edition. Marder, N.S., 2012. Batson Revisited - Batson Symposium. SSRN Scholarly Paper ID 2165561. Research Network. Rochester, NY. Mas-Colell, A., Green, J.R., Whinston, M.D., 1995. Microeconomic Theory. Oxford University Press, Oxford. Pathak, P.A., S¨onmez, T., 2013. School admissions reform in Chicago and England : Comparing mechanisms by their vulnerability to manipulation. American Economic Review 103, 80–106. Roth, A., Kadane, J.B., Degroot, M.H., 1977. Optimal peremptory chal- lenges in trials by juries: A bilateral sequential process. Operations Re- search 25, 901–919. Shapard, J., Johnson, M., 1994. Memorandom on a Survey of Active Jurdges Regarding their Voir Dire Practices. Technical Report. Federal Judicial Center, Research Division. Washington D.C. Smith, A., 2014. A call to abolish peremptory challenges by prosecutors. Georgetown Journal of Legal Ethics 27, 1163–1186. Van der Linden, M., 2015. Impossibilities for strategy-proof veto in selection problems. Working paper, Department of Economics, Vanderbilt Univer- sity . Cases Batson v. Kentucky, 476 U.S. 79 (1986). Swain v. Alabama, 380 U.S. 202 (1965).

35