“Soft” Affirmative Action and Minority Recruitment∗

Daniel Fershtman† Alessandro Pavan‡

July 2020

Abstract

We study search, , and selection of candidates of unknown quality for a position. We examine the effects of “soft” affirmative action policies increasing the relative percentage of minority candidates in the candidate pool. We show that, while meant to encourage minority hiring, such policies may backfire if the evaluation of minority candidates is noisier than that of non-minorities. This may occur even if minorities are at least as qualified and as valuable as non-minorities. The results provide a possible explanation for why certain soft affirmative action policies have proved counterproductive, even in the absence of (implicit) .

Keywords: affirmative action, recruitment, sequential evaluations, learning from en- dogenous consideration sets

∗We thank Mariacristina De Nardi, Joseph Ferrie, Matthew Notowidigdo, Wojciech Olszewski, Nicola Persico, Analia Schlosser, Marciano Siniscalchi, and anonymous referees for useful comments and sugges- tions. The usual disclaimer applies. †Eitan Berglas School of Economics, Tel-Aviv University. Email: [email protected] ‡Department of Economics, Northwestern University, and CEPR. Email: [email protected]

1 1 Introduction

In 2003, the National Football League established the “Rooney Rule,” a policy requiring teams to interview minority candidates for head vacancies.1 This policy, versions of which have been applied across various industries2, is an example of “soft” affirmative action (SAA), a term used to refer to policies designed to change the composition of the candidate pool, rather than the criteria used during the hiring process. Contrary to “hard” affirmative action requiring direct consideration of minority status as a part of the hiring decision (e.g., quotas), such policies involve taking steps to increase the share of minority candidates considered for a position, but subsequently treating candidates impartially (Schuck, 2002).3 This paper studies the effects of SAA on minority recruitment. Indeed, while some SAA policies have proved successful (Heilman, 1980), others, such as the Rooney Rule, have been deemed ineffective, or even counterproductive. One possible explanation is the presence of implicit bias. We show that, even in the absence of bias, SAA policies, while meant to increase the likelihood of hiring minority candidates, may in fact lead to the opposite outcome. This may occur even if qualified minority candidates are at least as valuable as qualified non-minority ones, and minority candidates are ex-ante at least as likely to be qualified. We consider SAA policies tilting the search technology in favor of minorities, while maintaining the discretion over when to expand the candidate pool based on the sequen- tial evaluation of candidates already under consideration. Our results are driven by the assumption that the evaluation of minority candidates is noisier than that of non-minority ones. This assumption is a central feature in the literature on statistical discrimination4 building on Phelps’ (1972) seminal contribution (e.g., Aigner and Cain, 1977, Borjas and Goldberg, 1979, Lundberg and Straz, 1983, Cornell and Welch, 1996, and more recently Chambers and Echenique, 2018, and Bardhi et al., 2020).5 The literature has offered various explanations for such differences in evaluation, including: (a) the more limited

1See DuBois (2015) for an empirical assessment of the Rooney Rule’s impact. 2Facebook recently adopted a similar policy (https://money.cnn.com/2018/05/31/technology/facebook- board-/index.html). In 2014, Senate Resolution (511) was introduced (though ultimately not enacted) to encourage companies to voluntarily establish policies to identify and interview qualified minority candidates for managerial openings. 3SAA policies are also common in academia (see, e.g., Columbia University’s “Guide to Best Practices in Faculty Search and Hiring”). 4In contrast to taste-based theories of discrimination (Becker, 1957), statistical discrimination theories explain group inequality without assuming prejudice or preference bias. 5An important exception is Arrow’s (1973) theory of statistical discrimination, which relies on coordi- nation failures rather than differences in the evaluation of particular groups. See Fang and Moro (2011) for an overview of the statistical discrimination literature.

2 experience that recruiters have in evaluating minorities6; (b) differences in “background” between minority candidates and recruiters, which may impede the assessment of intan- gible qualities which are informal but nevertheless potentially relevant for the position (Arrow, 1972, Cornell and Welch, 1996); and (c) differences in “language” (broadly de- fined) between recruiters and minority candidates.7 The literature on affirmative action has studied its potential unintended effects with regard to incentives for human capital investment, productivity stereotyping, differ- entials, and occupational social status (see, e.g., Fryer and Loury, 2005).8 In an influential article, Coate and Loury (1993) show that a “patronizing equilibrium” may arise in which minorities’ incentives to invest in skills may be reduced when affirmative action is in place. This paper complements this literature by illustrating the possible negative effects of SAA directly on the dimension it is intended for – the enhancement of minority recruitment. The mechanism we identify by which SAA policies may have an adverse effect on minority recruitment is the following. Such policies, by tilting the search technology in favor of minorities, also alter the desirability of expanding the candidate pool vis-a-vis a more thorough examination of candidates already under consideration. Because minority candidates are assumed to be more difficult to evaluate, they may suffer the most from the change in the relative attractiveness of search. Importantly, this may happen both when SAA increases the relative attractiveness of search, as when it decreases it. Our results thus suggest that, without taking steps to improve the evaluation of mi- nority candidates, attempts to shift the composition of the candidate pool in favor of minorities may simply amount to “checking a box,” or even prove detrimental. Such steps may include creating a sufficiently diverse recruiting committee and ensuring evaluation is based on objective criteria with predetermined weights. In our model, if differences in the evaluation of candidates were entirely eliminated – as in the case of “blind” auditions

6Aigner and Cain (1977) suggest that majority-group candidates face “a more homogeneous set of environmental determinants of quality,” resulting in a lower variance in qualification, and hence less noisy evaluation. It may be easier to assess a candidate’s history if it followed a well-known path, or to interpret a candidate’s references when they come from familiar letter writers. Certain tests used in recruitment were initially designed with a specific group in mind; e.g., it has long been argued that the SAT is more informative about the abilities of White students than African-American students (Linn, 1973, Fleming and Garcia, 1998). Further, recruiters often (implicitly) use recent hires with similar backgrounds as benchmarks for evaluating new candidates. If most recent hires were majority-group candidates, such benchmarking may be less effective for evaluating minority candidates. 7Lang (1986) proposes that noisier evaluation of minorities results from differences in “language” im- peding communication. Recent research also suggests women tend to use language quite different from men in interviews (Leaper and Robnett, 2011). 8The literature on the economics of affirmative action includes Welch (1976), Lundberg and Straz (1983), Chung (2000), Moro and Norman (2003), and Fryer and Loury (2013), and is too broad to be succinctly discussed here. For evidence on the effectiveness of affirmative action, see Holzer and Neumark (2000).

3 – SAA would be guaranteed to increase the probability of hiring a minority candidate. If differences in candidates’ evaluation cannot be sufficiently reduced, “hard” affirmative action may be necessary to foster minority recruitment.

2 Model

The recruitment problem described below applies to a variety of environments. For con- creteness, we focus on a university committee recruiting faculty, or students, for an avail- able position. The recruitment of a candidate is subject to university approval standards. There are two categories of candidates, A and B (race/gender/field of study). Candidates’ j j qualifications θ ∈ {L, H} are unknown and independent ex-ante. Let p0 = Pr(θ = H) denote the prior that a category-j candidate is qualified. The value of hiring a qualified category-j candidate is vj > 0, whereas the value of hiring any non-qualified candidate is zero. That is, the school wishes to hire only qualified candidates, but candidates of one category may be preferred. Candidates can be recruited only if they are in the school’s candidate pool. We assume that at the outset there are only two candidates in the pool, one from each category.9 At each period t = 0, 1, ..., the committee either chooses a candidate to evaluate among those in the pool, or decides to expand the pool by searching for additional candidates. The evaluation of a candidate generates new information, formally captured by a signal about the candidate’s qualification. The evaluation of a qualified category-j candidate yields a j j signal realization s = 1 with probability qH ≡ Pr(s = 1|θ = H), whereas the evaluation of a non-qualified category-j candidate yields a signal realization s = 0 with probability j j j j qL ≡ Pr(s = 0|θ = L), with qH ≥ 1 − qL. Conditional on a candidate’s type, the signals are iid draws from the above Bernoulli distribution.

Given a history σ = (s1, s2, ...) of signal realizations, the posterior probability that a category-j candidate is qualified will be denoted by pj(σ). The null history is denoted by σ = ∅. Each time the committee searches for new candidates, it identifies a category-j candidate with probability µj, and no candidate with probability 1 − µA − µB. Beyond the opportunity cost of not being able to evaluate one of the candidates in the pool while searching, search entails no direct cost. Likewise, the only cost of evaluating a candidate in the pool is the cost of postponing search and the evaluation of other candidates.10

9This assumption simplifies some of the derivations, but is not essential for the results. 10Most of the assumptions introduced above can be relaxed. For example, the results extend to more general search technologies yielding a random number of candidates from each category. Likewise, the assumption that the cost of each action is the time cost of postponing other actions can easily be amended by introducing additional costs. Finally, that signal realizations take only two values is also not essential.

4 The entire recruiting process ends when the committee finds a candidate “above the bar,” i.e., whose (category-adjusted) expected quality is large enough. Formally, for any j = A, B, there exists a threshold P¯j ∈ (0, 1] such that each category-j candidate is given the slot at history σ if and only if pj(σ) ≥ P¯j. The threshold P¯j can be thought of as reflecting standards imposed by the university, which may, but need not, coincide with j ¯j the preferences of the recruiting committee. To avoid trivialities, assume p0 < P , so that each candidate must be evaluated at least once to be recruited.11 A recruitment rule specifies in each period either a candidate to evaluate among those in the pool, or search for a new candidate. A recruitment rule is optimal if it maximizes the T  expected discounted payoff E δ v˜T over all feasible recruitment rules, where δ ∈ (0, 1) is the discount factor, T the (stochastic) time at which a selection is made, andv ˜T the value of the selected candidate. Maintained assumptions. Throughout, we assume the following:

(i) The evaluation of A-candidates is Blackwell less informative than that of B-candidates: A B qk ≤ qk , k = H,L, with at least one inequality strict; (ii) A-candidates are more valuable to the : vA ≥ vB;

A B (iii) A-candidates are more likely to be qualified: p0 ≥ p0 ;

(iv) A-candidates have a lower acceptance threshold: P¯A ≥ P¯B.

Condition (i) means A is the minority category. Conditions (ii)-(iv) guarantee that the potentially negative effects of SAA on the recruitment of minority candidates are not driven by any bias in the evaluation process favoring non-minority candidates: A-candidates are strictly better than B-candidates in all dimensions but the effectiveness of their evaluation.

3 Myopic Recruitment Rule

We start by assuming the recruitment process is conducted under a myopic rule. Given a history of signal realizations σ, denote by λj(σ) the probability that a category-j can- didate with history σ is recruited after a single additional evaluation (λj(σ) = Pr(s : pj(σ, s) ≥ P¯j|j, σ)). The myopic value the committee attaches to an additional eval- uation of a category-j candidate with history σ is equal to uj(σ) = λj(σ)vj. Like- wise, the myopic value it attaches to an additional search is equal to the expected value

What matters is that signals about minority candidates are noisier, as shown below. 11That the selection of a candidate is governed by a threshold rule guarantees that the optimal evaluation and search policy is indexable. Allowing for arbitrary selection rules would (in most cases) preclude a tractable representation of the optimal policy.

5 uS = δ µAuA(∅) + µBuB(∅) of bringing a “blank-slate” candidate to the pool. Under a myopic rule, in each period, the committee selects the alternative (evaluation of a candi- date from the pool, or search) with the highest myopic value. To avoid trivialities, assume uA(∅), uB(∅) > uS, so that, once added to the pool, each candidate has a greater myopic value than search. A B A B A B A A B B ¯ Given v ≡ (v , v ), p0 ≡ (p0 , p0 ), µ ≡ (µ , µ ), q ≡ (qL , qH , qL , qH ), and P ≡ A B j (P¯ , P¯ ), denote by γ (v, p0, µ, q, P¯) the ex-ante probability a category-j candidate is selected under a myopic rule. Our first result pertains to SAA policies promoting the expansion of the candidate pool. Formally, such policies can be captured by an increase in µA and µB (with µA increasing more than µB). To ease the exposition, and without any important implication for the results, we assume that, prior to the introduction of the policy, µ = (0, 0). To further simplify the derivations, suppose that

A A B B qH ∈ (0, 1) and qL = qH = qL = 1. (1)

That is, the evaluation of B-candidates is perfectly revealing, whereas the evaluation of A- B B candidates takes the typical“no-news-bad-news”form. Under this technology, λ (∅) = p0 , A A A A A A λ (∅) = qH p0 and λ (0) = qH p (0), with

A A A (1 − qH )p0 p (0) = A A A . (1 − qH )p0 + 1 − p0

Status-quo (µ = (0, 0)). Assume the A-candidate is evaluated first:

A A A B B p0 qH v > p0 v . (2) | {z } | {z } uA(∅) uB (∅)

A A B B A  Then γ (v, p0, (0, 0), q, P ) = p0 1 − p0 + p0 qH . That is, the minority candidate is se- lected if and only if she is qualified and either (a) the non-minority candidate is unqualified, or (b) she is qualified and the initial evaluation of the minority candidate yields a positive result. Search under SAA ( µ = (µA, µB) with µA ≥ µB = 1 − µA). Further assume

B B A A A A B B B p0 v > δ µ p0 qH v + µ p0 v . (3) | {z } uS

6 and

A A A A A A A A B B B (1 − qH )p0 qH v δ µ p0 qH v + µ p0 v > A A A . (4) (1 − qH )p0 + 1 − p0 | {z } uA(0)

Jointly, (1)-(4) imply that evaluation of any “blank-slate” candidate is preferred to search, whereas search is preferred to a second evaluation of an A-candidate whose initial evalu- ation yielded a negative result. Let

A A A A A B B A γS = µ λ (∅) + (1 − λ (∅))γS + µ (1 − p0 )γS denote the probability an A-candidate is selected after search is carried-out.12 Rearranging and using µA + µB = 1, we have that

µAλA(∅) γA = . S A A B B µ λ (∅) + µ p0

Hence, the ex-ante probability of selecting a minority candidate under SAA is

 µAλA(∅)  γA(v, p , µ, q, P ) = λA(∅) + (1 − λA(∅))(1 − pB) . 0 0 A A B B µ λ (∅) + µ p0

A B A B A Comparison. Under (1)-(4), for any µ = (µ , µ ) with µ +µ = 1, γ (v, p0, µ, q, P ) < A γ (v, p0, (0, 0), q, P ) if and only if

qA µBpB H < 0 . (5) A A A 1 − qH µ (1 − p0 )

The left-hand side of (5) is a measure of the effectiveness of the evaluation of minority candidates, whereas the right-hand side is the ratio between the probability that search brings a qualified non-minority candidate and an unqualified minority one.

Proposition 1. Suppose Conditions (1)-(5) hold.13 SAA policies promoting the expansion of the candidate pool reduce the ex-ante probability that minority candidates are selected: A A A B A B A γ (v, p0, µ, q, P¯) < γ (v, p0, (0, 0), q, P¯), for any µ = (µ , µ ) with µ ≥ µ = 1 − µ . 12Note that, once search is launched, candidates already in the pool are never evaluated again. 13These conditions, together with the maintained assumptions (i)-(iv), hold for a non-empty open set of A B A B A parameters. For example, they hold when δ = 0.9, p0 = 0.8, p0 = 0.7, v = 1.5, v = 1, qH = 0.6, and µA = 2/3 = 1 − µB .

7 Proof. The result follows from the arguments preceding the proposition.

When µA > µB , under such SAA policies, the expected number of minority candidates considered for a position, and their share relative to non-minority candidates, increase. However, such policies also reduce the length of the evaluation of candidates whose early evaluation yields negative results. As minority candidates are more difficult to evaluate, they suffer more from the truncation in the evaluation process, which may lead to a reduction in the ex-ante probability of selecting a minority candidate. Note that the probability of selecting A-candidates is reduced not only relative to the probability of selecting B-candidates, but overall; that is, promoting the expansion of the candidate pool may reduce the ex-ante probability of recruiting minority candidates, despite increasing the overall probability of filling the slot.

Next, consider SAA policies tilting the search technology in favor of minorities at the expense of non-minorities (formally captured by an increase of ζ > 0 in µA with an equal reduction in µB). Let

A B B A qL = qL = 1 and 1 > qH > qH > 0. (6)

Under (6), the evaluation of either type of candidate takes the familiar“no-news-bad-news” j j j j j j j j j j j form. Consequently, λ (∅) = p0qH , p (0) = (1 − qH )p0/(1 − qH p0), and λ (0) = qH p (0). Status-quo (µ = (µA, µB) with µA = 1 − µB < 1). Suppose that

uB (∅) uA(∅) uS z }| { z }| { z }| { λB(∅)vB > λA(∅)vA > δ µAλA(∅)vA + µBλB(∅)vB (7) A A A B B B (1 − qH )λ (∅)v (1 − qH )λ (∅)v > A A , B B . (8) 1 − qH p0 1 − qH p0 | {z } | {z } uA(0) uB (0)

Conditions (6)-(8) imply the evaluation of any blank-slate candidate is preferred to search, whereas the latter is preferred to the evaluation of any candidate whose first evalua- tion yields a negative result. The probability of selecting an A-candidate after search is launched is then equal to

A A A A A  B B  A γS (0) = µ λ (∅) + (1 − λ (∅))γS (0) + µ 1 − λ (∅) γS (0),

8 or

µAλA(∅) γA(0) = . S µAλA(∅) + µBλB(∅)

The ex-ante probability of selecting an A-candidate under the status-quo search protocol is then equal to

  µAλA(∅)  γA v, p , (µA, µB), q, P  = (1 − λB(∅)) λA(∅) + (1 − λA(∅)) . 0 µAλA(∅) + µBλB(∅)

Search under SAA (µ = (µA + ζ, µB − ζ), with ζ > 0). Assume that

B B B A A A (1 − qH )λ (∅)v A A A B B B (1 − qH )λ (∅)v B B > δ (µ + ζ)λ (∅)v + (µ − ζ)λ (∅)v > A A . (9) 1 − qH p0 1 − qH p0

Condition (9) implies that, after a single negative evaluation, search is preferred to a second evaluation if the candidate is an A-candidate, whereas the opposite is true for A B-candidates. Denote by γS (ζ) the probability of selecting an A-candidate after search is launched, under the search protocol promoted by SAA. Conditions (6)-(9) imply that

A A A A A  B B B A γS (ζ) ≤ (µ + ζ) λ (∅) + (1 − λ (∅))γS (ζ) + (µ − ζ)(1 − λ (∅))(1 − λ (0))γS (ζ), where the inequality follows from the fact that a B-candidate may be evaluated more than twice before search is launched. Rewriting this inequality, we have that

(µA + ζ)λA(∅) γA(ζ) ≤ . S (µA + ζ)λA(∅) + (µB − ζ)(λB(∅) + λB(0) (1 − λB(∅)))

Therefore, the ex-ante probability of selecting an A-candidate under the search protocol promoted by SAA satisfies

A A B  γ v, p0, (µ + ζ, µ − ζ), q, P B A A B A  ≤ (1 − λ (∅)) λ (∅) + (1 − λ (∅))(1 − λ (0))γS (ζ)  (1 − λB(0))(1 − λA(∅))(µA + ζ)  = (1 − λB(∅))λA(∅) 1 + . (µA + ζ)λA(∅) + (µB − ζ)(λB(∅) + λB(0) (1 − λB(∅)))

A A B  Comparison. Given the arguments above, we have that γ v, p0, (µ + ζ, µ − ζ), q, P <

9 A A B  γ v, p0, (µ , µ ), q, P if

µA (1 − λB(0))(µA + ζ) > . (10) µAλA(∅) + µBλB(∅) (µA + ζ)λA(∅) + (µB − ζ)(λB(∅) + λB(0) (1 − λB(∅)))

Proposition 2. Suppose Conditions (6)-(10) hold.14 SAA policies tilting the search tech- nology in favor of minorities reduce the ex-ante probability of selecting a minority candi- A A B  A A B  date: γ v, p0, (µ + ζ, µ − ζ), q, P < γ v, p0, (µ , µ ), q, P , for any ζ > 0.

Proof. The result follows from the arguments preceding the proposition.

Under such SAA policies, A-candidates are more likely to be included in the candidate pool at the expense of B-candidates. However, because A-candidates are more difficult to evaluate, such policies, contrary to those examined above, reduce the overall attractiveness of search relative to a lengthier evaluation of existing candidates. The committee may therefore substitute search primarily with the evaluation of B-candidates, as they are easier to evaluate. When strong enough, the latter effect may trigger a reduction in the ex-ante probability that minority candidates are selected.

4 Optimal (forward-looking) Rule

We now show that the effects identified above are not a mere consequence of the committee following a myopic rule.

4.1 Preliminaries

We first establish that the optimal rule is an index rule. That is, each candidate is assigned an index V j that depends only on the candidate’s category, j, and the candidate’s history of signal realizations, σ. The indices take the form

h φj  τ j −φj  j i E δ 1 − δ 1{φj <τ j }v˜ |j, σ V j(σ) ≡ sup , (11)  τ j  τ j >0 1 − E δ |j, σ where τ j is a (stochastic) stopping-time, φj is the (stochastic) time at which pj(σ) exceeds the acceptance threshold P¯j for the first time, andv ˜j ∈ {0, vj} denotes the candidate’s

14Conditions (6)-(10), along with the maintained assumptions (i)-(iv), hold for a non-empty open set of A B A B A parameters. For example, they hold when δ = 0.9, p0 = 0.8, p0 = 0.64, v = 1.5, v = 1, qH = 0.2, B A B qH = 0.75, µ = 0.9 = 1 − µ , and ζ = 0.04.

10 value to the organization.15 The indices defined by (11) are forward-looking, and have an intuitive interpretation. They represent the maximal average expected discounted payoff (averaging over discounted time) from the sequential evaluation of the candidates, ignoring the existence of the other candidates. Next, let

P j h φj  τ j −φj  j i δ j∈{A,B} µ E δ 1 − δ 1{φj <τ j }v˜ |j, ∅ V S ≡ sup (12) P j  τ j  τ A,τ B >0 1 − j∈{A,B} µ E δ |j, ∅ be the index of search, where τ j, φj andv ˜j are as defined above. This index represents the maximal average expected discounted payoff from the sequential evaluation of the first candidate that arrives as the result of search. Under the optimal rule described below, the value the committee assigns to the ad- ditional evaluation of any candidate in the pool takes into account the effects of this evaluation on the desirability of all further evaluations. Likewise, the value the committee assigns to search accounts for the fact that the new candidates may be evaluated multiple times. Note that V S is directly linked to the indices of the candidates expected to be identified by search. Specifically, the optimal stopping-time in (12) is the first time at which the index of any candidate that arrives as a result of search drops below V S.

Lemma 1. The optimal recruitment rule consists in evaluating in each period one of the candidates in the pool with the highest index, as defined in (11), provided this index is greater than V S, and searching for new candidates otherwise.

Proof. Consider the following fictitious environment in which, contrary to the problem under consideration, the recruitment of a candidate is reversible. Let the flow payoff from the evaluation of each candidate for whom pj(σ) < P¯j be equal to zero (recall that these are candidates who, given the University’s rules, are not acceptable yet for the position). Let the flow payoff from hiring a candidate for whom pj(σ) ≥ P¯j be equal to pj(σφj )vj(1 − δ), where φj is the first time at which the candidate is found acceptable, and where σφj is the history of signal realizations at that time. In accordance with the primitive environment, assume that, after the φj-th evaluation, no further information is gathered about the candidate’s qualification. Contrary to the primitive environment, however, assume that, at any point in time, the committee can reverse its decision. That is, once an acceptable

15The stopping decision is measurable with respect to the filtration induced by the process governing the arrival of information about the candidate’s qualification. Because no new information arrives after j φj , any stopping occurring after φj must be based on the information σφ available in period φj .

11 candidate is found (i.e., for whom pj(σ) ≥ P¯j), the committee is not obliged to hire that candidate indefinitely, and the temporary recruitment of such a candidate (which yields a flow payoff pj(σφj )vj(1 − δ)) does not preclude subsequent evaluation and recruitment of other candidates. The committee’s problem in this fictitious environment is a multi-armed bandit prob- lem with an endogenous set of arms, satisfying all the conditions in Fershtman and Pavan (2020) (see also the literature on branching bandits, e.g., Weiss, 1988, and Weber, 1992). Fershtman and Pavan (2020) show that the optimal rule for such problems is an index rule, with a special index for search. The latter index is the maximal expected average discounted payoff from searching for new arms and pulling any of the new arms that arrive as the result of search, where the maximization is over both a stopping-time and a rule that chooses among search and the pull of the new arms that arrive as the result of search. In the fictitious environment described above, the index of search is equal to

π,τ hPτ−1 s πi δ δ rs S E s=1 V = sup π,τ τ , (13) τ,π 1 − E [δ ] where τ is a stopping-time, π is a rule governing the alternation between the evaluation π of one of the new candidates brought in by search and further search, and rs is the flow payoff under the rule π. The latter flow payoff is equal to zero when π selects search or one of the candidates that are not acceptable yet (i.e., for whom pj(σ) < P¯j) and is equal to pj(σφj )vj(1−δ) when π selects an acceptable candidate (i.e., one for whom pj(σ) ≥ P¯j).16 Importantly, note that the index of search is independent of any information regarding the candidates already in the pool. Fershtman and Pavan (2020) also show that the optimal rule π in (13) is in fact the same index rule characterizing the solution to the entire decision problem, and that the optimal stopping-time τ in (13) is the first time at which the search index and the index of all newly arrived arms fall below the index of search when the latter was launched. In the context of the fictitious environment described above, because the search technology is stationary, τ coincides with the first time at which the index of the first candidate identified through search falls below the index of search. Given these features, (13) can be rewritten as

P j hPτ j −1 s j φj j i δ j∈{A,B} µ E s=φj δ p (σ )v (1 − δ)|j, ∅ V S = sup , P j  τ j  τ A,τ B >0 1 − j∈{A,B} µ E δ |j, ∅

16Recall that, after the φj -th evaluation, there is no further information that can be gathered about the candidate.

12 which coincides with the formula in (12). Fershtman and Pavan (2020) also show that, in multi-armed bandit problems with an endogenous set of arms, the index of any arm other than search coincides with the standard Gittins index (Gittins and Jones, 1974). In the fictitious problem, the arms are candidates and the Gittins index of each candidate is

hPτ j −1 s j j φj i j δ v p (σ )(1 − δ)|j, σ j E s=φ V (σ) = supτ j >0  j  , 1 − E δτ |j, σ which coincides with the formula in (11). Hence, the optimal rule in the fictitious envi- ronment consists in selecting in each period the candidate in the pool for whom V j(σ) is the highest, provided that such index is greater than V S, and in expanding the pool by searching for new candidates if V S is greater than the index of any candidate in the pool. Now consider the primitive problem where recruitment is irreversible. In general, index policies are not optimal in the presence of irreversible decisions. However, this is not the case for the specific problem under consideration. To see this, observe that in the fictitious environment, once an arm is pulled for which pj(σ) ≥ P¯j, its index V j(σ) = pj(σφj )vj(1 − δ) remains the same at all subsequent periods. Because the indices of the other arms and of search also remain the same, under the optimal rule in the fictitious environment, once the evaluation of a candidate yields a posterior pj(σ) exceeding the acceptance threshold P¯j, that candidate is selected in each subsequent period. Because the fictitious environment is a relaxation of the primitive one, it follows that the optimal rule in the primitive environment coincides with the one in the fictitious environment.

4.2 Soft affirmative action

We now show that the possible negative effects of SAA policies on the recruitment of minorities may also arise under optimal (forward-looking) policies. j Consider first SAA policies akin to those in Proposition1. Let Γ (v, p0, µ, q, P ) denote the ex-ante probability of selecting a category-j candidate under the optimal rule. Consider the same “no-news-bad-news” evaluation technology as in (1). Using the results in the previous subsection, the index of a blank-slate j-candidate is equal to

λj(∅)vj V j(∅) = . (14) 1 − δ(1 − λj(∅))

This is because, under the assumed evaluation technology, the optimal stopping-time τ j in the definition of the index in (11) specifies stopping immediately (τ j = 1) following a negative signal s = 0, and never stopping (τ j = ∞) following a positive signal s = 1,

13 B B j = A, B. Also recall that, under the assumed evaluation technology, λ (∅) = p0 and A A A B A B j λ (∅) = p0 qH . Hence, V (0) = 0 and V (1) = V (1) = v . It is also easy to see that, after a single negative evaluation, the index of any A-candidate is equal to

λA(0)vA V A(0) = . (15) 1 − δ(1 − λA(0))

Status-quo (µ = (0, 0)). In this case, the search index equals V S = 0, as search does not bring candidates. Assume the B-candidate is evaluated first:

B B A A A p0 v p0 qH v B ≥ A A . (16) 1 − δ(1 − p0 ) 1 − δ(1 − p0 qH ) | {z } | {z } V B (∅) V A(∅)

A B A It follows that, under Conditions (1) and (16), Γ (v, p0, (0, 0), q, P ) = (1 − p0 )p0 . Search under SSA ( µ = (µA, µB) with µA ≥ µB = 1 − µA). From Lemma1, the optimal stopping-time τ in the definition of the search index (12) is the first time at which the index of the candidate identified through search drops below V S. Now suppose that

δ µApAqA vA + µBpBvB V A(∅),V B(∅) > 0 H 0 > V A(0), (17) 2 2 A A A B B 1 − δ + δ µ p0 qH + µ p0 with V j(∅), j = A, B, and V A(0) as in (14) and (15), respectively. We argue that the search index is then equal to

δ µApAqA vA + µBpBvB V S = 0 H 0 . (18) 2 2 A A A B B 1 − δ + δ µ p0 qH + µ p0

To see this, recall that the optimal stopping-times τ j in (12) are the first times at which the index of the arriving candidate drops weakly below V S. Condition (17), along with the fact that V B(0) = 0, guarantee that τ j coincides with the first time at which an evaluation yields a negative signal realization s = 0, and is otherwise equal to τ j = +∞. That the search index is equal to (18) then follows from these properties along with the definition of the search index in (12). Given Lemma1 and the stationarity of the search index, if search is preferred to the evaluation of a candidate at some history, this continues to hold at all future periods. Let A ΓS denote the probability of selecting an A-candidate once search is launched. Given the

14 above conditions,

A A A A A B B A ΓS = µ λ (∅) + (1 − λ (∅))ΓS + µ 1 − p0 ΓS , or

µAλA(∅) ΓA = . S A A B B µ λ (∅) + µ p0

Therefore, under Conditions (1), (16), and (17), for any µ = (µA, µB) with µA+µB = 1, the ex-ante probability of selecting an A-candidate under SAA is equal to

  µAλA(∅)  ΓA(v, p , µ, q, P ) = (1 − pB) λA(∅) + (1 − λA(∅)) . 0 0 A A B B µ λ (∅) + µ p0

Comparison. Under Conditions (1), (16)-(17), for any µ = (µA, µB) with µA + µB = 1, A A Γ (v, p0, µ, q, P ) < Γ (v, p0, (0, 0), q, P ) if and only if (5) holds. Hence,

Proposition 3. Suppose Conditions (1), (5), (16), and (17) hold.17 SAA policies pro- moting the expansion of the candidate pool reduce the ex-ante probability that minority A A A B candidates are selected: Γ (v, p0, µ, q, P ) < Γ (v, p0, (0, 0), q, P ), for any µ = (µ , µ ) with µA ≥ µB = 1 − µA.

Proof. The result follows from the arguments preceding the proposition.

The intuition is similar to the one for the myopic rule. Qualified minority candidates whose initial evaluation yields negative results due to noise in the evaluation process may not have the opportunity to prove themselves when search is an attractive alternative to further evaluation. When this is the case, SAA policies promoting improvements in the search technology may reduce the ex-ante probability that a position is given to a minority candidate. Importantly, this may occur even if such improvements bring more minority candidates to the pool than non-minority ones (that is, even if µA > µB).

Next, consider SAA policies tilting the search technology in favor of minorities at the expense of non-minorities (formally, an increase in µA along with a reduction in µB). Consider the same evaluation technology as in (6). Arguments identical to those for SAA policies promoting improvements in the search technology imply that V j(∅) =

17These conditions, along with the maintained assumptions (i)-(iv), hold for a non-empty open set of A B A B A parameters. For example, they hold when δ = 0.9, p0 = 0.75, p0 = 0.7, v = 1.2, v = 1, qH = 0.19, and µA = 0.52 = 1 − µB .

15 j j j  j j j j  j j j λ (∅)v / 1 − δ(1 − λ (∅)) and V (0) = λ (0)v / 1 − δ(1 − λ (0)) , with λ (∅) = p0qH and

j j j j j j (1 − qH )p0qH λ (0) = p (0)qH = j j , 1 − qH p0 j = A, B. Assume the B-candidate is evaluated first:

B B B A A A p0 qH v p0 qH v B B > A A . (19) 1 − δ(1 − p0 qH ) 1 − δ(1 − p0 qH ) | {z } | {z } V B (∅) V A(∅)

Status-quo (µ = (µA, µB) with µA = 1 − µB < 1). As in the case of SAA policies promoting improvements in the search technology, assume

δ µApAqA vA + µBpBqB vB V A(∅),V B(∅) > 0 H 0 H > V A(0),V B(0), (20) 2 2 A A A B B B  1 − δ + δ µ p0 qH + µ p0 qH and observe that this condition implies that the optimal stopping-time in the formula for V S is the first time at which an evaluation yields a negative realization s = 0, which in turn implies the search index is equal to

δ µApAqA vA + µBpBqB vB V S = 0 H 0 H . 2 2 A A A B B B  1 − δ + δ µ p0 qH + µ p0 qH

Search under SAA (µ = (µA + ζ, µB − ζ), ζ > 0). Now assume

δ(µA + ζ)λA(∅)vA + δ(µB − ζ)vB λB(∅) + δ(1 − λB(∅))λB(0) V A(∅),V B(0) > (21) 1 − δ2 ((µA + ζ) (1 − λA(∅)) + (µB − ζ)(1 − λB(∅))(1 − λB(0))δ) > V A(0),V B(0, 0), (22) where V A(∅) and V B(0) are as defined above and where

λB(0, 0)vB V B(0, 0) = , 1 − δ(1 − λB(0, 0))

B B B with λ (0, 0) = p (0, 0)qH and

(1 − qB )pB(0) pB(0, 0) = H . B B 1 − qH p (0)

16 Conditions (21)-(22) imply that it takes one negative signal realization s = 0 for the index of an A-candidate to drop below

δ(µA + ζ)λA(∅)vA + δ(µB − ζ)vB λB(∅) + δ(1 − λB(∅))λB(0) (23) 1 − δ2 ((µA + ζ) (1 − λA(∅)) + (µB − ζ)(1 − λB(∅))(1 − λB(0))δ) and two negative signal realizations s = 0 for the index of a B-candidate to drop below the value in (23). These properties in turn imply the search index is equal to the value in (23).18 Comparison. Now observe that when (10) holds in addition to the conditions assumed above, A A B  A A B  Γ v, p0, (µ , µ ), q, P > Γ v, p0, (µ + ζ, µ − ζ), q, P .

Proposition 4. Suppose Conditions (6), (10), and (19)-(22) hold.19 SAA policies tilting the search technology in favor of minorities reduce the ex-ante probability of selecting a A A B  A A B  minority candidate: Γ v, p0, (µ + ζ, µ − ζ), q, U < Γ v, p0, (µ , µ ), q, U , for any ζ > 0.

Proof. The result follows from the arguments preceding the proposition.

The mechanism behind this result is similar to the one under the myopic rule. Such policies reduce the overall attractiveness of search relative to a more careful evaluation of the candidates already in the pool. Because minority candidates are more difficult to evaluate, the committee may then respond to such policies by substituting search with the evaluation of non-minority candidates at the expense of minority ones. When this effect is strong enough, such policies may thus have the unintended effect of reducing the ex-ante probability of selecting a minority candidate.

5 Discussion

The Rooney rule was adopted in 2003. Yet in 2020, there are only three African-American head coaches, the same number as in 2003, prompting criticism viewing the rule as merely a means of “checking a box”.20 Similar hesitations about the effectiveness of other SAA

18The arguments are similar to those for the SAA policies promoting improvements in search technology. 19These conditions, along with the maintained assumptions (i)-(iv), hold for a non-empty open set of A B A B A parameters. For example, they hold when δ = 0.9, p0 = 0.69, p0 = 0.68, v = 1.01, v = 1, qH = 0.4, B A B qH = 0.8, µ = 0.15 = 1 − µ , and ζ = 0.01. 20See the Washington Post, Jan 5, 2020: “The dearth of black coaches in the NFL is a problem that somehow still hasn’t been fixed”.

17 policies have recently been raised in contexts ranging from academic recruitment to CEO hiring. Why are many SAA policies unsuccessful? This paper suggests a possible explanation based on differences in the effectiveness of evaluating minority and non-minority candi- dates, which does not presume any bias in decision making. SAA policies promoting the expansion of the candidate pool or tilting the search technology in favor of minorities (while retaining discretion over when to expand the pool) do increase the percentage of minority candidates considered for a position, but also alter the desirability of searching for new candidates relative to evaluating those already under consideration. The purpose of this article is to show that such policies may lead to a reduction in the probability a position is assigned to a minority candidate. Unless accompanied by steps to reduce difficulties in the evaluation of minorities, such SAA policies thus do not guarantee their desired effects; they may indeed amount to “checking a box,” or even prove counterpro- ductive. In such circumstances, “hard” affirmative action may be necessary to enhance minority recruitment.

References

[1] Aigner, D. and G. Cain (1977). “Statistical Theories of Discrimination in Labor Markets,” Industrial and Labor Relations Review, 30 (2), 175-187.

[2] Arrow, K. (1972). “Models of Job Discrimination,” In in Economic Life.

[3] Arrow, K. (1973). The Theory of Discrimination, in Orley Ashenfelter and Albert Rees, eds., Discrimination in Labor Markets, Princeton, NJ: Princeton University Press, pp. 3-33.

[4] Bardhi, A., Guo, Y. and B. Strulovici (2020). “Spiraling or Self-Correcting Discrimination: A Multi-Armed Bandit Approach,” Working paper.

[5] Becker, G. S. (1957). The Theory of Discrimination, University of Chicago Press.

[6] Borjas, G. J. and M. S. Goldberg, (1979). “Biased Screening and Discrimination in the Labor Market,” American Economic Review, 68 (5), 918-22.

[7] Chambers, C. P. and F. Echenique (2018). “A Characterization of “Phelpsian” Statistical Discrimination,” Working paper, arXiv preprint arXiv:1808.01351.

18 [8] Chung, K.-S. (2000). “Role Models and Arguments for Affirmative Action,” The American Economic Review, 90, No. 3, 640-648.

[9] Coate, S. and G. C. Loury, (1993). “Will Affirmative-Action Policies Eliminate Negative ?,” The American Economic Review, 83 (5), 1220-1240.

[10] Cornell, B. and I. Welch (1996). “Culture, Information, and Screening Discrimi- nation,” Journal of Political Economy, 104 (3), 542-571.

[11] DuBois, C. (2015). “The Impact of “Soft” Affirmative Action Policies on Minority Hiring in Executive Leadership: The Case of the NFL’s Rooney Rule,” American Law and Economics Review, 18 (1), 208-233.

[12] Fang, H. and A. Moro (2011). “Theories of Statistical Discrimination and Affir- mative Action: A Survey,” In Handbook of Social Economics (Vol. 1, pp. 133-200). North-Holland.

[13] Fershtman, D. and A. Pavan (2019). “Sequential Learning with Endogenous Con- sideration Sets,” Working paper, Tel Aviv University and Northwestern University.

[14] Fleming, J. and N. Garcia (1998). “Are Standardized Tests Fair to African Amer- icans? Predictive Validity of the SAT in Black and White Institutions,” The Journal of Higher , 69 (5), 471-495.

[15] Fryer, R. and G. C. Loury (2005).“Affirmative Action and Its Mythology,”Journal of Economic Perspectives 19 (3): 147–62.

[16] Fryer, R. and G. C. Loury (2013). “Valuing Diversity,” Journal of Political Econ- omy 121 (4), 747-774.

[17] Gittins, J. and D. Jones (1974). “A Dynamic Allocation Index for the Sequen- tial Design of Experiments,” In J. Gani (Ed.), Progress in Statistics, pp. 241-266. Amsterdam, NL: North- Holland.

[18] Heilman, M. E. (1980). “The Impact of Situational Factors on Personnel Decisions Concerning Women: Varying the Sex Composition of the Applicant Pool,” Organiza- tional Behavior and Human Performance 26, 286–295.

[19] Holzer, H. and D. Neumark (2000). “Assessing Affirmative Action,” Journal of Economic Literature, 38 (3), 483-568.

19 [20] Lang, K. (1986). “A Language Theory of Discrimination,” The Quarterly Journal of Economics, 101 (2), 363-382.

[21] Leaper, C. and R. D. Robnett (2011). “Women Are More Likely than Men to Use Tentative Language, Aren’t They? A Meta-Analysis Testing for Gender Differences and Moderators,” Psychology of Women Quarterly, 35 (1) 129-142.

[22] Linn R. L. (1973). “Fair Test Use in Selection,” Review of Educational Research, Vol. 43, No. 2, pp. 139-61.

[23] Lundberg, S. and R. Startz (1983). “Private Discrimination and Social Interven- tion in Competitive Labor Market,” The American Economic Review, 73 (3), 340-347.

[24] Moro, A. and P. Norman (2003). “Affirmative Action in a Competitive Economy,” Journal of Public Economics, 87 (3-4), 567-594.

[25] Phelps, E. (1972). “The Statistical Theory of Racism and ,” The American Economic Review, 62 (4), 659-661.

[26] Schuck, P. H. (2002). “Affirmative Action: Past, Present, and Future,” Yale Law & Policy Review, 20, 1.

[27] Weber, R. (1992). “On the Gittins Index for Multiarmed Bandits,” The Annals of Applied Probability, 2 (4), 1024-1033.

[28] Weiss, G. (1988). “Branching Bandit Processes,” Probability in the Engineering and Informational Sciences, 2 (3), 269-278.

[29] Welch, F. (1976). “Employment Quotas for Minorities,” Journal of Political Econ- omy, 84 (4, Part 2), S105-S141.

20