Received: 20 June 2018 | Revised: 10 January 2019 | Accepted: 24 January 2019 DOI: 10.1111/ecpo.12130

ORIGINAL ARTICLE

Kleptocracy and evasion under resource abundance

Hamid Mohtadi1,2 | Michael L. Ross3 | Uchechukwu Jarrett4 | Stefan Ruediger5

1University of Wisconsin, Milwaukee, Wisconsin Abstract 2University of Minnesota, Minneapolis, Evidence has shown that petroleum wealth is associated Minnesota with less transparency and at the same time less tax collec- 3 UCLA, Los Angeles, California tion. In this paper, we find that the two issues are linked 4 University of Nebraska, Lincoln, Nebraska through the citizens’ behavior. We develop a 5 Arizona State University, Tempe, Arizona model to explain this link and conduct extensive empirical

Correspondence tests of its validity. The explanation is that officials tradeoff Hamid Mohtadi, University of Wisconsin, greater transparency to improve tax compliance against less Milwaukee, WI. transparency to increase gains from corruption. Oil wind- Email: [email protected] falls diminish needs, causing officials to opti- Funding information mize on less transparency. Seeing this, citizens optimize on Economic Research Forum a lower level of tax compliance. At equilibrium, both de- cline with a positive oil shock. We also study the alternative channel in which tax compliance responds to enforcement. Transparency is found to be the more robust channel. Ignoring citizens’ strategic behavior would lead to predict- ing suboptimal investment in state capacity for tax enforce- ment. Using giant oil discoveries data combined with oil price data, we develop a dynamic composite instrument and estimate the model with a dynamic panel system general- ized method of moments. We find robust support for our explanation and the model's deep structure for 130+ coun- tries and the 1980–2010 period.

JEL CLASSIFICATION Q34, H3, H26, O17, O11

Economics & Politics. 2019;1–51. wileyonlinelibrary.com/journal/ecpo © 2019 John Wiley & Sons Ltd | 1 2 | MOHTADI et al.

KEYWORDS corruption, oil, public policy, resource curse, tax evasion, taxation, transparency

1 | INTRODUCTION

Does a kleptocratic state produce less compliant citizenry? This paper provides a framework to answer this question with reference to a kleptocratic state's tax policy and its citizen tax evasion behavior, both in general and under natural resource abundance. Natural resource wealth is linked to a range of ad- verse economic and political outcomes.1 Focusing on the state's capacity to collect , and adopting the fiscal capacity framework of Besley and Persson (2009) and Cárdenas, Ramírez, and Tuzemen (2011) show that natural resource dependence reduces the state's capacity to collect taxes. When Besley and Persson (2010, 2011) introduced natural resources in their framework, they too predict that natural resource dependence leads to weaker state capacity. Crivelli and Gupta (2014) found evidence of resource revenues having a negative effect on income taxes, while Jensen (2011) suggested that “resource intensification weakens state-­building by impeding the state's fiscal capacity.”2 The approach taken in these papers implicitly assumes that citizens automatically increase com- pliance in response to greater tax enforcement as a part of the state's investments in capacity. This implies a more or less passive citizenry and overlooks their active participation in a tax evasion game. Ignoring citizens’ strategic behavior may predict suboptimal investment in state capacity for tax en- forcement when agents’ (citizens’) behavior is not anticipated. Critically, tax compliance may respond to other factors in addition to enforcement, such as the transparency of state policy. Overlooking this issue will lead to a second suboptimality of investments in state capacity for tax enforcement. We show that optimal level of enforcement (when it exists) depends on how much transparency is provided and find that (under some mild conditions) more transparency means less enforcement is needed. Specifically, we analyze a game in which tax payers comply/evade optimally, given the state's enforcement and transparency policies. In turn, the state chooses its policies, anticipating the tax pay- ers’ optimal compliance/avoidance response. Although this aspect of the model, i.e., the interaction between transparency and tax enforcement, is general and independent of natural resource wealth, the focus on resource rich economies brings in an additional dimension to consider. Specifically, resource windfalls diminish tax revenue needs and this causes corrupt officials to optimize on less transparency and less enforcement, and citizens to “optimally” comply less. Empirically, we find that (a) oil wealth does reduce both transparency and tax enforcement (or its proxy) and consequently the level of citizen tax compliance; and (b) that transparency is a more robust substitute for tax enforcement even in the presence of oil wealth. In short, the model and its deep structure is supported by extensive empirical evidence. For oil wealth, we use exogenous oil discoveries and for method, we use a dynamic gen- eralized method of moments (GMM) technique that addresses endogeneities. In effect, we show that natural resource abundance alters, not only the nature of institutions but the behavior of the citizens. Interestingly, tax evasion/tax compliance is discussed primarily in the taxation literature and mostly for developed economies (Andreoni, Erard, & Feinstein, 1998; Cuccia, 1994; Jackson & Milliron,

1For summaries of this voluminous literature, see Frankel (2010), Ross (1999, 2015), Stevens and Dietsche (2008) and Wick and Bulte (2009). 2Juan Pardinas, the General Director of the Mexican Institute for Competitiveness stated in 2013 that, “We collect few taxes because we have oil… That allows us to pay less in taxes…” (New York Times, September 9, 2013). MOHTADI et al. | 3 1986; Kinsey, 1986; Long & Swingen, 1991; Richardson & Sawyer, 2001) and, at any rate, almost never in the context of a natural resource windfall. Transparency matters. It allows the tax payers to observe how public funds are spent. Without such observation, tax payers lose trust which increases their incentive to evade taxes. In addition, because rentier states prefer less transparency to hide extraction of resource rents for personal use, more re- source rent implies even more rentier behavior, providing the tax payers with added incentive to evade. Petroleum wealth is indeed associated with reduced transparency, meaning fewer public disclosures about government policies, institutions, and activities,3,4 as shown in Figure 1. We explore why re- source wealth reduces transparency. Figure 2 also confirms the negative association of oil wealth with tax revenues. We employ several rich datasets. First, we exploit plausibly exogenous cross-­country variations on new giant oil field discoveries (e.g., Arezki, Ramey, & Sheng, 2017) complied by Horn (2014), combining it with exogenous temporal variations from oil prices to construct a composite dynamic in- strument for oil. We also use high quality data on taxation, meticulously complied by the International Center for Tax and Development (ICTD, 2014), and oil values complied by Ross (2013) both of which remedy shortcomings of other alternatives as detailed in the data section, along with datasets on transparency and accountability by Williams (2011, 2015). With the aid of this and other data, the deep structure of the model and its implications are tested for the period 1980–2010 and for 130–150 countries using a dynamic panel system GMM approach. Results strongly support the model and its implications. This paper is related to several strands of the literature. First, its key channel besides enforce- ment, i.e., the link between tax compliance and government transparency, is inspired by the

FIGURE 1 Oil and transparencySource: authors

3A prevailing definition of transparency used by the International Monetary Fund (2012) is: “the clarity, reliability, frequency, timeliness, and relevance of public reporting and the openness to the public of the government's fiscal policy-making­ process.” 4Several international initiatives are based on the belief that this outcome is not inevitable. They include the Extractive Industries Transparency Initiative, the IMF's Fiscal Transparency Code, the UNDP's Strategy for Supporting Sustainable and Equitable Management of the Extractive Industries, and the mandatory disclosure laws adopted by the US and EU for extractive industry firms operating abroad. 4 | MOHTADI et al.

FIGURE 2 Oil and taxation taxation-­representation arguments in which the government's need for direct taxation induced greater political accountability both in the nineteenth century Europe (Hoffman & Norberg, 1994; North, 1990; North & Weingast, 1989) and in contemporary societies (Brautigam, Fjeldstad, & Moore, 2008; McGuirk, 2010; Ross, 2004). Viewed in this perspective, the paper examines whether the presence of large oil revenues may compromise this historical social contract and, if so, the way in which this happens. Second, this paper is also related to the literature on the effect of natural resources on institutions. Several papers find that the “economic resource curse”, i.e., the adverse effect of natural resources on growth, operates only conditionally, i.e., only if institutions are weak to begin with.5 More recent work suggests that institutions are themselves affected by resource dependence. This paper is related to latter breed. Specifically, it is related to the existing evidence on the negative association between oil or natural resource wealth and transparency (Mohtadi, Ross, & Ruediger, 2014; Ross, 2011; Williams, 2011).6 Since the lack of transparency is likely to promote corruption, this paper is also related to the liter- ature on government corruption and tax evasion. For example, Tanzi and Zee (2000) linked citizens’ tax evasion to their perception of government corruption. Mohtadi and Polasky (2016) developed a model in which corrupt governments operate under information asymmetry, leading citizens to evade taxes when uncertain about efficacy of government spending.

5Studies indicating that the impact of natural resources on income depends on the quality of institutions are found in Boschini, Pettersson, and Roine (2007), Lane and Tornell (1996), Mehlum, Moene, and Torvik (2006), Robinson, Torvik, and Verdier (2006), and Tornell and Lane (1999). 6Williams (2011) found a correlation between resource wealth and reduced transparency in a smaller panel of countries over a briefer span of time. Ross (2011) reported a cross-­national correlation between oil wealth and lower scores on the Open Budget Index, but only among authoritarian states. Mohtadi et al. (2014) estimate the relationship between petroleum wealth and trans- parency, finding that oil is associated with less transparency when it constitutes a significant share of national income (repre- senting a country's dependence on oil), but not when it merely contributes to a higher average wealth of the country (an indica- tor of oil abundance). Consistent with this finding, the present paper also uses oil as a fraction of national income both in its model and in its empirical formulation. MOHTADI et al. | 5 Finally, if we view the government's incorporation of citizen tax evasion in its policy decision as a tacit acknowledgment of such a behavior and thus a form of implied subsidy, this paper is related to the rentier state literature where regimes use resource revenues to subsidize pressure groups that might otherwise demand accountability (e.g., Ross, 2001). In this case, paying less tax is akin to receiving funds. Intuitively, corruption requires some degree of opacity to conceal corrupt acts, so that it makes sense for corruption and transparency to move in opposite directions. But tax enforcement is more complex. On one hand, a corrupt state might prefer less enforcement since enforcement is costly, requiring elaborate collection structure and may reveal other forms of corruption; on the other hand, corrupt states might have an interest in enforcing taxes more vigorously in order to extract more reve- nues. We allow for both possibilities in the model. But empirically we do not find this latter outcome. By identifying each channel separately, we find (a) that oil wealth reduces both transparency and tax enforcement; (b) that tax payers’ compliance responds positively to both transparency and tax enforce- ment, and therefore falls, given the drop in both that is associated with a positive oil shock; and (c) that transparency is a more robust channel, both as the government's preferred policy, and in terms of its impact on tax compliance/avoidance behavior. A broader implication of our findings is that resource abundance not only affects the structure of the state and its institutions but also the behavior of the citizens. In the remainder of this paper, Section 2 presents the model; Section 3 presents the linkages be- tween the model and the empirical methodology by mapping theoretical propositions to empirically testable hypotheses; Section 4 discusses the data and choice of the instruments; Section 5 discusses the estimation strategies, the econometric model and additional data. Section 6 presents the results and Section 7 offers concluding remarks.

2 | THE MODEL

To capture the key ideas, a simple stylized model of the economy is developed. In this framework, the private economy is made of firms and households. Firms are perfectly competitive and produce a single final good with labor (l), capital (k) and a non-­excludable public input provided by the govern- ment which is external to the agents (e.g., infrastructure) Gn. This represents net spending on public goods (hence subscript), i.e., net of costs of building capacity on tax enforcement (L) and transparency (T). Specifically, transparency and tax enforcement are both costly to provide. An example of the first is investments in statistical capacity and measurements, and of the second is investments in legal institutions. We make the reasonable assumption that this cost is a fraction c(T, L) of gross spending G so that Gn = (1 − c(T, L))G where c(T, L) is additive in T and L so that c(T, L) = θ(T) + ϕ(L). It is also reasonable to assume that these costs rise with the amount of T and L that are provided so that θT(T) > 0, ϕL(L) > 0. Although having public goods in the production function is commonly done, going back at least to Barro (1990), its use in the context of a kleptocratic state requires some explanation. For example, why not steal all of Gn? Why provide any public goods at all? There are at least two reasons for this. First, the extraction of rents takes place through the budgetary process—which means through the re- lation between revenues and expenditures which we will model shortly below. Second, this extraction, cannot be for the entire revenues earmarked for Gn, as this would pose another problem: Without the provision of any Gn, output would collapse and there would be no income base from which to tax and extract rents. This is in line with Mancur Olson's (1993) well-known­ argument that a kleptocratic ruler (referred to as a “stationary bandit”) provides a minimal supply of public goods to facilitate a 6 | MOHTADI et al. steady stream of rents. In this view as in our model self-­interest, rather than the threat of revolution, is what compels the government to provide some public goods. Without loss of generality, we assume a representative firm. This means:

1−  1−  Y = A(k l )Gn = A(k l )[(1−c)G] , ∈ (0,1) (1) where A is the usual scale factor representing the level of technology. Demand for labor and cap- ital by the firms are given by their marginal products, derived from maximization of profits, α 1−α β πY = A(k l )G − wl − rk, where Y is the numéraire good with price of 1. Labor is fully employed with measure 1, so that l* = 1 where l* is demand for labor. We assume that capital markets are open so that return to capital, r, is exogenous, given by the international capital markets: r = r̄. Profit maxi- mization then yields aggregate output Y and labor demand, given G. G (and of course capacity cost c) is exogenous to the agents but subject to balanced budget equation (below). Thus total output, labor, and capital income are,

Y c G 1− =Ω[(1− ) ] (2)

 ∗ ∗ Y w l w c G 1− l = = = (1− )Ω[(1− ) ] (3)

훽 ∗ Y rk̄ 훼 c G 1−훼 k = = Ω[(1− ) ] (4) 훼 1 훼 1−훼 Ω ≡ A 1−훼 where r̄ which is a function of the constants of the economy. Households own factors of production so that Yl and Yk constitute their pretax gross income.

2.1 | Public goods and government budget Public goods G are financed by natural resource revenue R and income tax revenue X. (We reserve the notation T for transparency introduced later.) Gross income is taxed at the rate t. But taxes are subject to citizen underreporting, the extent of which is determined later by the outcome of a game between the government and citizens. Note that this game will occur only in the tax compliance space once income is earned. This means that through the game, the citizens can affect only their disposable income, while their pretax income is determined outside of the game by supply and demand for labor and capital. For simplicity, but no loss of generality, we assume that capital and labor incomes are taxed at the same rate and are subject to the same level of underreporting. This is represented by a fraction, 1 − μ, where μ is the reported fraction of total income. This simplification allows us to aggregate both types of income so that total reported income, Yr, is proportional to aggregate income:

Yr = Y (5)

Resource revenue is also expressed as a fraction ρ of total income, Y as:

R = Y (6)

It is important to note that although the model is set in the context of natural resource rich econ- omy, R can represent any form of nontax revenue that is proportional to income and in fact can be zero. MOHTADI et al. | 7 In this sense, the model has greater generality than with respect only to natural resources.7 We allow for “leakage” of some revenue to the outside of the transparent system. This is represented by a frac- tion b for total government revenue and represents the degree of extractive (corrupt) behavior by government officials. With this background, gross government spending on public goods is given by, G = (1 − b)(R + X). Since X is the government's aggregate income tax revenue, we have X = tμY. So, the balanced budget equation can also be expressed in term of tax compliance:

G = (1−b)( +t)Y (7)

 Substituting for Y from Equation (2) into seven yields, G = (1−b)( +t)ΩG 1−. Solving for G, we find:

1− G = [1−c(T,L)] 1−− [(1−b)(+t)Ω] 1−− (8)

Note that even though G in 8 is the gross, not the net, level of public goods with respect to the cost of capacity building c, the term 1 − c still shows up in 8. This is of course due to the negative impact of c on output Y (Equation 2) which is then carried over from the tax base Y into the expression for G. Next, we focus on the tax evasion behavior of the citizens following an outline of the game.

2.2 | Game outline Citizen follow the government's lead. Observing government transparency and enforcement policies, they choose their level of tax compliance in response. In turn, the government chooses its transparency and enforcement policies taking into account citizen response. Everything is common knowledge.

2.3 | Tax evasion/compliance technology Citizens decide what fraction of their earned income to hide. We use the terms underreporting, evasion, and noncompliance interchangeably. First, we show how rational tax payers evade taxation in response to both transparency and tax enforcement policies of the government. Let Cu denote the cost/penalty of evasion, measured by underreporting a fraction 1 − μ of income. We assume that tax payers who are caught for underreporting with some probability (see below), face a convex cost for tax evasion, i.e., they receive progressively higher penalty for higher rates of tax evasion. This assumption is needed to produce an interior solution. For analytical tractability, we assume that a quadratic form reflects this convexity.  C (1− ) = (1− )2 with (9) u 2 e where δ is some constant. Let b denote the perceived probability of being caught and penalized for underreporting. Without loss of generality we assume that, if caught, the individual is always penal- ized, thus ignoring the possibility of bribing the functionary to avoid penalty. While it is easy to incorporate this aspect, for example, by considering μ to be net of the bribe rate, the paper's focus is on sovereign or grand corruption, rather than bribery and functionary corruption as in Polinsky and Shavell (2001). The distinguishing feature is that high officials who divert resources for personal use are also the ones who make policy decisions and this is what we are interested in.

7Later, in the model when we focus on exogenous changes in ρ we will interpret this as a positive natural resource shock. Also, our empirics are geared to the testing of the model in the context of natural resources. 8 | MOHTADI et al.

In light of the above, a risk averse taxpayer facing convex costs considers E[Cu(1 − μ)], i.e., the expected penalty for tax evasion per unit of income, in her calculations.8 This is given by: 1 E[C (1− )] = e(1− )2 u 2 b (10) Citizens choose their compliance ratio by balancing the benefits and the costs of noncompliance. They do so by maximizing their expected disposable incomes from labor and capital, given their gross labor and capital income discussed earlier (which is made outside of the game). Since we have assumed, for simplicity, similar tax evasion behavior from both sources, this amounts to maximizing expected total income as follows:

Max [E(Y) Ḡ ] = Max E[(1−휇t)−Cu(1−휇)]Y Ḡ = Max [E(1−휇t)Y Ḡ −E(Cu(1−휇)Y Ḡ ] {휇} {휇} {휇} (11) We now relate E(1−휇t)Y Ḡ to transparency, relying on the uncertainty reducing property of trans- parency. Before we do so, we must discuss why greater transparency might be expected to improve tax compliance. The argument is that if tax payers better understood where their taxes went they might comply more since greater transparency will be associated with greater accountability among other things. It must therefore be the case that there is some understanding of the connection between taxes and public services, albeit with some uncertainty (the extent of which will decline with transparency as we will see), and from there to their income Y (see Equation 2). Let ɛ ∼ N(0, σ2). Then, our argu- ment can be formulated as follows: a E(1−휇t)Y Ḡ = (1−휇t)(1+휀) Y Ḡ (12) where ɛ captures the noise and σ2 the associated uncertainty that tax payers perceive in how their tax evasion might impact their own income (via less public goods) and a expresses the sensitivity (elasticity) of tax evasion to this uncertainty. Taylor series expansion around ɛ yields a a a a (1 + ε) ≃ ε + aε −1 + [a(a − 1)/2]ε −2+…]. We shall assume for analytical simplicity and trac- 2 2 tability that a = 2 in which case the approximation becomes exact equality: (1+ ) = 1+2 + . 2 2 This amounts to having a relatively sensitive (elastic) response. Since E(ɛ) = 0 and E( ) =  , we find:

2 E(1−휇t)Y Ḡ = (1−휇t)(1+휎 )Y Ḡ (13)

Substituting from 10 and 13 into 11, citizens’ best response to government transparency is given by,9

2 ∗ t(1+ ) (t,T,L) = 1− e b (14)

Since it is reasonable, as we have argued above, to assume that greater transparency reduces the de- gree of uncertainty (captured by σ2 here), denoting transparency by T, we define some monotonically 2 increasing function of transparency γ(T) with γ′ > 0, and let (T) = 1∕(1+ ) so that σ2 = [1/γ(T)] − 1 2 � 2 and d(휎 )∕dT =< −훾 ∕훾 < 0, i.e., uncertainty falls with transparency.

8 E[C (1−휇)] = 훾 휋e(1−휇)2 > C [E(1−휇)] = 훾 [휋e(1−휇)]2 This is because, u 2 b u 2 b 9 t T e T e First-order­ condition is, − + ( )b(1−) = 0. Second-­order condition is, −훾( )휋b < 0. MOHTADI et al. | 9 e Next, we focus on the probability of being caught. Citizens estimate b from experience and casual e observation. In the aggregate, (a) expectations are realized so that b = b; and (b) the probability of e being caught increases with greater enforcement of the tax laws, denoted by L, so that b = b = f (L) with f > 0 and f < 1. Putting these together, optimum compliance becomes10 : t 휇∗(t,T,L) = 1− with 훾�, f � > 0 훾(T)f (L) (15)

It follows that tax compliance increases with better transparency and tax enforcement:

휕휇∗ 휕휇∗ 휕휇∗ > 0, > 0, < 0 휕T 휕L 휕t (16)

Proposition 1a Citizen tax compliance increases in transparency T.

Proposition 1b Citizen tax compliance increases in tax enforcement L.

In addition, if 훾 < 0 and f < 0 tax compliance becomes concave in T and L.11 This concavity con- dition turns out to be sufficient for the government (as the first mover) to optimize in T and possibly L. This is discussed in the next section. Both the increasing and the concavity of compliance μ will be empirically tested later.12 The empirical section will also have to wrestle with what we mean by trans- parency. Is transparency just access to information as in Williams’ (2011), or should transparency be associated with a channel through which the government can be held accountable as in Williams (2015)? We will later visit this issue and the empirical challenges that either measure introduces.

2.4 | Government 2.4.1 | Budgetary versus political considerations First we focus on fiscal budgetary issues. Incorporating citizens’ best response, i.e., Equation (15), in Equation (8), government spending is found to be:

1− ∗ G = [1−c(T,L)] 1−− [(1−b)(+t (t,T,L))Ω] 1−− (17)

Define gas the ratio of spending to GDP, g ≡ G/Y. It will be useful to express G in terms of g and continue the discussion in terms of g which is free of scale, rather than G which is scale dependent. In addition, in the estimation section, data will be in terms1− of g not G. To do this, divide both sides of 2 by − 1−( +) G to find 1∕g =ΩG 1− or G = (1−c) 1−− (Ωg) 1−− . Substituting into 17 and solving for tμ* we get,

  ∗ − 1− − t (t,T,L) = g⋅(1−b) 1− − (18)

L L 10In Equation (15), since μ∗ ≤ 1, f and γ have lower bounds, i.e., 훾(T = 0) = 훾 > 0 and f (L = 0) = f > 0. In the case of γ, a lower bound means σ2 is finite. Since ɛ is normally distributed, its variance is always finite and the condition is satisfied.

11 ∗ t f ⋅ �� −2 2 −3 We show this in the case of T: 휇TT = ( ∕ ) (훾 훾 −2훾 훾 ) < 0, if 훾 < 0. ∗ 12 t We do not have systematic data on to test the sign of t . But the literature suggests that compliance does decline with the tax rate (e.g., Friedman, Johnson, Kauffman, & Zoido-Lobaton, 2000). As we will see, at equilibrium, t itself will depend on T, L, and G. 10 | MOHTADI et al. Note that tμ* which is just tax revenue as a share of national income (i.e., X/Y) does not depend on cost of capacity building c. This is because c comes out of the amount that is spent on public goods not the amount that is raised. We assume that government spending decision on g is exogenous vis-a-­ ­vis its transparency (T) and tax enforcement (L) strategies.13 Although we will empirically test the validity of this assumption in a later section, there are several a priori reasons why this may be a realistic assumption. First, the decision of how strongly to enforce the laws in general or tax laws in particular, or how much transparency to allow, are strategic and political decisions that influence citizens’ behavior, as seen in the model. On the other hand, decisions on g are mainly driven by fiscal considerations. It could be argued, however, that if g can be used as a tool by an extractive state to elicit greater loyalty from its constituency, as argued for example, in the rentier literature, would this tool not allow the government to trade more g with less T or L (in which case g, T, and L would be interdependent)? As far as transparency is concerned, we observe that T is a pure , available equally to all constituencies, whereas g would have to be an excludable quasi-public­ good, available to some but not others in order to enable an extractive state to buy-­off one constituency as against another. This argument can also be extended to the enforcement of taxes, L as well. While tax enforcement can be differentially administrated to different groups, such a scheme would be very costly to the government and hence probably impractical. Finally we point out that in our model, g, is also modeled as a pure public good (e.g., roads, bridges, etc.) since it directly enters the production function. So it cannot be denied of one constituency or the other. All of this suggest that a higher (lower) g may be a rather poor substitute for a lower (higher) T and L and vice versa. In the empir- ical section, we find support for the argument that g is not a substitute for either T or L.14 Other factors could of course influence g. For example, the distinct political economic structures in democracies versus non-­democracies imply that there may be systematic effects on government spending. In the empirical section, we will control for this effect by including a polity variable. We will show that the main findings of this paper remain robust despite all these considerations. Finally, more oil revenue might influence the amount of spending. We will take up this issue shortly below. The assumption of exogenous g means that variations in spending g drive variations in the tax rate, t, to satisfy the budget constraint. Since direct data on tax compliance is not available, this assumption allows us to identify compliance in the empirical section of the paper: Because tax revenue as a share of national income, tμ, is made up of tax rate t and compliance rate μ, tax revenue tμ can serve as a proxy for tax compliance μ if we can control for variations in t. The latter is possible if variations in g are a proxy for variations in t, thus allowing us to isolate the compliance component μ from tμ. While nominally published tax rates for each country do not vary greatly from year to year, effective average tax rates probably do, for example, due to changes in the threshold taxable in- come, in the income bracket at which marginal rates change, or in the “enforcement coverage” zones. Moreover, effective tax rates encompass subsidies, i.e., negative taxes, so that we have an additional source of variation in t. The assumption of exogenous g will be empirically tested by allowing it to be endogenous in a GMM setting with its lagged values as instruments versus entering it without lags. The results are nearly identical.15

13Even so, changes in g may still influence T and L indirectly through the balanced budget Equation (18), coming from the accompanying changes in the tax rate (t) to keep the budget balanced. This will be discussed later. 14There is also the possibility that g is subject to political economy forces. For example, interest groups lobbying the govern- ment for more spending (e.g., Mohtadi & Roe, 1998, 2003) or voters choose the types of government associated with different levels of spending (e.g., Persson, Roland, & Tabellini, 2007). As long as citizens’ voting or lobbying behavior is independent of their tax evasion behavior, our results will hold. 15This also rules out reverse causality if, for example, anticipating more tax revenues, via higher the tax rate, were to drive spending decisions. It also rules out other potential endogeneity factors such as if tax revenues and spending are driven by the same unobserved factor(s). MOHTADI et al. | 11 Given this discussion, we express the tax rate t as the implicit solution to Equation (18):

t = h(g T,L,b, ) (19)

16 It is easy to show that tg|T,L,b,ρ > 0 as long as the tax rate is bounded from above. We also observe from 18 that tax revenue as a share of national income tμ falls with more oil revenue, ρ. This result does not depend on how an increase in g or in ρ affects the compliance behavior μ indirectly via T, L, or t channels. It simply says that any change on the right side of 18 has to match an equal change on the left hand side. What if oil producers do spend more on all their citizens? In that case g depends on ∗ 17 ρ: g = g(ρ) with gρ > 0. Then, d(t휇 )∕d휌<0 still holds, so long as gρ has an upper bound. Notice, however, if gρ > 0, then: ∗ d(t휇 )∕d휌 g=ḡ < d(t휇)∕d휌 g=g(휌) < 0 (20)

Inequality 20 allows us to test whether or not oil producing governments spend more on their cit- izens. We will test this in the empirical section by including and excluding g from our controls and compare the coefficients of oil and find support for this finding. We can put this in the form of the following proposition:

Proposition 2 An increase in oil revenue reduces tax revenue either if government spending g is independent of oil revenues ρ or its increase (Δg) in response to a positive oil shock (Δρ) has an upper bound (given above). In the latter case, more oil revenue will mean less of a reduction in tax revenues than would otherwise be the case.

2.4.2 | Strategic considerations Government moves first by anticipating citizens’ level of tax compliance in response to setting the level of transparency, T, and possibly enforcement of tax laws, L. We now study T and L policies. In choosing its level of T and L, the government takes into account, not only the effect of its action on compliance directly via μ* but also indirectly via the implicit dependence of the tax rate t on T and L as seen in 18. Therefore, before the government's decision is discussed, this relationship must be analyzed and under- stood. Focusing on T first, we make the reasonable assumption that extra-budgetary­ diversions b are easier to hide when there is some opacity, i.e., ∂b/∂T ≤ 0. This leads to the following lemma.

Lemma 1 Under reasonable upper bound on the tax rate, t, ∂b/∂T ≤ 0 implies that 휕t∕휕T < 0

Proof See Appendix A.

The fact that more democratic societies which are also more transparent may also experience higher tax rates is because such societies may also spend a higher fraction of their GDP on public spending

1−(+) 16 ∗ ∗ 1− To see this, totally differentiate 18 in g: tg( +t t ) = (1−b) . Now, μ + tμt > 0 if there is an upper bound on the tax rate: i.e., if t < γ(T)f(L)/2. (This is the same condition as for Lemmas 1 and 2 below.) If this holds, then, tg > 0.

1−(훼+훽) 17 1−훼 + g휌 < (1−b) . Note also that 훼 훽<1 or 훽<1−훼. Evidence supports this assumption, with 1 − α as the labor share typically known to be about 60% and β as the share of public goods, typically known to be about 20%. 12 | MOHTADI et al. (g) which would raise the tax rate we hold exogenous here. To avoid possible confounding between democracy and g, in the empirical section we control for variations in both democracy and in g. The application of Lemma 1 to tax enforcement is somewhat more complex. On one hand, corrupt officials may be interested in extracting more taxes from the citizen to generate more revenues in order to divert more rents in which case 휕b∕휕L > 0; on the other, better tax collection is a part of better rule of law institutions in which case there would be less corruption so that 휕b∕휕L < 0. This leads to the second lemma:

Lemma 2 Under reasonable upper bound on the tax rate, t, ∂b/∂L ≤ 0 implies that 휕t∕휕L < 0 while 휕b∕휕L > 0 implies that ∂t/∂L ⪌ 0.

Proof See Appendix B.

We will take advantage of both lemmas in determining the government's transparency and tax en- forcement policy. Let the government payoff depend on how much it diverts for personal use, b. This yields the following utility function: ̄ ⋅ ∗ ̄ ⋅ UG(T,L) =VoG +b(T,L) [휌+t휇 (t,T,L)]Y = VoG +b(T,L) 훽 훽 1−훼−훽 ⋅ ∗ 1−훼−훽 (21) [1−c(T,L)] [휌+t휇 (t,T,L)]Λg with bT ≤ 0 and bL ⪋ 0

1− where Λ=Ω1− − .The second equality in 21 comes from substituting for Y in terms of G from Equation (2) and G in terms of g from the earlier discussion on Equation (17). As stated earlier, we assume that the government's choice of transparency and rule of law are independent of its spending decision, g. Hence ̄ for our purposes, g is exogenous to L and T. In the above equation, VoG is some base level government ̄ utility independent of L and T. In a democracy, VoG is determined by the median voters and b is small, ̄ in which case variations in UG(T) would come from VoG. But more generally this will not be the case. ̄ Our control for variations in democracy in the empirical sections also controls for variations in VoG. The government, in its own self-interest,­ optimizes over the choice of T and L. The idea that the gov- ernment optimizes over the choice of transparency is found in monetary and public policy discussions based on the costs and benefits of transparency (e.g., Cornand & Heinemann, 2008), but to our knowl- edge this is its first application to the case where a corrupt government optimizes over the choice of trans- parency based on tax payer behavior. In this context, the transparency decision involves clear tradeoffs; on one hand, 휕b∕휕T < 0 as already discussed; on the other hand, the fact that citizens tax compliance ∗ increases with transparency, i.e., 휕휇 ∕휕T > 0 (see Equation 16), gives the government a larger income base from which to extract rent. In this case we expect an interior solution for optimum transparency: ∗ T = arg max UG(T,.) g {T} (22) The first-­order condition for T* demonstrates the balance of costs and benefits to the government, from a transparency policy:

 ∗ ∗ ∗ ∗ ∗  −1 ∗ T :b(t +t t ) =−b (+t )−bt + (1−c) 1−− b(+t ) g T t T T T 1− − T (23)

The left side of 23 shows the marginal payoff of greater transparency to the government, via larger ∗ tax base due to better compliance (Proposition 3a t휇T > 0) and the compliance response to lower tax MOHTADI et al. | 13 ∗ rate (Lemma 1 t휇t tT > 0); the right-hand­ side is the marginal cost of transparency via lower rent seek- ∗ ing (−bT > 0), the marginal loss due to the reduction in the tax rate (tT 휇 < 0) and the marginal cost of building transparency capacity (θT weighted by the term in front of it). Note that even though g is ex- ogenous, its variation still influences the government's decision on T through the balanced budget Equation (18) coming from the relation between transparency (T) and tax rate (t). In other words, Equation (18) acts as a constraint: for a given g and ρ on the right hand side of 18, a change in T or L is accompanied by a change in t.18 Using the condition bT < 0 and Lemma 1, Appendix C shows that the second-­order condition for optimal T* holds if ∂2μ/∂T2 < 0. In short, governments limit transparency to their citizens. Their choice of this limit depends on how tax payers behave and, soon we will learn, also on the available natural resource revenue. The empirical section examines the validity of this result which we summa- rize as follows:

2 2 Proposition 3a If 휕휇∕휕T > 0 per Proposition 1a but at a decreasing rate (∂ μ/∂T < 0), an T∗ interior exists (assuming bT < 0 per Lemma 1).

Note that μ is never maximized with respect to T. Proposition 3a only requires that ∂2μ/∂T2 < 0 for a T* policy to exist. The requirement that ∂2μ/∂T2 < 0 is what is needed to tie the behavior of the citizens and the government in the Stackelberg game. We now turn to the enforcement of tax laws, L* = arg max UG(L,.)|g as an alternative mechanism by which a government may improve tax compliance. The first-order­ conditions for optimum en- forcement, L*, is identical to Equation (23) with L replacing T. But now the sign of bL plays a crucial role. If bL < 0, the result is identical to the above and an optimum level of enforcement of taxes will exist. But if bL > 0 it is possible that tL > 0 by Lemma 2. In such a case, no interior solution and thus no optimum enforcement will exist (see Appendix D). This outcome arises because there can be no tradeoff if stronger tax enforcement leads to more rent extraction by the officials while increasing the government's coffers. In this case, a corner solution will exist where tax extraction is at its maximum. We can summarize this result as follows:

2 2 Proposition 3b If 휕휇∕휕L > 0 per Proposition 1b but at a decreasing rate (∂ μ/∂L < 0), an interior L* may or may not exist. It exists if bL < 0. It may not exist If bL > 0, i.e., if enforcing taxes exacerbates government's rentier behavior.

2.5 | Equilibrium Citizens react to the strategic variables, T and L, chosen by the government. But the tax rate, t, is also a variable entering into the government decision (see for example, the use of Lemmas 1 and 2 in Propositions 4a and 4b). This then means that at equilibrium t must enter the tax payer's post op- timization value of μ*. This point has a number of important implications that we will now explore.

2.5.1 | Compliance Focusing on transparency and compliance first, at equilibrium transparency affects compliance in two ways:

18 g is not the only driver of t. In Equation (18) tg > 0 is conditional on other variables. Changes in these also affects t. 14 | MOHTADI et al. ∗ d휇 ∗ ∗ T=T∗(eq) = 휇T T=T∗(re) +휇t tT gov.budget > 0 dT (24) ∗ where T T=T∗(re) is the slope of citizens’ optimal response at T*, denoted by subscript “re”. This first effect is the direct effect and is positive. However, since an increase in transparency also reduces the tax rate (second term on right), it follows that an increase in tax payer compliance at equilibrium (de- noted by subscript “eq” on the left) may be either due to citizen's greater participation in a more trans- parent economy or a reduction in the tax rate. The second differential, via the transparency and the taxation paths, is: d2휇∗ = 휇∗ +2휇∗ < 0 if 휇∗ < 0 (25) dT2 TT Tt TT

Empirically, systematic observations on the tax rate t are not available. Thus, we can only estimate the total effects of dμ/dT and d2μ/dT2 but will not know how much is due to the first term and how much is due to the second term in 24. However, a positive total differential and a negative total sec- ond differential are consistent with rational tax payers optimizing over compliance. Similar argument holds for tax enforcement.

2.5.2 | Tax revenue and tax compliance Appendix E shows that so long as compliance responds positively to transparency, tax collections per capita tμ* will also increase with transparency, at equilibrium. This is significant in two ways. First theoretically, we have found that tax rates fall with transparency (tT < 0) but compliance increases ∗ with it (휇T > 0). So it is not obvious that tax collections tμ* should increase with T. Second, as we have discussed before, while controlling for g empirically is supposed to pin down the tax rate t in the tμ* expression so as to “force” variations in the tax revenue to reflect variations in compliance, the fact that tax collections (tμ*) increase in transparency in the absence of controlling for g, provides further evidence that the underlying compliance must be responding positively to transparency. Of course, we will not be satisfied with this indirect evidence and will actually use compliance data to test the compliance hypothesis directly, albeit for a much more limited period, as a robustness check. Similar results hold for tax enforcement. To summarize,

Proposition 4 If tax compliance increases with transparency or tax enforcement, so do overall tax collections. That is, if d휇∗∕dT > 0 (or d휇∗∕dL > 0), it will be the case that d(t휇∗)∕dT > 0 (or d(t휇∗)∕dL > 0).

2.6 | Tax enforcement and transparency In the Introduction, we argued that transparency can influence the state's capacity to collect taxes. In this subsection, we show how higher levels of transparency will allow the state to put less effort in tax enforcement at the optimum. We will show how this effect arises before we actually test this empiri- cally later. Since in the Introduction we suggested that this result is independent of natural resource wealth, we can set = 0 for simplicity only for this part of our analysis. In addition, our findings will be independent of the cost of capacity building and would hold even if transparency institutions were as costly as tax enforcement institutions. For this reason and for further analytic simplicity we will set (T) = (L) = 0 so that c(T, L) = 0. Our results should be stronger, if analytically more complex, MOHTADI et al. | 15 when tax enforcement institutions were costlier to set up than transparency institutions. With these considerations, the first-­order condition for optimum enforcement, if it exists (see Proposition 3b), is given by, d d L∗ = arg max U (L,.) → (bt ∗) = 0 → b t ∗ +b (t ∗) = 0 g G g dL L dL (26)

Since L* is the solution to 26, we make use of the implicit function rule to implicitly differentiate L* with respect to T. This yields (after collecting terms): d2 dL∗ d d b t ∗ +b (t ∗) +b (t ∗)+b (t ∗) = 0 LT dLdT dT L dT T dL (27) 

We now make use of the first-order­ conditions for enforcement and transparency, Equations (23) and (26), to substitute respectively for the 3rd and 4th terms in 27. After simplification this yields, d2 dL∗ 2b b b t ∗ +b (t ∗) = L T t ∗ LT dLdT dT T∗ b (28)   We shall assume that corruption is separable in the effect of tax enforcement versus transparency, b(T, L) = bT (T)+bL(L) b for example 2 . This implies that∗ LT = 0. Under this assumption Appendix F d (t휇∗) < 0 dL < 0 shows that dLdT unambiguously. In that case dT

Proposition 5 If corruption falls with better tax enforcement (i.e., bL < 0) so that an optimal enforcement level exists (Proposition 3b) and if transparency and tax enforcement separately affect corruption, then less enforcement of taxes is needed at the optimum. Thus, transparency acts as a substitute for tax enforcement.

This result is particularly interesting. The result would be probably reinforced if capacity costs entered the analysis since tax enforcement is likely to be costlier than transparence at the margin (ϕL > θT). Thus, it is likely that the results from Equation (28) and Proposition 5 are understated and would be strengthened under these conditions, allowing for a more efficient outcome in the sense of lower tax enforcement costs. We have ignored this situation for analytic simplicity. We now consider capacity building costs.

2.7 | Capacity building costs Recall that c(T,L) = (T)+(L). We first focus on cost of transparency capacity θ(T). Reexpress the first order condition 23 in its general form (from the original objective function 21) as,

d ∗ [1−c(T,L)] 1−− b(T,L)⋅[+t (t,T,L)] = 0 dT (29)

This simplifies to, d  d b(T,L)⋅[ +t∗(t,T,L)] = (1−c)−1[ +t∗(t,T,L)] dT 1− − dT (30) 16 | MOHTADI et al. Let H(T) ≡ b(T, L)·[ρ + tμ*(t, T, L)]. Then if the left hand of 30 is set to zero, i.e., if dH(T)∕dT = 0 we simply have the first-­order condition for optimal transparency when capacity costs are zero, yield- ∗ ∗ ing T =0. However, the full equation in 30 yields T 휃>0. From this it is clear that the slope of the ∗ ∗ ∗ curve H(T) which peaks at T =0 is positive at T 휃>0, i.e., H(T) is still on its ascendance at T 휃>0. It ∗ ∗ follows that T 휃>0 < T 휃=0. Same argument holds for L * subject to its existence. Thus, Proposition 6 Higher costs of capacity building lead to lower optimal levels of transparency or tax enforcement (when the latter exists).

An interesting implication of this proposition, in light of Proposition 5, is that if tax enforcement is costlier than transparency capacity, relying on transparency would be cost saving. We now return to the case where natural resource wealth is nonzero.

2.8. | Resource rents, transparency, and enforcement Our key results are how rents from natural resources impact the government's transparency policy (though in the process we address tax enforcement as well). To see this, rewrite the first-­order condi- ∗ tion in Equation (23) as Ψ(T , ) = 0 (suppressing the dependence on L). Collecting all the terms in 23 to the right, we see that Ψ휌(T,휌) < 0. Similarly, the second-order­ condition established in Appendix C ∗ can be written as ΨT (T,휌) T∗ < 0. Totally differentiating the first-order­ condition Ψ(T , ) = 0 in ρ we have, dT Ψ (T, ) ∗ ⋅ +Ψ (T, ) ∗ = 0 T T d T (31)

from which we obtain,

dT∗ Ψ휌(T,휌) T∗ (−) =− =− < 0 d휌 ΨT (T,휌) T∗ (−) (32) We summarize this result: Proposition 7a Given a positive resource/oil shock, and conditional on a T* existing, a klep- tocratic state always opts for less transparency.

Similar result holds for tax enforcement:

Proposition 7b Given a positive resource/oil shock and conditional on an L* existing, a klep- tocratic state always opts for less tax enforcement.

∗ ∗ Note that in the case of Proposition 7a, observing 휇T > 0, 휇TT < 0 is sufficient to guarantee an ∗ optimum T* will exist from which dT ∕d휌<0 would result. But in the case of Proposition 7b, ob- ∗ ∗ serving 휇L > 0, 휇LL < 0 is not sufficient. In addition, we must also have bL < 0 as a sufficient (but not ∗ ∗ ∗ necessary) condition for dL ∕d휌<0. In the empirical section we are able to test for 휇T > 0, 휇TT < 0 ∗ and dT ∕d휌<0. Confirming these signs is consistent with the inference that a self-­interested state rationally chooses/limits transparency to suite its self-­interest. In the case of tax enforcement, again ∗ ∗ ∗ ∗ ∗ we can test for 휇L > 0 휇LL < 0 as well as for dL ∕d휌<0. Should we confirm that 휇L > 0, 휇LL < 0 and ∗ dL ∕d휌<0, we are then able to make the same inference for L* as we did for T*. In this instance, the ∗ role of bL is mute since establishing dL ∕d휌<0 is an indication of the underlying rational self-interest.­ MOHTADI et al. | 17 3. | FROM MODEL TO HYPOTHESIS TESTING

Before we empirically test the theory, we must be able to map the propositions of the model into em- pirically testable hypotheses. We begin with Proposition 1 rather than 2.

H1: For a given level of public spending, more resource revenue means less income tax revenue.

Testing H1 is a “reduced form” test since it says nothing about the structure of the model. Other propositions connect different aspects of the model so testing them can be viewed as “structural.” For example, Propositions 3a and 3b link the tax payers’ compliance behavior to the rentier government's transparency and tax enforcement policy. They also relate the curvature of this compliance behavior as a way of testing whether a rational self-interested­ rentier government “optimizes” in the policy space to benefit itself. Combining Propositions 1a and 3a and Propositions 1b and 3b in pairs we have the following pair of hypotheses:

H2a: Tax compliance increases in transparency at a diminishing rate.

H2b: Tax compliance increases in enforcement at a diminishing rate.

Testing these hypotheses sheds light on both the tax payer behavior and the rentier government be- havior. Given the possibility that, in principle, corrupt governments may have the incentive to enforce taxes more vigorously to extract more rents, i.e., that bL may be >0 (besides the more intuitive possi- bility of bL < 0), hypothesis H2b requires some elaboration. Specifically, if the “diminishing rate” part of H2b is rejected, then no self-­serving optimal enforcement of taxes exists by a rentier government. Intuitively, this implies corrupt governments might prefer to enforce tax rules maximally to get more taxes because bL > 0. In this case, tax enforcement behaves differently from transparency: instead of a tradeoff, we have positive feedback from corruption to tax enforcement. The empirical section will examine these hypotheses. In addition, Proposition 4 provides a check on the relation between transparency, or tax enforce- ment, to tax revenue collections, as a reflection of the underlying relation between these two drivers and tax compliance. In hypothesis format we can write,

H3: Tax collections should increase with transparency or tax enforcement.

Finally, mapping Propositions 7a and 7b, with the caveat discussed at the end of Section 1, we have:

H4a: Transparency falls with resource revenue as a share of GDP.

H4b: Tax enforcement falls with resource revenue as a share of GDP.

Figure 3 presents a conceptualization of these hypotheses and the relationship among them. For ex- ample, one can view hypothesis H1 as signifying the “demand side” for tax revenues (by the government) while hypotheses H3 and H4 as signifying the “supply side” (by the citizens). 18 | MOHTADI et al. 4. | DATA

We employ several datasets, beginning with the transparency data. First we discuss our approach to measuring transparency. We follow Williams (2015) and consider transparency, at large, to consist of both informational transparency and accountability. Our theoretical model on compliance indicates that both the degree of knowledge (information) about the government's budget and the degree of confidence by the citizens that it is spent on public goods are important. The former is better captured by an information transparency index, first developed by Williams (2009) called release of informa- tion (RI), then later used in Williams (2011), and further refined in Williams (2015), while the latter is better captured by an accountability index per Williams (2015). Accountability and transparency capture somewhat different things. Discussing accountability Williams (2015, p. 811) states, “this is a different form of transparency, in that access to this information by the public has less meaningful economic value unto itself, but is designed to provide a check on the behavior of the government, and to therefore promote accountability.” Williams (2009) Information transparency index, which was updated on Williams’ website to 2010, counts the frequency and extent of annual data released by governments and published in World Bank's World Development Indicators (WDI) and the IMF's International Financial Statistics. Its main advantage is coverage: covering every country listed in WDI from 1970 to 2010, though the common time period between this dataset and a second dataset on taxation and government revenue (below), limits us to the 1980–2010 period.19 Although we will use both the transparency and the accountability indices, the accountability index turns out to be highly correlated with the proxy that we must use for tax enforcement, i.e., rule of law20 (correlation of about 0.7321). This makes sense since a large part of accountability of any government is abiding by rule of law. While we will still opt to use both measures and present both sets of results, because

Oil

H4 Transparency or Tax Enforcement H1 Supply of H2 Taxes (by the cizens) Demand for taxes (by government) Tax Compliance for given level of spending

FIGURE 3 Hypotheses and model structure

19In Williams (2015), the information transparency and accountability indices are broken down into their subcomponents but for our purposes using the overall measure for each is more appropriate. 20The WDI index of rule of law is composite index which includes the enforcement of taxes as one of its components. However, WDI does not provide information on the specific components of this index. For this reason we use the full index as our best proxy for tax enforcement. 21Naturally, this correlation is for non-missing­ observations in both variables. MOHTADI et al. | 19 of the collinearity with the rule of law we present the accountability results as a robustness test for the main regressions, rather than as part of the main regressions. Besides Williams’ data, Hollyer, Rosendorff, and Vreeland (2014) have developed a transparency dataset similar to Williams (2009) but based on a Bayesian methodology. However, their transparency measure contains 1,500 fewer observations. Moreover, the correlation between the two datasets is very high (82%). Thus, we opted for the dataset by Williams which has the added advantage of compatibil- ity with his accountability index. While simply counting the amount of information is not an accurate measure of the quality of information released by governments, Williams (2011) shows that the RI index is highly correlated with World Bank's Statistical Capacity index and thus also casts light on the quality of information released. This dataset allows us to use panel estimation techniques and hence examine within-country­ effects over time. The index ranges from −10 to +10 which we normalize to (0,1). For taxation data, we use a dataset from the International Center for Tax and Development (ICTD, 2014) for 1980–2010. The ICTD dataset significantly improves on existing data in quality, standard- ization, and coverage.22 Of the many tax revenue variables available from ICTD, we choose revenue from personal income taxes because this variable matches the theory more closely. Since no direct and reliable international data exists on tax compliance,23 tax revenue is used as a proxy for tax compli- ance based on the earlier theoretical discussion (see discussion preceding Equation 19) in which by using government spending g to control for the variations in the tax rate, t, component of tax revenue tμ one should be able to isolate and identify the compliance component, μ. We carefully examine this assumption in detail in our results section (Section 6.3). Our findings there are consistent with this assumption. We also carry out a robustness test of our results, using a limited dataset available on tax compliance from the Fraser Chain Linked index (Gwartney, Lawson, & Hall, 2013) for 2004–2010. The third main dataset is on oil wealth. There are several problems with using the data from the WDI. First, while the measure of rents nets out the production costs of a resource and therefore in theory seems a suitable measure, in practice costs are measured as a rough estimate taken at a single point in time and are heavily dependent on accounting assumptions; they are hence both uncertain and do not capture actual variations in extraction costs over time, which can be very large. Second, using the WDI “rents” measures create a serious endogeneity problem: The measure relies on countries’ self-­reporting of their resource data to the World Bank and since our measure of transparency is based on the frequency and extent of reporting of data by countries (see next section), any missing observa- tions may be an indication of low transparency rather than being randomly distributed. Third, there are many missing observations, which may bias our estimate of oil's impact. Fourth, due to missing observations the scope of the coverage of WDI's data is more limited, especially when it is combined with other variables. To circumvent these problems, we will use an alternative measure, namely “oil and gas value” available from Ross (2013) which is simply the quantity of oil and gas extracted in a given year

22Wilson, Cobham, and Goodall (2014) described the new ICTD data as follows: “The dataset meticulously combines data from several major international databases, as well as drawing on data compiled from all available International Monetary Fund (IMF) Article IV reports. It achieves marked improvements in data coverage and accuracy, including a standardized approach to revenue from natural resources.” 23Efforts to measure tax evasion in developing countries have always involved some measure of the shadow or underground economy. To construct such measures a common variable that has been used is overall tax revenues themselves. Thus, rather than considering how compliance impacts total tax revenues, which is our focus, such studies assume instead that taxation leads to departures from the formal economy. Our concept of compliance here is closer to the measures calculated by the Internal (IRS) and widely discussed in academic circles. Unfortunately, such measures are not available internationally. 20 | MOHTADI et al. multiplied by the per-unit­ world price. Ross cleaned and compiled data from the U.S. Geological Survey, the U.S. Energy Information Administration's International Energy Statistics, the World Bank, and the BP Statistical Review. This dataset covers 172 countries, of which 95 have produced oil, from 1932 to 2011. Due to its approximate independence from the self-­reporting aspect of the pure WDI measure, this measure largely circumvents the above endogeneity issue and yet is highly correlated with the WDI measure. Furthermore, by avoiding the thorny challenge of estimating extraction costs, this measure covers nearly all countries. Most of the other variables are from WDI, but democracy data is from Polity IV (2013). The data coverage is 1980–2010 for approximately 150 countries. The latter varies slightly depending on the variables used. Appendix G provides information on variables, data sources, and descriptive statistics. Finally, we need data to instrument for oil. Despite its many merits above, oil value still represents revenue and much has been written about its potential endogeneity to institutions. We will cover this issue later. Here we just mention the remedy and the data: We will combine two instruments for oil value; oil prices (in 2,000 constant prices) and oil reserves from the giant oil field discoveries data by Horn (2014) to create a single dynamic instrument. This approach has a number of advantages that we will discuss later. The appropriateness of using oil reserves is extensively discussed in Cassidy (2016). This type of data is also used in a recent paper by Arezki et al. (2017).

5. | ESTIMATION STRATEGY

We use GMM to estimate the structure of the model. Endogeneity concerns are of particular impor- tance and an entire subsection is devoted to them. Three types of equations are estimated with their different variants: The first type are equations that test the direct effect of oil revenues on taxes. We may call these reduced form equations since they precede testing any structural mechanism that underlie this relation. They estimate the effect of oil revenues on taxation and are basically a test of hypothesis H1:

� (t휇) = a +b 휌 +c g +d̃ ⋅X̃ +u +v +휀 it 1 1 it 1 it 1 it 1i 1t 1it (33)

The residuals, u1i and v1t are country and time fixed effects and ɛ1it is the regression error that in- X⃗ cludes unobserved heterogeneities; it is the column vector of controls (see below). Based on hypoth- esis H1 we would expect that b1 < 0. In addition if spending drives taxes then c1 > 0. Here, if either g or ρ are endogenous, the estimates of both b1 and c1 will be biased. This will be taken up shortly in a separate section dedicated to the discussion of endogeneity, the remedies for it, and the instruments used. The second and third types of equations test the structural mechanism underlying the reduced form relation. They join hypotheses H2 and H4, linking the behavior of the tax payers and the government in a Stackelberg game. Thus, for transparency and tax enforcement policies (hypotheses H4a and H4b) we have, T = a +b 휌 +c̃� ⋅X̃ +u +v +휀 it 2 2 it 12 it 2i 2t 2it (34)

L = a +b 휌 +c̃� ⋅X̃ +u +v +휀 it 3 3 it 13 it 3i 3t 3it (35) MOHTADI et al. | 21

in which we expect b2, b3 < 0; for tax payer response to policies (hypotheses H2a and H2b), we have,

� ln (t휇) = a +[b ln T̂ ]+[c ln L̂ ]+d g +f̃ ⋅X̃ +u +v +휀 it 4 4 it 4 it 4 it 4 it 4i 4t 4it (36)

Equations (34) and (35) are each tied to Equation (36) by the fact that the predicted values of L and T on the right hand side of 36 come from 34 and 35, following the theoretical model in which tax payers in Equation (36) “react” to the government's choice of T and L in Equations (34) and (35). The ̂ ̂ logarithmic specification for tμ, T and L in 36 which is equivalent to estimating the generic form m Y = Z has two distinct advantages over the more common practice of using a quadratic specification. ̂ First the estimates of b4 and ĉ4 in 36 are free of units of measurement. This is particularly important for the measure of T and L where the choice of units would otherwise affect the size of the coefficients. Second, per hypothesis, tax payer compliance always increases in T and L (albeit at a diminishing rate) but never decreases. A quadratic specification would pose additional burden to support (or reject) this 24 hypothesis. In accordance with this, we expect 0 < b4, c4 < 1. The brackets in 36 is meant to indicate variations in the regression model with only one, or both, of the two variables, T or L, to be included in the regression. The presence of git in 36 is meant to pin down the tax rate t in tμ (recall that g drives t) so that tμ really signifies compliance not revenue. To check this we estimate 36 both with and without git. In the empirical section, we will show that even if controlling for government expenditures does not control ̂ for the variations in tax rate, t, it would still be the case that the condition 0 < b4,ĉ4 < 1 in Equation (36) would constitute the necessary and sufficient condition in support of the tax compliance hypothesis.

5.1 | Controls GDP per capita is our first control. In their questioning of the validity of the resource curse hypoth- esis, Alexeev and Conrad (2009) have argued that a positive resource shock leads to a boost in GDP per capita, artificially biasing subsequent growth rates downward. Similarly, they argue, such a boost places oil producers in the class of richer countries with better institutions so that by comparison re- source rich countries appear as though they have worse institutions, a result less due to oil curse than a boost in national income. This argument would implicate our transparency, rule of law and taxation equations, the latter because tax revenue is also measured relative to GDP and suffers the same fate as growth rate. Alexeev and Conrad's proposed instruments are static (e.g., geographic factors such latitude) that would drop out in any fixed effects panel model. In the absence of a strong dynamic instrument for GDP per capita, the GMM method comes to our aid where lagged GDP per capita can be used to instrument for current GDP per capita. Polity2, which measures democratic freedoms, is our second control. More democratic regimes tend to be more transparent. But as the components of the measure of Polity2 suggest, Polity2 is about more than transparency and covers many other aspects of society and polity. Controlling for Polity2 allows for the residual variations in transparency that are unrelated to polity but may be potentially related to a positive oil shock to be examined. The inclusion of Polity2 also controls for “the Norway factor”, i.e., for non-­idiosyncratic variations due to countries’ high quality institutions. (The further

24In a quadratic specification, with the coefficients of the linear term and quadratic terms positive and negative respectively, to establish that tax payer compliance never falls with T and L one would have to establish that the peak of the resulting parabola occurs outside of the range of support of T and L, requiring one to compare the size of the linear term and quadratic terms. These additional calculations are not needed in our case. 22 | MOHTADI et al. possibility that Polity2 may be subject to reverse causality from transparency is addressed later in the GMM estimation). Finally, should we suspect that democratically elected governments may be more responsive to their citizens by spending differently (or more) on public goods than otherwise, Polity2 can control for variations in the variable g that are unrelated to the role of public spending in our model. A variable called Trade taxes is our next control. This variable addresses the potential influence that the presence of a weak income tax environment might have on a government's decision to, say, increase its extraction of oil reserves to earn more revenue. Since trade taxes are easier to impose and collect in many developing economies with weak income tax environments, trade taxes would be expected to also be higher in such an environment. Our selection of a strong instrument for oil means that variations in oil revenues will be unrelated to any other source of variation in the model including trade taxes. Nonetheless, this issue will be examined. By including and excluding trade taxes in our regressions, we will be able to see whether or not trade taxes actually influence the size of the coeffi- cients of the oil revenues.

5.2 | Endogeneity issues In what follows, we first discuss some of the potential endogeneity issues that may arise in different parts of the model and offer remedies. In all cases, except one, the remedies are based on using GMM methodology. The appropriateness of using GMM in these cases is extensively discussed later. In the case of one variable, oil revenue, we construct a unique dynamic composite instrument that takes advantage of exogenous variations in oil reserves (Horn, 2014) and oil prices in an IV estimation embedded in the GMM methodology.

5.2.1 | Oil revenues Possible endogeneity of oil revenues is the most critical issue, both in our paper and in the literature. In our paper, the specific points where this issue can possibly arise are as follows: In the relation- ship between oil and taxation, it is possible to imagine that the need for more tax revenues drives the decision to extract more oil so that taxation could also drive oil revenues. In the equation that relates oil revenues to transparency a potential endogeneity of oil revenue arises since countries that are more transparent may provide a better environment for foreign investments in oil (and other natural resources), creating a reverse causality from transparency to oil revenues. Bohn and Deacon (2000) found that democratic institutions and political stability positively affect investments in oil explora- tion. Cust and Harding (2015) showed that when oil is potentially located on a national border, 95% more exploration occurs in the country with relatively better institutions. As stated, we address the endogeneity of oil in both equations by the above mentioned composite instrument. We will discuss this instrument in greater detail shortly below.

5.2.2 | Public spending In the oil-­taxation equation, public spending g appeared as a variable. The model assumed that g was exogenous to, and a driver of, taxation tμ (Equation 18). This assumption may be questioned. For example, one can imagine that the expectation of higher tax revenues may drive officials to spend more. To maintain consistency with the theory, we will first estimate our model with an exogenous g. We then follow this with a robustness test by letting g to be endogenous. Our main results turn out to remain robust in either case. MOHTADI et al. | 23 In the transparency or tax enforcement versus compliance equation, there may be another source of endogeneity of g. Recall that in deriving the relation between the government's choice of T* and L* and the citizens’ tax compliance, government level of spending g was assumed to be exogenous to its decision on T* and L*. But in the fiscal capacity model of Besley and Persson (2009, 2010, 2011) when institutions are relatively more “cohesive,” a “common-­interest” state is more likely to emerge. Such a state probably spends more on public goods, g. If greater cohesion also implies greater transparency because such a state is less threatened by disclosures due to greater public trust, then the unobserved variable “cohesion” is correlated with both g and T, leading to the endogeneity of g. Once again, we address this issue by allowing g to be endogenous in a robustness test of the oil-­transparency equation, similar to the oil-­taxation equation above. Further to this point, if g were to influence the government's decision of T* and L* such that more g would buy the government less transparency (or rule of law), controlling for g in the tax compliance equation should lead to a smaller effect of T* and L*. To test this, we can simply include and exclude g and compare the results. We will find evidence against g being a “strategic” variable in the above sense. Finally, the theory also allowed for g to either depend on oil revenues ρ or not (Equation 20). On this point the literature is conflicted. For example, Cassidy (2016) argued that resource windfall reduces the marginal utility of public goods to policy-makers,­ causing them to invest less in fiscal capacity, though he does not provide evidence while Ross (2012) finds than more oil revenue may lead to increased public spending. Empirically, in our test, this issue is not an endogeneity problem but merely a colinearity issue. This is addressed by including and excluding variable g in the empirical test of the relation between oil and taxation. While colinearity does lead to estimation bias, the direction of this bias is to underestimate the negative effect of oil on taxation, if more oil drives more spending as Ross (2012) suggests, as can be also seen from Equation (20) in the model. We find this to be the case.

5.2.3 | Polity2 The taxation-representation­ argument discussed in the introduction implies that the need for taxation by the government may actually drive more open political systems so that taxation may also influ- ence the Polity2 variable. There is also a possibility of reverse causality in the relation between polity and transparency: one might ask whether it is democratic institutions that influence transparency, or that greater transparency induces more democratic institutions. To address both of these concerns, throughout the estimation, the variable Polity2 is allowed to be endogenous.

5.2.4 | Per capita income The use of per capita income as a control must be carefully examined. Several studies suggest that oil revenues boost income (Alexeev & Conrad, 2009; Arezki et al., 2017; Smith, 2015). While static geography instruments such as absolute latitude and regions of the world have been used for example by Alexeev and Conrad (2009) such instruments amount to fixed effects and drop out in any dynamic or fixed effects estimation technique. As we discuss below, The GMM procedure offers an alternative.

5.3 | Remedies 5.3.1 | Instrument for oil We begin with the most critical endogeneity issue that of oil revenues. One approach to this concern has been to use oil price fluctuations as an identification strategy (Andersen, Johannesen, Lassen, & 24 | MOHTADI et al. Paltseva, 2014; Brückner, Ciccone, & Tesei, 2012; Carreri & Dube, 2015; Caselli & Tesei, 2016; Dube & Vargas, 2013). While Price does offer the possibility of exogeneity (except perhaps Saudi Arabia), it lacks cross-country­ variations since the use of price shock ignores revenue variability which depends on quantity of oil extracted and does not reflect the countries’ differences in endow- ments. Cassidy (2016) uses geologically determined oil basins to generate cross-­country variation in oil wealth. We also use this variable. But this instrument lacks a time dimension. To address both problems, we combine both instruments to create a single instrument which we then use in an instru- mental variable estimate (IV) embedded in the GMM procedure (see below).

5.3.2 | The GMM method One major problem that plagues the use of 2SLS Instrumental Variable (IV) method is the validity of exogenous instruments. Given the nature of what we investigate, it is very difficult to identify instru- ments that satisfy the valid instrument criteria such that they can be used in the two-step­ procedure of the IV method. The only valid dynamic exogenous instrument we can make a case for is the instru- ment for oil reserve value, as discussed above. To remedy this issue, we adopt a static and a dynamic panel System GMM approach that is suitable to address the above challenge. The methodology is a culmination of advances by Arellano and Bond (1991), Arellano and Bover (1995), Blundell and Bond (1998), and Holtz-Eakin,­ Newey, and Rosen (1988). The system GMM approach is ideal given the potential endogeneity of oil value and other variables as discussed. It assumes that the only available instruments are “internal”, i.e., based on lags of the instrumented variables while also simultaneously allowing for the introduction of external instruments in an embedded IV procedure (i.e., our composite oil reserve instrument), so we have the best of both worlds. The flexible nature of GMM allows us to simultaneously correct for a number of econometric shortcomings embedded in an analysis such as this, including heteroske- dasticity, autocorrelation, finite samples, and the presence of country fixed effects. This approach also readily provides tests for serial correlation, overidentification, and under-identification­ of the instrument sets used. Suppose, � yi,t = yi,t−1 +Xi,t +i,t with

i,t = i +i,t

Eliminating the fixed effects by the usual differencing methods exacerbates the problem of miss- ing variables, drastically reducing a dataset with missing data. Arellano and Bover (1995) suggested an alternative method of transformation that minimizes the loss in data called forward orthogonal deviations. In place of taking the first difference, the forward orthogonal deviations transformation subtracts the average of all future available observations of a variable so no matter how many gaps exist in the dataset, observations will exist for all entries except the last observation for each individual using this transformation, while still eliminating the fixed effects. This way, data loss is minimized and because lagged variables are not part of the transformation process, they are still valid instruments (Roodman, 2009). If ω is a variable, then its transformation is,

trans 1 휔it+1 = ci,t 휔i,t − wi,s Ti,t s>t   MOHTADI et al. | 25 c T T where i,t = i,t∕ i,t+1 and where Ti,t is the number of available future observations. The choice of ci,t ω here preserves√ the properties of i,t after transformation (Roodman, 2009). Denoting the orthogonally transformed variables by “*”, the following equation is estimated: ∗ ∗ ∗� ∗ yi,t = yi,t−1 +Xi,t  +i,t

To estimate the equation above, we use the xtabond2 command with the robust option and one step estimation in STATA which corrects for heteroskedasticity and auto correlation; the “small” option which corrects for finite sample regressions and generates t-­stat and F-­stats instead of z and chi square statistics; the “orthogonal” option which uses forward orthogonal deviations transforms instead of first differencing while simultaneously taking care of fixed effects across countries and finally, the IV option to use our only exogenous instrument: reserve value (estimated reserves × oil price). The GMM option is also used to instrument for other variables using lags as is the standard case in GMM regressions. The AR(1) and AR(2) tests for serial correlation with p stats greater than 0.05 suggest no serial correlation at lags 1 and 2. Note that when we introduce the lag of the dependent variables in the regressions, the specification indicates serial correlation of order (1) so the test should show p values for AR(1) less than 0.05 (Roodman, 2009). Finally, tests of overidentification are carried out for each regression using the Hansen Statistic as this is known to be the more appropriate test for our system GMM estimation. The null hypothesis associated with this statistic is that the instruments are valid so that large p values (e.g., greater than 0.05), suggest that we cannot reject the null, i.e., instruments are valid.

6. | RESULTS 6.1. | The direct effect of oil on taxation Table 1 shows the direct effect of oil on tax revenue per Equation (33) with the first four columns representing static estimation and columns 5 through 8 representing dynamic panel estimation, i.e., with lagged dependent variable added, as shown here:

� � � d́ �� ⋅X̃ � � � (t휇)it = a1 +l1(t휇)it−1 +b1휌it +c1git + 1 it +u1i +v1t +휀1it (37)

̂ ̂ Without exception, Table 1 corroborates the first hypothesis (H1): i.e., b1 < 0 and b1 < 0 and sig- nificant through different permutations in all the columns of Table 1. This is in line with the literature cited earlier that oil revenues negatively impact tax revenue collection. In addition, in three out of four specifications, higher public spending drives greater tax collection as expected: i.e., ĉ1 > 0, ĉ1 > 0. As for the role of government spending itself, we have mentioned that Ross (2012) found that oil producers spend more (not less) perhaps to elicit greater constituent loyalty, while Cassidy (2016) argued the reverse. Table 1 finds support for the Ross view. This also verifies the derivative in Equation (20) and Proposition 2. Specifically, Equation (20) showed that if gρ > 0, i.e., if oil produc- ∗ ∗ ers spend more to elicit greater constituent loyalty then d(t휇 )∕d휌 g=ḡ < d(t휇 )∕d휌 g=g(휌) < 0, that is, controlling for g should mean a more negative effect of oil revenue on taxation. Indeed this is the case: when the difference in the estimates comes from controlling versus not controlling for g, the static model in Table 1 (column 3) shows significantly more negative oil coefficient than the respective coefficient in column 2; in the dynamic model of the same table, the difference is even more stark: the oil coefficient in column 7 is twice as large as its counterpart in column 6. This indicates that gρ > 0. 26 | MOHTADI et al. 0.398 2,126 134 2,244 1 0 (0.145) (0.003) 0.001 (0.003) 0.005 (0.006) −0.010 −0.005* (0.124) (0.021) −0.396*** 0.973*** (0.010) 0.039* (8) 0.353 1,981 138 2,328 1 0 (.) (0.003) 0.005** (0.003) 0 −0.004 (0.110) (0.014) −0.445*** 0.970*** (0.01) 0.035** (7) 1,678 138 2,365 0.389 1 0 (0.142) −0.116 (0.003) −0.002 (0.083) −0.261*** (0.013) 0.975*** (0.008) 0.028** (6) 1,338 145 2,482 0.471 1 0 (.) 0 (0.069) −0.235*** (0.013) Dynamic estimation (with first lag) 0.978*** (0.008) 0.023* (5) 1,963 134 2,394 0.065 1 0.256 (1.685) −0.028 0.125*** (0.033) (0.065) −9.868*** (0.045) 0.031 (1.635) −6.543*** (0.227) 1.507*** (4) 1,705 138 2,488 0.323 1 0.290 (1.810) 0.109*** (0.032) −8.438*** (0.046) 0.066 (1.681) −6.109*** (0.248) 1.351*** (3) 1,335 138 2,531 0.135 0.237 1 (1.960) −6.986*** (0.049) 0.108** (2.085) −4.580** (0.276) 1.336*** (2) G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions were carried out using dynamic panel system GMM methodology 929 145 2,656 0.328 0.459 1 (2.382) −8.537*** (2.406) −6.018** (0.323) 1.609*** Static estimation (no lag) Tax revenue (1) Oil and tax revenue 2nd order 1st order test ( p values) taxes No of instr. Countries Observations Serial correlation ( p values) Hansen specification Gov. cons International trade Constant Polity2 Oil value GDP PC Lag of tax revenue TABLE 1 Notes . For variable measures see Appendix (xtabond2 command on stata), errors are corrected for heteroskedasticity and autocorrelation all columns include time fixe d effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. The first four columns capture direct effect of oil value on tax revenue, last do same but int roduce dynamics by having the lag of dependent variable as part regressors. For all variations, we see that oil has a consi stently strong and significantly negative effect on tax revenue generation, con - firming other works in the literature. MOHTADI et al. | 27 TABLE 2 Indirect effect stage 1: From oil to transparency and rule of law

Transparency (matching Transparency Rule of law sample)

(1) (2) (3) (4) (5) (6) GDP PC 0.0520*** 0.0473*** 0.0858*** 0.0883*** 0.0501*** 0.0424*** (0.00605) (0.00633) (0.00834) (0.00757) (0.00870) (0.00936) Oil value −0.170*** −0.184*** −0.273*** −0.309*** −0.143** −0.166*** (0.0563) (0.0655) (0.0963) (0.0940) (0.0657) (0.0553) Polity2 0.00774*** 0.00649*** 0.00755*** 0.00564** 0.00922*** 0.00743*** (0.00131) (0.00110) (0.00247) (0.00251) (0.00222) (0.00222) Gov. cons −0.00239*** −0.00206*** 0.00198* 0.00261*** −0.00237** −0.00222*** (0.000621) (0.000771) (0.00102) (0.000981) (0.000975) (0.000804) International −0.00433** −0.000146 −0.00409 trade taxes (0.00213) (0.00182) (0.00284) Hansen 1 1 1 1 1 1 specification test (p values) Serial correlation (p values) 1st order 0 0 0.630 0.342 0.191 0.0146 2nd order 0.139 0.285 0.153 0.134 0.786 0.945 No of Instr. 1,888 2,170 1,093 1,187 1,096 1,184 Countries 154 149 156 150 154 148 Observations 4,075 3,111 1,776 1,424 1,752 1,406

Notes. For variable measures see Appendix G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. For columns 1, 2, 5, and 6, the addition of lags of the dependent variable does not eliminate serial autocorrelation. We thus cluster the errors by the panel to generate more efficient estimates despite the presence of autocorrelation (see Wooldridge 2002). All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedasticity and au- tocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. All columns highlight the first stage of the indirect effect of oil on tax revenue by capturing the effect of oil revenue on transparency and rule of law. Columns 5 and 6 repeat the regressions of columns 1 and 2 on transparency but match the observations in the rule of law sample as closely as possible to rule out sample effects. We see the nega- tive effect of oil on each variable of interest as hypothesized in the paper.

6.2 | Oil-­transparency and oil-­enforcement effects In light of the above evidence on the negative effect of oil revenues on taxation, we now test the explanation that this effect arises because oil abundance represses the government's desire for trans- parency and/or rule of law (path 1) which in turn represses the citizens’ tax compliance (path 2). We thus allow for equal treatment for the alternative mechanism of oil acting to reduce tax enforcement. First we test the validity of path 1. Results are given in Table 2. Because there is no direct systematic data on tax enforcement, we use the Rule of Law variable from WDI as a proxy. This may not be a major limitation because tax enforcement is in fact used as a components of the Rule of Law variable in WDI, and in any case the two are likely to be highly correlated. Thus, results on tax enforcement must be understood with this caveat in mind. Results for the oil coefficient in first four columns of Table 2 are negative and significant at 1%, supporting the view that oil subverts both transparency 28 | MOHTADI et al. ̂ ̂ and tax enforcement (as proxied by rule of law), i.e., b2,b3 < 0. This is consistent with the expectation in Equations (34) and (35) and the hypotheses H4a and H4b. Due to sample size differences between the sample that includes rule of law and the one that includes transparency, care is exercised to assure coefficient differences are not due to sample differences. For this reason, columns 5 and 6 repeat the transparency regressions of columns 1 and 2 for the sample that matches the observations of the rule of law. Results from the matched sample remain consistent with the original sample but lose some size and some significance, possibly due to loss of many observations in the matched sample. Also,

TABLE 3 Indirect effect stage 2: From transparency and rule of law to tax compliance (test of strategic policy on both variables)

Log of tax revenue

(1) (2) (3) (4) (5) (6) Lag of tax 0.943*** 0.946*** 0.959*** 0.959*** 0.947*** 0.943*** revenue (0.0123) (0.0119) (0.0226) (0.0224) (0.0256) (0.0256) GDP PC −0.0225 −0.0182 −0.0491*** −0.0354*** −0.0469** −0.0317* (0.0140) (0.0132) (0.0137) (0.0117) (0.0204) (0.0163) Log of 0.418*** 0.458*** 0.625*** 0.574*** transparency (0.0961) (0.114) (0.239) (0.204) Log of rule of 0.349*** 0.265*** law (0.0583) (0.0411) Polity2 −0.00702*** −0.00715*** −0.00662** −0.00436 −0.0105** −0.00788** (0.00230) (0.00241) (0.00281) (0.00276) (0.00462) (0.00356) Gov. cons. 0.00499*** 0.00449** 0.00112 0.000971 0.00664** 0.00597** (0.00189) (0.00184) (0.00103) (0.000973) (0.00302) (0.00257) International 0.00109 0.00779*** −0.000367 0.000309 0.000148 0.00551** trade taxes (0.00261) (0.00281) (0.00228) (0.00224) (0.00251) (0.00274) Hansen 1 1 1 1 1 1 specification test (p values) Serial correlation (p values) 1st order 0 0 0 0 0 0 2nd order 0.105 0.0807 0.126 0.125 0.188 0.0849 No of Instr. 2,103 2,103 984 984 984 984 Countries 133 133 128 128 128 128 Observations 2,215 2,215 1,030 1,030 1,030 1,030

Notes. For variable measures see Appendix G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedas- ticity and autocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. We use the lag of tax revenue here because the initial specification without lags fails the serial correlation test at the second lag. This resolves the second-order­ serial correlation for all col- umns reported. All columns capture the second path, i.e., the effects of transparency and rule of law on tax revenue as a proxy for tax compliance (see the text) with each column using estimates generated from corresponding columns of Table 2 (e.g., column 1 uses es- timates of transparency from column 1 of Table 2, etc.). This is done to test for the tax payers’ optimal response to the government's chosen level of transparency or tax enforcement (in turn impacted by oil revenue). We see that the coefficients of the logs of both transparency and rule of law are between 0 and 1 confirming concave (thus optimal) response of tax payers as per theory. MOHTADI et al. | 29 ( Continues ) 1 0.00 (0.00257) 0.00696*** (0.00164) 0.00788*** (0.00244) −0.00727*** (0.0123) −0.0301** (0.0441) (0.0230) 0.726*** (0.0951) −0.120*** 0.940*** (6) 1 0.00 (0.00245) −0.0000661 (0.00156) 0.00716*** (0.00258) −0.00654** (0.0130) −0.0314** (0.0550) 0.565*** (0.0762) −0.0911 (0.0235) 0.946*** (5) 1 0.00 (0.00230) 0.0000660 (0.000941) 0.000706 (0.00294) −0.00396 (0.0107) −0.0347*** (0.0393) −0.0167 (0.161) 0.261*** (0.0209) 0.960*** (4) 1 0.00 (0.00240) −0.000671 (0.00103) 0.000985 (0.00298) −0.00655** (0.0124) −0.0493*** (0.0578) −0.0728 (0.137) 0.354*** (0.0211) 0.960*** (3) 1 0.003 (0.00267) 0.00498* (0.00207) 0.00563*** (0.00279) −0.00668** (0.0176) 0.134 (0.103) −0.0547*** 0.515*** (0.127) (0.0207) 0.940*** (2) 1 0.003 (0.00274) 0.000329 (0.00216) 0.00614*** (0.00287) −0.00700** (0.0171) 0.159 (0.104) −0.0572*** 0.466*** (0.107) (0.0202) 0.939*** Log of tax revenue (1) Indirect effect stage 2: From transparency and rule of law to compliance (robustness test): Testing strategic choice transpa rency (rule law) with 1st order test ( p values) taxes Serial correlation ( p values) Hansen specification International trade Gov. cons. Polity2 Rule of law Transparency GDP PC Log of transparency Log of rule law Lag of tax revenue (transparency) as control) TABLE 4 30 | MOHTADI et al. 3, 1,030 128 1,001 0.1 (6) order serial correlation for all columns reported. 1,030 128 1,001 0.114 (5) 0.05 mean we cannot reject the null hypothesis which states that instruments > 1,017 126 990 0.130 (4) 2 since including rule of law as a control already reduces the sample). Unlike Table 2, etc.). This is done to test for the tax payers’ optimal response government's chosen level of transparency 1,017 126 990 0.138 (3) 1,030 128 985 0.914 (2) G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions use dynamic panel system GMM (xta bond2 command on stata), 2.) We see that the coefficients of logs transparency and rule law fall between 0 1 confirming concavity. For col umns 5 6, estimates transpar - 1,030 128 985 0.893 Log of tax revenue (1) (Continued) 2 (e.g., column 1 uses estimates of transparency from Table 2nd order Observations Countries No of instr. TABLE 4 Notes . For variable measures see Appendix errors correct for heteroskedasticity and autocorrelation; all columns include time fixed effects. Hansen overidentification p are valid. We use the lag of tax revenue because initial specification without lags fails serial correlation test at sec ond lag. This resolves second- ­ All columns capture the second path, i.e., effects of transparency or rule law on tax revenue as a proxy for complia nce (see text) with each column using estimates generated from corresponding columns of Table or tax enforcement (in turn impacted by oil revenue). (We do not use the “matching sample” columns of Table ency and rule of law from stage 1 are both used to see which effect dominates when optimally chosen by the government. We that transparency clearly dominate s as its coefficient is between 0 and 1 significant. however, columns 1 and 2 have rule of law as a control for transparency 3 4 law. (In such cases, the takes its value from actual data, rather than its predicted value in Table MOHTADI et al. | 31 note that greater democracy, as captured by Polity2, is associated with more transparency and more rule of law.

6.3 | Effects of transparency and rule of law on tax compliance We now examine the hypothesis that tax payers’ compliance behavior is an optimal response to government transparency and/or enforcement of taxes (the latter indicated by rule of law). Results are reported in Tables 3 and 4 with transparency and the rule of law showing up in log forms, consistent with Equation (36) (or Equations 24 and 25 from the model) and the “increasing but diminishing returns” behavior of tax payers’ response to T and L. The difference between the two tables is that (a) in Table 3, T and L show up separately, whereas in Table 4 (last two columns), they also show up together and (b) in the first 4 columns of Table 4 the log form of each vari- able is accompanied by the non-log­ form of the other variable. This is to “control” for the role of one variable when examining the optimal response of the other. Because compliance is supposed to capture tax payer response to the government's chosen levels of transparency or rule of law (first mover), it is the predicted values of transparency and rule of law from Table 2 that are used Tables 3 and 4. Thus, each column uses the values of transparency or rule of law as predicted from the corresponding columns of Table 2. Results indicate that the response of tax payer compliance to transparency and rule of law is highly significant, positive and diminishing, confirming hypoth- esis H2. This means that our estimates of the corresponding coefficients in Equation (36) are in ̂ fact as expected: 0 < b4,ĉ4 < 1. However, notice that despite both variables, transparency and rule of law, being normalized and having same scale, transparency is the stronger and the more robust ̂ ̂ of the two: First, b4 > b3 in all cases, i.e., the taxpayers respond more strongly to transparency than to they do to rule of law. Second, when both variables are introduced simultaneously (columns 5 and 6 of Table 4), rule of law loses all significance and falls outside of the range (0,1). The fact that the these results are obtained with Polity2 being controlled for throughout different variations, gives us greater confidence that we are actually capturing something about transparency and not democracy per se. Table 5 allows us to examine an important assumption of the model. It repeats the specification of Table 3 but excludes g: To explain, we have assumed that government spending decisions drive the “tax rate,” broadly defined (see theory section for details). If so, this would mean that by controlling for the variations in g in our regressions, we in effect control for the variations in t in the expression tμ*(t, T, L). This way, our dependent variable, which is tax revenue as a share of national income in Tables 3 and 4, would be a good proxy for underlying compliance. (In fact Proposition 4 showed that ∗ a positive d(tμ*)/dT indicates a positive d휇 ∕dT > 0.) We will now see if this argument really holds. First note that in the absence of g, the variations in the implied tax rate t in the compliance expression μ*(t, T, L) are no longer controlled for. Further, note that tax rate t and compliance rate μ move in opposite directions (see Equation 15). Focusing on transparency first, we have d(tμ*)/dT = t·dμ*/dT + μ*·dt/dT. Since dt∕dT < 0 (Lemma 1), it follows that d(tμ*)/dT < t·dμ*/dT. If controlling for g is a proxy for controlling for t, the left hand side of this inequality would be the coefficient of the trans- parency variable when g is not controlled for, while the right hand side is the slope of the compliance variable μ* times t when g is controlled for. Following this logic, Table 5, which does not control for g, should provide the estimates of d(tμ*)/dT while Table 3, which controls for g, would give estimates of dμ*/dT. The difference is of course the tax rate t. But we also know that t·dμ*/dT < dμ*/dT. Putting the two inequalities together, we find, d(tμ*)/dT < dμ*/dT. This last inequality therefore constitutes a necessary (but not a sufficient) condition for our argument (i.e., its reverse would violate our argu- ment). Comparison of the respective coefficients of ln(T) between the two tables strongly confirms 32 | MOHTADI et al. TABLE 5 Re-­estimate of Table 3 without government expenditures

Log of tax revenue

(1) (2) (3) (4) (5) (6)

Trans_1 Trans_2 Rol._3 Rol._4 Trans_5 Trans_6 Lag of tax 0.958*** 0.956*** 0.950*** 0.951*** 0.962*** 0.958*** revenue (0.0104) (0.0102) (0.0234) (0.0233) (0.0218) (0.0233) GDP PC 0.000892 0.00341 −0.0409*** −0.0243* −0.0144 −0.000484 (0.00784) (0.00815) (0.0124) (0.0127) (0.0128) (0.0136) Log of 0.172*** 0.184** 0.204*** 0.205*** transparency (0.0493) (0.0704) (0.0776) (0.0759) Log of rule of 0.315*** 0.236*** law (0.0437) (0.0296) Polity2 −0.00435** −0.00370** −0.00519** −0.00358 −0.00220 −0.00356 (0.00170) (0.00181) (0.00240) (0.00257) (0.00250) (0.00287) International 0.00263* 0.00485*** 0.00112 0.00208 0.00238 0.00546* trade taxes (0.00156) (0.00170) (0.00273) (0.00285) (0.00266) (0.00305) Hansen 1 1 1 1 1 1 specification test (p values) Serial correlation (p values) 1st order 0 0 0 0 0 0 2nd order 0.133 0.135 0.135 0.0965 0.117 0.119 No of instr. 1,930 1,929 933 933 933 933 Countries 133 133 128 128 128 128 Observations 2,215 2,215 1,030 1,030 1,030 1,030

Notes. For variable measures see Appendix G. Standard errors in parentheses. *,**,***Significant at 10%, 5%, and 1%. Standard errors are in parentheses. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedasticity and autocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. This table replicates all columns in Table 3 above but omits government expenditure on the right hand side. We use the lag of tax revenue here because the initial specification without lags fails the serial correlation test at the second lag. This resolves the second-­order serial correlation for all columns reported. As in Table 3, this table is meant to capture the “second path” (effects of transparency and rule of law on tax revenue) using estimates from Table 2 as a way of testing for the tax payers’ optimal response. (See Table 3 for more detail.) We see that the coef- ficients of the log of transparency and log of rule of law are between 0 and 1 indicating concavity. The coefficients are lower than their counterparts in Table 3 when government expenditure was included. that this condition is satisfied: the coefficients in Table 3 which represent dμ*/dT are anywhere from 1.5 times to 3 times larger than those in Table 5 which represent d(tμ*)/dT. Focusing next, on the rule of law, L, we find the same pattern except that the differences are much smaller. For robustness check of the above, a comparison of Tables 6 and 4 repeats the comparison of Tables 5 and 3. Thus, Table 6 repeats the pattern of Table 4 but without government spending g. The differences in the respective coefficients of transparency between the two tables are even more dramatic with the coefficients in Table 4 ranging anywhere from 2.5 times to nearly 4 times those in Table 6. Similar differences in the coefficients of the two tables are found with respect to the rule of law, but again the differences are somewhat smaller. MOHTADI et al. | 33 TABLE 6 Re-­estimate of Table 4 without government expenditures

Log of tax revenue

(1) (2) (3) (4) (5) (6) Lag of tax 0.955*** 0.955*** 0.954*** 0.952*** 0.959*** 0.957*** revenue (0.0206) (0.0220) (0.0223) (0.0221) (0.0221) (0.0214) Log of 0.185*** 0.213*** 0.205*** 0.252*** transparency (0.0482) (0.0601) (0.0348) (0.0465) Log of rule of law 0.296*** 0.227*** 0.0858 0.0294 (0.0650) (0.0362) (0.0654) (0.0476) Rule of law 0.304*** 0.294** (0.113) (0.114) Transparency −0.092 −0.059 (0.068) (0.07) GDP PC −0.048*** −0.044** −0.0335** −0.0243** −0.0230 −0.0120 (0.018) (0.019) (0.0138) (0.0115) (0.0164) (0.0146) Polity2 −0.004 −0.005* −0.00495* −0.00284 −0.00478* −0.00385 (0.003) (0.003) (0.00258) (0.00267) (0.00270) (0.00236) International 0.002 0.004 0.00182 0.00213 0.00279 0.00556** trade taxes (0.003) (0.003) (0.00306) (0.00296) (0.00286) (0.00272) Hansen specifica- 1 1 1 1 1 1 tion test (p values) Serial correlation (p values) 1st order 0.001 0.003 0.00 0.00 0.00 0.00 2nd order 0.726 0.733 0.142 0.0868 0.146 0.146 No of instr. 940 940 975 975 985 985 Countries 128 128 126 126 128 128 Observations 1,030 1,030 1,017 1,017 1,030 1,030

Notes. For variable measures see Appendix G. Standard errors in parentheses. Errors corrected for MENA clustering and year-fixed­ effects. *,**,***Significant at 10%, 5%, and 1%. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedasticity and autocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. This table replicates all columns in Table 4 above but omits government expenditure on the right hand side. We use the lag of tax revenue here because the initial specification without lags fails the serial correlation test at the second lag. This resolves the second-­order serial correlation for all columns reported. As in Table 4, this table is meant to capture the “second path” (effects of trans- parency and rule of law on tax revenue) using estimates from Table 2 as a way of testing for the tax payers’ optimal response (see Table 4 for more detail). Also, as in Table 4, in columns 5 and 6 neither regressor acts a “control.” This is done to see which effect domi- nates when both are optimally chosen by the government. Results from these two columns show that transparency seems to be the clear “winner” as it remains highly significant, whereas rule of law loses its significance entirely. Further, note that the coefficients for transparency are lower here than their counterparts in Table 4 when government expenditure is included.

We argued in the theory section that the government choice of g does not affect its strategic decisions on T* and L*, which we have proxied by rule of law. We now examine this assumption. Before we do, observe that if more public spending were to be used to solicit citizen loyalty, it would act as a substitute for the government offering less transparency or rule of law, especially 34 | MOHTADI et al. the former. Many rich oil producers seem to spend lavishly on their economy but with little transparency. This would mean a positive correlation between g, on one hand, and T and L on the other so that controlling for the level of g by including it in the regressions should yield smaller coefficients of the effect of T and L on tax compliance that excluding g. Comparison of Tables 3 and 5 indicate the precise opposite. Thus, while more spending does occur with more oil reve- nue, verifying Ross (2012), g does not seem to be a strategic substitute to “buy” citizen goodwill.

6.4 | Endogenous g The design and discussion of the empirical model in Section 5.1 led to two different issues involving g and thus two different ways of treating g. One set of issues was addressed by including and excluding g from a regression model as we have done. The second set of issues involved possible endogene- ity of g. So far we have assumed g to be exogenous in loyalty to the theoretical model. Now we will relax that assumption. An appendix, available from the authors reestimates the above tables with g as endogenous. None of our results are affected by this. The fact that allowing g to be endogenous main- tains the structural integrity of the results and by implication the model, suggests that a more general model in which g is endogenous would also work. We have avoided the additional complexities of such a model for greater parsimony.

6.5 | Other effects 6.5.1 | Per capita income and Polity2 Richer and more democratic countries experience higher tax revenue share of their national incomes (Table 1) as well as greater transparency and rule of law (Table 2). Neither outcome is surprising. What is interesting is that both these effects are actually captured in the transparency and the rule of law variables so that when the predicted values of the latter are used in Table 3, Polity2 and per capita income exhibit a reverse effect: the residual effect of being rich and more democratic on tax compliance behavior of their citizens is actually negative, among countries that enjoy similar levels of transparency and the rule of law. (Tables 4–6 which replicate Table 3 patterns in different ways, show similar but slightly weaker tendency).

6.5.2 | Trade taxes We had argued that the inclusion of trade taxes addresses the potential influence that a weak income tax environment can lead to increased resource extraction by the government. Several implications follow: First the selection of a strong instrument for oil should alleviate concerns about the endo- geneity of oil revenues from this source. Hence, the inclusion or the exclusion of trade taxes should not influence the size of the coefficients of oil revenue too much. In Table 1, a comparison of col- umns 3 and 4, as well as 7 and 8, and in Table 2 comparison of columns 1 and 2, columns 3 and 4, and columns 5 and 6 verify that the size of the coefficients of oil revenues do not change very much whether trade taxes are included or excluded. The second and third implications of including trade taxes have to do with the direct influence of trade taxes themselves. These are interesting to ex- amine. Since trade taxes are easier to impose and collect in most developing economies with weak income tax environments, they would be expected to be higher in such environments. Thus, one should expect (a) negative correlation between trade taxes and income taxes; and (b) negative corre- lation between trade taxes and other factors that influence income tax compliance, i.e., transparency MOHTADI et al. | 35 and rule of law. Our findings are rather weak in both regards: Table 1 (columns 4 and 8) shows no significant effect of expectation (a) while Table 2 (column 2) verifies expectation; and (b) with respect to transparency only. Interestingly Table 3 (columns 2 and 6) tells us that once institutions (e.g., transparency) are controlled for, trade taxes do not act as a substitute but as a complement to income taxes. On the whole, however, the sporadic nature of these last sets of results should serve as a caution in overinterpreting them.

6.6. | Robustness test 1: Direct tax compliance Two types of robustness tests are conducted. First, we use a limited dataset that is available on tax compliance from the Fraser Chain Linked index (Gwartney, et al. 2013) for 2004–2010 to examine tax compliance behavior directly, rather than by inferring it from observed tax revenue data. The tradeoff is of course the very limited direct data on tax compliance, hence the reason for presenting it as robustness test rather than as our main results. Table 7 combines the structure of Table 3 in its first four columns with some aspects of Table 4 in its next four columns. We see that the coefficients of the log of transpar- ency in columns 1–2 are between 0 and 1 which shows nondecreasing concavity as the theory required, but the same cannot be said for rule of law (columns 3–4). Columns 5 and 6 introduce both variables as strategic policy choices but neither variable meets the criteria for optimality. But when columns 7 and 8 introduce a dynamic specification, we see that transparency exhibits the hypothesized concavity consistent with government optimal decision, but rule of law shows a negative coefficient. In short, the result from direct tax compliance behavior tends to vindicate our previous treatment of tax revenue as a proxy for tax compliance, both with respect to the sign and the size of the coefficients and the concavity that underlies government's optimal decision, especially with respect to transparency.

6.7 | Robustness test 2: Government accountability plus compliance In our second robustness test, we use data from Williams (2015) by replacing his Release of Information measure with his new accountability measure. We report accountability results as a robustness check rather than as main regressions in part because, as previously mentioned, accountability turns out to be highly correlated with the rule of law indicator (correlation coefficient about 0.73), understand- ably because to a large extent accountability almost by definition means abiding by the rule of law. It therefore appears that accountability may not be truly distinct from the rule of law. For this reason using accountability for our main results may undermine the case for rule of law when both vari- ables are present. The second reason for using accountability as a robustness check and not as main results, is because we observed a very high degree of persistence in this index (but not the transpar- ency index) over the years and especially from the mid 1990s on. Given this degree of persistence in the accountability measure we borrowed from the growth literature and opted for 3-­year averages for each country to create a dataset with more variability. But this also meant a large drop in the number of observations. For both these reasons, we have opted to report accountability results as robustness check rather than as main results. Results are reported in Tables 8–10. Table 8 is a rerun of the original Table 1 for the new dataset. The table confirms the adverse effect of oil wealth on tax revenue for this data as well. Tables 9 and 10 are the heart of the robustness check: Table 9 confirms the adverse role of oil wealth on both ac- countability and rule of law (as a proxy for tax enforcement). Table 10 examines the effects of both accountability and rule of law on tax revenue as well as on tax compliance. Results confirm the in- creasing and concave effects of accountability on either tax revenue or tax compliance, thus verifying the robustness of the previous findings with respect to our new accountability measure and our new 36 | MOHTADI et al. 1 (0.000351) 0.00876*** (0.000403) 0.00395*** (0.0004) −0.008*** −0.134*** (0.0295) 0.654*** (0.0471) (0.00255) −0.02*** 0.889*** (0.00361) (8) 508 110 0.419 445 0.818 1 (0.000215) 0.00312*** (0.000595) 0.00362*** (0.000753) −0.009*** −0.142** (0.0554) 0.702*** (0.0737) (0.00522) −0.03*** 0.878*** (0.00544) (7) 508 110 0.406 445 0.829 3 and 4 results with tax compliance replacing revenue. 1 (0.000743) −0.0122*** (0.000794) −0.0110*** (0.000643) −0.0089*** (0.0680) 2.004*** −1.384*** (0.0633) (0.0108) −0.151*** (6) 623 118 0.298 554 0.201 are valid. All columns here capture the second path, effect of transparency and rule 1 (0.000241) 0.00164*** (0.000788) −0.009*** (0.000900) −0.0159*** (0.0571) 1.805*** −0.992*** (0.0651) (0.00579) −0.126*** (5) 623 118 0.315 554 0.193 2 above, e.g., transparency in the first column represents natural log of estimates obtained 1 (0.000300) −0.005*** (0.000346) 0.00599*** (0.000512) −0.0163*** (0.0214) 1.093*** (0.00561) −0.127*** (4) 623 118 0.296 547 0.200 1 (0.000280) −0.005*** (0.000272) 0.00565*** (0.000989) −0.0251*** (0.0340) 1.238*** (0.00600) −0.145*** (3) 623 118 0.280 547 0.179 1 (0.000447) 0.00351*** (0.000208) 0.00549*** (0.000768) −0.005*** (0.0278) 0.280*** (0.00335) 0.0280*** (2) 623 118 0.335 547 0.244 G. *,**,***Significant at 10%, 5%, and 1%. Standard errors are in parentheses. This is a robustness check on Tables 1 (0.000224) 0.00246*** (0.000295) 0.00609*** (0.00116) −0.007*** (0.0551) 0.396*** (0.00515) 0.0212*** Log of tax compliance (1) 623 118 0.309 547 0.204 values greater than 0.05 imply that we cannot reject the null hypothesis which states instruments used p 2 above to highlight the fact that resulting concavity is coming from oil component part of transparency. We see t he coefficients log transparency in columns 1–2 are between decreasing concavity as the theory required, but same cannot be said for rule of law (columns 3–4). Columns 5 and 6 introdu ce both variables strategic policy choices neither variable Robustness test 1: Replacing tax revenue data with actual compliance . For variable measures see Appendix 1st order 2nd order test ( p value) taxes compliance Serial correlation ( p values) Hansen specification International trade Gov. cons. Polity2 Log of rule law Log of transparency GDP PC Lag of tax Observations Countries No of instr. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected f or heteroskedasticity and autocorrelation all columns include time fixed effects. Hansen overidentification TABLE 7 Notes of law on tax compliance with each column using estimates generated from corresponding columns in Table from column one in Table 0 and 1 which shows non- ­ meets the criteria for optimality. Columns 7 and 8 introduce a dynamic specification here we see that only transparency sho ws hypothesized concavity but rule of law shows negative coefficient. MOHTADI et al. | 37 TABLE 8 Robustness test 2: Repeating Table 1 for the new dataset comprised of accountability and tax compliance data

(1) (2) (3) (4)

Tax revenue GDP PC 1.702*** 1.399*** 1.380*** 1.538*** (0.0318) (0.00968) (0.0257) (0.0427) Oil value −4.362*** −3.442*** −4.180*** −3.924*** (0.122) (0.0443) (0.0999) (0.136) Polity2 0.0657*** 0.0549*** 0.0464*** (0.00173) (0.00272) (0.00380) Gov. cons 0.0926*** 0.0787*** (0.00238) (0.00298) International trade taxes 0.130*** (0.00774) Hansen specification test 0.304 0.654 1 1 (p value) Serial correlation (p values) 1st order 0.683 0.940 0.971 0.863 2nd order 0.973 0.821 0.817 0.556 No of instr. 99 143 187 231 Countries 145 138 138 135 Observations 963 920 904 872

Notes. For variable measures see Appendix G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedas- ticity and autocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. This is a rerun of Table 1: The regressions had to be rerun given that the data has been converted into three year averages to accommodate the peculiarity in the accountability measure (see text). These columns capture the direct effect of oil value on tax revenue. For all variations, we see that oil has a consistently strong and sig- nificantly negative effect on tax revenue generation, confirming other works in the literature. tax compliance measure. What is interesting is that rule of law now loses its robustness, showing up as an insignificant driver of tax revenue and negatively significant driver of tax compliance. The robustness of transparency, whichever way it is measured, now shines even more by the contrast with the non-­robust behavior of its kin, the rule of law.

6.8 | An oil shock experiment We now ask a different question. Suppose that countries experienced a 10% positive oil shock. We would like to know how much their transparency, accountability and tax revenues decline. In Table 11 the left and right panels show the results for “high” and “low” oil dependent economies defined, respectively, as those with oil revenue of at least 10% and at most 2% of their national in- comes. The first three columns focus on changes in transparency, the second three columns on changes in accountability, and the last two columns on changes in the tax revenues due to the oil shocks via transparency and accountability, respectively. Note that, with a few exceptions, countries in the high oil dependent group generally experience larger percentage declines in accountability 38 | MOHTADI et al. TABLE 9 Robustness test 2: Repeating Table 2 with accountability data: from oil to accountability and rule of law (Indirect effect stage 1)

Accountability Rule of law

(1) (2) (3) (4) GDP PC 0.0335*** 0.0360*** 0.111*** 0.111*** (0.00115) (0.000950) (0.00124) (0.000920) Oil value −0.114*** −0.0479*** −0.206*** −0.220*** (0.000948) (0.00192) (0.00110) (0.00139) Polity2 0.0276*** 0.0289*** 0.0121*** 0.0114*** (0.000145) (0.000143) (0.000117) (0.000125) Gov. cons 0.00233*** 0.000975*** 0.00226*** 0.00243*** (0.0000284) (0.0000446) (0.0000280) (0.0000340) International trade taxes 0.00102*** 0.00253*** (0.000144) (0.000126) Hansen specification test 0.22 0.95 0.267 0.977 (p values) Serial correlation (p values) 1st order 0.062 0.107 0.302 0.964 2nd order 0.988 0.686 0.294 0.168 No of instr. 151 187 151 187 Countries 157 150 157 150 Observations 887 764 892 769

Notes. For variable measures see Appendix G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions were carried out using dynamic panel system GMM methodology (xtabond2 command on stata), errors are corrected for heteroskedas- ticity and autocorrelation and all columns include time fixed effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. This is a rerun of Table 2: The columns capture the first stage effects, i.e., effect of oil on accountability and rule of law. For all variations, we see that oil has a consistently strong and signifi- cantly negative effect both. Thus, replacing transparency of Table 2 with accountability here does not change the stage 1 results, point- ing to their robustness. than in transparency while those in the low oil dependent group experience about the same decline in both. In terms of the ranges, transparency declines least in Norway by −2.33% (from 0.71 to 0.69) and most in the newly established country of Timor-­Leste by −5% (from 0.15 to 0.14). The decline in transparency in Saudi Arabia from 0.471 to 0.458 brings it closer to Cameroon with a pre-oil­ shock transparency of 0.454. The change in accountability is somewhat similar with Norway show- ing least decline (−2.15%) and Syria showing the largest (−8.98%).25 Similar drops are observed for tax revenues: For “less-transparency-­ induced”­ drop in tax revenues, we see a range from −2.35% for Norway to −9.20% for Timor-­Leste. For “less-­accountability-induced”­ drop, we see a range from −1.83% for Norway to −11.33% for Congo. The increase in US oil production due to the recent fracking revolution is also instructive. Here, a 10% increase in the US oil production is associated with a drop of −2.49% in transparency, bringing the US approximately to pre-shock­ level of UK, and a drop of −2.32% in accountability, bringing the US to slightly above the pre-­shock level of Portugal.

25Our data end at 2010 which precedes the Syrian uprising. Thus, the decline in this index is not caused by the uprising, though one could speculate whether it indicated an unequal application of the law among the emerging dissenting voices. MOHTADI et al. | 39 ( Continues ) 0.292 0.243 (0.0007) −0.003*** (0.00069) 0.00416*** (0.00196) 0.0137*** (0.0591) −0.245*** (0.0155) 0.130*** (8) 0.289 0.194 (0.0005) −0.005*** (0.00060) 0.00665*** (0.00146) 0.00681*** (0.0387) −0.128*** (0.0109) 0.0936*** (7) 0.305 0.332 (0.0004) −0.008*** (0.000418) 0.00317*** (0.00281) −0.0394*** 0.742*** (0.0419) (0.00732) 0.00607 (6) 0.292 0.271 (0.0004) −0.003*** (0.0004) 0.0033*** (0.00405) −0.059*** 0.920*** (0.0529) (0.0101) −0.0109 (5) Tax compliance 0.552 0.001 1 (0.00083) 0.0145*** (0.00026) −0.0052*** (0.00180) 0.00305* (0.0611) −0.0509 (0.0142) 0.105*** (0.00570) 0.798*** (4) 0.528 0.001 1 (0.00086) 0.0102*** (0.000252) −0.004*** (0.00200) 0.0000529 (0.0600) −0.0228 (0.0139) 0.0852*** (0.00563) 0.822*** (3) 0.829 0.001 1 (0.00091) 0.0101*** (0.000152) −0.004*** (0.00325) −0.0151*** 0.249*** (0.0447) (0.00345) 0.0493*** (0.00495) 0.807*** (2) 0.776 0.02 1 (0.00090) 0.0103*** (0.000157) −0.006*** (0.00297) −0.0457*** 0.673*** (0.0510) (0.00424) 0.0375*** (0.00868) 0.808*** Tax revenue (1) Robustness test 2: From accountability and rule of law to tax revenue compliance (Indirect effect stage 2) 2nd order 1st order test ( p values) taxes Serial correlation ( p values) Hansen specification International trade Gov. cons Polity2 Log of rule law Log of accountability GDP PC Lag of tax revenue TABLE 10 40 | MOHTADI et al. order serial correla - 299 122 95 (8) 299 122 95 (7) 299 122 95 (6) 9 above, e.g., the log of accountability in the first column represents the first column e.g., the log of accountability in 9 above, 299 122 95 (5) Tax compliance 732 134 282 (4) 9 above to highlight the path from oil to accountability and rule of law and from accountability and rule of law to tax to law of rule and accountability from and law of rule and accountability to oil from path the highlight to above 9 732 134 282 (3) 568 131 227 (2) G. *,**,***Significant at 10%, 5%, and 1%. Standard errors in parentheses. All regressions were carried out using dynamic panel system GMM methodology order serial correlation for those columns. All columns here capture the second path, effect of accountability and rule law on tax revenue as a proxy compliance 568 131 227 Tax revenue (1) (Continued) Observations Countries No of instr. TABLE 10 Notes . For variable measures see Appendix (xtabond2 command on stata), errors are corrected for heteroskedasticity and autocorrelation all columns include time fixe d effects. Hansen overidentification p values greater than 0.05 imply that we cannot reject the null hypothesis which states that instruments used are valid. Lag of tax revenue was reintroduced for first four columns as some specifications failed first- ­ as well as a direct measure of tax compliance (see the text). Each column uses estimates generated from corresponding columns in Table compliance (see the text). of tax direct measure as a as well tion test. This resolved the second- ­ Table in one column from obtained accountability of estimates of log natural the revenue and tax compliance. We see that the coefficients of log accountability is between 0 1 which shows nondecreas ing concavity as theory required, but same cannot be said for rule of law. MOHTADI et al. | 41 ( Continues ) −3.68 −2.09 −5.78 −3.30 −1.91 −2.74 −8.54 −8.32 −2.85 −1.83 −4.80 −6.73 −7.92 −9.68 −11.33 −6.40 −6.91 −8.54 −5.06 Accountability (%) From −4.04 −2.53 −4.76 −3.31 −2.98 −9.20 −4.47 −4.12 −3.31 −2.35 −4.69 −4.10 −4.29 −4.63 −5.22 −4.75 −4.67 −4.73 −4.07 Transparency (%) Effect on tax revenue From −2.83 −2.34 −4.42 −2.52 −2.35 −1.93 −8.98 −6.76 −2.35 −2.15 −3.71 −5.05 −5.93 −9.04 −5.42 −8.70 −6.32 −6.33 −5.00 % 0.619 0.750 0.324 0.580 0.666 0.609 0.152 0.207 0.622 0.774 0.363 0.301 0.254 0.171 0.227 0.189 0.252 0.207 0.304 To 0.637 0.768 0.339 0.595 0.682 0.621 0.167 0.222 0.637 0.791 0.377 0.317 0.27 0.188 0.24 0.207 0.269 0.221 0.32 From Effect on accountability −3.24 −2.57 −3.04 −2.45 −2.47 −5.41 −3.39 −2.76 −2.45 −2.25 −2.86 −3.15 −3.04 −3.79 −3.16 −4.04 −3.74 −3.25 −3.10 % 0.537 0.683 0.414 0.558 0.593 0.14 0.428 0.458 0.558 0.694 0.407 0.461 0.447 0.406 0.368 0.404 0.437 0.387 0.469 To 0.555 0.701 0.427 0.572 0.608 0.148 0.443 0.471 0.572 0.71 0.419 0.476 0.461 0.422 0.38 0.421 0.454 0.4 0.484 Effect on transparency From Effects of positive oil shock. (a) High dependent group. (b) Low Botswana (b) Belgium Yemen Venezuela Trinidad Timor- ­ Leste Syria S. Arabia Russia Norway Nigeria Kazakhstan Iran Egypt Congo Chad Cameroon Azerbaijan (a) Algeria Country TABLE 11 42 | MOHTADI et al. −2.00 −2.02 −1.98 −1.94 −1.95 −3.14 −1.65 −2.02 −2.93 Accountability (%) From −2.36 −2.50 −2.23 −2.61 −2.25 −2.96 −2.14 −2.53 −3.41 Transparency (%) Effect on tax revenue From −2.32 −2.32 −2.17 −2.43 −2.26 −3.07 −2.13 −2.32 −2.58 % 0.759 0.757 0.766 0.723 0.777 0.568 0.827 0.759 0.680 To 0.777 0.775 0.783 0.741 0.795 0.586 0.845 0.777 0.698 From Effect on accountability −2.49 −2.56 −2.28 −2.66 −2.41 −2.86 −2.24 −2.56 −2.97 % 0.705 0.684 0.729 0.659 0.728 0.611 0.743 0.684 0.588 To 0.723 0.702 0.746 0.677 0.746 0.629 0.76 0.702 0.606 Effect on transparency From (Continued) USA UK Switzerland Portugal Luxembourg Korea, Rep. Ireland Finland Costa Rica Country TABLE 11 Notes . The table show the impact of a positive oil shock (an increase in value to GDP ratio 10% points). first three col umns capture on transparency, second columns the impact on accountability, last two columns show impact, tax revenues, coming from transparency and accounta bility. See text for discussion insights. MOHTADI et al. | 43 7 | CONCLUSION

The abundance of petroleum wealth is associated both with less transparency and less tax revenue. We explain this outcome and elucidate the mechanism that produces it. The key is the behavior of tax payers. We develop a simple analytic game which produces both outcomes. An extractive (rentier) government moves first, choosing the level of transparency and the enforcement of taxes. The trade- off to the government arises because, on one hand, transparency is costly to a rentier state by making it harder to divert resources and, on the other hand, it is needed to elicit citizen greater tax compli- ance. Citizens, observing limited transparency, comply only partly with taxation since they cannot tell where their taxes go. A subgame-perfect­ equilibrium emerges in which the government is opaque and citizens are tax evasive. Besides transparency as an inducement, we also model the role of tax enforcement as a force. Testing the model with extensive data, we find strong support for the model and somewhat greater support for transparency. Economic historians have suggested that the need for in early modern Europe even- tually forced governments to become more accountable to their citizens (Hoffman & Norberg, 1994; North, 1990; North & Weingast, 1989). Seen in this light, our paper extends this insight in reverse in that windfall revenues from oil (and possibly other natural resources) diminish the incentive of gov- ernments to make their policies and institutions more transparent. The relation between resource wealth and transparency is particularly salient in oil-rich­ Middle East and North Africa (MENA) region. But rigorous empirical analysis of this question is constrained by the paucity of taxation data for this region. Further, by removing the MENA group from the sample and rerunning the regressions to test the MENA effect by its absence, one runs into another difficulty; namely removing the key source of variability, i.e., oil revenues. However, by simply comparing mean values, we find that the oil-­rich MENA countries have exceptionally low taxes and low transparency, while the non-­oil MENA countries have higher tax and transparency levels similar to the rest of the developing world. From this, we infer that low transparency in the MENA region is largely a function of the region's concentration of oil wealth, rather than cultural or historical traits.26 If more oil revenue means less accountability, less taxation and less citizen tax compliance, will the recent collapse in oil prices, if it becomes permanent, lead to a change in reverse by ushering a new era of greater accountability by governments, reliance on taxation, and more tax compliance coming from greater citizen participation? The answer would be affirmative, if the empirical findings were symmetric in either direction. But we cannot know this until more time has passed and more data has been gathered. What can be said about the policy relevance of these findings? In a world in which the government does act in public interest (for instance if new government comes to power that is less burdened by rentier behavior), the policy relevance of our findings is tied to several proposals (Devarajan, Ehrhart, Le, & Raballand, 2011; Sala-­i-Martin­ & Subramanian, 2003) in which the transfer of oil revenues di- rectly to the public is a way of improving government accountability through greater citizen scrutiny. That more accountability is associated with more efficiency and also improved taxation is not in doubt and is consistent with our model as well. But the difficulty is that first, a rentier and therefore opaque state would resist such proposals since adopting them may be self-­defeating; and second, that citizen scrutiny and public action relies on better information which the government may refuse to provide. Citing the Roman poet, Juvenal, the late Nobel laureate Leo Hurwicz, famously asked Hurwicz (2008) “But Who Will Guard the Guardians?”

26The analysis is available from the authors but not included here for brevity. 44 | MOHTADI et al. ACKNOWLEDGMENTS Financial support was provided by the Economic Research Forum (ERF). Prior versions were pre- sented at the Universities of Wisconsin and Minnesota, Pontifica Universidad Catolica de Chile, Santiago, Chile, the Economic Research Forum, Cairo, Egypt, the University of Southern California, Los Angeles, CA, and the 2016 Midwest International Economic Development Conference, Minneapolis, MN. We thank Scott Adams, Niloy Bose, Xi Chen, Scott Drewianka, Ibrahim Elbadawi, John Heywood, Jason Kerwin, Jeffrey Nugent, Hashem Pesaran, Kamiar Mohaddes, Hadi Salehi-­ Esfehani, Raimundo Soto and three ERF internal reviewers of the project for their helpful comments. We also thank two anonymous reviewers for their most insightful comments.

ORCID Hamid Mohtadi http://orcid.org/0000-0002-9497-787X

REFERENCES Alexeev, M., & Conrad, R. (2009). The elusive curse of oil. The Review of and Statistics, 91(3), 586–598. https://doi.org/10.1162/rest.91.3.586 Andersen, J. J., Johannesen, N., Lassen, D. D., & Paltseva, E. (2014). Petro rents, political institutions, and hidden wealth: Evidence from offshore bank accounts. Working paper. Andreoni, J., Erard, B., & Feinstein, J. (1998). Tax compliance. Journal of Economic Literature, 36, 818–860. Arellano, M., & Bond, S. (1991). Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. The Review of Economic Studies, 58(2), 277–297. https://doi.org/10.2307/2297968 Arellano, M., & Bover, O. (1995). Another look at the instrumental variable estimation of error-components­ models. Journal of Econometrics, 68(1), 29–51. https://doi.org/10.1016/0304-4076(94)01642-D Arezki, R., Ramey, V. A., & Sheng, L. (2017). News shocks in open economies: Evidence from giant oil discoveries. Quarterly Journal of Economics, 32(1), 103–155. Barro, R. (1990). Government spending in a simple model of endogenous growth. Journal of Political Economy 98(5) (part 2), S103–S125. https://doi.org/10.1086/261726 Besley, T., & Persson, T. (2009). The origins of state capacity: Property rights, taxation, and politics. American Economic Review, 99(4), 1218–1244. https://doi.org/10.1257/aer.99.4.1218 Besley, T., & Persson, T. (2010). State capacity, conflict, and development. Econometrica, 78(1), 1–34. Besley, T., & Persson, T. (2011). Pillars of prosperity: The political economics of development clusters. Princeton, NJ: Princeton University Press. Blundell, R., & Bond, S. (1998). Initial conditions and moment restrictions in dynamic panel data models. Journal of Econometrics, 87(1), 115–143. https://doi.org/10.1016/S0304-4076(98)00009-8 Bohn, H., & Deacon, R. T. (2000). Ownership risk, investment, and the use of natural resources. American Economic Review, 90(3), 526–549. https://doi.org/10.1257/aer.90.3.526 Boschini, A. D., Pettersson, J., & Roine, J. (2007). Resource curse or not: A question of appropriability. Scandinavian Journal of Economics, 109(3), 593–617. https://doi.org/10.1111/j.1467-9442.2007.00509.x Brautigam, D., Fjeldstad, O. H., & Moore, M. (Eds.) (2008). Taxation and state-building in developing countries: Capacity and consent. New York, NY: Cambridge University Press. Brückner, M., Ciccone, A., & Tesei, A. (2012). Oil price shocks, income, and democracy. Review of Economics and Statistics, 94(2), 389–399. https://doi.org/10.1162/REST_a_00201 Cárdenas, M., Ramírez, S., & Tuzemen, D. (2011). Commodity dependence and fiscal capacity. Working paper, Brookings Institution. Carreri, M., & Dube, O. (2015). Do natural resources influence who comes to power, and how?Working paper. Caselli, F., & Tesei, A. (2016). Resource windfalls, political regimes, and political stability. Review of Economics and Statistics, 98(3), 573–590. https://doi.org/10.1162/REST_a_00538 MOHTADI et al. | 45 Cassidy, T. (2016). The long-run effects of oil wealth on democracy and fiscal capacity. Working Paper, University of Michigan. Cornand, C., & Heinemann, F. (2008). Optimal degree of public information dissemination. The Economic Journal, 118(528), 718–742. https://doi.org/10.1111/j.1468-0297.2008.02139.x Crivelli, E., & Gupta, S. (2014). Resource blessing, revenue curse? Domestic revenue effort in resource-rich countries. Working paper, IMF. Cuccia, A. D. (1994). The economics of tax compliance: What do we know and where do we go? Journal of Accounting Literature, 13, 81–116. Cust, J., & Harding, T. (2015). Institutions and the location of oil exploration. Working paper. Devarajan, S., Ehrhart, H., Le, T. M., & Raballand, G. (2011). Direct redistribution, taxation, and accountability in oil- rich economies: A proposal. CGD Working Paper 281. Washington, DC: Center for Global Development. Retrieved from http://www.cgdev.org/content/publications/detail/1425822 Dube, O., & Vargas, J. F. (2013). Commodity price shocks and civil conflict: Evidence from Colombia. Review of Economic Studies, 80, 1384–1421. https://doi.org/10.1093/restud/rdt009 Frankel, J. (2010). The natural resource curse: A survey. NBER Working Paper 15836. Friedman, E., Johnson, S., Kauffman, D., & Zoido-Lobaton, P. (2000). Dodging the grabbing hand: The determinants of unofficial activity in 69 countries. Journal of Public Economics, 87, 354–358. Gwartney, J., Lawson, R., & Hall, J. (2013). Economic Freedom Dataset. In Economic Freedom of the World: 2013 Annual Report. Vancouver: Fraser Institute. Hoffman, P. T., & Norberg, K. (Eds.) (1994). Fiscal crises, liberty, and representative government (pp. 1450–1789). Stanford, CA: Stanford University Press. Hollyer, J., Rosendorff, P., & Vreeland, J. (2014). Measuring transparency. Political Analysis, 22, 413–434. https://doi. org/10.1093/pan/mpu001 Holtz-Eakin, D., Newey, W., & Rosen, H. S. (1988). Estimating vector autoregressions with panel data. Econometrica, 56(6), 1371–1395. https://doi.org/10.2307/1913103 Horn, M. (2014). Giant oil and gas fields of the world. Retrieved from http://www.datapages.com/AssociatedWebsites/ GISOpenFiles/HornGiantFields.aspx. Hurwicz, L. (2008). But who will guard the guardians? The American Economic Review, 98, 577–585. https://doi. org/10.1257/aer.98.3.577 ICTD (2014). International centre for tax and development government revenue dataset. Brighton, UK: International Centre for Tax and Development. International Monetary Fund (2012). Fiscal transparency, accountability, and risk. Fiscal Affairs and Statistics Department. Jackson, B., & Milliron, V. (1986). Tax compliance research: Findings, problems and prospects. Journal of Accounting Literature, 5, 125–165. Jensen, A. (2011). State-building­ in resource-rich­ economies. Atlantic Economic Journal, 39, 171–193. https://doi. org/10.1007/s11293-011-9269-z Kinsey, K. (1986). Theories and models of tax cheating. Criminal Justice Abstracts, 2, 403–425. Lane, P. R., & Tornell, A. (1996). Power, growth, and the voracity effect. Journal of , 1, 213–241. https://doi.org/10.1007/BF00138863 Long, S., & Swingen, J. (1991). Taxpayer compliance: Setting new agendas for research. Law and Society Review, 25, 637–683. https://doi.org/10.2307/3053731 McGuirk, E. (2010). The illusory leader: Natural resources, taxation, and accountability. Dublin, Ireland: Trinity College. Mehlum, H., Moene, K., & Torvik, R. (2006). Institutions and the resource curse. The Economic Journal, 116, 1–20. https://doi.org/10.1111/j.1468-0297.2006.01045.x Mohtadi, H., & Polasky, S. (2016). To choose or not to choose free trade? Role of corruption and tax evasion. Working Paper, University of Wisconsin and University of Minnesota. Mohtadi, H., & Roe, T. (1998). Growth, lobbying and public goods. European Journal of Political Economy, 14(4), 453–573. https://doi.org/10.1016/S0176-2680(98)00017-2 Mohtadi, H., & Roe, T. (2003). Democracy, rent seeking, public goods and economic growth. Journal of Public Economics, 87(3–4), 445–466. https://doi.org/10.1016/S0047-2727(01)00135-9 Mohtadi, H., Ross, M., & Ruediger, S. (2014). Do natural resources inhibit transparency? Economic research forum working paper 906; Working Paper, University of Wisconsin and UCLA. New York Times (2013). President of Mexico proposes tax overhaul September 9. 46 | MOHTADI et al. North, D. (1990). Institutions, institutional change, and economic performance (p. 49). New York, NY: Cambridge University Press. https://doi.org/10.1017/CBO9780511808678 North, D., & Weingast, B. (1989). Constitutions and commitment: The evolution of institutions governing public choice in seventeenth-century­ England. Journal of , 49(4), 803–832. https://doi.org/10.1017/ S0022050700009451 Olson, M. (1993). Dictatorship, democracy, and development. The American Political Science Review, 87(3), 567–576. https://doi.org/10.2307/2938736 Persson, T., Roland, G., & Tabellini, G. (2007). Electoral rules and government spending in parliamentary democracies. Quarterly Journal of Political Science, 2(2), 155–188. https://doi.org/10.1561/100.00006019 Polinsky, A. M., & Shavell, S. (2001). Corruption and optimal law enforcement. Journal of Public Economics, 81, 1–24. https://doi.org/10.1016/S0047-2727(00)00127-4 Polity IV PROJECT (2013). Monty G. Marshall Center for Systemic Peace and Societal-Systems Research Inc by Gurr, T. and K. Jaggers. Richardson, M., & Sawyer, A. (2001). A taxonomy of the tax compliance literature: Further findings, problems and prospects. Australian Tax Forum, 16, 137–320. Robinson, J. A., Torvik, R., & Verdier, T. (2006). Political foundations of the resource curse. Journal of Development Economics, 79, 447–468. https://doi.org/10.1016/j.jdeveco.2006.01.008 Roodman, D. (2009). How to do xtabond2: An introduction to difference and system GMM in stata. Stata Journal, 9(1), 86–136. https://doi.org/10.1177/1536867X0900900106 Ross, M. (1999). The political economy of the resource curse. World Politics, 51(2), 297–322. https://doi.org/10.1017/ S0043887100008200 Ross, M. (2001). Does oil hinder democracy? World Politics, 53(3), 325–361. https://doi.org/10.1353/wp.2001.0011 Ross, M. (2004). Does taxation lead to representation? British Journal of Political Science, 34, 229–249. https://doi. org/10.1017/S0007123404000031 Ross, M. (2011). Mineral wealth and budget transparency. Working paper, UCLA, Department of Political Science. Ross, M. (2012). The oil curse: How petroleum wealth shapes the development of nations?. Princeton, NJ: Princeton Univ. Press. Ross, M. (2013). Oil & gas data, 1932–2011. Retrieved from http://thedata.harvard.edu/dvn/dv/mlross Ross, M. (2015). What have we learned about the resource curse? Annual Review of Political Science, 18, 239–259. https://doi.org/10.1146/annurev-polisci-052213-040359 Sala-i-Martin, X., & Subramanian, A. (2003). Addressing the natural resource curse: An illustration from Nigeria. NBER Working Paper 9804. Smith, B. (2015). The resource curse exorcised: Evidence from a panel of countries. Journal of Development Economics, 116, 57–73. https://doi.org/10.1016/j.jdeveco.2015.04.001 Stevens, P., & Dietsche, E. (2008). Resource curse: An analysis of causes, experiences and possible ways forward. Energy Policy, 36(1), 56–65. https://doi.org/10.1016/j.enpol.2007.10.003 Tanzi, V., & Zee, H. (2000). Interview: Developing countries face range of challenges in devising effective tax strategy. IMF Survey 29(13), 209 and 217–221. Tornell, A., & Lane, P. R. (1999). The voracity effect. The American Economic Review, 89(1), 22–46. https://doi. org/10.1257/aer.89.1.22 Wick, K., & Bulte, E. (2009). The curse of natural resources. Annual Review of Resource Economics, 1(1), 139–156. https://doi.org/10.1146/annurev.resource.050708.144219 Williams, A. (2009). On the release of information by governments: Causes and consequences. Journal of Development Economics, 89(1), 124–138. https://doi.org/10.1016/j.jdeveco.2008.08.001 Williams, A. (2011). Shining a light on the resource curse: An empirical analysis of the relationship between natu- ral resources, transparency, and economic growth. World Development, 39(4), 490–505. https://doi.org/10.1016/j. worlddev.2010.08.015 Williams, A. (2015). A global index of information transparency and accountability. Journal of Comparative Economics, 43, 804–824. https://doi.org/10.1016/j.jce.2014.10.004 Wilson, P., Cobham, A., & Goodall, A. (2014). The ICTD government revenue dataset. ICTD Working paper 19. Wooldridge, J. (2002). Econometric analysis of cross section and panel data. Cambridge, UK: The MIT Press. MOHTADI et al. | 47

How to cite this article: Mohtadi H, Ross ML, Jarrett U, Ruediger S. Kleptocracy and tax evasion under resource abundance. Econ Polit. 2019;00:1–51. https://doi.org/10.1111/ ecpo.12130

APPENDIX A Proof of Lemma 1

Statement of Lemma 1 : Under reasonable upper bound on the tax rate, t, ∂b/∂T ≤ 0 implies that 휕t∕휕T < 0

Totally differentiate 18 in T:

 − 1−( +) 1− ( ∗ +t ∗)t =−t ∗ −g (1−b) ⋅(−b ) t T T 1− T

We have already argued why evidence supports 훼 +훽<1. Since bT ≤ 0 by assumption and since ∗ ∗ ∗ 휇T > 0, it follows that RHS < 0. Using optimum μ* from 15 in the expression +t t on the LHS, we ∗ ∗ ⋅ ∗ ∗ find +t t = 1−2t∕[(T) f (L)]. Thus, 휇 +t휇t > 0 if t < γf/2 which puts an upper bound on the tax rate. Assuming this upper bound, it follows that, → bT ≤ 0 tT < 0

APPENDIX B Proof of Lemma 2

Statement of Lemma 2 Under reasonable upper bound on the tax rate, t, ∂b/∂L ≤ 0 implies 휕t∕휕L < 0 but 휕b∕휕L > 0 implies ∂t/∂L ⪌ 0.

Repeating the above procedure this time for L we have,

 1−( +) − ( ∗ +t ∗)t =−t ∗ −g (1−b) 1− (−b ) t L L 1− L

∗ As with the case of T, here assuming same upper bound on tax rate, and also noting that 휇L > 0, we find that bL ≤ 0 would imply the RHS < 0 so that tL < 0 and the response of tax payers is similar to transparency. However, if bL > 0, then RHS can have any sign and hence tL ⪌ 0.

APPENDIX C Proof of second-­order condition for compliance Differentiating the first-­order condition, Equation (23), in T and rearranging the terms, we have:

≡ ∗ ∗ ∗ ∗ ∗ ∗ ∗ 2 ∗ ∗ Q bT (tT +t T +t t tT )+bT (tT +t T )+b[2tT T +t TT +tT t +t tT tT ]

Use the first-order­ condition Equation (23) once again to substitute for the first two terms ∗ ∗ bT (tT +t T ) in the above equation. We have: 48 | MOHTADI et al.

b2 Q =−2 T ⋅( +t∗)−t∗t +tb ∗t +b[2t ∗ +t∗ +t2 ∗ +t∗ t ] b t T T t T T T TT T t tT T

∗ ∗ In this equation the first term (with minus sign) is negative; the second term, −t t tT is <0 as t and ∗ tT are both <0. In addition, since bT < 0 the third term tbT t tT is also <0. As for the last term in the ∗ ∗ ∗ brackets, we know from optimum μ* that 휇T > 0, 휇t < 0 and 휇tT > 0 (Equation 15) Moreover, tT < 0 by ∗ Lemma 1. Thus, it is sufficient that 휇TT < 0 in order that Q < 0 and second-order­ condition be satis- 27 ∗ fied. Note, however, that 휇TT < 0 is not necessary. For example, a more relaxed condition such as ∗ ∗ 휇TT ∕휇T < −tT ∕t will also satisfy second-­order condition.

APPENDIX D Proof of non-optimality­ of law enforcement if tL > 0 The first-­order condition for L, if it existed, would derive from differentiating the government's utility function, in L. This would yield: U G = b(∗ +t∗)t +bt∗ +b (+t∗) L t L L L

∗ ∗ Recall from Lemma 2 that 휇 +t휇t > 0 under reasonable upper bound on tax rate. Also recall from t > 0 휇∗ > 0 that lemma that the case of L can only arise from bL > 0. It follows that if L in above equation, then UG will always be increasing in L i.e., ∂UG/∂L > 0, and there is no optimum enforcement. Such a government would always benefit from maximum enforcement.

APPENDIX E Proof of Proposition 4 Using the expression for optimal compliance, μ* = 1 − t/[γ(T)π(L)], we have, d �t2 2t (t ∗) = −t 1+ dT 2 T  

Now use the expression for μ* above again to reexpress this equation in terms of μ*: d �t (t ∗) = (1− ∗)−t [1+2(1− ∗)] dT  T

Since μ* < 1 and since tT < 0, it follows that, d (t휇∗) > 0 if 훾� > 0 dT

APPENDIX F Proof of Proposition 5 Using the expression for d(tμ*)/dT in Appendix E, the second cross differential d2(tμ*)/dLdT becomes,

27 For analytic simplicity, two simplifying linearity conditions are assumed: τTT = bTT = 0. MOHTADI et al. | 49

d2 � (t ∗) = [t (1− ∗)−t ∗]−t [1+2(1− ∗)]+2t∗ ∗ dLdT  L L TL T L In this expression, the first bracketed term is <0; the sign of the second term depends on the sign of tTL; and the last term is <0. We shall now show that tTL > 0 if bTL = 0. To find tTL differentiate the equation of Lemma 1 under Appendix A with respect to L. This yields,

(∗ +t∗)⋅t +(∗ +t∗)t =−t ∗ −t∗ +gb ⋅ 1−− ⋅ −  L t T t TL L T TL T 1− 1−   − −1 − (1−b) 1− (−b )+gb ⋅ 1−− ⋅(1−b) 1−  L TL 1−

� 휕 (휇∗ +t휇∗) =− 2 (t −t 휋 ) > 0 Note that the first term on the left-­hand side involves 휕L t 훾휋 L 휋 since tL < 0. t Therefore,� �that entire term is <0 since T < 0. Further, the second term on the right-hand­ side involves 휇∗ =−훾 휋 < 0 TL 훾2휋2 . Since the third term is positive (bL < 0, bT < 0), it follows that the right hand side is 2 d (t ∗) positive if bTL = 0. Thus, tTL > 0. With this result, the right hand side of the equation for dLdT is 2 negative. It follows that d (t휇∗) < 0. dLdT 50 | MOHTADI et al. (Continues) calculations giant oil discoveries dataset information index by Williams (2009, 2011) WDI WDI Authors own Horn (2014) Ross (2013) Ross (2013) ICTD Release of Source unit world price unit price of oil GDP value of the fraction recoverable crude oil from underground reservoirs with current petroleum extraction technologies per- ­ divided by GDP and gas extracted in a given year multiplied by the per- ­ Include taxes on total income, capital gains and profit 0 to 1. (The index is linearly transformed without being compacted so that all values are positive) Government consumption divided by GDP per capita in constant 2,005 US$ Where oil reserves is the estimated Where oil price is the average annual Where oil value is the quantity of Gov. tax revenue from individuals. An index of transparency ranging from Variable description 164.70 11.98 16.5 11.47 4.35 1.82 26.26 0.87 Max 1.38 3.91 0 4.37 2.61 0 0 0.01 Min 8.19 1.64 5.52 1.90 0.48 0.13 4.62 0.17 SD 16.65 8.01 3.93 7.68 3.48 0.06 4.21 0.48 Mean 186 199 203 50 169 167 165 184 Countries 4,976 5,480 6,510 1,550 5,239 4,743 2,968 5,326 Obs. Summary statistics per GDP (in percent) (const. 2,005 US$) reserves) of barrels) US$) per GDP) per GDP (in percent) Gov’t consumption Log of GDP per capita Log of (oil price × oil Oil reserves (millions Oil price (const. 2,000 Log of (1 + oil value Taxes on individuals Actual transformed variable, if any, used in regressions Gov. cons GDP per capita Oil reserve value Oil reserves Oil price Oil Tax revenue Variable names as shown in tables Transparency APPENDIX G names, and measures variable statistics, Descriptive MOHTADI et al. | 51 (2015) Final transparency Indices scores (2013) Fraser chain linked index PROJECT (2013) Williams Gwartney et al. WDI ICTD Polity IV Source ency ranging from 0 to 1. (The index is linearly transformed without being compacted so that all values are positive) the World Bank’s doing business data on the time required per year for a business to prepare, file, and pay taxes on corporate income, value added or sales taxes, and taxes on labor” (Gwartney et al., 2013) to 1. (The index is linearly trans - formed without being compacted so that all values are positive) transactions. Including import and export taxes measuring on a scale from −10 to +10 the polity of a country An Index of accountability transpar - An index of tax compliance “based on An Index of rule law ranging from 0 Taxes on international trade and Updated combined polity score; Variable description 1 9.87 1 39.18 10 Max 0 0 0 −0.03 −10 Min 0.27 2.29 0.21 3.65 7.24 SD 0.52 6.54 0.56 3.10 1.68 Mean 184 146 203 182 165 Countries 4,896 952 2,396 3,880 4,669 Obs. Summary statistics trade and transac - tions per GDP (in %) (Continued) Taxes international Actual transformed variable, if any, used in regressions Accountability Tax compliance Rule of law Inter. taxes Variable names as shown in tables Polity2 APPENDIX G