<<

The origins and consequences of kin networks and practices

by Duman Bahramirad

M.Sc., University of Tehran, 2007 B.Sc., University of Tehran, 2005

Thesis Submitted in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

in the Department of Economics Faculty of Arts and Social Sciences

c Duman Bahramirad 2018 SIMON FRASER UNIVERSITY Summer 2018

Copyright in this work rests with the author. Please ensure that any reproduction or re-use is done in accordance with the relevant national copyright legislation. Approval

Name: Duman Bahramirad

Degree: Doctor of Philosophy (Economics)

Title: The origins and consequences of kin networks and marriage practices

Examining Committee: Chair: Nicolas Schmitt Professor

Gregory K. Dow Senior Supervisor Professor

Alexander K. Karaivanov Supervisor Professor

Erik O. Kimbrough Supervisor Associate Professor Argyros School of Business and Economics Chapman University

Simon D. Woodcock Supervisor Associate Professor

Chris Bidner Internal Examiner Associate Professor

Siwan Anderson External Examiner Professor Vancouver School of Economics University of British Columbia

Date Defended: July 31, 2018

ii Ethics Statement

iii

iii Abstract

In the first chapter, I investigate a potential channel to explain the heterogeneity of kin networks across societies. I argue and test the hypothesis that female inheritance has historically had a posi- tive effect on in-marriage and a negative effect on female premarital relations and economic partic- ipation. In the second chapter, my co-authors and I provide evidence on the positive association of in-marriage and corruption. We also test the effect of ties on in a bribery experi- ment. The third chapter presents my second joint paper on the consequences of kin networks. Taking a bigger-picture approach, we define a intensity index based on basic elements of kinship such as marriage practices, residence patterns, and organizations. Combining data on 20 psychological outcomes, we show that a significant portion of the existing psychological vari- ation around the world originates in the differences of kin networks. Using historical measures of Church exposure, we also show that the variation in these differences arose historically from the ’s marriage and family policies. Keywords: Female inheritance; Kin networks; Gender inequality; marriage; ; Corruption;

iv Dedication

To my wonderful wife, Mahsa.

v Acknowledgements

I would like to thank Greg Dow, Alexander Karaivanov, Erik Kimbrough, and Simon Woodcock for their guidance and support.

vi Table of Contents

Approval ii

Ethics Statement iii

Abstract iv

Dedication v

Acknowledgements vi

Table of Contents vii

List of Tables ix

List of Figures x

1 Introduction 1

2 Keeping It in the Family: Female Inheritance, In-marriage, and the Status of Women 3 2.1 Introduction ...... 3 2.2 Inheritance as a determinant of kinship pattern ...... 7 2.2.1 Classification of inheritance systems ...... 8 2.2.2 Origins and persistence of inheritance systems ...... 9 2.2.3 Patterns of inmarriage and female economic participation ...... 12 2.3 Conceptual framework ...... 13 2.3.1 Female inheritance and inmarriage ...... 15 2.3.2 Female inheritance and the status of women ...... 18 2.4 Empirical strategies and results using historical data ...... 20 2.4.1 Pre-industrial societies ...... 21 2.4.2 Mid-twentieth-century Italy ...... 25 2.5 Empirical strategies and results using contemporary data from developing countries 27 2.5.1 Individual-level analysis using Indonesian data ...... 28 2.5.2 Difference in differences analysis using Hindu Succession (Amendment) Act of ...... 33

vii 2.6 Conclusion ...... 38

Bibliography Keeping It in the Family: Female Inheritance, In-marriage, and the Status of Women 40

3 Kinship, Fractionalization and Corruption 56 3.1 Introduction ...... 56 3.2 Theory and hypotheses ...... 63 3.2.1 A basic model of bribery ...... 63 3.2.2 Inclusive fitness, kin altruism, and corruption ...... 65 3.2.3 Consanguineous marriage, sub-ethnic fractionalization, and corruption . . . 67 3.3 Empirical Strategy and Results ...... 69 3.3.1 Cross-country analysis ...... 69 3.3.2 Within-country analysis (Italy) ...... 75 3.3.3 Laboratory Experiments ...... 81 3.4 Conclusion ...... 88

Bibliography Kinship, Fractionalization and Corruption 89

4 Origins of WEIRD Psychology 105 4.1 Theory ...... 106 4.2 Methods ...... 108 4.3 Results ...... 112 4.3.1 Psychological variation across countries ...... 113 4.3.2 Psychological Variation within Europe ...... 119 4.3.3 The children of immigrants ...... 123 4.4 Discussion ...... 124

Bibliography Origins of WEIRD psychology 125

Appendix A Keeping It in the Family: Female Inheritance, In-marriage, and the Status of Women 127

Appendix B Kinship, Fractionalization and Corruption 164

Appendix C Origins of Weird Psychology 217

viii List of Tables

Table 2.1 Female inclusion and partibility of inheritance in the EA ...... 21 Table 2.2 Set of control variable used in the regression analyses with the EA data . . . 22 Table 2.3 First set of regression analyses with the EA data ...... 23 Table 2.4 Second set of regression analyses with the EA data ...... 24 Table 2.5 Regression analyses with data on Italian provinces ...... 27 Table 2.6 Ethnic-level regression analyses with IFLS data ...... 30 Table 2.7 Individual-level regression analyses with IFLS data ...... 31 Table 2.8 Diff-in-diff regression analyses with NFHS data ...... 36 Table 2.9 Diff-in-diff regression analyses with NFHS data, counterfactual trends . . . 37

Table 3.1 Misspecification due to omission of sub-ethnic fractionalization ...... 59 Table 3.2 Misspecification due to conflation of ethnic and sub-ethnic fractionalization . 59 Table 3.3 Religious attitudes to consanguineous marriage ...... 68 Table 3.4 Cross-country regression analysis of the relationship between and corruption ...... 71 Table 3.5 Cross-country regression analysis of the relationship between consanguinity and corruption: potential confounds...... 73 Table 3.6 Regression analysis of the relationship between consanguinity and corruption in Italy ...... 78 Table 3.7 Regression analysis of the relationship between consanguinity and corruption in Italy controlling for climate and geography...... 78 Table 3.8 Reduced form regressions with active years of archdioceses ...... 79 Table 3.9 Active years of archdioceses as an instrument for consanguinity ...... 80 Table 3.10 Relative frequency of bribery and corruption by treatment ...... 85 Table 3.11 Summary of relative frequency of bribery and corruption ...... 87

Table 4.1 Psychological and behavioral measures ...... 112 Table 4.2 Baseline cross-country regressions for psychological outcomes ...... 119 Table 4.3 Regression of psychological outcomes on exposure to the medieval Western Church ...... 120 Table 4.4 Regression of psychological outcomes for the children of immigrants in Europe123

ix List of Figures

Figure 2.1 Average rates in India ...... 6 Figure 2.2 Worldwide distribution of pre-industrial inheritance systems ...... 9 Figure 2.3 Cousin marriage and female economic participation rates around the world 12 Figure 2.4 Cousin marriage and female economic participation rates in Italy . . . . . 13 Figure 2.5 Female inheritance and cousin marriage ...... 17 Figure 2.6 Cousin marriage rates among Hindu women across Indian states...... 33

Figure 3.1 A bribery game between strangers and relatives with inclusive fitness . . . 64 Figure 3.2 Corruption and consanguinity around the world ...... 70 Figure 3.3 Corruption and consanguinity in Italy ...... 77 Figure 3.4 Bribery game in the experiment ...... 82

Figure 4.1 Kinship Intensity Index for ethno-linguistic populations around the globe . 109 Figure 4.2 Western Church exposure across European regions ...... 110 Figure 4.3 Individualism and independence, kinship intensity and Church exposure . 114 Figure 4.4 Conformity and obedience, kinship intensity and Church exposure . . . . 115 Figure 4.5 Impartiality, kinship intensity and Church exposure ...... 116 Figure 4.6 Impersonal cooperation and trust, kinship intensity and Church exposure . 117 Figure 4.7 Relationships between regional estimates of cousin marriage and psycho- logical measures ...... 122

x Chapter 1

Introduction

The thesis includes three chapters on the origins and consequences of kin networks, marriage prac- tices, and gender norms. In the first chapter, I study historical origins of existing differences in marriage practices and gender norms across societies. I argue that in patrilineal societies that man- date female inheritance, such as Islamic societies, cousin marriage and endogamy have developed to preserve property in the male lineage, prevent land fragmentation, and limit conflicts over inheri- tance. In these societies, female inheritance has also encouraged restrictions on women’s premarital sexual freedom in order to arrange cousin and avoid out-of-wedlock children as poten- tial heirs. These restrictions, such as through veiling and segregation from men, are incompatible with female participation in agriculture and have further influenced the historical gender division of labor. Using data on pre-industrial societies, Italian provinces, and Indonesian individuals, I find evidence consistent with these hypotheses: female inheritance is associated with higher cousin mar- riage, endogamy, and , and lower female economic participation and premarital sexual freedom. Finally, I use a 2005 reform of Indian inheritance laws to obtain causal estimates of the effect of female inheritance on the cousin marriage rate and the female premarital sex rate. In the second chapter, along with my co-authors Mahsa Akbari and Erik Kimbrough, I study consequences of marriage practices for quality of formal institutions. By shaping patterns of related- ness and interaction, marriage practices influence the relative returns to norms of nepotism/favoritism versus norms of impartial cooperation. Cousin marriage yields a relatively closed society of related individuals and thereby encourages favoritism and corruption. Out-marriage creates a relatively open society with increased interaction between non-relatives and strangers, thereby encouraging impartiality. We report a robust association between in-marriage practices and corruption across countries and across Italian provinces. A stylized corruption experiment comparing subjects from two countries with divergent marriage patterns provides complementary evidence that the degree of impartiality varies with marriage patterns. In the last chapter, along with my co-authors Jonathan Schulz, Jonathan Beauchamp, and Joseph Henrich, I study consequences of kinship patterns for social psychology. Recent research not only confirms the existence of substantial psychological variation around the globe but also highlights the peculiarity of populations that are Western, Educated, Industrialized, Rich and Democratic

1 (WEIRD). We propose that much of this variation arose as people psychologically adapted to dif- fering kin-based institutions—the set of social norms governing descent, marriage, residence and related domains. We further propose that part of the variation in these institutions arose historically from the Catholic Church’s marriage and family policies, which contributed to the dissolution of Europe’s traditional kin-based institutions, leading eventually to the predominance of nuclear fam- ilies and impersonal institutions. By combining data on 20 psychological outcomes with historical measures of both kinship and Church exposure, we find support for these ideas in a comprehensive array of analyses across countries, among European regions and between individuals with different cultural backgrounds.

2 Chapter 2

Keeping It in the Family: Female Inheritance, In-marriage, and the Status of Women

“This is what the LORD commands . . . Every daughter who inherits land in any Israelite tribe must marry someone in her father’s tribal , so that every Israelite will possess the inheritance of his fathers." (The , Numbers 36)

2.1 Introduction

Modern economic literature widely assumes that female inheritance empowers women, increases their autonomy, and promotes gender equality. This is true in the modern world, where women’s in- heritance rights enable them to control and exploit their property and therefore improve their social and economic prospects. However, in a patrilineal traditional society, female inheritance transmits property through women, not to women. Here, women function mainly as carriers of property from father to husband, and on to children, rather than as active managers of wealth [113, 157, 159, 147]. Therefore, under female inheritance, a woman’s marriage determines whom her family will have to share their land or herds with. Thus, her male relatives have the incentive to arrange her mar- riage to a cousin in order to keep her share of property among themselves. Arranging her marriage also requires controlling her premarital romantic and sexual relationships. These arrangements and controls are likely to negatively impact the woman’s role and participation in society. Despite inheritance being an important economic institution, the above paradigm from the social sciences has received little attention from economists. I attempt to fill this gap in the field. I propose that under female inheritance, patrilineal societies encourage inmarriage—in the form of cousin marriage or endogamy (marriage within the limits of a local community such as a clan or village)— and control over women’s sexuality to keep property within the male lineage, prevent property fragmentation, and limit conflicting claims on the estate. Using different data sources, I confirm that female inheritance is associated with more cousin marriage, endogamy and arranged marriage, as

3 well as with less female economic participation and premarital sexual freedom. I also provide causal evidence that female inheritance has a positive effect on the cousin marriage rate and a negative effect on the female premarital sex rate. I present a conceptual framework to analyze inmarriage and the status of women in both patrilin- eal pre-industrial societies and contemporary developing countries. In this framework, the objective of a family head in a patrilineal society is to preserve the patrilineal succession, that is, to minimize the probability of patrilineal extinction in order to perpetuate his male lineage. At the same time, the production technology exhibits increasing returns to scale up to a point, due to the size of the land required for plowing and harrowing, or to maintain a base of local political power (such as a feudal estate).1 I develop the conceptual framework based on three assumptions. First, I take patrilineal bias—in succession of names, rights, titles, properties, and so on—as given based on the fact that patrilineal systems have historically been widespread [103]. As Adam Smith noted, patrilineal systems incor- porate the patrilineal bias in order to preserve male lineages [236, 60] as their basic social units. Sec- ond, capital markets are imperfect, and most land is acquired through inheritance as a non-market mechanism of land transmission between close kin. This assumption characterizes pre-industrial agricultural societies and many contemporary developing countries [60, 213, 36, 152]. Third, I as- sume female inheritance was introduced exogenously to some patrilineal societies but not others. As I discuss in section 2.2, the literature suggests that the differences in traditional inheritance sys- tems across societies are deep-rooted in geographical characteristics and the subsistence economy of the regions. Moreover, and legal systems also contributed to the persistence of inher- itance systems throughout history. For example, have had to follow Islamic inheritance law because it is stated explicitly in the text of the Qur’an. The centuries-long legal recognition of entails in many European countries is another example. By an entail, the founder of an estate could prevent its sale or division by an indefinite and automatic chain of succession specified at his will. Because of its deep-rooted origins and its persistence through religious or secular laws or traditions, following the literature, we can reasonably consider inheritance an exogenous determinant of more flexible social institutions such as marriage practices and gender norms. Using insights from anthropological studies such as [113], I propose two hypotheses. The first hypothesis is that when women are included in inheritance, inmarriage is more frequent. Several possible mechanisms are involved. Cousin marriage keeps the land within the male lineage and therefore decreases the probability of patrilineal extinction. Inmarriage also provides a way to de- crease property fragmentation: a man can marry a woman within the kin or community to pool land parcels and capital goods, which decreases land and capital fragmentation. Finally, cousin marriage decreases rivalry and conflict between siblings over inheritance by creating overlapping interests

1I base my arguments on inheritance of land only, both for the sake of clarity and because land was historically the most important form of property, factor of production, and source of wealth. However, my arguments and hypotheses may well apply to other property such as herds or commercial property.

4 among their offspring; a sister’s offspring will benefit from the property of a brother when it passes to his offspring. The second hypothesis is that under female inheritance, more restrictions are imposed on women’s premarital sexual behavior and their economic participation. Two potential mechanisms are in- volved. The first mechanism is through inmarriage. Female inheritance promotes cousin and endog- amous marriages, which are usually arranged by parents. Arranging marriage requires controlling premarital courtship and sexual relationships through gender segregation and restrictions on con- tact between opposite sexes. Under these restrictions, young people are less likely to form romantic attachments with outsiders, which makes it easier to arrange marriage with an intended cousin. Fur- ther, under such restrictions, are among the few people of the opposite sex whom young people are likely to meet and form a romantic relationship with. However, restrictions on contact between opposite sexes tend to disadvantage women more than men, because women’s sexual be- havior can more easily be screened (through virginity and unwanted pregnancy). Gender segregation is also likely to lead to women being secluded at home and wearing the veil, which are incompati- ble with participation in agriculture and may intensify the pre-existing division of labor within the agricultural family. The second mechanism is through maternity certainty.2 Female sexual freedom is a threat to a family’s property because out-of-wedlock children are connected to the family due to maternity certainty and therefore are considered potential heirs. Avoiding illegitimate children by requiring women’s virginity before marriage protects the kin group from dispersion of property by illegitimate children and also limits the possibility of conflicting claims on the estate in which a woman has rights. In the empirical section, I use three different data sets (ethnicity-level data from the Ethno- graphic Atlas, province-level data from Italy, and individual-level data from Indonesia) to test the correlations predicted by the hypotheses. I also use individual-level data from India to provide evi- dence for the causal relationships predicted by the hypotheses. Regression analysis on the Ethnographic Atlas data confirms that female inheritance is asso- ciated with lower female participation in agriculture and higher cousin marriage, endogamy, and female premarital sex prohibitions. The reduced female participation in agriculture associated with female inheritance is of a comparable magnitude to that of plow agriculture. This is an important finding considering the famous study by [16] on the negative impact of plow agriculture on women’s participation in agriculture. As they argued, such a negative impact may carry over to beliefs about the role and participation of women in society generally. I also test the effects of traditional inheritance systems in Italy on cousin marriage and female economic participation rates. The regression analyses show that Italian provinces with an egalitarian inheritance system that included women in inheritance have had higher cousin marriage rates and lower female economic participation rates.

2It is always known who a child’s mother is, and the mother knows her children, since she produces them.

5 I also examine contemporary data from developing countries. Using the Indonesia Family Life Survey (IFLS) and the ethnographic data on inheritance practices of Indonesian ethnicities, I show that females of ethnicities that traditionally included women in inheritance are more likely to marry endogamously within the village, are more likely to be in arranged marriages, and are less likely to be self-employed. I also find that Indonesian women who inherit property—even after marriage— are more likely to be in endogamous and arranged marriages. This effect is obtained from a re- gression analysis controlling for individual characteristics and village-level, , and ethnicity fixed effects. This shows not only that different traditional inheritance systems have different con- sequences for marriage practices and social norms across societies—as established in previous em- pirical strategies—but also that different expectations of inheritance may create different marriage outcomes for individuals living in the same society. In my last empirical strategy, I use the amendment of the Hindu Succession Act in 2005—which substantially improved Hindu women’s inheritance rights on land—in a difference in differences approach to provide evidence of the causal impact of female inheritance on inmarriage and the status of women. The Hindu Succession Act applies only to (including Sikhs, Jains, and Buddhists), and explicitly exempts Muslims, Christians, , and Jews. For this analysis, I use data from the Indian National Family Health Survey (NFHS). I show that the cousin marriage rate was significantly higher and the premarital sex rate was significantly lower in the treated group, Hindu women married in or after 2005. I also provide tests to confirm the identical counterfactual trends of the two outcome variables for treatment and control groups (see Figure 2.1 for a visual inspection).

Figure 2.1: The average cousin marriage rate of women married in the five-year period (2000–2004) before 2005 amendment of the Hindu Succession Act compared with those married after the amendment increased significantly among Hindu women (from 7.93% to 8.72%, Wald test p-value<0.001), while it did not change significantly among Muslims and Christians (Wald test p-value=0.417 and 0.371 respectively).

These findings contribute to our knowledge of marriage and the status of young women in developing countries and highlight a potential unintended consequence of exogenously introduced policies to improve female inheritance in these countries. They also suggest that female inheritance affects marriage practices and the status of women even in the short term. However, this short- term mechanism has operated for a long time and for a larger share of the population in some

6 patrilineal societies that traditionally practiced female inheritance. The long-term practice of female inheritance in a patrilineal society may create persistent cultural traits and beliefs regarding marriage and the status of women that affect people—even those who do not receive inheritance—in an era in which, through industrialization, inheritance is no longer the only source of wealth and means of production. This has an important implication for the of norms encouraging inmarriage and seclusion of women in Islamic societies where Sharia has mandated female inheritance. In the Qur’an, there is no specific guidance that encourages cousin marriage [43] and no explicit prescription on the veiling of women [5]. But, of all the economic rules in the Qur’an, the most detailed are those of inheritance [163]. The Qur’an, the main source of Islamic law, explicitly states the Islamic inheritance rules in such detail (the Qur’an 4:11) that no space is left for different in- terpretations regarding female inheritance. Islamic religious authorities often paid great attention to the observance of female inheritance, while similar legal rights for women did not exist in the West until the nineteenth century [159]. , in fact, may be the only religion that formally specifies women’s inheritance rights.3 In line with my arguments in the hypotheses, this may explain why cousin marriage (with mean 32%), gender segregation, seclusion of women inside homes, and the veiling of women are most common, and female economic participation (with mean 27%) is lowest, in the Middle East and North Africa (see Figure 2.3). The paper is organized as follows. Section 2.2 reviews the literature, introduces different in- heritance systems, and discusses their origins and persistence. Section 2.3 develops the conceptual framework for the effect of female inheritance on inmarriage and the status of women. Section 2.4 and 2.5 describe the empirical strategies and present results using historical data, and contemporary data from developing countries respectively. Finally, section 2.6 presents my conclusions.

2.2 Inheritance system as a determinant of kinship pattern

Social scientists discuss marriage practices and the status of women in the larger context of kin- ship patterns, for which inheritance is considered an important determinant.4 Anthropological stud- ies clearly emphasize that inheritance can affect marriage patterns, residence arrangements, family structures, , courtship and sex, , and so on.5 To understand why the link between inheritance and kinship is important, we need to consider the institutional environment of pre-industrial societies and many contemporary developing coun-

3For example, according to the Bible, property is to be inherited by sons, and daughters inherit property only in the absence of sons (the Bible, Numbers 27).

4Max Weber perceived a kin group as “a group of expectant heirs" [257, p.365]. Lewis Morgan argued that the family grew out of the development of a knowledge of property and its transmission by inheritance, and that even in the face of other factors, “with more effective power the rights of property might influence the system of relationship" [189, p.14]. Jack Goody noted that inheritance is an institution “in which interpersonal relationships are structured" [113, p.1]. David Sabean suggested that “there is no system of obligations and duties" that is not mediated through property [225, p.171].

5See, e.g., [189, 76, 40, 116, 181, 182, 237, 232, 110, 159, 140, 234].

7 tries. In pre-industrial agricultural societies, which were characterized by imperfect capital markets [60], land as the basic source of wealth and means of production was universally transmitted be- tween close kin by the process of inheritance [112, 237]. Even today in many developing coun- tries, land sales are rare, and most land is acquired through inheritance as a non-market mechanism [213, 152]. It is not surprising that in such a world, “kinship and property are closely interlocked" [112, p.70]. I follow the literature and consider the inheritance system “as the independent (or exogenous) variable" [113, p.20] to inmarriage and the status of women. In the next subsections, after introduc- ing different inheritance systems, I discuss their origins and persistence, which reflect the chain of causation suggested in the literature.

2.2.1 Classification of inheritance systems

It is important first to define what I refer to as "inheritance systems". I classify different inheri- tance systems using combinations of possible modes of property transmission. The first dimension involves lineal versus lateral inheritance systems. In lineal inheritance systems, property is trans- mitted vertically to children. In lateral inheritance systems, property is transmitted horizontally to siblings or indirectly vertically to siblings’ children. The second dimension involves impartible ver- sus partible inheritance systems. In impartible inheritance systems, a land parcel is preserved intact from generation to generation, and only one lineal or lateral heir inherits property. The examples are (inheritance by a senior child, sibling, or sibling’s child) and ultimogeniture (inheri- tance by the junior child, sibling, or sibling’s child). In partible inheritance systems, the land parcel is not preserved intact. Instead, each parcel is divided up lineally or laterally, among some or all of the children, siblings, or siblings’ children. The third dimension involves female inclusion versus female exclusion in inheritance. To see how different inheritance systems can be characterized by the combinations of these three categories, consider the following historical examples. Primogeniture, in practice carried out by preference given to the senior son—found in , Korea, and Northwest Europe [198, 109, 245, 213, 186, 36]—, can be characterized as lineal and impartible, with female exclusion. The inheri- tance system of equal division of land property among sons—common in Eastern Europe, , , and South Asia [109, 213, 156]—can be characterized as lineal and partible, with female exclusion. Dividing inheritance among all sons and daughters—common in Mediterranean Europe, Latin America, and Islamic societies [109, 186, 163]—can be characterized as lineal and partible, with female inclusion. Lateral inheritance system was common in Africa south of the Sahara. In theory, this system could be partible or impartible, and with female inclusion or exclusion. But in practice, lateral inheritance in sub-Saharan Africa excluded women [113]. The mode of property

8 transmission can also be characterized by the absence of any rules of inheritance or any private property rights.6

Figure 2.2: Inheritance systems of 820 Ethnographic Atlas societies, distinguishing impartible and partible inheritance systems, and for the latter, distinguishing female inclusion and female exclusion in inheritance.

It might be helpful at this point to see the global distribution of inheritance systems based on the suggested classification. Using data on pre-industrial societies from the Ethnographic Atlas (see section 2.4.1), Figure 2.2 displays traditional inheritance systems around the world. The distribution of inheritance systems in the figure coincides with the historical examples mentioned above. The Ethnographic Atlas data focuses as much as possible on the characteristics of the sample of societies prior to European contact. Hence it obviously provides scant observations for Western Europe. However, based on detailed studies and data on Europe, we know that impartible inheritance was more common in Northwest Europe than any other region in [245, 248, 213, 36, 163]. See also maps from [245] and [248] in Figure A.2.1 of Appendix A.2.

2.2.2 Origins and persistence of inheritance systems

The differences between inheritance systems are thought to be deep-rooted in agricultural and po- litical organization. Capital-intensive (e.g., involving plows), open-field, and manorial agriculture might favor impartible inheritance due to economies of scale. However, agricultural organization itself is determined by geographic factors and political organization. For example, heavy soils usu- ally required large plows pulled by several horses, which were expensive and practical only on large

6For example, in many hunting and gathering societies (such as Native American societies) individuals had little property except personal equipment, which was often destroyed at death [113]. Other societies had communal ownership of land, in which individuals inherited from their parents a general right of access to the whole of the community’s resources that continued to exist after the head of the family passed away. Communal inheritance could be patrilineal (from father to sons), such as the Russian peasantry, or matrilineal (from mother to daughters), such as among the Minang in Indonesia.

9 land holdings, while sandy light soils could be cultivated by handheld tools like the hoe and digging stick on small family farms. In terms of political organization, manorial agriculture, for example, was closely linked to feudalism.7 Despite the influence of agriculture, the literature suggests that inheritance systems are best ex- plained by the political organization of societies. In societies with impartible inheritance, such as Japan or Northwest Europe, lands were controlled by powerful whose interests were best served by maintaining their holdings intact through impartible inheritance because the political and military functions associated with the estate were indivisible [236, 213, 36]. Also, in the European countries, large estates came with seats on parliamentary bodies. Therefore, property became indi- visible because the office was indivisible [36]. On the other hand, a necessary condition for partible inheritance was a strong central govern- ment [18]. In regions such as China, India, Russia, and the Mediterranean, inheritance rules were subject to the legislation of strong central bureaucracies with an interest in restricting the develop- ment of powerful landholding by fragmenting their properties through partible inheritance [258, 109, 213, 163]. But contrary to patrilineal partible inheritance in China, India, and Russia, partible inheritance in the Mediterranean region (including the Middle East) included both sons and daughters. Inclusion of women in inheritance in the Mediterranean region had Roman-Byzantine roots [156]. Again, it seems that geographic factors had a role here. For example, [258] suggests that in regions such as the Middle East, states had centralized power by controlling large-scale irrigation systems essential to the agriculture. [37] provide general evidence on this account. [184, 185] ar- gue and present evidence that a centralized Islamic state featuring redistributive principles such as partible inheritance emerged to address economic inequalities resulting from geographical features of the region—this is, unequal agricultural potential with few fertile places and a large share of arid lands—and their interaction with the diversion of trade routes in seventh-century Arabia. Consid- ering the incompatibility of female inheritance with a patrilineal system, the mere fact that female inheritance has been present within the patrilineal context of Islam and Islamic societies itself sug- gests that it was most likely introduced to Islam by an exogenous source. Whatever the deep-rooted sources are, once they determine the form of an inheritance system, its development “very much tends to follow the track that has been laid down, and is relatively inde- pendent of changing socio-economic conditions" [36, p.82]. One can find a strong continuity and a systematic pattern through all changes [116, 36]. Two important aspects of inheritance contributed to its persistence and path dependency. First, inheritance practices cannot be understood as purely individual decisions. Rather, they are regulated by secular or religious institutions and laws. Inher- itance laws frequently “continue in force long after the circumstances which first gave occasion to them" [236, p.305]. Second, inheritance is a non-market institution [213, 36].

7See [213] for a comprehensive discussion on different agricultural organizations and the involved political and geo- graphic factors.

10 For example, primogeniture was legally recognized through entails8 which were “respected through the greater part of Europe" [236, p.384]. By entail, the testator not only determined the heir, but also decides to whom the land must be bequeathed after the death of the heir. If real property was entailed, it could not be sold by the heir, and it had to be passed on automatically from generation to generation according to the succession determined by the founder. Entails prevented the division of property through sale or inheritance. Therefore, an entailed property was removed from the market process. Aside from enjoying legal recognition for centuries—until 1780 in the United States, 1848 in France, 1919 in Germany, and 1925 in Britain [36]—, supporters of primogeniture collected more than a dozen biblical verses9 to give it a Christian foundation [163]. Under strict manorial controls in Europe, even peasants had no right to divide or alienate the land [213, 156]. On the other hand, inheritance has been subject to the partible Qur’anic inheritance law10 in Is- lamic societies. Islamic inheritance law clearly subordinated personal preferences and strengthened the inheritance rights of women. The law took shape in the Mediterranean Middle East region, in Syria and , which were already accustomed to partible inheritance practices and inclusion of women. However, by entering into the text of the Qur’an, the law became a path-dependent institu- tion for all Muslim populations. Although in some Muslim populations women were excluded from inheritance, they were still more likely to inherit than their counterparts in Christian societies [49] because the Qur’an left no space for different interpretations regarding female inheritance. Finally, communal and joint property persists in many developing countries and even Europe.11 Under these “archaic regimes" [207], access to land is possible only through membership in a com- munal assembly or a joint family composed of several generations, and a single member cannot transfer or alienate their membership right. Thus, land sales and partitions are rare [207, 152, 55]. Due to its deep-rooted origins and its persistence, inheritance is a process critical to the repro- duction of the social system itself. It is true that the differences in inheritance systems were a marked feature of the pre-industrial era. But, as [116] argued, “whatever the reasons, these differences have consequences for the position of women, the structure of social roles, the behaviour of kin, and the strategies of family organization" (p.35).

8Fideikommiss (in German), Substitutions and Majorats (in French).

9E.g. ’s first-born son, , sold his “birthright" to his younger brother, , for a bowl of stew (The Bible, Genesis 25).

10Verses 11, 12, and 176 in the fourth chapter of the Qur’an, Surah An-Nisa. Islamic inheritance law limits an indi- vidual’s power of testamentary disposition to one-third of his estate, and two-thirds of the estate passes to the legal heirs of the deceased under the compulsory rules of inheritance. Legal heirs include children, spouses, parents, and siblings of both sexes. The females among these relatives only take half the share of the male relative of the same degree of relation to the decedent. However, a female must have her firm share of inheritance in all types of property left by her father.

11See, e.g., [4] for joint property in India and [188] for communal lands in some Italian towns.

11 2.2.3 Patterns of inmarriage and female economic participation

Because the authority of religious institutions over marriage is very common, religious attitudes explain a large share of the variation in inmarriage rates around the world today. Figure 2.3 (left) shows cousin marriage rates up to and including second cousins.12

Figure 2.3: Left: cousin marriage rates (up to and including second cousins) around the world [42]. Right: Labor force participation rate of women above 15 years old, 2010–2016 estimates by the International Labour Organization, with darker colors indicating lower percentages.

Christian countries have the lowest rates of cousin marriage, as they have a long religious history of discouraging this practice. The Catholic Church has prohibited cousin marriage since the sixth century, at times extending the prohibitions as far as third cousins in the eighth century and sixth cousins in the eleventh century.13 The highest cousin marriage rates can be observed in Islamic countries, where pre-Islamic Arab culture and the Islamic Sunnah (the deeds of the Prophet) might have encouraged cousin marriages [67, 43]. Figure 2.3 confirms that religious history has been an important determinant of marriage patterns in terms of cousin marriages. However, to better introduce the first hypothesis of this study, we need to take a look within a religiously homogeneous country. Although the Church banned cousin marriage, it was still possible to marry a cousin by requesting a dispensation letter from a diocese. Figure 3.3 (left) shows the cousin marriage rates up to first cousins14 (based on the number of issued dispensation letters) within an almost 100% Catholic country: Italy from 1945 to 1965. As seen in the figure, cousin marriage rates varied considerably across the Italian provinces. According to my first hypothesis, female

12This also includes marriage between first cousins once removed, and uncle-niece/aunt-nephew marriages.

13Prohibitions on cousin marriages were relaxed to third cousins in 1215, second cousins in 1917, and first cousins in 1983 [115, 43]. The Church’s family and marriage regulations might have partly formed and evolved in response to inheritance rules. According to [115], an edict of a Roman emperor allowed the Church to possess properties that did not belong to any individual. Therefore, the Church could only benefit by leaving a deceased without eligible heirs. It does not seem accidental that the Church would have condemned , cousin marriage, , adoption, and . By providing heirs, these practices could deprive the Church of property. The Church prohibited cousin marriages within seven degree of kinship because it was the degree considered for the purpose of inheritance by . Therefore, one could no longer marry anyone from whom one could have inherited. [121] also agrees that the Church’s marriage laws were “self-serving" (p.308).

14This also includes uncle-niece/aunt-nephew marriages.

12 inheritance could be the cause of the residual variation in cousin marriage rates after controlling for the religion (i.e., within the same religious denomination, as in the case of Italy).15

Figure 2.4: Left: Cousin marriage rates within Italy 1945-1965 [57]. Middle: Traditional inheritance systems within Italy [248]. Right: The rate of economically active women above 15 years old, 2011 census, the Italian National Institute of Statistics (ISTAT), with darker colors indicating lower percentages.

Figure 3.3 (middle) displays traditional inheritance systems for Italian provinces. Consistent with the hypothesis, regions in Italy characterized by egalitarian type, where women were more likely to receive property, were also historically characterized by high cousin marriage rates.16 The same correlation can also be observed between cousin marriage rates for world regions (Figure 2.3, left) and the distribution of Ethnographic Atlas societies in the regions practicing female inheritance, that is, partible inheritance by both sexes (Figure 2.2). Interestingly, regions of the world that traditionally practiced female inheritance have also ex- perienced lower female economic participation (see Figure 2.3, right). These are mostly Muslim countries also with norms favoring gender segregation, women wearing the veil, and seclusion of women in homes. The pattern also holds for Italy. Southern Italy has traditionally been associated with lower female participation and freedom than has the north17 (see also Figure 3.3, right). This correlation between female inheritance and female economic participation and freedom is a predic- tion of my second hypothesis.

2.3 Conceptual framework

An implicit assumption of studies on inheritance, such as in Adam Smith’s The Wealth of Nations, is an overlapping generations perspective, where the objective of the family head is to preserve the patrilineal succession and perpetuate his male lineage.18 This objective is embedded in the nature of

15As argued by [113], “in Europe, to some extent, the desire for like to marry like runs against the rules of the Catholic church against marrying close kin" (p.14).

16One piece of historical evidence that female inheritance can promote inmarriage is the increased cousin marriage rates among Italians with the introduction of the Napoleonic civil code in 1810, which emphasized equal inheritance rights between men and women. See, e.g., [95] (p.86), [63] (p.154), [57] (p.159).

17See, e.g., [157] (p.18) and [175] (p.51–2).

18Adam [236] suggested that primogeniture was introduced in Europe “to preserve a certain lineal succession [. . . ] and to hinder any part of the original estate from being carried out of the proposed line" (p.384). Similarly, [64] describe

13 patrilineal societies, where male lineages are the basic social and economic units, and the succession of names, lordly rights, titles, and other valuables tends to be passed on in the male line. Patrilineal systems have been historically widespread, such as in Western cultures [103].19 Many religions, specifically Abrahamic religions, offer an extensive demonstration of patrilineal relationships [see, e.g., 246]. Of course, a family head would also like to support all his sons and daughters, and their families. However, in a disorderly time (such as the pre-industrial era) or poor environment (such as in some developing countries), to divide the land among all children is to expose every parcel and family to extinction. On the other hand, leaving large land parcels for few children increases their families’ chances of survival and therefore continuation of the lineage for another generation. Three factors contribute to this trade-off between supporting all children and preserving the lineage: economies of scale, high mortality rates, and imperfect capital markets. First, division of land might decrease the total production of the offspring due to economies of scale in land size. Some forms of agricultural organization require larger land parcels. For example, plowing and harrowing can take place only on large parcels. Moreover, larger land parcels are asso- ciated with more political and bargaining power. For example, the size of a feudal estate determined the political and military power of the lord who owned it. Political power, such as a parliamentary seat, and military functions and obligations of lords were indivisible by nature [213, 36]. As Adam [236] noted, “to divide it [i.e. the land] was to ruin it, and to expose every part of it to be oppressed and swallowed up by the incursions of its neighbours" (p.383). Second, since land is the basic means of production, under high mortality rates a larger land parcel provides a family with more production and higher income which means better nutrition, hygiene, and health—in short, a better chance of survival. Third, imperfect capital markets imply rigid intergenerational mobility of income groups. Since land is the main source of wealth, a child who gets a large land parcel would stands a better chance of staying rich or even moving up the social ladder, increasing the family’s chance of survival. Emphasizing the role of high mortality rates and imperfect capital markets, [60] develops an economic model and simulations to show that even in the absence of economies of scale, the optimal strategy of a family head seeking to primogeniture as an attempt for “perpetuation of the family line" (p.176). [198] note that the purpose of primogeniture in Japan is to “keep the succession line firm" (p.11). [157] notes that in Medieval Italy, “the transmission of property through sons was in harmony with the desire to preserve the family estate intact, whereas property given to daughters was thought to be lost from the family of origin" (p.17). [239] reports that not having a son in an Indian villiage “meant the end of the lineage, a disaster that no one liked even to contemplate" (p.112). [3] presents historical examples of the “justifiable fear of extinction" of the family name among English aristocrats and the strategies that were devised to procure continuity, such as transmitting land to men prepared to trade their family names (p.43). In the case of female inheritance, strategies such as cousin marriage are described as the means “to preserve house and holdings in the male line" [63, p.154], to prevent “plots of land from escaping the control of an identifiable patrilineage" [95, p.86], and to “keep parcels of land intact and within families" [234, p.S169], which all appears to be linked to the ideology of the “family line" [232, p.64].

19“A patrilineal bias can coexist with cognatic descent groups and bilateral patterns of affiliation" [46, p.19].

14 perpetuate his lineage is primogeniture. However, he also acknowledges that economies of scale alone are enough to rationalize primogeniture.20 Therefore, a family head seeking to protect the welfare of all his children and preserve his lineage might find himself with insufficient resources to fulfill both goals. In this case, sacrificing the welfare of some of his children is the only way to reach the objective of perpetuating his lineage. Since in a patrilineal society “the male sex is universally preferred to the female" [236, p.383] and lineal succession “refers to the preservation of a family name by sons" [60, p.83], sons are preferred to daughters in the succession of land. Patrilineal primogeniture manifests the extent of the sacrifice of families to preserve male lineages: family heads attempt to prevent the extinction of the male lineage by leaving the land intact for only one son, knowing that this “beggars all the rest of the children" [236, p.384]. In imperial China, the whole clan, not just immediate family members, often pooled their money to subsidize the education of just one child, hoping that he would pass the civil service examination, and bring honor and prestige to the clan by becoming an official [60]. The patrilineal bias in succession persists even today in family farms of rural Europe [see, e.g., 211, Ch.5] and many developing countries [see, e.g., 152, for India].

2.3.1 Female inheritance and inmarriage

Bequeathing land to sons only (such as in primogeniture) is not always an option. As discussed earlier, female inheritance is mandated in some patrilineal societies—not to empower women, but most likely to restrict the development of local powers. In this section, I will describe three mech- anisms through which inmarriage arises in this specific institutional environment characterized by the intersection of a patrilineal system and female inheritance. First, preserving land within the male lineage. Where female inheritance is mandated, mar- riage of a granddaughter to an outsider means that her share of land will eventually leave the male lineage. As anthropologists have noted,21 inmarriage provides a way to avoid this problem. If the granddaughter marries a son’s son (a first-cousin marriage), her share of inheritance will still serve to perpetuate the male lineage of the family head by increasing the chance of survival of the grand- son’s family. If there is no eligible partner among the family head’s descendants, marriage of his grand- daughter to his brother’s son (a first-cousin-once-removed marriage) or his brother’s son’s son (a second-cousin marriage)—who are carriers of the same family name—could be arranged to keep

20Similarly, in an evolutionary model, [222] shows that under poor environment or positive correlation between earned and inherited wealth, the optimal strategy is to limit the number of heirs and maximize the wealth inherited by them. [135] and [228] argue that heritable wealth has reproductive value independent of number of offspring because it increases the reproductive chances of offspring by improving their quality as measured by status, health, nutrition, and so on. Therefore, a strategy, such as primogeniture, favoring offspring quality may better serve to enhance inclusive fitness in the long run. In other words, inheritance of wealth in humans alters the equilibrium favored by solely genetic inheritance of reproductive strategies.

21See, e.g., [112, 146, 137, 159, 140, 234].

15 the parcels of lands under the family name and within the higher-level segment of the male lineage (all male descendants of the family head’s deceased father). Attempts to keep the land within the larger segments of the lineage (all male descendants of a remote common ancestor) lead to mar- riage between remote (and even unidentifiable) cousins. This creates endogamy within the lineage or clan. Of all first-cousin marriages, marriage of a daughter to her father’s brother’s son is the most straightforward union to keep her inheritance within the male lineage. Her marriage to her father’s sister’s son or her maternal cousins will serve her father’s lineage only if those cousins are also members of the male lineage of the father.22 It is not surprising that marriage of a daughter to her father’s brother’s son—or equivalently marriage of a son to his father’s brother’s daughter— is the most preferential form of cousin marriage in Islamic countries, where female inheritance is mandated [146, 159].23 Following the arguments above, cousin marriage might even arise under impartible inheritance (such as primogeniture) or partible inheritance by males only, if daughters inherit property in the absence of male offspring. In his study of cousin marriage in Japan, [229] notes that a family head who has only daughters faces a problem because a wife takes the family name of her husband. “Pride in family name" and “attempts to ensure the perpetuation of the family name" leads him to “select her spouse from among her male relatives having the same family name, in which case the headship does not leave the family" (p.295). [229] presents statistics which show cousin marriage is higher among families with no male offspring. Using data from rural Bangladesh, [234] also report that women without brothers are more likely to be in cousin marriages. This calls to mind the biblical account about female inheritance in the absence of sons (the Bible, Numbers 27), in which the daughter is compelled to marry someone from the same clan as her father (the Bible, Numbers 36). Second, decreasing land fragmentation. Partible inheritance could fragment the land until it is no longer viable because of decreased economies of scale [213].24 Under the constraint of partible inheritance by both sexes, cousin marriage emerges as a solution to decrease land fragmentation. Avoiding fragmentation of land by the means of inmarriage is well documented in the literature.25 Cousin marriage provides the possibility for pooling farms and resources, and the continual recom-

22For example, if the father’s sister herself is married to a first cousin within the male lineage, her son is also considered a member of the same male lineage—although not through his mother and as a first cousin of the bride, but through his father and as her second cousin.

23Interestingly, it has been suggested that the practice of marrying a father’s brother’s daughter [159] and the veiling of women [5] were introduced into Islamic societies through the contact of with regions such as Syria and Palestine, the same regions that shaped Islamic inheritance law [163].

24There is supporting evidence in fact that partible inheritance fragments the land and that land fragmentation sacrifices economies of scale, for example in nineteenth-century France [36, p.45], and today’s China [200, 255, 242], Indonesia and India [261], and Africa [24].

25See, e.g., [116, 159, 140, 57, 234].

16 bining of portions by adding the claims of the groom and the bride in new conjugal estates. In the case of double cousin marriage—between a brother and sister of one family to a cousin sister and brother of another family—in fact no land changes hands at the marriage, which avoids land fragmentation with zero transaction costs. The same arrangement of exchanging daughters between unrelated but neighbor farmers within the same village leaves both lineages with unfragmented lands [210]. This creates endogamy within the village.

(a) (b) (c) (d)

Figure 2.5: Triangles, circles, lines, and “=" represent males, females, descent bonds, and marriage respectively. The first generation starts with two brothers, and thereafter, every family has one son and one daughter. Black triangles and circles represent anyone within the male lineage. Chart (a): Impartible inheritance: primogeniture retains the family land intact, only at the cost of creating many landless offspring. Chart (b): Partible inheritance by both sexes, and outmarriage: female offspring marrying outsiders rapidly fragments the land and also diffuses land parcels out of the male lineage, since thereafter next generations are carrying a different family name. Chart (c): Partible inheritance by both sexes, and double cousin marriage: inmarriage decreases land fragmentation and, contrary to impartible inheritance, does so without leaving landless offspring. Chart (d): Partible inheritance by both sexes, double cousin marriage, and : higher fertility (four sons and four daughters) creates the same land fragmentation in the third generation as the case with outmarriage (chart c), but still all land parcels remain under the family name.

Figure 2.5 shows that compared with outmarriage in chart (b), double cousin marriage in chart (c) keeps land parcels within the male lineage and decreases land fragmentation. To highlight the former mechanism, population growth is added in chart (d). Note that the dimension of inheritance systems relevant for inmarriage is inclusion of women, not partibility. Under partible inheritance by sons only, keeping land parcels within the male lineage is irrelevant, and there is no possibility to decrease land fragmentation by inmarriage. It is also important to recall the assumption of imperfect capital markets. Under this assumption, pooling land parcels (through markets) with landed outsiders is costly. Moreover, outmarriage might be associated with negative externalities for the community, such as the diminished stock of knowl- edge over time and conflict over inheritance (see below). Therefore, a marriage partner with a plot of land within the community is a better candidate to preserve the property of the male lineage than an outsider with the same size and quality of land plot. For the same reasons, other landed commu- nities also prefer inmarriage. Thus, outmarriage most likely attracts landless outsiders and creates

17 immigration pressure [55] such as by non-inheriting children from communities under impartible inheritance [63]. Third, decreasing conflict over inheritance. Cousin marriage also reduces the rivalries and conflicts over inheritance. In fact, the “solution of inheritance fights" was one of the situations that could be used to support a dispensation request from the Catholic Church to marry a relative [57, p.37]. By excluding outsiders, cousin marriage reduces the number of claimants on property and increases biological and cultural ties among those involved in the division of land after the death of the family head. By cousin marriage, men also take advantage of existing relationships. They know each other and have a sense of each other’s personalities and how to work together, which is an advantage for the male lineage as a group of cooperators [234]. In contrast, outmarriage might diminish local skills and the stock of knowledge over time and across generations [41]. Cousin marriage also decreases the potential for conflict among siblings over inheritance by creating overlapping interests and doubly relating them to each other through the young couple and their grandchildren [234]. A sister’s offspring will benefit from a brother’s property when it passes to his offspring. In the case of double cousin marriage, siblings’ conflict over inheritance becomes irrelevant because in the next generation, the land will be reallocated only between their grandchildren (see Figure 2.5, chart (c)). Therefore, where women are legally entitled to a share of an inheritance—such as in Islamic societies—cousin marriage offers a way to deny them inheritance but at the same time keep them content to not take legal action, such as through Sharia courts. Even if a Muslim community disinherits women in practice—despite Islamic law for female inheritance— cousin marriage could at least alleviate the potential for the conflict stemming from the mandatory law.

2.3.2 Female inheritance and the status of women

An important line of research has recently developed to study how contemporary differences in gender norms are determined by various historical factors.26 This study aims to contribute to this literature by highlighting the role of female inheritance as another pre-industrial characteristic in- fluencing gender norms. Some potential mechanisms are as follows. Arranged marriages. As discussed earlier, female inheritance promotes cousin marriage and endogamy. If not controlled, love and premarital sexual relationships might lead to marriage with outsiders because ties of descent are not central in these relationships [137, 187]. Therefore, cousin and endogamous marriages are generally arranged by parents controlling courtship and premarital sexual relationships of their children through gender segregation and restrictions on contact between opposite sexes. Under such restrictions, not only are young people less likely to form romantic relationships, have premarital sex, and have out-of-wedlock children—which makes arranging their

26Such as subsistence technology of agriculture and herding [215, 16, 102, 133, 34], language [227], geography [54], family structures [13, 250, 11], religion [127, 35, 203], historical shocks [203, 125, 264, 244, 52], and pre-industrial societal characteristics such as [107, 111, 174], modes of residence after marriage [170], and versus [25]. For more references see [103].

18 marriages easier—but they are also much more likely to meet and thus form romantic attachments to their cousins, who are among the few young people of the opposite sex with whom it is appropriate for them to socialize [113, 234]. Therefore, in cousin marriage, men marry by guarding their sisters’ ’honor’ and staying on the right side of their uncles. Women are reduced to male property and their conformity and clan loyalty are prized [80]. Restrictions on contact between opposite sexes tend to disadvantage women for several reasons. First, sexual behaviors of women can easily be screened (through virginity and unwanted pregnancy) and punished. Therefore, male relatives restricting a woman’s premarital sexuality is a means of arranging a good marriage—one that would allow access to her inheritance. Female premarital sex is usually controlled by encouraging early marriage, stressing virginity at marriage, veiling, and chaperoning post-pubertal daughters [112, 137]. Second, men have higher productivity in traditional agriculture [16, 133]. Therefore, gender segregation likely leads to women’s seclusion and more involvement in activities within the home because men are needed outside the home in the fields. In other words, gender segregation intensifies the pre-existing division of labor within the agricultural family. Moreover, whenever women appear outside the home, restrictions on contact between opposite sexes require them to wear veils, which is also incompatible with strenuous manual work in agriculture. Illegitimate children. Female inheritance also exposes the kin group to another threat from fe- male sexuality: any out-of-wedlock children of female members are undeniably tied to the kin group due to maternity certainty and therefore are considered potential heirs. Thus, maternity certainty creates unequal gender norms regarding premarital sexual contact. Historical evidence confirms that societies gave “the father the option of refusing to acknowledge an illegitimate child [which] was intended to protect the property of the legitimate paternal kin from the claims of illegitimate children" [36, p.102]. Until the early twentieth century, in France, Germany, and the United States, illegitimate children had the right to acknowledgement only in relation to the mother, took their mother’s name, and “were granted inheritance rights from the maternal side of the family much ear- lier than from the paternal side" [36, p.102]. This is still the law today in many Islamic countries.27 Any children born outside of regulated sexual relationships is to attribute free membership to the lineage [257]. Therefore, controlling women’s sexuality is a means of controlling arbitrary distribu- tion of property. Conflict and fights over inheritance play an important role here as well. Avoiding illegitimate children by controlling female premarital sex also limits the possibility of conflicting claims on the estate in which a woman has rights [112]. [112] concluded that the link between fe- male inheritance, , and veiling that marks the Mediterranean world cannot be separated from the position of women as carriers of property. Anthropological evidence suggests that within

27According to Sunni law, illegitimate children do not bear the name of the father, but that of the mother’s family. They also inherit only from the mother’s side of the family and not from the father’s. According to Shia law, illegitimate children do not inherit from either parent. However, a child who is denied paternity by the father (who is married at the time to the mother) inherits from its mothers’ family [150].

19 the same community, women with lower status and without access to land have more sexual freedom [137, 210]. Female mobility and entrepreneurship. One famous argument about the effect of different inheritance systems on economic development is that impartible inheritance stimulated the devel- opment of new economic opportunities by encouraging mobility, emigration, and entrepreneurship among landless offspring. People who did not acquire landed property had to emigrate to cities to join the industrial labor force [213, 36]. Also, the lack of any future inheritance encouraged the need for education, training, mobility, and entrepreneurship [248, 78]. The same argument can apply to landless versus landed women. All else being equal, a woman with no prospect of inheritance had more incentive to be educated and independent, to migrate and work in industrial settings, to be an entrepreneur and create her own business. In fact, women in Europe migrated more often than men because men inherited land more often and had to stay in their home village [57]. Women’s informal learning, as servants or teachers, is considered a signifi- cant contributor to the intergenerational transmission and accumulation of human capital during the industrial revolution [90]. These factors, as possible consequences of women’s exclusion from inheritance, might have contributed to women’s higher social and economic participation in Europe. In the Middle East, on the other hand, women as carriers of property were immobile, engaged in cousin marriages, and under the strict control of their male relatives. The arguments presented above on the relationship of female inheritance with inmarriage and the status of women can be summarized in two hypotheses. Hypothesis 1: Female inheritance has a positive effect on cousin marriage, endogamy, and arranged marriage. Hypothesis 2: Female inheritance has a negative effect on female economic participation and premarital sexual freedom.

2.4 Empirical strategies and results using historical data

In this section, I provide historical and big-picture evidence on the correlations predicted by my two hypotheses. In section 2.4.1, I describe historical data on pre-industrial societies from the Ethno- graphic Atlas to test the association of female inheritance with outcome variables. I show that female inheritance is associated with higher cousin marriage and endogamy (as stated in hypothesis 1) and lower female participation in agriculture and premarital sexual freedom (as stated in hypothesis 2). In section 2.4.2, I use mid-twentieth-century Italian province-level data to show the correlation of egalitarian inheritance system with higher cousin marriage rates and lower female economic partic- ipation rates.

20 2.4.1 Pre-industrial societies

The Ethnographic Atlas28 includes data on 1,291 pre-industrial societies distributed globally and mostly sampled between 1800 and 1950, ranging from societies with complex agricultural economies and political systems to small hunter-gatherer groups. Table B.3.1 of Appendix B.3 shows a detailed description of all variables used in this section. If a variable is defined in previous empirical studies using the Ethnographic Atlas, I have used exactly the same definitions to construct the variable, in which case the related study is also mentioned in the table. Inheritance systems. Earlier, I defined female inheritance as partible inheritance by both sexes. Therefore, I construct indicator variables for inheritance systems using entries in the Ethnographic Atlas on female inclusion in inheritance (EA074) and partibility of inheritance ( EA075). With the available data, we can clearly characterize the inheritance systems of 820 Ethnographic Atlas societies29 as summarized in Table 2.1.

EA074 (1) (2) (3) & (6) (4) & (5) (7) Absence of Younger Sons and EA075 private property Sisters’ sons brothers daughters Sons only Total Percent

(1) Equally distributed 0 10 20 86 191 307 %37.44

(2) Best qualified 0 0 9 3 8 20 %2.44

(3) Ultimogeniture 0 0 2 1 19 22 %2.68

(4) Primogeniture 0 11 107 8 122 248 %30.24

(9) Absence of 223 0 0 0 0 223 %27.20 private property

Total 223 21 138 98 340 820

Percent %27.20 %2.56 %16.83 %11.95 %41.46 %100.00

Table 2.1: Intersection of entries EA074 and EA075 of the Ethnographic Atlas data.

Based on the two entries, I construct dummy variables for the following categories: impartible inheritance (categories 2, 3, and 4 of EA075), partible inheritance by males only (intersection of category 1 of EA075 and categories 2, 3, 6, and 7 of EA074), partible inheritance by both sexes (intersection of category 1 of EA075 and categories 4 and 5 of entry EA074), and absence of private property (category 1 of entry EA074, or equivalently category 9 of entry EA075). Figure 2.2 seen earlier followed this categorization to portray the distribution of inheritance systems globally for Ethnographic Atlas societies. Dependent variables. As a measure of cousin marriage in the Ethnographic Atlas societies, I use entry EA023 on the rules governing cousin marriage. I construct a variable for cousin marriage that takes on integer values ranging from 1 to 4, where higher values indicate tighter cousin marriage culture. The highest value is assigned if first-cousin marriage is allowed with a father’s brother’s daughter, since this is the ideal marriage to keep the property within the male lineage. Value 3 is

28[196, 33, 120, 160, 45, 158]

29EA074 is available only for 856 societies, and EA075 is available only for 820 societies.

21 assigned if marriage with any first cousins except a father’s brother’s daughter is allowed. Value 2 is assigned if only second-cousin marriages are allowed. Finally, value 1 is assigned if no first- or second-cousin marriages are allowed. To construct a variable for endogamy, I use entry EA015, which classifies the prevalence of local endogamy, agamy, and . I construct a variable for endogamy, which takes on integer values ranging from 1 to 3, where higher values indicate higher endogamy: value 1 is assigned for exogamous communities, value 2 is assigned for societies with no marked tendency toward endogamy or exogamy, and value 3 is assigned for endogamous communities. Entry EA078 classifies prevailing standards of sexual behavior for unmarried women. I construct an indicator variable for female premarital sex prohibition, which takes value 1 if premarital sex of unmarried women is precluded or prohibited, and value 0 if it is permitted. Finally, to construct a variable for female participation in agriculture, I follow [16] and construct a variable that takes on integer values ranging from 1 to 5, where higher values indicate more participation of women in agriculture. Control variables. Table 2.2 lists all control variables used in the regression analyses. To test hypotheses with the Ethnographic Atlas data, I report three sets of regressions, which in addition to dummy variables for inheritance systems include the following set of controls: first, the regression specification from [16] as the specification with minimal controls; second, a large set of ethno- graphic and geographic controls, including those from [16]; third, the set of robustness controls, including those mentioned as “historical controls" by [16]. See Table B.3.1 of Appendix B.3 for a detailed description of all dependent and control variables.

Minimal controls Traditional plough use, Settlement , Political hierarchies, Presence of large animals, Tropical climate, Suitability for agriculture Maximal controls Ethnographic controls: Traditional plough use, Non-irrigated intensive agriculture, Irrigated intensive agriculture, Settlement complexity, Political hierarchies, Presence of large animals, Year society sampled, Patrilineal descent, Matrilineal descent, Dowry, Brice price Geographic controls: Latitude, Mean temperature, Temperature predictability, Mean precipitation, Precipitation predictability, Tropical climate, Suitability for agriculture, Distance to coast, Slope, Ruggedness, Elevation Robustness controls Population, Proportion of subsistence from herding, Proportion of subsistence from hunting, Proportion of subsistence from gathering, Patrilocal marriages, Matrilocal mar- riages, , Nuclear family

Table 2.2: Set of control variable used in different specifications for the Ethnographic Atlas data analysis.

The set of control variables controls for possible factors that can affect inheritance systems, inmarriage, and the status of women, including agricultural organization, economic and political development, subsistence economy, kinship structure, transfers at marriage, and a large set of geo- graphic variables. Results. Table A.2.1 of Appendix A.2 reports descriptive statistics for all dependent and control variables based on available observations in the full sample. The number of observations varies

22 across regressions due to missing observations in the outcome variables. Tables A.2.2–A.2.9 of Appendix A.2 report descriptive statistics for the sample of societies in regression analyses of each outcome variable, as well as full regression results with minimal, maximal, and robustness controls. Since the results are robust to inclusion of robustness controls, here I report only the mini- mal and maximal regression results. Table 2.3 reports the results for cousin marriage, endogamy, and female premarital sex prohibition. The omitted category in the regressions is impartible inher- itance. Consistent with the hypothesis, partible inheritance by both sexes has a significant positive correlation with cousin marriage, endogamy, and female premarital sex prohibition. The estimated coefficients of the dummy variable for partible inheritance by both sexes are large considering the standard deviations of dependent variables, and also comparing with the coefficient of the dummy variable for traditional plough—which is considered as an important determinant of pre-industrial characteristics of societies. The effect of partible inheritance (by males only) is insignificant in the regressions, confirming that the dimension of inheritance systems relevant for these outcomes is female inheritance, not partibility of inheritance.

Female premarital Cousin marriage Endogamy sex prohibition (SD=0.990) (SD=0.580) (SD=0.499) VARIABLES (1) (2) (3) (4) (5) (6)

Partible inheritance by males only 0.046 0.145 0.011 0.013 0.032 0.005 (0.098) (0.096) (0.056) (0.056) (0.062) (0.064) Partible inheritance by both sexes 0.284* 0.376** 0.365*** 0.233*** 0.258*** 0.238*** (0.151) (0.150) (0.078) (0.079) (0.081) (0.083)

Absence of private property yes yes yes yes yes yes

Minimal controls yes yes yes Maximal controls yes yes yes

Observations 651 651 686 686 427 427 R-squared 0.088 0.218 0.067 0.144 0.138 0.171

Table 2.3: OLS estimates are reported with robust standard errors in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

One concern with estimations based on Ethnographic Atlas data is spatial auto-correlation (across nearby units). Table A.2.10 of Appendix A.2 reports OLS estimates for maximal regression specification with [66] standard errors for spatial dependence with cutoffs of 60 decimal degrees. The coefficients remain significant at 1% and 5% levels. Since the outcome variables are ordinal and not cardinal, I also report regression results with maximal specifications using binary and ordered logit estimations in Table A.2.11 of Appendix A.2. The coefficients remain significant at 1% and 5% levels. Finally, I control for region fixed effects to address concerns about possible confounding with religion and culture. Christianity has a long history of discouraging inmarriage, which happens to coincide with patrilineal inheritance. In contrast, Islam has long encouraged both inmarriage and female inheritance. Therefore, all reported correlations above might just be the result of such historical coincidences. Based on the Ethnographic Atlas data, I created 17 subcontinent regions,

23 importantly including 7 regions within the Islamic world: 2 in its core (Middle East and Northern Africa) and 5 in its periphery (Caucasus, Middle Asia, Indian Subcontinent, Malesia and Papuasia, and Northern Tropical Africa) where Islam might be mixed with other religions or local traditions. I also defined 4 regions within the Christian world: Northwest Europe, Southeast Europe, Eastern Europe, and Siberia. Table A.2.12 of Appendix A.2 includes a map of the regions and reports the regression results using region fixed effects with standard errors clustered in the region level. Despite region fixed effects washing away most of the variation across societies, partible inheritance by both sexes is still significant in all regressions at 5% or 10% levels.

Female participation in agriculture (SD=1.018) (SD=1.036) (SD=0.985) (SD=0.985) (SD=0.985) (1) (2) (3) (4) (5) VARIABLES Alesina et al. (2013) Updated data Restricted sample and Inheritance Updated controls

Partible inheritance by males only -0.244** -0.190** (0.097) (0.092) Partible inheritance by both sexes -0.570*** -0.278** (0.128) (0.129) Traditional plough use -0.883*** -0.848*** -0.751*** -0.615*** -0.397*** (0.114) (0.107) (0.120) (0.119) (0.146)

Absence of private property yes yes

Minimal controls yes yes yes yes Maximal controls yes

Observations 660 713 489 489 489 R-squared 0.135 0.135 0.143 0.183 0.326

Table 2.4: OLS estimates are reported with robust standard errors in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

In Table 2.4, I focus on female participation in agriculture, which is highlighted in the famous study by [16]. They confirm the hypothesis that traditional plough use has decreased female partic- ipation in agriculture. Regression 1 of Table 2.4 replicates the regression result by [16] using the same specification and data.30 The resulting coefficient for plough agriculture is the same as in [16]. In column 2, I estimate the same regression specification, but using the version of the Ethnographic Atlas data used in this study.31 The coefficient is almost the same. In column 3, the sample is re- stricted to observations with available inheritance data. Plough agriculture remains significant, and the coefficient is not significantly different than column 2 (Wald test p-value=0.105). Column 4 introduces the controls for inheritance systems. Both plough use and partible in- heritance by both sexes are significant. This decreases the coefficient on plough use (Wald test

30See regression 1 of Table 4 in [16]. The data is available online at http://scholar.harvard.edu/nunn/home.

31The new version of the Ethnographic Atlas used in this study includes more societies. Also, the data is improved by linking each data point to one or more of the 3,502 ethnographic sources. Moreover, data sources and radius around societies used for tropical climate and suitability for agriculture are different than those in [16]. With available data from [16], it is possible to run the same regressions with the minimal controls. Table A.2.14 of Appendix A.2 presents the regressions, indicating that results are robust across two versions of the Ethnographic Atlas.

24 p-value=0.000) relative to column 3. Moreover, the coefficient of partible inheritance by both sexes is not significantly different than the coefficient of plough use (Wald test p-value=0.814). Contrary to regressions on other outcome variables, here I am cautious not to attribute the full magnitude of the coefficient of partible inheritance by both sexes to the described mechanism in the concep- tual framework. This is because partible inheritance by males only is also significant—although smaller than partible inheritance by both sexes (Wald test p-value=0.013) and insignificant when region dummies are included (see Table A.2.12 of Appendix A.2). This could indicate that a dif- ferent mechanism also partly contributes to the difference in coefficients of impartible and partible inheritances.32 In regression 5, where I use maximal controls, the coefficient of partible inheritance by both sexes is still significant and not significantly different from the coefficient of plough use (Wald test p-value=0.566). For full regression results with minimal, maximal, and robustness con- trols, estimates with Conley standard errors, ordered logit model, and region fixed effects, see Tables A.2.8–A.2.13 of Appendix A.2. The results are robust. 33

2.4.2 Mid-twentieth-century Italy

Data on cousin marriage rates and inheritance systems of Italian provinces is available for 1950– 60s, as presented in Figure 3.3. For inheritance systems of Italian provinces, I use the most detailed data available for European regions, which is provided by [248] mostly based on censuses from Western European countries in the 1950–1960s. According to Todd, family types in Italian provinces can be classified as communitarian, incomplete stem, or egalitarian nuclear. While inheritance in communitarian and incomplete stem family types was patrilineal, in the egalitarian nuclear family type common in south and north west Italy, inheritance was equally divided among children, and women—at least in southern Italy—were likely to inherit.34 Table B.3.2 of Appendix B.3 presents a full description of the data and variables used in this section.

32One possible mechanism described by [250] is that under impartible inheritance, usually two generations live together (stem family). This gives wives more time to work on the farm because co-residence with the mother-in-law reduces the burden of work, freeing up the wife’s time for non-domestic work. This requires the assumption that on the other hand, under partible inheritance, all sons start their own independent .

33In the above analyses, I have not distinguished between lineal and lateral inheritance systems. For example, impartible inheritance category also includes primogeniture by sister’s sons and younger brothers. In Table A.2.15 of the appendix, I define separate indicator variables for lateral inheritance systems—that is, inheritance by matrilineal heirs (such as sister’s sons) and inheritance by patrilineal heirs (such as younger brothers). Regression analyses report qualitatively similar results for inheritance by both sexes.

34The Neapolitans are one pre-industrial society recorded in the Ethnographic Atlas within the borders of today’s Italy. Neapolitan was and still is the language of much of southern Italy, with 6 million native speakers today mostly in the Campania and regions [172]. The Ethnographic Atlas indicates that the Neapolitans practiced partible inheritance of land by both sexes. [109] also describe the inheritance system of southern Italy as bilateral inheritance (i.e., by both sexes). The Ancient Romans are the other Ethnographic Atlas society within the borders of today’s Italy. They also practiced partible inheritance by both sexes—in which the prevalence of female inheritance in the Mediterranean region was rooted [156].

25 To test whether traditional female inheritance can explain the variation of cousin marriage rates in Italy—as stated in hypothesis 1—I run a series of regressions controlling for economic, de- mographic, and geographic factors. Cousin marriage data is from [57] based on records of Papal dispensations kept in the Vatican archives. The data is available for all provinces between 1945 and 1964. Since the available historical demographic and economic data belongs to 1960s, I use only the average of the last five-year period (1960–64). I control for two variables as indicators of the economy of Italian provinces at the same time period: GDP per capita and share of agriculture in the GDP for the regions they belong to, both averaged over 1963 and 1964. I also control for the population of provinces using the available data on population of provinces in 1961. Similarly to the Ethnographic Atlas analysis, I also control for a wide range of geographic controls such as ab- solute latitude, mean temperature, mean precipitation, an indicator for tropical climate, suitability for agriculture, distance to coast, slope, ruggedness, and elevation. To make sure that the effect of inheritance systems is not due to confounding cultural and histor- ical factors, I also control for two key variables. First, a well-known fact about Italy is the cultural differences between the south and north [29, 214]. Therefore, I include a dummy variable in the regressions for the provinces in so-called Mezzogiorno regions: southern Italy and the islands. In- cluding this dummy variable assures a stringent test of the hypothesis because southern Italy and the islands also happen to have the highest cousin marriage rates (mean cousin marriage rate of 1.377 compared to 0.306 in other regions, mean-comparison test p-value<0.001, N=101) and the egalitarian inheritance system. Second, regions in central Italy with the communitarian family type are also characterized by historical domination of the Papal States. The Papal States were territo- ries under the sovereign direct rule of the pope, from the eighth century until Italian unification in 1871. Therefore, it is plausible to expect stricter historical emphasis on cousin marriage bans and lower cousin marriage rates in the Papal States (0.412 compared to 0.976 in other regions, one-sided mean-comparison test p-value=0.063, N=101). This might confound with inheritance by sons in the communitarian family type in the region, which allows outmarriage, according to the conceptual framework. Columns 1 and 2 of Table B.5.13 show regression results on cousin marriage percentage. The egalitarian inheritance system is highly significant in both specifications, and it is associated with almost 0.4 higher percentage points of cousin marriage (by 35% of the standard deviation) in the full regression specification. The magnitude of the coefficient is not significantly different from the dummy variable for the Papal States (Wald test p-value=0.129). This confirms the hypothesis that where women are more likely to inherit, inmarriage is more likely. Columns 1 and 2 of Table B.5.13 show regression results on female participation percentage. Using data from census 2011, the variable is defined as the percentage of currently economically active women (either employed or unemployed) aged 15 or older from the total population of a province. For these regressions, I use province-level data on GDP per capita and share of agricul- ture averaged over 2000–2013. In the full regression specification, the egalitarian inheritance system is associated with 3.75 percentage points lower female participation (by 68% of the standard devia-

26 Cousin marriage percentage Female participation percentage (SD=1.149) (SD=5.529) VARIABLES (1) (2) (3) (4)

Egalitarian nuclear family 0.782*** 0.407*** -4.731*** -3.753*** (0.148) (0.124) (0.937) (0.676) Incomplete stem family -0.061 0.461** 1.986* -1.367 (0.103) (0.195) (1.118) (0.955) GDP per capita -0.299*** -0.028 12.260** 9.929** (0.061) (0.073) (5.708) (4.229) Share of agriculture -1.031 2.058 -60.338*** -12.286 (1.865) (2.182) (15.308) (12.620) South and Islands -0.875** -2.477 (0.408) (1.555) Papal states -0.711*** -0.746 (0.220) (1.357)

Geographic controls yes yes

Observations 101 101 101 101 R-squared 0.592 0.874 0.412 0.822

Table 2.5: OLS estimates are reported with robust standard errors in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels. Descriptive statistics and full regression results are reported in Table B.5.14 of Appendix A.2. tion). This is consistent with the historical accounts on the links between female inheritance, culture of honor, and lower female participation in southern Italy and the Mediterranean Europe in general [157, 112].

2.5 Empirical strategies and results using contemporary data from developing countries

In developed countries, economic empowerment of women increases their control over decisions and bargaining power within the household, and it creates positive outcomes such as reduced do- mestic violence against women [see, e.g., 6, 21]. However, it is not clear that this relationship exists in all developing countries. Some studies suggest that policies designed to empower women in de- veloping countries, such as female inheritance, create positive outcomes for women [178, 69, 134, 224, 141, 19, 22]. However, others report unintended negative consequences, such as increased fe- male child mortality [223], domestic violence and suicides [23, 126], and so on [for other studies see, e.g., 176]. In their patrilineal nature, many developing countries resemble the pre-industrial world and fit the conceptual framework of my study. The effect of this patrilineal bias may be even more signifi- cant for women’s premarital and marital outcomes—the focus of my study—than their postmarital outcomes. This is because women are likely to exhibit different household bargaining allocations depending on their roles as spouses or daughters. As spouses, women who inherit and own prop- erty might gain higher bargaining power against their husbands for household decisions such as children’s education [75]. However, women’s premarital and marital outcomes are formed in their

27 parental households when they are very young.35 As daughters, they either have not received in- heritance yet—because their fathers are still alive—or they have received inheritance but have not much opportunity to gain bargaining power in premarital and marital decisions made for them in the paternal household. In this section, I present findings on women’s premarital and marital outcomes using individual- level data from Indonesia and India. In section 2.5.1, using data from Indonesian Family Life Sur- veys, I examine the association of female inheritance with endogamy and arranged marriage—as stated in hypothesis 1—and economic participation—as stated in hypothesis 2. Then, in section 2.5.2, I use the national amendment of the Hindu Succession Act in 2005 for a difference in dif- ferences analysis of the effect of the amendment on cousin marriage and premarital sex rates—as predicted by hypotheses 1 and 2 respectively—among Indian women.

2.5.1 Individual-level analysis using Indonesian data

The Indonesian Family Life Survey (IFLS) is a longitudinal survey conducted between 1993 and 2014 in Indonesia. The sample is representative of about 83% of the Indonesian population and con- tains over 30,000 individuals of 7,224 households living in 13 of the 26 provinces, which captures the cultural and socioeconomic diversity of Indonesia. I use data on male and female adults (household heads, their spouse, and selected seniors in the household) to construct outcome variables.36 The IFLS has no information on cousin marriages, but the data allows creation of a proxy for endogamy. In the case of female inheritance, we should expect women to marry their relatives or neighbours more often, which usually means marrying from the same village. In contrast, where women are excluded from inheritance, men inherit the land and stay in the same village, but landless women are likely to outmarry and join their husbands in other villages. Therefore, I construct a variable for endogamy that indicates staying within the same village after marriage. The constructed indicator variable for endogamy takes value 0 if the respondent moved to another village after the in the latest marriage, and takes value 1 otherwise. I dropped a few (less than 0.02%) observations where the respondents didn’t start living with their spouse after the wedding.37 The second outcome variable indicates whether the respondent’s marriage was an arranged mar- riage (versus a ). Respondents are asked who chose their spouse in their first marriage. I create a dummy variable for arranged marriage that takes value 1 if parents or family members chose the respondent’s spouse, and takes value 0 if the respondent chose their spouse by them-

35In my study samples, most women in Indonesia and India married before age 20.

36Panel respondents were not asked the related questions in the IFLS (2000) if they had been asked in the IFLS (1997). Therefore, I had to merge IFLS (2000) with IFLS (1997) to have a full sample.

37In unreported regressions on endogamy, to make sure that moving to the new residence after wedding was not temporary, I dropped some observations (around 11%) where the respondent stayed less than a year in their first residence (whether in another village or within the same village) after the wedding. The results do not change.

28 selves. It is positively correlated with the endogamy variable (Spearman’s r=0.048, p-value=0.000, N=13819). While 23% of women are engaged in arranged marriages, this percentage for men is only 13%. This difference—which also shows up in the coefficients of female inheritance in the re- gression analyses—reflects another aspect of unequal gender norms. Men are more likely to choose their wives and ask their parents to arrange the marriage. Therefore, what is a choice for the husband can be perceived as an arranged marriage by the wife. Another aspect of unequal gender norms in arranged marriages is the underage marriage (below age 16 in Indonesia). More than 90% of un- derage marriages involve girls, of which 37% are arranged marriages; among women married at the legal age, this percentage is only 20%. For economic participation as my third outcome variable, I create three indicator variables to capture any economic activity by the respondent. The first variable takes value 1 if the respondent is self-employed, and takes value 0 otherwise. The second variable takes value 1 if she is employed in public or private sectors (working for others), and takes value 0 otherwise. The third variable takes value 1 if she works unpaid for her family. To define these variables, I dropped observation for those above 60 years old.

Indonesian ethnic groups with female inheritance tradition: Here, I investigate the effect of traditional inheritance systems of Indonesian ethnicities on the outcome variables. Following the strategy in [25], I use ethnographic data from the Ethnographic Atlas and Ethnic Groups of Insular Southeast Asia [169] to link individuals to their traditional inheritance systems through their self- reported ethnicity in the IFLS. Then I construct an indicator variable for female inheritance, which takes value 1 if the ethnicity to which the individual belongs has traditionally included women in inheritance.38 In the regression analysis, following [25], I control for education (five categories), quadratic in marriage age, and ethnicity control variables, and I cluster standard errors at the ethnicity level. In addition to their ethnicity controls (traditional presence of female-dominated agriculture and tradi- tional matrilinearity), I also control for bride price practice from their study.39 Moreover, I control for religion fixed effects (Muslim, Protestant, Catholic, Hindu, and other), quadratic in age, type of residence (urban or rural), and province fixed effects. The latter is relevant because ethnicities that traditionally excluded women from inheritance have a majority in three provinces only. Table 2.6 reports regression results for the sample of female and male adults separately. Table A.2.17–A.2.20

38The IFLS (2000) contains information on 23 ethnicities. As described in Table B.3.3 of Appendix B.3, 14 of the ethnicities (around 75% of the sample) can be directly matched to the ethnic groups for which the inheritance information is available from ethnographic records, meaning that the name of the society in ethnographic records and the name of the ethnicity in the IFLS are the same, and based on the coordinates in ethnographic records, the society is located in the same regions that the IFLS surveys took place.

39Following [25], female-dominated agriculture is a dummy variable that takes value 1 if female participation in agri- culture is higher than male participation. The matrilineal dummy variable takes value 1 for Minang ethnicity (the only ethnicity in IFLS sample with matrilineal inheritance). The dummy variable for bride price takes value 1 if the ethnicity practices bride price.

29 of Appendix A.2 show the descriptive statistics of variables, full regression results, and logistic estimation with qualitatively similar results.

Economic participation Endogamy Arranged marriage Self Public/Private Family (SD=0.457) (SD=0.424) (SD=0.441) (SD=0.405) (SD=0.357) VARIABLES (1) (2) (3) (4) (5)

Panel A: Women’s sample Female inheritance 0.111*** 0.063*** -0.086*** 0.018 0.005 (0.014) (0.018) (0.011) (0.016) (0.036) Education -0.049*** -0.029** -0.018*** 0.035*** -0.018*** (0.004) (0.011) (0.004) (0.008) (0.003) Urban -0.098*** -0.053*** -0.005 0.049*** -0.132*** (0.009) (0.014) (0.021) (0.014) (0.016)

Observations 7,575 7,705 6,645 6,645 6,645 R-squared 0.058 0.231 0.074 0.063 0.089

Panel B: Men’s sample Female inheritance -0.018 0.023* -0.022 0.041 -0.006 (0.019) (0.012) (0.025) (0.029) (0.003) Education -0.041*** -0.019*** -0.056*** 0.056*** -0.002* (0.003) (0.005) (0.010) (0.011) (0.001) Urban -0.114*** -0.035*** -0.176*** 0.176*** -0.012** (0.009) (0.004) (0.019) (0.015) (0.004)

Observations 6,045 6,116 5,180 5,180 5,180 R-squared 0.083 0.130 0.123 0.126 0.016

Quadratic in age yes yes yes yes yes Quadratic in marriage age yes yes yes yes yes Ethnicity Controls yes yes yes yes yes Religion FE yes yes yes yes yes Province FE yes yes yes yes yes

Table 2.6: OLS estimates are reported with robust standard errors, clustered at the ethnicity level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

Consistent with the hypotheses, women from ethnicities with traditional female inheritance are more likely to be engaged in endogamous and arranged marriages. The coefficients are significant and large considering the standard deviation of dependent variables and also compared with coef- ficients of other important variables such as Catholicism, education, and urban residence. Women from ethnicities with traditional female inheritance are also less likely to be self-employed. In the men’s sample, female inheritance is insignificant in all regressions except for arranged marriage. However, the coefficient of arranged marriage is smaller than the one in the women’s sam- ple (Wald test p-value=0.086), likely due to unequal power of sexes in their choice of spouse—as discussed before. Insignificance of the coefficient in the endogamy regression for the men’s sample might imply that an ethnicity with female inheritance is different from an ethnicity without female inheritance in migration rates of brides (not grooms) across villages. Since men inherit land under both regimes and stick to their land in the village, female inheritance should not affect the average endogamy rate of men across regimes.

30 Indonesian women receiving inheritance: Here, instead of traditions of the ethnic groups, I focus on the actual inheritance individuals received. In the first survey of the IFLS, adults are asked if either of their parents died, and if so, whether and what the parent bequeathed to them: house, land, livestock, jewellery, or money. Using this information, I construct an indicator variable for receiving inheritance that takes value 1 if the respondent inherited house, land, and livestock (properties subject to economies of scale) from either parents, and takes value 0 otherwise. Of the sample, 36% reported that they received an inheritance in the form of house, land, and livestock. To capture the effect of received inheritance on individual-level outcomes, I follow a strict strategy. The smallest administrative division in Indonesia (and in the IFLS data) is community (desa/kelurahan). Therefore, controlling for fixed effects of 312 IFLS communities, I make sure there is no confounding due to economic, geographic, and other differences across communities. In other words, the regressions capture only the variation of the inheritance indicator within com- munities, not across communities. Using fixed effects of religions and ethnicities, I also control for variation due to different religious and ethnic attitudes. In addition, I include controls from previous regressions: education, quadratic in age, and quadratic in marriage age. I also cluster standard errors at the community level.

Economic participation Endogamy Arranged marriage Self Private/Public Family VARIABLES (1) (2) (3) (4) (5)

Panel A: Women’s sample Inheritance dummy 0.050*** 0.035** 0.029 -0.006 0.007 (0.015) (0.015) (0.019) (0.013) (0.012) Education -0.034*** -0.050*** -0.029*** 0.054*** -0.020*** (0.011) (0.008) (0.009) (0.011) (0.006)

Observations 4,231 4,289 3,656 3,656 3,656 R-squared 0.179 0.382 0.161 0.194 0.338

Panel B: Men’s sample

Inheritance dummy 0.065*** 0.020 0.041** -0.038** 0.002 (0.015) (0.014) (0.019) (0.019) (0.003) Education -0.032*** -0.026*** -0.095*** 0.091*** -0.001 (0.009) (0.007) (0.010) (0.010) (0.001)

Observations 3,702 3,776 3,059 3,059 3,059 R-squared 0.180 0.301 0.348 0.346 0.121

Quadratic in age yes yes yes yes yes Quadratic in marriage age yes yes yes yes yes Religion FE yes yes yes yes yes Community FE yes yes yes yes yes

Table 2.7: OLS estimates are reported with robust standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels. See Tables A.2.21 and A.2.23 for the regressions including ethnicity fixed effects.

Since the ethnicity information was collected in 2000 for the first time, there is a rate of attrition. Table 2.7 presents results for both male and female adults without ethnicity fixed effects. The re- sults are unchanged when ethnicity fixed effects are controlled for. Tables A.2.21–A.2.24 Appendix

31 A.2 show the descriptive statistics of variables, full regression results, including those with ethnic- ity fixed effects, and logistic estimations with qualitatively similar results. Unreported regression analyses show that the results are also unchanged if standard errors are clustered at the province level. Receiving inheritance from parents is highly significant in endogamy and arranged marriage regressions in the women’s sample, and still comparable with the coefficient of education. The coefficient of inheritance in the regression on endogamy in the men’s sample is not significantly different than in the women’s sample (Wald test p-value=0.456 and 0.861). This makes sense, since men and women are sampled from the same communities, and endogamy entails staying within the same community after marriage. However, the coefficients in arranged marriage regressions in the men’s sample are smaller and insignificant. Again, since both men and women are sampled from the same communities, this reflects the different perception men and women have about the nature of their marriages. Men have more freedom choosing the spouses who are arranged to marry them. Inheritance is insignificant in regressions on all three economic participation variables in the women’s sample. While inheritance is associated with higher self-employment and lower private/public sector employment in the men’s sample, it doesn’t show a positive association with self-employment in the women’s sample. Recall that in the ethnic-level regression analyses, we found a negative asso- ciation between female inheritance practice and self-employment. Therefore, higher self-employment in the men’s sample may be considered the positive effect of inheriting property in a developing world, which is missing for women, likely due to the patrilineal restrictions on women’s empower- ment.

Inheritance expectations: There is a possibility of reverse causality in the above analyses. That is, those who marry within the village or by their parents’ choice are more likely to inherit property from their parents just because they are closer to them physically or emotionally. To address this concern, I split the inheritance indicator into two separate variables: an indicator variable called post-marriage inheritance that takes value 1 if the respondent inherited house, land, and livestock after their marriage, and an indicator variable for pre-marriage inheritance which takes value 1 if the respondent inherited house, land, and livestock before their marriage. These categories are defined by comparing the respondent’s date of marriage and the date of death of the parent who left the inheritance, and are based on the assumption that the inheritance is transferred upon the death of the parent. Around 54% of respondents received inheritance post-marriage. Table A.2.25 of Appendix A.2 reports regression results for the full sample (including both men and women), first with the inheritance variable as defined in earlier analyses, then with pre- and post-marriage inheritance variables.40 Just as the inheritance variable, both post- and pre-marriage

40When pre- and post-marriage inheritance categories are used for male and female samples separately, I end up with around 700 observations for each inheritance-sex combination, which is not enough to get robust results. Therefore, here I ignore the male and female distinction, and instead focus on the pre- and post-marriage distinction.

32 inheritance indicators are significant in the regressions. The Wald test results indicate that the dif- ference of the coefficients is not significant. This implies that expecting to inherit property has the same association with marriage outcomes as actually inheriting the property. The potential for fu- ture inheritance can be predicted based on ethnicity, religion, and local customs, and whether the parent has a property to bequest. Those with high potential to receive an inheritance are subject to the same marriage restrictions as they would be if they actually received the inheritance.

2.5.2 Difference in differences analysis using Hindu Succession (Amendment) Act of India

Since 1956, property rights for Hindus in India (including Sikhs, Jains, and Buddhists) have been governed by the Hindu Succession Act (HSA). The Act applies to all states except Jammu and Kashmir, and it explicitly exempts Muslims, Christians, Parsis, and Jews. As in traditional Hindu law, under the HSA women had no rights to joint family property (including land and other ancestral assets). Since 1956, some states have amended the Act so that both sons and daughters have a right to joint family property ( in 1976; in 1986; in 1989; and in 1994). However, these amendments applied only to women who were not yet married at the time of the reform in their state. In the other 29 states,41 men remained the sole joint heirs of family property until 2005. Under the Hindu Succession (Amendment) Act of 2005, all daughters, including married daughters, are also joint heirs in family property such as agricultural land.42

Figure 2.6: Cousin marriage rates among Hindu women across Indian states.

41Excluding Jammu and Kashmir, which was exempt from the Hindu Succession Act, and , which was part of Andhra Pradesh until 2014.

42See [4] and [23] for more details on the HSA and its amendments in different states. The Hindu Succession (Amend- ment) Act was passed on September 5, 2005, and applied to any disposition, alienation, and partition of property that had taken place after December 20, 2004.

33 The states that passed amendments to the HSA before 2005 are the farthest southern states, whose traditional schools of law (the Madras and Bombay sub-schools) were more gender equal, and they agreed to include female inheritance rights when the HSA was passed in 1956. However, the northern states dismissed the idea by a majority vote, and the traditional laws of female exclusion in joint property were maintained until 2005 [23]. Interestingly, Hindus in southern India have his- torically experienced much higher cousin marriage rates compared with Hindus in the other states (see Figure 2.6); in the sample of those married in or after 2000, the rate was 24% in the states that passed local amendments in the past versus 6% in other 29 states. This historical difference in the marriage pattern of the southern and northern states might be a consequence of their different tradi- tional attitudes to female inheritance. Therefore, there might be concerns regarding the endogeneity of the amendments in the five states. For three reasons, here I focus on other states and the 2005 amendment only. First, endogene- ity of the amendments in the five states is possible, as discussed above. Second, under the past amendments in the five states, only unmarried women were eligible to inherit. Therefore, timing of marriages might have responded to the amendments. With the anticipation of the amendments, fam- ilies who did not want their daughters to be eligible married them just before the law passed. This is not a concern for the 2005 amendment under which both married and unmarried women were eligi- ble for inheritance. Third, past amendments not only were different from the 2005 amendment, but also might have been different from each other. For example, the amendment of the state of Kerala removed the legal status of the joint family altogether. I also drop some observations of respondents with unknown religions or who do not identify as Hindu (including Sikhs, Jains, and Buddhists), Muslim, or Christian. To estimate the impact of the 2005 amendment—which substantially improved female inheritance— on two outcome variables, I exploit a difference in differences approach using data collected from adult women (15–49 years old) by National Family Health Survey of India (repeated cross-sectional surveys conducted in four rounds between 1992 and 2016). The first outcome variable is cousin mar- riage, which takes value 1 if a woman’s husband in her first marriage is a blood relative (first cousin, second cousin, or other blood relatives such as uncle), and takes value 0 otherwise. The second outcome variable is premarital sex, which takes value 1 if a married woman had her first sexual intercourse before her first union with her first husband, and takes value 0 if she had her first sexual intercourse at or after her first union or if she has never had sex with her first husband.43 The first difference I use is the religion of the respondent. The amendment should have had an impact on Hindu women (Hindus, Sikhs, Jains, or Buddhists) but not women from the exempted

43Contrary to the cousin marriage variable, the sample for the premarital sex variable includes only those women who are currently married. This is because the age at first union was not asked of other groups, such as those who are widowed, divorced, or separated. I have created the premarital sex variable using information on age at first union and age at first sexual intercourse. The latter includes a category labelled as "at first union". In addition, whenever age at first union and age at first sexual intercourse are the same, because of strong norms against premarital sex, I considered it as sex at first union with husband. However, it is possible for age at first union and age at first sexual intercourse to be the same, but for first sexual intercourse to have occurred before the marriage.

34 religions (Muslims, Christians, Parsis, or Jews). The second difference I use is exposure to the amendment, as measured by the year of the first marriage. The decision to marry a relative could have been affected by the amendment only if the marriage took place in or after 2005.44 Therefore, the marriage decision of those Hindu women who married before 2005 could not have been affected by the amendment. Similarly, the premarital sex attitude of those Hindu women who married before 2005 could not have be affected by the amendment because by definition premarital sex takes place before the first marriage. I assume that an outcome y for woman i of religion r married for the first time in year t in state s is a function with the following form:

yirts = α + β1Tirt + β2γr + β3δt + β4θs + β5(θs × δt) + β6(θs × γr) + β7Xirts + eirts

Tirt captures a woman’s treatment status and takes value 1 if she is a Hindu (i.e., Hindu, Sikh,

Jain, or Buddhist) and married for the first time in or after 2005. The coefficient of interest is β1, which identifies the effect of exposure to the amendment. Religion dummy (γr) takes value 1 if a woman is a Hindu, and takes value 0 if she is a Muslim, Christian, Parsi, or Jew. It captures time- invariant characteristics of Hindus. Marriage year fixed effects (δt) control for time-series changes of the outcome variable. State fixed effects (θs) control for time-invariant characteristics of the states.

State-marriage year fixed effects (θs × δt) control for state-specific changes over time. Religion-state

fixed effects (θs × γr) control for time-invariant characteristics of Hindus across the states. Finally, a vector of observable characteristics (Xirts) controls for respondent’s education (four categories); wealth quintile; dummy variables for whether she lives in an urban region, whether she is a member of a scheduled , and whether she is a member of a scheduled tribe; and finally fixed effects for rounds in which respondents are surveyed. Table 2.8 presents regression results for cousin marriage and premarital sex with a subset or all of the controls in the full specification. Being a Hindu is negatively associated with cousin marriage, since Muslims—which constitute 59% of the dropped category—have much higher cousin marriage rates (with 17.1% versus 4.7% among Hindus). Among Hindu women, exposure to the amendment significantly increased the likelihood of marrying with blood relative and decreased the likelihood of having premarital sex. The coefficients are large compared with the coefficients of individual- level controls and considering the mean of dependent variables for the Hindu sample. Table A.2.26 of Appendix A.2 presents the descriptive statistics and description of variables in the regressions. In Table A.2.27 of Appendix A.2, I run two sets of robustness regressions. First, instead of primary sampling units, I cluster the standard errors at the state level. Second, in addition to year and state-year fixed effects based on first marriage year, I also control for those based on birth year. I have not included birth year fixed effects in the baseline regressions because later I use them to create

44Here, I assume that before the amendment, the respondent (and her family) did not know that she would become a joint heir in the future. Later, I will address the potential endogeneity concern.

35 Cousin marriage Premarital sex (Hindu sample mean=0.047, SD=0.212) (Hindu sample mean= 0.090, SD=0.286) VARIABLES (1) (2) (3) (4) (5) (6)

Subject to amendment 0.013*** 0.012*** 0.012*** -0.035*** -0.012*** -0.012*** (0.003) (0.004) (0.004) (0.003) (0.003) (0.003) Hindu -0.119*** -0.099*** -0.099*** 0.003 0.002 0.004 (0.003) (0.009) (0.009) (0.002) (0.007) (0.007)

Caste 0.000 -0.000 (0.001) (0.001) Tribe 0.001 -0.000 (0.002) (0.002) Education -0.001 -0.005*** (0.000) (0.001) Wealth 0.001 -0.003*** (0.000) (0.001) Urban 0.000 -0.007*** (0.001) (0.002) Survey round FE yes yes

Marriage year FE yes yes yes yes yes yes State FE yes yes yes yes yes yes State × Marriage year FE yes yes yes yes State × Hindu FE yes yes yes yes

Observations 461,746 461,746 461,746 419,478 419,478 419,478 R-squared 0.043 0.056 0.056 0.061 0.070 0.071 Number of clusters 22751 22751 22751 25131 25131 25131

Table 2.8: OLS estimates are reported with robust standard errors, clustered at the level of the primary sampling unit, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels. instrumental variables. The results are robust to these specifications. In Table A.2.28 of Appendix A.2, I run two sets of regressions by changing the sample. First, I restrict the sample to only those five states that locally passed similar amendments in the past. The results indicate no significant effect on Hindu women married after 2005 in the five states. Second, I run the regressions with the full sample from all Indian states. The coefficients remain highly significant and in the predicted directions. The identifying assumption of the difference in differences approach is the identical counterfac- tual trends in treatment and control groups. Here, I provide some robustness and falsification tests on this account. In the first column of Table 2.9, I include an additional variable called "near-treated Hindus", which takes value 1 for Hindu women married for the first time within the two-year span (2004 and 2003) just before the amendment. Its coefficient is insignificant in the regressions on cousin marriage and premarital sex, meaning that there is no evidence that the trend in outcome variables for Hindu women had diverged from non-Hindu women even before the amendment. Next, I conduct a placebo test in time. This involves re-estimating the difference in differences model over the pre-treatment period, but with the assumption that the treatment took effect at an earlier date. Column 2 and 3 show the regression results assuming that the amendment took place in 2003 and 2002 respectively. The difference in difference estimator is statistically insignificant implying zero placebo effect. In the next regressions, I conduct a placebo test in space using un- treated groups. In column 4, I falsely pretend that Christian women have been treated, although they

36 Hindus before Amendment Amendment Christians Muslims the amendment in 2003 in 2002 treated treated VARIABLES (1) (2) (3) (4) (5)

Panel A: Cousin marriage Subject to amend. 0.012*** 0.003 0.003 0.006 -0.006 (0.004) (0.007) (0.006) (0.009) (0.009) Near-treated Hindus 0.004 (0.007) Hindu -0.099*** -0.109*** -0.109*** (0.009) (0.010) (0.010) Christian -0.029 (0.032) Muslim 0.029 (0.032)

Observations 461,746 302,565 302,565 80,768 80,768 R-squared 0.056 0.060 0.060 0.103 0.103 Number of clusters 22751 22684 22684 9584 9584

Panel A: Premarital sex Subject to amend. -0.012*** -0.002 -0.002 0.004 -0.004 (0.003) (0.006) (0.005) (0.011) (0.011) Near-treated Hindus -0.002 (0.006) Hindu 0.004 0.004 0.004 (0.007) (0.007) (0.007) Christian 0.011 (0.031) Muslim -0.011 (0.031)

Observations 419,478 266,923 266,923 71,669 71,669 R-squared 0.071 0.070 0.070 0.092 0.092 Number of clusters 25131 24996 24996 10381 10381

Marriage year FE yes yes yes yes yes State FE yes yes yes yes yes State × Marriage year FE yes yes yes yes yes State × Hindu FE yes yes yes State × Christian FE yes State × Muslim FE yes Individual-level controls yes yes yes yes yes

Table 2.9: OLS estimates are reported with robust standard errors, clustered at the level of the primary sampling unit, in parentheses. Individual-level controls include caste and tribe dummy, education, wealth, and urban dummy variables, and survey round fixed effects. ***, **, and * indicate significance at the 1, 5, and 10% levels. have not been treated, and I use Muslims as their control group. In column 5, I use Muslims and Christians as treatment and control groups respectively. Again, the results show zero placebo effect. Unlike the past amendments in the five states, under the 2005 amendment both unmarried and married women were eligible for inheritance. Therefore, endogeneity of the year of marriage is less of a concern for the 2005 amendment. As column 1 of Table 2.9 shows, there is no evidence that variables of interest were affected by the anticipation of the 2005 amendment. However, to address any such concerns and to show the robustness of the results, here I follow two strategies. First, in a similar approach to [141], I use an instrumental variable approach where woman’s treatment status

(Tirt)—which is a function of religion-year of marriage cells—is instrumented by fixed effects for

37 each religion-year of birth cell, i.e. γr × λτ where γr is the Hindu dummy and λτ is year of birth fixed effects. Therefore, in this approach, exogenous variation in year of birth is used as a measure of exposure to the treatment. The second approach is similar to [77], [206], and [141]. I compare younger cohorts likely to be subject to the amendment with older cohorts who were likely to have been married by the time the amendment was passed and thus were probably not affected by the amendment in their marriage and premarital sex decisions. I define the treatment group to be Hindu women aged 14 or younger (the 10th percentile of the first marriage age distribution for females) in 2005—when the amendment was passed—and the control group to be women of all religion aged 24 or older (the 90th percentile of the distribution) in 2005. Defined based on the described age cohorts, 97.5% of the treatment group were actually exposed to the amendment, and 98% of the control group were not exposed to the amendment. Table A.2.29 of Appendix A.2 reports the results from the instrumental variable and age cohort comparison approaches. The direction and significance of the coefficients of the exposure to the amendment remain unchanged.

2.6 Conclusion

Theories on how inheritance systems shape family relationships, marriage patterns, and the status of women have been advanced over a century by social scientists. Following this literature, and stand- ing on the shoulders of giants such as Adam Smith, Lewis Morgan, , and Jack Goody, I developed two hypotheses with important implications for today’s cultural differences across soci- eties. Due to deep-rooted differences in their geography, subsistence economy, and agricultural and political organization, patrilineal societies ended up with different prevailing inheritance systems. I argue that patrilineal societies that traditionally included women in inheritance have developed prac- tices encouraging inmarriage and controlling women’s sexuality in order to preserve the property within the patrilineage, prevent its fragmentation, and limit conflicting claims on the estate. The findings of the study show that ethnicities, regions, and women whose ancestors practiced female inheritance have experienced higher cousin marriage, endogamy, and arranged marriages, and lower female economic participation and sexual freedom. I also find that women who receive inheritance are more likely to be in arranged and endogamous marriages. To show that these corre- lations can at least in part be attributed to the causal impact of female inheritance, I use a difference in differences approach to estimate the impact of a reform of inheritance regulations in India that substantially improved women’s rights on land. Consistent with the causal direction stated by the hypotheses, the results indicate that women exposed to the reform had higher cousin marriage and lower premarital sex rates. These findings have several important implications. First, female inheritance imposed by Sharia law might be a major historical factor explaining why today’s Islamic countries experience the highest cousin marriage rates and the lowest female participation rates in the world, and sustain their tribal social organization.

38 Second, there is growing evidence on how tight kinship systems impair the development of an individualistic social psychology; undermine social trust, large-scale cooperation, and democratic institutions; and encourage corruption and conflict.45 Cousin and arranged marriages, as means of creating and maintaining kin-based groups such as and tribes [121, 187], are considered as important elements of kinship patterns. Therefore, I also add to the literature by highlighting how inheritance systems contribute to the heterogeneity of kinship patterns across societies. For example, it has been suggested that the Catholic Church’s prohibitions on cousin marriages and promotion of consensual (or love) marriages played an important role in dismantling tribes and clans in Europe and stimulating its divergent development [115, 143, 82, 160, 121, 187, 231]. It is true that cousin and arranged marriage rates have been historically low in Christian countries. However, these low cousin and arranged marriage rates are also consistent with the fact that by putting no emphasis on female inheritance, European inheritance systems did not create economic incentives for cousin or arranged marriages, and provided an incentive-compatible institutional environment for the Church’s marriage policies. Finally, under patrilineal restrictions on women’s empowerment in developing countries, the mere enacting of female inheritance might create unintended consequences for young women’s sexual lives and marital choices.

45See, e.g., [266], [85], [14], [122], [7], [230], [84], [190], [231], [193]

39 Bibliography

[1] Klaus Abbink and Heike Hennig-Schmidt. Neutral versus loaded instructions in a bribery experiment. Experimental Economics, 9(2):103–121, 2006.

[2] Klaus Abbink, Bernd Irlenbusch, and Elke Renner. An experimental bribery game. Journal of Law, Economics, and Organization, 18(2):428–454, 2002.

[3] Mary Abbott. Family ties: English families 1540-1920. Routledge, 2013.

[4] Bina Agarwal. Gender and legal rights in agricultural land in india. Economic and Political Weekly, pages A39–A56, 1995.

[5] Leila Ahmed. Women and gender in Islam: Historical roots of a modern debate. Yale University Press, 1992.

[6] Anna Aizer. The gender wage gap and domestic violence. American Economic Review, 100(4):1847–59, 2010.

[7] Mahsa Akbari, Duman Bahrami-Rad, and Erik O Kimbrough. Kinship, fractionalization and corruption. SSRN Working Paper, 2847222, 2016.

[8] Seyed Mohammad Akrami and Zahra Osati. Is consanguineous marriage religiously encour- aged? islamic and iranian considerations. Journal of Biosocial Science, 39(02):313–316, 2007.

[9] Vivi Alatas, Lisa Cameron, Ananish Chaudhuri, Nisvan Erkal, and Lata Gangadharan. Sub- ject pool effects in a corruption experiment: A comparison of indonesian public servants and indonesian students. Experimental Economics, 12(1):113–132, 2009.

[10] Alberto Alesina, Yann Algan, Pierre Cahuc, and Paola Giuliano. Family values and the regulation of labor. Journal of the European Economic Association, 2015.

[11] Alberto Alesina, Benedetta Brioschi, and Eliana La Ferrara. Violence against women: A cross-cultural analysis for africa. NBER Working Paper, 21901, 2016.

[12] Alberto Alesina, Arnaud Devleeschauwer, William Easterly, Sergio Kurlat, and Romain Wacziarg. Fractionalization. Journal of Economic growth, 8(2):155–194, 2003.

[13] Alberto Alesina and Paola Giuliano. The power of the family. Journal of Economic growth, 15(2):93–125, 2010.

[14] Alberto Alesina and Paola Giuliano. Family ties and political participation. Journal of the European Economic Association, 9(5):817–839, 2011.

40 [15] Alberto Alesina and Paola Giuliano. Family ties. In Philippe Aghion and Steven N. Durlauf, editors, Handbook of Economic Growth, volume 2A, pages 177–215. North Holland: Nether- lands, 2014.

[16] Alberto Alesina, Paola Giuliano, and Nathan Nunn. On the origins of gender roles: Women and the plough. The Quarterly journal of economics, 128(2):469–530, 2013.

[17] Alberto Alesina and Eliana La Ferrara. Ethnic diversity and economic performance. Journal of Economic Literature, 43:762–800, 2005.

[18] Lee J Alston and Morton Owen Schapiro. Inheritance laws across colonies: Causes and consequences. The Journal of , 44(02):277–287, 1984.

[19] Sofia Amaral. Do improved property rights decrease violence against women in india? SSRN Working Paper, 2504579, 2017.

[20] AALL American Association of Law Libraries and Arturo A Flores. Foreign law: Current sources of codes and basic legislation in jurisdictions of the world. Littleton, Colo.: FB Rothman, 1989.

[21] Dan Anderberg, Helmut Rainer, Jonathan Wadsworth, and Tanya Wilson. Unemployment and domestic violence: Theory and evidence. The Economic Journal, 126(597):1947–1979, 2016.

[22] Siwan Anderson. Legal origins and female hiv. American Economic Review, forthcoming, 2018.

[23] Siwan Anderson and Garance Genicot. Suicide and property rights in india. Journal of Development Economics, 114:64–78, 2015.

[24] Kenneth RM Anthony. Agricultural change in tropical Africa. Cornell University Press, 1978.

[25] Nava Ashraf, Natalie Bau, Nathan Nunn, and Alessandra Voena. Bride price and female education. NBER Working Paper, 22417, 2018.

[26] Quamrul Ashraf and Oded Galor. The’out of africa’hypothesis, human , and comparative economic development. The American Economic Review, 103(1):1, 2013.

[27] St. Augustine. The City of God Against the Pagans (trans: R. W. Dyson). Cambridge Univer- sity Press, 1998.

[28] Drew H Bailey, Kim R Hill, and Robert S Walker. Fitness consequences of spousal related- ness in 46 small-scale societies. Biology letters, 10(5):20140160, 2014.

[29] Edward C Banfield. The moral basis of a backward society. New York: Glencol, 3:4, 1958.

[30] Abigail Barr and Danila Serra. The effects of externalities and framing on bribery in a petty corruption experiment. Experimental Economics, 12(4):488–503, 2009.

[31] Abigail Barr and Danila Serra. Corruption and culture: An experimental analysis. Journal of Public Economics, 94(11):862–869, 2010.

41 [32] David B Barrett. World Christian encyclopedia: a comparative study of churches and reli- gions in the modern world, AD 1900-2000. Oxford University Press, USA, 1982.

[33] Herbert Barry. Ethnographic atlas xxviii. Ethnology, 19(2):245–263, 1980.

[34] Anke Becker. Herding and male . SSRN Working Paper, 3069684, 2017.

[35] Sascha O Becker and Ludger Woessmann. Luther and the girls: Religious denomination and the female education gap in nineteenth-century prussia. The Scandinavian Journal of Economics, 110(4):777–805, 2008.

[36] Jens Beckert. Inherited wealth. Press, 2008.

[37] Jeanet Sinding Bentzen, Nicolai Kaarsen, and Asger Moll Wingender. Irrigation and autoc- racy. European Economic Association. Journal, 2015.

[38] Joyce Berg, John Dickhaut, and Kevin McCabe. Trust, reciprocity, and social history. Games and economic behavior, 10(1):122–142, 1995.

[39] Theodore C Bergstrom. The algebra of assortative encounters and the evolution of coopera- tion. International Game Theory Review, 5(03):211–228, 2003.

[40] Lutz K Berkner. The stem family and the developmental cycle of the peasant household: An eighteenth-century austrian example. The American Historical Review, 77(2):398–418, 1972.

[41] Chris Bidner and Mukesh Eswaran. A gender-based theory of the origin of the caste system of india. Journal of Development Economics, 114:142–158, 2015.

[42] Alan Bittles and M. H. Black. Global patterns and tables of consanguinity. 2015.

[43] Alan H Bittles. Consanguinity in context, volume 63. Cambridge University Press, 2012.

[44] Keith Blackburn, Kyriakos Neanidis, Maria Paola Rana, et al. Organized crime, corruption and growth: Theory and evidence. Technical report, Economics, The Univeristy of Manch- ester, 2015.

[45] , Alexander Kazankov, , and . Ethno- graphic atlas xxxi: Peoples of easternmost europe. Ethnology, pages 261–289, 2005.

[46] Robert Borofsky. Making history: Pukapukan and anthropological constructions of knowl- edge. Cambridge University Press, 1987.

[47] Samuel Bowles and Dorrit Posel. Genetic relatedness predicts south african migrant workers’ remittances to their families. Nature, 434(7031):380–383, 2005.

[48] S.I. Bruk and V.S. Apenchenko, editors. Atlas Narodov. Moscow: Miklukho-Maklai Eth- nological Institute at the Department of Geodesy and Cartography of the State Geological Committee of the Soviet Union, 1964.

[49] Ulf Brunnbauer. Descent or territoriality: inheritance and family forms in the late ottoman and early post-ottoman . Distinct Inheritances. Property, family and Community in a Changing Europe, pages 181–205, 2003.

42 [50] Paolo Buonanno and Paolo Vanin. Social closure, surnames and crime. Quaderni-Working Paper DSE, 2015.

[51] Lisa Cameron, Ananish Chaudhuri, Nisvan Erkal, and Lata Gangadharan. Propensities to en- gage in and punish corrupt behavior: Experimental evidence from australia, india, indonesia and singapore. Journal of Public Economics, 93(7):843–851, 2009.

[52] Pamela Campa, Michel Serafinelli, et al. Politico-economic regimes and attitudes: Female workers under state-socialism. 2016.

[53] J Edgardo Campos, Donald Lien, and Sanjay Pradhan. The impact of corruption on invest- ment: Predictability matters. World development, 27(6):1059–1067, 1999.

[54] Eliana Carranza. Soil endowments, female labor force participation, and the demographic deficit of women in india. American Economic Journal: Applied Economics, 6(4):197–225, 2014.

[55] Marco Casari and Maurizio Lisciandra. Gender in property rights: six cen- turies of commons governance in the alps. The Journal of Economic History, 76(02):559– 594, 2016.

[56] Luigi Luca Cavalli-Sforza, Paolo Menozzi, and Alberto Piazza. The history and geography of human . Princeton university press, 1994.

[57] Luigi Luca Cavalli-Sforza, Antonio Moroni, and Gianna Zei. Consanguinity, , and in Italy, volume 39. Princeton University Press, 2004.

[58] Nicholas Charron, Lewis Dijkstra, and Victor Lapuente. Regional governance matters: qual- ity of government within european union member states. Regional Studies, 48(1):68–90, 2014.

[59] Nicholas A Christakis and James H Fowler. Friendship and natural selection. Proceedings of the National Academy of Sciences, 111(Supplement 3):10796–10801, 2014.

[60] CY Cyrus Chu. Primogeniture. Journal of Political Economy, 99(1):78–99, 1991.

[61] Swee-Hoon Chuah, Reema Fahoum, and Robert Hoffmann. Fractionalization and trust in india: A field-experiment. Economics Letters, 119(2):191–194, 2013.

[62] EUA CIA. The world factbook (published online). Central Intelligence Agency, Washington, DC, 1996.

[63] Nevill Colclough. Invisible patrimonies–capitoli matrimoniali in ancien régime ascoli. Dis- tinct inheritances: property, family and community in a changing Europe, pages 145–158, 2003.

[64] John W Cole and Eric R Wolf. The hidden frontier: ecology and ethnicity in an alpine valley. Univ of California Press, 1999.

[65] Robert K Colwell. Predictability, constancy, and contingency of periodic phenomena. Ecol- ogy, 55(5):1148–1153, 1974.

43 [66] Timothy G Conley. Gmm estimation with cross sectional dependence. Journal of economet- rics, 92(1):1–45, 1999.

[67] Youssef Courbage and Emmanuel Todd. A convergence of : The transformation of Muslim societies around the world. Columbia University Press, 2014.

[68] Donald Cox and Marcel Fafchamps. Extended family and kinship networks: economic insights and evolutionary directions. Handbook of development economics, 4:3711–3784, 2007.

[69] Klaus Deininger, Aparajita Goyal, and Hari Nagarajan. Women’s inheritance rights and inter- generational transmission of resources in india. Journal of Human Resources, 48(1):114–141, 2013.

[70] Alfredo Del Monte and Erasmo Papagni. Public expenditure, corruption, and economic growth: the case of italy. European Journal of Political Economy, 17(1):1–16, 2001.

[71] Alfredo Del Monte and Erasmo Papagni. The determinants of corruption in italy: Regional panel data analysis. European Journal of Political Economy, 23(2):379–396, 2007.

[72] Alfredo Del Monte and Erasmo Papagni. The determinants of corruption in italy: Regional panel data analysis. European Journal of Political Economy, 23(2):379–396, 2007.

[73] Adriana Di Liberto and Marco Sideri. Past dominations, current institutions and the italian regional economic performance. European Journal of Political Economy, 38:12–41, 2015.

[74] Oguzhan C Dincer. Ethnic and religious diversity and corruption. Economics Letters, 99(1):98–102, 2008.

[75] Matthias Doepke and Michele Tertilt. Does female empowerment promote economic devel- opment? Technical report, 2016.

[76] Gertrude E Dole. The lineage pattern of kinship nomenclature: Its significance and develop- ment. Southwestern Journal of , pages 36–62, 1965.

[77] Esther Duflo. Schooling and labor market consequences of school construction in indonesia: Evidence from an unusual policy experiment. American economic review, 91(4):795–813, 2001.

[78] Gilles Duranton, Andrés Rodríguez-Pose, and Richard Sandall. Family types and the persis- tence of regional disparities in europe. Economic Geography, 85(1):23–47, 2009.

[79] William Easterly and Ross Levine. Africa’s growth tragedy: policies and ethnic divisions. The Quarterly Journal of Economics, pages 1203–1250, 1997.

[80] Lena Edlund. Cousin marriage is not choice: Muslim marriage and underdevelopment. AEA Papers and Proceedings, 108:353–57, 2018.

[81] Charles Efferson, Rafael Lalive, and Ernst Fehr. The coevolution of cultural groups and ingroup favoritism. Science, 321(5897):1844–1849, 2008.

[82] Robert B Ekelund, Robert D Tollison, Gary M Anderson, FH Robert, Audrey B Davidson, et al. trust: the medieval church as an economic firm. Oxford University Press, 1996.

44 [83] Nasr G Elbahnasawy and Charles F Revier. The determinants of corruption: Cross-country- panel-data analysis. The Developing Economies, 50(4):311–333, 2012.

[84] Benjamin Enke. Kinship systems, cooperation and the evolution of culture. NBER Working Paper, 23499, 2017.

[85] John Ermisch and Diego Gambetta. Do strong family ties inhibit trust? Journal of Economic Behavior & Organization, 75(3):365–376, 2010.

[86] Ernst Fehr and Urs Fischbacher. The nature of human altruism. Nature, 425(6960):785–791, 2003.

[87] Carl H. Fellner and John R. Marshall. Kidney donors revisited. In Eds. J. Philippe Rushton and Richard M. Sorrentino, editors, Altruism and Helping Behavior: Social, Personality and Developmental Perspectives. Lawrence Erlbaum Associates: New Jersey, 1981.

[88] Nadia Fiorino, Emma Galli, and Ilaria Petrarca. Corruption and growth: Evidence from the italian regions. European Journal of Government and Economics, 1(2):126–144, 2012.

[89] Raymond Fisman and Edward Miguel. Corruption, norms, and legal enforcement: Evidence from diplomatic parking tickets. Journal of Political economy, 115(6):1020–1048, 2007.

[90] James Foreman-Peck and Peng Zhou. Late marriage as a contributor to the industrial revolu- tion in . The Economic History Review, 2017.

[91] Robin Fox. Kinship and Marriage. Penguin Books: , UK, 1967.

[92] Patrick Francois and Jan Zabojnik. Trust, social capital and economic development. Journal of the European Economic Association, 3(1):51–94, 2005.

[93] Feng Fu, Corina E Tarnita, Nicholas A Christakis, Long Wang, David G Rand, and Martin A Nowak. Evolution of in-group favoritism. Scientific Reports, 2(460), 2012.

[94] Francis Fukuyama. Trust: The Social Virtues and the Creation of Prosperity. New York: Free Press, 1995.

[95] Donna R Gabaccia and Franca Iacovetta. Women, gender and transnational lives: Italian workers of the world. University of Toronto Press, 2002.

[96] Simon Gächter and Jonathan F. Schulz. Intrinsic honesty and the prevalence of rule violations across societies. Nature, 531(7595):496–499, 03 2016.

[97] Vincenzo Galasso and Paola Profeta. When the state mirrors the family: the design of pension systems. CEPR Discussion Paper No. DP8723, 2011.

[98] John Luke Gallup, Jeffrey D Sachs, and Andrew D Mellinger. Geography and economic development. International Regional Science Review, 22(2):179–232, 1999.

[99] A. Gardner and A.W. Stuart. Spite. Current Biology, 16(17):R662–R664, 2006.

[100] Thomas R. Garth. Kinship terminology, marriage practices and behavior toward kin among the atsugewi. , 46(3):348–361, 1944.

[101] JH Gillespie. Population genetics: a concise guide. Johns Hopkins University Press, 2004.

45 [102] Paola Giuliano. The role of women in society: from preindustrial to modern times. CESifo Economic Studies, 61(1):33–52, 2015.

[103] Paola Giuliano. Gender: A historical perspective. NBER Working Paper, 23635, 2017.

[104] Paola Giuliano and Nathan Nunn. Ancestral characteristics of modern populations, 2013.

[105] Edward L Glaeser and Raven E Saks. Corruption in america. Journal of Public Economics, 90(6):1053–1072, 2006.

[106] Bernt Glatzer. The pashtun tribal system. Concept of tribal society, pages 265–282, 2002.

[107] Uri Gneezy, Kenneth L Leonard, and John A List. Gender differences in competition: Evi- dence from a matrilineal and a patriarchal society. Econometrica, 77(5):1637–1664, 2009.

[108] Miriam A Golden and Lucio Picci. Proposal for a new measure of corruption, illustrated with italian data. Economics & Politics, 17(1):37–75, 2005.

[109] Walter Goldschmidt and Evalyn Jacobson Kunkel. The structure of the peasant family. Amer- ican Anthropologist, 73(5):1058–1076, 1971.

[110] Melvyn C Goldstein. When brothers share a wife. Natural History, 96(3):109–112, 1987.

[111] Binglin Gong and Chun-Lei Yang. Gender differences in risk attitudes: Field experiments on the matrilineal and the patriarchal yi. Journal of economic behavior & organization, 83(1):59–65, 2012.

[112] Jack Goody. Inheritance, property, and marriage in africa and eurasia. , 3(1):55–76, 1969.

[113] Jack Goody. Production and reproduction: A comparative study of the domestic domain. Cambridge Studies in , 17, 1976.

[114] Jack Goody. Production and reproduction: a comparative study of the domestic domain. Number 17. Cambridge University Press, 1976.

[115] Jack Goody. The development of the family and marriage in Europe. Cambridge University Press, 1983.

[116] Jack R Goody, Joan Thirsk, and Edward Palmer Thompson. Family and inheritance. Past and Present Publications (USA), 1976.

[117] Philip Gounev and Tihomir Bezlov. Examining the links between organised crime and cor- ruption. Center for the Study of , 2010.

[118] Jesse Graham, Jonathan Haidt, and Brian Nosek. The Moral Foundations Quiz. www.yourmorals.org, 2008.

[119] J Patrick Gray. Ethnographic atlas codebook. World Cultures, 10(1):86–136, 1998.

[120] J Patrick Gray. A corrected ethnographic atlas. World Cultures, 10(1):24–85, 1999.

[121] Avner Greif. Family structure, institutions, and growth: the origins and implications of west- ern corporations. The American Economic Review, 96(2):308–312, 2006.

46 [122] Avner Greif and Guido Tabellini. The clan and the corporation: Sustaining cooperation in china and europe. SSRN Working Paper, 2565120, 2015.

[123] Avner Greif and Guido Tabellini. The clan and the corporation: Sustaining cooperation in china and europe. Available at SSRN 2565120, 2015.

[124] Veronika Grimm and Friederike Mengel. Cooperation in viscous populations—experimental evidence. Games and Economic Behavior, 66(1):202–220, 2009.

[125] Pauline Grosjean and Rose Khattar. It’s raining men! hallelujah? SSRN Working Paper, 2445285, 2015.

[126] Eleonora Guarnieri and Helmut Rainer. Female empowerment and male backlash. CESifo Working Paper, 7009, 2018.

[127] Luigi Guiso, Paola Sapienza, and Luigi Zingales. People’s opium? religion and economic attitudes. Journal of monetary economics, 50(1):225–282, 2003.

[128] EV Gunnemark. Countries, peoples, and their languages: The geolinguistic handbook. sum- institute of linguistics. Inc., Dallas, Texas, 1991.

[129] W. D. Hamilton. The genetical evolution of social behaviour. i. Journal of Theoretical Biology, 7(1):1–16, 1964.

[130] William D Hamilton. The genetical evolution of social behaviour. ii. Journal of Theoretical Biology, 7(1):17–52, 1964.

[131] William D Hamilton. Innate social aptitudes of man: an approach from evolutionary genetics. Biosocial anthropology, 133:155, 1975.

[132] Ross A Hammond and Robert Axelrod. The evolution of . Journal of Conflict Resolution, 50(6):926–936, 2006.

[133] Casper Worm Hansen, Peter Sandholt Jensen, and Christian Volmar Skovsgaard. Modern gender roles and agricultural history: the neolithic inheritance. Journal of Economic Growth, 20(4):365–404, 2015.

[134] Mariaflavia Harari. WomenâA˘ Zs´ inheritance rights and bargaining power: Evidence from kenya. Department of Economics of MIT, pages 1–47, 2014.

[135] Henry Harpending and Alan Rogers. Fitness in stratified societies. Ethology and sociobiol- ogy, 11(6):497–509, 1990.

[136] Henry C Harpending. Kinship and population subdivision. Population and Environment, 24(2):141–147, 2002.

[137] Stevan Harrell. Human families. Westview Press, 1997.

[138] IPDJ Harris, PD Jones, TJ Osborn, and DH Lister. Updated high-resolution grids of monthly climatic observations–the cru ts3. 10 dataset. International Journal of Climatol- ogy, 34(3):623–642, 2014.

47 [139] hbdchick. Big summary post on the hajnal line. https://hbdchick.wordpress.com/2014/03/10/big-summary-post-on-the-hajnal-line/, March 2014.

[140] Patrick Heady and Hannes Grandits, editors. Distinct inheritances: property, family and community in a changing Europe. LIT Verlag Münster, 2003.

[141] Heath and Xu Tan. Intrahousehold bargaining, female autonomy, and labor supply: Theory and evidence from india. , 2016.

[142] Joseph Henrich. The secret of our success: how culture is driving human evolution, domesti- cating our species, and making us smarter. Princeton University Press, 2015.

[143] David Herlihy. Medieval households. Harvard University Press, 1985.

[144] Benedikt Herrmann, Christian Thöni, and Simon Gächter. Antisocial punishment across societies. Science (New York, N.Y.), 319(5868):1362–1367, 2008.

[145] Ashley D. Hoben, Abraham P. Buunk, Corey L. Fincher, Randy Thornhill, and Mark Schaller. On the adaptive origins and maladaptive consequences of human inbreeding: Para- site prevalence, immune functioning, and consanguineous marriage. Evolutionary Psychol- ogy, 8(4):658 – 676, 2010.

[146] Ladislav Holy.` Kinship, honour, and solidarity: cousin marriage in the Middle East. Manch- ester University Press, 1989.

[147] Martha C Howell. Commerce before capitalism in Europe, 1300-1600. Cambridge University Press, 2010.

[148] David Hume. A Treatise of Human Nature. Oxford: Oxford University Press, (2003) edition, 1740.

[149] John Kenneth Hyde. Society and politics in medieval Italy: the evolution of the civil life, 1000-1350. Not Avail, 1973.

[150] Shabnam Ishaque. Islamic principles on adoption: examining the impact of illegitimacy and inheritance related concerns in context of a child’s right to an identity. International Journal of Law, Policy and the Family, 22(3):393–420, 2008.

[151] Lutfi Jaber, Gabrielle J Halpern, and Mordechai Shohat. The impact of consanguinity world- wide. Community Genetics, 1(1):12–17, 1998.

[152] Tarun Jain. Where there is a will fertility behavior and sex bias in large families. Journal of Human Resources, 49(2):393–423, 2014.

[153] Allen W Johnson and Timothy K Earle. The evolution of human societies: from foraging group to agrarian state. Stanford University Press, 2000.

[154] Noel D Johnson and Alexandra Mislin. How much should we trust the world values survey trust question? Economics Letters, 116(2):210–212, 2012.

[155] Doug Jones. Socially enforced nepotism: How norms and reputation can amplify kin altru- ism. PLoS ONE, 11(6):1–13, 06 2016.

48 [156] Karl Kaser. Power and inheritance: Male domination, property, and family in eastern europe, 1500–1900. Distinct Inheritances. Property, family and Community in a Changing Europe, pages 53–63, 2003. [157] David I Kertzer. The family in Italy from antiquity to the present. Yale University Press, 1993. [158] Kathryn R Kirby, Russell D Gray, Simon J Greenhill, Fiona M , Stephanie Gomes-Ng, Hans-Jörg Bibiko, Damián E Blasi, Carlos A Botero, Claire Bowern, Carol R Ember, et al. D-place: A global database of cultural, linguistic and environmental diversity. PLoS One, 11(7):e0158391, 2016. [159] Andrey Korotayev. Parallel-cousin (fbd) marriage, islamization, and . Ethnology, pages 395–407, 2000. [160] Andrey Korotayev, Alexander Kazankov, Svetlana Borinskaya, Daria Khaltourina, and Dmitri Bondarenko. Ethnographic atlas xxx: peoples of siberia. Ethnology, 43(1):83–92, 2004. [161] Andrey V Korotayev. Unilineal descent organization and deep christianization: A cross- cultural comparison. Cross-Cultural Research, 37(1):133–157, 2003. [162] Maurice Kugler, Thierry Verdier, and Yves Zenou. Organized crime, corruption and punish- ment. Journal of Public Economics, 89(9):1639–1663, 2005. [163] Timur Kuran. The long divergence: How Islamic law held back the Middle East. Princeton University Press, 2012. [164] Rafael La Porta, Florencio Lopez-de Silanes, Andrei Shleifer, and Robert Vishny. The quality of government. Journal of Law, Economics, and organization, 15(1):222–279, 1999. [165] Rafael La Porta, Andrei Shleifer, and RW Vishny. Law and finance. Journal of Political Economy, 106:1113–1155, 1998. [166] Johann Graf Lambsdorff. How corruption affects persistent capital flows. Economics of Governance, 4(3):229–243, 2003. [167] Johann Graf Lambsdorff. Behavioural and experimental economics as a guidance to anticor- ruption. In Danila Serra and Leonard Wantchekon, editors, New advances in experimental research on corruption, volume 15, pages 279–300. Emerald Group Publishing, 2012. [168] Iosif Lazaridis, Dani Nadel, Gary Rollefson, Deborah C. Merrett, Nadin Rohland, Swa- pan Mallick, Daniel Fernandes, Mario Novak, Beatriz Gamarra, Kendra Sirak, Sarah Con- nell, Kristin Stewardson, Eadaoin Harney, Qiaomei Fu, Gloria Gonzalez-Fortes, Eppie R. Jones, Songül Alpaslan Roodenberg, György Lengyel, Fanny Bocquentin, Boris Gasparian, Janet M. Monge, Michael Gregg, Vered Eshed, Ahuva-Sivan Mizrahi, Christopher Meik- lejohn, Fokke Gerritsen, Luminita Bejenaru, Matthias Blüher, Archie Campbell, Gianpiero Cavalleri, David Comas, Philippe Froguel, Edmund Gilbert, Shona M. Kerr, Peter Kovacs, Johannes Krause, Darren McGettigan, Michael Merrigan, D. Andrew Merriwether, Seamus O’Reilly, Martin B. Richards, Ornella Semino, Michel Shamoon-Pour, Gheorghe Stefanescu, Michael Stumvoll, Anke Tönjes, Antonio Torroni, James F. Wilson, Loic Yengo, Nelli A. Hovhannisyan, Nick Patterson, Ron Pinhasi, and David Reich. Genomic insights into the origin of farming in the ancient . Nature, 536(7617):419–424, 08 2016.

49 [169] FM LeBar and GN Appell. Ethnic groups of insular southeast asia. new haven. Human Relations Area Files Press, 1972.

[170] David Levine and Michael Kevane. Are investments in daughters lower when daughters move away? evidence from indonesia. World Development, 31(6):1065–1084, 2003.

[171] Alexandra Lewis. Security, Clans and Tribes: Unstable Governance in Somaliland, and the Gulf of Aden. Palgrave Macmillan: London, 2014.

[172] M. Pau Lewis, Gary F. Simons, and Charles D. Fennig. Ethnologue: Languages of the World, Nineteenth edition. Dallas, Texas: SIL International, 2016.

[173] Seymour Martin Lipset and Gabriel Salman Lenz. Corruption, culture, and markets. George Mason University, 1999.

[174] Sara Lowes. Kinship systems, gender norms, and household bargaining: Evidence from the matrilineal belt. Working Paper, Harvard mimeo, 2016.

[175] Katherine A Lynch. Individuals, families, and communities in Europe, 1200-1800: the urban foundations of Western society, volume 37. Cambridge University Press, 2003.

[176] Ramzi Mabsout and Irene Van Staveren. Disentangling bargaining power from individual and household level to institutions: Evidence on womenâA˘ Zs´ position in . World Development, 38(5):783–796, 2010.

[177] Elainie A Madsen, Richard J Tunney, George Fieldman, Henry C Plotkin, Robin IM Dun- bar, Jean-Marie Richardson, and David McFarland. Kinship and altruism: A cross-cultural experimental study. British Journal of Psychology, 98(2):339–359, 2007.

[178] Aparna Mathur and Sita Slavov. Escaping domestic violence: Empowering women through employment, earnings and wealth in india. AEI Economic Policy Working Paper Series, 2013.

[179] Paolo Mauro. Corruption and growth. The Quarterly Journal of Economics, pages 681–712, 1995.

[180] Paolo Mauro. Corruption and the composition of government expenditure. Journal of Public economics, 69(2):263–279, 1998.

[181] Hans Medick. The proto-industrial family economy: The structural function of household and family during the transition from peasant society to industrial capitalism. Social History, 1(3):291–315, 1976.

[182] Hans Medick and David Warren Sabean. Interest and emotion: essays on the study of family and kinship. Cambridge University Press, 1984.

[183] Stelios Michalopoulos. The origins of ethnolinguistic diversity. The American Economic Review, 102(4):1508, 2012.

[184] Stelios Michalopoulos, Alireza Naghavi, and Giovanni Prarolo. Islam, inequality and pre- industrial comparative development. Journal of Development Economics, 120:86–98, 2016.

[185] Stelios Michalopoulos, Alireza Naghavi, and Giovanni Prarolo. Trade and geography in the spread of islam. The Economic Journal, 2017.

50 [186] Michael Mitterauer. European kinship systems and household structures: medieval origins. Distinct inheritances. Property, family and community in a changing Europe, pages 35–52, 2003.

[187] Michael Mitterauer. Why Europe?: The Medieval Origins of Its Special Path. University of Chicago Press, 2010.

[188] Anna Momigliano. The town in italy where most women are barred from inheriting land. The Atlantic, 2016.

[189] Lewis Henry Morgan. Systems of consanguinity and affinity of the human family, volume 218. Smithsonian institution, 1871.

[190] Jacob Moscona, Nathan Nunn, and James A Robinson. Keeping it in the family: Lineage organization and the scope of trust in sub-saharan africa. American Economic Review Papers and Proceedings, 107(5):565–71, 2017.

[191] Jacob Moscona, Nathan Nunn, and James A Robinson. Keeping it in the family: Lineage organization and the scope of trust in sub-saharan africa. Technical report, National Bureau of Economic Research, 2017.

[192] Jacob Moscona, Nathan Nunn, and James A Robinson. Social structure and conflict: Evi- dence from sub-saharan africa. 2017.

[193] Jacob Moscona, Nathan Nunn, and James A Robinson. Social structure and conflict: Evi- dence from sub-saharan africa. NBER Working Paper, 24209, 2018.

[194] Siegfried Hermann Muller. The World’s Living Languages: Basic Facts of Their Structure, Kinship, Location, and Number of Speakers. Ungar, 1964.

[195] George Peter Murdock. World ethnographic sample. American Anthropologist, 59(4):664– 687, 1957.

[196] George Peter Murdock. Ethnographic atlas, installments i-xxvii. Ethnology, pages 1–10, 1962–1971.

[197] George Peter Murdock. Ethnographic atlas: a summary. Ethnology, 6(2):109–236, 1967.

[198] Chie Nakane et al. Kinship and economic organization in rural japan. Kinship and economic organization in rural Japan., 1967.

[199] Mark New, David Lister, Mike Hulme, and Ian Makin. A high-resolution data set of surface climate over global land areas. Climate Research, 21(1):1–25, 2002.

[200] Tin Nguyen, Enjiang Cheng, and Christopher Findlay. Land fragmentation and farm produc- tivity in china in the 1990s. China Economic Review, 7(2):169–180, 1996.

[201] William D Nordhaus. Geography and : New data and new findings. Pro- ceedings of the National Academy of Sciences of the United States of America, 103(10):3510– 3517, 2006.

[202] Martin A Nowak, Corina E Tarnita, and Edward O Wilson. The evolution of eusociality. Nature, 466(7310):1057–1062, 2010.

51 [203] Nathan Nunn, Emmanuel Akyeampong, Robert Bates, and James A Robinson. Gender and missionary influence in colonial africa. African Development in Historical Perspective, 2011.

[204] Nathan Nunn and Diego Puga. Ruggedness: The blessing of bad geography in africa. Review of Economics and Statistics, 94(1):20–36, 2012.

[205] Samir Okasha. Biological altruism. In Edward N. Zalta, editor, The Stanford Encyclopedia of Philosophy. Standford University Press, fall 2013 edition, 2013.

[206] Una Okonkwo Osili and Bridget Terry Long. Does female schooling reduce fertility? evi- dence from . Journal of development Economics, 87(1):57–75, 2008.

[207] Elinor Ostrom and Charlotte Hess. Private and common property rights. Encyclopedia of law and economics, 2:53–106, 2000.

[208] Trevor J Pemberton, Michael DeGiorgio, and Noah A Rosenberg. Population structure in a comprehensive genomic data set on human microsatellite variation. G3: Genes| Genomes| Genetics, pages g3–113, 2013.

[209] Trevor J Pemberton and Noah A. Rosenberg. Population-genetic influences on genomic estiamtes of the inbreeding coefficient: A global perspective. Human Heredity, 77(1-4):37– 48, 2014.

[210] Francis Pine. Reproducing the house: kinship, inheritance, and property relations in highland poland. Distinct Inheritances: Property, Family and Community in a Changing Europe, pages 279–295, 2003.

[211] Leendert Plas. Rural gender studies in Europe. Van Gorcum, 1994.

[212] Jean-Philippe Platteau. Institutions, social norms, and economic development, volume 1. Psychology Press, 2000.

[213] Jean-Philippe Platteau and Jean-Marie Baland. Access to land, rural poverty, and public action (Impartible Inheritance Versus Equal Division: a Comparative Perspective Centered on Europe and Subsaharan Africa). Oxford University Press, 2001.

[214] Robert D Putnam, Robert Leonardi, and Raffaella Y Nanetti. Making democracy work: Civic traditions in modern Italy. Princeton university press, 1994.

[215] Nancy Qian. Missing women and the price of tea in china: The effect of sex-specific earnings on sex imbalance. The Quarterly Journal of Economics, 123(3):1251–1285, 2008.

[216] Sohini Ramachandran, Omkar Deshpande, Charles C. Roseman, Noah A. Rosenberg, Mar- cus W. Feldman, and L. Luca Cavalli-Sforza. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in africa. Proceedings of the National Academy of Sciences of the United States of America, 102(44):15942–15947, 2005.

[217] Navin Ramankutty, Jonathan A Foley, John Norman, and Kevin McSweeney. The global distribution of cultivable lands: current patterns and sensitivity to possible climate change. Global Ecology and biogeography, 11(5):377–392, 2002.

52 [218] Navin Ramankutty, Jonathan A Foley, John Norman, and Kevin McSweeney. The global distribution of cultivable lands: current patterns and sensitivity to possible climate change. Global Ecology and Biogeography, 11(5):377–392, 2002.

[219] Auke Rijpma and Sarah G Carmichael. Testing todd and matching murdock: Global data on historical family characteristics. Economic History of Developing Regions, 31(1):10–46, 2016.

[220] M Fernanda Rivas. An experiment on corruption and gender. Bulletin of Economic Research, 65(1):10–42, 2013.

[221] Janet Roberts. Sociocultural change and communication problems. Study of the role of second languages in Asia, Africa, and Latin America, pages 105–23, 1962.

[222] Alan R Rogers. Evolutionary economics of human reproduction. Ethology and Sociobiology, 11(6):479–495, 1990.

[223] Daniel Rosenblum. Unintended consequences of womenâA˘ Zs´ inheritance rights on female mortality in india. Economic Development and Cultural Change, 63(2):223–248, 2014.

[224] Sanchari Roy. Empowering women? inheritance rights, female education and dowry pay- ments in india. Journal of Development Economics, 114:233–251, 2015.

[225] David W Sabean. Young bees in an empty hive: relations between brothers-in-law in a south german village around 1800. Interest and Emotion: Essays on the Study of Family and Kin- ship. Cambridge: Cambridge University Press. p, pages 171–186, 1984.

[226] . Cousin marriage conundrum. In Tim Folger and Steven Pinker, editors, The Best American Science and Nature Writing 2004. Mariner Books, 2004.

[227] Assaf Sarid, Oded Galor, and Omer Ozak. Geographical origins and economic consequences of language structures. CESifo Working Paper, 6149, 2017.

[228] Walter Scheidel. Sex and Empire. New York: Oxford University Press, 2009.

[229] William J Schull. Empirical risks in consanguineous marriages: sex ratio, malformation, and viability. American journal of human genetics, 10(3):294, 1958.

[230] Jonathan F Schulz. The churches’ bans on consanguineous marriages, kin-networks and democracy. SSRN Working Paper, 2877828, 2016.

[231] Jonathan F Schulz, Duman Bahrami-Rad, Jonathan Beauchamp, and Joseph Henrich. The origins of weird psychology. SSRN Working Paper, 3201031, 2018.

[232] Martine Segalen. Historical anthropology of the family. Cambridge University Press, 1986.

[233] Danila Serra. Empirical determinants of corruption: A sensitivity analysis. Public Choice, 126(1-2):225–256, 2006.

[234] Mary K Shenk, Mary C Towner, Emily A Voss, and Nurul Alam. Consanguineous marriage, kinship ecology, and market transition. Current Anthropology, 57(S13):S167–S180, 2016.

[235] Tom Siegfried. A Beautiful Math:: John Nash, Game Theory, and the Modern Quest for a Code of Nature. National Academies Press, 2006.

53 [236] Adam Smith. An Inquiry into the Nature and Causes of the Wealth of Nations. Vol. II of The Glasgow Edition of the Works and Correspondence of Adam Smith, 1776.

[237] RM Smith. Land, kinship and life-cycle. Cambridge (UK) Cambridge Univ. Press, 1984.

[238] Enrico Spolaore and Romain Wacziarg. The diffusion of development. The Quarterly Journal of Economics, 124(2):469–529, 2009.

[239] Mysore Narasimhachar Srinivas. The remembered village. Univ of California Press, 1980.

[240] Guido Tabellini. Culture and institutions: Economic development in the regions of europe. Journal of the European Economic Association, 8(4):677–716, 2010.

[241] Guido Tabellini et al. The scope of cooperation: Values and incentives. The Quarterly Journal of Economics, 123(3):905–950, 2008.

[242] Shuhao Tan, Nico Heerink, and Futian Qu. Land fragmentation and its driving forces in china. Land use policy, 23(3):272–285, 2006.

[243] Vito Tanzi and Hamid Reza Davoodi. Roads to nowhere: how corruption in public investment hurts growth, volume 12. International Monetary Fund, 1998.

[244] Edoardo Teso. The long-term effect of demographic shocks on the evolution of gender roles: Evidence from the transatlantic slave trade. SSRN Working Paper, 2927221, 2016.

[245] Joan Thirsk. The european debate on customs of inheritance, 1500-1700. Family and inher- itance, pages 177–191, 1976.

[246] Todne Thomas, Asiya Malik, and Rose Wellman. New Directions in Spiritual Kinship: Sacred Ties Across the Abrahamic Religions. Springer, 2017.

[247] Emmanuel Todd. The explanation of ideology: Family structures and social systems. Black- well Oxford, 1985.

[248] Emmanuel Todd. L’invention de l’Europe. Seuil, 1990.

[249] Daniel Treisman. The causes of corruption: a cross-national study. Journal of public eco- nomics, 76(3):399–457, 2000.

[250] Ana Tur-Prats. Family types and intimate-partner violence: A historical perspective. Eco- nomics Working Papers 1486, Universitat Pompeu Fabra, 2015.

[251] Edward B Tylor. On a method of investigating the development of institutions; applied to laws of marriage and descent. The Journal of the Anthropological Institute of Great Britain and Ireland, 18:245–272, 1889.

[252] UN. Demographic Yearbook. United Nations: New York, 1995.

[253] Eric M Uslaner. Trust and corruption. In Johann Graf Lambsdorff, Markus Taube, and Matthias Schramm, editors, The New Institutional Economics of Corruption, pages 76–93. Routledge, 2005.

[254] Robert S Walker and Drew H Bailey. Marrying kin in small-scale societies. American Journal of Human Biology, 26(3):384–388, 2014.

54 [255] Guang H Wan and Enjiang Cheng. Effects of land fragmentation and returns to scale in the chinese farming sector. Applied Economics, 33(2):183–194, 2001.

[256] Max Weber. The protestant ethic and the spirit of capitalism, trans. Talcott Parsons (New York: Charles Scribner’s Sons, 1958), 182, 1958.

[257] Max Weber. Economy and society: an outline of interpretive sociology. Vol. 1. University of Calif. Press, 1978.

[258] Karl August Wittfogel. Oriental despotism: A comparative study of total power. Yale Uni- versity Press, 1959.

[259] Michael A. Woodley and Edward Bell. Consanguinity as a major predictor of levels of democracy: A study of 70 nations. Journal of Cross-Cultural Psychology, 44(2):263–280, 2013.

[260] C Geoffrey Woods, James Cox, Kelly Springell, Daniel J Hampshire, Moin D Mohamed, Martin McKibbin, Rowena Stern, F Lucy Raymond, Richard Sandford, Saghira Malik Sharif, et al. Quantification of homozygosity in consanguineous individuals with autosomal reces- sive disease. The American Journal of Human Genetics, 78(5):889–896, 2006.

[261] 1978 World Bank. World development report 1978. Oxford University Press, New York, 1978.

[262] WB World Bank. Helping countries combat corruption: the role of the world bank, 1997.

[263] Worldmark. Worldmark encyclopedia of the nations. Gale Cengage, 1995.

[264] Melanie Meng Xue. High-value work and the rise of women: The cotton revolution and gender equality in china. SSRN Working Paper, 2389218, 2016.

[265] Toshio Yamagishi, Karen S Cook, and Motoki Watabe. Uncertainty, trust, and com- mitment formation in the united states and japan 1. American Journal of Sociology, 104(1):AJSv104p165–194, 1998.

[266] Toshio Yamagishi and Midori Yamagishi. Trust and commitment in the united states and japan. Motivation and emotion, 18(2):129–166, 1994.

55 Chapter 3

Kinship, Fractionalization and Corruption

(with Mahsa Akbari and Erik Kimbrough)

“Now it appears, that in the original frame of our mind, the strongest attention is confin’d to ourselves; our next is extended to our relations and acquaintances; and ’tis only the weakest which reaches to strangers and indifferent persons. This partiality, then, and unequal affection, must not only have an influence on our behaviour and conduct in society, but even on our ideas of vice and virtue. . . " [David 148, A Treatise of Human Nature, Section 3.2.2.]

3.1 Introduction

Norms of solidarity, fidelity and self-sacrifice in favor of kin, tribe and clan have often been praised as virtues, but these virtues may become vices when they conflict with the abstract rules and formal institutions of the modern political and economic system. In particular, favoring kin at the expense of others may lead to corruption, disrupting or subverting impartial institutions and hampering eco- nomic development. In this paper, we provide evidence that a history of consanguineous mating practices generates fractionalization between local, sub-ethnic groups, increases incentives for local favoritism, and thus encourages corruption. Previous studies of corruption and its effects on growth have explored the idea that ethnic het- erogeneity (and concomitant fractionalization) may cause corruption when individuals favor mem- bers of their own . Mauro’s (1995) influential study used ethnic fractionalization as an instrumental variable for corruption. Since then, several studies have investigated whether ethnic heterogeneity causes corruption, with mixed results. In support, [79] found that ethnic fractionaliza- tion is positively correlated with corruption; [164], [249] and [12] also found that fractionalization has a reduced-form relationship with corruption but reported non-robust results when controlling for other variables such as per capita income. However, [233] and [83] found no significant effect of fractionalization on corruption. In addition to the cross-country studies, [105] and [74] found a

56 significant relationship between ethnic heterogeneity and corruption across US states. These con- tradictory results have encouraged skepticism [see e.g. 61]. A typical regression model from the cross-country empirical studies is as follows:

C = α + βEF + γX + u

where C is a corruption index, EF is an ethnic fractionalization index1 and X is a set of indepen- dent variables. The richest specifications in [164], and [12] include legal origins, religion, latitude, per capita income, country size and regional dummies. However, while the motivation for such analysis is intuitively appealing, there is no obvious theoretical justification for the model or for a causal effect of ethnic fractionalization on corruption. Why should individuals favor members of their ethnic group? We provide a framework, rooted in biological theory, that explicitly connects ethnicity, kinship and corruption. While the theory may be unfamiliar to economists, “the biologically based approach shares strengths in common with the best of economic theory; it is parsimonious, counter-intuitive, and falsifiable" [68, p. 3759]. The basic intuition comes from the biological notions of inclusive fit- ness and kin selection, which imply that genetic relatives, because they share genes with an interest in propagating themselves to the next generation, will sometimes be willing to incur costs to help one another [see e.g. 129, 130]. A model of kin selection implies that increases in relatedness can encourage corruption and favoritism. Since the relatedness of two randomly chosen co-ethnics is quite low, shared ethnicity per se is insufficient to foster corruption, helping explain the ambiguous findings noted above. Key to our argument is that different marriage practices will increase or decrease relatedness at a local level, thereby directly altering the incentives for corruption and favoritism. For instance, the offspring of a consanguineous marriage (e.g. marriage between two first cousins) share more genes with their parents, siblings and cousins than the offspring of a marriage between two unrelated individuals. As we illustrate in a simple model below (see Section 3.2), from the point of view of biological theory, all else equal, the former family has stronger incentives for favoritism and corruption at the expense of non-relatives than the latter. While the intuition is rooted in evolutionary theory, our argument is not that different groups have different genes and hence different behaviors.2 Instead, we assume that all humans possess

1As a measure of heterogeneity within countries, empirical studies have used fractionalization indices from two sources: (I) the ethnolinguistic fractionalization index (often referred to as ELF) from the Atlas Narodov Mira com- piled from sources in the former Soviet Union [48], and (II) the ethnic, linguistic, and religious fractionalization indices provided by [12]. Both sources define fractionalization as the probability that two randomly drawn individuals from a country’s population belong to two different groups.

2In biology, models of kin selection are typically used to explain the evolution of altruism or cooperation, by showing that “genes for" such behaviors may provide a selective advantage to their carriers. While it is possible, in principle, that persistent differences in marriage patterns between populations could eventually lead to genetic differences between those populations (increasing the relative frequency of “genes for" kin altruism due to differential selection pressures; see [131] for a theoretical treatment, and see hbdchick (2014) for a provocative argument that this might apply to human

57 the capacity for kin altruism as well as for cooperation with non-kin, and that societies differ in the degree to which operant social norms favor one or the other mechanism of cooperation. Since humans are social creatures reliant on cultural norms of cooperation and information-sharing for survival, selection pressures also operate on these norms, so that a society’s norms adapt to local conditions [142]. Our focus is on the effects of in- and out-marriage practices on norms related to corruption. Our view is that the incentives produced by different marriage practices have influenced the evo- lution of social norms. Endogamous and consanguineous marriage (in-marriage) directly increase local relatedness, increasing the relative returns to local favoritism and corruption; while exogamous marriage practices (out-marriage) reduce local relatedness, increasing the relative returns to impar- tiality. Social norms are constructed on top of this lattice of biological relatedness, and thus changes in relatedness due to marriage practices can directly influence the evolution of norms.3 Moreover, on top of these direct effects via the mechanism of relatedness, there are also intuitive indirect mech- anisms by which marriage practices can influence the relative returns to norms of favoritism and corruption vis-a-vis norms of impartial cooperation. In particular, different marriage patterns also typically result in different patterns of social in- teraction, reflecting complementarities between marriage practices and other social structures. Out- marriage encourages (indeed, requires) interaction with non-kin and strangers. As we discuss below, historical religious bans on consanguineous marriage were an important cause of migration, espe- cially in agricultural societies where one male child inherited the family’s land. Siblings without property, especially females, had to migrate to find eligible marriage partners [57], and we argue that this exchange of people across distances encouraged the development of norms of impartial cooperation with non-kin. On the other hand, in-marriage is associated with extensive interaction among local (and more closely related) in-group members (e.g. kin, clan, tribe), and relatively less interaction with strangers. Seen in this light, in-marriage practices generate another kind of fractionalization at a level of gran- ularity finer than the ethnic or linguistic group. We refer to this as sub-ethnic fractionalization, and we argue that patterns of increased local relatedness and concomitant intense local interaction di- populations), in practice, genetic change (or genetic difference) is not necessary to explain how marriage practices can alter incentives for kin altruism (and consequent corruption).

3Many groups extend altruistic norms to affines, friends and other less-related individuals by adopting cultural prac- tices that create fictive kinship, and it has been argued that these practices piggy-back on, or hijack, the evolved mecha- nisms for kin cooperation and extend them to non-kin [see 142, for a summary]. present another example of how social norms can harness biological mechanisms: “Incest taboos are social norms that evolved culturally to regulate sex and pair-bonding between non-close relatives by harnessing innate intuitions and emotional reactions that originally arose via genetic evolution to suppress sexual interest among close relatives, especially siblings. By harnessing innate in- cest aversion and labeling distant relatives as ‘brothers’ and ‘sisters,’ cultural evolution seized a powerful lever to control human behavior, since incest taboos can strongly influence mating and marriage, and kin-based altruism can be extended through social norms. If you control mating and marriage, you get a grip on much of the larger social structure, and even aspects of people’s cognition and motivation" [142, p. 153]. See also [155] on between social norms and kin altruism.

58 minished the impetus to develop norms of impartial cooperation, instead favoring the development of norms of local favoritism (which may manifest as corruption).4 To see how sub-ethnic fractionalization can help account for some puzzling observations on the relationship between ethnic (and linguistic) fractionalization and corruption, consider Table 3.1 derived from [12] which lists a few countries with low ethnic and linguistic heterogeneity, but rel- atively high levels of corruption.5 Although Yemen, Tunisia, Saudi Arabia and Bangladesh are rel- atively homogeneous in terms of ethnicity and language, they are highly fractionalized due to the presence of and competition between other close-knit kin-based and local groups such as extended families, tribes, clans, and religious groups [see e.g. 171, on clan structures in the Gulf of Aden]. We argue that distinctive family structures and mating patterns generate sub-ethnic fractionalization (e.g. the preference for in-marriage in many African and Asian countries) and can help account for corruption in many societies, even those that are ethnically homogeneous.

Fractionalization index 0-5% 5-15% 15-35% 35-55% 55-75% Yemen Saudi Arabia ...... Tunisia Bangladesh Corruption index 0-20% 20-40% 40-60% 60-80% 80-100% ...... Saudi Arabia Bangladesh Yemen Tunisia

Table 3.1: Misspecification due to omission of sub-ethnic fractionalization.

A second puzzle for the view that heterogeneity per se causes corruption can be seen in Table 3.2 [also derived from 12], which lists a number of countries, all of which are highly ethnically and linguistically fractionalized:

Fractionalization index 0-5% 5-15% 15-35% 35-55% 55-75% ...... Switzerland Canada ...... Belgium Luxembourg Iraq Iran Uzbekistan Corruption index 0-20% 20-40% 40-60% 60-80% 80-100% Canada Belgium Iran Iraq Luxembourg Pakistan Uzbekistan Switzerland

Table 3.2: Misspecification due to conflation of ethnic and sub-ethnic fractionalization.

Although, e.g., both Canada and Pakistan are ethno-linguistically heterogeneous, Canada has effective, impartial institutions; while Pakistan is quite corrupt. As above, the countries differ in the importance of sub-ethnic groups such as extended family, tribe and clan to social and political

4[247] make a similar point: while exogamous marriage in Europe served as a model for impersonal bureaucratic relations by creating links between unrelated individuals, endogamy impedes the creation of impersonal relationships by cutting “horizontally across the vertical edifice of the state, undermining the system and producing what in conventional administrative terms is called corruption" (p. 146).

5The fractionalization index in the table is the simple average of ethnic fractionalization and linguistic fractionalization from [12], and the range of the fractionalization index in the table is the same as [179]’s ethnolinguistic fractionalization table. For the corruption index in the table, we used the 2014 Corruption Perception Index provided by Transparency International (http://www.transparency.org/research/cpi/overview).

59 life. Pashtuns, one of the largest ethnic groups in and Pakistan “are said to having [sic] developed the world’s largest tribal society, . . . [with] sub-tribes, clans and sub-clans down to the local lineages and families" [106, p. 3]. Similar arguments contrast Switzerland, Luxembourg, and Belgium on one hand to Iran, Iraq, and Uzbekistan on the other.6 Viewed through the lens of marriage practice-driven sub-ethnic fractionalization, these observations can be understood. To make our argument about the incentives produced by in-marriage more precise, we introduce a stylized bribery model where a private agent offers a bribe to an official, and if the official accepts the bribe and makes a corrupt effort, a negative externality is imposed on third parties called citizens. This is a well-known model of corruption used in a number of laboratory studies.7 We employ a utility function that embeds the implications of inclusive fitness to illustrate how increases in relatedness (as would be produced by in-marriage) may encourage corruption. In a one-shot game, the subgame perfect equilibrium of the bribery game for payoff maximizing agents involves neither bribery nor corruption (nor the associated negative externalities), but when the private agent and the official are sufficiently related to one another, both bribery and corruption can be supported in equilibrium. The model thus shows how relatedness can influence the returns to corruption, and we use that intuition to motivate an empirical analysis of the relationship between in-marriage practices and corruption. Our empirical analysis combines data from population genetics, corruption, and comparative development studies to test the hypothesis that marriage practice-driven sub-ethnic fractionalization causes corruption using both cross-country and within-country regression analysis. As a measure of sub-ethnic fractionalization, we collect data on national and regional (within-Italy) rates of con- sanguineous (cousin) marriage. We find that consanguinity rates have a substantial and positive association with corruption, both across countries and within Italy, even after controlling for other “deep" determinants of comparative development. To complement our regression analyses and provide a test of the proposed mechanism, we also design a cross-cultural lab experiment comparing the bribery and corruption behavior of strangers, co-ethnics and kin in Canada and Iran, exploiting the fact that the two countries are both ethnically (and linguistically) fractionalized by standard measures but vary substantially in their degree of sub- ethnic fractionalization, due to cultural differences in family structure. 849 students from different ethnic origins in Canada and Iran participated in a bribery game, in which the first mover chooses whether to offer a bribe and the second mover chooses to accept or reject it. If he accepts the bribe, the second mover also decides whether to make a corrupt effort to benefit the first mover, thereby imposing a negative externality on a passive third player. Subjects play the three-player bribery game with one unrelated person and one co-ethnic (or sibling, in the Kin treatment). Three possible assignments of roles to two co-ethnics (kin) create three treatments through which we explore the effect of co-ethnicity (kinship) on corrupt acts.

6See also [226] on clans, corruption and state-building in Iraq in light of cousin marriage practices.

7E.g. see [2, 51, 9, 31, 220].

60 Our design allows us to compare the frequency of bribery and corruption in treatments with in-group members (kin or co-ethnics) as first and second movers to treatments with one in-group member as the first or second mover and the other as the passive third party. We can also test for dif- ferences in the strength of norms favoring each kind of in-group within-country and for differences across countries that reflect differences in norms (plausibly related to differences in sub-ethnic frac- tionalization). We find evidence of favoritism in both countries, but among co-ethnics the pattern is more pronounced in Iran. Robustness checks include treatments in which the first and second- movers are close friends and treatments varying the incentives; increased favoritism among friends in Iran compared to Canada provides further support for cross-country normative differences in the degree of in-group favoritism. Overall, our findings suggests that marriage practice-driven sub-ethnic fractionalization is an alternative but important channel through which history can explain variation in present day insti- tutional quality. Two further considerations suggest that using variation in consanguinity to study the effects of sub-ethnic fractionalization is a reasonable approach. First, consanguinity has a direct impact on local relatedness (and thus, the relative returns to norms of local favoritism) since the offspring of a consanguineous marriage will be more closely related to their kin than the offspring of a randomly mating pair. Second, although some of the variation in consanguineous marriage rates has been traced to variation in historical modes of subsistence, inheritance rules, geographical constraints, parasite threats, and etc., [see e.g. 115, 57, 145, 254], two other factors loom large as deep-rooted causes that explain a large portion of the observed variation: Christianization and the Arab conquests in the early centuries of Islam. The Catholic Church has restricted cousin marriage since 500AD, at times extending a ban as far as sixth cousins. However, the Church prohibited not only consanguineous marriages to blood relatives, but also to affinal kin, (e.g. a dead brother’s widow), to spiritual kin (e.g. godchildren) and to fictional kin (e.g. adoptees), “producing a vast range of people, often resident in the same locality, that were forbidden to marry" [115, p. 56]. Consanguineous marriages “had historically provided one means of creating and maintaining kinship groups–such as clans, lineages, and tribes" [121, p. 309]. The Church’s restrictions made in-marriage virtually impossible, fundamentally al- tering social organization by slowly dissolving clan, lineage, and tribe boundaries in Europe, and thereby created pressures favoring the development of impartial norms, large-scale cooperation, and eventually modern institutions (see [161] and [121] for further discussion and references). Going the other direction, a preference for (a particular kind of) cousin marriage seems to have spread with the Arab conquests at the dawn of Islam, despite cousin marriage being neither explicitly encouraged nor prohibited by the religion [159]. Indeed, Italy is a uniquely interesting case in that by the 20th century (when we observe consanguinity rates) it was a nearly 100% Catholic country, but some southern parts of Italy, where cousin marriage is most common in our data, were also part of the for more than a century from roughly 850AD to 1000AD. Therefore, consanguinity rates today also partly reflect historical military and cultural conquests which carried with them

61 different attitudes towards family and marriage patterns, and in our view, the impact of this variation in marriage practices on patterns of relatedness has helped shape the extent of corruption and the quality of institutions today. While we cannot allay all endogeneity concerns with our regression analyses, we also report instrumental variables estimates to provide some evidence that the relationship we identify may be causal. In particular, for our Italian data, we construct an instrument that captures province-level variation in exposure to the Catholic Church’s consanguinity bans: the number of years with an active Catholic archdiocese in each province. This variation has two sources: variation in the initial expansion of the Catholic Church’s dominion and suppression of archdioceses in some provinces in Southern Italy during centuries of Arab domination of the region. The IV estimates provide evidence consistent with our reduced form analysis, suggesting that in-marriage practices are an important potential cause of variation in corruption.8 Our argument is related to the literature that distinguishes between generalized and limited morality (or amoral familism) [29, 212, 241]. Limited morality is the extreme reliance on a narrow circle of family, friends or relatives; outside this circle, harming and cheating are allowed and frequent. In this narrow circle, people are raised to trust in-group members only. They are also taught to distrust people outside the circle, which hampers cooperation and exchange with strangers and outsiders, and as a result, impedes the development of formal institutions. Generalized morality is characterized by respect for abstract individuals and their rights, generalized trust and loyalty to general rules, which facilitates large-scale cooperation. The underlying mechanism that determines whether a society adopts limited morality has been attributed to strong versus weak family ties [85, 14, 15], collectivism versus individualism [266, 265], and clan versus corporation [123, 121]. We also contribute to this literature highlighting another mechanism: in-marriage practices raise the relative returns to limited morality and therefore encourage corruption. Finally, our work is also related to a growing literature on the political and economic conse- quences of historical variation in social and familial organization [e.g. 259, 230, 191, 192, 84]. [191, 192] provide evidence that other aspects of kinship and social organization help account for levels of inter-group conflict in Sub-Saharan Africa; in particular, they show that groups organized around segmentary lineages are prone to localized conflicts. The most closely related papers by [259] and [230] document a negative relationship between rates of consanguineous marriage and levels of democracy. Also related is a recent paper by [84], who develops a measure of historical “kinship tightness" along three dimensions: family structure (domestic organization, post-wedding residence), marriage patterns (cousin marriage, ), and descent systems (lineages, seg- mented communities and localized clans). He reports a negative correlation between his measure of kinship tightness and cooperation with and trust of outsiders using a variety of datasets. Our pa- per contributes to this literature by highlighting an important channel by which marriage practices

8In Appendix B.5.2 we also report an instrumental variables strategy for our cross-country analysis using an instru- ment based on differences in kin terminology, which also provides evidence consistent with our primary analyses.

62 and kinship structures directly alter the incentives to develop norms of impartiality, which underlie both the degree of corruption (our focus) and the extent of democracy.

3.2 Theory and hypotheses

Corruption can be defined as “abuse of public office for private gain" [262, p. 8]. Public office can be abused in hiring for governmental positions, manipulating government procurement or facilitat- ing/limiting access to basic goods or services in places like hospitals, schools, police departments, etc. Private gain is often realized through bribery, with gifts, money, or similar benefits offered in exchange for official actions. However, enforceable contracts for such exchanges are impossible because corruption is typically illegal. Therefore, bribery necessitates implicit contracts which rely on trust and cooperation.

3.2.1 A basic model of bribery Seen from a game-theoretic perspective, bribery is a social dilemma much like a trust game [see e.g. 38, 86] where (i) a sequential exchange takes place in the absence of enforceable contracts, (ii) both players are better off exchanging their goods or favors, and (iii) there is also a strong temptation to cheat, e.g. by accepting the bribe and failing to reciprocate. However, as noted by [2], the trust game lacks two essential components of bribery: the possibility of negative externalities and the risk of penalty. Figure 3.1a shows our bribery game inspired by [2]. Player 1 represents a private agent and player 2 represents a public official. Player 1 may offer a bribe (t) to player 2 in the hope that player 2 will misuse his office to benefit her (B). If player 1 offers a bribe, she also incurs a small cost (c) of initiating the relationship with the official. The private agent’s benefit from the official’s corrupt effort, B is high enough that B > t + c. The official, player 2, has the option of accepting t but making no effort or making a corrupt effort and incurring a cost (e). The effort cost is low enough that e < t. If the official chooses to make the corrupt effort, there is a small probability (e) of getting caught, where both private agent and official end up with zero payoffs. If the official is not caught, the negative externality of the official’s corrupt effort on citizens, who have no move in the game, is Xi, which is displayed below the payoff vectors whenever it occurs. Assuming ∑ Xi ≥ B, the game also captures another characteristic of corruption; it is inefficient.

63 Private Agent

No offer Offer t

Official   P1 P2 Reject Accept

Official   P1 − c P2 No effort Effort

Nature   P1 − c − t P2 + t 1 − e e

    P1 − c − t + B 0  P2 + t − e  0 -X ... -X

(a) All players unrelated.

Private Agent

No offer Offer t

Official   P1 P2 Reject Accept

Official   P1 − c P2 − rpoc No effort Effort

Nature   P1 − c − (1 − rpo)t P2 + (1 − rpo)t − rpoc 1 − e e

    P1 − c − (1 − rpo)t − rpoe + B 0 P2 + (1 − rpo)t − rpoc + rpo B − e 0 -X ... -X

(b) Private Agent and Official are relatives.

Figure 3.1: A bribery game between strangers and relatives, highlighting the implications of inclusive fitness.

In the unique subgame perfect equilibrium of the one-shot game, the official accepts the offer but makes no effort, and the private agent chooses not to offer a bribe to the official. However, from field observations and experimental studies, we know that “corruption exists, bribes are paid, and

64 favors are reciprocated" [167, p. 280]. One possible source of observed corruption is nepotism, and its antecedent biological or kin altruism.9

3.2.2 Inclusive fitness, kin altruism, and corruption Sequential social dilemmas such as the bribery game allow agents to engage in altruistic behavior: one party may incur a cost in order to provide a larger benefit to another (in this case at a cost to third parties). In general, costly, altruistic behaviors like self-sacrifice, non-reciprocal help and subordination of private interests for the good of the group are all commonly observed among kin. Costly altruism seems to contradict both models of individual self-interest and Darwinian natural se- lection, because behaving altruistically is disadvantageous for the altruist, by definition. Intuitively, individuals that incur costs in order to provide fitness benefits to others will have lower fitness than free-riders, and hence, prima facie, should have their numbers dwindle. However, [129]’s kin selec- tion theory provides a simple and empirically successful argument explaining how such altruistic behavior could evolve under natural selection. Hamilton solved the problem by focusing on selection at the level of the rather than the individual. We can imagine “a gene which causes its bearer to behave altruistically towards other organisms, e.g. by sharing food with them" [205]. We expect the altruist gene to be eliminated because it is disadvantageous for the altruist. But what if altruists share food only with those with whom they also share genes? Since there is a certain probability that the recipients of the food will also carry copies of that gene, the altruistic gene can in principle spread by natural selection. Thus altruistic behavior may increase the number of copies of the altruistic gene in the next generation, and thus “the incidence of the altruistic behavior itself" (Ibid.). Hamilton demonstrated that an altruistic gene will be favored by natural selection and will spread in the population when a certain condition, known as Hamilton’s rule, is satisfied. According to Hamilton’s rule, a donor provides an altruistic act if rB > C, where C is the cost of the altruistic act to the donor, B is the benefit of the act to the recipient, and r is the coefficient of relatedness between the donor and the recipient. This rule is based upon expected costs and benefits in terms of inclusive fitness which represents one’s own fitness10 plus the weighted sum of relatives’ fitness, where the weights are the coefficients of relatedness. Then from a gene’s eye view an individual benefits not only through personal reproduction, but also by helping the reproduction of others who share some of their genes [205, 68]. Therefore, if we assume that all people have some propensity toward kin altruism, all else equal, more closely related individuals have stronger incentives to behave altruistically towards one an- other. There is a lot of supporting evidence for this claim in other contexts: kinship patterns cor- relate with within-household violence, allocation of food, provision of childcare, and safeguards against infanticide, as well as migrant workers’ remittances to their families, willingness to murder political rivals and form stable alliances, taking sides in disputes, emotional and material support within social networks, cooperation under catastrophic circumstances, membership in cooperative labor units, organ donation rates, etc. [see e.g. 68, 177, 47, 87]. The relatedness of two individuals can be approximated by the coefficient of relatedness, that is, the expected fraction of identical by descent genes that are shared between two individuals in a

9Of course we do not claim it is the only source - repeat interaction, reciprocity and threats may also facilitate corrup- tion, even among non-kin. The purpose of the one-shot game described here is merely to provide a simple framework in which to highlight the role of kinship as another important potential causal factor.

10Fitness should be thought of as reproductive success, e.g. as the expected number of progeny.

65 randomly mating population. The value of the relatedness coefficient for identical twins is 1, for full siblings and fraternal twins 1/2, for parents and offspring 1/2, for grandparents and grandchildren 1/4, for first cousins 1/8, and so-on to a randomly chosen pair who have a relatedness coefficient of 0 [205].11 “J. B. S. Haldane once remarked, it would make sense to dive into a river to save two drowning siblings or eight drowning cousins" [235, p. 85]. See Appendix B.1 for further detail.

Bribery game with inclusive fitness Suppose the payoffs in the bribery game are in units of biological reproductive fitness.12 According to inclusive fitness theory, if players in the bribery game are related, their payoffs should include not only the fitness effects on themselves but also on the other parties involved. In particular the benefits and costs to others enter into in the players’ payoffs weighted by the coefficient r, of relatedness between them. Let rpo represent the relatedness of the private agent and the official. Also, let rpc = 0 be the sum of relatedness of the private agent to citizens, and let roc = 0 be the sum of relatedness of the official to citizens. Then, the payoffs to the bribery game are modified as shown in Figure 3.1b. In the bribery game with genetically related players, the subgame perfect equilibrium can be characterized as follows, by backward induction: (I) If accepting the offer, the official honors the trust of the private agent and makes a corrupt P +(1−r )t−r c effort on her behalf with a unique equilibrium strategy if: (1 − e) > 2 po po and he P2+(1−rpo)t−rpoc+rpo B−e accepts the offer if: (1 − e) > P2 . P2+(1−rpo)t−rpoc+rpo B−e (II) Assume that both aforementioned conditions hold so that the official accepts the offer and exerts the corrupt effort as his unique equilibrium strategy. The private agent foresees the optimal strategy of the official; therefore she places trust and offers t in a unique equilibrium strategy if: (1 − e) > P1 . P1−c−(1−rpo)t−rpoe+B Implication 1: All else equal, the official is more likely to accept a bribe and make a corrupt effort as rpo increases.

Implication 2: All else equal, the private agent is more likely to offer a bribe as rpo increases.

Note that while our example sets roc and rpc equal to 0, if we allow roc to vary, the official is less likely to accept a bribe and make a corrupt effort as roc increases, and similarly, if we allow rpc to vary, the private agent is less likely to offer a bribe as rpc increases. The analysis so far highlights the role of relatedness in shaping the incentives for corruption. In-group favoritism among co-ethnics was Mauro’s (1995) motivating example for using ethnic fractionalization as an instrumental variable for corruption: “bureaucrats may favor members of their same group"(Ibid., p.693). However, as we noted in the introduction, empirical estimates suggest

11In reality, even in a random mating population, a parent might share more than half of her genes with her offspring; “half those genes are surely identical because they came from the parent, while gene sharing with the other half of the child’s genome is just what is shared with any random member of the population." Hence, a more precise way to think of relatedness is to “think of gene sharing in excess of random gene sharing" [136, p. 142], and the coefficient of relatedness is more properly defined as r = (Q − Q)/(1 − Q), where Q is the relatedness of the two individuals, while Q is the average relatedness in the population [202, p. 1059].

12Of course, the analogy is imprecise in the sense that corruption is not transacted in units of fitness. However, in many cases, corruption influences the allocation of large quantities of resources (monetary and otherwise), which are correlated with reproductive success. In an extreme case, if a corrupt act results in one individual living to reproduce and another dying before reproduction, the effects are direct in fitness terms.

66 that relatedness among co-ethnics (relative to neighboring groups) is often not far above zero [56], which helps explain the weak association between ethnic fractionalization per se and corruption. Our focus is instead on sub-ethnic fractionalization, driven by marriage patterns (and associated variation in patterns of social interaction), which causes increased relatedness and thereby alters the returns to norms of favoritism and corruption.

3.2.3 Consanguineous marriage, sub-ethnic fractionalization, and corruption Underlying the coefficients of relatedness for kin reported above is a crucial assumption: random mating, “that mates are chosen with complete ignorance of their genotype (at the locus under con- sideration), degree of relationship, or geographic locality" [101, p. 13]. The key to our argument about the role of sub-ethnic fractionalization in corruption is that in many cases, the assumption of a randomly mating population is violated. When mating is non-random, so that members of some local groups are more likely to mate within the group than outside the group (e.g. consanguinity due to geography, culture, etc.), the expected relatedness of kin and group members is higher than under random mating.13 For example, two offspring of a first-cousin marriage have a relatedness higher than 1/2 (r = 1/2 + 1/2 × 1/8): with probability 1/2 they inherit a gene from the same parent at each locus, and with probability 1/2 they each inherent a gene from a different parent, in which case the prob- ability of gene sharing is just the relatedness between their parents, 1/8. If this pattern repeats over generations, local relatedness only grows [see 131, for a theoretical analysis and Appendix B.1 for some empirical evidence]. In a cross-cultural ethnographic tabulation due to [197] and [119], a total of 476 out of 1024 societies for which we have data either permitted or favored first and/or second-cousin marriage, and estimates suggest that roughly 10% of marriages around the world today are consanguineous [43].14,15 The wide diversity in attitudes towards consanguinity in human societies partially originates in religious beliefs due to the common jurisdiction of religious institutions over marriage. Table 3.3 shows some of the diversity of religious attitudes toward cousin marriage around the world, and Appendix B.2 summarizes the history of Christian and Islamic attitudes toward consanguinity. Of particular note are the fact that the Catholic Church has placed restrictions on cousin marriage since at least 500AD (sometimes extending these bans out to sixth cousins, as well as to fictive and affinal kin) and the fact that a persistent preference for cousin marriage can be seen in many Islamic countries, reflected in contemporary rates as high as 50% of marriages. In fact, marriage norms may persist, even as religious attitudes change. As one example, con- sanguineous marriage has long been prevalent in parts of Italy, despite it being an almost entirely

13Related literature suggests that assortative matching (and mating) can encourage the evolution of (local) cooperation and favoritism. See e.g. [39, 124] for theoretical and experimental evidence on cooperation, and also [132, 81, 93] on favoritism.

14While the negative health effects of consanguinity are well-known (e.g. increased risk of autosomal-recessive disor- ders), some believe that there are countervailing positive effects as well [151].

15In a small sample, genomic estimates of the “inbreeding coefficient", which measures “the proportion of a genome that is ‘autozygous’ - homozygous for alleles inherited identically by descent from a common ancestor," are correlated as expected with consanguineous marriage rates (r = 0.349, p-value = 0.04, N = 26). The correlation would likely be higher except that the genomic measures also capture background (so-called “cryptic") inbreeding due to geographical population division [209, p. 38]. See Appendix B.1 for more information.

67 Attitude toward Religion Sect Cousin Marriage

Judaism Ashkenzi Permitted Sephardi Permitted Christianity Coptic Orthodox Permitted Eastern Orthodox Proscribed Protestant Permitted Roman Catholic Diocesan approval req. Islam Sunni Permitted Shia Permitted Indo-European Proscribed Dravidian Permitted Permitted Proscribed Confucian/Taoist Partially permitted Zoroastrian/Parsi Permitted

Table 3.3: Religious attitudes to consanguineous marriage [43, p. 14].

Catholic country. [57] suggest this may be a result of persistent cultural norms imported during the Arab conquest of southern Italy over 1000 years ago. Going the other direction, majority-Protestant countries mostly legalized cousin marriage after centuries of living under the Catholic ban. Nev- ertheless, consanguinity remains rare in those countries. In the United States, cousin marriage is illegal in 25 states, though its frequency remains low even where it is legal. Of course, not all consanguineous marriage is driven by religious preference. For instance, con- sanguineous mating can be caused by population division due to geography. As populations mi- grated around the world historically, they became isolated from one another due to vast distances and geographic barriers such as mountains, deserts and oceans that were only recently broken down by transportation technologies. Due to isolation, some groups accumulated relatively high local relatedness. Others have argued that consanguinity is a cultural adaptation to social and ecological circum- stances. In his , [115] suggests that consanguinity may be a property and wealth-preserving response to gender-egalitarian inheritance rules, which would result in the dif- fusion of property under out-marriage. A few studies have examined the causes and consequences of consanguinity in small-scale societies using detailed to directly measure relatedness. [254] show that among forager peoples, such marriages are rare due to norms of exogamy and fis- sion/fusion dynamics that disperse kin across groups, but among agropastoralists, particularly those that practice polygyny, the practice is more common, with average spousal relatedness rising as high as r = 0.18 (almost 50% greater than first cousins, r = 0.125). Using log(surviving children) as a measure of fitness, estimates in [28] suggest that these marriage practices may be adaptive, with fit- ness maximized for moderate consanguinity among agropastoralists and with minimal consanguin- ity among foragers. Other evidence from [145] suggests that consanguinity may be more prevalent near the equator since it can raise the frequency of homozygosity for adaptive recessive mutations that defend against diseases and parasites, which are also more prevalent in warmer climes. Regardless of its origins, wherever it is practiced, consanguineous marriage directly increases local relatedness and encourages sub-ethnic fractionalization, thereby altering the returns to norms of favoritism and norms of impartial cooperation. Thus, variation in consanguinity rates facilitates a test of our main hypothesis: that sub-ethnic fractionalization causes corruption.

68 3.3 Empirical Strategy and Results

To provide evidence for a relationship between sub-ethnic fractionalization and corruption, we present data from cross-country regressions, within-country regressions and laboratory experiments. Each of the methods has limitations, and none of them can wholly address identification or endo- geneity concerns, but our goal is that by approaching the problem at various levels of granularity we can provide a set of robust, complementary tests of the main hypothesis: that sub-ethnic fractional- ization causes corruption. In our cross-country and within-country analyses, we employ data on consanguineous marriage as our measure of sub-ethnic fractionalization and we regress measures of corruption on consan- guinity. As discussed below and detailed in Appendix B.3, [42] have collected data on rates of con- sanguineous marriage (among 2nd cousins or closer relatives) at the country level from a multitude of sources, including surveys, public health studies, and church records. While our cross-country consanguinity sample is neither random nor representative, it covers the large majority of the global population. For our within-country analysis, the data on consanguinity has more complete coverage, as [57] have collected data on consanguinity in Italian provinces. Finally, we report the results of a laboratory corruption experiment in Iran and Canada where our treatments directly manipulate relatedness, bringing strangers, co-ethnics and kin (siblings) into the lab and varying their assignment to roles in the game. These two countries are similarly ethnically fractionalized, but Iran has much higher rates of consanguinity. The design allows us to test for the effects of relatedness by comparing across treatments within each country and for the effects of marriage patterns on social norms by comparing across countries. Robustness checks in which the corruption game is played with friends and/or the social and private cost of corruption is varied provide further support for an interpretation of cross-country differences in social norms rooted, at least in part, in distinct marriage patterns.

3.3.1 Cross-country analysis Figure 3.2 displays average ICRG institutional quality data from 1984-2011 alongside consanguin- ity data from [42]. ICRG indices are widely used in corruption studies since they capture many kinds of corruption and have wide coverage. Grey colored areas indicate missing data. Although we have consanguinity data for 72 countries, in our primary analyses, there are 67 countries for which we have the full set of covariates used in our main regression analyses. In this sample, we find a negative and highly significant correlation between institutional quality and consanguinity (Spearman’s ρ = −0.56, p-value < 0.001, N = 67). While the correlational evidence is strong, we also conducted a series of OLS regressions con- trolling for relevant confounds and alternative explanations.16 Our empirical strategy follows [164] and [12] who attempted to address the endogeneity of corruption by focusing on “(reasonably) ex- ogenous sources of variation" [164, p. 223] in the economic, geographical, political and cultural characteristics of countries. Hence, we do not include contemporary political variables or variables capturing public policy. Instead, our analysis focuses on “more fundamental, or at least historically predetermined" variables (LaPorta et al., 1999, p. 230; see also Treisman 2000, p. 409). The most obvious economic heterogeneity across countries that can affect corruption is eco- nomic development, but development is almost certainly endogenous to corruption. While there is evidence that poor countries are perceived to be more corrupt than rich ones, so that per capita in-

16Results are robust to using Tobit to account for the dependent variable being restricted to the interval [0,6].

69 (a) Corruption

1

6

(b) Consanguinity

65.8

0

Figure 3.2: Corruption and consanguinity around the world. come is a potential determinant of corruption, corruption itself can reduce per capita income [see e.g. 180, 243, 53, 166]. As noted by [249], one exogenous variable that is correlated with economic development but is unaffected by corruption is a country’s latitudinal distance from the equator (in- deed, log GNI per capita averaged over 1984-2011 correlates with latitude, Spearman’s ρ = 0.59, p-value < 0.001, N = 67).17 In addition to latitude, we also include regional dummies from [12] for Sub-Saharan Africa, East Asia and Pacific, Latin America and Caribbean and country size (popula- tion). Following the literature, we also include ethnic fractionalization and legal origins as sources of exogenous variation in country-level political characteristics, and we report heteroskedasticity robust standard errors. After reporting a basic model using the variables described above, our main analysis relies on cross-country variation in cultural traits. First we consider the effects of a cultural preference for (and prohibitions of) consanguineous mating practices by adding consanguinity rates to the basic model, and then, in a series of regressions we allow consanguinity to compete with alternative cultural traits believed to influence corruption in the literature.

17One possible indirect connection is through an effect of latitudinal distance from the equator on consanguinity rates. As noted above, [145] argue that relative parasite prevalence near the equator may raise the returns to consanguinity by raising the frequency of homozygosity for adaptive recessive mutations (e.g. parasite immunities). We find a high correlation between distance from the equator and consanguinity which provides further reason to control for latitude in our regressions (Spearman’s ρ = -0.45, p-value < 0.001, N = 67).

70 Main findings Table 3.4 displays our first set of regressions, with ICRG institutional quality as the dependent vari- able. A full description of the variables is presented in Appendix B.3, Table B.3.1, and the omitted legal origin dummy in the regressions is the British one. Column (1) presents our basic regression model, inspired by [164] and [12], which includes a set of historically predetermined and exoge- nous economic, geographical and political variables. In column (2), we run the same regression using only the sample of countries for which consanguinity data exists and find qualitatively similar results. In column (3), we include consanguinity rates to account for sub-ethnic fractionalization, and the estimated coefficient is significant at the 1% level, increases the R2 by 50%, and remains significant in alternative specifications using different measures of economic development. These estimates imply that a 1 standard deviation increase in consanguinity is associated with a reduction in quality of institutions (i.e. increase in corruption) by about 0.7 standard deviations.

(1) (2) (3) (4) (5) (6) (7) Basic model Basic model and Income instead Latitude as both Income without Income VARIABLES restricted sample Consanguinity of Latitude instrument and Latitude and Latitude

Consanguinity -4.463*** -3.664*** -2.513* -3.566*** -5.087*** (0.769) (0.905) (1.331) (0.884) (0.772) Ethnic fractionalization -0.070 -0.222 0.524 0.0742 -0.123 0.226 0.319 (0.470) (0.670) (0.514) (0.460) (0.417) (0.479) (0.555)

Additional controls yes yes yes yes yes yes yes Observations 134 67 67 67 67 67 67 R-squared 0.527 0.441 0.668 0.695 0.654 0.704 0.632 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.4: Regression analysis of the relationship between consanguinity and corruption. Higher values of the dependent variable imply lower corruption. Additional controls include legal origins dummies, region dummies, latitude and log population. Full specification reported in Appendix Table B.5.1.

Latitude remains a significant determinant of corruption. This provides strong evidence that whatever the effect of corruption on growth, higher economic development is associated with lower corruption, as noted by [249]. When we include income per capita in the regression instead of lati- tude, in column (4), consanguinity is still highly significant. To address the endogeneity of income per capita more formally, we also used latitude as an instrument for income per capita in column (5) which yields similar results; here we cannot reject the null hypothesis of no endogeneity, suggesting the point estimates are consistent. From significance at 1%, latitude becomes insignificant when income per capita is also included in the regression in column (6). The most plausible interpretation is that latitude affects corruption only through income per capita which indicates that latitude is a reasonable proxy for income per capita in the regressions. Compared to those in common law countries, governments in countries of socialist legal origin are more interventionist, and thus the observed negative effect of socialist legal origin on institu- tional quality in all regressions is consistent with previous findings. While [12] note the difficulty of disentangling the independent effect of ethnic fractionalization from income per capita and latitude because all are highly correlated, their ethnic fractionalization index is insignificant in all specifications, even excluding controls for income per capita and latitude as in column (7). As another robustness check, we replicate two specifications from [12], restricting the sample to countries for which we also have consanguinity data (see Appendix B.4), and our results continue to hold. As the authors along with [249] note, ethnic fractionalization has a reduced form relationship with corruption and is not typically significant after controlling for per capita

71 income and latitude, while consanguinity remains significant in columns (3)-(7), with or without one or both of income per capita and latitude.

Alternative interpretations and confounds Given concerns about the endogeneity of income to corruption, we will retain latitude as our proxy for economic development in subsequent regressions. From here on, we build upon specification (3) which controls for historical and predetermined variables. To assure the robustness of our inter- pretation, we compare our preferred cultural metric of consanguinity against alternative theories of culture and also consider possible confounds.

Religion. Religion is the most important historical cultural factor affecting institutions. Re- ligion is potentially relevant to our analysis in two ways: through an indirect effect on corruption via consanguinity and through a direct effect on corruption as discussed in previous work [see 164, 12, 249]. A preference for consanguineous marriage is common in the Islamic world, and thus consan- guinity rates and the percent of the country practicing Islam are highly correlated (Spearman’s ρ = 0.73, p-value < 0.001, N=67).18 In contrast, Catholicism has imposed a long-standing ban on con- sanguineous marriage, and this is evinced by a strong negative correlation between the share of a country practicing Catholicism and consanguinity (Spearman’s ρ = -0.57, p-value < 0.001, N=67). Finally, while Protestant religions do not officially ban consanguineous marriage, the frequency is quite low, and we find a large negative correlation between a country’s share of Protestants and consanguinity (Spearman’s ρ = -0.53, p-value < 0.001, N=67). [164] and [249] offered reasons why may be a cultural deterrent to corruption, beyond its relationship to consanguinity. They argue that Protestantism is associated with attitudes such as [256]’s “Protestant work ethic" and a separation of church and state, both of which may have been conducive to growth and quality government, while countries that were predominantly Catholic or Muslim during this period were relatively more insular, hierarchical and interventionist [see 164, p. 229]. In column (2) of Table 3.5, we include variables indicating the share of each country’s popula- tion that practices Protestantism, Catholicism, and Islam; the excluded category is “other religions". We confirm [164] and Treisman’s (2000) finding that the proportion of Protestants in a country’s population is associated with lower corruption relative to other religious groups. Moreover, Islam and Catholicism are both associated with higher corruption. Note that we also controlled for consanguinity, and though the coefficient is smaller, it remains highly significant. The significant coefficients on religion in our regressions confirm previous find- ings and suggest an additional effect of religion that is independent of its influence on consanguinity traditions. [173] argue that “Protestantism reduces corruption, in part, because of its association with individualistic, non-familistic relations" [249, p. 428]. Family ties is the next cultural trait that we discuss.

Family ties. As [57] note, consanguinity “may be especially attractive where family values are especially important, the size of extended families is large, and social contacts are much more frequent with close relatives" (p. 287). This suggests that sub-ethnic fractionalization (measured

18Note, however, that we find a significant correlation between consanguinity and corruption even if we focus only on minority-Muslim countries (Spearman’s ρ = -0.43, p-value = 0.002, N=47).

72 as consanguinity) may simply reflect the relative importance of family ties across countries, since “strong and stable social relations (such as family ties and group ties) promote a sense of security within such relations but endanger trust that extends beyond these relations" [265, p. 166-8]. Several studies confirm the negative correlation of strong family ties with general trust [266, 265, 94, 85, 14]. Moreover, there is evidence that the strength of family ties contributes to the explanation of heterogeneity in corruption [among other macroeconomic variables, see e.g. 13, 15], and we find a large correlation between their measure of family ties and consanguinity (Spearman’s ρ = 0.58, p-value < 0.001, N=45). Nevertheless, in column (3) which includes data on family ties, as with religion, consanguinity remains a highly significant predictor of corruption, despite losing nearly 1/3 of our observations to missing data on family ties; moreover, the coefficient on family ties is also significant, suggesting that the variables’ effects on corruption are independent and do not necessarily capture the same underlying factors.

(1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties and Trust and Genetic diversity and Geography

Consanguinity -4.463*** -2.411** -3.853*** -4.663*** -3.494*** -3.874*** (0.769) (1.176) (0.722) (0.842) (0.952) (0.999) Ethnic fractionalization 0.524 0.451 0.969 0.846* 0.274 -0.140 (0.514) (0.458) (0.667) (0.486) (0.516) (0.447) Protestant 1.553* (0.791) Catholic -0.904** (0.370) Muslim -1.492*** (0.459) Family ties -1.261** (0.528) General trust 1.562 (1.105) Genetic diversity 48.933 (75.326) Genetic diversity squared -40.988 (56.274)

Geographical variables yes Additional controls yes yes yes yes yes yes Observations 67 67 45 56 67 65 R-squared 0.668 0.762 0.783 0.725 0.689 0.770 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.5: Regression analysis of the relationship between consanguinity and corruption: po- tential confounds. N varies due to missing data for some countries. See Appendix B.5.1 for analogous estimates of column (1) for each subsample. Additional controls include legal origins dummies, region dum- mies, latitude and log population.Full specification reported in Appendix Table B.5.2.

Trust. In contrast to the partiality that may be engendered by family ties, generalized trust is often considered to be a foundation of impartial institutions, and consistent with [265] and others, trust and family ties are negatively related in the sample for which we also have consanguinity data (Spearman’s ρ = -0.52, p-value < 0.001, N=45). As measured in the World and European Values Surveys, trust has been shown to correlate with institutional quality and economic development [253, 92, 240]. One possible concern is that consanguinity-driven sub-ethnic fractionalization is an endogenous response to lack of trust, creating binding ties to encourage and enforce cooperation in the absence of other means. Perhaps surprisingly, we find no significant relationship between gener-

73 alized trust and consanguinity (Spearman’s ρ = -0.19, p-value = 0.16, N=56).19 Nevertheless, when we control for trust in the regression analysis in column (4), it is insignificant; while consanguinity remains a highly significant predictor of corruption.

Genetic diversity. Empirically, the genetic diversity of indigenous populations around the world (measured as mean expected heterozygosity) decreases with geographic (great circle) distance from Ethiopia [216, 208]. This correlation has been argued to reflect the prehistoric “out of Africa" exodus of Homo sapiens to settlements around the world. These migrations, which happened over thousands of years, resulted in a “serial founder effect", in which small founding populations at each new settlement carried with them only a portion of the genetic diversity of the source population.20 [26] show that country-level predicted genetic diversity has an inverted U-shaped relationship with economic development, which suggests that we could include genetic diversity and diversity squared as an additional control for economic development. In column (5) of Table 3.5, we control for predicted genetic diversity and predicted genetic diversity squared, following [26] (see Appendix B.3 for details on the variables). Although consanguinity and predicted genetic diversity are highly correlated (Spearman’s ρ = 0.50, p-value < 0.001, N=67), the coefficient on consanguinity remains sizable and highly significant. In an unreported regression, we instead include migratory distance from East Africa as an instrument for genetic diversity, and the results are qualitatively unchanged.

Geographical factors. [57] report that “following the national trend" most consanguineous marriages were celebrated in the mountains “with a clear decreasing trend moving down" to the hills, in the plain and finally in the city (p. 37-38). This suggests that cross-country differences in consanguinity rates may be a consequence of heterogeneity in geographical barriers to migra- tion and exogamy. As a proxy for geographical barriers, we use the terrain ruggedness index from [204]. As explained in the data description in Appendix B.3, terrain ruggedness captures the aver- age elevation differences of adjacent lands in each country. Perhaps surprisingly, ruggedness is not significantly correlated with consanguinity (Spearman’s ρ = -0.14, p-value = 0.270, N = 67). In col- umn (6) we control for ruggedness and also include a variety of other geographic controls that are plausibly related to economic development, taken from [26]: soil suitability for agriculture, mean elevation, mean temperature, mean precipitation, percentage of the population living in tropical and subtropical zones, percentage of population living in temperate zones, and percentage of land near a waterway. Consanguinity again remains a highly significant predictor of corruption.

Additional robustness checks. To assuage concerns about the changing number of observa- tions in Table 3.5 as we include additional variables for which we have limited data, in Table B.5.5 in Appendix B.5.1, we compare the basic specification including only consanguinity from column (1) and the specification including each cultural variable, restricting to the sample for which we have data on both measures. Finally, since the data on consanguinity were collated from 448 studies

19Note that this insignificant relationship holds if we also restrict the sample to countries for which we have family ties data (p-value = 0.27). This may reflect an ambiguity in the question as it is asked in the WVS and EVS; in particular, while previous analyses have typically interpreted the question as referring to trust of strangers [see e.g. 154, for a discussion], the question as asked does not make this distinction, leaving it up to the respondent to determine the reference group. In societies with high levels of sub-ethnic fractionalization, “most people" [with whom you interact] may refer to a different reference set than in societies with low levels of sub-ethnic fractionalization.

20Evidence of subsequent admixture complicates this view; see e.g. [168].

74 over many decades, we report additional analysis in Table B.5.6 in the appendix that addresses data collection dates. Our results remain robust in both tables.

Caveats. While the evidence is compelling, such cross-country analyses can never allay all endogeneity concerns. Indeed, ethnic conflict resulting from ethnic fractionalization, greater in- tensity of local group interaction due to family ties, and in-group loyalty all might encourage consanguinity-driven sub-ethnic fractionalization (or vice versa) for various reasons. Moreover, cor- rupt, low-quality governance may cause all of these variables indirectly. In many countries with weak institutions “social safety nets are incomplete or nonexistent and households must cope with an unforgiving environment of severe poverty and shocks to economic and physical well-being [. . . ] especially against a backdrop of inadequate formal credit and insurance markets and a minimal wel- fare state" [68, p. 3714]. In such an environment, kin-based, tribal and ethnic networks may play an important role in helping households to manage shortages and uncertainty by supporting infor- mal exchanges including gifts, feasts, rotating saving, informal loans, intermarriage and arranged marriages. These networks rely on in-group trust and reciprocity to enforce informal contracts and provide families with risk-sharing, insurance, and information - all functions which, in developed countries, are typically carried out by markets. Kinship, tribal and ethnic networks also help to or- ganize the provision of public goods, a role that, in developed countries, is usually performed by government. But unlike governments, these networks do not have the power to tax or mobilize re- sources. Therefore, the provision of public goods relies on local, informal exchange of favors [see 68, 17, 153]. Thus, there is reason to believe that weaker institutions may also increase sub-ethnic fractionalization, raising the relative returns to kin-based, clan and tribal organization and concomi- tant norms of local favoritism.

Summary. Overall, our cross-country results are consistent with the idea that marriage-practice driven sub-ethnic fractionalization, by increasing the relative returns to norms of favoritism, is an important cause of corruption. Nevertheless, our cross-country regressions cannot establish causal- ity; in Appendix B.5.2 we report an instrumental variables analysis using a strategy developed by [230] that relies on linguistic differences in kin terminology. The evidence from the IV is consistent with that reported above, though since language and other cultural practices co-evolve, the assump- tion of exogeneity is a strong one. To further test our hypothesis, we next report analysis exploiting historical variation in consanguinity within a single developed country.

3.3.2 Within-country analysis (Italy) To further test the hypothesis that sub-ethnic fractionalization causes corruption, while controlling for country-specific institutional and cultural factors, we collected data on corruption and consan- guinity across Italian provinces. Our consanguinity data are from [57] and are based on records of Papal dispensations for consanguineous marriages kept in the Vatican archives. Because con- sanguinity was officially banned by the church, couples who wished to circumvent the ban had to request approval from their local diocese. Detailed records of these marriages were preserved by the church and compiled in province-level statistics. We report the consanguinity rate by province, measured as the share of all marriages in the province that were between first cousins, or closer rela- tives (e.g. uncle-niece marriages) for the period 1945-1964, since data from this period are available

75 for all provinces and do not include sample variation due to either World War.21 The underlying data, however, date back to 1911, and [57] provide evidence that heterogeneity in consanguinity rates across Italy at the start of their time period was unrelated to a large vector of demographic, social and economic variables (e.g. birth/death rates, , population density, immi- gration/emigration, , and industrialization). Moreover, their evidence suggests that trends in consanguinity in Italy were similar across the entire country, despite substantial differences in levels (see Appendix B.5.4). Data on actual corruption crimes in Italy are not available for the provinces. Therefore, ex- ploiting a known link between corruption and associative crime [i.e. criminal conspiracy and mafia association, see 88], we use the latter as a proxy for corruption. The link is straightforward: as- sociative crime reflects charges leveled at members of criminal organizations that seek to enrich themselves at the expense of others and of the government. Corruption is essential for such criminal organizations because it facilitates the operation of illegal markets for goods and services such as cigarettes, drugs, prostitution, gambling, as well as activities such as car-theft, extortion, tax eva- sion, etc. Corruption allows criminal organizations to obtain information about potential attempts to subvert their operations (by law enforcement or competitors), supports the deletion, falsification or destruction of incriminating evidence, and may be used to neutralize judges, prosecutors, police or experts who might interfere with their plans.22,23 Thus, we proxy for province-level corruption by using the number of associative crimes per 100,000 inhabitants of the province (via ISTAT). Our data on associative crimes and control vari- ables cover the period 2000-2013. All variables are described in detail in Appendix B.3, Table B.3.2. Because of missing data on consanguinity or corruption (partly due to changes in the number of provinces since the year 2000), our analysis is based on data from 101 provinces. Figure 3.3 displays our measures of corruption and consanguinity by province in Italy. We use the log transformation of consanguinity rates to bring the contrast into sharper relief for the fig- ure, but our statistical analysis uses the raw rate of consanguinity as in the cross-country analysis. Grey-colored regions in the figures have missing data. Overall, we find a positive and highly sig- nificant correlation between consanguinity rates and our measure of corruption in Italian provinces (Spearman’s ρ = 0.55, p-value < 0.001, N=101). As in the cross-country analysis, we regress our measure of corruption on a number of controls and include consanguinity rates as our measure of sub-ethnic fractionalization. In such analyses, per capita income and the relative size of the agricultural sector “are often used as proxy variables

21Due to high consanguinity rates in and Sardinia, the church relaxed its rules such that dispensations were not required for marriages between relatives more distant than first cousins in those regions, though dispensations for such marriages remained mandatory elsewhere. Thus to ensure comparability of the data across provinces, we restrict attention to first cousin marriages and closer. Note that the coefficient of relatedness between first cousins (in a randomly mating population) is r = 1/8 vs. r = 1/32 for second cousins.

22A model by [162] connects corruption and associative crimes, based on criminal organizations’ attempts to avoid punishment by bribing law enforcement and otherwise engaging in local corruption.

23The proposed connection is also borne out in the available data. Data on corruption crimes exist at the region level (N = 20), so we report region-level correlations between consanguinity and corruption crimes in Appendix B.5.4, and the re- sults are consistent with the province-level analysis. Moreover, aggregating our proxy measure of corruption, “associative crime", to the region level, the two measures are highly correlated, even with a small number of observations, suggest- ing that associative crime is a reasonable proxy for corruption (Spearman’s ρ = 0.49, p-value = 0.03). See also [117] who analyzed links between organized crime and corruption for The European Commission. Their analysis includes case studies on several European countries including Italy, where corruption and organized crime “are closely intertwined" (p. 157) and “criminal organizations such as the mafia are the most visible in terms of exercising power" (p. 163).

76 Figure 3.3: Corruption and consanguinity in Italy. for the level of development" [71, p. 390]. Again, due to the likely endogeneity of income per capita to corruption, we prefer to use the share of agriculture in the regression, though we report specifications including both. In fact, the two variables are highly correlated (Spearman’s ρ = -0.37, p-value < 0.001, N=101), reiterating that share of agriculture is a good proxy for per capita income. We also control for the population of the provinces. These variables are averaged over 2000-2013. In addition to our correlational estimates using consanguinity rates, we also attempt to address causality by exploiting plausibly exogenous variation in historical exposure to the marriage laws of the Catholic church due to Arab domination of southern Italy from the 7th-10th centuries. This historical episode led to suppression of the Catholic church in some regions and thus facilitates an instrumental variables approach that provides evidence of a causal relationship between consan- guinity and corruption.

Main findings Table 3.6 displays the results of our first set of regressions. Our baseline specification in column (1) reveals a large and significant coefficient on agricultural share of income, indicating that less devel- oped parts of Italy also exhibit lower institutional quality. In column (2), we introduce our measure of sub-ethnic fractionalization: the consanguinity rate. From the estimates in column (2), the effect of consanguinity on corruption is quite large: a 1 standard deviation increase in consanguinity rate is associated with a roughly 0.6 standard deviation increase in associative crimes per hundred thou- sand inhabitants in the province. Moreover, the difference in consanguinity between the least and the most consanguineous province is associated with an increase of roughly 2.8 associative crimes per hundred thousand, about 45% of the difference between the most and least-corrupt region. In columns (3) - (6), we show that our results are robust whether we control for share of agricul- ture or income, or both or neither. Contrary to the cross-country analysis, population, income and its proxy are not significant in the regressions. This suggests that between-province variability of pop- ulation or the degree of development is too low to capture its effect on corruption. Consanguinity is associated with significantly higher corruption in all specifications.

77 (1) (2) (3) (4) (5) (6) Basic model and Consanguinity Income instead of Share of agriculture both Income and without Income and VARIABLES Share of agriculture as instrument Share of agriculture Share of agriculture

Consanguinity 5.144*** 5.371*** 4.984*** 5.155*** 5.427*** (1.099) (1.117) (1.167) (1.115) (1.081) Share of agriculture 12.707*** 3.472 3.705 (4.408) (3.243) (3.550) Log population 0.069 0.022 -0.011 -0.161 0.035 0.011 (0.124) (0.115) (0.132) (0.220) (0.133) (0.116) Log value added per capita -0.034 -0.264 0.018 (0.083) (0.273) (0.092) Constant 0.451 0.763 1.162 2.274 0.662 1.000 (1.654) (1.505) (1.607) (2.074) (1.624) (1.521)

Observations 101 101 101 101 101 101 R-squared 0.106 0.430 0.424 0.396 0.430 0.423 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.6: Regression analysis of the relationship between consanguinity and corruption in Italy.

Table 3.7 displays the results of our first set of regressions including additional geographic and climatic controls: latitude, mean annual temperature and precipitation, soil suitability for agricul- ture, distance to the coast, average elevation, average slope, and ruggedness.

(1) (2) (3) (4) (5) (6) Basic model and Consanguinity Income instead of Share of agriculture both Income and without Income and VARIABLES Share of agriculture as instrument Share of agriculture Share of agriculture

Consanguinity 4.451* 4.506* 4.440* 4.472* 4.470* (2.443) (2.421) (2.470) (2.460) (2.395) Share of agriculture 1.537 0.366 0.991 (3.469) (3.385) (3.631) Log population -0.020 -0.050 -0.032 -0.067 -0.021 -0.051 (0.123) (0.113) (0.124) (0.181) (0.132) (0.111) Log value added per capita 0.029 -0.024 0.042 (0.081) (0.228) (0.088) Latitude -0.359*** -0.271* -0.278* -0.269* -0.273* -0.273* (0.130) (0.147) (0.148) (0.148) (0.149) (0.146)

Additional controls yes yes yes yes yes yes Observations 101 101 101 101 101 101 R-squared 0.459 0.502 0.502 0.501 0.503 0.502 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.7: Replication of Table 3.6 including controls for climate and geography.Additional con- trols include mean temperature, annual precipitation, soil suitability for agriculture, distance to the coast, elevation, slope, and ruggedness. Full estimates reported in Appendix Table B.5.11.

In column (1) of Table 3.7, which is analogous to column (1) of Table 3.6, we observe a negative and highly significant effect of latitude, consistent with the perception that northern Italy is less corrupt. This difference is usually attributed to cultural differences between the south and north [29, 214]. As shown in the table, the effect of latitude becomes smaller and its standard errors become larger once we include consanguinity in columns (2)-(6), which are analogous to the same columns in Table 3.6. In these specifications, the consanguinity rate remains a statistically significant determinant of corruption, though its magnitude is slightly smaller and the associated standard errors slightly larger than above, likely due to the correlation between latitude and consanguinity shown in Figure 3.3b.

IV Regressions. These findings suggest that some of the modern cultural differences between northern and southern Italy might be driven, in part, by historical differences in marriage patterns between the two regions, perhaps “as a remote consequence of Arab domination in Sicily and south- ern Italy in the eighth to the eleventh centuries" [57, p. 3]. The possibility of differences in mating

78 patterns due to a historical event such as Arab domination provides a potential exogenous variation that is worthwhile to explore. Since the Arab conquest also brought new religious authorities, it introduced exogenous variation in the exposure to the Catholic church (and its policies on legiti- mate marriage). There are 42 Roman Catholic ecclesiastical provinces in Italy. Each ecclesiastical province is served by a metropolitan archdiocese. Historical data on the all dioceses and archdio- ceses of Italy is available from their date of establishment. For each ecclesiastical province, we calculated the number of active years of its archdioceses from the date of establishment of the first archdiocese to the present (see the appendix for more details). This measure captures two sources of variation. First, the establishment dates of archdioceses vary from 100 AD to 1100 AD. Second, archdioceses in five ecclesiastical province (covering 10 administrative provinces today) in south- ern Italy were suppressed for more than 200 years between the 7th and 10th centuries, coinciding with the historical Arab domination of southern Italy. We matched the data on the number of active years of archdioceses in ecclesiastical provinces to today’s administrative provinces. This measure is strongly and negatively correlated with consanguinity rates in the 20th century (Spearman’s ρ = -0.390, p-value < 0.000, N=101). We use this variable as an instrument for cousin marriage, but we do so with the caveat that the Church clearly had other influences on social, economic and political life, and so even if we cannot reject exogeneity, there are reasons to remain skeptical of the causal claim.

(1) (2) (3) (4) Basic model Income instead of both Income and without Income and VARIABLES Share of agriculture Share of agriculture Share of agriculture

No. of active years -0.002*** -0.002*** -0.002*** -0.002*** (0.000) (0.000) (0.000) (0.000) Share of agriculture 10.365** 10.778** (4.181) (4.559) Log population 0.128 0.014 0.152 0.111 (0.120) (0.150) (0.156) (0.120) Log value added per capita -0.136 0.032 (0.111) (0.113) Constant 2.363 3.915** 2.212 3.458* (1.766) (1.823) (1.915) (1.765)

Observations 101 101 101 101 R-squared 0.186 0.129 0.186 0.119 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.8: Reduced form regressions with active years of archdioceses.

Table 3.8 reports reduced form regression results showing that provinces with a higher num- ber of active years of archdioceses exhibit lower rates of corruption. Table 3.9 reports regression results using this measure as an instrumental variable for consanguinity and supports a causal inter- pretation; in all cases, we fail to reject the null hypothesis of exogeneity. In Table B.5.12 we also check the robustness of the results to including geographic and climatic controls. Controlling for geography, the results remain robust. Latitude is highly correlated with active years of archdioceses (Spearman’s ρ = 0.434, p-value < 0.000, N=101). When it is included in the regression, consan- guinity becomes insignificant. This provides further evidence that differences in mating patterns are mediated by differences in latitude, which makes sense; the Church’s influence was weaker in southern regions because of its geographic distance from Rome and due to historical Arab, Norman, and Spanish dominations of these regions.

Robustness checks. As in the cross-country analysis, there are a few other potential con- founds which, if addressed, would increase confidence in our results. In Appendix B.5.4, we report

79 (1) (2) (3) (4) Basic model Income instead of both Income and without Income and VARIABLES Share of agriculture Share of agriculture Share of agriculture

Consanguinity 8.306*** 8.576*** 8.658*** 8.035*** (2.396) (2.329) (2.538) (1.891) Share of agriculture -2.205 -1.731 (5.420) (5.552) Log population -0.006 0.066 0.045 0.001 (0.146) (0.172) (0.156) (0.145) Log value added per capita 0.102 0.077 (0.116) (0.094) Constant 0.955 0.291 0.530 0.811 (1.899) (2.160) (1.973) (1.913)

Observations 101 101 101 101 R-squared 0.307 0.287 0.281 0.326 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table 3.9: Active years of archdioceses as an instrument for consanguinity. further analyses based on column (2) of Tables 3.6 and 3.7 and including additional variables that have been suggested to mediate institutional quality (hence corruption); these include measures of civil society, civic , family types, family ties, and historical political domination by var- ious groups [see e.g. 214, 248, 15, 73]. The relationship between consanguinity rates and corruption remains robust; see Tables B.5.13 and B.5.14. Our results are also robust to altering cutoff dates for computing consanguinity rates from 1945-1964 to 1950-1964, 1955-1964, or 1960-1964; see Table B.5.15. Since associative crime is not a measure of actual corruption, but only of the crimes reported and detected by the police, it may underestimate the true phenomenon because of judicial ineffi- ciency. Moreover, the most corrupt regions may be those in which such crimes are least likely to be reported or detected, which further suggests that we may underestimate the relationship between consanguinity and corruption. Thus, in Appendix B.5.4, we show that our regression results are robust to using an alternative measure of corruption from [108] based on computing the ratio of the value of existing physical infrastructure stocks (in 1998) to the expected value of infrastructure given government expenditures over the period 1954-1997. “The intuition underlying this measure is that, all else equal, governments that do not get what they pay for are those whose bureaucrats and politicians are siphoning off more public monies in corrupt transactions" [108, p. 41]. The measure is available for 90 provinces for which we also have consanguinity data, and it is highly correlated with consanguinity (Spearman’s ρ = -0.63, p-value < 0.001, N=90). See Figure B.5.4 and Table B.5.16.

Summary. In our baseline specifications and controlling for a variety of possible climatic, ge- ographical, and historical confounds, consanguinity is a significant predictor of corruption. More- over, our IV results may provide some evidence that the relationship is causal. As in the cross- country analysis, these data are highly consistent with a model in which corruption is in part driven by sub-ethnic fractionalization, as reflected in consanguinity.24

24One further implication of our theory that our current data does not allow us to test is that local crime, in which neighbors are targets, should be less common in regions with high consanguinity rates. [50] provide related evidence consistent with our findings and with this hypothesis as well. Using the distribution of surname frequencies from Italian phonebooks, they show that municipalities with more surname concentration (i.e. with more in-marriage) exhibit both more tax evasion (of a federal tax) and less local crime.

80 3.3.3 Laboratory Experiments The third prong of our approach employs experiments to directly compare norms of favoritism and corruption in a stylized laboratory game conducted in two countries with similar levels of eth- nic fractionalization, but very different marriage practices. Subjects play a one-shot, three-person bribery game, in which Person A may attempt to bribe Person B, who can choose whether to incur a corruption cost, thereby helping A and harming Person C. In each triplet, there are two “related" people (either co-ethnics or kin) and one unrelated person. By varying assignment of people to roles (e.g. A related to B vs. A related to C), we can test how nepotism and corruption vary within-country across different relationships (as e.g. implied by Hamilton’s theory), and by comparing across coun- tries we can see whether norms vary with marriage practices as predicted. We conducted the experiments with students in two large, ethnically heterogeneous cities: Van- couver, Canada and Urmia, Iran.25 Importantly for our motivation, both Iran (0.67) and Canada (0.71) are similarly ethnically fractionalized according to the measure developed in [12]. Vancouver is the largest city in the province of British Columbia and has seen a large influx of people from East and South Asia in the last 30 years. Today, the two most common ethnic backgrounds are English and Chinese. Urmia is the capital of West Azerbaijan Province in Iran, and the city has been home to numerous ethnic groups during its long history. Today, Azeri Turks and Kurds are the two main ethnic groups in the city. While Vancouver and Urmia are both multi-ethnic metropolises with populations around 600,000, they differ sharply in the importance and structure of kin networks. Extended family is very impor- tant in Urmia, and clans and tribes continue to have influence in the region.26 Moreover, consan- guineous marriages are very common in Urmia; consanguinity rates for Azeri Turks and Kurds in Iran are 32% and 40%, while the rate for Iran as a whole is 32% [42]. On the contrary, family struc- ture in Vancouver is mostly nuclear and consanguinity is very rare; in Canada as a whole, the rate is 1.5% [42]. In other words, Urmia exhibits substantial sub-ethnic fractionalization; while Vancouver does not, possibly yielding differences in norms of favoritism and corruption. One might argue that experiment results in Vancouver and Urmia cannot be readily generalized to Canada and Iran, but nevertheless the cross-cultural comparison provides suggestive evidence that speaks to our hypotheses. Moreover, subsequent robustness checks conducted in Tehran (described below) indicate no difference across those cities in Iran.

The game. Subjects participated in the one-shot bribery game shown in Figure 3.4, which is a simplified version of the game described in Section 3.2. Unlike above, there is no risk of punishment. We remove this possibility to eliminate a source of noise, as the impact of punishment has been investigated in previous experimental studies [e.g. 2]. For simplicity, we also assume that only one citizen suffers the negative externality of corruption. We chose our parameters so that bribery (A choosing “Transfer") and corruption (B choos- ing “Accept/Right") do not occur in the subgame perfect Nash equilibrium with unrelated, payoff- maximizing agents. Our design also ensures that the payoffs resulting from successful bribery and corruption are both inequitable and inefficient relative to the status quo, so that distributional and

25Our experiments in Iran were conducted in collaboration with the Moaser Research Center which possesses a permit from the Ministry of Science, Research and Technology to conduct research in Economics and Management. The center took full responsibility for planning, ethical review and official approvals to run experiments in Iran.

26http://www.iranicaonline.org/articles/kurdish-tribes

81 social welfare preferences do not predict bribery/corruption. However, if A and B are sufficiently related, bribery and corruption can occur in equilibrium (via Hamilton’s rule). More broadly, if we observe bribery and corruption, these must be driven by factors other than selfish payoff maximization, (pro)social preferences or concerns for efficiency. Our preferred inter- pretation is that such behavior reflects background social norms related to favoritism and corruption. Moreover, if bribery and corruption rates are higher in Iran than in Canada, this would provide evi- dence of normative differences that are at least correlated with observed higher levels of sub-ethnic fractionalization in Iran. Because of the negative connotation attached to words like “bribe", “corrupt effort", and “nega- tive externality", we applied non-normative language in the experiment using words like “Transfer/Not- transfer", “Reject/Accept", “Right/Left", “payoff added/deducted".27 In order to make full use of our limited sample, we elicited Person B’s strategies using the strategy method, so that we know what he/she would have chosen, had Person A offered the bribe. Payoffs were shown in Experimental Currency Units, and our conversion rates were designed to assure that the stakes were purchasing power-equivalent across the societies. We used local pizza prices as our measure of students’ purchasing power since both cities have many pizza shops, and pizza is popular among students. The price of a medium pizza in Vancouver including tax is around 15 CAD and in Urmia around 15000 Tomans.28 In Vancouver, we paid $7 for arriving on time and converted ECU at a rate of 10 ECU = $1. In Urmia, we paid 7000 Tomans for arriving on time and paid 10 ECU = 1000 Tomans. At the conclusion of the experiment a subset of subjects completed a post-experiment questionnaire (see Appendix B.6.3).

Person A

Not transfer Transfer 40

Person B 100 100 Reject Accept 160 Person B  95  100 Left Right 160

 55  160 140 135 160 55

Figure 3.4: Bribery game in the experiment. Each terminal node shows the payoffs for Person A, Person B and Person C, from top to bottom.

27While [1] and [30] find no effect of framing in laboratory corruption games, their experiments were run with a single sample. To avoid risk of culture-specific framing effects, we erred on the side of caution.

28Although the Rial is the official currency of Iran, Iranians employ the term ‘Toman’, meaning 10 rials

82 Design and treatments We employ a one-shot, between-subject design. Our treatments vary 1) whether the related pair of subjects in each triplet are related by kinship (K) or co-ethnicity (C), and 2) the assignment of subjects to roles. This generates the following treatments, where S refers to the Stranger:

KKS/CCS. Persons A and B are kin/co-ethnics. A(B) knows that B(A) is kin/co-ethnic and also knows that B(A) knows that A(B) is kin/co-ethnic. No information regarding the ethnicity of Person C is given to A or B, and C has no information about the ethnicity of the other players or their being related by kin or co-ethnicity.

KSK/CSC. Persons A and C are kin/co-ethnics. A(C) knows that C(A) is kin/co-ethnic and also knows that C(A) knows that A(C) is kin/co-ethnic. No information regarding the ethnicity of Person B is given to A or C, and B has no information about the ethnicity of the other players or their being related by kin or co-ethnicity.

SKK/SCC. Person B and C are kin/co-ethnics. B(C) knows that C(B) is kin/co-ethnic and also knows that C(B) knows that B(C) is kin/co-ethnic. No information regarding the ethnicity of Person A is given to B or C, and A has no information about the ethnicity of the other players or their being related by kin or co-ethnicity.

We randomly assigned one of the three treatments at the kin/ethnic level to each triple of sub- jects. Subjects were matched in triplets with their kin/co-ethnic based on their self-reported kin- ship/ethnic origin in a pre-experiment questionnaire (see Appendix B.6.2). Pre-experiment ques- tionnaires were collected from subjects online or in paper, prior to the experiment. In the ques- tionnaire, in addition to ethnic origin, we collected demographic information such as age group, gender, degree, and field of study to avoid highlighting the aim of our research. Before subjects learn their roles and information about subjects in the other roles, we mentioned that “you might observe some background information from the pre-experiment questionnaire about participants in the other roles." Also, we always included age-group information for other players in addition to ethnic origin information. We chose 18-30 as the age group to present in the experiment because it covered all the subjects in our sample; therefore, age information was the same for all treatments. We were hoping that these cautions along with the between-subject design would minimize any possible experimenter effect.29 The instructions and more detailed procedures of the experiment are presented in Appendix B.6, including sample pages showing how we exchanged information between subjects in the three-player game.

Subject pool. For the co-ethnic treatments, our subjects in Vancouver were 180 Canadian- born undergraduate students with English or Chinese origins from the University of British Columbia and Simon Fraser University, both located in the Vancouver area. The subjects in the ethnic treat- ment in Urmia, Iran consisted of 180 Iranian-born undergraduate students with Azeri or Kurdish

29When we began our experiments in Vancouver, to present ethnic origin information, we used the word “ethnic origin" on the information page. Later, we dropped this word for the rest of the experiments in Vancouver and all the experiments in Iran, considering that it might affect subjects’ choices due to the salience of “ethnicity". However, the results of experiments in Vancouver indicate that using the word "ethnic origin" had no effect on behavior.

83 origins, taking courses at Urmia University during summer 2015. From each city, we collected data from 20 triplets in each of the three ethnic-level treatments (CCS, CSC, SCC). For the kin treatments, we collected data on all three matching schemes in Urmia and only one matching scheme in Canada (KKS) since recruiting subjects for the Kin treatment in Canada was extremely difficult. For these treatments, we asked students whether their sibling would like to participate in the experiment, and if they answered with “Yes", we also asked the occupation and age group of their sibling. Then we invited those pairs of siblings who both were 18-30 years old and students. For each pair of siblings, another randomly chosen student was invited to participate in the three-person game. In Urmia 180 subjects (60 sibling pairs + 60 others) participated in the three kin level treatments (KKS, KSK, SKK), with 20 triplets per treatment. In Vancouver, 39 subjects (13 sibling pairs + 13 others) participated in the KKS treatment.30 Finally, we conducted two robustness checks: 1) a “Friend" treatment in both countries, in which subjects were asked to bring a close friend to the laboratory and they participated together in the role of persons A and B. The Friend treatment is designed to test whether observed corruption among kin is driven by familiarity. In particular, the possibility in our design of ‘gains from exchange’ in corruption suggests that trusted friends might also be prone to cooperate at the expense of unknown 3rd parties; 2) a “High Cost" treatment involving variants of the Kin, Friend and Stranger treatments in which we increase the cost of corruption by player 2 to eliminate the mutual gains from corrup- tion (such that player 2 is worse off by engaging in corruption than if player 1 had not offered the bribe). By varying the cost of helping, we can test the strength of norms of favoritism within and across countries. In total we collected data on an additional 90 subjects in Canada (10 triplets in the FFS treatment and 20 triplets in the FFS_High treatments) and on 180 additional subjects in Tehran, Iran (12 triplets each in the SSS, SSS_High, FFS_High, KKS_High and KKS_High_Cousins treat- ments). We conducted the SSS treatments to check for baseline differences in corruption among strangers between Tehran and Urmia.

Hypotheses. One direct implication of the theory in Section 3.2 is that the frequency of offer- ing a bribe by Person A and the frequency of accepting the bribe and making the corrupt effort by Person B are positive functions of their relatedness to one another and negative functions of their relatedness to Person C. More important for our purposes, comparing behavior across countries we can test whether norms of favoritism and corruption are more prominent in Iran where we observe increased sub-ethnic fractionalization. While we would expect kin to engage in favoritism in both countries, there is scope for differences in the other treatments. In sum: Hypothesis 1a: Bribery is (weakly) increasing in relatedness of A to B and (weakly) decreasing in relatedness of A to C. Hypothesis 1b: Corruption is (weakly) increasing in relatedness of A to B and (weakly) decreasing in relatedness of B to C. Hypothesis 2: Bribery/Corruption among non-kin are higher in Iran than in Canada, due to in- creased sub-ethnic fractionalization.

30We exclude from the sample one father-daughter pair since all other kin observations were on siblings. Interestingly, the father played the role of person B and was one of very few subjects to reject the bribe; his explanation for his decision indicated that he planned to compensate his daughter for her loss outside of the experiment.

84 Experimental findings First we report within-country comparisons testing for the effects of relatedness on behavior, and then we report between-country comparisons, which provide suggestive evidence for the effects of sub-ethnic fractionalization. We conclude with a discussion of our Friend treatment, which high- lights our interpretation of the findings. The experimental results from our primary treatments are presented in Table 3.10. Each entry in the table shows the fraction of subjects choosing to engage in bribery or corruption, by treatment and matching scheme.

Relatedness. Let µk be the relative frequency of bribery in matching scheme k, with k ∈ {KKS, KSK, SKK} in the Kin treatment and k ∈ {CCS, CSC, SCC} in the Co-ethnic treatment. Similarly, let νk be the relative frequency of corruption under matching scheme k. From the discus- sion in Section 3.2 and Appendix B.1 it follows that kin are more related than co-ethnics and that co-ethnics are more related (in expectation) than strangers.

Iran Canada Kin Co-Ethnic Kin Co-Ethnic Bribery Corruption Bribery Corruption Bribery Corruption Bribery Corruption A & B related 18/20 19/20 17/20 16/20 13/13 12/13 7/20 9/20 A & C related 1/20 7/20 10/20 10/20 NA NA 6/20 7/20 B & C related 8/20 1/20 14/20 9/20 NA NA 9/20 6/20

Table 3.10: Relative frequency of bribery and corruption by treatment in Urmia, Iran and Vancouver, Canada.

Thus, hypothesis 1a has the following implication: µKKS ≥ µCCS ≥ µSKK/SCC ≥ µCSC ≥ µKSK. Using the data from Iran, a Cochran-Armitage test rejects the null hypothesis of equal relative frequency across the treatments in favor of the ordered alternative (technically, the alternative is that the ordering is weak, with at least one relationship strict, Z = 5.85, p-value < 0.001). With our Canadian data, we can only test a portion of the hypothesis, namely µKKS ≥ µCCS ≥ µSCC ≥ µCSC, but again a Cochran-Armitage test rejects the null hypothesis (Z = 3.19, p-value = 0.001). Similarly, hypothesis 1b implies that νKKS ≥ νCCS ≥ νKSK/CSC ≥ νSCC ≥ νSKK. Using Iranian data, a Cochran-Armitage test rejects the null hypothesis of equal relative frequency in favor of the ordered alternative (Z = 6.08, p-value < 0.001). Again, with the Canadian data, we can only test a portion of the hypothesis, namely νKKS ≥ νCCS ≥ νCSC ≥ νSCC, and a Cochran-Armitage test rejects the null hypothesis (Z = 3.28, p-value = 0.001).

Finding 1: In both countries, the data are consistent with the relatedness hypothesis (1a and 1b).

Cross-country comparisons. Table 3.10 reveals evidence of kin favoritism in both countries. We focus on the top row of the table since we have data in the kin treatment only in the KKS cell in Canada. Pooling over the decisions of Persons A and B in the KKS treatment, subjects took 25/26 (96%) corrupt actions in Canada and 37/40 (93%) corrupt actions in Iran; the degree of kin favoritism is virtually indistinguishable across countries (two-tailed proportions test, χ2 = 0.006, p-value = 0.94). However, we see some differences in the degree of ethnic favoritism. Again pooling over the decisions of Persons A and B, and focusing on the CCS treatment, we see subjects took 16/40 (40%) corruption actions in Canada and 33/40 (83%) corrupt actions in Iran. A two-tailed proportions test confirms that these differences are statistically significant (χ2 = 13.5, p-value < 0.001). Using rows 2 and 3 of the table, we can also test for differences in the willingness to harm, rather than

85 help, a co-ethnic by comparing bribery rates when A and C are co-ethnics and corruption rates when B and C are co-ethnics. Here we see 12/40 (30%) such actions taken in Canada and 19/40 (45%) such actions taken in Iran; the difference is not significant (two-tailed proportions test, χ2 = 1.90, p-value = 0.17). Thus, we see that Iranian subjects are substantially more willing to help and no more willing to harm co-ethnics than are Canadian subjects. Using the other cells of Table 3.10, we can also test for differences in the frequency of corrupt actions taken on behalf of strangers. In particular, the decision of Person A to offer a bribe the SCC/SKK treatments was made knowing nothing about the counterpart, as was the decision of Person B to accept and reciprocate (i.e. corruption) in the CSC/KSK. These decisions thus capture the rate of corrupt actions undertaken with strangers. Pooling over the decisions of A and B, we see 16/40 (40%) corrupt actions in Canada and 39/80 (49%) corrupt actions in Iran, and thus we have no evidence of significant differences in corrupt behavior when interacting with strangers (two-tailed proportions test, χ2 = 0.51, p-value = 0.48). Finding 2: Iranian subjects exhibit somewhat more ethnic favoritism than Canadians, though both countries show similar levels of kin favoritism and similar behavior toward strangers. This suggests there are normative differences between the countries; we explore these differ- ences further with subsequent treatments.

Robustness: the Friend and High Cost treatments. One limitation of our basic design is that the effect of kinship cannot be separated easily from that of familiarity. Kin, especially those of similar ages who are willing to attend a laboratory experiment together, are likely to have a good relationship built on reciprocity and generosity that might be reflected in their willingness to coop- erate with one another (at a third party’s expense). The Friend treatment allows us to highlight an important aspect of our interpretation of the findings: that differences in behavior reflect differences in norms of local favoritism, which we believe can be explained in part by difference in marriage practices. In the FFS treatment, conducted only in Canada, we observe bribery/corruption in 19/20 (95%) decisions, indicating that corrupt acts in these small stakes decisions are just as likely among friends as among kin and substantially more so than among co-ethnics (two-tailed proportions tests, χ2 = 0.00, p-value = 1 and χ2 = 14.4, p-value < 0.001, comparing FFS to KKS and CCS, respectively). In almost all societies, norms of favoritism exist among friends, and like marriage norms, these may constitute an instance in which the biological machinery for kin altruism is co-opted to support cooperation among non-relatives.31,32 The adage that “blood is thicker than water" may nevertheless apply to our bribery game, and thus we hypothesized that the behavior of kin and friends might differ if there were no gains from exchange in our experiment (i.e., if the payoffs to A and B after Transfer→Accept→Right were 160 and 90, respectively). To test this conjecture we also conducted the High Cost variants of the treatment where player A and B are related as friends in both countries (N = 20 triplets in Canada and N=12 in Iran). In Iran, we also conducted versions of the High Cost treatment where players A and B are related as siblings and as cousins (N = 12 for both).33 As seen in Table 3.11, we did not

31But there is evidence that friends are more closely related than random individuals; see Appendix B.1.

32This co-optation is reflected in the use of kinship words, e.g. ‘brother’ and ‘sister’, to refer to close friends.

33In Iran, these followup experiments were conducted in Tehran instead of Urmia. One reason to do this was to see if our basic results were unique to Urmia or generalized to the larger and more cosmopolitan city of Tehran. We ran a low

86 Kin (low cost) Friend (low cost) Stranger (low cost)a Bribery Corruption Bribery Corruption Bribery Corruption Canada 13/13 12/13 9/10 10/10 9/20 7/20 Iran 18/20 19/20 NA NA 30/52 22/52

Kin (high cost)b Friend (high cost) Stranger (high cost) Bribery Corruption Bribery Corruption Bribery Corruption Canada NA NA 13/20 9/20 NA NA Iran 24/24 22/24 11/12 10/12 1/12 1/12 aIncludes 40 pairs from Urmia in the SKK and SCC treatments. bIncludes 12 sibling and 12 cousin pairs.

Table 3.11: Summary of relative frequency of bribery and corruption when A & B are related. conduct a full factorial experiment due to difficulties recruiting subjects (we struggled to recruit kin in Canada) and to obviousness of the results (in case of the Low Cost FFS treatment in Iran, which we had no reason to think would differ from the High Cost FFS treatment). The new experiments are described in more detail in appendix B.6.4. Consistent with our conjecture, the key finding here is that in Canada, high effort cost signifi- cantly decreases the frequency of bribery and corruption among friends relative to the experiment with low effort cost among either kin or friends. Pooling over the decisions of A and B in Canada, the frequency of corrupt acts is 22/40 (55%) in the High Cost FFS treatment versus 19/20 (95%) in the Low Cost FFS treatment and 25/26 (96%) in the Low Cost KKS treatment (two-tailed propor- tions tests, χ2 = 8.1, p-value = 0.004, and χ2 = 11.1, p-value = 0.001, comparing High Cost FFS to Low Cost FFS and Low Cost KKS, respectively). Perhaps surprisingly, in Iran, the frequency of bribery and corruption among friends in our high effort cost treatment is not distinguishable from that observed among kin, in either the Low or High cost treatment. Pooling over the bribe/accept decisions in Iran, the frequency of corrupt acts in the High Cost FFS is 21/24 (88%) versus 37/40 (93%) in the Low Cost KKS and 46/48 (96%) in the High Cost KKS, which includes both siblings and cousins since they were indistinguishable (two- tailed proportions tests, χ2 = 0.05, p-value = 0.82 and χ2 = 0.67, p-value = 0.41, comparing High Cost FFS to Low Cost KKS and High Cost KKS, respectively). In fact, when we compare the High Cost FFS treatment across countries, we see a significantly higher rate of corrupt acts in Iran than in Canada among friends (two-tailed proportion test, χ2 = 5.79, p-value = 0.016). In sum, in our High Cost treatment, our Iranian subjects are equally willing to engage in corruption on behalf of kin and friends, but in Canada, the High Cost treatment significantly decreases the willingness to benefit friends, as compared to kin.

Finding 3: Iranian subjects exhibit more favoritism toward friends than Canadian subjects.

This suggests that, in a consanguineous society—where the gains from local altruism are higher— norms of favoritism among friends tend to be stronger too. cost SSS treatment in Tehran and the frequency of bribery (8/12) and corruption (5/12) were not significantly different from the SKK and SCC treatments conducted in Urmia (22/40 and 17/40, p-values > 0.7, two-tailed proportions tests). In addition, as seen in Table 3.11, experiments with siblings and cousins in the treatment with high effort cost in Tehran rules out the possibility of weak kin and relative ties in Tehran relative to Urmia.

87 3.4 Conclusion

Countries around the world exhibit vast differences in levels of corruption, and understanding the sources of these differences is crucial to improving governance. We provide evidence from cross- country, within-country and experimental data that in-marriage and concomitant sub-ethnic frac- tionalization is an important determinant of corruption. In regions with high sub-ethnic fractional- ization, corruption is relatively prevalent, even after controlling for previously studied deep deter- minants of corruption. Motivating our argument with the notion of inclusive fitness from biology, we argue that differences in mating practices and family structure provide the source of this cor- relation. In particular in-marriage practices, which increase relatedness between individuals at the local level, raise the relative returns to norms of kin altruism, nepotism, and favoritism; while, out- marriage practices raise the relative returns to norms of impartial, generalized cooperation. While we focus on the example of cousin marriage, we would expect the same logic to apply to other local in-marriage practices (e.g. locally endogamous marriage within a caste). Our findings suggest that historical differences in mating practices (due to religion, geography, and local circumstance) may have had a powerful influence on today’s norms.

88 Bibliography

[1] Klaus Abbink and Heike Hennig-Schmidt. Neutral versus loaded instructions in a bribery experiment. Experimental Economics, 9(2):103–121, 2006.

[2] Klaus Abbink, Bernd Irlenbusch, and Elke Renner. An experimental bribery game. Journal of Law, Economics, and Organization, 18(2):428–454, 2002.

[3] Mary Abbott. Family ties: English families 1540-1920. Routledge, 2013.

[4] Bina Agarwal. Gender and legal rights in agricultural land in india. Economic and Political Weekly, pages A39–A56, 1995.

[5] Leila Ahmed. Women and gender in Islam: Historical roots of a modern debate. Yale University Press, 1992.

[6] Anna Aizer. The gender wage gap and domestic violence. American Economic Review, 100(4):1847–59, 2010.

[7] Mahsa Akbari, Duman Bahrami-Rad, and Erik O Kimbrough. Kinship, fractionalization and corruption. SSRN Working Paper, 2847222, 2016.

[8] Seyed Mohammad Akrami and Zahra Osati. Is consanguineous marriage religiously encour- aged? islamic and iranian considerations. Journal of Biosocial Science, 39(02):313–316, 2007.

[9] Vivi Alatas, Lisa Cameron, Ananish Chaudhuri, Nisvan Erkal, and Lata Gangadharan. Sub- ject pool effects in a corruption experiment: A comparison of indonesian public servants and indonesian students. Experimental Economics, 12(1):113–132, 2009.

[10] Alberto Alesina, Yann Algan, Pierre Cahuc, and Paola Giuliano. Family values and the regulation of labor. Journal of the European Economic Association, 2015.

[11] Alberto Alesina, Benedetta Brioschi, and Eliana La Ferrara. Violence against women: A cross-cultural analysis for africa. NBER Working Paper, 21901, 2016.

[12] Alberto Alesina, Arnaud Devleeschauwer, William Easterly, Sergio Kurlat, and Romain Wacziarg. Fractionalization. Journal of Economic growth, 8(2):155–194, 2003.

[13] Alberto Alesina and Paola Giuliano. The power of the family. Journal of Economic growth, 15(2):93–125, 2010.

[14] Alberto Alesina and Paola Giuliano. Family ties and political participation. Journal of the European Economic Association, 9(5):817–839, 2011.

89 [15] Alberto Alesina and Paola Giuliano. Family ties. In Philippe Aghion and Steven N. Durlauf, editors, Handbook of Economic Growth, volume 2A, pages 177–215. North Holland: Nether- lands, 2014.

[16] Alberto Alesina, Paola Giuliano, and Nathan Nunn. On the origins of gender roles: Women and the plough. The Quarterly journal of economics, 128(2):469–530, 2013.

[17] Alberto Alesina and Eliana La Ferrara. Ethnic diversity and economic performance. Journal of Economic Literature, 43:762–800, 2005.

[18] Lee J Alston and Morton Owen Schapiro. Inheritance laws across colonies: Causes and consequences. The Journal of Economic History, 44(02):277–287, 1984.

[19] Sofia Amaral. Do improved property rights decrease violence against women in india? SSRN Working Paper, 2504579, 2017.

[20] AALL American Association of Law Libraries and Arturo A Flores. Foreign law: Current sources of codes and basic legislation in jurisdictions of the world. Littleton, Colo.: FB Rothman, 1989.

[21] Dan Anderberg, Helmut Rainer, Jonathan Wadsworth, and Tanya Wilson. Unemployment and domestic violence: Theory and evidence. The Economic Journal, 126(597):1947–1979, 2016.

[22] Siwan Anderson. Legal origins and female hiv. American Economic Review, forthcoming, 2018.

[23] Siwan Anderson and Garance Genicot. Suicide and property rights in india. Journal of Development Economics, 114:64–78, 2015.

[24] Kenneth RM Anthony. Agricultural change in tropical Africa. Cornell University Press, 1978.

[25] Nava Ashraf, Natalie Bau, Nathan Nunn, and Alessandra Voena. Bride price and female education. NBER Working Paper, 22417, 2018.

[26] Quamrul Ashraf and Oded Galor. The’out of africa’hypothesis, human genetic diversity, and comparative economic development. The American Economic Review, 103(1):1, 2013.

[27] St. Augustine. The City of God Against the Pagans (trans: R. W. Dyson). Cambridge Univer- sity Press, 1998.

[28] Drew H Bailey, Kim R Hill, and Robert S Walker. Fitness consequences of spousal related- ness in 46 small-scale societies. Biology letters, 10(5):20140160, 2014.

[29] Edward C Banfield. The moral basis of a backward society. New York: Glencol, 3:4, 1958.

[30] Abigail Barr and Danila Serra. The effects of externalities and framing on bribery in a petty corruption experiment. Experimental Economics, 12(4):488–503, 2009.

[31] Abigail Barr and Danila Serra. Corruption and culture: An experimental analysis. Journal of Public Economics, 94(11):862–869, 2010.

90 [32] David B Barrett. World Christian encyclopedia: a comparative study of churches and reli- gions in the modern world, AD 1900-2000. Oxford University Press, USA, 1982.

[33] Herbert Barry. Ethnographic atlas xxviii. Ethnology, 19(2):245–263, 1980.

[34] Anke Becker. Herding and male dominance. SSRN Working Paper, 3069684, 2017.

[35] Sascha O Becker and Ludger Woessmann. Luther and the girls: Religious denomination and the female education gap in nineteenth-century prussia. The Scandinavian Journal of Economics, 110(4):777–805, 2008.

[36] Jens Beckert. Inherited wealth. Princeton University Press, 2008.

[37] Jeanet Sinding Bentzen, Nicolai Kaarsen, and Asger Moll Wingender. Irrigation and autoc- racy. European Economic Association. Journal, 2015.

[38] Joyce Berg, John Dickhaut, and Kevin McCabe. Trust, reciprocity, and social history. Games and economic behavior, 10(1):122–142, 1995.

[39] Theodore C Bergstrom. The algebra of assortative encounters and the evolution of coopera- tion. International Game Theory Review, 5(03):211–228, 2003.

[40] Lutz K Berkner. The stem family and the developmental cycle of the peasant household: An eighteenth-century austrian example. The American Historical Review, 77(2):398–418, 1972.

[41] Chris Bidner and Mukesh Eswaran. A gender-based theory of the origin of the caste system of india. Journal of Development Economics, 114:142–158, 2015.

[42] Alan Bittles and M. H. Black. Global patterns and tables of consanguinity. 2015.

[43] Alan H Bittles. Consanguinity in context, volume 63. Cambridge University Press, 2012.

[44] Keith Blackburn, Kyriakos Neanidis, Maria Paola Rana, et al. Organized crime, corruption and growth: Theory and evidence. Technical report, Economics, The Univeristy of Manch- ester, 2015.

[45] Dmitri Bondarenko, Alexander Kazankov, Daria Khaltourina, and Andrey Korotayev. Ethno- graphic atlas xxxi: Peoples of easternmost europe. Ethnology, pages 261–289, 2005.

[46] Robert Borofsky. Making history: Pukapukan and anthropological constructions of knowl- edge. Cambridge University Press, 1987.

[47] Samuel Bowles and Dorrit Posel. Genetic relatedness predicts south african migrant workers’ remittances to their families. Nature, 434(7031):380–383, 2005.

[48] S.I. Bruk and V.S. Apenchenko, editors. Atlas Narodov. Moscow: Miklukho-Maklai Eth- nological Institute at the Department of Geodesy and Cartography of the State Geological Committee of the Soviet Union, 1964.

[49] Ulf Brunnbauer. Descent or territoriality: inheritance and family forms in the late ottoman and early post-ottoman balkans. Distinct Inheritances. Property, family and Community in a Changing Europe, pages 181–205, 2003.

91 [50] Paolo Buonanno and Paolo Vanin. Social closure, surnames and crime. Quaderni-Working Paper DSE, 2015.

[51] Lisa Cameron, Ananish Chaudhuri, Nisvan Erkal, and Lata Gangadharan. Propensities to en- gage in and punish corrupt behavior: Experimental evidence from australia, india, indonesia and singapore. Journal of Public Economics, 93(7):843–851, 2009.

[52] Pamela Campa, Michel Serafinelli, et al. Politico-economic regimes and attitudes: Female workers under state-socialism. 2016.

[53] J Edgardo Campos, Donald Lien, and Sanjay Pradhan. The impact of corruption on invest- ment: Predictability matters. World development, 27(6):1059–1067, 1999.

[54] Eliana Carranza. Soil endowments, female labor force participation, and the demographic deficit of women in india. American Economic Journal: Applied Economics, 6(4):197–225, 2014.

[55] Marco Casari and Maurizio Lisciandra. Gender discrimination in property rights: six cen- turies of commons governance in the alps. The Journal of Economic History, 76(02):559– 594, 2016.

[56] Luigi Luca Cavalli-Sforza, Paolo Menozzi, and Alberto Piazza. The history and geography of human genes. Princeton university press, 1994.

[57] Luigi Luca Cavalli-Sforza, Antonio Moroni, and Gianna Zei. Consanguinity, inbreeding, and genetic drift in Italy, volume 39. Princeton University Press, 2004.

[58] Nicholas Charron, Lewis Dijkstra, and Victor Lapuente. Regional governance matters: qual- ity of government within european union member states. Regional Studies, 48(1):68–90, 2014.

[59] Nicholas A Christakis and James H Fowler. Friendship and natural selection. Proceedings of the National Academy of Sciences, 111(Supplement 3):10796–10801, 2014.

[60] CY Cyrus Chu. Primogeniture. Journal of Political Economy, 99(1):78–99, 1991.

[61] Swee-Hoon Chuah, Reema Fahoum, and Robert Hoffmann. Fractionalization and trust in india: A field-experiment. Economics Letters, 119(2):191–194, 2013.

[62] EUA CIA. The world factbook (published online). Central Intelligence Agency, Washington, DC, 1996.

[63] Nevill Colclough. Invisible patrimonies–capitoli matrimoniali in ancien régime ascoli. Dis- tinct inheritances: property, family and community in a changing Europe, pages 145–158, 2003.

[64] John W Cole and Eric R Wolf. The hidden frontier: ecology and ethnicity in an alpine valley. Univ of California Press, 1999.

[65] Robert K Colwell. Predictability, constancy, and contingency of periodic phenomena. Ecol- ogy, 55(5):1148–1153, 1974.

92 [66] Timothy G Conley. Gmm estimation with cross sectional dependence. Journal of economet- rics, 92(1):1–45, 1999.

[67] Youssef Courbage and Emmanuel Todd. A convergence of civilizations: The transformation of Muslim societies around the world. Columbia University Press, 2014.

[68] Donald Cox and Marcel Fafchamps. Extended family and kinship networks: economic insights and evolutionary directions. Handbook of development economics, 4:3711–3784, 2007.

[69] Klaus Deininger, Aparajita Goyal, and Hari Nagarajan. Women’s inheritance rights and inter- generational transmission of resources in india. Journal of Human Resources, 48(1):114–141, 2013.

[70] Alfredo Del Monte and Erasmo Papagni. Public expenditure, corruption, and economic growth: the case of italy. European Journal of Political Economy, 17(1):1–16, 2001.

[71] Alfredo Del Monte and Erasmo Papagni. The determinants of corruption in italy: Regional panel data analysis. European Journal of Political Economy, 23(2):379–396, 2007.

[72] Alfredo Del Monte and Erasmo Papagni. The determinants of corruption in italy: Regional panel data analysis. European Journal of Political Economy, 23(2):379–396, 2007.

[73] Adriana Di Liberto and Marco Sideri. Past dominations, current institutions and the italian regional economic performance. European Journal of Political Economy, 38:12–41, 2015.

[74] Oguzhan C Dincer. Ethnic and religious diversity and corruption. Economics Letters, 99(1):98–102, 2008.

[75] Matthias Doepke and Michele Tertilt. Does female empowerment promote economic devel- opment? Technical report, 2016.

[76] Gertrude E Dole. The lineage pattern of kinship nomenclature: Its significance and develop- ment. Southwestern Journal of Anthropology, pages 36–62, 1965.

[77] Esther Duflo. Schooling and labor market consequences of school construction in indonesia: Evidence from an unusual policy experiment. American economic review, 91(4):795–813, 2001.

[78] Gilles Duranton, Andrés Rodríguez-Pose, and Richard Sandall. Family types and the persis- tence of regional disparities in europe. Economic Geography, 85(1):23–47, 2009.

[79] William Easterly and Ross Levine. Africa’s growth tragedy: policies and ethnic divisions. The Quarterly Journal of Economics, pages 1203–1250, 1997.

[80] Lena Edlund. Cousin marriage is not choice: Muslim marriage and underdevelopment. AEA Papers and Proceedings, 108:353–57, 2018.

[81] Charles Efferson, Rafael Lalive, and Ernst Fehr. The coevolution of cultural groups and ingroup favoritism. Science, 321(5897):1844–1849, 2008.

[82] Robert B Ekelund, Robert D Tollison, Gary M Anderson, FH Robert, Audrey B Davidson, et al. Sacred trust: the medieval church as an economic firm. Oxford University Press, 1996.

93 [83] Nasr G Elbahnasawy and Charles F Revier. The determinants of corruption: Cross-country- panel-data analysis. The Developing Economies, 50(4):311–333, 2012.

[84] Benjamin Enke. Kinship systems, cooperation and the evolution of culture. NBER Working Paper, 23499, 2017.

[85] John Ermisch and Diego Gambetta. Do strong family ties inhibit trust? Journal of Economic Behavior & Organization, 75(3):365–376, 2010.

[86] Ernst Fehr and Urs Fischbacher. The nature of human altruism. Nature, 425(6960):785–791, 2003.

[87] Carl H. Fellner and John R. Marshall. Kidney donors revisited. In Eds. J. Philippe Rushton and Richard M. Sorrentino, editors, Altruism and Helping Behavior: Social, Personality and Developmental Perspectives. Lawrence Erlbaum Associates: New Jersey, 1981.

[88] Nadia Fiorino, Emma Galli, and Ilaria Petrarca. Corruption and growth: Evidence from the italian regions. European Journal of Government and Economics, 1(2):126–144, 2012.

[89] Raymond Fisman and Edward Miguel. Corruption, norms, and legal enforcement: Evidence from diplomatic parking tickets. Journal of Political economy, 115(6):1020–1048, 2007.

[90] James Foreman-Peck and Peng Zhou. Late marriage as a contributor to the industrial revolu- tion in england. The Economic History Review, 2017.

[91] Robin Fox. Kinship and Marriage. Penguin Books: Middlesex, UK, 1967.

[92] Patrick Francois and Jan Zabojnik. Trust, social capital and economic development. Journal of the European Economic Association, 3(1):51–94, 2005.

[93] Feng Fu, Corina E Tarnita, Nicholas A Christakis, Long Wang, David G Rand, and Martin A Nowak. Evolution of in-group favoritism. Scientific Reports, 2(460), 2012.

[94] Francis Fukuyama. Trust: The Social Virtues and the Creation of Prosperity. New York: Free Press, 1995.

[95] Donna R Gabaccia and Franca Iacovetta. Women, gender and transnational lives: Italian workers of the world. University of Toronto Press, 2002.

[96] Simon Gächter and Jonathan F. Schulz. Intrinsic honesty and the prevalence of rule violations across societies. Nature, 531(7595):496–499, 03 2016.

[97] Vincenzo Galasso and Paola Profeta. When the state mirrors the family: the design of pension systems. CEPR Discussion Paper No. DP8723, 2011.

[98] John Luke Gallup, Jeffrey D Sachs, and Andrew D Mellinger. Geography and economic development. International Regional Science Review, 22(2):179–232, 1999.

[99] A. Gardner and A.W. Stuart. Spite. Current Biology, 16(17):R662–R664, 2006.

[100] Thomas R. Garth. Kinship terminology, marriage practices and behavior toward kin among the atsugewi. American Anthropologist, 46(3):348–361, 1944.

[101] JH Gillespie. Population genetics: a concise guide. Johns Hopkins University Press, 2004.

94 [102] Paola Giuliano. The role of women in society: from preindustrial to modern times. CESifo Economic Studies, 61(1):33–52, 2015.

[103] Paola Giuliano. Gender: A historical perspective. NBER Working Paper, 23635, 2017.

[104] Paola Giuliano and Nathan Nunn. Ancestral characteristics of modern populations, 2013.

[105] Edward L Glaeser and Raven E Saks. Corruption in america. Journal of Public Economics, 90(6):1053–1072, 2006.

[106] Bernt Glatzer. The pashtun tribal system. Concept of tribal society, pages 265–282, 2002.

[107] Uri Gneezy, Kenneth L Leonard, and John A List. Gender differences in competition: Evi- dence from a matrilineal and a patriarchal society. Econometrica, 77(5):1637–1664, 2009.

[108] Miriam A Golden and Lucio Picci. Proposal for a new measure of corruption, illustrated with italian data. Economics & Politics, 17(1):37–75, 2005.

[109] Walter Goldschmidt and Evalyn Jacobson Kunkel. The structure of the peasant family. Amer- ican Anthropologist, 73(5):1058–1076, 1971.

[110] Melvyn C Goldstein. When brothers share a wife. Natural History, 96(3):109–112, 1987.

[111] Binglin Gong and Chun-Lei Yang. Gender differences in risk attitudes: Field experiments on the matrilineal mosuo and the patriarchal yi. Journal of economic behavior & organization, 83(1):59–65, 2012.

[112] Jack Goody. Inheritance, property, and marriage in africa and eurasia. Sociology, 3(1):55–76, 1969.

[113] Jack Goody. Production and reproduction: A comparative study of the domestic domain. Cambridge Studies in Social Anthropology, 17, 1976.

[114] Jack Goody. Production and reproduction: a comparative study of the domestic domain. Number 17. Cambridge University Press, 1976.

[115] Jack Goody. The development of the family and marriage in Europe. Cambridge University Press, 1983.

[116] Jack R Goody, Joan Thirsk, and Edward Palmer Thompson. Family and inheritance. Past and Present Publications (USA), 1976.

[117] Philip Gounev and Tihomir Bezlov. Examining the links between organised crime and cor- ruption. Center for the Study of Democracy, 2010.

[118] Jesse Graham, Jonathan Haidt, and Brian Nosek. The Moral Foundations Quiz. www.yourmorals.org, 2008.

[119] J Patrick Gray. Ethnographic atlas codebook. World Cultures, 10(1):86–136, 1998.

[120] J Patrick Gray. A corrected ethnographic atlas. World Cultures, 10(1):24–85, 1999.

[121] Avner Greif. Family structure, institutions, and growth: the origins and implications of west- ern corporations. The American Economic Review, 96(2):308–312, 2006.

95 [122] Avner Greif and Guido Tabellini. The clan and the corporation: Sustaining cooperation in china and europe. SSRN Working Paper, 2565120, 2015.

[123] Avner Greif and Guido Tabellini. The clan and the corporation: Sustaining cooperation in china and europe. Available at SSRN 2565120, 2015.

[124] Veronika Grimm and Friederike Mengel. Cooperation in viscous populations—experimental evidence. Games and Economic Behavior, 66(1):202–220, 2009.

[125] Pauline Grosjean and Rose Khattar. It’s raining men! hallelujah? SSRN Working Paper, 2445285, 2015.

[126] Eleonora Guarnieri and Helmut Rainer. Female empowerment and male backlash. CESifo Working Paper, 7009, 2018.

[127] Luigi Guiso, Paola Sapienza, and Luigi Zingales. People’s opium? religion and economic attitudes. Journal of monetary economics, 50(1):225–282, 2003.

[128] EV Gunnemark. Countries, peoples, and their languages: The geolinguistic handbook. sum- mer institute of linguistics. Inc., Dallas, Texas, 1991.

[129] W. D. Hamilton. The genetical evolution of social behaviour. i. Journal of Theoretical Biology, 7(1):1–16, 1964.

[130] William D Hamilton. The genetical evolution of social behaviour. ii. Journal of Theoretical Biology, 7(1):17–52, 1964.

[131] William D Hamilton. Innate social aptitudes of man: an approach from evolutionary genetics. Biosocial anthropology, 133:155, 1975.

[132] Ross A Hammond and Robert Axelrod. The evolution of ethnocentrism. Journal of Conflict Resolution, 50(6):926–936, 2006.

[133] Casper Worm Hansen, Peter Sandholt Jensen, and Christian Volmar Skovsgaard. Modern gender roles and agricultural history: the neolithic inheritance. Journal of Economic Growth, 20(4):365–404, 2015.

[134] Mariaflavia Harari. WomenâA˘ Zs´ inheritance rights and bargaining power: Evidence from kenya. Department of Economics of MIT, pages 1–47, 2014.

[135] Henry Harpending and Alan Rogers. Fitness in stratified societies. Ethology and sociobiol- ogy, 11(6):497–509, 1990.

[136] Henry C Harpending. Kinship and population subdivision. Population and Environment, 24(2):141–147, 2002.

[137] Stevan Harrell. Human families. Westview Press, 1997.

[138] IPDJ Harris, PD Jones, TJ Osborn, and DH Lister. Updated high-resolution grids of monthly climatic observations–the cru ts3. 10 dataset. International Journal of Climatol- ogy, 34(3):623–642, 2014.

96 [139] hbdchick. Big summary post on the hajnal line. https://hbdchick.wordpress.com/2014/03/10/big-summary-post-on-the-hajnal-line/, March 2014.

[140] Patrick Heady and Hannes Grandits, editors. Distinct inheritances: property, family and community in a changing Europe. LIT Verlag Münster, 2003.

[141] Rachel Heath and Xu Tan. Intrahousehold bargaining, female autonomy, and labor supply: Theory and evidence from india. University of Washington, 2016.

[142] Joseph Henrich. The secret of our success: how culture is driving human evolution, domesti- cating our species, and making us smarter. Princeton University Press, 2015.

[143] David Herlihy. Medieval households. Harvard University Press, 1985.

[144] Benedikt Herrmann, Christian Thöni, and Simon Gächter. Antisocial punishment across societies. Science (New York, N.Y.), 319(5868):1362–1367, 2008.

[145] Ashley D. Hoben, Abraham P. Buunk, Corey L. Fincher, Randy Thornhill, and Mark Schaller. On the adaptive origins and maladaptive consequences of human inbreeding: Para- site prevalence, immune functioning, and consanguineous marriage. Evolutionary Psychol- ogy, 8(4):658 – 676, 2010.

[146] Ladislav Holy.` Kinship, honour, and solidarity: cousin marriage in the Middle East. Manch- ester University Press, 1989.

[147] Martha C Howell. Commerce before capitalism in Europe, 1300-1600. Cambridge University Press, 2010.

[148] David Hume. A Treatise of Human Nature. Oxford: Oxford University Press, (2003) edition, 1740.

[149] John Kenneth Hyde. Society and politics in medieval Italy: the evolution of the civil life, 1000-1350. Not Avail, 1973.

[150] Shabnam Ishaque. Islamic principles on adoption: examining the impact of illegitimacy and inheritance related concerns in context of a child’s right to an identity. International Journal of Law, Policy and the Family, 22(3):393–420, 2008.

[151] Lutfi Jaber, Gabrielle J Halpern, and Mordechai Shohat. The impact of consanguinity world- wide. Community Genetics, 1(1):12–17, 1998.

[152] Tarun Jain. Where there is a will fertility behavior and sex bias in large families. Journal of Human Resources, 49(2):393–423, 2014.

[153] Allen W Johnson and Timothy K Earle. The evolution of human societies: from foraging group to agrarian state. Stanford University Press, 2000.

[154] Noel D Johnson and Alexandra Mislin. How much should we trust the world values survey trust question? Economics Letters, 116(2):210–212, 2012.

[155] Doug Jones. Socially enforced nepotism: How norms and reputation can amplify kin altru- ism. PLoS ONE, 11(6):1–13, 06 2016.

97 [156] Karl Kaser. Power and inheritance: Male domination, property, and family in eastern europe, 1500–1900. Distinct Inheritances. Property, family and Community in a Changing Europe, pages 53–63, 2003. [157] David I Kertzer. The family in Italy from antiquity to the present. Yale University Press, 1993. [158] Kathryn R Kirby, Russell D Gray, Simon J Greenhill, Fiona M Jordan, Stephanie Gomes-Ng, Hans-Jörg Bibiko, Damián E Blasi, Carlos A Botero, Claire Bowern, Carol R Ember, et al. D-place: A global database of cultural, linguistic and environmental diversity. PLoS One, 11(7):e0158391, 2016. [159] Andrey Korotayev. Parallel-cousin (fbd) marriage, islamization, and arabization. Ethnology, pages 395–407, 2000. [160] Andrey Korotayev, Alexander Kazankov, Svetlana Borinskaya, Daria Khaltourina, and Dmitri Bondarenko. Ethnographic atlas xxx: peoples of siberia. Ethnology, 43(1):83–92, 2004. [161] Andrey V Korotayev. Unilineal descent organization and deep christianization: A cross- cultural comparison. Cross-Cultural Research, 37(1):133–157, 2003. [162] Maurice Kugler, Thierry Verdier, and Yves Zenou. Organized crime, corruption and punish- ment. Journal of Public Economics, 89(9):1639–1663, 2005. [163] Timur Kuran. The long divergence: How Islamic law held back the Middle East. Princeton University Press, 2012. [164] Rafael La Porta, Florencio Lopez-de Silanes, Andrei Shleifer, and Robert Vishny. The quality of government. Journal of Law, Economics, and organization, 15(1):222–279, 1999. [165] Rafael La Porta, Andrei Shleifer, and RW Vishny. Law and finance. Journal of Political Economy, 106:1113–1155, 1998. [166] Johann Graf Lambsdorff. How corruption affects persistent capital flows. Economics of Governance, 4(3):229–243, 2003. [167] Johann Graf Lambsdorff. Behavioural and experimental economics as a guidance to anticor- ruption. In Danila Serra and Leonard Wantchekon, editors, New advances in experimental research on corruption, volume 15, pages 279–300. Emerald Group Publishing, 2012. [168] Iosif Lazaridis, Dani Nadel, Gary Rollefson, Deborah C. Merrett, Nadin Rohland, Swa- pan Mallick, Daniel Fernandes, Mario Novak, Beatriz Gamarra, Kendra Sirak, Sarah Con- nell, Kristin Stewardson, Eadaoin Harney, Qiaomei Fu, Gloria Gonzalez-Fortes, Eppie R. Jones, Songül Alpaslan Roodenberg, György Lengyel, Fanny Bocquentin, Boris Gasparian, Janet M. Monge, Michael Gregg, Vered Eshed, Ahuva-Sivan Mizrahi, Christopher Meik- lejohn, Fokke Gerritsen, Luminita Bejenaru, Matthias Blüher, Archie Campbell, Gianpiero Cavalleri, David Comas, Philippe Froguel, Edmund Gilbert, Shona M. Kerr, Peter Kovacs, Johannes Krause, Darren McGettigan, Michael Merrigan, D. Andrew Merriwether, Seamus O’Reilly, Martin B. Richards, Ornella Semino, Michel Shamoon-Pour, Gheorghe Stefanescu, Michael Stumvoll, Anke Tönjes, Antonio Torroni, James F. Wilson, Loic Yengo, Nelli A. Hovhannisyan, Nick Patterson, Ron Pinhasi, and David Reich. Genomic insights into the origin of farming in the ancient near east. Nature, 536(7617):419–424, 08 2016.

98 [169] FM LeBar and GN Appell. Ethnic groups of insular southeast asia. new haven. Human Relations Area Files Press, 1972.

[170] David Levine and Michael Kevane. Are investments in daughters lower when daughters move away? evidence from indonesia. World Development, 31(6):1065–1084, 2003.

[171] Alexandra Lewis. Security, Clans and Tribes: Unstable Governance in Somaliland, Yemen and the Gulf of Aden. Palgrave Macmillan: London, 2014.

[172] M. Pau Lewis, Gary F. Simons, and Charles D. Fennig. Ethnologue: Languages of the World, Nineteenth edition. Dallas, Texas: SIL International, 2016.

[173] Seymour Martin Lipset and Gabriel Salman Lenz. Corruption, culture, and markets. George Mason University, 1999.

[174] Sara Lowes. Kinship systems, gender norms, and household bargaining: Evidence from the matrilineal belt. Working Paper, Harvard mimeo, 2016.

[175] Katherine A Lynch. Individuals, families, and communities in Europe, 1200-1800: the urban foundations of Western society, volume 37. Cambridge University Press, 2003.

[176] Ramzi Mabsout and Irene Van Staveren. Disentangling bargaining power from individual and household level to institutions: Evidence on womenâA˘ Zs´ position in ethiopia. World Development, 38(5):783–796, 2010.

[177] Elainie A Madsen, Richard J Tunney, George Fieldman, Henry C Plotkin, Robin IM Dun- bar, Jean-Marie Richardson, and David McFarland. Kinship and altruism: A cross-cultural experimental study. British Journal of Psychology, 98(2):339–359, 2007.

[178] Aparna Mathur and Sita Slavov. Escaping domestic violence: Empowering women through employment, earnings and wealth in india. AEI Economic Policy Working Paper Series, 2013.

[179] Paolo Mauro. Corruption and growth. The Quarterly Journal of Economics, pages 681–712, 1995.

[180] Paolo Mauro. Corruption and the composition of government expenditure. Journal of Public economics, 69(2):263–279, 1998.

[181] Hans Medick. The proto-industrial family economy: The structural function of household and family during the transition from peasant society to industrial capitalism. Social History, 1(3):291–315, 1976.

[182] Hans Medick and David Warren Sabean. Interest and emotion: essays on the study of family and kinship. Cambridge University Press, 1984.

[183] Stelios Michalopoulos. The origins of ethnolinguistic diversity. The American Economic Review, 102(4):1508, 2012.

[184] Stelios Michalopoulos, Alireza Naghavi, and Giovanni Prarolo. Islam, inequality and pre- industrial comparative development. Journal of Development Economics, 120:86–98, 2016.

[185] Stelios Michalopoulos, Alireza Naghavi, and Giovanni Prarolo. Trade and geography in the spread of islam. The Economic Journal, 2017.

99 [186] Michael Mitterauer. European kinship systems and household structures: medieval origins. Distinct inheritances. Property, family and community in a changing Europe, pages 35–52, 2003.

[187] Michael Mitterauer. Why Europe?: The Medieval Origins of Its Special Path. University of Chicago Press, 2010.

[188] Anna Momigliano. The town in italy where most women are barred from inheriting land. The Atlantic, 2016.

[189] Lewis Henry Morgan. Systems of consanguinity and affinity of the human family, volume 218. Smithsonian institution, 1871.

[190] Jacob Moscona, Nathan Nunn, and James A Robinson. Keeping it in the family: Lineage organization and the scope of trust in sub-saharan africa. American Economic Review Papers and Proceedings, 107(5):565–71, 2017.

[191] Jacob Moscona, Nathan Nunn, and James A Robinson. Keeping it in the family: Lineage organization and the scope of trust in sub-saharan africa. Technical report, National Bureau of Economic Research, 2017.

[192] Jacob Moscona, Nathan Nunn, and James A Robinson. Social structure and conflict: Evi- dence from sub-saharan africa. 2017.

[193] Jacob Moscona, Nathan Nunn, and James A Robinson. Social structure and conflict: Evi- dence from sub-saharan africa. NBER Working Paper, 24209, 2018.

[194] Siegfried Hermann Muller. The World’s Living Languages: Basic Facts of Their Structure, Kinship, Location, and Number of Speakers. Ungar, 1964.

[195] George Peter Murdock. World ethnographic sample. American Anthropologist, 59(4):664– 687, 1957.

[196] George Peter Murdock. Ethnographic atlas, installments i-xxvii. Ethnology, pages 1–10, 1962–1971.

[197] George Peter Murdock. Ethnographic atlas: a summary. Ethnology, 6(2):109–236, 1967.

[198] Chie Nakane et al. Kinship and economic organization in rural japan. Kinship and economic organization in rural Japan., 1967.

[199] Mark New, David Lister, Mike Hulme, and Ian Makin. A high-resolution data set of surface climate over global land areas. Climate Research, 21(1):1–25, 2002.

[200] Tin Nguyen, Enjiang Cheng, and Christopher Findlay. Land fragmentation and farm produc- tivity in china in the 1990s. China Economic Review, 7(2):169–180, 1996.

[201] William D Nordhaus. Geography and macroeconomics: New data and new findings. Pro- ceedings of the National Academy of Sciences of the United States of America, 103(10):3510– 3517, 2006.

[202] Martin A Nowak, Corina E Tarnita, and Edward O Wilson. The evolution of eusociality. Nature, 466(7310):1057–1062, 2010.

100 [203] Nathan Nunn, Emmanuel Akyeampong, Robert Bates, and James A Robinson. Gender and missionary influence in colonial africa. African Development in Historical Perspective, 2011.

[204] Nathan Nunn and Diego Puga. Ruggedness: The blessing of bad geography in africa. Review of Economics and Statistics, 94(1):20–36, 2012.

[205] Samir Okasha. Biological altruism. In Edward N. Zalta, editor, The Stanford Encyclopedia of Philosophy. Standford University Press, fall 2013 edition, 2013.

[206] Una Okonkwo Osili and Bridget Terry Long. Does female schooling reduce fertility? evi- dence from nigeria. Journal of development Economics, 87(1):57–75, 2008.

[207] Elinor Ostrom and Charlotte Hess. Private and common property rights. Encyclopedia of law and economics, 2:53–106, 2000.

[208] Trevor J Pemberton, Michael DeGiorgio, and Noah A Rosenberg. Population structure in a comprehensive genomic data set on human microsatellite variation. G3: Genes| Genomes| Genetics, pages g3–113, 2013.

[209] Trevor J Pemberton and Noah A. Rosenberg. Population-genetic influences on genomic estiamtes of the inbreeding coefficient: A global perspective. Human Heredity, 77(1-4):37– 48, 2014.

[210] Francis Pine. Reproducing the house: kinship, inheritance, and property relations in highland poland. Distinct Inheritances: Property, Family and Community in a Changing Europe, pages 279–295, 2003.

[211] Leendert Plas. Rural gender studies in Europe. Van Gorcum, 1994.

[212] Jean-Philippe Platteau. Institutions, social norms, and economic development, volume 1. Psychology Press, 2000.

[213] Jean-Philippe Platteau and Jean-Marie Baland. Access to land, rural poverty, and public action (Impartible Inheritance Versus Equal Division: a Comparative Perspective Centered on Europe and Subsaharan Africa). Oxford University Press, 2001.

[214] Robert D Putnam, Robert Leonardi, and Raffaella Y Nanetti. Making democracy work: Civic traditions in modern Italy. Princeton university press, 1994.

[215] Nancy Qian. Missing women and the price of tea in china: The effect of sex-specific earnings on sex imbalance. The Quarterly Journal of Economics, 123(3):1251–1285, 2008.

[216] Sohini Ramachandran, Omkar Deshpande, Charles C. Roseman, Noah A. Rosenberg, Mar- cus W. Feldman, and L. Luca Cavalli-Sforza. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in africa. Proceedings of the National Academy of Sciences of the United States of America, 102(44):15942–15947, 2005.

[217] Navin Ramankutty, Jonathan A Foley, John Norman, and Kevin McSweeney. The global distribution of cultivable lands: current patterns and sensitivity to possible climate change. Global Ecology and biogeography, 11(5):377–392, 2002.

101 [218] Navin Ramankutty, Jonathan A Foley, John Norman, and Kevin McSweeney. The global distribution of cultivable lands: current patterns and sensitivity to possible climate change. Global Ecology and Biogeography, 11(5):377–392, 2002.

[219] Auke Rijpma and Sarah G Carmichael. Testing todd and matching murdock: Global data on historical family characteristics. Economic History of Developing Regions, 31(1):10–46, 2016.

[220] M Fernanda Rivas. An experiment on corruption and gender. Bulletin of Economic Research, 65(1):10–42, 2013.

[221] Janet Roberts. Sociocultural change and communication problems. Study of the role of second languages in Asia, Africa, and Latin America, pages 105–23, 1962.

[222] Alan R Rogers. Evolutionary economics of human reproduction. Ethology and Sociobiology, 11(6):479–495, 1990.

[223] Daniel Rosenblum. Unintended consequences of womenâA˘ Zs´ inheritance rights on female mortality in india. Economic Development and Cultural Change, 63(2):223–248, 2014.

[224] Sanchari Roy. Empowering women? inheritance rights, female education and dowry pay- ments in india. Journal of Development Economics, 114:233–251, 2015.

[225] David W Sabean. Young bees in an empty hive: relations between brothers-in-law in a south german village around 1800. Interest and Emotion: Essays on the Study of Family and Kin- ship. Cambridge: Cambridge University Press. p, pages 171–186, 1984.

[226] Steve Sailer. Cousin marriage conundrum. In Tim Folger and Steven Pinker, editors, The Best American Science and Nature Writing 2004. Mariner Books, 2004.

[227] Assaf Sarid, Oded Galor, and Omer Ozak. Geographical origins and economic consequences of language structures. CESifo Working Paper, 6149, 2017.

[228] Walter Scheidel. Sex and Empire. New York: Oxford University Press, 2009.

[229] William J Schull. Empirical risks in consanguineous marriages: sex ratio, malformation, and viability. American journal of human genetics, 10(3):294, 1958.

[230] Jonathan F Schulz. The churches’ bans on consanguineous marriages, kin-networks and democracy. SSRN Working Paper, 2877828, 2016.

[231] Jonathan F Schulz, Duman Bahrami-Rad, Jonathan Beauchamp, and Joseph Henrich. The origins of weird psychology. SSRN Working Paper, 3201031, 2018.

[232] Martine Segalen. Historical anthropology of the family. Cambridge University Press, 1986.

[233] Danila Serra. Empirical determinants of corruption: A sensitivity analysis. Public Choice, 126(1-2):225–256, 2006.

[234] Mary K Shenk, Mary C Towner, Emily A Voss, and Nurul Alam. Consanguineous marriage, kinship ecology, and market transition. Current Anthropology, 57(S13):S167–S180, 2016.

[235] Tom Siegfried. A Beautiful Math:: John Nash, Game Theory, and the Modern Quest for a Code of Nature. National Academies Press, 2006.

102 [236] Adam Smith. An Inquiry into the Nature and Causes of the Wealth of Nations. Vol. II of The Glasgow Edition of the Works and Correspondence of Adam Smith, 1776.

[237] RM Smith. Land, kinship and life-cycle. Cambridge (UK) Cambridge Univ. Press, 1984.

[238] Enrico Spolaore and Romain Wacziarg. The diffusion of development. The Quarterly Journal of Economics, 124(2):469–529, 2009.

[239] Mysore Narasimhachar Srinivas. The remembered village. Univ of California Press, 1980.

[240] Guido Tabellini. Culture and institutions: Economic development in the regions of europe. Journal of the European Economic Association, 8(4):677–716, 2010.

[241] Guido Tabellini et al. The scope of cooperation: Values and incentives. The Quarterly Journal of Economics, 123(3):905–950, 2008.

[242] Shuhao Tan, Nico Heerink, and Futian Qu. Land fragmentation and its driving forces in china. Land use policy, 23(3):272–285, 2006.

[243] Vito Tanzi and Hamid Reza Davoodi. Roads to nowhere: how corruption in public investment hurts growth, volume 12. International Monetary Fund, 1998.

[244] Edoardo Teso. The long-term effect of demographic shocks on the evolution of gender roles: Evidence from the transatlantic slave trade. SSRN Working Paper, 2927221, 2016.

[245] Joan Thirsk. The european debate on customs of inheritance, 1500-1700. Family and inher- itance, pages 177–191, 1976.

[246] Todne Thomas, Asiya Malik, and Rose Wellman. New Directions in Spiritual Kinship: Sacred Ties Across the Abrahamic Religions. Springer, 2017.

[247] Emmanuel Todd. The explanation of ideology: Family structures and social systems. Black- well Oxford, 1985.

[248] Emmanuel Todd. L’invention de l’Europe. Seuil, 1990.

[249] Daniel Treisman. The causes of corruption: a cross-national study. Journal of public eco- nomics, 76(3):399–457, 2000.

[250] Ana Tur-Prats. Family types and intimate-partner violence: A historical perspective. Eco- nomics Working Papers 1486, Universitat Pompeu Fabra, 2015.

[251] Edward B Tylor. On a method of investigating the development of institutions; applied to laws of marriage and descent. The Journal of the Anthropological Institute of Great Britain and Ireland, 18:245–272, 1889.

[252] UN. Demographic Yearbook. United Nations: New York, 1995.

[253] Eric M Uslaner. Trust and corruption. In Johann Graf Lambsdorff, Markus Taube, and Matthias Schramm, editors, The New Institutional Economics of Corruption, pages 76–93. Routledge, 2005.

[254] Robert S Walker and Drew H Bailey. Marrying kin in small-scale societies. American Journal of Human Biology, 26(3):384–388, 2014.

103 [255] Guang H Wan and Enjiang Cheng. Effects of land fragmentation and returns to scale in the chinese farming sector. Applied Economics, 33(2):183–194, 2001.

[256] Max Weber. The protestant ethic and the spirit of capitalism, trans. Talcott Parsons (New York: Charles Scribner’s Sons, 1958), 182, 1958.

[257] Max Weber. Economy and society: an outline of interpretive sociology. Vol. 1. University of Calif. Press, 1978.

[258] Karl August Wittfogel. Oriental despotism: A comparative study of total power. Yale Uni- versity Press, 1959.

[259] Michael A. Woodley and Edward Bell. Consanguinity as a major predictor of levels of democracy: A study of 70 nations. Journal of Cross-Cultural Psychology, 44(2):263–280, 2013.

[260] C Geoffrey Woods, James Cox, Kelly Springell, Daniel J Hampshire, Moin D Mohamed, Martin McKibbin, Rowena Stern, F Lucy Raymond, Richard Sandford, Saghira Malik Sharif, et al. Quantification of homozygosity in consanguineous individuals with autosomal reces- sive disease. The American Journal of Human Genetics, 78(5):889–896, 2006.

[261] 1978 World Bank. World development report 1978. Oxford University Press, New York, 1978.

[262] WB World Bank. Helping countries combat corruption: the role of the world bank, 1997.

[263] Worldmark. Worldmark encyclopedia of the nations. Gale Cengage, 1995.

[264] Melanie Meng Xue. High-value work and the rise of women: The cotton revolution and gender equality in china. SSRN Working Paper, 2389218, 2016.

[265] Toshio Yamagishi, Karen S Cook, and Motoki Watabe. Uncertainty, trust, and com- mitment formation in the united states and japan 1. American Journal of Sociology, 104(1):AJSv104p165–194, 1998.

[266] Toshio Yamagishi and Midori Yamagishi. Trust and commitment in the united states and japan. Motivation and emotion, 18(2):129–166, 1994.

104 Chapter 4

Origins of WEIRD Psychology

A growing body of research suggests that populations around the globe vary substantially along several important psychological dimensions, and that people from societies characterized as West- ern, Educated, Industrialized, Rich and Democratic (WEIRD) are particularly unusual (1–6). Often at the extremes of global distributions, people from WEIRD populations tend to be more individu- alistic, independent, analytically-minded and impersonally prosocial (e.g., trusting strangers) while revealing less conformity, obedience, in-group loyalty and nepotism (3, 5–13). While these patterns are now well documented, efforts to explain this variation from a cultural evolutionary and historical perspective have just begun (13–20). Here, we develop and test a cultural evolutionary theory that aims to explain a substantial portion of this psychological variation, both within and across nations. Not only does our approach contribute to explaining global variation and address why WEIRD soci- eties so often occupy the tail ends of global distributions, but it also helps explain the psychological variation within Europe—among countries, across regions within countries and between individuals with different cultural backgrounds within the same country and region. Our approach integrates three insights. The first, drawing on anthropology, reveals that the insti- tutions built around kinship and marriage vary greatly across societies (21–23) and that much of this variation developed as societies scaled up in size and complexity, especially after the origins of food production 12,000 years ago (22, 24–29). In forging the tightly-knit communities needed to defend agricultural fields and pastures, cultural evolution gradually wove together social norms governing marriage, post-marital residence and in-group identity (descent), leading to a diversity of kin-based institutions, including the organizational forms known as clans, lineages and kindreds (21, 27, 30). The second insight, based on work in psychology, is that people’s motivations, emotions, percep- tions, thinking styles and other aspects of cognition are heavily influenced by the social norms, social networks, technologies and linguistic worlds they encounter while growing up (31–38). In particular, with intensive kin-based institutions, people’s psychological processes adapt to the col- lectivistic demands and the dense social networks that they interweave (39–43). Intensive kinship norms reward greater conformity, obedience, holistic/relational awareness and in-group loyalty but discourage individualism, independence and analytical thinking (41, 44). Since the sociality of in- tensive kinship is based on people’s interpersonal embeddedness, adapting to these institutions tends to reduce people’s inclinations towards impartiality, universal (non-relational) moral principles and impersonal trust, fairness and cooperation. Finally, based on historical evidence, the third insight suggests that the branch of Western Christianity that eventually evolved into the Roman Catholic Church—hereafter, ’the Western Church’ or simply ’the Church’—systematically undermined the intensive kin-based institutions of Europe during the Middle Ages (45–52). The Church’s marriage policies and prohibitions, which we will call the Marriage and Family Program (MFP), meant that

105 by 1500 CE, and likely centuries earlier in some regions, Europe lacked strong kin-based insti- tutions, and was instead dominated by relatively weak, independent and isolated nuclear or stem families (49–51, 53–56). This made people exposed to Western Christendom rather unlike nearly all other populations. Integrating these insights, we propose that the spread of the Church, specifically through its transformation of kinship and marriage, was a key factor behind a cultural shift towards a WEIRDer psychology in Europe. This shift eventually fostered the creation of new formal institutions, in- cluding representative governments, individual rights, commercial law and impersonal markets (17, 57). This theory predicts that (1) societies with less intensive kin-based institutions should have a WEIRDer psychology and (2) historical exposure to the Church’s MFP should predict both less intensive kin-based institutions and, as a consequence, a WEIRDer psychology. To illuminate these relationships for diverse populations, we (1) developed measures of the intensity of kin-based institutions, (2) created historical databases to estimate the exposure of popu- lations to the Church (along with the MFP) and (3) compiled 20 different psychological outcomes, including laboratory experiments, validated scales, survey questions and ecologically-valid observa- tional data. We examine the predicted relationships from three complementary perspectives. Across countries, we can observe the broadest range of variation in the largest number of psychological out- comes. Across regions, we can track the historical Church as it lumbered across Europe and detect its footprints on the psychological patterns and marital arrangements of modern Europeans. Finally, by comparing second-generation immigrants in Europe based on their links to the kin-based insti- tutions of their ancestral communities around the world, we eliminate many alternative hypotheses for the relationships we’ve illuminated.

4.1 Theory

Our approach proposes that, by structuring and incentivizing the social environments that individu- als must navigate, kin-based institutions shape several important aspects of our psychology. Thus, to understand contemporary psychological variation, we need to understand the historical and cultural evolutionary forces that have shaped the intensity and diversity of kin-based institutions globally. In this section, we first explain kin-based institutions, sketch their intensification since the origins of food production, and then describe how they influence aspects of psychology. Lastly, we’ll sketch the historical process by which the Church systematically dismantled the intensive kin-based insti- tutions in Europe during the Middle Ages. Kin-based institutions are perhaps the most fundamental of human institutions, and have long represented the primary framework for organizing social life in most societies (21, 22, 29, 58). These institutions are composed of sets of culturally-transmitted norms that influence a broad range of social relationships by endowing individuals with a set of responsibilities, obligations and priv- ileges vis-a-vis others in their communities (Supplementary S1). Many of these norms are rooted in, or otherwise tap, aspects of our species’ evolved psychology, including those related to kin al- truism, incest aversion and pair bonding (29, 59, 60). Many kinship systems, for example, extend our species’ innate aversion to inbreeding (incest) with close kin to create taboos on marriages to more distant relatives, usually including particular types of cousins (27). Specifically, some soci- eties classify the children of one’s parents’ same-sex siblings as ’brothers’ and ’sisters’ (so, no sex or marriage with these ’parallel-cousins’) while normatively prescribing or at least permitting mar- riage to the children of one’s parent’s opposite-sex siblings (’cross-cousins’). By shaping patterns of marriage, residence and alliance formation, these norms organize social interactions and configure networks in ways that profoundly influence people’s minds and behavior (61, 62).

106 While all small-scale societies are organized primarily by kin-based institutions, evidence sug- gests that the character of these institutions is influenced in important ways by ecological, climatic, economic and geographic factors (63, 64). Among mobile hunter-gatherers, cultural evolution has responded to ecological risk by fostering social norms that favor extensive kin ties, which create sprawling relational networks that can be tapped when local disasters strike (65, 66). Among Kala- hari foragers, for example, first and second cousins are under incest taboos, so parents traditionally arranged marriages for their daughters with distant kinsmen (65, 67, 68). With the emergence of food production, however, cultural evolution increasingly favored intensive kin-based institutions that permitted communities to unify larger groups to control territories and organize production, storage and distribution (24, 25, 69, 70). Unlike foragers, food producers had to rely on longer-term investments in fields and pastures, which placed a premium on controlling territory and protecting stores. The development of these intensive kin-based institutions proceeded at different rates in dif- ferent regions, presumably due to a variety of ecological, climatic and geographic factors as well as the eventual formation of pre-modern states (48, 71). Cultural evolution thus responded to the emergence of food production by fostering the emer- gence of a diversity of new relationship-building institutions that all intensified group solidarity by constructing denser and more interdependent social networks. For example, instead of favor- ing marriages to distant kin, cultural evolution often favored some form of cousin marriage, which strengthened the existing bonds among families (21). Similarly, unilineal descent systems that create lineages or clans likely emerged because of how systems (common among mobile foragers (72)) create conflicts of interest between the two sides of a person’s family (21, 62). By tracking descent from a common ancestor, members of a clan or lineage tend to see themselves as equally related. This mitigates conflicts of interest within the clan or lineage and promotes stronger in-group loyalty. Complementing anthropological accounts, experimental work in both psychology and neuro- science suggests that aspects of our cognition, emotions, perceptions, and motivations adapt to the normative demands, reputational incentives, and implicit values of the dense networks spanned by kin-based institutions (3, 15, 41, 42, 73–79). Research, for example, has shown that people from societies with social norms favoring interdependence attend to scenes more holistically—based on eye-tracking and neural measures—and show better recall in memory tests for background objects (80–82). Since the cultural evolutionary forces driving the development of intensive kinship favored greater in-group solidarity, often with strict lines of command and control, we expect corresponding psychological patterns: kin-based institutions should foster greater conformity, obedience to author- ities, nepotism and in-group loyalty but less individualism, creativity and independence. Given the reliance of kin-based institutions on relationship-specific norms, these institutions should also favor a contextually-sensitive morality rooted in in-group loyalty over impartial standards and univer- sal principles. Kin-based institutions should also inhibit motivations toward prosociality, including trust, cooperation and fairness, towards strangers or impersonal organizations. The final aspect of our approach incorporates the role of religion and its influence on kin-based institutions. By the start of the Common Era, universalizing religions with powerful moralizing gods (or cosmic forces), universal ethical codes and contingent afterlife beliefs had emerged across the Old World. However, these competing religions varied greatly in how their religious beliefs and practices shaped kin-based institutions (83, 84). In Persia, for example, Zoroastrians glorified the marriage of close relatives, including siblings, and encouraged widespread cousin marriage. Later, Islam curbed polygynous marriage (limiting a man to no more than four wives) but also adopted inheritance customs that promoted a nearly unique form of cousin marriage in which a daughter marries her father’s brother’s son—patrilineal clan endogamy (45–47). Meanwhile, by Late Antiq-

107 uity, the Church had begun systematically dismantling the intensive kin-based institutions of West- ern Europe by banning or otherwise undercutting crucial practices. Prior to the Church’s efforts, the kin-based institutions of European populations looked much like other agricultural societies and included patrilineal clans, kindreds, cousin marriage, polygyny, ancestor worship and corporate ownership (47, 49, 52, 85–94). As documented in Supplementary S2, the Church’s Marriage and Family Program (MFP) began with targeted bans on certain marriage practices used to sustain mar- riage alliances between families (e.g., levirate marriage); however, by the , the Church had become obsessed with incest and had begun to expand the circle of forbidden relatives, eventually including not only distant cousins but also step-relatives, in-laws and spiritual-kin. Early in the new millennium, the ban was stretched out to encompass sixth cousins, including all affines (Table S2.1). At the same time, the Church promoted marriage ’by choice’ (no arranged marriages) and often required newly married couples to set up independent households (). The Church also forced an end to many lineages by eliminating legal adoption, remarriage and all forms of polygamous marriage as well as concubinage, which meant that many lineages began liter- ally dying out as they lacked legitimate heirs. In the 8th century, the Church found a common cause with the Frankish Kings, eventually leading the Carolingian Empire to put its secular power behind the Church’s MFP (Table S2.1) (47, 49, 52, 95). Although quantitative data are lacking for most of the Middle Ages, by 1500 CE European kinship had transformed into a virtually unique configuration, based on monogamous nuclear (or stem) households, bilateral descent, late marriage and neolocal residence (49–51, 53–55, 87, 96, 97). There were few cousin marriages (except among elites with Papal dispensations) and no clans, tribes or kindreds. Land was individually owned, and inheritance was often by testament. Substantial debates persist about why the Church adopted the MFP, which we summarize in Supplementary S2. Support for the full extent of the MFP policies certainly cannot be found in the Bible, and the other branches of Christianity never went so far. Nestorian and Coptic Christians, for example, continued marrying their cousins for at least another millennium. And, while the did adopt some of the same prohibitions as the Western Church, it never endorsed the Western Church’s broad taboos on cousin marriage, was slow to adopt many policies, and was generally unenthusiastic about enforcement. From a cultural evolutionary point of view, diverse religious communities may coalesce on their own beliefs and practices for idiosyncratic or histori- cally contingent reasons, but the broad diffusion of particular beliefs and practices depend on how they shape long-term success (83). Consistent with this, much evidence indicates that the immense financial success of the Church during the Middle Ages can be tied back to the MFP (47, 49, 98). While our approach views the Church’s MFP as a critical contributor to the dissolution of in- tensive kin-based institutions in Europe and to the formation of WEIRD psychology, it does not preclude the existence of other contributors. As noted, ecological, geographic and related factors likely also matter. Moreover, while the Church actively spread to wherever it could, and there was much stochasticity, the diffusion and and implementation of the MFP may have also been influenced by other factors, including by the existing kin-based institutions of the missionized populations. Our analysis addresses these issues in a variety of ways.

4.2 Methods

To test these ideas, we developed two measures of the intensity of kin-based institutions along with measures for the duration of exposure to the Church both at the country level and at the regional level within European countries. We combined these variables with 20 psychological outcome variables, captured at either the country or individual level, that we assembled from diverse sources.

108 To measure the intensity of kin-based institutions, we created two kinds of measures. First, as an omnibus measure of the overall strength of kin-based institutions, we created a kinship intensity index (KII) based on data from the Ethnographic Atlas (83, 84). The Atlas quantifies observations from 1,291 populations based largely on anthropological studies of societies prior to industrializa- tion. We constructed the KII by averaging, for each Atlas ethnicity, measures regarding (1) prefer- ences for cousin marriage (Figure S1.1), (2) polygamy (Figure S1.2), (3) co-residence of extended families (Figure S1.3), (4) community organization (Figure S1.4), and (5) presence of unilineal de- scent (Figure S1.5). Then, linking these Atlas measures via language phylogenies, we mapped our normalized KII values to 7,651 ethno-linguistic populations around the globe from the Ethnologue (101) (Figure 4.1, Supplementary S8). For country-level KII measures, we aggregated up from the ethno-linguistic populations, weighted by their population sizes, within each country (Figure S1.7, for a similar approach see (42)). The five measures that make up our KII each contribute to structuring societies in ways that in- tensify and multiply social bonds within kin groups (see Supplementary S1.1). The KII makes full use of the available Atlas measures that capture social structure. This omnibus measure of kinship intensity captures the key aspects that the Church’s MFP impacted. Because polygynous marriage may impact psychology through alternative mechanisms, related to the intensity of intrasexual com- petition (60), we show that our results hold when we remove polygyny from the KII and control for it separately (Tables S3.9 and S6.6).

Figure 4.1: The Kinship Intensity Index (KII) for 7,651 ethno-linguistic populations around the globe.

The KII is a historical measure of the intensity of kin-based institutions, with the average ob- servation occurring around 1900 CE. This means that KII measures pre-date our psychological measures by about a century. This is an advantage, since it eliminates the risk of contemporaneous reverse causality and permits us to potentially reveal enduring impacts of kin-based institutions on psychology. Analyses reported in Table S3.7 confirm that the observation years for the KII has no impact on our findings. As an alternative measure of kinship intensity, we also compiled estimates of the prevalence of cousin marriages in the 20th century, both across countries and within Europe (Supplementary S1.2). Cross-nationally, we augmented an existing database (102) of estimates of the prevalence of marriages between couples who were second cousins or closer relatives (hereafter, ’cousin marriage prevalence’; see Figure S1.8). Within Europe, we compiled estimates of the rates of first cousin

109 marriage for regions in Italy, and France during the 20th century based on requests for Papal dispensations (permission to marry cousins). Expanding this, we also incorporated rates of cousin marriage from contemporary Turkey. As a measure of kinship intensity, cousin marriage complements the KII in three ways. First, unlike the ethnographic observations used to fashion the KII, cousin marriage provides a statistic about what was actually happening in the 20th century. Second, cousin marriage is a particularly important aspect of intensive kinship: it creates thick family networks. Third, cousin marriage repre- sents perhaps the central MFP-related policy difference between the Western and Eastern Churches, with the Western Church implementing and enforcing its extensive prohibitions with greater verve. To assess the historical exposure of countries and regions to the Churches’ MFP, we calculated the duration of exposure to the Church in two ways. At the country level, we created a measure that captures the number of centuries each country was under the sway of either the Western or the Eastern Church prior to 1500 CE. We then adjusted this indicator for population movements that have occurred after the year 1500 (such as the European migration to the Americas; Supplementary S2.2). This gives us separate country-level measures for Western Church exposure (Figure S2.1) and Eastern Church exposure (Figure S2.2) based on contemporary population distributions.

Figure 4.2: Western Church exposure across European regions. The map shows regional Church exposure between 550 and 1500CE and contains the boundaries of the Carolingian Empire (blue line) and the divide between Western and Socialist Europe (green line; following Churchill’s speech on the “Iron Curtain", we included the former Yugoslavia in Socialist Europe, even though it was not part of the Warsaw pact).

Within Europe, we calculated a similar dosage measure for 440 regions based on the diffusion of bishoprics—regional administrative centers within the Western Church’s hierarchy—through space and time (Figure S2.3). To calculate regional Church exposure, we first divided Europe’s surface into pixels (0.125 × 0.125 decimal degrees or around 14km2). Then, for every pixel and for every half century from 550 to 1500 CE, we coded a binary variable as ’1’ if there was an active bishopric within 100km (about 2–3 days travel, (103)) of the pixel’s centroid. Finally, we calculated each region’s Church exposure by taking the mean across all the pixels in the region and across all half- centuries. Figure 4.2 plots Church exposures for European regions.

110 To connect our effort to important work on family ties (39, 40), we confirm that both of our historical measures of kin-based institutions, as well as Church exposure, predict the value that contemporary populations place on “family ties" (Figure S2.6 and Table S3.5). To capture the theoretically relevant psychological dimensions, we compiled data on 20 vari- ables, which we group in three packages: (1) individualism and independence, (2) conformity and obedience, and (3) impersonal prosociality. Table 4.1 briefly describes and lists the 20 variables by package. Here, we supplement the descriptions in Table 4.1 with more detail for some of the vari- ables, but unfamiliar readers can find complete descriptions in Supplementary S9. Our individualism and independence package includes two survey questions along with Hofstede’s classic measure of individualism and a task that uses triads to assess people’s reliance on analytic vs. holistic thinking. In that task, participants are asked in a series of triads to say whether a target, such as a rabbit, goes with a dog or a carrot. The pairings are either analytical, based on abstract categories (rabbits and dogs are both mammals), or relational/functional (rabbits eat carrots). Our conformity and obedience package includes three survey measures, one psychological scale (tightness) and one experiment (the Asch conformity task). In the Asch conformity experiment, undergraduates were asked to judge over a series of trials which of a set of lines was longest. During certain critical trials, participants heard their peers, who were actually experimental confederates, give the same incorrect judgment just prior to their turn. Because these judgments are easy—97% or more of people get them correct by themselves—the average percentage of incorrect responses in each country provides a country-level measure of conformity. Our impersonal prosociality package has been broken down into two subcategories, one focused on impartiality and the other on impersonal cooperation and trust. Of the impartiality measures, only the honesty dice game requires more explanation. In this experiment, students were isolated in a cubicle and asked to anonymously report their roll of a six-sided die. For rolls from one to five, they were paid in proportion to their reported roll, with ’5’ making the most money. A roll of ’6’, however, was worth zero. At the country level, the percentage of people claiming a ’high roll’ (a ’3’, ’4’, or ’5’) provides a measure of dishonesty, since we know that only 50% of rolls can be ’high’, on average. This measure correlates strongly with country-level measures of corruption and tax evasion (6). For impersonal cooperation and trust, we analyzed two laboratory experiments derived from versions of the public goods game (PGG), one real-world measure of public goods contributions based on voluntary blood donations, and three survey questions about trust and fairness. Since our theory suggests that the fault line created by intensive kinship lies between outgroups or strangers and the binding interpersonal ties of one’s in-group, we follow other researchers (42, 104) in con- structing a country-level measure of impersonal trust using six questions from the World Values Survey (WVS). To construct the out-ingroup trust measure, we first created an in-group measure of trust by averaging people’s reported trust towards (1) their families, (2) their neighbors and (3) people they know. Then, we created an outgroup trust measure based on people’s reports about their trust in (1) foreigners, (2) people they’ve met for the first time, and (3) people from other religions. Finally, we subtracted our outgroup trust measure from our ingroup measure (and standardized it). To measure impersonal trust within Europe, we used the commonly used generalized trust ques- tion, which asks whether “most people can be trusted" or “you can’t be too careful". However, detailed analyses suggest that while in Europe this question is interpreted as intended, to refer to trust in generic strangers, it is not consistently interpreted in this way everywhere (Supplementary S7). In places where people have embedded themselves both physically and relationally in dense networks of interpersonal ties, many report that most people can be trusted in response to the gener- alized trust question, while also reporting that they don’t trust foreigners, new people or those from

111 Table 4.1: Psychological and behavioral measures. ’std’ or ’%’ indicate that a variable has been standardized or is expressed in percent. ’WVS’: World Values Survey (112); ’ESS’: European Social Survey (113); ’WHO’: World Health Organization; ’PGG’: Public Goods Game; ’UN’: United Nations. other religions. For this reason, we use both the generalized trust and generalized fairness survey questions within Europe but drop both in our cross-country analyses (though the results are reported in Supplementary S3.3B).

4.3 Results

We present our findings in three major stages. In the first stage, we examine cross-country patterns. This allows us to tap the broadest range of psychological outcomes (16 variables) and illustrates the breadth of contemporary global variation. We examine the relationships between our outcomes and both our country-level measures of kinship intensity—the KII and the prevalence of cousin marriage—and Church exposure. We also confirm that greater Church exposure is associated with less intensive kinship. In the second stage, we zoom in on Europe and examine the relationships be- tween four survey-based outcomes and both regional Church exposure and cousin marriage. Again, as in the cross-country analysis, we confirm that longer exposure to the Western Church is associated with weaker kin-based institutions. Finally, in the third stage, we compare only the adult children of immigrants within European countries. We establish links between the kinship intensities of peo- ple’s immigrant parents—based on the parents’ native countries or their originating ethno-linguistic populations—and our four survey-based psychological measures. In addition, further confirming our findings, Supplementary S7 reports the results of analyses that relate psychological measures drawn from the WVS to variation in the kinship intensities of the ethno-linguistic populations within countries around the globe.

112 4.3.1 Psychological variation across countries Figures 3-6 show the relationships between our outcomes and (1) the KII, (2) cousin marriage prevalence and (3) Western and Eastern Church exposure. For our individualism and independence package, Figure 4.3 shows that less intensive kinship, measured by either the KII or cousin mar- riage, and more Church exposure are associated with a WEIRDer psychology—i.e., with greater individualism, increased creativity and more analytic thinking. All of these relationships go in the predicted directions. The Spearman correlations range in magnitude from 0.10 to 0.70. The only result that is not significant at the 5% level (on a one-sided test) is that between the KII and analytic thinking, which is not surprising given the limited variation in KII available in that small sample (N= 29) (Throughout, we report the significance of our estimates on one-sided tests because our theory makes clear predictions about the directions of the associations we examine). Figure 4.4 shows that intensive kinship is negatively correlated with a WEIRDer psychology in our conformity and obedience package: more intensive kinship is associated with greater psy- chological tightness, more conformity (Asch task), more importance attributed to obedience and increased devotion to tradition. Similarly, greater Church exposure is associated with less tightness, less conformity, lower rates of conformity and a reduced emphasis on obedience and tradition. All relationships go in the predicted direction, with Spearman correlations ranging in magnitude from 0.12 to 0.65. All are significant except the relationship between tightness and the KII, whose lack of significance may again be due to the limited variation in the KII in this small sample (N= 31). The fifth variable of our conformity and obedience package, proper behavior (not shown), reveals no relationship to any of our key theoretical predictors (Figure S3.1). For the impartiality sub-package, Figure 4.5 shows the 12 key relationships, all of which go in the predicted directions with Spearman correlations ranging in magnitude from 0.25 to 0.69. Based on laboratory and field measures, Figure 4.5A–4.5C show that greater kinship intensity and less exposure to the Western Church are associated with fewer accurate reports of die rolls, more parking tickets for diplomats under immunity, and a greater acceptance of lying in court to help a friend. Figure 4.5D shows that more intensive kinship and less Church exposure are associated with more nepotistic hiring practices. All associations are significant except that between particularism and the KII (P= 0.051), which is likely influenced by the limited variation in KII available in that small sample (N= 42).

113 Figure 4.3: Individualism and independence, kinship intensity and Church exposure. The figure shows the cross-country relationship between our psychological outcomes—individualism (row A), creativity (row B), and analytic thinking (row C)—and the KII (1st column), cousin marriage prevalence (2nd column) and both the Western and Eastern Church exposures (3rd column). Linear best-fit lines are displayed; in the third column, the solid and dashed lines plot the best- fit for the Western and Eastern Churches, respectively. Reported are Spearman’s Ï ˛Aand associated significance levels (one-sided).

114 Figure 4.4: Conformity and obedience, kinship intensity and Church exposure. The figure shows the cross-country relationship between our psychological outcomes—tightness (row A), Asch conformity (row B), obedience (row C) and tradition (row D)—with the KII (1st column), cousin marriage prevalence (2nd column) and both the Western and Eastern Church exposures (3rd column). Linear best-fit lines are displayed; in the third column, the solid and dashed lines plot the best-fit for the Western and Eastern Churches, respectively. Reported are Spearman’s ρ and associated significance levels (one-sided).

115 Figure 4.5: Impartiality, kinship intensity and Church exposure. The figure shows the cross-country relationship be- tween our psychological outcomes—high claims (%) in the honesty dice game (row A), UN diplomatic tickets (unpaid parking fines; row B), particularism (row C) and nepotism (row D)—and the KII (1st column), cousin marriage prevalence (2nd column) and both the Western and Eastern Church exposure (3rd column). Linear best-fit lines are displayed; in the third column, the solid and dashed lines plot the best-fit for the Western and Eastern Churches, respectively. Reported are Spearman’s ρ and associated significance levels (one-sided).

116 Figure 4.6: . The figure shows the cross-country relationship between our psychological outcomes—1st round PGG contributions (row A), average PGG contribution with punishment (row B), blood donations (row C), and out-ingroup trust (row D)—and the KII (1st column), cousin marriage prevalence (2nd column) and both the Western and Eastern Church exposure (3rd column). Linear best-fit lines are displayed; in the third column, the solid and dashed lines plot the best-fit for the Western and Eastern Churches, respectively. Reported are Spearman’s ρ and associated significance levels (one-sided).

117 Figure 4.6 shows the results from the impersonal cooperation and trust sub-package. Figures 4.6A and 4.6B reveal that less intensive kinship and more Church exposure are associated with higher contributions in both the first round of a standard PGG and across all rounds in a PGG with punishment. These experimental findings are further grounded in the real-world by Figure 4.6C, which reveals parallel patterns when blood donations are analyzed. Figure 4.6D shows that greater kinship intensity and less Western Church exposure are associated with a sharper distinction be- tween in-group (e.g., family and neighbors) and out-group members (e.g., foreigners, new people). All the relationships in Figure 4.6 go in the expected direction, with less intensive kinship and more Church exposure associated with a WEIRDer psychology, and with Spearman correlations ranging in magnitude from 0.40 to 0.98. All correlations are significant except that between Church expo- sure and average contributions in a PGG with punishment (Figure 4.6A, column 3), where although the relationship is strong (ρ= 0.53), the few data points (N=10) limit inference. Along with exposure measures for the Western Church, Figures 4.3-4.6 also show the relation- ships for each of our psychological outcomes and exposure to the Eastern Church. As expected given the lower intensity with which the Eastern Church implemented and enforced its MFP, the relationship with the Eastern Church often resembles that with the Western Church—in 9 of 13 cases—but remains weaker and is often not significant. (We note that fewer countries were exposed to the Eastern Church, which limits statistical power.) Our primary goal in presenting these cross-country relationships has been to reveal the extent of global psychological variation and show how it patterns in ways consistent with our theoreti- cal expectations. However, to further test the robustness of these relationships, we also estimated 24 additional regression models for each of the eight psychological outcomes for which we have sufficient sample sizes (N ≥ 40). In our baseline model, we regressed each outcome on (1) the KII, (2) cousin marriage prevalence or (3) the two Church exposures (both in the same regression), separately, and on four geographic control variables: agricultural suitability, absolute latitude, mean distance to waterways and average terrain ruggedness (Supplementary S3). Each of these controls has previously been associated with economic development, colonial expansion or productivity. Table 4.2 presents the results for our baseline models. For each outcome, the associated col- umn provides the four relevant coefficients from three separate regression models (distinguished by shading). The results show that, holding the four geographic controls constant, 23 of the 24 theoreti- cally relevant coefficients go in the predicted direction and 22 of the 24 coefficients are significantly different from zero. The effect sizes are moderate to large. For example, a one-standard-deviation increase in the KII is associated with a change of between a quarter and half of a standard deviation in tradition, creativity and out-ingroup trust (all in the predicted direction). Similarly, an additional 500 years of exposure to the Western Church is associated with a change of between 0.80 and 1.15 standard deviations in tradition, creativity and out-ingroup trust. In contrast, for the Eastern Church, the relationships between our psychological outcomes and Church exposure essentially disappear. In addition to these baseline models, Supplementary S3 explores the effect of separately adding control variables for (1) parasite stress and tropical regions, (2) irrigation potential, (3) suitability for oats and rye, (4) time since the origins of agriculture and genetic heterogeneity (expected het- erozygosity), (5) major religious traditions (Catholic, Protestant, Orthodox, other Christian, Islam, Hinduism and Buddhism), (6) religiosity and (7) continental fixed effects. Thus, for each of our psy- chological outcome variables, we estimated three simple correlations (Figures 3-6), three baseline models (Table 2) and 21 models that added each of these seven variable sets to our baseline model, giving us 27 models. For our individualism and independence package (two outcomes, shown in Table 2), the coefficients on our three key theoretical predictors were in the hypothesized direction in 52 out of 54 models and significant in 43 models. For our conformity and obedience package

118 Table 4.2: Baseline cross-country regressions for psychological outcomes. Country-level regressions of psychological outcomes with data from more than 40 countries on the KII, log % cousin marriage, and Western and Eastern Church exposure. Outcome variables were standardized (z-scores) unless otherwise indicated. All regressions include our set of baseline controls: ruggedness, mean distance to waterways, caloric suitability and absolute latitude. Robust standard errors are reported in parentheses. ∗P ≤ 0.05, ∗∗P ≤ 0.01, ∗∗∗P ≤ 0.005 (one-sided).

(two variables), the coefficients were in the predicted direction in 45 of 54 models and significant in 26 models. For the impersonal prosociality package (four variables), 105 out of 108 coefficients were in the predicted direction and 81 of these were significant. The detailed regression results are reported in Tables S3.1-S3.4. Countries, of course, do not represent truly independent observations, which means that our standard errors in these models may be biased. To address this, we re-estimated our main regres- sions using Conley standard errors (114), to account for similarities among countries induced by either spatial proximity or by a shared genetic/cultural history. These results confirm that the above findings are robust (Table S3.6). Focusing now on the link between Church exposure and intensive kinship, our cross-country analyses confirm that nations with more centuries of exposure to the Church have less intensive kin- based institutions and lower rates of cousin marriage (Table S3.10). Each additional 500 years under the Western Church is associated with a reduction of 1.2 standard deviations in the KII and a 91% decline in cousin marriage rates. These results are robust to the addition of the baseline geographic controls and hold across all seven of the supplementary specifications described above. Moreover, Church exposure accounts for a substantial part of the variation in the KII (R2: 40%) and cousin marriage rates (R2: 62%). Partial R2s underscore this: Church exposure explains about 20% of the variation in both the KII and cousin marriage rates beyond the variation already explained by the baseline geographic controls. While these cross-country analyses reveal patterns fully consistent with our hypotheses, such analyses are fraught with potential confounds, including differences in endogenous variables like national wealth, effective governments and the rule of law. To better map the causal pathways in- volved, we now zoom in on Europe.

4.3.2 Psychological Variation within Europe To explain the psychological variation within Europe, we combine four survey-based measures with estimates of cousin marriage rates and our measure of regional Church exposure. We first link our psychological outcomes to Church exposure, and then confirm their connection to cousin marriage rates in the 20th century. We also confirm that greater historical Church exposure is associated with

119 Table 4.3: Regression of psychological outcomes on exposure to the medieval Western Church. Individual-level OLS regression of ESS-based psychological outcomes on Church exposure. Outcome variables were standardized (z-scores). The basic controls are country and survey-wave fixed effects, individual characteristics (gender, age, and age squared) and basic geographic controls (agricultural suitability, absolute latitude, mean distance to the sea and average terrain ruggedness). Robust standard errors clustered at the regional level are reported in parentheses. ’No. regions’: number of regions. ∗P ≤ 0.05, ∗∗P ≤ 0.01, ∗∗∗P ≤ 0.005 (one-sided). less cousin marriage. Finally, we study the relationship between cousin marriage and voluntary blood donations in Italy. We regressed each of our four psychological outcomes on Church exposure across 11 different models, including a simple regression. 4.3 shows two of these models for each of our psychological outcomes. The first model includes our four baseline (geographic) controls, individual demograph- ics (age, age squared and sex) and both country and survey-wave fixed effects. The second model adds individual-level measures of religiosity and denomination (Roman Catholic, Protestant, Mus- lim, etc.). By absorbing all the variation between countries (i.e., the mean country differences), the country fixed effects allow us to effectively only compare individuals from different regions within the same countries. 4.3 shows that Europeans from regions that were under the sway of the Western Church for longer tend to have a WEIRDer psychology: they reveal higher individualism- independence, less conformity-obedience and both greater generalized trust and generalized fair- ness. Considering that Europe was fully Christianized by 1500 CE, these effects are large. An ad- ditional millennium under the Western Church is associated with increases in these psychological outcomes of roughly one-tenth of a standard deviation. Importantly, consistent with our theory that emphasizes the medieval Church’s MPF and not other religious factors, these effects hold or actually increase in magnitude when religiosity and denomination are statistically held constant. The results shown in Table 4.3 are robust in several ways. First, Table S4.1 explores the effects of including both additional regional and individual-level variables. At the regional level, these models include a battery of geographic, climatic and agricultural controls as well as variables that capture a region’s integration into the (based on Roman roads), population density in 500 CE (to capture economic development at the onset of the MFP), and incorporation within either the Carolingian Empire (814 CE) or the Soviet Bloc (1948 CE). These controls mitigate concerns that the Church may have been strategically moving along Roman roads or into ecologically or economically well-endowed regions. We also confirmed that these results hold after controlling for the presence of monasteries within each region and for the educational attainment at the individual level, which is important since the Church could be operating on contemporary psychology through its influence on schooling or monasteries instead of via kin-based institutions. Across 44 models, all of the coefficients on regional Church exposure are in the predicted direction and 42 are significant. Second, the results are robust to estimating the models using only the 191 regions in the formerly socialist countries of Europe (Figure 4.2 and Table S.4.2), which confirms our broader findings in a part of Europe with a very different history than Western Europe. Third, to verify that nothing hinges on the 100km circle of influence we drew around each bishopric to define our regional

120 Church exposure measure, we re-calculated it by assuming radii ranging from 50km to 200km. Table S4.3 demonstrates that our results do not hinge on the choice of 100km. Furthermore, to address concerns that our baseline regional Church exposure measure may give too much weight to unpopulated pixels within a region, we constructed a population-weighted measure (Supplementary 2.3) and verified that our results are robust to using that measure. To examine the relationship between intensive kin-based institutions and our four psychological outcomes, we turn to our sample of marriage rates among first cousins from Italy, Spain and France. Here, to highlight the uniform impact of intensive kinship across diverse populations with quite dif- ferent histories, we augmented this data with first cousin marriage rates from Turkey. As the bridge between Europe and the Middle East, Turkey is interesting because it was never under the Western Church, so rates of cousin marriage remain much higher than in Europe. Figure 4.7 shows the re- lationship between cousin marriage rates and our four psychological outcomes. These relationships are all in the expected direction and strong, with cousin marriage accounting for between 27% and 69% of the psychological variation. To further explore these relationships, we regressed each of our four psychological outcomes on rates of cousin marriage across ten additional specifications (Table S4.5), following the same approach used for linking Church exposure to our psychological outcomes, albeit now with a smaller sample. For generalized trust and fairness, the coefficients on cousin marriage were always in the predicted direction and significant at the 5% level (one-sided test) in 18 of 20 specifications. The two specifications showing non-significant coefficients control for presence of the Carolingian Empire and the density of monasteries. This is not surprising given the role of both Carolingian rule and the monastic orders in implementing the MFP. For the other two psychological outcomes, 17 of the 18 coefficients on the rate of cousin marriage go in the predicted direction, though they are not estimated with precision. The loss in precision is due to the inclusion of country fixed effects since most of the variation in cousin marriage occurs between countries. To further explore these relationships, we regressed each of our four psychological outcomes on rates of cousin marriage across ten additional specifications (Table S4.5), following the same approach used for linking Church exposure to our psychological outcomes, albeit now with a smaller sample. For generalized trust and fairness, the coefficients on cousin marriage were always in the predicted direction and significant at the 5% level (one-sided test) in 18 of 20 specifications. The two specifications showing non-significant coefficients control for presence of the Carolingian Empire and the density of monasteries. This is not surprising given the role of both Carolingian rule and the monastic orders in implementing the MFP. For the other two psychological outcomes, 17 of the 18 coefficients on the rate of cousin marriage go in the predicted direction, though they are not estimated with precision. The loss in precision is due to the inclusion of country fixed effects since most of the variation in cousin marriage occurs between countries. As a final check, instead of using Church exposure, we looked at whether a region was in the Carolingian Empire in 814 CE, the year of Charlemagne’s death. In the 8th century, the Carolingian rulers teamed up with the Church to enforce the MFP. Consistent with this, we find that, within countries, regions exposed to the Carolingian Empire exhibit lower 20th century cousin marriage and reveal greater individualism-independence, less conformity-obedience and both greater gener- alized trust and fairness. (Table S4.4 and Table S4.6). Zooming in on Italy, where we have more finely grained cousin marriage data for 92 provinces, we examined voluntary, unpaid blood donations (Figure S5.1). In a simple regression, cousin mar- riage rates explain about a third of the variation in provincial blood donations. A doubling of the rate of cousin marriages is associated with a reduction in blood donations of about 8 bags per 1000 people (Figure S5.2). This is substantial given the mean donation across provinces is 28 bags per

121 Figure 4.7: Relationships between regional estimates of cousin marriage and (A) conformity-obedience, (B) individualism-independence, (C) impersonal trust (based on generalized trust) and (D) impersonal fairness (based on generalized fairness). In each panel, solid linear best-fit lines are displayed. Reported are Spearman’s ρ and associated significance levels (one-sided). The shape of each data point indicates the corresponding region’s country (see legend).

1000 people. This effect either holds or increases when our geographic and ecological controls are included in the model, as well as when differences in formal schooling across provinces are con- trolled for (Table S5.1). This relationship is also robust to adding regional fixed effects as well as controls for the historical Kingdoms of Naples and Sicily, so we are not simply capturing a North- South difference. These patterns converge with other recent analyses linking cousin marriage rates in Italy to voter turnout, judicial efficiency, mafia activity and corruption at the provincial level (115, 116). As with UN diplomatic tickets and blood donations in our cross-country analyses, these results establish a real-world complement to the survey-based measures. Finally, confirming the link between Church exposure and kinship intensity, Table S4.6 shows that Church exposure explains about 75% of the variation in cousin marriage across the regions of Turkey, Spain, France, and Italy. If Turkey is dropped, Church exposure still captures about 37% of the variation in cousin marriage. The association between Church exposure and cousin marriage holds when we compare only regions within countries and when controlling for a wide range of

122 Table 4.4: The children of immigrants in Europe. Regression of psychological outcomes on the KII, cousin marriage and Church exposure. OLS regressions of our four ESS-based dependent variables on (mother’s origin country’s) KII, log percent cousin marriage, and Western and Eastern Church exposure (in 100 years). Each column contains results from three separate regressions. An observation is a respondent with an immigrant mother. All regressions control for standard demographic variables (age, age squared, gender), as well as ESS wave, residence-country fixed effects and a set of origin- country controls (absolute latitude, ruggedness, caloric suitability for agriculture and mean distance to waterways). Robust standard errors clustered at the resident-country level are reported in parentheses. ∗P ≤ 0.05, ∗∗P ≤ 0.01, ∗∗∗P ≤ 0.005 (one-sided). geographic and ecological variables as well as our measures of Roman roads and population density in 500 CE.

4.3.3 The children of immigrants For native-born European adults whose parents immigrated into Europe from around the globe— second-generation immigrants—we linked the same four survey-based psychological measures used in the previous section to the KII, cousin marriage, and Church exposure of their parents’ countries of origin (Supplementary 6). Then, by regressing each outcome on these measures of kinship inten- sity and Church exposure while controlling for resident-country fixed effects, we effectively com- pared only individuals who grew up in the same country. By analyzing the psychological impact of kin-based institutions only through second-generation immigrants, we can exclude the causal in- fluence of all factors except those operating through some form of inter-generational transmission; this eliminates many alternative causal pathways that might be responsible for the relationships documented above. Table 4.4 summarizes our baseline models, which link each respondent to his/her mother’s na- tive country, and individual-level demographics and both residence-country and wave fixed effects. The models also include our basic set of four geographic controls for the mother’s native country. All 12 of the key coefficients go in the predicted direction, with less intensive kinship associated with a WEIRDer psychology, and are significant. Given that we are comparing only native-born Europeans from the same country based on the originating countries of their mothers, the effects are large. For example, a one-standard-deviation increase in the KII is associated with a change of between 7% to 13% of a standard deviation across our four psychological measures. Similarly, an additional millennium under the Western Church is associated with a change of between 15% and 28% of a standard deviation across our four psychological measures. Table 4.4’s results are robust to five checks. First, additional controls: to these baseline models, we added continental fixed effects for the mother’s originating country, religious denomination and

123 religiosity and a slate of other individual variables, including marital status, educational attainment, employment and feelings of discrimination (Table S6.1). Second, alternative ancestral linking: in- stead of using the mother’s native country to assign the KII and cousin marriage prevalence variables to respondents, we re-estimated our main regressions using either the father’s country of origin (Ta- ble S6.2) or the average of both parents’ (Table S6.3). Third, two-way clustering: we clustered our standard errors at the level of both country of residence and mother’s country of origin (Table S6.4). Fourth, regional and community size fixed effects (e.g., big city, suburb, farm, etc.): this allowed us to effectively compare only individuals living in the same subnational regions and in similarly sized communities (Table S6.5). Fifth, ancestral links via language: instead of assigning the KII via parental countries of origin, we matched second-generation immigrants to the KII via the languages they reported speaking in their homes (Table S6.7). This allowed us to also include country-of-origin fixed effects (in addition to residence-country fixed effects) in the regressions, and thus to narrow comparisons to people who not only live in the same countries but whose parents came from the same country (but from different ethno-linguistic groups). These supplemental analyses all broadly support the results shown in Table 4.4.

4.4 Discussion

To begin to explain the psychological differences now documented around the globe, we have pro- posed a two-part theory. First, we hypothesize that, in adapting to the social worlds created by intensive kin-based institutions, human psychology shifts in ways that foster greater conformity, obedience and sensitivity to relational contexts but less individualism, analytic thinking and cooper- ation with strangers. Second, to account for part of the variation in kinship intensity, we hypothesize that Western Christianity, beginning around 500 CE, gradually implemented a set of policies about marriage and the family—the MFP—that was a critical contributor to the eventual dissolution of the intensive kin-based institutions of Europe. By 1500 CE, this left many regions of Western Eu- rope dominated by independent, monogamous, nuclear families—a peculiar configuration called the European Marriage Pattern (54, 55, 97). This two-part theory implies that the Church, through the MFP, inadvertently contributed to what psychologists have termed WEIRD psychology. We tested these hypotheses at three levels of analysis. Across countries, our analyses of 16 variables confirm that populations with less intensive kin-based institutions historically are psycho- logically WEIRDer today: they are more individualistic and independent but less nepotistic, con- formist, obedient and holistically-oriented. Socially, populations with weaker kin-based institutions reveal less in-group loyalty, diminished moral particularism and greater trust, fairness and coopera- tion with strangers. Then, zooming in on Europe, by tracking the diffusion of the MFP from 550 to 1500 CE, we show that the longer a regional population was exposed to the Church, the higher their measures of individualism-independence and generalized trust and fairness and the lower their mea- sure of conformity-obedience. Then, by tapping remnants of intensive kinship in Western Europe, we demonstrate that greater exposure to the Church is associated with less cousin marriage in the 20th century, which in turn is associated with stronger individualism, less conformity and greater impersonal prosociality. In Italy, we further demonstrate that higher rates of cousin marriage are as- sociated with fewer voluntary blood donations (a public good). Lastly, by linking second-generation immigrants in Europe back to the places where their parents originated, we demonstrate that the in- fluence of both intensive kinship and Church exposure can still be detected psychologically among the adult children of immigrants living in the same European countries. The psychological variation documented here may arise from the action of a combination of facultative, developmental and cultural-evolutionary mechanisms (37, 117–119) in response to the

124 incentives created by intensive kin-based institutions. It is also possible that stable institutions may favor particular genetic variants that promote success within these institutional environments (120). Our empirical findings are largely agnostic regarding the relevance of these different mechanisms. However, the fact that we can detect the influence of historical kin-based institutions on psychology even decades after those institutions—as practiced on the ground—have disappeared suggests a role for inheritance, either cultural or genetic. But, the fact that we can relate psychological differences among European populations to relatively recent historical events like the MFP, suggests a central role for cultural transmission. While our efforts lay a foundation that links psychological variation to kinship and kinship to the Church’s MFP, more research is needed. Historical shocks or other natural experiments, for example, might be exploited to confirm the causal nature of these relationships. Additionally, archeo-genetic evidence may permit us to observe the impact of the MFP on patterns of relatedness over time in different parts of Medieval Europe while analyses of digitized corpora may allow us to track corresponding shifts in kinship (e.g., via kin terms) or on our psychology. By integrating anthropological, psychological and historical evidence into a unified cultural evolutionary framework, our approach paves a new way of understanding contemporary behavioral variation. This is important because most efforts to understand human behavior presume either that little important psychological variation exists across populations or, if such variation does exists, that it represents merely shallow responses to current material incentives, governmental institutions or ecological conditions (121, 122). Our work, by contrast, suggests that contemporary psycholog- ical patterns, ranging from individualism and trust to conformity and analytic thinking, have been influenced by enduring family structures, particular religious practices and deep cultural evolution- ary processes. Beyond their scientific interest, these insights may have broader implications since some of these psychological differences have previously been deployed to explain global variation in (123–125), strength of formal institutions and corruption (6, 110, 115, 116, 126) and economic prosperity (16, 123, 124, 127, 128).

125 Bibliography

See Online References and Notes for Origins of WEIRD Psychology: https://psyarxiv.com/d6qhu/

126 Appendix A

Keeping It in the Family: Female Inheritance, In-marriage, and the Status of Women

A.1 Empirical data description

Table B.3.1: Description of the data on Ethnographic Atlas societies

Ethnographic Atlas, Cultural Data. Retrieved from D-PLACE [196, 33, 120, 160, 45, 158] Variable name Description Inheritance systems Entry EA074 classifies “the rule or practice governing the dis- position or transmission of a man’s property in land" in the fol- lowing categories: 1- no inheritance of real property (absence of individual property rights in land or of any rule of inheri- tance governing the transmission of such rights); 2- matrilineal by sister’s sons; 3- matrilineal by heirs (who take precedence over sisters’ sons); 4- children, less for daughters; 5- children; 6- patrilineal by heirs (who take precedence over sons); 7- pa- trilineal by sons. Entry EA075, following previous entry, “indicates how real property is distributed among several individuals of the same category" by the following categories: 1- equally distributed; 2- best qualified (adjudged either by the deceased or by his surviv- ing relatives); 3- ultimogeniture (the junior member); 4- primo- geniture (the senior member); 9- no inheritance of real property. For female inclusion and partibility of inheritance, four dummy variables are constructed by the intersection of categories from entry EA074 and EA075. 1- impartible inheritance (categories 2, 3, and 4 of EA075), 2- partible inheritance by males only (in- tersection of category 1 of EA075 and categories 2, 3, 6, and 7 of EA074), 3- partible inheritance by both sexes (intersec- tion of category 1 of EA075 and categories 4 and 5 of entry EA074), and 4- no inheritance of real property (category 1 of entry EA074, or equivalently category 9 of entry EA075).

127 Cousin marriage Entry EA024 classifies the rules governing the marriageabil- ity of a man’s first or second cousins in the following cate- gories: 1- cross-cousin; 2- paternal only; 3-maternal only; 4- fa- ther’s/mother’s brother’s daughter only; 5- father’s/mother’s sis- ter’s daughter only; 6- cross only; 7- no first/second cousins; 8- no first cousins; 9- patrilateral cross only; 10- any first cousins; 11- some second only; 12- only second cousins; 13- any first cousin except lineage mate. The constructed “cousin marriage" variable takes on integer values ranging from 1 to 4, where higher values indicate tighter cousin marriage range; value 1 is assigned for no first/second-cousin marriage al- lowed (category 7), value 2 for second-cousin marriage allowed (categories 8, 11, and 12), value 3 is assigned for first-cousin marriage allowed except with father’s brother’s daughter (cate- gories 1, 3, 5, 6, 9, and 13), value 4 is assigned for cousin mar- riage allowed with father’s brother’s daughter (categories 2, 4, and 10). Category 2 and 4 include 1 and 0 observations respec- tively, while category 10 includes 117 observations. Therefore, value 4 is practically the case if marriage with any first cousins is allowed. Endogamy Entry EA015 classifies “the prevalence of local endogamy, agamy, and exogamy" in the following categories; 1- demes (a marked tendency toward local endogamy); 2- segmented, no ex- ogamy; 3- agamous (without any marked tendency toward ei- ther local exogamy or local endogamy); 4- exogamous; 5- seg- mented, exogamy; 6- clans (each consisting of a single localized exogamous kin group). Following [113], only category 1 is con- sidered as endogamous. The constructed “endogamy" variable takes on integer values ranging from 1 to 3 where higher values indicate higher endogamy; value 1 is assigned for exogamous societies (categories 4, 5, and 6), value 2 is assigned for soci- eties with no marked tendency toward endogamy or exogamy (categories 2 and 3), and value 3 is assigned for endogamous societies (categories 1). Female premarital sex prohibition Entry EA078 classifies “prevailing standards of sex behavior for unmarried women" in the following categories: 1- precluded by early marriage; 2- prohibited, strongly sanctioned; 3- prohib- ited but weakly sanctioned; 4- permitted, sanctioned in the case of pregnancy; 5- trial marriage; 6- permitted, no sanctions. The constructed indicator variable for “female premarital sex pro- hibition" takes value 1 if premarital sex of unmarried women is precluded or prohibited (categories 1, 2, and 3), and 0 other- wise. Female participation in agriculture Entry EA054 classifies “specialization by sex in agriculture" in the following categories: 1- males alone; 2- both, males more; 3- differentiated but equal; 4- equal participation; 5- both, females more; 6- females alone; 7- sex irrelevant; 8- activity present, sex diff. unspecified; 9- activity is absent. Following [16], the con- structed “female participation in agriculture" variable takes on integer values ranging from 1 to 5, where higher values indicate more participation of women in agriculture; value 3 is assigned for both categories 3 and 4; and categories 7, 8, and 9 are con- sidered as missing data.

128 Traditional plough use Entry EA039 “indicates whether or not animals are employed in plow cultivation, and whether plow cultivation is aboriginal or dates to the post-contact period" by the following categories: 1- absent; 2- not aboriginal but present; 3- present. Following [16], the constructed indicator variable for “traditional plough use" takes value 1 if the plough is present (categories 1 and 2), and 0 otherwise. Non-irrigated/Irrigated intensive Entry EA028 classifies “intensity of cultivation" in the fol- agriculture lowing categories: 1- no agriculture, 2- casual; 3- exten- sive/shifting; 4- horticulture; 5- intensive; 6- intensive irrigated. The constructed indicator variable for “non-irrigated intensive agriculture" takes value 1 for category 5, and 0 otherwise. The constructed indicator variable for “irrigated intensive agricul- ture" takes value 1 for category 6, and 0 otherwise. Settlement complexity Entry EA030 classifies “settlement patterns" in the follow- ing categories: 1- nomadic; 2- seminomadic; 3- semiseden- tary; 4- impermanent; 5- dispersed homesteads; 6- hamlets; 7- village/town; 8- complex permanent. Following [16], the con- structed “settlement complexity" variable takes on integer val- ues ranging from 1 to 8, where higher values indicate higher settlement complexity. Political hierarchies Entry EA033 classifies “jurisdictional hierarchy beyond local community" in the following categories: 1- acephalous (e.g., autonomous bands and villages); 2- one level (e.g., petty chief- doms); 3- two levels (e.g., larger ); 4- three levels (e.g., states); 5- four levels (e.g., large states). Following [16], the constructed “political hierarchies" variable takes on integer values ranging from 1 to 5, where higher values indicate more political hierarchies. Presence of large animals Entry EA040 classifies “the predominant type of animals kept" in the following categories: 1- absence or near absence; 2- pigs; 3- sheep/goats; 4- equine; 5- deer; 6- camelids; 7- bovine. Fol- lowing [16], the constructed “presence of large animals" vari- able takes value 1 for categories 2–7 and value 0 for category 1. Year society sampled D-PLACE data are tagged with a “focal year" indicating the year in which Ethnographic Atlas societies were sampled. Focal year is before 1800 for 3% of societies, in the nineteenth century for 25%, between 1900 and 1950 for 69%, and after 1950 for 2%; 1% of the 1,291 societies are missing a focal year. Patrilineal/matrilineal descent Entry EA043 indicates “major mode of descent " in the follow- ing categories: 1- patrilineal; 2- duolateral; 3- matrilineal; 4- quasi-lineages; 5- ambilineal; 6- bilateral; 7- mixed. Following [25], the constructed indicator variable for “matrilineal descent" takes value 1 for category 3, and 0 otherwise. The constructed indicator variable for “patrilineal descent" takes value 1 for cat- egory 1, and 0 otherwise. Bride price/dowry Entry EA006 classifies "prevailing type of transfer or exchange at marriage" in the following categories: 1- bride wealth (or bride price); 2- ; 3- token bride wealth; 4- gift ex- change; 5- woman exchange; 6- insignificant; 7- dowry. Fol- lowing [25], the constructed indicator variable for "bride price" takes value 1 for categories 1, and 0 otherwise. The constructed indicator variable for "dowry" takes value 1 for category 7, and 0 otherwise. Population (in millions) A continuous variable indicating the population of each society as a whole.

129 Proportion of subsistence from Entries EA001, EA002, and EA004 indicate dependence on gathering/hunting/herding gathering, hunting, and animal husbandry respectively. Each en- try takes the values 0 to 9 respectively for 0–5%, 6–15%, 16– 25%, 26–35%, 36–45%, 46–55%, 56–65%, 66–75%, 76–85%, 86–100% dependence on the activity. Following [16], the me- dian value of dependence on the activity is used to construct variables for “prop. of subsist. from gathering", “prop. of sub- sist. from hunting" and “prop. of subsist. from herding". Patrilocal/matrilocal marriages Entry EA011 indicates “the prevailing pattern of transfer of res- idence at marriage" in the following categories: 1- wife to hus- band; 2- ambi/neo-local; 3- husband to wife; 9- separate. The constructed indicator variable for “patrilocal marriages" takes value 1 for category 1, and 0 otherwise. The constructed indi- cator variable for “matrilocal marriages" takes value 1 for cate- gory 3, and 0 otherwise. These variables are equivalent to [16]’s constructed variables based on another entry describing marital residence with kin from which entry EA011 is summarized. Extended/nuclear family Entry EA008 indicates “the prevailing form of domestic or familial organization" in the following categories: 1- nuclear, monogamous; 2- nuclear, limited polygyny; 3- polyandrous; 4- polygyny, atypical cowives pattern; 5- polygyny, typical cowives pattern; 6- minimal extended; 7- small extended; 8- large extended. Following [16], the constructed indicator vari- able for “nuclear family" takes value 1 for categories 1 and 2, and 0 otherwise. The constructed indicator variable for “ex- tended family" takes value 1 for categories 6–8, and 0 other- wise. Ethnographic Atlas, Geographic data Variable name Description Latitude D-PLACE data are tagged with a “revised latitude" indicating the corrected latitude data for Ethnographic Atlas societies. The constructed “latitude" variable is the absolute value of the re- vised latitude. Source: D-PLACE [158]. Mean temperature (in Celsius degrees)/ The means of the entire annual cycles of precipitation and tem- mean precipitation (in meters) perature are constructed for the time period between 1901 and 1950 (the time period in which the vast majority of Ethno- graphic Atlas societies were sampled) based on monthly global maps (0.5 by 0.5 degree cells) obtained from the CRU-TS 3.1 Climate Database, [138]. Source: D-PLACE [158]. Temperature/precipitation predictability The predictability measures of the entire annual cycles of pre- cipitation and temperature are constructed for the time pe- riod between 1901 and 1950 based on monthly global maps (0.5 by 0.5 degree cells) obtained from the CRU-TS 3.1 Cli- mate Database, [138]. Predictability was measured via [65]’s Constancy, Contingency and Predictability indexes. These in- dexes capture the extent to which yearly cycles vary among years in terms of onset, intensity and duration, ranging from 0 (completely unpredictable) to 1 (fully predictable). Source: D-PLACE [158]. Tropical climate The indicator variable for “tropical climate" takes value 1 if the point location of an Ethnographic Atlas society is classified as being either tropical or subtropical, and 0 otherwise. The data is constructed based on Thermal Climate Zones of the World, a global raster datalayer with a resolution of 5 arc-minutes, with each pixel containing a class value for the dominant thermal cli- mate found in the pixel. Source: FAO’s Food Insecurity, Poverty and Environment Global GIS Database (FGGD).

130 Suitability for agriculture Suitability for agriculture represents the fraction of each grid cell that is suitable to be used for agriculture. It is based on the temperature and soil conditions of each grid cell. The data is constructed based on the global map (0.5 by 0.5 degree cells) obtained from Suitability for Agriculture. Source: Atlas of the Biosphere, [217]. Distance to coast Distance of point locations of Ethnographic Atlas societies (in 100 kilometers) from the coast is constructed based on the coastline defined in the full-resolution Global Self-consistent, Hierarchical, High- resolution Geography Database. Source: D-PLACE [158]. Slope Mean incline in the terrain (unit of sample 0.5 by 0.5 degree (in degrees) cell) is constructed based on topographical data provided by the GTOPO30 data set. Source: D-PLACE [158]. Elevation Elevation is constructed based on the global map (30 by 30 arc- (in 100 meters) second cells) obtained from the Global 30 Arc-Second Eleva- tion data set. For the consistency of data used for Ethnographic Atlas societies, I have aggregated the elevation data (waters ex- cluded, and by taking mean) to a 0.5 by 0.5 degree resolution. Source: GTOPO30 data set. Ruggedness Ruggedness measures the elevation distance of each grid cell (in 100 meters) and its neighbors. The data is constructed based on the global map (30 by 30 arc-second cells) obtained from Grid-cell-level Data on Terrain Ruggedness. For the consistency of data used for Ethnographic Atlas societies, I have aggregated the Terrain Ruggedness data (waters excluded, and by taking mean) to a 0.5 by 0.5 degree resolution. Source: [204].

131 Table B.3.2: Description of the data on Italian provinces

Italian provinces Variable name Description Inheritance systems Compared to the Ethnographic Atlas, [248] provides more de- tailed data on inheritance systems of European societies. To ob- tain data on traditional inheritance systems for Western Europe (including Italy), [248] first used censuses from Western Euro- pean countries in the 1950s and 1960s while checking whether the patterns that he uncovered were reflected in historical mono- graphs. Some of these monographs go back more than 500 years, which suggests that the patterns have been lasting, stable, and persistent [78]. According to [248], inheritance systems of the traditional family types in Italy can be described as follow- ing. 1- Communitarian family type. Most common in central Italy, is an extended family in which all sons can get married, bring their wives to the family home and cohabit with their parents. In the communitarian family, land was communally owned, which can be characterized by equality among brothers in inheritance—as described by [248]—or absence of private property rights—as described in the Ethnographic Atlas. 2- Egalitarian nuclear family type. Common in south and north- west Italy, in which inheritance is equally divided among chil- dren. According to case studies and ethnographic evidence, women were likely to inherit land in regions with the egalitar- ian nuclear family type, at least in southern Italy [see, e.g., the Ethnographic Atlas for Neapolitan society, and 109]. 3- Incomplete stem family type. Most common in northeast Italy, is a mixture of complete stem and egalitarian nuclear fam- ily types. In the complete stem family type, inheritance is im- partible, and one child—generally the eldest son—marries and has children who remain in the household and inherit the house and the land. The incomplete stem family is the same as the stem family, but with more egalitarian inheritance rules. GDP per capita For the period 1963–1964, the average of region-level value- added per capita is used. Italy has 20 regions, and provinces in the same region take the same value for GDP per capita. Source: [72]. For the period 2000–2013, the average of province-level value-added per capita is used. Source: ISTAT. Share of agriculture For the period 1963–1964, the average of region-level share of agriculture in total value-added is used. Italy has 20 regions, and provinces in the same region take the same value for share of agriculture. Source: [72]. For the period 2000–2013, the av- erage of province-level share of agriculture in total value-added is used. Source: ISTAT. South and islands Dummy variable for South and Islands (one of four principal geographical areas in Italy as defined by the Italian National Institute of Statistics) including provinces in south (Abruzzo, Molise, Campania, Puglia, Basilicata, and Calabria regions) and islands (Sicilia and Sardegna regions). Source: ISTAT. Papal States Territories of the Papal States ruled over by the pope at the be- ginning of the fourteenth century with the current provincial borders, including provinces in Lazio, Umbria, and Marche re- gions. Source: [214].

132 Cousin marriage rates The data reported by [57] on cousin marriages, for five-year periods from 1910 to 1964, comes from the Vatican’s Secret Archives in which requests for dispensations from the cousin marriage impediment, sent by the Bishops to the Sacred Con- gregation of the Sacraments in Rome, were recorded with in- formation of the name of the diocese of marriage, the date, and the degree of relationship between the spouses. They grouped 280 Italian dioceses into the provinces present in year 1961, for which the number of total marriages was available from year to year. The cousin marriage data is available in all re- ported provinces only for five-year periods from 1945 to 1964. Four major degrees of consanguineous marriage are reported in the data; uncle-niece/aunt-nephew, first cousins, first cousins once removed, second cousins. However, the bishops of the is- lands (Sardinia and Sicily) were granted the privilege to ac- cord the dispensation for consanguineous unions beyond de- gree III, i.e., first cousins once removed and second cousins were not recorded in the Vatican Archives, and therefore are not reported for Sicily and are obtained from another source for Sardinia. Thus, only uncle-niece/aunt-nephew and first-cousin unions are considered to compute cousin marriage rates for Ital- ian provinces. The available economic and demographic data on Italian provinces are for the current borders, and for 1960s. Therefore, for newly created provinces, the cousin marriage rate of the province to which they belonged to in year 1961 is as- signed. Cousin marriage rates are the average of the last five- year period (1960–1964). Source: [57]. Latitude The absolute value of the latitude in the geographic center of each province. Geographic centroids of Italian provinces are generated using polygons map of Italian administrative divi- sions. Source: GADM database of Global Administrative Ar- eas. Mean temperature (in Celsius degrees)/ The average of mean temperature/precipitation in the geograph- mean precipitation (in meters) ical area of each province. The means of the entire annual cycles of precipitation and temperature are constructed for the time pe- riod between 1901 and 2014 based on monthly global maps (0.5 by 0.5 degree cells). Source: CRU-TS 3.23 Climate Database, [138]. Tropical climate The indicator variable for “tropical climate" takes value 1 if the geographic center of a province is classified as being either tropical or subtropical, and 0 otherwise. The data is constructed based on Thermal Climate Zones of the World, a global raster datalayer with a resolution of 5 arc-minutes, with each pixel containing a class value for the dominant thermal climate found in the pixel. Source: FAO’s Food Insecurity, Poverty and Envi- ronment Global GIS Database (FGGD). Suitability for agriculture The average of suitability for agriculture in the geographical area of each province. Suitability for agriculture represents the fraction of each grid cell that is suitable to be used for agricul- ture. It is based on the temperature and soil conditions of each grid cell. The data is constructed based on the global map (0.5 by 0.5 degree cells) obtained from Suitability for Agriculture. Source: Atlas of the Biosphere, [217]. Distance to coast Distance of the geographic center of each province from the (in kilometers) coast is constructed based on a coastline physical vector map in 1:10m resolution. Retrieved on October 3, 2016. Source: Natu- ral Earth.

133 Slope The average of slope in the geographical area of each province. (in percent) Slope measures the mean uphill slope percentage between each grid cell and its neighbors. The data is constructed based on the global map (30 by 30 arc-second cells) obtained from Grid-cell- level Data on Terrain Ruggedness. Source: [204]. Ruggedness The average of ruggedness in the geographical area of each (in 100 meters) province. Ruggedness measures the elevation distance of each grid cell and its neighbors. The data is constructed based on the global map (30 by 30 arc-second cells) obtained from Grid-cell- level Data on Terrain Ruggedness. Source: [204]. Elevation The average of elevation in the geographical area of each (in 100 meters) province. Elevation is constructed based on the global map (30 by 30 arc-second cells) obtained from Global 30 Arc-Second Elevation data set. Source: GTOPO30 data set.

134 Table B.3.3: Ethnic groups in IFLS sample, and ethnographic records on the inheritance of real property.

Ethnicity Female Ethnographic Atlas Ethnic Groups of Insular Southeast Asia inheritance Jawa 1 Sons and daughters Cultivable land, fruit trees, and domestic animals are inherited in equal shares by children of both sexes (p.51). Sunda 1 No match Cultivable lands are divided equally among the children of both sexes (p.56). Bali 0 Sons only One son only fully succeeds the father as head of the houseyard [. . . ] The remainder of the father’s property, especially his land, is divided equally among all sons (p.63). Batak 0 Sons only Patrilineal with preference to eldest and youngest sons (p.21). Bugis . No match Missing Tionghoa . No match Missing Madura . No match Missing Sasak 0 No match Land and buildings are inherited by males, while jewelry and households furnishings are passed down in the female line (p.67). Minang 1 Absence of private Ancestral property is considered uninheritable, since it belongs property to a corporate group, the matrilineage, which persists through time. Use rights to the main forms of ancestral property—land and houses—generally are divided among the senior women of a matrilineage and pass from mother to daughter (p.26). Banjar . No match No match Bima- 1 No match A man’s widow and his children, including his daughters, share Dompu his inheritance equally among them (p.70). Makassar . Missing Missing Nias 0 No match Sons receive the bulk of the inheritance, including real property, with the first-born having first claim (p.40). Palembang . No match No match Sumbawa 1 Missing A man’s widow and his children, including his daughters, share his inheritance equally among them (p.70). Toraja 1 Sons and daughters Sons and daughters have an equal chance to inherit their par- ents’ property (p.135). Betawi . No match No match Dayak . No match Missing Melayu- . No match No match Deli Komering . No match No match Ambon 0 Sons only Parcels of agricultural land are owned or controlled by clans [. . . ] Membership is restricted to male and unmarried female descendants of original or founder clans (p.117). Manado 1 No match Inheritance is equally divided among all children (p.126). Aceh 1 No match Inheritance follows Islamic law [. . . ] modified by the practice of giving women houses at the time of marriage. The result is to give the bulk of village land resources to the men. However, regardless of the ownership of the land, control is generally in the hands of the women, since the men are so often away (p.18).

135 A.2 Additional figures and regressions

Figure A.2.1: Inheritance systems in Western Europe illustrated by [245] (left) and [248] (right). In [248]’s map, absolute nuclear and stem family types are associated with patrilineal impartible inheritance (usually primogeniture); the incomplete stem family type is the same as the stem family, but with more egalitarian inheritance rules (in principle, but rarely in practice); the communitarian family type is associated with partible inheritance by sons only; and finally the egalitarian Nuclear family type is associated with partible inheritance. [248]’s map is taken from [78].

136 Table A.2.1: Descriptive statistics of full sample.

VARIABLES N mean sd Cousin marriage 1,042 2.232 0.973 Endogamy 1,102 1.747 0.583 Female premarital sex prohibition 598 0.537 0.499 Polygyny 1,257 0.469 0.499 Female participation in agriculture 735 2.970 1.053 Impartible inheritance 820 0.354 0.478 Partible inheritance by males only 820 0.270 0.444 Partible inheritance by both sexes 820 0.105 0.307 Partible inheritance 820 0.374 0.484 Absence of private property 820 0.272 0.445 Traditional plough use 1,182 0.149 0.356 Non-irrigated intensive agriculture 1,188 0.174 0.379 Irrigated intensive agriculture 1,188 0.106 0.308 Settlement complexity 1,187 5.116 2.218 Political hierarchies 1,155 1.944 1.106 Presence of large animals 1,182 0.727 0.446 Year society sampled 1,283 1,891 195 Patrilineal descent 1,274 0.463 0.499 Matrilineal descent 1,274 0.126 0.332 Dowry 1,272 0.034 0.181 Bride price 1,272 0.517 0.500 Latitude 1,291 21.193 17.376 Mean temperature 1,291 18.470 8.784 Temperature predictability 1,291 0.705 0.102 Mean precipitation 1,291 114.440 71.810 Precipitation predictability 1,291 0.625 0.097 Tropical climate 1,291 0.832 0.374 Suitability for agriculture 1,291 0.315 0.307 Distance to coast 1,291 4.370 4.308 Slope 1,291 2.117 2.537 Ruggedness 1,291 1.187 1.423 Elevation 1,291 6.850 6.936 Population 953 0.434 3.203 Prop. of subsist. from herding 1,290 16.747 17.344 Prop. of subsist. from hunting 1,290 15.500 15.054 Prop. of subsist. from gathering 1,290 11.743 15.098 Patrilocal marriages 1,267 0.680 0.467 Matrilocal marriages 1,267 0.081 0.273 Nuclear family structure 1,263 0.302 0.460 Extended family structure 1,263 0.473 0.499

137 Table A.2.2: Descriptive statistics for regression analysis of cousin marriage.

VARIABLES N mean sd Cousin marriage 651 2.220 0.990 Impartible inheritance 651 0.338 0.473 Partible inheritance by males only 651 0.252 0.434 Partible inheritance by both sexes 651 0.104 0.306 Absence of private property 651 0.306 0.461 Traditional plough use 651 0.178 0.383 Non-irrigated intensive agriculture 651 0.203 0.402 Irrigated intensive agriculture 651 0.114 0.318 Settlement complexity 651 5.026 2.272 Political hierarchies 651 2.034 1.171 Presence of large animals 651 0.765 0.424 Year society sampled 651 1,893 189 Patrilineal descent 651 0.465 0.499 Matrilineal descent 651 0.106 0.308 Dowry 651 0.043 0.203 Bride price 651 0.522 0.500 Latitude 651 22.387 17.705 Mean temperature 651 17.958 9.204 Temperature predictability 651 0.698 0.102 Mean precipitation 651 108.742 70.227 Precipitation predictability 651 0.616 0.097 Tropical climate 651 0.820 0.384 Suitability for agriculture 651 0.314 0.300 Distance to coast 651 4.219 4.132 Slope 651 2.206 2.656 Ruggedness 651 1.226 1.438 Elevation 651 7.147 7.249 Population 511 0.585 4.109 Prop. of subsist. from herding 651 17.566 17.742 Prop. of subsist. from hunting 651 16.340 17.047 Prop. of subsist. from gathering 651 12.288 15.554 Patrilocal marriages 647 0.699 0.459 Matrilocal marriages 647 0.057 0.232 Nuclear family structure 646 0.294 0.456 Extended family structure 646 0.481 0.500

138 Table A.2.3: Regression analysis of cousin marriage.

VARIABLES (1) (2) (3) (4) (5) (6) Partible inheritance by males only 0.046 0.145 0.154 0.160* 0.143 0.173 (0.098) (0.096) (0.107) (0.095) (0.097) (0.109) Partible inheritance by both sexes 0.284* 0.376** 0.363** 0.348** 0.435*** 0.427** (0.151) (0.150) (0.166) (0.150) (0.151) (0.165) Absence of private property 0.191 0.319** 0.426*** 0.348*** 0.368*** 0.508*** (0.130) (0.134) (0.157) (0.133) (0.135) (0.160) Traditional plough use 0.217 0.356** 0.267 0.251 0.365** 0.232 (0.135) (0.154) (0.179) (0.156) (0.152) (0.175) Non-irrigated intensive agriculture -0.247** -0.300** -0.269** -0.250** -0.332*** (0.113) (0.123) (0.116) (0.112) (0.124) Irrigated intensive agriculture 0.221 0.248 0.247* 0.222 0.261* (0.148) (0.163) (0.143) (0.147) (0.157) Settlement complexity 0.020 -0.007 -0.010 0.036 -0.010 0.027 (0.024) (0.026) (0.029) (0.032) (0.026) (0.037) Political hierarchies 0.106** 0.125*** 0.149*** 0.091** 0.121*** 0.115** (0.044) (0.045) (0.050) (0.044) (0.045) (0.050) Presence of large animals 0.218* 0.141 0.192 -0.136 0.176 0.003 (0.118) (0.122) (0.139) (0.130) (0.123) (0.153) Year society sampled -0.000*** -0.000 -0.000*** -0.000*** -0.000** (0.000) (0.000) (0.000) (0.000) (0.000) Patrilineal descent 0.430*** 0.370*** 0.367*** 0.360*** 0.250** (0.095) (0.108) (0.096) (0.104) (0.117) Matrilineal descent 0.657*** 0.624*** 0.667*** 0.640*** 0.606*** (0.113) (0.136) (0.111) (0.117) (0.140) Dowry -0.365* -0.312 -0.377* -0.375* -0.364 (0.208) (0.232) (0.213) (0.216) (0.244) Bride price 0.005 0.057 -0.071 -0.018 -0.011 (0.086) (0.100) (0.088) (0.088) (0.103) Latitude 0.011 0.015 0.013 0.010 0.016 (0.009) (0.010) (0.009) (0.009) (0.011) Mean temperature -0.010 -0.008 -0.002 -0.010 -0.003 (0.013) (0.015) (0.014) (0.014) (0.016) Temperature predictability 3.733*** 4.380*** 3.141** 3.646** 3.725** (1.420) (1.652) (1.440) (1.454) (1.718) Mean precipitation -0.002*** -0.003*** -0.002** -0.003*** -0.002** (0.001) (0.001) (0.001) (0.001) (0.001) Precipitation predictability 0.352 0.666 0.406 0.401 0.900 (0.690) (0.816) (0.678) (0.694) (0.822) Tropical climate 0.382*** 0.338* 0.523** 0.359* 0.352* 0.557** (0.118) (0.186) (0.220) (0.190) (0.188) (0.225) Suitability for agriculture -0.348*** -0.199 -0.187 -0.162 -0.165 -0.162 (0.131) (0.129) (0.146) (0.128) (0.130) (0.145) Distance to coast 0.010 0.023 0.008 0.009 0.023 (0.012) (0.015) (0.013) (0.013) (0.015) Slope 0.041 0.034 0.041 0.038 0.030 (0.035) (0.040) (0.035) (0.035) (0.041) Ruggedness -0.026 0.019 -0.040 -0.028 -0.001 (0.065) (0.074) (0.064) (0.065) (0.073) Elevation -0.029*** -0.031*** -0.026*** -0.027*** -0.028*** (0.009) (0.010) (0.009) (0.009) (0.010) Population 0.009 0.010 (0.007) (0.006) Prop. of subsist. from herding 0.014*** 0.013*** (0.004) (0.004) Prop. of subsist. from hunting 0.005 0.004 (0.004) (0.004) Prop. of subsist. from gathering -0.006 -0.002 (0.004) (0.004) Patrilocal marriages 0.222** 0.274** (0.108) (0.125) Matrilocal marriages 0.057 0.164 (0.148) (0.163) Nuclear family structure 0.066 0.029 (0.103) (0.118) Extended family structure 0.051 0.013 (0.095) (0.106) Constant 1.392*** -0.681 -2.070 -0.599 -0.768 -2.162 (0.209) (1.236) (1.462) (1.285) (1.256) (1.525)

Observations 651 651 511 651 642 503 R-squared 0.088 0.218 0.234 0.253 0.231 0.280 Adjusted R-squared 0.076 0.186 0.193 0.219 0.195 0.229 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

139 Table A.2.4: Descriptive statistics for regression analysis of endogamy.

VARIABLES N mean sd Endogamy 686 1.738 0.580 Impartible inheritance 686 0.341 0.474 Partible inheritance by males only 686 0.248 0.432 Partible inheritance by both sexes 686 0.111 0.314 Absence of private property 686 0.300 0.459 Traditional plough use 686 0.181 0.385 Non-irrigated intensive agriculture 686 0.207 0.405 Irrigated intensive agriculture 686 0.114 0.318 Settlement complexity 686 5.080 2.275 Political hierarchies 686 2.032 1.177 Presence of large animals 686 0.770 0.421 Year society sampled 686 1,899 138 Patrilineal descent 686 0.474 0.500 Matrilineal descent 686 0.101 0.301 Dowry 686 0.050 0.217 Bride price 686 0.525 0.500 Latitude 686 22.520 18.313 Mean temperature 686 17.717 9.384 Temperature predictability 686 0.697 0.103 Mean precipitation 686 110.481 69.474 Precipitation predictability 686 0.623 0.099 Tropical climate 686 0.808 0.394 Suitability for agriculture 686 0.321 0.301 Distance to coast 686 4.424 4.291 Slope 686 2.218 2.652 Ruggedness 686 1.233 1.466 Elevation 686 7.086 7.183 Population 536 0.562 4.013 Prop. of subsist. from herding 686 17.480 17.507 Prop. of subsist. from hunting 686 16.462 16.848 Prop. of subsist. from gathering 686 11.879 15.187 Patrilocal marriages 680 0.704 0.457 Matrilocal marriages 680 0.057 0.233 Nuclear family structure 681 0.289 0.454 Extended family structure 681 0.490 0.500

140 Table A.2.5: Regression analysis of endogamy.

VARIABLES (1) (2) (3) (4) (5) (6) Partible inheritance by males only 0.011 0.013 -0.005 0.013 0.013 -0.005 (0.056) (0.056) (0.061) (0.056) (0.056) (0.062) Partible inheritance by both sexes 0.365*** 0.233*** 0.285*** 0.230*** 0.171** 0.213** (0.078) (0.079) (0.085) (0.079) (0.080) (0.087) Absence of private property 0.071 -0.022 -0.022 -0.003 -0.060 -0.039 (0.074) (0.078) (0.087) (0.079) (0.080) (0.089) Traditional plough use 0.138* 0.009 -0.054 -0.011 0.002 -0.083 (0.075) (0.087) (0.108) (0.088) (0.088) (0.109) Non-irrigated intensive agriculture -0.048 0.004 -0.065 -0.053 -0.003 (0.067) (0.077) (0.069) (0.068) (0.078) Irrigated intensive agriculture 0.109 0.163* 0.098 0.078 0.128 (0.084) (0.093) (0.083) (0.086) (0.096) Settlement complexity 0.023 0.035** 0.054*** 0.032* 0.035** 0.049** (0.014) (0.015) (0.016) (0.017) (0.016) (0.019) Political hierarchies 0.023 0.010 0.015 0.002 0.021 0.020 (0.025) (0.027) (0.029) (0.027) (0.028) (0.030) Presence of large animals -0.098 0.087 0.049 0.038 0.079 0.013 (0.067) (0.075) (0.088) (0.079) (0.076) (0.098) Year society sampled 0.000 -0.000 0.000 0.000 -0.000 (0.000) (0.000) (0.000) (0.000) (0.000) Patrilineal descent -0.195*** -0.235*** -0.206*** -0.133** -0.169** (0.056) (0.062) (0.056) (0.059) (0.068) Matrilineal descent -0.053 0.050 -0.057 -0.089 -0.001 (0.075) (0.082) (0.074) (0.078) (0.087) Dowry -0.005 -0.072 -0.012 -0.001 -0.047 (0.102) (0.117) (0.101) (0.098) (0.111) Bride price -0.107** -0.080 -0.118** -0.073 -0.054 (0.052) (0.060) (0.054) (0.053) (0.061) Latitude 0.006 0.009 0.006 0.005 0.009 (0.005) (0.006) (0.005) (0.005) (0.006) Mean temperature 0.004 0.002 0.007 0.004 0.005 (0.008) (0.009) (0.008) (0.008) (0.010) Temperature predictability -0.573 -0.405 -0.837 -0.495 -0.444 (0.760) (0.894) (0.787) (0.768) (0.947) Mean precipitation -0.001 -0.000 -0.000 -0.001 -0.001 (0.001) (0.001) (0.001) (0.001) (0.001) Precipitation predictability 0.041 0.062 0.056 0.084 0.042 (0.382) (0.424) (0.393) (0.385) (0.447) Tropical climate -0.063 0.169 0.268** 0.175 0.183 0.300** (0.063) (0.119) (0.135) (0.123) (0.118) (0.142) Suitability for agriculture 0.022 0.025 0.061 0.037 0.016 0.057 (0.076) (0.075) (0.086) (0.076) (0.075) (0.086) Distance to coast -0.010 -0.011 -0.010 -0.010 -0.011 (0.006) (0.008) (0.007) (0.007) (0.008) Slope 0.008 0.006 0.009 0.008 0.012 (0.019) (0.022) (0.020) (0.020) (0.023) Ruggedness -0.025 -0.026 -0.030 -0.028 -0.036 (0.034) (0.038) (0.035) (0.035) (0.041) Elevation 0.002 0.001 0.004 0.003 0.003 (0.005) (0.006) (0.005) (0.005) (0.006) Population -0.004 -0.002 (0.009) (0.009) Prop. of subsist. from herding 0.001 -0.000 (0.002) (0.002) Prop. of subsist. from hunting -0.001 -0.001 (0.002) (0.003) Prop. of subsist. from gathering -0.002 -0.003 (0.002) (0.003) Patrilocal marriages -0.146** -0.186** (0.062) (0.075) Matrilocal marriages 0.113 0.132 (0.108) (0.120) Nuclear family structure 0.155** 0.113 (0.067) (0.074) Extended family structure 0.053 0.025 (0.060) (0.066) Constant 1.604*** 1.611** 1.418 1.859*** 1.506** 1.823 (0.112) (0.680) (1.150) (0.715) (0.693) (1.255)

Observations 686 686 536 686 675 526 R-squared 0.067 0.144 0.199 0.148 0.166 0.223 Adjusted R-squared 0.0545 0.112 0.158 0.112 0.128 0.171 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

141 Table A.2.6: Descriptive statistics for regression analysis of female premarital sex prohibition.

VARIABLES N mean sd Female premarital sex prohibition 427 0.536 0.499 Impartible inheritance 427 0.307 0.462 Partible inheritance by males only 427 0.244 0.430 Partible inheritance by both sexes 427 0.105 0.307 Absence of private property 427 0.344 0.476 Traditional plough use 427 0.204 0.403 Non-irrigated intensive agriculture 427 0.201 0.402 Irrigated intensive agriculture 427 0.124 0.330 Settlement complexity 427 4.953 2.329 Political hierarchies 427 2.094 1.245 Presence of large animals 427 0.756 0.430 Year society sampled 427 1,881 286 Patrilineal descent 427 0.431 0.496 Matrilineal descent 427 0.080 0.271 Dowry 427 0.054 0.226 Bride price 427 0.473 0.500 Latitude 427 24.741 19.504 Mean temperature 427 16.588 10.230 Temperature predictability 427 0.688 0.110 Mean precipitation 427 110.641 72.582 Precipitation predictability 427 0.626 0.097 Tropical climate 427 0.759 0.428 Suitability for agriculture 427 0.314 0.307 Distance to coast 427 4.331 4.263 Slope 427 2.286 2.695 Ruggedness 427 1.221 1.429 Elevation 427 7.133 7.195 Population 330 0.764 5.073 Prop. of subsist. from herding 427 17.025 17.735 Prop. of subsist. from hunting 427 17.329 18.396 Prop. of subsist. from gathering 427 12.107 14.701 Patrilocal marriages 423 0.662 0.474 Matrilocal marriages 423 0.052 0.222 Nuclear family structure 424 0.330 0.471 Extended family structure 424 0.450 0.498

142 Table A.2.7: Regression analysis of female premarital sex prohibition.

VARIABLES (1) (2) (3) (4) (5) (6) Partible inheritance by males only 0.032 0.005 0.024 -0.002 0.010 0.024 (0.062) (0.064) (0.074) (0.065) (0.066) (0.077) Partible inheritance by both sexes 0.258*** 0.238*** 0.243** 0.241*** 0.253*** 0.270** (0.081) (0.083) (0.104) (0.084) (0.083) (0.104) Absence of private property 0.152** 0.117 0.168* 0.110 0.133 0.185* (0.076) (0.081) (0.096) (0.084) (0.081) (0.100) Traditional plough use 0.184*** 0.123 0.146 0.144* 0.145* 0.201* (0.071) (0.080) (0.106) (0.080) (0.081) (0.108) Non-irrigated intensive agriculture -0.005 -0.024 -0.001 0.006 -0.006 (0.081) (0.098) (0.082) (0.082) (0.100) Irrigated intensive agriculture -0.037 -0.094 -0.037 -0.032 -0.079 (0.088) (0.108) (0.089) (0.089) (0.109) Settlement complexity -0.035*** -0.025 -0.014 -0.030 -0.021 -0.016 (0.013) (0.015) (0.018) (0.020) (0.016) (0.022) Political hierarchies 0.084*** 0.085*** 0.081** 0.096*** 0.080*** 0.084** (0.025) (0.028) (0.034) (0.028) (0.028) (0.034) Presence of large animals -0.135* -0.144* -0.185** -0.084 -0.151** -0.137 (0.069) (0.075) (0.087) (0.082) (0.076) (0.097) Year society sampled -0.000 0.000*** 0.000 -0.000 0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) Patrilineal descent 0.038 0.023 0.062 0.011 0.018 (0.063) (0.074) (0.063) (0.067) (0.078) Matrilineal descent -0.112 -0.113 -0.114 -0.142 -0.129 (0.086) (0.099) (0.086) (0.095) (0.112) Dowry 0.026 0.113 0.029 0.003 0.090 (0.121) (0.144) (0.119) (0.121) (0.145) Bride price -0.007 0.033 0.008 -0.002 0.048 (0.058) (0.066) (0.058) (0.059) (0.066) Latitude -0.002 -0.000 -0.003 -0.000 0.003 (0.005) (0.006) (0.005) (0.005) (0.006) Mean temperature 0.003 0.004 0.001 0.004 0.003 (0.008) (0.010) (0.008) (0.008) (0.010) Temperature predictability -0.969 -0.986 -0.797 -0.720 -0.443 (0.798) (0.916) (0.829) (0.804) (0.958) Mean precipitation 0.001 0.001 0.000 0.001 0.000 (0.001) (0.001) (0.001) (0.001) (0.001) Precipitation predictability -0.686* -0.476 -0.701* -0.733* -0.623 (0.378) (0.442) (0.380) (0.378) (0.459) Tropical climate 0.068 0.037 0.094 0.034 0.036 0.091 (0.061) (0.114) (0.150) (0.116) (0.114) (0.156) Suitability for agriculture 0.300*** 0.276*** 0.274*** 0.264*** 0.276*** 0.278*** (0.077) (0.085) (0.100) (0.086) (0.086) (0.102) Distance to coast 0.002 0.004 0.002 0.001 0.003 (0.007) (0.009) (0.007) (0.007) (0.009) Slope 0.014 0.006 0.013 0.018 0.013 (0.023) (0.025) (0.023) (0.024) (0.027) Ruggedness 0.013 0.023 0.017 -0.006 -0.004 (0.040) (0.044) (0.040) (0.041) (0.047) Elevation -0.004 -0.002 -0.005 -0.002 -0.000 (0.006) (0.007) (0.006) (0.006) (0.007) Population 0.007** 0.006* (0.003) (0.003) Prop. of subsist. from herding -0.003 -0.003 (0.002) (0.003) Prop. of subsist. from hunting -0.001 -0.001 (0.002) (0.003) Prop. of subsist. from gathering 0.002 0.001 (0.003) (0.003) Patrilocal marriages 0.059 0.038 (0.063) (0.076) Matrilocal marriages 0.116 0.125 (0.117) (0.144) Nuclear family structure -0.047 -0.179** (0.074) (0.086) Extended family structure -0.087 -0.133* (0.067) (0.078) Constant 0.364*** 1.392** 0.763 1.334* 1.206* 0.530 (0.112) (0.647) (0.769) (0.683) (0.646) (0.813)

Observations 427 427 330 427 420 323 R-squared 0.138 0.171 0.172 0.180 0.177 0.193 Adjusted R-squared 0.119 0.119 0.101 0.122 0.116 0.101 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

143 Table A.2.8: Descriptive statistics for regression analysis of female participation in agriculture.

VARIABLES N mean sd Female participation in agriculture 489 2.881 0.985 Impartible inheritance 489 0.407 0.492 Partible inheritance by males only 489 0.325 0.469 Partible inheritance by both sexes 489 0.133 0.340 Absence of private property 489 0.135 0.342 Traditional plough use 489 0.252 0.434 Non-irrigated intensive agriculture 489 0.280 0.450 Irrigated intensive agriculture 489 0.155 0.363 Settlement complexity 489 5.980 1.581 Political hierarchies 489 2.344 1.235 Presence of large animals 489 0.908 0.289 Year society sampled 489 1,892 270 Patrilineal descent 489 0.552 0.498 Matrilineal descent 489 0.100 0.301 Dowry 489 0.063 0.244 Bride price 489 0.583 0.494 Latitude 489 16.947 14.591 Mean temperature 489 20.484 6.456 Temperature predictability 489 0.729 0.083 Mean precipitation 489 120.604 72.430 Precipitation predictability 489 0.626 0.095 Tropical climate 489 0.904 0.295 Suitability for agriculture 489 0.369 0.307 Distance to coast 489 4.675 4.397 Slope 489 2.042 2.538 Ruggedness 489 1.151 1.382 Elevation 489 7.245 7.740 Population 382 0.784 4.737 Prop. of subsist. from herding 489 19.788 13.977 Prop. of subsist. from hunting 489 10.042 9.414 Prop. of subsist. from gathering 489 7.138 9.821 Patrilocal marriages 487 0.760 0.428 Matrilocal marriages 487 0.060 0.237 Nuclear family structure 488 0.275 0.447 Extended family structure 488 0.482 0.500

144 Table A.2.9: Regression analysis of female participation in agriculture.

VARIABLES (1) (2) (3) (4) (5) (6) Partible inheritance by males only -0.244** -0.190** -0.207** -0.174* -0.172* -0.170* (0.097) (0.092) (0.100) (0.089) (0.091) (0.095) Partible inheritance by both sexes -0.570*** -0.278** -0.342** -0.259** -0.245* -0.301** (0.128) (0.129) (0.142) (0.127) (0.133) (0.141) Absence of private property 0.090 0.172 0.179 0.127 0.263* 0.199 (0.146) (0.144) (0.170) (0.133) (0.143) (0.150) Traditional plough use -0.615*** -0.397*** -0.304* -0.306** -0.315** -0.145 (0.119) (0.146) (0.173) (0.144) (0.145) (0.166) Non-irrigated intensive agriculture -0.217* -0.260** -0.080 -0.216** -0.104 (0.112) (0.125) (0.110) (0.109) (0.121) Irrigated intensive agriculture -0.290** -0.269 -0.188 -0.242* -0.087 (0.144) (0.168) (0.143) (0.142) (0.158) Settlement complexity -0.010 -0.016 -0.027 -0.045 -0.002 -0.041 (0.033) (0.034) (0.039) (0.033) (0.035) (0.039) Political hierarchies -0.038 0.013 -0.000 0.041 -0.006 0.009 (0.040) (0.042) (0.047) (0.041) (0.041) (0.045) Presence of large animals 0.367** -0.028 -0.127 0.204 -0.019 0.148 (0.157) (0.152) (0.170) (0.148) (0.150) (0.164) Year society sampled 0.000 0.000 0.000* 0.000 0.000 (0.000) (0.000) (0.000) (0.000) (0.000) Patrilineal descent 0.176* 0.080 0.210** 0.156 0.067 (0.105) (0.115) (0.101) (0.110) (0.112) Matrilineal descent 0.403*** 0.255 0.375*** 0.344** 0.157 (0.142) (0.170) (0.130) (0.141) (0.157) Dowry 0.386** 0.345 0.478*** 0.343** 0.417** (0.173) (0.215) (0.171) (0.171) (0.210) Bride price 0.219** 0.206** 0.337*** 0.193** 0.274*** (0.094) (0.105) (0.090) (0.097) (0.100) Latitude -0.034*** -0.040*** -0.032*** -0.028*** -0.032*** (0.009) (0.010) (0.009) (0.009) (0.010) Mean temperature -0.067*** -0.068*** -0.069*** -0.057*** -0.061*** (0.018) (0.020) (0.018) (0.018) (0.021) Temperature predictability 0.125 0.568 1.747 0.437 2.684 (1.566) (1.821) (1.573) (1.579) (1.824) Mean precipitation 0.001 0.000 -0.000 0.000 -0.001 (0.001) (0.001) (0.001) (0.001) (0.001) Precipitation predictability -0.169 -0.341 -0.753 -0.131 -1.012 (0.712) (0.833) (0.709) (0.739) (0.885) Tropical climate 0.057 -0.069 -0.401 -0.265 -0.111 -0.555* (0.178) (0.272) (0.339) (0.257) (0.274) (0.308) Suitability for agriculture -0.441*** -0.404*** -0.447*** -0.396*** -0.443*** -0.449*** (0.147) (0.149) (0.168) (0.142) (0.150) (0.160) Distance to coast 0.009 0.009 0.010 0.008 0.006 (0.012) (0.014) (0.011) (0.012) (0.013) Slope -0.045 -0.076* -0.051 -0.040 -0.075* (0.036) (0.041) (0.035) (0.037) (0.041) Ruggedness 0.207*** 0.225*** 0.213*** 0.187*** 0.216*** (0.059) (0.066) (0.060) (0.060) (0.070) Elevation -0.025** -0.019 -0.022** -0.022** -0.016 (0.010) (0.012) (0.010) (0.010) (0.012) Population -0.013** -0.016** (0.006) (0.006) Prop. of subsist. from herding -0.017*** -0.017*** (0.004) (0.006) Prop. of subsist. from hunting 0.018*** 0.024*** (0.005) (0.006) Prop. of subsist. from gathering -0.008 -0.012** (0.006) (0.005) Patrilocal marriages 0.084 0.131 (0.131) (0.148) Matrilocal marriages 0.209 0.106 (0.216) (0.218) Nuclear family structure -0.350*** -0.336*** (0.120) (0.123) Extended family structure -0.417*** -0.422*** (0.097) (0.104) Constant 3.104*** 4.786*** 5.512*** 4.019*** 4.419*** 4.270*** (0.320) (1.254) (1.468) (1.346) (1.289) (1.578)

Observations 489 489 382 489 486 379 R-squared 0.183 0.326 0.341 0.379 0.349 0.436 Adjusted R-squared 0.167 0.290 0.293 0.342 0.307 0.382 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

145 Table A.2.10: Regression analyses of maximal specifications with Conley standard errors adjusted for spatial correlation with cutoffs of 60 decimal degrees.

(1) (2) (3) (4) Cousin marriage Endogamy Female premarital Female participation VARIABLES sex prohibition in agriculture

Partible inheritance by males only 0.145 0.013 0.005 -0.190** (0.097) (0.058) (0.065) (0.093) Partible inheritance by both sexes 0.376*** 0.233*** 0.238*** -0.278** (0.137) (0.080) (0.090) (0.133)

Absence of private property yes yes yes yes

Maximal controls yes yes yes yes

Observations 651 686 427 489 R-squared 0.870 0.914 0.615 0.930 Adjusted R-squared 0.865 0.911 0.591 0.926 Conley standard errors adjusted for spatial correlation in parentheses. *** p<0.01, ** p<0.05, * p<0.1

Table A.2.11: Regression analyses of maximal specifications using ordered logistic model for re- gressions 1, 2, and 4, and binary logistic model for regression 3. Odds ratios are reported in the table.

(1) (2) (3) (4) Cousin marriage Endogamy Female premarital Female participation VARIABLES sex prohibition in agriculture

Partible inheritance by males only 1.346 1.025 1.003 0.651** (0.265) (0.216) (0.304) (0.137) Partible inheritance by both sexes 2.391** 2.494*** 3.344*** 0.506** (0.814) (0.787) (1.452) (0.149)

Absence of private property yes yes yes yes

Maximal controls yes yes yes yes

Observations 651 686 427 489 Pseudo R-squared 0.095 0.090 0.136 0.144 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

146 Table A.2.12: Regression analyses of maximal specifications including region fixed effects.

(1) (2) (3) (4) Cousin marriage Endogamy Female premarital Female participation VARIABLES sex prohibition in agriculture Partible inheritance by males only 0.067 -0.002 0.000 -0.166 (0.080) (0.048) (0.050) (0.101) Partible inheritance by both sexes 0.238* 0.180* 0.180* -0.246** (0.135) (0.089) (0.103) (0.109)

Absence of private property yes yes yes yes

Maximal controls yes yes yes yes Region fixed effects yes yes yes yes

Observations 651 686 427 489 R-squared 0.349 0.213 0.215 0.435 Adjusted R-squared 0.305 0.163 0.132 0.383 Number of clusters 17 17 17 17 Robust standard errors, clustered at the region level, in parentheses. *** p<0.01, ** p<0.05, * p<0.1

17 regions used in the regression analyses.

147 Table A.2.13: Model specification Table (1), column (1) from [16], updated with new sample, in- heritance systems, and maximal controls.

Alesina et al. (2013) Updated data Restricted sample VARIABLES N mean sd N mean sd N mean sd Female participation in agriculture 660 3.036 1.018 713 3.000 1.036 489 2.881 0.985 Traditional plough use 660 0.186 0.390 713 0.198 0.399 489 0.252 0.434 Settlement complexity 660 5.877 1.691 713 5.928 1.650 489 5.980 1.581 Political hierarchies 660 2.111 1.108 713 2.181 1.165 489 2.344 1.235 Presence of large animals 660 0.835 0.372 713 0.837 0.369 489 0.908 0.289 Suitability for agriculture 660 0.469 0.384 713 0.349 0.312 489 0.369 0.307 Tropical climate 660 0.917 0.266 713 0.902 0.298 489 0.904 0.295 (1) (2) (3) (4) (5) VARIABLES Alesina et al. (2013) Updated data Restricted sample and inheritance Updated controls

Partible inheritance by males only -0.244** -0.190** (0.097) (0.092) Partible inheritance by both sexes -0.570*** -0.278** (0.128) (0.129) Absence of private property 0.090 0.172 (0.146) (0.144) Traditional plough use -0.883*** -0.848*** -0.751*** -0.615*** -0.397*** (0.114) (0.107) (0.120) (0.119) (0.146) Non-irrigated intensive agriculture -0.217* (0.112) Irrigated intensive agriculture -0.290** (0.144) Settlement complexity -0.034 -0.048* -0.029 -0.010 -0.016 (0.028) (0.028) (0.033) (0.033) (0.034) Political hierarchies -0.019 -0.042 -0.024 -0.038 0.013 (0.041) (0.037) (0.041) (0.040) (0.042) Presence of large animals -0.048 -0.038 0.287* 0.367** -0.028 (0.111) (0.110) (0.162) (0.157) (0.152) Year society sampled 0.000 (0.000) Patrilineal descent 0.176* (0.105) Matrilineal descent 0.403*** (0.142) Dowry 0.386** (0.173) Bride price 0.219** (0.094) Latitude -0.034*** (0.009) Mean temperature -0.067*** (0.018) Temperature predictability 0.125 (1.566) Mean precipitation 0.001 (0.001) Precipitation predictability -0.169 (0.712) Tropical climate -0.563*** -0.439*** 0.009 0.057 -0.069 (0.176) (0.142) (0.177) (0.178) (0.272) Suitability for agriculture -0.029 -0.159 -0.386*** -0.441*** -0.404*** (0.106) (0.129) (0.149) (0.147) (0.149) Distance to coast 0.009 (0.012) Slope -0.045 (0.036) Ruggedness 0.207*** (0.059) Elevation -0.025** (0.010) Constant 4.011*** 4.030*** 3.172*** 3.104*** 4.786*** (0.223) (0.208) (0.294) (0.320) (1.254)

Observations 660 713 489 489 489 R-squared 0.135 0.135 0.143 0.183 0.326 Adjusted R-squared 0.127 0.128 0.132 0.167 0.290 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1 148 Table A.2.14: Regression analyses of minimal specifications using data from [16].

(1) (2) (3) (4) Cousin marriage Endogamy Female premarital Female participation VARIABLES sex prohibition in agriculture

Partible inheritance by males only 0.017 0.003 0.033 -0.299*** (0.100) (0.057) (0.068) (0.102) Partible inheritance by both sexes 0.321** 0.353*** 0.189** -0.648*** (0.154) (0.084) (0.088) (0.128) Absence of private property 0.140 0.093 0.140* -0.008 (0.126) (0.073) (0.080) (0.154) Historical plough use 0.205 0.106 0.184** -0.636*** (0.136) (0.076) (0.074) (0.127) Settlement patterns 0.028 0.018 -0.029** -0.019 (0.025) (0.014) (0.014) (0.033) Jurisdictional hierarchy 0.171*** 0.026 0.100*** -0.035 (0.044) (0.026) (0.026) (0.044) Presence of large animals 0.147 -0.049 -0.106 0.298* (0.121) (0.067) (0.074) (0.163) Tropical climate 0.270** -0.032 0.021 -0.072 (0.126) (0.068) (0.074) (0.228) Suitability for agriculture -0.190* -0.157*** 0.033 -0.197* (0.102) (0.061) (0.069) (0.118) Constant 1.403*** 1.635*** 0.426*** 3.324*** (0.207) (0.112) (0.119) (0.345)

Observations 619 651 388 461 R-squared 0.116 0.068 0.105 0.165 Adjusted R-squared 0.103 0.0554 0.0832 0.148 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

149 Table A.2.15: Regression analyses distinguishing lineal and lateral inheritance systems.

Cousin marriage Endogamy Female premarital Female participation sex prohibition in agriculture VARIABLES (1) (2) (3) (4) (5) (6) (7) (8)

Inheritance by both sons and daughters 0.627*** 0.332** 0.355*** 0.223*** 0.138* 0.129* -0.423*** -0.112 (0.238) (0.140) (0.069) (0.073) (0.073) (0.077) (0.120) (0.123) Matrilineal heirs 0.868*** 0.441*** 0.056 -0.091 -0.225*** -0.220* 0.065 0.042 (0.194) (0.156) (0.065) (0.098) (0.078) (0.127) (0.140) (0.196) Patrilineal heirs 0.382* 0.299** -0.037 0.085 0.056 0.172 0.242 (0.197) (0.128) (0.076) (0.074) (0.079) (0.142) (0.149)

Absence of private property yes yes yes yes yes yes yes yes

Minimal controls yes yes yes yes Maximal controls yes yes yes yes

Observations 677 677 714 714 442 442 516 516 R-squared 0.208 0.145 0.138 0.165 0.169 0.327 Adjusted R-squared 0.102 0.176 0.0597 0.113 0.118 0.113 0.153 0.291

Figure shows inheritance systems of 856 Ethnographic Atlas societies, distinguishing lineal vs. lat- eral inheritance systems, and for the former, distinguishing female inclusion vs. female exclusion in inheritance. Lateral inheritance systems include inheritance by matrilineal or patrilineal heirs who take precedence over sons and daughters (indicator variables 3 and 4 below). For the regression analysis, I define the following indicator variables based on entry EA074: 1- in- heritance by both sons and daughters (categories 4 and 5); 2- inheritance by sons only (category 7); 3- inheritance by matrilineal heirs such as sister’s sons (categories 2, 3); 4- inheritance by patrilineal heirs such as younger brothers (category 6); and 5- no inheritance of real property (category 1). Note that the omitted category in the regressions is inheritance by sons only, including both impart- ible and partible forms. Female inheritance is insignificant in the regression on female participation in agriculture with maximal controls. As noted before, partibility of inheritance is likely to have an effect on female participation in agriculture, which is not captured by this categorization.

150 Table A.2.16: Descriptive statistics, and regression analysis of cousin marriage rates across Italian provinces.

VARIABLES N mean sd Cousin marriage percentage, 1960–1964 101 0.921 1.149 Female participation percentage, 2011 101 41.518 5.529 Communitarian family 101 0.317 0.468 Incomplete stem family 101 0.109 0.313 Egalitarian nuclear family 101 0.574 0.497 Population 101 0.462 0.461 GDP per capita 101 9.734 2.566 Share of agriculture 101 0.130 0.059 South and Islands 101 0.356 0.481 Papal states 101 0.109 0.313 Latitude 101 42.873 2.674 Mean Temperature 101 1.736 4.043 Mean precipitation 101 0.035 0.034 Tropical climate 101 0.634 0.484 Suitability for agriculture 101 0.782 0.217 Distance to coast 101 55.985 48.721 Slope 101 6.877 5.094 Ruggedness 101 2.256 1.619 Elevation 101 4.618 3.716

VARIABLES (1) (2) (3) (4)

Egalitarian nuclear family 0.782*** 0.407*** -4.731*** -3.753*** (0.148) (0.124) (0.937) (0.676) Incomplete stem family -0.061 0.461** 1.986* -1.367 (0.103) (0.195) (1.118) (0.955) GDP per capita -0.299*** -0.028 12.260** 9.929** (0.061) (0.073) (5.708) (4.229) Share of agriculture -1.031 2.058 -60.338*** -12.286 (1.865) (2.182) (15.308) (12.620) South and Islands -0.875** -2.477 (0.408) (1.555) Papal states -0.711*** -0.746 (0.220) (1.357) Latitude -0.375*** 1.244* (0.116) (0.634) Mean Temperature -0.040 0.126 (0.046) (0.228) Mean precipitation 3.769 35.987** (4.193) (15.869) Tropical climate -0.132 -0.229 (0.145) (1.125) Suitability for agriculture 0.244 0.943 (0.454) (2.097) Distance to coast 0.002* 0.010 (0.001) (0.009) Slope -1.905*** 2.363 (0.573) (3.191) Ruggedness 6.032*** -8.368 (1.778) (9.825) Elevation 0.008 0.423* (0.046) (0.217) Constant 3.526*** 16.264*** 45.818*** -10.391 (0.851) (5.185) (0.845) (28.395)

Observations 101 101 101 101 R-squared 0.592 0.874 0.412 0.822 Robust standard errors in parentheses. *** p<0.01, ** p<0.05, * p<0.1

151 Table A.2.17: Descriptive statistics and ethnic-level regression analyses with the IFLS female adults sample.

VARIABLES N mean sd Endogamy 7,575 0.703 0.457 Arranged marriage 7,705 0.234 0.424 Self-employed 6,646 0.265 0.441 Public/private sector employee 6,646 0.207 0.405 Family worker 6,646 0.149 0.357 Female inheritance 7,705 0.842 0.364 Education 7,705 1.108 1.205 Urban 7,705 0.443 0.497 Age 7,705 41.448 15.687 Marriage age 7,705 20.121 5.783 Muslim 7,705 0.881 0.324 Protestant 7,705 0.044 0.205 Catholic 7,705 0.017 0.130 Hindu 7,705 0.056 0.231 Buddhist 7,705 0.002 0.039 Matrilineal 7,705 0.070 0.255 Bride price 7,705 0.075 0.264 Female–dominated agriculture 7,705 0.201 0.401

Economic participation Endogamy Arranged marriage Self Public/Private Family VARIABLES (1) (2) (3) (4) (5)

Female inheritance 0.111*** 0.063*** -0.086*** 0.018 0.005 (0.014) (0.018) (0.011) (0.016) (0.036) Matrilineal -0.019 0.166*** 0.045** -0.006 0.006 (0.043) (0.017) (0.018) (0.057) (0.030) Bride price 0.020 0.110*** 0.054*** -0.134*** 0.100*** (0.024) (0.019) (0.016) (0.014) (0.027) Female–dominated agriculture -0.014 -0.067*** -0.042*** -0.023*** -0.032 (0.022) (0.008) (0.008) (0.005) (0.026) Protestant -0.007 -0.094** 0.004 0.048 0.065 (0.025) (0.034) (0.041) (0.031) (0.074) Catholic -0.054*** -0.047** -0.101*** 0.022 0.052 (0.013) (0.019) (0.028) (0.019) (0.044) Hindu 0.022 -0.035 -0.006 0.025 0.077*** (0.053) (0.029) (0.025) (0.039) (0.017) Other religions -0.153 0.034 -0.065 -0.042 0.085 (0.135) (0.118) (0.110) (0.120) (0.086) Education -0.049*** -0.029** -0.018*** 0.035*** -0.018*** (0.004) (0.011) (0.004) (0.008) (0.003) Urban -0.098*** -0.053*** -0.005 0.049*** -0.132*** (0.009) (0.014) (0.021) (0.014) (0.016) Age -0.001 0.009*** 0.035*** 0.023*** 0.005*** (0.001) (0.002) (0.003) (0.001) (0.001) Age2 0.000 -0.000 -0.000*** -0.000*** -0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.004 -0.016*** -0.006* 0.009*** -0.008** (0.004) (0.003) (0.003) (0.002) (0.003) Marriage age2 0.000 0.000*** 0.000 -0.000 0.000 (0.000) (0.000) (0.000) (0.000) (0.000)

Province FE yes yes yes yes yes Observations 7,575 7,705 6,645 6,645 6,645 R-squared 0.058 0.231 0.074 0.063 0.089 Number of clusters 12 12 12 12 12 OLS estimates are reported with robust standard errors, clustered at the ethnicity level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

152 Table A.2.18: Ethnic-level logistic regression analyses with the IFLS female adults sample.

Economic participation Endogamy Arranged marriage Self Public/Private Family VARIABLES (1) (2) (3) (4) (5)

Female inheritance 0.114*** 0.088*** -0.083*** 0.012 0.021 (0.014) (0.027) (0.010) (0.018) (0.018) Matrilineal -0.007 0.183*** 0.041** 0.007 0.073** (0.037) (0.019) (0.019) (0.054) (0.034) Bride price 0.024 0.165*** 0.050*** -0.124*** 0.200*** (0.022) (0.027) (0.017) (0.008) (0.043) Female–dominated agriculture -0.016 -0.080*** -0.046*** -0.022*** -0.061*** (0.023) (0.005) (0.009) (0.004) (0.011) Protestant -0.007 -0.083*** 0.001 0.046 0.090* (0.021) (0.024) (0.039) (0.030) (0.048) Catholic -0.049*** -0.044*** -0.091*** 0.009 -0.007 (0.013) (0.013) (0.027) (0.020) (0.021) Hindu 0.019 -0.037 -0.006 0.018 0.051* (0.046) (0.053) (0.022) (0.042) (0.029) Other religions -0.162 -0.001 -0.051 -0.063 0.080 (0.142) (0.094) (0.106) (0.150) (0.064) Education -0.045*** -0.036*** -0.017*** 0.030*** -0.017*** (0.004) (0.009) (0.004) (0.008) (0.003) Urban -0.100*** -0.054*** -0.005 0.049*** -0.088*** (0.009) (0.008) (0.021) (0.017) (0.009) Age -0.001 0.014*** 0.043*** 0.025*** 0.004** (0.001) (0.001) (0.003) (0.002) (0.002) Age2 0.000 -0.000*** -0.000*** -0.000*** -0.000** (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.004 -0.012*** -0.004 0.012*** 0.003 (0.005) (0.002) (0.003) (0.002) (0.003) Marriage age2 0.000 0.000*** 0.000 -0.000*** -0.000* (0.000) (0.000) (0.000) (0.000) (0.000)

Province FE yes yes yes yes yes Observations 7,575 7,705 6,645 6,645 5,173 Pseudo R-squared 0.0473 0.236 0.0706 0.0653 0.151 Number of clusters 12 12 12 12 10 Average marginal effects of logit estimates are reported with robust delta method standard errors, clustered at the ethnicity level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

153 Table A.2.19: Ethnic-level regression analyses of the IFLS male adults sample.

Economic participation Endogamy Arranged marriage Self Public/Private Family (SD=0.464) (SD=0.340) (SD=0.498) (SD=0.450) (SD=0.157) VARIABLES (1) (2) (3) (4) (5)

Female inheritance -0.018 0.023* -0.022 0.041 -0.006 (0.019) (0.012) (0.025) (0.029) (0.003) Matrilineal -0.042* 0.081*** 0.022 -0.035 -0.004 (0.020) (0.018) (0.057) (0.076) (0.010) Bride price -0.011 0.056*** 0.078** -0.094** 0.004 (0.028) (0.012) (0.027) (0.031) (0.002) Female–dominated agriculture -0.017 -0.039*** -0.030*** 0.039*** -0.011*** (0.011) (0.004) (0.007) (0.006) (0.002) Protestant 0.017 -0.057** -0.017 -0.011 0.023** (0.019) (0.025) (0.027) (0.037) (0.008) Catholic -0.059* -0.039 -0.052 0.047 -0.002 (0.027) (0.042) (0.038) (0.041) (0.010) Hindu 0.138*** 0.001 -0.144*** 0.113* -0.004 (0.021) (0.068) (0.036) (0.052) (0.004) Other religions 0.114 -0.015 0.424** -0.361* -0.016** (0.111) (0.216) (0.185) (0.189) (0.007) Education -0.041*** -0.019*** -0.056*** 0.056*** -0.002* (0.003) (0.005) (0.010) (0.011) (0.001) Urban -0.114*** -0.035*** -0.176*** 0.176*** -0.012** (0.009) (0.004) (0.019) (0.015) (0.004) Age -0.003 0.003 0.017*** 0.005* -0.009*** (0.003) (0.002) (0.002) (0.002) (0.002) Age2 0.000 0.000** -0.000*** -0.000*** 0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.008*** -0.003 0.006* -0.003 0.000 (0.001) (0.003) (0.004) (0.006) (0.001) Marriage age2 0.000*** 0.000 -0.000** 0.000 0.000 (0.000) (0.000) (0.000) (0.000) (0.000)

Province FE yes yes yes yes yes Observations 6,045 6,116 5,180 5,180 5,180 R-squared 0.083 0.130 0.123 0.126 0.016 Number of clusters 12 12 11 11 11 OLS estimates are reported with robust standard errors, clustered at the ethnicity level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

154 Table A.2.20: Ethnic-level logistic regression analyses of the IFLS male adults sample.

Economic participation Endogamy Arranged marriage Self Public/Private Family VARIABLES (1) (2) (3) (4) (5)

Female inheritance -0.029 0.042*** -0.020 0.039 0.001 (0.025) (0.016) (0.023) (0.028) (0.008) Matrilineal -0.034* 0.097*** 0.022 -0.032 -0.021** (0.019) (0.032) (0.056) (0.075) (0.010) Bride price -0.025 0.104*** 0.077*** -0.091*** 0.008 (0.036) (0.018) (0.025) (0.030) (0.020) Female–dominated agriculture -0.017* -0.053*** -0.032*** 0.040*** -0.007 (0.010) (0.004) (0.007) (0.006) (0.009) Protestant 0.028* -0.049*** -0.018 -0.016 0.034* (0.016) (0.018) (0.029) (0.041) (0.019) Catholic -0.052* -0.030 -0.054 0.048 (0.028) (0.025) (0.040) (0.042) Hindu 0.153*** 0.035 -0.143*** 0.112** 0.002 (0.018) (0.170) (0.030) (0.049) (0.011) Other religions 0.105 -0.002 0.413*** -0.354** (0.107) (0.182) (0.149) (0.149) Education -0.039*** -0.024*** -0.055*** 0.054*** -0.007 (0.005) (0.002) (0.010) (0.010) (0.006) Urban -0.111*** -0.034*** -0.176*** 0.176*** -0.035*** (0.009) (0.007) (0.018) (0.015) (0.004) Age -0.004 0.010*** 0.016*** 0.006*** -0.012*** (0.003) (0.002) (0.002) (0.002) (0.004) Age2 0.000 -0.000*** -0.000*** -0.000*** 0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.008*** -0.002* 0.007* -0.003 0.007** (0.002) (0.001) (0.004) (0.006) (0.003) Marriage age2 0.000*** 0.000 -0.000*** 0.000 -0.000 (0.000) (0.000) (0.000) (0.000) (0.000)

Province FE yes yes yes yes yes Observations 6,045 6,116 5,180 5,180 3,443 Pseudo R-squared 0.0717 0.172 0.0942 0.0967 0.180 Number of clusters 12 12 11 11 9 Average marginal effects of logit estimates are reported with robust delta method standard errors, clustered at the ethnicity level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

155 Table A.2.21: Descriptive statistics and individual-level regression analyses with the IFLS female adults sample.

VARIABLES N mean sd Endogamy 4,231 0.741 0.438 Arranged marriage 4,289 0.381 0.486 Self-employed 3,656 0.272 0.445 Public/private sector employee 3,656 0.144 0.351 Family worker 3,656 0.140 0.347 Inheritance dummy 4,289 0.334 0.472 Muslim 4,289 0.869 0.337 Protestant 4,289 0.047 0.212 Catholic 4,289 0.023 0.150 Hindu 4,289 0.041 0.198 Other religions 4,289 0.020 0.139 Education 4,289 0.594 0.941 Urban 4,289 0.465 0.499 Age 4,289 45.717 13.686 Marriage age 4,289 18.994 5.895

Economic participation Endogamy Arranged marriage Self Private/Public Family VARIABLES (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Inheritance 0.050*** 0.053*** 0.035** 0.036** 0.029 0.023 -0.006 -0.006 0.007 0.010 (0.015) (0.017) (0.015) (0.017) (0.019) (0.020) (0.013) (0.015) (0.012) (0.013) Protestant -0.126** -0.133** -0.081** -0.075* 0.016 0.042 -0.046 -0.046 0.079*** 0.062 (0.054) (0.066) (0.037) (0.044) (0.059) (0.069) (0.036) (0.041) (0.030) (0.041) Catholic -0.015 -0.051 -0.028 0.030 0.037 0.061 -0.122*** -0.103* 0.031 0.032 (0.071) (0.081) (0.056) (0.053) (0.071) (0.082) (0.043) (0.057) (0.036) (0.050) Hindu -0.163** -0.220*** -0.056 0.024 -0.023 0.102 -0.026 0.042 0.037 0.092 (0.068) (0.063) (0.078) (0.135) (0.123) (0.140) (0.072) (0.085) (0.027) (0.068) Other religions -0.099 -0.182*** 0.085 0.173** -0.016 -0.048 -0.117*** -0.062 0.170** 0.184** (0.070) (0.067) (0.067) (0.084) (0.058) (0.078) (0.044) (0.070) (0.068) (0.089) Education -0.034*** -0.028** -0.050*** -0.053*** -0.029*** -0.032*** 0.054*** 0.066*** -0.020*** -0.022*** (0.011) (0.012) (0.008) (0.009) (0.009) (0.010) (0.011) (0.012) (0.006) (0.007) Age 0.003 0.005 0.010*** 0.011*** 0.021*** 0.018*** 0.014*** 0.015*** 0.010** 0.010* (0.003) (0.004) (0.003) (0.004) (0.006) (0.006) (0.004) (0.005) (0.004) (0.005) Age2 -0.000 -0.000 -0.000 -0.000 -0.000*** -0.000** -0.000*** -0.000*** -0.000** -0.000** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.007 -0.008* -0.016*** -0.014*** -0.002 -0.001 0.010** 0.008* -0.007* -0.007* (0.004) (0.005) (0.004) (0.005) (0.006) (0.007) (0.005) (0.005) (0.004) (0.004) Marriage age2 0.000 0.000 0.000** 0.000 -0.000 -0.000 -0.000 -0.000 0.000 0.000* (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000)

Community FE yes yes yes yes yes yes yes yes yes yes Ethnicity FE yes yes yes yes yes Observations 4,231 3,615 4,289 3,657 3,656 3,269 3,656 3,269 3,656 3,269 R-squared 0.179 0.202 0.382 0.394 0.161 0.175 0.194 0.211 0.338 0.350 Number of clusters 312 310 312 310 311 309 311 309 311 309 OLS estimates are reported with robust standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

156 Table A.2.22: Individual-level logistic regression analyses with the IFLS female adults sample.

Economic participation Endogamy Arranged marriage Self Private/Public Family VARIABLES (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Inheritance 0.053*** 0.058*** 0.036** 0.040** 0.030 0.023 -0.005 -0.005 0.015 0.010 (0.014) (0.016) (0.015) (0.018) (0.019) (0.017) (0.012) (0.012) (0.024) (0.010) Protestant -0.096** -0.108* -0.107** -0.099* 0.019 0.045 -0.039* -0.044 0.223** 0.072 (0.046) (0.057) (0.048) (0.057) (0.063) (0.068) (0.023) (0.027) (0.088) (0.050) Catholic -0.006 -0.034 -0.038 0.024 0.046 0.059 -0.088*** -0.089*** 0.105 0.024 (0.054) (0.062) (0.063) (0.064) (0.078) (0.074) (0.020) (0.021) (0.126) (0.061) Hindu -0.156** -0.244*** -0.067 0.035 -0.021 0.103 -0.009 0.116 0.162 0.066 (0.069) (0.078) (0.119) (0.447) (0.102) (0.127) (0.050) (0.143) (0.158) (0.100) Other religions -0.079 -0.161** 0.086 0.184** -0.013 -0.041 -0.111*** -0.091 0.417*** 0.275*** (0.062) (0.067) (0.065) (0.090) (0.057) (0.064) (0.026) (0.060) (0.095) (0.090) Education -0.028*** -0.023** -0.062*** -0.069*** -0.033*** -0.034*** 0.043*** 0.052*** -0.064*** -0.032*** (0.009) (0.009) (0.009) (0.011) (0.010) (0.009) (0.008) (0.007) (0.018) (0.008) Age 0.003 0.005 0.012*** 0.013*** 0.024*** 0.021*** 0.015*** 0.014*** 0.024** 0.010** (0.003) (0.004) (0.003) (0.004) (0.006) (0.006) (0.005) (0.005) (0.010) (0.004) Age2 -0.000 -0.000 -0.000 -0.000 -0.000*** -0.000*** -0.000*** -0.000*** -0.000*** -0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.007 -0.009 -0.016*** -0.015*** -0.002 -0.001 0.012** 0.010** -0.014** -0.007** (0.005) (0.005) (0.004) (0.005) (0.006) (0.005) (0.005) (0.005) (0.007) (0.003) Marriage age2 0.000 0.000 0.000** 0.000* -0.000 -0.000 -0.000* -0.000 0.000* 0.000** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000)

Observations 4,054 3,405 3,944 3,216 3,418 2,974 2,665 2,308 2,033 1,761 Community FE yes yes yes yes yes yes yes yes yes yes Ethnicity FE yes yes yes yes yes Pseudo R-squared 0.146 0.160 0.300 0.298 0.117 0.122 0.145 0.158 0.217 0.221 Number of clusters 297 290 279 262 280 270 212 205 153 143 Average marginal effects of logit estimates are reported with robust delta method standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

157 Table A.2.23: Descriptive statistics and individual-level regression analyses with the IFLS male adults sample.

VARIABLES N mean sd Endogamy 3,702 0.737 0.441 Arranged marriage 3,776 0.240 0.427 Self-employed 3,059 0.524 0.500 Public/private sector employee 3,059 0.429 0.495 Family worker 3,059 0.006 0.079 Inheritance 3,776 0.399 0.490 Muslim 3,776 0.866 0.341 Protestant 3,776 0.051 0.219 Catholic 3,776 0.021 0.144 Hindu 3,776 0.045 0.207 Other religions 3,776 0.017 0.131 Education 3,776 0.952 1.131 Urban 3,776 0.470 0.499 Age 3,776 47.975 13.371 Marriage age 3,776 24.510 7.029

Economic participation Endogamy Arranged marriage Self Private/Public Family VARIABLES (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Inheritance 0.065*** 0.058*** 0.020 0.009 0.041** 0.039* -0.038** -0.041* 0.002 0.001 (0.015) (0.018) (0.014) (0.016) (0.019) (0.022) (0.019) (0.021) (0.003) (0.003) Protestant -0.039 -0.061 -0.017 -0.008 0.180*** 0.171** -0.164*** -0.163** -0.001 -0.002 (0.056) (0.073) (0.039) (0.048) (0.055) (0.071) (0.058) (0.070) (0.003) (0.003) Catholic -0.101 -0.166* -0.065 -0.028 0.093 0.062 -0.107 -0.081 -0.002 -0.001 (0.079) (0.096) (0.056) (0.056) (0.064) (0.081) (0.069) (0.086) (0.002) (0.003) Hindu 0.050 0.190 -0.095 -0.086 0.025 0.188 -0.015 -0.064 0.000 0.002 (0.174) (0.155) (0.071) (0.099) (0.138) (0.268) (0.119) (0.204) (0.001) (0.002) Other religions 0.080 -0.006 -0.101 -0.078 0.189** 0.130 -0.253*** -0.138 -0.000 0.001 (0.062) (0.102) (0.066) (0.073) (0.093) (0.102) (0.091) (0.104) (0.001) (0.002) Education -0.032*** -0.029*** -0.026*** -0.022*** -0.095*** -0.103*** 0.091*** 0.103*** -0.001 -0.001 (0.009) (0.010) (0.007) (0.008) (0.010) (0.011) (0.010) (0.011) (0.001) (0.001) Age 0.002 0.003 0.007** 0.009** 0.011 0.008 0.008 0.010 -0.003 -0.003* (0.004) (0.005) (0.003) (0.004) (0.008) (0.009) (0.009) (0.009) (0.002) (0.002) Age2 -0.000 -0.000 0.000 -0.000 -0.000 -0.000 -0.000 -0.000 0.000* 0.000* (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.005 -0.004 -0.006 -0.010* 0.001 -0.003 -0.001 0.003 0.001* 0.001 (0.004) (0.005) (0.004) (0.005) (0.005) (0.006) (0.005) (0.006) (0.001) (0.001) Marriage age2 0.000 0.000 0.000 0.000 -0.000 0.000 0.000 -0.000 -0.000 -0.000 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000)

Community FE yes yes yes yes yes yes yes yes yes yes Ethnicity FE yes yes yes yes yes Observations 3,702 3,018 3,776 3,073 3,059 2,612 3,059 2,612 3,059 2,612 R-squared 0.180 0.197 0.301 0.307 0.348 0.373 0.346 0.372 0.121 0.119 Number of clusters 312 311 312 311 312 311 312 311 312 311 OLS estimates are reported with robust standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

158 Table A.2.24: Individual-level logistic regression analyses with the IFLS male adults sample.

Economic participation Endogamy Arranged marriage Self Private/Public Family VARIABLES (1) (2) (3) (4) (5) (6) (7) (8) (9) (10)

Inheritance 0.066*** 0.057*** 0.029* 0.015 0.046** 0.041** -0.040** -0.047** (0.014) (0.017) (0.017) (0.017) (0.019) (0.019) (0.019) (0.019) Protestant -0.028 -0.037 -0.001 -0.002 0.188*** 0.187*** -0.161*** -0.177*** (0.049) (0.061) (0.064) (0.071) (0.048) (0.061) (0.047) (0.056) Catholic -0.093 -0.154* -0.066 -0.007 0.085 0.069 -0.089 -0.086 (0.078) (0.092) (0.065) (0.067) (0.073) (0.083) (0.068) (0.083) Hindu 0.046 0.169* -0.135 -0.088 0.022 0.169 0.026 0.022 (0.151) (0.090) (0.107) (0.115) (0.146) (0.239) (0.132) (0.194) Other religions 0.068 0.007 -0.122 -0.095 0.187** 0.147* -0.235*** -0.167* (0.047) (0.088) (0.091) (0.101) (0.080) (0.084) (0.069) (0.087) Education -0.031*** -0.026*** -0.040*** -0.033*** -0.098*** -0.111*** 0.089*** 0.109*** (0.008) (0.009) (0.010) (0.010) (0.010) (0.009) (0.009) (0.009) Age 0.002 0.003 0.017*** 0.019*** 0.014 0.011 0.008 0.011 (0.004) (0.005) (0.005) (0.005) (0.009) (0.009) (0.009) (0.009) Age2 -0.000 -0.000 -0.000* -0.000** -0.000 -0.000 -0.000 -0.000* (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.005 -0.003 -0.008* -0.009* 0.001 -0.004 0.000 0.004 (0.004) (0.005) (0.004) (0.005) (0.005) (0.005) (0.005) (0.006) Marriage age2 0.000 0.000 0.000 0.000 -0.000 0.000 0.000 -0.000 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000)

Community FE yes yes yes yes yes yes yes yes Ethnicity FE yes yes yes yes Observations 3,406 2,731 3,115 2,334 2,729 2,291 2,731 2,263 Pseudo R-squared 0.131 0.143 0.235 0.228 0.225 0.242 0.233 0.241 Number of clusters 283 274 243 216 267 256 269 255 Average marginal effects of logit estimates are reported with robust delta method standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels. In regressions 9 and 10, many observations dropped because of collinearity in the logit estimation.

159 Table A.2.25: Individual-level regression analyses with the IFLS full sample using pre- and post- marriage inheritance indicators.

VARIABLES N mean sd Endogamy 7,933 0.739 0.439 Arranged marriage 8,065 0.315 0.465 Inheritance dummy 8,065 0.364 0.481 Post-marriage inheritance 8,065 0.198 0.399 Pre-marriage inheritance 8,065 0.166 0.372 Muslim 8,065 0.868 0.339 Protestant 8,065 0.049 0.216 Catholic 8,065 0.022 0.147 Hindu 8,065 0.043 0.202 Other religions 8,065 0.019 0.135 Education 8,065 0.762 1.050 Urban 8,065 0.467 0.499 Age 8,065 46.774 13.586 Marriage age 8,065 21.577 7.013 Male 8,065 0.468 0.499

Endogamy Arranged marriage VARIABLES (1) (2) (3) (4) (5) (6) (7) (8)

Inheritance 0.066*** 0.067*** 0.034*** 0.029*** (0.010) (0.012) (0.010) (0.011) Post-marriage inheritance 0.057*** 0.056*** 0.028** 0.025* (0.013) (0.015) (0.013) (0.014) Pre-marriage inheritance 0.077*** 0.080*** 0.042*** 0.035** (0.012) (0.014) (0.012) (0.014) Protestant -0.072** -0.087* -0.073** -0.088** -0.054* -0.051 -0.055* -0.051 (0.036) (0.044) (0.035) (0.044) (0.029) (0.032) (0.029) (0.032) Catholic -0.061 -0.090 -0.062 -0.090 -0.048 0.002 -0.049 0.001 (0.052) (0.063) (0.052) (0.063) (0.044) (0.032) (0.044) (0.032) Hindu -0.064 -0.034 -0.065 -0.035 -0.085 -0.040 -0.086 -0.040 (0.103) (0.081) (0.103) (0.082) (0.066) (0.101) (0.066) (0.101) Other religions -0.020 -0.076 -0.021 -0.077 0.002 0.064 0.002 0.064 (0.054) (0.062) (0.054) (0.062) (0.050) (0.059) (0.050) (0.059) Education -0.028*** -0.023*** -0.028*** -0.023*** -0.034*** -0.036*** -0.034*** -0.035*** (0.007) (0.007) (0.007) (0.007) (0.006) (0.006) (0.006) (0.006) Male 0.014 0.012 0.015 0.013 -0.112*** -0.114*** -0.112*** -0.114*** (0.014) (0.015) (0.014) (0.015) (0.010) (0.011) (0.010) (0.011) Age 0.002 0.004 0.003 0.004 0.010*** 0.010*** 0.010*** 0.010*** (0.002) (0.003) (0.002) (0.003) (0.002) (0.003) (0.002) (0.003) Age2 -0.000 -0.000 -0.000 -0.000 -0.000 -0.000 -0.000 -0.000 (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) Marriage age -0.008*** -0.007** -0.008*** -0.007** -0.015*** -0.016*** -0.015*** -0.016*** (0.002) (0.003) (0.002) (0.003) (0.003) (0.003) (0.003) (0.003) Marriage age2 0.000** 0.000 0.000** 0.000 0.000*** 0.000*** 0.000*** 0.000*** (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000) (0.000)

Community FE yes yes yes yes yes yes yes yes Ethnicity FE yes yes yes yes Observations 7,933 6,633 7,933 6,633 8,065 6,730 8,065 6,730 R-squared 0.117 0.127 0.117 0.127 0.336 0.342 0.336 0.342 Number of clusters 312 311 312 311 312 311 312 311 OLS estimates are reported with robust standard errors, clustered at the community level, in parentheses. ***, **, and * indicate significance at the 1, 5, and 10% levels.

160 Table A.2.26: Descriptive statistics of analyses with the NFHS female adults.

VARIABLES N mean sd Cousin marriage 461,746 0.060 0.237 Subject to amendment 461,746 0.279 0.449 Hindu 461,746 0.825 0.380 Muslim 461,746 0.098 0.297 Christian 461,746 0.077 0.267 First marriage year 461,746 1,997.6 12.182 Caste 461,746 0.177 0.381 Tribe 461,746 0.187 0.390 Education 461,746 1.124 1.033 Wealth 461,746 2.900 1.430 Urban 461,746 0.272 0.445 The sample includes ever married women. Dummy variable "Cousin marriage" takes value 1 if a woman’s husband in her first marriage is a blood relative, and takes value 0 otherwise. Dummy variable "Hindu" takes value 1 if the respondent is Hindu, Sikh, Jain, or Buddhist, and takes value 0 if the respondent is Muslim, Christian, Parsi, or Jewish. Dummy variable "Caste/Tribe" takes value 1 if the respondent is a member of a scheduled caste/tribe, and takes value 0 otherwise. Education takes values 1 to 4 for no education, primary education, secondary education, and higher education respectively. Wealth takes values 1 to 5 for the poorest to the richest quintile respectively. Dummy variable "Urban" takes value 1 if the respondent resides in a urban region, and takes value 0 if the respondent resides in a rural region.

VARIABLES N mean sd Premarital sex 419,478 0.096 0.294 Subject to amendment 419,478 0.298 0.457 Hindu 419,478 0.829 0.376 Muslim 419,478 0.099 0.299 Christian 419,478 0.072 0.258 First marriage year 419,478 1,999.8 9.813 Caste 419,478 0.187 0.390 Tribe 419,478 0.179 0.383 Education 419,478 1.200 1.035 Wealth 419,478 2.934 1.436 Urban 419,478 0.287 0.452 The sample includes currently married women. Dummy variable "Premarital sex" takes value 1 if a woman had her first sexual intercourse before her first union (with a husband), and takes value 0 if she had her first sexual intercourse at or after her first union or if she never had sex with her first husband. Dummy variable "Caste/Tribe" takes value 1 if the respondent is a member of a scheduled caste/tribe, and takes value 0 otherwise. Education takes values 1 to 4 for no education, primary education, secondary education, and higher education respectively. Wealth takes values 1 to 5 for the poorest to the richest quintile respectively. Dummy variable "Urban" takes value 1 if the respondent resides in a urban region, and takes value 0 if the respondent resides in a rural region.

161 Table A.2.27: Individual-level regression analyses with the NFHS female adults with standard errors clustered at the state level, and birth year fixed effects.

Cousin marriage Premarital sex VARIABLES (1) (2) (4) (5)

Subject to amendment 0.012*** 0.012*** -0.012** -0.013*** (0.004) (0.004) (0.005) (0.003) Hindu -0.099*** -0.096*** 0.004** -0.007 (0.001) (0.009) (0.002) (0.007)

Marriage year FE yes yes yes yes State FE yes yes yes yes State × Marriage year FE yes yes yes yes State × Hindu yes yes yes yes Individual-level controls yes yes yes yes Birth year FE yes yes State × Birth year FE yes yes

Observations 461,746 461,746 419,478 419,478 R-squared 0.056 0.061 0.071 0.114 Number of clusters 29 22751 29 25131 OLS estimates are reported with robust standard errors in parentheses, clustered at the state level in regressions 1 and 3, and clustered at the level of the primary sampling unit in regressions 2 and 4. Individual-level controls include caste and tribe dummy, education, wealth, and urban dummy variables, and survey round fixed effects. ***, **, and * indicate significance at the 1, 5, and 10% levels.

Table A.2.28: Individual-level regression analyses with the NFHS female adults using omitted states only, and full sample.

Cousin marriage Premarital sex VARIABLES (1) (2) (4) (5)

Subject to amendment -0.010 0.009*** -0.008 -0.012*** (0.008) (0.003) (0.007) (0.003) Hindu -0.044* -0.039 -0.001 -0.002 (0.027) (0.026) (0.023) (0.023)

Marriage year FE yes yes yes yes State FE yes yes yes yes State × Marriage year FE yes yes yes yes State × Hindu yes yes yes yes Individual-level controls yes yes yes yes

Observations 104,182 577,776 93,730 522,637 R-squared 0.069 0.119 0.056 0.069 Number of clusters 5262 28438 6086 31898 Estimates are reported with robust standard errors, clustered at the level of the primary sampling unit, in parentheses. Regression 1 and 3 report results for states that locally passed similar amendments in the past. Regressions 2 and 4 report the results for all Indian states. Individual-level controls include caste and tribe dummy, education, wealth, and urban dummy variables, and survey round fixed effects. ***, **, and * indicate significance at the 1, 5, and 10% levels.

162 Table A.2.29: Individual-level regression analyses with the NFHS female adults using instrumental variable and age cohort comparison approaches.

Cousin marriage Premarital sex VARIABLES (1) (2) (3) (4)

Subject to amendment 0.012*** 0.015*** -0.012*** -0.071*** (0.004) (0.003) (0.003) (0.003) Hindu -0.099*** -0.105*** 0.004 0.024*** (0.009) (0.010) (0.007) (0.008)

Marriage year FE yes yes yes yes State FE yes yes yes yes State × Marriage year FE yes yes yes yes State × Hindu FE yes yes yes yes Individual-level controls yes yes yes yes

Observations 461,746 323,895 419,478 278,846 R-squared 0.056 0.060 0.071 0.087 Number of clusters 22751 22712 25131 25051 Estimates are reported with robust standard errors, clustered at the level of the primary sampling unit, in parentheses. Regression 1 and 3 report results from instrumental variable estimations. Regression 2 and 4 report results from OLS estimations in which the treated group includes Hindu women aged 14 or younger in 2005, and the control group includes women (of all religions) aged 24 or older in 2005. ***, **, and * indicate significance at the 1, 5, and 10% levels.

163 Appendix B

Kinship, Fractionalization and Corruption

B.1 Quantitative description of genetic relatedness and distance

First let us explain some vocabulary. A “locus" is a place on a chromosome where an “allele" resides. A locus is not a tangible object, it is a map describing where to find an allele, which is the piece of DNA in that location. Some books use gene as a synonym for an allele. An individual has two alleles at a particular locus, one from the mother and the other from the father. Alleles are identical by descent if they share a common ancestor allele in a relatively short time in the past, say, the past 10 generations [101, p. 6-8]. Identity by descent is used as the basis of a quantitative description of relatedness. One simple measure is the “coefficient of kinship notated as fxy which is the probability that two alleles, one from individual X and one from individual Y, are identical by descent. This coefficient can be written as 1 1 fxy = p + p 4 1 2 2 where p1 and p2 are, respectively, the probabilities of sharing one and two identical by descent alleles (p0 is the probabil- ity of sharing 0 identical by descent alleles). The intuition is that there are two mutually exclusive ways that two chosen 1 alleles from X and Y might be identical by descent; they share exactly one allele or exactly two alleles. The term 4 p1 in the equation above is the probability that X and Y share exactly one identical by descent allele times the conditional 1 probability that these two alleles are chosen from X and Y. The term 2 p2 is the probability that X and Y share exactly two identical by descent allele times the conditional probability that two identical alleles are chosen from X and Y.

Relationship p0 p1 p2 Identical twin (IT) 0 0 1 Parent-offspring (PO) 0 1 0 Full sibs (FS) 1/4 1/2 1/4 First cousins (FC) 3/4 1/4 0

Table B.1.1: Identity by descent.

It follows that fPO = 1/4, fFS = 1/4 and fFC = 1/16. The “coefficient of relatedness", r, is one-half of the mean 1 1 number of shared alleles (p1 + 2p2), r = 2 r1 + r2 therefore, fxy = 2 r and rPO = 1/2, rFS = 1/2 and rFC = 1/8 [101, p. 121-123].

It follows that fxy is defined over the the range of [0, 0.5]. It is important that the coefficient of kinship not be confused with the coefficient of relatedness, r. In a random mating population, the relationship between the two coefficients is simple: the coefficient of relatedness is just twice the coefficient of kinship, therefore in the range of [0, 1]. The coefficient of relatedness could be interpreted as the expected fraction of alleles that are shared identical by descent between two individuals [136]. Table B.1.2 summarizes coefficients of relatedness for kin relationships.

164 Relationship to you Relatedness coefficient identical twin 1 fraternal twin, parent, child, sibling 1/2 grandparent, grandchild, aunt, uncle, 1/4 niece, nephew great-grandparent, great-grandchild, 1/8 great-aunt, great-uncle, great-niece, great-nephew, first-cousin second-cousin 1/32 nth cousin 1/22n+1 a perfect stranger 0

Table B.1.2: Expected relatedness of individuals under random mating.

The expected coefficients of relatedness computed from Table B.1.1 are valid only under the assumption of random mating. With consanguineous marriages, actual relatedness will exceed expected relatedness. For example, two offspring from a first-cousin marriage (drawn from a previously randomly mating population) have a relatedness higher than 1/2 (r = 1/2 + 1/2 × 1/8 = 0.5625): with probability 1/2 they inherit a gene from the same parent at each locus, and with probability 1/2 they each inherent a gene from a different parent, in which case the probability of gene sharing is just the relatedness between their parents, 1/8. Individuals born of consanguineous union have segments of their genomes that are homozygous as a result of inheriting identical-by-descent genomic segments through both parents. The extra term in the relatedness of offspring of a first-cousin marriage (i.e. 1/2 × 1/8) represents the expected excess homozygosity in the genome of the offspring of a union between first cousins. The extra term is (1/2 × 1/32) for offspring of second- cousins; (1/2 × 1/4) for offspring of double-first cousins; and (1/2 × 1/2) for the offspring of sibling or parent-offspring (incestuous) unions. Therefore, “offspring of second cousins are expected to have children with 1/64 of their genome homozygous; offspring of first cousins, 1/16; offspring of double-first cousins, 1/8; and offspring of incestuous union, 1/4" [260, p. 889]. [131] explores the consequences of extreme inbreeding, developing a model of endogamous colonies where the related- ness of all colony members can rise to the level of siblings under random mating (1/2). In such a world,

“siblings, parents, and offspring will still be the individual’s closest relatives. Owing to inbreeding, their relatedness will be above the value of 1/2 that applies under random mating. Thus an individual should be more altruistic than usual to his immediate kin. But other neighbors who are not immediate kin are now also closely related, and it is this reduced contrast between neighbors and close kin that will give what is probably the most striking effect: we expect less nepotistic discrimination and more genuine communism of behavior. At the boundary of the local group, however, there is a sharp drop in relatedness, [. . . ] this drop may be such as to promote active hostility between neighboring groups" [131, p. 340].

Empirically, persistent inbreeding has raised the relatedness of siblings as high as 7/10 in some samples (compared to 1/2 in a randomly mating population), and the average sibling relatedness among three persistently inbreeding groups was 6/10 [260]. Note that cousins in such a society share almost 75% more genes than expected in a randomly mating population; their relatedness is (0.6 × 0.6 × 0.6 = 0.216), compared to (0.5 × 0.5 × 0.5 = 0.125) in a randomly mating population. The expected percentage of excess homozygosity arising from consanguineous mating is known as the inbreeding coef- ficient of the individual with respect to the local sub-population, or FIS. For example, for the offspring of first cousins, FIS is 1/16. Inbreeding coefficient (FIS) of an individual is equal to kinship coefficient of the individual’s parents; as seen before, coefficient of kinship of the first cousin parents is also fFC = 1/16. In fact, inbreeding coefficient FIS mea- sures homozygosity in individual genomes excess to the expected frequency under random mating in the sub-population. H Therefore, inbreeding coefficient F can be calculated as 1 − obs,i , where H is the observed heterozygosity of an IS Hexp,i obs,i individual, and Hexp,i is expected heterozygosity within the sub-population which is equal to Hobs,p; the average of observed heterozygosity of individuals within the sub-population. Consanguineous mating is not the only way of producing excess homozygosity. If a sub-population is genetically iso- lated and thus “cryptically" inbred with respect to the total population, there will be excess average homozygosity in individual genomes of the sub-population compared to the expected frequency under random mating across the total population. This expected increase in homozygosity arising from genetic isolation of the sub-population is called FST; the average inbreeding coefficient of sub-populations relative to the total population. Pairwise FST measures popula-

165 tion differentiation, producing higher values when two populations have large between-population differences but small H within-population differences. Inbreeding coefficient F can be calculated as 1 − obs,p , where H is the mean of ST Hexp obs,p observed heterozygosity within the sub-population, and Hexp is expected heterozygosity in the total population. Thus, if the alleles in an individual are identical by descent in excess of the expected frequency in the total population, and due to population inbreeding, further inbreeding by consanguineous marriages can not increase homozygosity in those alleles. The “inbreeding coefficient" or Fhbd captures both FIS and FST, and measures the inbreeding of an Individual H relative to the total population. The inbreeding coefficient F can be calculated as 1 − obs,i , where H is the observed hbd Hexp obs,i heterozygosity of an individual, and Hexp is expected heterozygosity within the sub-population. This is the coefficient reported in footnote 15.

B.1.1 Ethnicity and relatedness

Genomic methods allow us to measure relatedness between and within populations. The of two popu- lations can be measured by FST known as the coancestor coefficient: “the probability that two alleles at a given locus selected at random from two populations will be different, [. . . ] FST is strongly related to how long two populations have been isolated from each other. When two populations split apart, their genes can start to change as a result either of random genetic drift or natural selection" [238, p. 481]. As shown by [136], for large populations, genetic distance between two populations implies genetic similarity within those populations. Therefore, FST also measures the coefficient of kinship between members of the same population; for a random mating population, FST is simply half of the coefficient of relatedness, r.1 Between some ethnic groups, empirical estimates suggest that relatedness is not far above zero, so that co-ethnics are unlikely to be sufficiently related for kin selection to substantially influence behavior. For example according to [56], the genetic distance between English and French populations is FST = 0.0024. Therefore, in a world consisting of only English people, the kinship of any randomly chosen pair is zero, but in a world consisting of both English and French populations, two random English (or French) people have a relatedness of only r = 0.0048 (in between the relatedness of 3rd and 4th cousins under random mating). Perhaps surprisingly, this is about how closely groups of friends are related to one another. [59] find that friends’ genotypes tend to be positively correlated, and the increase in similarity relative to strangers is at the level of fourth cousins. The authors were aware that some of the similarity in genotypes can be explained by “a simple preference for ethnically similar others" or “distant relatives" (Ibid., p. 10797). Therefore, they applied strict controls for such factors in their study. This suggests that there might be “some sort of kin detection system in humans [. . . ] such that, for each individual encountered, an unspecified system may compute and update a continuous measure of kinship that corresponds to the genetic relatedness of the self to the other individual" (Ibid., p. 10800).

1 An individual’s coefficient of kinship with someone randomly chosen from his own population is FST while his kinship with someone from the other population is −FST. “Negative relatedness implies that two individuals share fewer genes than average" [99, p. R663].

166 B.2 Religion and Consanguinity

As noted in the text and in Table 3.3, religious traditions have diverse relationships to cousin marriage practices. Below we summarize Christian and Islamic views on consanguinity.

Catholic and Orthodox Christianity. In the early Christian era, consanguineous marriages were common,2 and there are few direct biblical prohibitions on marriage among close kin.3 The laws and customs of the Roman empire with respect to domestic life, “conformed to patterns that were wide-spread throughout Mediterranean and Middle East" that permitted and even encouraged consanguinity. Thus the Christian religion emerged in a setting in which such prac- tices were tolerated. In other regions to which Christianity later spread such as and , the earlier presence of close marriage was yet more marked [115, p. 39]. However, in the centuries after the conversion of the Roman Empire to Christianity, radical changes occurred with respect to the issue of marriage to kin. For the first two-hundred years after Constantine, the legality of consanguinity was in flux.4 Roman Catholic authority stepped in to settle the issue. Pope Gregory I (AD 540-604) in a letter to Augustine, ‘the first Bishop and apostle of the English’ confirms that a certain secular law in the Roman state allows cousin marriage and advises that ‘it is necessary that the faithful should only marry relations three or four times removed, while those twice removed must not marry in any case’ [115, p. 36]. This letter was significant because through it, the Church asserted (implicitly) its jurisdiction over marriage and the family, a position that Christian churches still maintain today.

Biological relationship Genetic relationship Roman classification Germanic classification First cousin Third degree Fourth degree Second degree Second cousin Fifth degree Sixth degree Third degree Third cousin Seventh degree Eighth degree Fourth degree

Table B.2.1: Genetic and religious classification of consanguinity (Bittles 2012, p. 16).

Within the Roman Catholic Church, the stringency of the restrictions continued to increase over time. A canon attributed to various popes and embodied in a letter of Pope Gregory III (AD 731-741) forbade marriage to the seventh degree of consanguinity in AD 732 (i.e. to 3rd cousins, see Table B.2.1), and confusion resulting from differences in the Germanic and Roman systems of consanguinity evaluation eventually led to the ban being extended all the way to 6th cousins in 1076 (7th degree in the Germanic system). Not only were these extended prohibitions attached to blood ties, but they were also assigned to affinal kinship such as marriage to the dead brother’s widow, spiritual kinship such as marriage

2On one hand, it has been suggested that with the aim of favoring outbreeding, pre-Christian Roman law forbade all unions among people within the seventh degree of consanguinity, and the Church “initially followed the Roman" reg- ulations on consanguineous marriages [57, p. 29,31]. On the other hand, some authors argue that there is not enough evidence suggesting prohibition of consanguineous unions at the very earliest stage in Roman history [115, p. 53]. How- ever, it seems that in the early Christian era, consanguineous marriages were common either due to “relaxed" [57, p. 29] earlier prohibitions or because Roman “law had nothing to say against most forms of close marriage" [115, p. 39]. For example, the first Christian emperor, married his son and daughters to his half-brother’s children [115, p. 53].

3The only types of forbidden relationships in the Bible are found in the Levitical prohibitions in The . For example, a man could not marry his mother (Lev. 18:7), his sister (Lev. 18:12, 20:19), his father’s sister (Lev. 18:12, 20:19), his mother’s sister (Lev. 18:13, 20:19), or his son’s daughter (Lev. 18:10). A man could marry his niece [57, p. 29].

4Following the acceptance of Christianity as the official religion of the Eastern Roman Empire, Theodosius the Great (AD 378-395) condemned unions between first cousins in a law made in AD 384, although it was still possible to effect such marriages by imperial dispensation. In AD 405, his son, Arcadious (AD 378-408) legalized cousin marriages once again for the Eastern Roman Empire, however, his younger brother and the Western Roman emperor, Honorious (384- 423), who himself married his dead wife’s sister, permitted marriages among cousins in AD 409 only if the parties obtained an imperial dispensation. It was not until later that under civil legislation they became freely permissible once again. In AD 533, the validity of cousin marriage in the secular law of the Eastern Roman Empire, the institutes of Justinian (482-565), was recognized as perfectly legitimate (Goody, 1983, p. 55; Cavalli-Sforza et al., 2004, p. 29-30; and Bittles, 2012, p. 16).

167 to godchildren and fictional kinship such as adoption, “producing a vast range of people, often resident in the same locality, that were forbidden to marry" [115, p. 56]. While the bans were later loosened and dispensations have typically been available in specific cases, prohibitions on first cousin marriage remain in effect today.5 Similarly, within the Greek Orthodox Church, first-cousin marriages were prohibited by AD 692, a policy which remains in place. It is worthwhile to briefly discuss the possible reasons behind Church prohibitions of consanguineous marriages. From Gregory’s letter to Augustine, it is clear that he understood the potential negative health consequences of inbreeding - ‘the offspring of such marriages cannot thrive’ - and that he had a moral opposition to incest, derived from his reading of scripture [115, p. 36-37]. More importantly for our purposes, it seems that the potential impact on social behavior had not gone unnoticed by the Church. St. , writing in the early 5th century, noted that restrictions on consanguinity expand the scope for mutually beneficial cooperation: “for affection is now given its proper place, so that men, for whom it is beneficial to live together in honorable concord, may be joined to one another by the bonds of diverse relationships: not that one man should combine many relationships in his sole person, but that those relationships should be distributed among individuals, and should thereby bind social life more effectively by involving a greater number of persons in them" [27, Book 15, Ch. 16, italics added]. Similar arguments were later echoed by Thomas Aquinas, who worried that “consanguineous marriages would ’prevent people widening their circle of friends’" [115, p. 57].6 This suggests an awareness of the sorts of favoritism that might generate corruption. It seems that as the Church extended prohibitions to remoter degrees of consanguineous marriage, large kinship groups such as clans, lineages, and tribes gradually disappeared throughout Europe. There is a large and significant negative correlation between the spread of Christianity (for at least 500 years) and the absence of clans and lineage groups in Europe [121, p. 309] which may have facilitated the emergence of impartial economic and political institutions, nuclear families and individualistic culture.

Protestant and Anglican Christianity. As the Protestant emphasized a return to scripture, ’s (1483-1546) views on consanguinity were based on the Levitical prohibitions and did not include a ban on cousin marriages; similarly, (1509-1564) and his followers based their views on scripture, though they extended Levitical guidelines to a wife’s relatives so that affinity and consanguinity were treated equivalently. The Church of England grew directly out of a dispute between Henry VIII (1491-1547) and the Catholic Church about , and its creation led to changes in consanguinity law that allowed for first cousin marriage [115, p. 168-177]. In the emergent Protestant denominations, marriages up to and including first-cousin unions were permitted under Re- formed Church law.7 However, “many Protestant reformers discouraged marriages within the third degree. While Luther did not think they were positively harmful, he considered them to be ‘inexpedient on the ground that people would marry without love merely to keep property within family, while poor women would be left spinsters’" [115, p. 181-2]. In prac- tice, consanguinity rates remained low in the Protestant world, as seen in Figure 3.2 in Section 3.3 below, perhaps due to ingrained norms from centuries of prohibition.

Islam. Today, consanguinity rates are high in Islamic countries, though “contrary to widespread Western opinion, there is no specific guidance in the Qur’an to encourage consanguinity" [43, p. 22] and “marriage between cousins is not prescribed by the Qur’an" [67, p. 33]. The permitted degrees of consanguinity within Islam closely match the Levitical

5The prohibition on consanguineous marriages was reduced to 4th degree relationships (third cousin) in AD 1215. The ban also reduced to 2nd degree relationships (first cousin or closer) for South American Amerindians in 1537, later for the indigenous population of The , and for black populations in 1897. After 1917, the consanguinity prohibition was reduced in all populations, initially to second cousins or closer and in 1983 to first cousins or closer, which remains current law [57, 43, 115].

6[115] also suggests a number of other possible, complementary reasons for the bans, including attempts to increase fealty to the Church by diminishing the role of the family (rooted in ’ entreaties to deny the family, e.g. Matthew 10: 34-37) and Church desire to acquire property by leaving the deceased without eligible heirs. The latter argument finds support in the tendency of the Church to oppose any practice that expanded the number of kin, e.g. , polygamy, and adoption.

7“An exception to this generalization is provided by the State Lutheran Church of Sweden, which until 1680 refused to recognize first-cousin unions, and from 1680 to 1844 approval was required from the King of Sweden before a first-cousin union could proceed" [43, p. 20].

168 guidelines with one exception: a prohibition of uncle-niece marriage (Qur’an 4:23). In addition to the Qur’an, Muslims recognize two other sources of Islamic Law (Sharia) which bear on consanguinity: the (oral pronouncements of the Prophet Mohammad) and the Sunnah (the deeds of the Prophet).8 “The overall attitude to consanguineous marriage within Islam is somewhat ambiguous" [43, p. 22] since a Hadith of the Prophet stated: “Do not marry cousins as the offspring may be disabled at birth" [8, p. 314]; while, according to the Sunnah, two of Muhammad’s wives were his first cousins, and the Prophet Muhammad married his daughter, Fatima to his cousin .9 “Thus, [despite the content of the Hadith] for Muslims, the practice of cousin marriage could potentially be interpreted as following the example provided by the Sunnah" [43, p. 22]. In any case, evidence suggests that cousin marriage customs in Islamic countries “probably antedated the spread of Arabs" [57, p. 284] and “predated Islam" [67, p. 33]. A preference for consanguinity is reflected in “the well known Iranian proverb ‘the first cousin’s marriage contract has been recorded in heaven,’" though the preference “is merely a cultural and local custom rather than a religious belief" [8, p. 315]. Nevertheless, cousin marriage is not forbidden or clearly discouraged in Islam, and it is likely that “Islam served as a vehicle for the geographical spread of the endogamous practices [. . . ] the model was adopted not for religious reasons but because it was the practice of a prestigious group, the Arabs, bearers of the message of the Qur’an" [67, p. 33-4]. One explanation suggesting why the conversion to Islam may have encouraged the high prevalence of cousin marriages in Islamic countries is the Quranic law on the inheritance of property which entitled daughters to inherit half of the amount received by sons, and wives to receive a share from their husbands. A dowery (Mahr) also is specified in Islamic law as part of the marriage arrangement. Under these circumstances, consanguineous marriages could prevent part of the family wealth from leaving the clan [115, p. 32].

Other religions. In Table 3.3, we summarize other religions’ views on consanguineous marriage based on [43], p. 23-28. Suffice to say that there has historically been a great diversity of practices around the world, with many groups banning cousin marriage outright and certain groups favoring even closer marriages (e.g. Dravidian Hindu and Sephardic Jewish preference for uncle-niece marriages).

8In Shi’a Islam, law also derives from the oral pronouncements and deeds of the twelve Imams.

9The first Imam, in the Shi’a tradition, and the fourth of the Rashidun Caliphate according to Sunni tradition.

169 B.3 Empirical data description

Cross country regressions: Variable name Description and source Consanguinity As a working definition, unions contracted between persons bi- ologically related closer than second cousins are categorized as consanguineous. [42] provided a compilation of the propor- tion of consanguineous marriages from 262 journal papers and book chapters which report consanguinity percentages of 448 samples of different sizes in different locations of 72 countries based on household surveys, Roman Catholic Church dispensa- tions, parish records, civil registrations, marriage registrations and surveys on blood donors, obstetric inpatients, hospital out- patients, hospital births, etc that include information for around 9.8 million marriages. The sample of countries is non-random since the data were collected for other purposes (e.g. the public health studies disproportionately sample high consanguinity so- cieties since these are the societies with higher rates of genetic disorders), but the coverage is broad. Data collection periods vary from 1922 to 2013, with around 90% after 1950, 75% af- ter 1960, 50% after 1970, 40% after 1980, 30% after 1990 and 15% after 2000. We collected this data from www.consang.net in December 2015 and computed the mean percentage of con- sanguineous marriages for each country by weighting the indi- vidual estimates according to sample size. Note that (i) the data on Czechoslovakia in the period 1961-64 is used for both the Czech Republic and Slovakia; (ii) some sample studies were re- ported twice, once at the city level and once for the province, in which case we considered only the province level report; (iii) we also ignored the studies listed as “Minorities and Isolates" to avoid overweighting outliers. Source: [42]. Corruption Measures financial corruption in the form of demands for spe- cial payments and bribes connected with import and export li- censes, exchange controls, tax assessments, police protection, or loans, and actual or potential corruption in the form of exces- sive patronage, nepotism, job reservations, ‘favor-for-favors’, secret party funding, and suspiciously close ties between pol- itics and business. In our tables, this measure is averaged over 1984-2011. [164] and [12] used the average of the months of April and October in the monthly index between 1982 and 1995, and we did so in the replication of their tables. Source: Political Risk Services, International Country Risk Guide (2012). Ethnic fractionalization Measures ethnic fractionalization; the probability that two ran- domly selected individuals from a population belong to differ- ent ethnic groups. Source: [12].

170 Ethonolinguistic fractionalization Average value of five different indices of ethonolinguistic frac- tionalization. Its value ranges from 0 to 1. The five component indices are: (1) index of ethonolinguistic fractionalization 1960, which measures the probability that two randomly selected peo- ple from a given country will not belong to the same ethno- linguistic group (the index is based on the number and size of population groups as distinguished by their ethnic and linguis- tic status); (2) probability of two randomly selected individuals speaking different languages;( 3) probability of two randomly selected individuals do not speak the same language; (4) per- cent of the population not speaking the official language; and (5) percent of the population not speaking the most widely used language. The data is collected using [79]. The sources of the components of the average index are (1) [48]; (2) [194]; (3) [221]; (4) and (5) [128]. Source: [164]. Family ties Measures the strength of family ties by looking at three vari- ables from the World Value Survey (WVS) and the European Social Survey (EVS) which capture beliefs regarding the im- portance of the family in the respondent’s life, the duties and responsibilities of parents and children, and the love and respect for one’s own parents. The first question asks how important the family is in one person’s life and can be answered with (i) Very important; (ii) Rather important; (iii) Not very important; (iv) Not at all important, which in our measure of family ties, take values of 4 to 1, respectively. The second question asks whether the respondent agrees with one of the two statements (taking the values of 2 and 1, respectively): (i) Regardless of what the qualities and faults of one’s parents are, one must always love and respect them; (ii) One does not have the duty to respect and love parents who have not earned it. The third question prompts respondents to agree with one of the following state- ments (again taking the values of 2 and 1, respectively): (i) It is the parents’ duty to do their best for their children even at the expense of their own well-being; (ii) Parents have a life of their own and should not be asked to sacrifice their own well-being for the sake of their children. Following [13], we extract the first principal component from the whole data set with all individual responses for the original variables. Source: WVS (Six waves, 1981-2014) and EVS (four waves, 1981-2008). Trust Measures generalized trust by considering the following ques- tion from the World Value Survey (WVS) and the European So- cial Survey (EVS): “Generally speaking, would you say that most people can be trusted or that you can’t be too careful in dealing with people?" The answer could be either “Most people can be trusted" or “Can’t be too careful", which in our measure of trust, take values of 2 and 1, respectively. Source: WVS (Six waves, 1981-2014) and EVS (four waves, 1981-2008).

171 Cousin term measure The fraction of the population of a country that speaks a lan- guage in which there are words used for distinguishing (at least some) first cousins from each other. Based variable 27 of the Ethnographic Atlas, a dummy variable is created which takes value 1 if the language of the ethno-linguistic group in the Ethnographic Atlas fully or partially distinguishes first cousins from each other (Sudanese, Iroquois, Omaha, Crow), and takes value 0 otherwise; for ethnicities which the language does not distinguish any first cousins (Eskimo or Hawaiian). Following [16]’s methodology, the country-level cousin term measure is created by population weighted average of the dummy variable for all ethnic groups living within a country. Predicted genetic diversity The expected heterozygosity (genetic diversity) of a given coun- try as predicted by (the extended sample definition of) migra- tory distance from East Africa (i.e., Addis Ababa, Ethiopia). This measure is calculated by applying the regression coef- ficients obtained from regressing expected heterozygosity on migratory distance at the ethnic group level, using the world- wide sample of 53 ethnic groups from the HGDP-CEPH Hu- man Genome Diversity Cell Line Panel. The expected heterozy- gosities and geographical coordinates of the ethnic groups are from [216]. Expected heterozygosities are constructed by mea- suring actual heterozygosity within an ethnic group at a sample of selectively-neutral loci and averaging over the loci. Source: [26]. Geographical controls (i) Ruggedness Terrain ruggedness measures small-scale terrain irregularities. The ruggedness calculation takes a point on the earth’s surface and calculates the difference in elevation between this point and each of the points on the grid 30 arc-seconds (926 meters on a meridian) in each of the eight major directions of the compass (north, northeast, east, southeast, south, southwest, west, and northwest). The terrain ruggedness index at the central point is given by the square root of the sum of the squared differences in elevation between the central point and the eight adjacent points. Then by averaging across all grid cells in the country not covered by water, each cell weighed by its latitude-varying sea-level surface, the average terrain ruggedness of the coun- try’s land area is obtained. Ruggedness is measured in hundreds of meters of elevation difference for grid points 30 arc-seconds apart. Source: [204]. (ii) Soil suitability for agriculture The soil suitability component, based on soil carbon density and soil pH, of an index of land suitability for agriculture. The soil suitability data are reported at a half-degree resolution by [218] and are aggregated to the country level by [183] by averaging across grid cells within a country. For additional details on the soil suitability component of the land suitability index, the in- terested reader is referred to the definition of the land suitability variable above. Source: [26].

172 (iii) Mean elevation The mean elevation of a country in km above sea level, calcu- lated using geospatial elevation data reported by the G-ECON project [201] at a 1-degree resolution, which, in turn, is based on similar but more spatially disaggregated data at a 10-minute resolution from [199]. The measure is thus the average eleva- tion across the grid cells within a country. The interested reader is referred to the G-ECON project website for additional details. Source: [26]. (iv) Mean temperature The intertemporal average monthly temperature of a country in degrees Celsius per month over the 1961-1990 time period, cal- culated using geospatial average monthly temperature data for this period reported by the G-ECON project [201] at a 1-degree resolution, which, in turn, is based on similar but more spa- tially disaggregated data at a 10-minute resolution from [199]. The measure is thus the spatial mean of the intertemporal aver- age monthly temperature across the grid cells within a country. See the G-ECON project website for additional details. Source: [26]. (v) Mean precipitation The intertemporal average monthly precipitation of a country in mm per month over the 1961-1990 time period, calculated us- ing geospatial average monthly precipitation data for this period reported by the G-ECON project [201] at a 1-degree resolution, which, in turn, is based on similar but more spatially disaggre- gated data at a 10-minute resolution from [199]. The measure is thus the spatial mean of the intertemporal average monthly pre- cipitation across the grid cells within a country. The interested reader is referred to the G-ECON project web site for additional details. Source: [26]. (vi) Percentage of population living in tropi- cal/subtropical/temperate zones The percentage of a country’s population in 1995 that resided in areas classified as tropical/subtropical/temperate by the Köppen-Geiger climate classification system. This variable was originally constructed by [98] and is part of Harvard University’s CID Research Datasets on General Measures of Geography. Source: [26]. (vii) Percentage of land near a waterway The percentage of a country’s total land area that is located within 100 km of an ice-free coastline or sea-navigable river. This variable was originally constructed by [98] and is part of Harvard University’s CID Research Datasets on General Mea- sures of Geography. Source: [26]. Latitude The absolute value of the latitude of the country, scaled to take values between 0 and 1. The data is collected from [62]. Source: [164]. Log GNI per capita Logarithm of GNI per capita in current US dollars averaged over the period 1984-2011. Source: World Bank Development Indicators (WDI), Data retrieved Online in December 2015. Log GNP per capita Logarithm of GNP per capita in current U.S. dollars averaged over the period 1970-1995. The data is collected from WDI. Source: [164]. Log population Logarithm of population averaged over the period 1984-2011. Source: World Bank Development Indicators (WDI), Data re- trieved Online in December 2015.

173 Log population (1960) Logarithm of population in 1960. This is the variable used in [12] for the country size. Our data source might be different. Source: Penn World Table, Data retrieved Online in December 2015. Regional dummy variables Dummy variable for (1) Sub-Saharan Africa, (2) East Asia Pa- cific, and (3) Latin America and Caribbean. Source: World Bank (http://www.worldbank.org/en/country). Legal origin dummy variables Identifies the legal origin of the 212 Company Law or Commer- cial Code of each country. There are five possible origins: (1) English Common Law; (2) French Commercial Code; (3) Ger- man Commercial Code; (4) Scandinavian Commercial Code; and (5) Socialist/Communist Laws. The data is collected using [165], [20] and [62]. Source: [164] Religion dummy variables Identifies the percentage of the population of each country that belonged to the three most widely spread religions in the world in 1980. For countries of recent formation the data is avail- able for 1990-1995. The numbers are in percent (scale from 0 to 100). The three religions identified here are: (1) Roman Catholic; (2) Protestant and (3) Muslim. The residual is called "other religions". The data is collected using [32], [263], Statis- tical Abstract of the World (1995), [252], [62]. Source: [164].

Table B.3.1: Description of the data for cross-country analysis

174 Within country (Italy) regressions: Variable name Description and source Consanguinity The data reported by [57] on consanguineous marriages, for 5- year periods from 1910 to 1964, comes from the Vatican’s Se- cret Archives in which requests for dispensations from the con- sanguinity impediment, sent by the Bishops to the Sacred Con- gregation of the Sacraments in Rome, were recorded with in- formation of the name of the diocese of marriage, the date and the degree of relationship between the spouses. They grouped 280 Italian dioceses into the provinces present in 1961, for which the number of total marriages was available from year to year. They report four major degrees of consanguinity; uncle- niece/aunt-nephew, first cousins, first cousins once-removed, second cousins. However, the bishops of the islands (Sardinia and Sicily) were granted the privilege to accord the dispensation for consanguineous unions beyond degree III, i.e. first cousins once-removed and second cousins were not recorded in the Vat- ican Archives, and therefore are not reported for Sicily and are obtained from another source for Sardinia. Thus, we have only considered uncle-niece/aunt-nephew and first cousin unions to compute consanguinity rates for Italian provinces. We have also chosen 5-year periods from 1945 to 1964 for which the data is available in all reported provinces. We computed the consan- guinity rate for each province as the average of consanguin- ity percentages of four 5-year periods (1945-1949, 1950-1954, 1955-1959, 1960-1964) weighted by the number of marriages. For newly created provinces, we used the consanguinity rate of the province they belonged to in year 1961. This gave us con- sanguinity rates of 108 Italian provinces. Source: [57]. Corruption Province-level number of associative crimes reported by the po- lice to the court per 100,000 inhabitants averaged over 2000- 2013. Associative crimes include criminal association (article 416: when three or more persons associate together in order to commit more than one crime) and Mafia-type association (arti- cle 416-bis: participating in a Mafia-type unlawful association including three or more persons). The data is available for 103 Italian provinces. Source: ISTAT. Share of agriculture Province-level share of agriculture in total value-added aver- aged over 2000-2013. Source: ISTAT. Log value added per capita Logarithm of province-level total value added (at current prices millions Euros) averaged over 2000-2013. Source: ISTAT. Log population Logarithm of province-level population averaged over 2000- 2013. Source: ISTAT. Civic involvement An integer index ranging from 1 to 9 which combines five vari- ables observed in the late 19th century and early 20th century; (i) Membership in mutual aid societies (1873-1904); (ii) Mem- bership in cooperatives (1889-1915); (iii) Strength of the mass parties (1919-1921); (iv) Turnout in the few relatively open elections (1919-1921) before Fascism brought authoritarian rule to Italy; (v) The longevity of local associations founded before 1860. Source: [214]. Dominations A categorical variable that identifies, for each province, the ad- ministration that presided during the period of the Spanish dom- ination in Italy, 1560-1659; Spanish, Papal, Austrian, Venetian, Sabaudian and Independent. Source: [73].

175 Family types A categorical variable that identifies, for each province, [248]’s classification of family types which is argued to be very simi- lar to what the geography of family types would have been in the Middle Ages. The three family types common in Italy were the following; (i) incomplete stem family (characterized by an extended family with several generations living under one roof and the inheritance of the house and the land by one son – gener- ally, the eldest – who stays at home); (ii) Communitarian family (characterized by an extended family in which all the sons can get married and bring their wives to the family home and equal division of inheritance among children); (iii) Egalitarian nuclear (characterized with total emancipation of children in adulthood to form independent families and equal division of inheritance among children). Source: [78]. Voluntary organizations Region-level number of voluntary organizations established be- fore 1965 per 100,000 inhabitants at year 2001. The variable takes the same value for all provinces within a region. Source: ISTAT. Family ties Region-level fraction of youth aged 18-34 living with at least one parent averaged over 2002-2009. The variable takes the same value for all provinces within a region. Source: ISTAT. Number of active years of Number of active years of Catholic archdioceses in Italian archdioceses provinces. The raw data is for 42 Roman Catholic ecclesias- tical provinces in Italy. Each ecclesiastical province is served by a metropolitan archdiocese. For each ecclesiastical province, we calculated the number of active years of its archdioceses from the date of establishment of the first archdiocese to the present, subtracting the number of years which there were no active archdiocese in the province (the archdiocese was sup- pressed). Some of archdioceses are branched from pre-existing archdioceses, in which case we considered the date of the es- tablishment of the pre-existing archdioceses. Some archdioce- ses lost territory to other dioceses which were promoted as the new archdioceses, in which case we considered the archdiocese of the ecclesiastical province being active. We matched the data on ecclesiastical provinces to today’s administrative provinces. Source: www.gcatholic.org. Alternative corruption metrics (i) Corruption crimes (region level, N=20) Region-level number of corruption crimes convicted of felony by final judgment per 100,000 inhabitants averaged over 2000- 2011. Corruption crimes include the following types of crimes defined under the title offenses against public administration: crimes of peculation, malversation, bribery, and corruption. Source: ISTAT.

176 (ii) Infrastructure (province level, N=92, and region level, N=20) The difference between a measure of the value of existing phys- ical quantities of public infrastructure and the cumulative price government has paid for public capital stocks. Where the differ- ence is larger between the monies spent and the existing phys- ical infrastructure, more money is being siphoned off to mis- management, fraud, bribes, kickbacks, and embezzlement; that is, corruption is greater. The measure is created for Italy’s 92 provinces and 20 regions as of the mid-1990s, controlling at the regional level for possible differences in the costs of public con- struction. Inspecting the data, the province-level ratios reported in the appendix appear to be the inverse of the ratios reported in the text for regions, so we have taken the inverse to align the in- terpretation of the province and region-level measures. Source: [108]. (iii) European Quality of Governance Index (EQI 2010) (region level, N=20) This measure is constructed from recent surveys that elicit perceptions of and experience with governmental corruption, as well beliefs about impartiality and quality in the provision of government services. This research was funded by the EU Commission for Regional Development. Source: [58], avail- able here (https://nicholascharron.wordpress.com/european- quality-of-government-index-eqi/). (iv) Institutional performance (region level, N=20) This measure is an index constructed by combining measures of policy processes and internal operation of government (Cab- inet Stability, Budget Promptness, Statistical and Information Services), policy content (Reform Legislation, Legislative In- novation), and policy implementation (Daycare Centers, Fam- ily Clinics, Industrial Policy Instruments, Agricultural Spend- ing Capacity, Local Health Unit Expenditures, Housing and Ur- ban Development, Bureaucratic Responsiveness) over the pe- riod 1978-1985. Source: [214], see Ch. 3. Latitude The absolute value of the latitude in the geographic center of each province. Geographic centroids of Italian provinces are generated using polygons map of Italian administrative divi- sions. Source: GADM database of Global Administrative Ar- eas. Mean temperature (in Celsius degrees)/ The average of mean temperature/precipitation in the geograph- Mean precipitation (in meters) ical area of each province. The means of the entire annual cycles of precipitation and temperature are constructed for the time pe- riod between 1901 and 2014 based on monthly global maps (0.5 by 0.5 degree cells), CRU-TS 3.23 Climate Database. Source: [138]. Tropical climate The indicator variable for “tropical climate" takes value 1 if the geographic center of a province is classified as being either trop- ical or subtropical, and zero otherwise. The data is constructed based on Thermal Climate Zones of the World, a global raster datalayer with a resolution of 5 arc-minutes, with each pixel containing a class value for the dominant thermal climate found in the pixel. Source: FAO’s Food Insecurity, Poverty and Envi- ronment Global GIS Database (FGGD).

177 Suitability for agriculture The average of suitability for agriculture in the geographical area of each province. Suitability for agriculture represents the fraction of each grid cell that is suitable to be used for agricul- ture. It is based on the temperature and soil conditions of each grid cell. The data is construed based on the global map (0.5 by 0.5 degree cells) obtained from Suitability for Agriculture, Atlas of the Biosphere. Source: [217]. Distance to coast Distance of the geographic center of each province from the (in kilometers) coast is constructed based on a coastline physical vector map in 1:10m resolution. Source: Natural Earth. Slope The average of slope in the geographical area of each province. (in percent) Slope measures the mean uphill slope percentage between each grid cell and its neighbours. The data is constructed based on the global map (30 by 30 arc-second cells) obtained from Grid- cell-level Data on Terrain Ruggedness. Source: [204]. Ruggedness The average of ruggedness in the geographical area of each (in 100 meters) province. Ruggedness measures the elevation distance of each grid cell and its neighbours. The data is constructed based on the global map (30 by 30 arc-second cells) obtained from Grid- cell-level Data on Terrain Ruggedness. Source: [204]. Elevation The average of elevation in the geographical area of each (in 100 meters) province. Elevation is constructed based on the global map (30 by 30 arc-second cells) obtained from Global 30 Arc-Second Elevation data set. Source: GTOPO30 data set.

Table B.3.2: Description of the data for within-country analysis (Italy)

178 B.4 Replication and Extension of Alesina et al. (2003)

Here we report the results of regression analysis using the data from [12] in concert with our data from [42]. The analysis tested the robustness of the claim that ethnic fractionalization causes corruption. We first replicate their findings and then extend them by including our measure of consanguinity. First, we note that there is a sizable correlation between ethnic fractionalization and consanguinity (Spearman’s ρ = 0.50, p- value < 0.001, N = 72). [12] report find a significant correlation between their ethnic fractionalization index and corruption, even after controlling for legal origins, which we replicate in regression (1) of Table B.4.1. In column (2), we replicate their analysis including only those countries for which we have consanguinity data. Including consanguinity in specifications (3) and (4) causes the effect of fractionalization to disappear while consanguinity is signif- icant. Columns (5) and (6) show that the effect of consanguinity on corruption is robust to using the cousin term measure (our IV from Section 3.3.1) in reduced form and as an instrument for consanguinity. When we introduce additional control variables in Table B.4.2 and include both measures in regression (3) or replace ethnic fractionalization with consanguinity in regression (4) of the table, consanguinity is still significant de- spite including additional control variables; the same is true when we use our IV in reduced form or as an instrument for consanguinity.10

(1) (2) (3) (4) (5) (6) Alesina (2003) With With Consanguinity Cousin term Cousin term VARIABLES Alesina (2003) restricted sample Consanguinity Without EF(2003) reduced form as instrument

Ethnic fractionalization -2.498** -2.777* -0.183 -2.581** -0.235 (0.981) (1.394) (1.196) (1.027) (1.433) Consanguinity -10.38*** -10.49*** -10.71*** (1.579) (1.274) (2.540) Cousin term measure -1.175** (0.484) Log population (1960) 0.0764 0.203 -0.259 -0.253 -0.0752 -0.297 (0.260) (0.340) (0.244) (0.249) (0.252) (0.258) Africa -0.902 0.202 1.475** 1.423** -0.758 1.547* (0.569) (1.061) (0.709) (0.598) (0.525) (0.825) East Asia -1.650** -1.484 -2.249* -2.280* -1.822** -2.268* (0.661) (1.355) (1.244) (1.220) (0.702) (1.263) Latin America -2.127*** -1.101** -2.888*** -2.921*** -2.690*** -2.962*** (0.496) (0.523) (0.578) (0.523) (0.532) (0.703) Constant 7.295*** 6.441*** 9.112*** 9.045*** 8.499*** 9.358*** (1.052) (1.485) (0.996) (0.995) (1.048) (1.162)

Observations 122 64 64 64 118 61 R-squared 0.278 0.172 0.519 0.519 0.342 0.508 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.4.1: Model specification Table (13), column (2) from Alesina et al. (2003), including consanguinity.

10Our replications of previous studies give slightly different results for [12] in regression (1) of Table B.4.1 and Table B.4.2, possibly because we used different data sources for population and regional dummies.

179 (1) (2) (3) (4) (5) (6) Alesina (2003) With With Consanguinity Cousin term Cousin term VARIABLES Alesina (2003) restricted sample Consanguinity Without EF(2003) reduced form as instrument

Ethnic fractionalization -1.029 -1.635 -0.225 -0.858 0.242 (0.760) (0.982) (0.963) (0.852) (1.159) Consanguinity -6.076*** -6.251*** -7.746** (2.092) (1.909) (3.420) Cousin term measure -0.884** (0.435) Log GNP per capita (1970-95) 1.028*** 1.273*** 0.879*** 0.868*** 0.918*** 0.777** (0.176) (0.198) (0.276) (0.274) (0.191) (0.378) Log population (1960) 0.609*** 0.796*** 0.347 0.346 0.508** 0.240 (0.207) (0.287) (0.284) (0.281) (0.212) (0.397) Africa 1.224** 2.371** 2.344** 2.272*** 1.064* 2.345** (0.576) (0.988) (0.924) (0.820) (0.575) (0.972) East Asia -0.375 -0.1000 -0.983 -1.040 -0.568 -1.218 (0.616) (1.049) (1.153) (1.138) (0.685) (1.268) Latin America -0.672 0.304 -1.230* -1.285* -1.194** -1.627 (0.444) (0.467) (0.701) (0.646) (0.507) (1.109) Socialist legal origin 1.065** 0.942 0.546 0.578 1.000* 0.478 (0.521) (0.677) (0.848) (0.839) (0.585) (1.009) French legal origin -0.0753 -0.331 -0.181 -0.175 0.00392 -0.132 (0.373) (0.548) (0.489) (0.482) (0.375) (0.529) German legal origin 0.103 -1.008 -0.0499 0.0336 -0.0173 0.246 (0.638) (0.686) (0.609) (0.513) (0.616) (0.748) Scandinavian legal origin 2.157*** 1.482** 1.409*** 1.468*** 2.662*** 1.437** (0.472) (0.642) (0.507) (0.456) (0.561) (0.549) Constant -3.978** -6.617*** -1.137 -1.092 -2.397 0.194 (1.764) (2.412) (3.411) (3.371) (2.099) (4.994)

Observations 120 62 62 62 116 59 R-squared 0.564 0.623 0.695 0.695 0.583 0.682 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.4.2: Model specification Table (13), column (3) from Alesina et al. (2003), including consanguinity.

180 B.5 Additional analysis

B.5.1 Additional cross-country analysis

While in the paper we use the ICRG index as our measure of corruption, the basic pattern of correlation between corruption and consanguinity rates that we observe is similar using alternative measures of corrup- tion. For instance, Transparency International’s 2013 Corruption Perception Index11 is highly correlated with consanguinity (Spearman’s ρ = −0.49, p-value < 0.001, N = 67) as is the measure “Prevalence of Rule Violations" from [96] (Spearman’s ρ = 0.59, p-value < 0.001, N = 66). In fact, consanguinity rates are also correlated with the pre-treatment number of parking tickets per UN diplomat in NYC, reported in [89] (Spearman’s ρ = 0.37, p-value = 0.002, N = 65). And despite the small sample size, consanguinity is highly correlated with the recent cross-country laboratory measure of cheating from [96] (Spearman’s ρ = 0.72, p-value = 0.003, N = 15).

Full Tables Reported in Section 3.3.1

(1) (2) (3) (4) (5) (6) (7) Basic model Basic model and Income instead Latitude as both Income without Income VARIABLES restricted sample Consanguinity of Latitude instrument and Latitude and Latitude

Consanguinity -4.463*** -3.664*** -2.513* -3.566*** -5.087*** (0.769) (0.905) (1.331) (0.884) (0.772) Ethnic fractionalization -0.070 -0.222 0.524 0.0742 -0.123 0.226 0.319 (0.470) (0.670) (0.514) (0.460) (0.417) (0.479) (0.555) Log population -0.106 -0.132 -0.175 0.105 0.297 0.0423 -0.132 (0.105) (0.167) (0.129) (0.125) (0.235) (0.130) (0.127) Latitude 3.001*** 3.586*** 1.898** 1.026 (0.861) (1.255) (0.878) (0.847) Log GNI per capita 0.712*** 1.288** 0.592** (0.256) (0.567) (0.269) Africa -0.272 0.068 0.388 0.765* 1.209** 0.766* 0.217 (0.266) (0.487) (0.376) (0.449) (0.586) (0.451) (0.361) East Asia -0.022 0.340 -0.440 -0.592 -0.275 -0.364 -0.983** (0.310) (0.460) (0.472) (0.394) (0.408) (0.416) (0.441) Latin America -0.184 0.451 -0.807** -0.788** -0.365 -0.604* -1.310*** (0.254) (0.326) (0.327) (0.303) (0.437) (0.317) (0.279) Socialist legal origin -1.347*** -0.903*** -1.212*** -0.706** -0.470 -0.871** -0.998*** (0.277) (0.329) (0.312) (0.333) (0.361) (0.347) (0.360) French legal origin -0.250 -0.550* -0.239 -0.0574 -0.0114 -0.135 -0.114 (0.180) (0.302) (0.206) (0.223) (0.247) (0.225) (0.232) German legal origin 0.571 0.434 0.446 -0.163 -0.619 -0.0436 0.401 (0.355) (0.408) (0.342) (0.276) (0.478) (0.295) (0.366) Scandinavian legal origin 1.123*** 0.690 0.787** 1.131*** 0.984*** 0.878*** 1.313*** (0.330) (0.416) (0.327) (0.260) (0.269) (0.313) (0.321) Constant 3.239*** 3.313** 4.643*** 0.495 -3.269 1.005 5.147*** (0.856) (1.464) (1.018) (1.755) (3.904) (1.789) (0.882)

Observations 134 67 67 67 67 67 67 R-squared 0.527 0.441 0.668 0.695 0.654 0.704 0.632 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.1: Regression analysis of the relationship between consanguinity and corruption. Higher values of the dependent variable imply lower corruption.

Table B.5.1 reports the full regression table underlying Table 3.4 in Section 3.3.1.

11See http://www.transparency.org/research/cpi/overview.

181 (1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties and Trust and Genetic diversity and Geography

Consanguinity -4.463*** -2.411** -3.853*** -4.663*** -3.494*** -3.874*** (0.769) (1.176) (0.722) (0.842) (0.952) (0.999) Ethnic fractionalization 0.524 0.451 0.969 0.846* 0.274 -0.140 (0.514) (0.458) (0.667) (0.486) (0.516) (0.447) Protestant 1.553* (0.791) Catholic -0.904** (0.370) Muslim -1.492*** (0.459) Family ties -1.261** (0.528) General trust 1.562 (1.105) Genetic diversity 48.933 (75.326) Genetic diversity squared -40.988 (56.274) Log population -0.175 -0.284** -0.329 -0.257* -0.223* -0.126 (0.129) (0.133) (0.216) (0.143) (0.133) (0.182) Latitude 1.898** 1.695** 1.138 1.531 2.296** -4.349** (0.878) (0.681) (1.299) (1.001) (0.894) (2.092) Africa 0.388 -0.232 -0.037 0.680 0.639 0.446 (0.376) (0.340) (0.357) (0.570) (0.397) (0.365) East Asia -0.440 -0.497 -0.464 -0.526 -0.582 -0.643 (0.472) (0.469) (0.536) (0.502) (0.517) (0.433) Latin America -0.807** -0.790** -0.776** -0.631* -1.305** -0.895** (0.327) (0.331) (0.366) (0.338) (0.558) (0.369) Socialist legal origin -1.212*** -1.115*** -1.440*** -1.054*** -1.084*** -0.989*** (0.312) (0.234) (0.430) (0.371) (0.351) (0.318) French legal origin -0.239 0.156 -0.259 -0.140 -0.130 -0.047 (0.206) (0.202) (0.286) (0.301) (0.225) (0.227) German legal origin 0.446 0.087 0.008 0.491 0.151 0.122 (0.342) (0.277) (0.572) (0.336) (0.398) (0.388) Scandinavian legal origin 0.787** -0.780 0.255 0.400 0.802** 0.916* (0.327) (0.664) (0.431) (0.444) (0.359) (0.458) Constant 4.643*** 5.805*** 6.127*** 3.224** -9.197 9.296*** (1.018) (1.076) (1.674) (1.496) (25.046) (1.833)

Geographical variables yes Observations 67 67 45 56 67 65 R-squared 0.668 0.762 0.783 0.725 0.689 0.770 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.2: Regression analysis of the relationship between consanguinity and corruption: potential confounds. N varies due to missing data for some countries.

Table B.5.2 reports the full regression table underlying Table 3.5 in Section 3.3.1.

182 (1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties Trust and Genetic diversity and Geography

Cousin term measure -0.733*** -0.323* -0.706** -1.039*** -0.519*** -0.551** (0.181) (0.190) (0.317) (0.247) (0.183) (0.230) Ethnic fractionalization 0.044 0.283 0.295 0.010 0.020 0.120 (0.453) (0.402) (0.548) (0.440) (0.427) (0.507) Protestant 0.639 (0.520) Catholic 0.367 (0.255) Muslim -0.895*** (0.250) Family ties -0.580 (0.620) General trust 1.540* (0.898) Genetic diversity 23.798 (57.826) Genetic diversity squared -24.128 (41.920) Log population -0.128 -0.143 -0.242 -0.251** -0.140 -0.138 (0.101) (0.093) (0.161) (0.126) (0.099) (0.105) Latitude 3.012*** 2.413*** 3.207** 2.821*** 3.076*** -0.172 (0.734) (0.648) (1.286) (0.807) (0.634) (1.601) Africa -0.192 -0.638*** -0.060 0.216 0.034 -0.265 (0.224) (0.227) (0.540) (0.342) (0.218) (0.266) East Asia -0.051 -0.297 0.136 -0.187 -0.406 -0.103 (0.337) (0.326) (0.587) (0.440) (0.339) (0.315) Latin America -0.529** -1.136*** -0.168 -0.154 -1.469*** -0.488* (0.253) (0.266) (0.377) (0.313) (0.387) (0.270) Socialist legal origin -1.413*** -1.386*** -1.719*** -1.516*** -1.232*** -1.393*** (0.250) (0.216) (0.364) (0.287) (0.227) (0.269) French legal origin -0.119 -0.021 -0.240 -0.190 0.001 -0.066 (0.160) (0.157) (0.311) (0.225) (0.151) (0.163) German legal origin 0.337 0.213 -0.067 0.120 0.461 0.193 (0.334) (0.251) (0.484) (0.299) (0.351) (0.362) Scandinavian legal origin 1.450*** 0.769 0.852 1.024** 1.420*** 1.352*** (0.307) (0.475) (0.597) (0.475) (0.279) (0.364) Constant 3.620*** 3.888*** 4.476*** 2.762** -1.029 4.905*** (0.771) (0.760) (1.502) (1.310) (20.055) (1.702)

Geographical variables yes Observations 128 127 69 86 127 122 R-squared 0.619 0.702 0.726 0.728 0.652 0.703 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.3: Regression analysis of the relationship between cousin term measure and corrup- tion (reduced form). N varies due to missing data for some countries.

Table B.5.3 reports the full regression table underlying Table B.5.7 in Section 3.3.1.

183 (1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties Trust and Genetic diversity and Geography

Consanguinity -5.048*** -0.814 -3.498*** -5.610*** -4.503** -5.804*** (1.157) (4.631) (1.219) (1.313) (1.896) (1.860) Ethnic fractionalization 0.586 0.193 0.869 1.057* 0.406 -0.136 (0.532) (0.604) (0.570) (0.527) (0.587) (0.542) Protestant 1.996 (1.571) Catholic -0.715 (0.602) Muslim -1.722** (0.835) Family ties -1.097* (0.586) General trust 1.513 (1.127) Genetic diversity -6.551 (90.365) Genetic diversity squared 1.878 (68.444) Log population -0.185 -0.305** -0.186 -0.253* -0.210 -0.174 (0.130) (0.128) (0.212) (0.141) (0.130) (0.196) Latitude 1.976* 2.151** 2.281 1.453 2.098 -5.262** (1.106) (0.910) (1.720) (1.501) (1.359) (2.327) Africa 0.526 -0.362 0.278 0.833 0.617 0.622 (0.423) (0.711) (0.448) (0.673) (0.474) (0.568) East Asia -0.467 -0.250 -0.350 -0.628 -0.504 -0.822* (0.582) (0.630) (0.570) (0.638) (0.584) (0.487) Latin America -0.947* -0.699* -0.503 -0.856 -1.352** -1.304** (0.483) (0.397) (0.506) (0.566) (0.628) (0.527) Socialist legal origin -1.252*** -1.059*** -1.307*** -1.070** -1.182*** -1.089*** (0.340) (0.324) (0.441) (0.396) (0.402) (0.336) French legal origin -0.120 0.237 -0.126 -0.007 -0.064 -0.009 (0.232) (0.184) (0.319) (0.344) (0.238) (0.278) German legal origin 0.473 0.163 0.131 0.563 0.358 0.097 (0.361) (0.320) (0.592) (0.355) (0.425) (0.441) Scandinavian legal origin 0.734** -1.105 0.281 0.436 0.776** 0.915* (0.323) (1.008) (0.437) (0.432) (0.346) (0.536) Constant 4.713*** 5.596*** 4.420** 3.291* 8.544 10.310*** (1.195) (1.287) (2.019) (1.664) (29.887) (2.042)

Geographical variables yes Observations 64 64 44 53 64 63 R-squared 0.667 0.792 0.801 0.709 0.687 0.747 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.4: Regression analysis of the relationship between consanguinity and corruption, with cousin term measure as an instrument for consanguinity. N varies due to missing data for some countries.

Table B.5.4 reports the full regression table underlying Table B.5.8 in Section 3.3.1.

184 Re-analysis using restricted samples

Table B.5.5 report direct comparisons of our baseline regression and those regressions computing the effects of various cultural variables on corruption, restricting the sample to only those countries for which data exists on both metrics. Note that this is not necessary for the specification including genetic diversity, since we have genetic diversity data on all 67 countries in our baseline regression. The results are consistent with those reported in the body of the paper.

(1) (2) (3) (4) (5) (6) (7) VARIABLES Basic model restricted sample and Family ties restricted sample and Trust restricted sample and Geography

Consanguinity -4.463*** -4.533*** -3.853*** -4.470*** -4.663*** -4.049*** -3.874*** (0.769) (0.760) (0.722) (0.882) (0.842) (0.769) (0.999) Ethnic fractionalization 0.524 0.797 0.969 0.837 0.846* 0.561 -0.140 (0.514) (0.637) (0.667) (0.503) (0.486) (0.501) (0.447) Family ties -1.261** (0.528) General trust 1.562 (1.105) Log population -0.175 -0.210 -0.329 -0.202 -0.257* -0.168 -0.126 (0.129) (0.223) (0.216) (0.133) (0.143) (0.134) (0.182) Latitude 1.898** 2.200* 1.138 2.172* 1.531 2.522*** -4.349** (0.878) (1.160) (1.299) (1.092) (1.001) (0.902) (2.092) Africa 0.388 0.053 -0.037 0.517 0.680 0.448 0.446 (0.376) (0.459) (0.357) (0.554) (0.570) (0.376) (0.365) East Asia -0.440 -0.548 -0.464 -0.375 -0.526 -0.247 -0.643 (0.472) (0.598) (0.536) (0.522) (0.502) (0.465) (0.433) Latin America -0.807** -0.723* -0.776** -0.641* -0.631* -0.596* -0.895** (0.327) (0.366) (0.366) (0.369) (0.338) (0.326) (0.369) Socialist legal origin -1.212*** -1.274*** -1.440*** -1.211*** -1.054*** -1.195*** -0.989*** (0.312) (0.414) (0.430) (0.357) (0.371) (0.318) (0.318) French legal origin -0.239 -0.305 -0.259 -0.291 -0.140 -0.237 -0.047 (0.206) (0.303) (0.286) (0.238) (0.301) (0.222) (0.227) German legal origin 0.446 0.462 0.008 0.547 0.491 0.507 0.122 (0.342) (0.471) (0.572) (0.363) (0.336) (0.348) (0.388) Scandinavian legal origin 0.787** 0.652 0.255 0.759** 0.400 0.705** 0.916* (0.327) (0.395) (0.431) (0.352) (0.444) (0.327) (0.458) Constant 4.643*** 4.798*** 6.127*** 4.649*** 3.224** 4.245*** 9.296*** (1.018) (1.749) (1.674) (1.124) (1.496) (1.142) (1.833)

Geographical variables yes Observations 67 45 45 56 56 65 65 R-squared 0.668 0.747 0.783 0.705 0.725 0.677 0.770 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.5: Regression analysis of the relationship between consanguinity and corruption: sample size restrictions. Higher values of the dependent variable imply lower corruption. N varies due to missing data for some countries.

The issue of data collection dates

Another concern is the heterogeneous data collection dates for our measure of consanguinity. As explained in the data description in Appendix B.3, our measure of consanguinity is the weighted average of 448 sample studies in different locations from 72 countries, with the data collection date varying from 1922 to 2013. To analyze the data in more detail, we categorize the 448 sample studies into five overlapping groups: studies conducted after 1922, after 1950, after 1975, after 1980, and after 1984. Nine studies are excluded from this categorization because their exact data collection dates are not reported in [42]. 67 countries are covered by both studies after 1922 and after 1950, and the average consanguinity rate of 67 countries in the first group of studies (after 1922) is 13.651 and drops to 13.508 in the group of studies after 1950. 34 countries are covered by both studies after 1922 and after 1975, and the average consanguinity rate of 34 countries in the first group of studies (after 1922) is 23.23 and drops to 22.847 in the group of studies after 1975. 31 countries are covered by both studies after 1922 and after 1980, and the average consanguinity rate of 34 countries in

185 the first group of studies (after 1922) is 24.360 and drops to 23.698 in the group of studies after 1980. 26 countries are covered by both studies after 1922 and after 1984, and the average consanguinity rate of 26 countries in the first group of studies (after 1922) is 25.056 and increases to 25.683 in the group of studies after 1984. This information reveals that countries with relatively high consanguinity rates have been studied later. In fact, the correlation between average data collection date weighted by sample size and consanguinity rate is 0.48. This is consistent with the later studies being motivated by an increasing awareness of the genetic sources of certain health problems and attempts to determine whether consanguinity was increasing the frequency of deleterious recessive genes. Perhaps surprisingly, the trend of consanguinity is flat or weakly decreasing over- all. The correlation between data collection date and consanguinity rate within countries that were sampled in multiple studies over the years is positive for some, and for others negative, but the average of the correlations for all countries is -0.14. As noted by [42], “past predictions of a rapid decline in the overall prevalence of consanguineous unions have proved to be largely incorrect. In fact, the recorded numbers of consanguineous unions appear to have grown at least in step with increasing national and regional populations, and in some economically less developed countries the proportion of marriages contracted between close biological kin has expanded. The simplest explanation for this observation is that as greater numbers of children survive to , the traditional social preference for consanguineous unions can be more readily accommo- dated" [42, Summary]. Therefore, it seems that consanguinity is quite stable over time and could be among the drivers of institutional differences across countries. [15] shows that the same is true about family values. Since the overall consanguinity trend is weakly decreasing, if consanguinity data is collected earlier for relatively corrupt and low development countries, we might end up with spurious correlation between con- sanguinity and corruption. However, the correlation of average data collection date of countries weighted by sample size with average income per capita of the countries for 1984-2011 is -0.10 and with average cor- ruption index of countries for 1984-2011 is -0.24. It follows that the consanguinity data is collected later for countries with relatively high corruption, low economic development and high consanguinity. This can only underestimate the consanguinity rates of relatively high-corruption countries and overestimate the con- sanguinity rates of relatively low-corruption countries and would lead us to underestimate the effect of the consanguinity on corruption in the regressions. Nevertheless, to further investigate the robustness of the results, we also include the variable of average data collection date of countries weighted by sample size to regression (3) of Table 3.4 (including basic regressors and consanguinity). The result is displayed in column (1) of Table B.5.6 and shows that consanguinity is significant even after controlling for the data collection dates and the variable of data collection dates is not significant. Also, we have run the same regression with different consanguinity rates obtained from the five groups of studies (sample studies after 1922, after 1950, after 1975, after 1980 and after 1984) and the results support a highly significant effect of consanguinity.

186 (1) (2) (3) (4) (5) Full sample Data collection date Data collection date Data collection date Data collection date VARIABLES 1922-2013 1950-2013 1975-2013 1980-2013 1984-2013

Consanguinity -5.169*** -5.108*** -7.166*** -6.422*** -5.113*** (0.621) (0.641) (1.153) (0.872) (1.121)

Consanguinity data -0.002 collection date (0.002)

Ethnic fractionalization 0.913 0.704 2.331** 1.956*** 0.842 (0.548) (0.528) (0.886) (0.669) (0.692) Latitude 1.656* 1.915** 1.522 0.849 1.183 (0.866) (0.830) (1.082) (1.011) (1.757) Log population -0.181 -0.150 -0.209 -0.109 -0.217 (0.140) (0.134) (0.176) (0.162) (0.203) Africa -0.507** -0.546*** (0.229) (0.200) East Asia -0.702 -0.538 -1.530*** -1.608*** -0.991 (0.443) (0.459) (0.454) (0.437) (0.708) Latin America -1.001*** -0.873*** -0.525 -0.486 -0.215 (0.337) (0.323) (0.573) (0.494) (0.721) Socialist legal origin -0.849* -1.387*** 0.332 0.220 0.574 (0.467) (0.299) (0.503) (0.455) (0.693) French legal origin -0.362* -0.359* -0.561* -0.715** -0.613 (0.210) (0.211) (0.322) (0.333) (0.357) German legal origin 0.408 0.333 0.304 0.191 (0.365) (0.362) (0.364) (0.279) Scandinavian legal origin 0.753** 0.658** 0.569 0.807 0.615 (0.357) (0.319) (0.574) (0.555) (0.629) Constant 8.677* 4.583*** 5.083*** 4.665*** 5.411** (4.490) (1.015) (1.276) (1.243) (1.841)

Observations 58 62 30 27 22 R-squared 0.727 0.710 0.779 0.840 0.802 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.6: Regression analysis of the relationship between consanguinity and corruption, controlling for date of data collection. Higher values of the dependent variable imply lower corruption. N varies due to missing data for some countries.

187 B.5.2 An Instrumental Variables Approach

To attempt to provide preliminary evidence for a causal relationship between sub-ethnic fractionalization and corruption, we seek an instrument that is correlated with consanguineous marriage practices, but unlikely to be correlated with corruption. A valuable recent approach uses historical ethnographic data from the Ethno- graphic Atlas to aggregate information about the historical cultural characteristics of resident ethnic groups into country-level measures. We follow the method introduced by [16] to produce such measures. As an estimate of the geographic distribution of ethnicities across the globe, we use Language locations from WLMS (World Language Mapping System) 19th version (2017) which corresponds to the 19th edition (2016) ISO 639-3 standard, and the 16th Edition of the Ethnologue (Languages of the World). WLMS provides a shape file that divides the world into polygons indicating the locations around the world where 7650 languages are spoken. Each of these languages are matched to one of the 1291 ethnic groups included in the extended Ethnographic Atlas provided by D-PLACE.12 We also use the LandScan 2014 database which reports the world’s population in 2014 for 30 arc-second by 30 arc-second grid cells globally to construct the country-level population weighted averages of measures from the Ethnographic Atlas. See [16] for more details of the methodology used in the construction of the country-level data. Below, we replicate their measure of historical plow use, using the new data; see Figure B.5.2. The key for our study is to find a measure that is related to consanguineous marriage (and hence sub-ethnic fractionalization) but unlikely to be directly related to corruption. While the Ethnographic Atlas contains some measures of consanguineous (cousin) marriage practices, path dependence in the development of norms and institutions might make such measures subject to the same endogeneity concerns that arise in using contemporary marriage practices. We discuss other issues with the marriage practice variables below. We adopt a clever strategy employed by [230], that exploits a connection between kinship terminology and marriage patterns. Anthropologists distinguish six systems of kin terminology into which languages can be classified, following [189]. Eskimo (which includes English) and Hawaiian kin terminologies do not distin- guish first cousins from one another.13 Sudanese, Iroquois, Omaha, and Crow terminologies either fully or partially distinguish cousins from one another. Anthropologists have long argued that differentiated cousin- terms are associated with the prevalence of consanguineous marriage, since kin terms may serve a “clas- sificatory" function, dividing (real and fictive) kin into those who are and are not eligible for marriage [189, 251, 100, 91, 114]. Variable 27 in the Ethnographic Atlas records kinship terminology used to refer to cousins and thus pro- vides a measure that can be used as an instrument for cousin marriage rates. We created a dummy variable which equals 1 if the language spoken in a society fully or partially distinguishes first cousins from each other (Sudanese, Iroquois, Omaha, Crow), and equals 0 otherwise (Eskimo or Hawaiian). However, while the instrument employed by [230] is drawn from [219], we construct our cousin terms instrument using [16]’s method explained above; first we assign this measure to all ethnic groups (see the left panel of Figure B.5.1). Then, we construct a population-weighted average of the dummy variable for all ethnic groups living in a

12The extended version of the Ethnographic Atlas by D-PLACE (Database of Places, Language, Culture, and Envi- ronment) [158] has several advantages over last version [196, 120] used in previous studies such as [16]; 1- Original Ethnographic Atlas includes 1267 societies with three duplicate observations of which two were discovered and removed by [16]. D-PLACE added data for 27 societies in Eurasia, recently coded by [160, 45], which fill important regional gaps in northern Eurasia; 2- Each data point in Ethnographic Atlas is linked to one or more of the 3502 ethnographic sources that were consulted in coding the data; 3- Errors in the years in which societies were sampled were identified and replaced; 4- The latitude-longitude data of societies were corrected and improved.

13Eskimo terminology further distinguishes siblings from their cousins. According to [189] and [114], accumulation of property and its inheritance by individuals (i.e. direct inheritance) were crucial in the development of such “individu- alizing" kin terms and the “isolation of the nuclear family" [114] in Eskimo terminology.

188 country. Therefore, our country-level measure is the fraction of the population that speaks a language which differentiates among cousins (see the right panel of Figure B.5.1).

Figure B.5.1: Kin terms partially or fully distinguishing first cousins (Descrip- tive/Crow/Iroquois/Omaha/Sudanese) versus not distinguishing any first cousins (Es- kimo/Hawaiian), by ethnic/linguistic group and country.

Results. Tables B.5.7 and B.5.8 employ the country-level cousin terms measure in both reduced-form and IV analysis of the specifications reported in Table 3.5. In all specifications in Table B.5.7, we see a negative and significant relationship between the share of the population using kin terms that distinguish cousins and institutional quality. This relationship holds controlling for the set of confounds introduced in the last section. In all specifications we cannot reject the null hypothesis of exogeneity. We report additional robustness checks in below. Similarly, when cousin terminology is used as an IV for consanguinity in Table B.5.8, we observe a large, negative impact of consanguinity rates on institutional quality in the baseline specification. This result is robust to including our other controls, except in the case of column (2), which introduces controls for religion. This is unsurprising since the Church’s influence on family structure also led, e.g. the English language to shed excess kin terms [187, p. 69]. For this reason we are skeptical of the validity of kin terms as an instrument, since language is part of a broader coevolutionary process with norms and institutions.

189 (1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties Trust and Genetic diversity and Geography

Cousin term measure -0.733*** -0.323* -0.706** -1.039*** -0.519*** -0.551** (0.181) (0.190) (0.317) (0.247) (0.183) (0.230) Ethnic fractionalization 0.044 0.283 0.295 0.010 0.020 0.120 (0.453) (0.402) (0.548) (0.440) (0.427) (0.507) Protestant 0.639 (0.520) Catholic 0.367 (0.255) Muslim -0.895*** (0.250) Family ties -0.580 (0.620) General trust 1.540* (0.898) Genetic diversity 23.798 (57.826) Genetic diversity squared -24.128 (41.920)

Geographical variables yes Additional controls yes yes yes yes yes yes Observations 128 127 69 86 127 122 R-squared 0.619 0.702 0.726 0.728 0.652 0.703 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.7: Regression analysis of the relationship between cousin term measure and corrup- tion (reduced form). N varies due to missing data for some countries. Additional controls include legal origins dummies, region dummies, latitude and log population. Full estimates reported in Appendix Table B.5.3.

(1) (2) (3) (4) (5) (6) VARIABLES Basic model and Religion and Family ties Trust and Genetic diversity and Geography

Consanguinity -5.048*** -0.814 -3.498*** -5.610*** -4.503** -5.804*** (1.157) (4.631) (1.219) (1.313) (1.896) (1.860) Ethnic fractionalization 0.586 0.193 0.869 1.057* 0.406 -0.136 (0.532) (0.604) (0.570) (0.527) (0.587) (0.542) Protestant 1.996 (1.571) Catholic -0.715 (0.602) Muslim -1.722** (0.835) Family ties -1.097* (0.586) General trust 1.513 (1.127) Genetic diversity -6.551 (90.365) Genetic diversity squared 1.878 (68.444)

Geographical variables yes Additional controls yes yes yes yes yes yes Observations 64 64 44 53 64 63 R-squared 0.667 0.792 0.801 0.709 0.687 0.747 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.8: Regression analysis of the relationship between consanguinity and corruption, with cousin term measure as an instrument for consanguinity. N varies due to missing data for some countries. Additional controls include legal origins dummies, region dummies, latitude and log population. Full estimates reported in Appendix Table B.5.4.

190 Alternative Measures of Consanguinity from the Ethnographic Atlas?

Here we explain why we use the kin terminology variable rather than other EA variables regarding marriage. Variables 23-26 in the Ethnographic Atlas which all are constructed based on column 25 of [197]’s codebook [158] provide measures related to historical consanguineous marriage practices - in particular, cousin mar- riages. In column 25, Murdock listed types of cousins14 a man is permitted to marry. He also indicated with an extra letter those types which are “preferred rather than just permitted" [120, 158]. To discuss possible measures, we categorized variables 24 and 26 of the Ethnographic Atlas, as shown in Table B.5.9.15

Preferred (1) (2) A type of first/second No preferred Permitted cousin marriage preferred cousin marriage Total

(1) Four of four first cousins 55 62 117

(2) Three of four first cousins 6 19 25

(3) Two of four first cousins 140 75 215

(4) One of four first cousins 30 19 49

(5) No first cousins (some or 14 340 354 all second cousins) (6) No first/second cousins 0 282 282

Total 245 797 1024

Table B.5.9: Intersection of "cousin marriages permitted" and "Cousin marriages preferred" from the Ethnographic Atlas.

First, let’s consider the cousin marriage permitted. This variable doesn’t capture cousin marriage frequency especially in the country level. The reason is that some populations such as New Englanders permitted mar- riage with all four first cousins, same as the Middle Eastern populations of Syrians, Turks and Iranians. This is probably just because Protestantism allowed cousin marriages. This variable doesn’t capture the fact that Protestantism historically discouraged cousin marriages. This difference between permitting cousin marriage, and frequencies of cousin marriage is likely to be true about some other populations as well. However, New Englanders is the most problematic for our country level analyses, because it is the only English society with available data on cousin marriage permitted. Therefore, countries with majority English populations such as UK, US, Australia, New Zealand, Canada, and also English regions in many countries (such as Ireland and South Africa) pick the cousin marriage measure from New Englanders’ society in the Ethnographic Atlas, while according to historical records and consanguinity rates [42], there is a big difference between consan- guinity rates of these countries, and those of the Middle Eastern and North African countries. Now, let’s consider the cousin marriage preferred. This variable also doesn’t capture cousin marriage fre- quencies across societies, and in the country level. First, consider that there is no report of cousin marriage

14(i) Mother’s brother’s daughter, (ii) Father’s sister’s daughter, (iii) Mother’s sister’s daughter, (iv) Father’s brother’s daughter, (v) second cousins. Matrilateral cousin refers to (i) and (iii); Patrilateral cousin refers to (ii) and (iv); Cross cousin refers to (i) and (ii); Parallel cousin refers to (iii) and (iv).

15Based on variable 24, we constructed a variable for cousin marriages permitted. The variable takes on integer values ranging from 1 to 6 where higher values indicate wider cousin marriage culture; value 1 is assigned if there no first or second cousin marriages are allowed (category 8), value 2 is assigned if only second cousin marriages are allowed (categories 5, 6 and 7, considering that categories 5 and 6 are not distinguishable from category 7), and values 3 to 6 is assigned if one, two, three, and four of four cousins are marriageable receptively (categories 4 to 1 respectively). Also based on variable 26, we constructed another variable for cousin marriages preferred. The variable takes value 1 if a first or second cousin is preferred spouse (categories 1-5), and takes value 0 if there is no preferred cousin marriage.

191 preferred if it is not permitted (see table B.5.9). Therefore cousin marriage preferred is a subset of cousin marriage permitted. Second, there are 245 societies that permit cousin marriages but has no preference for any types of cousins. While 72% of the Ethnographic Atlas societies permitted cousin marriage, only 24% of societies are reported to have a marriage preference with a cousin. The reason seems to be that cousin marriage preferred is not defined versus no preference for cousin marriage, but versus indifference among permitted cousins for marriage. Therefore, no report of a cousin marriage preferred for a society does not necessarily imply cousin marriages to be uncommon.16 Instead, it might have some implications about the nature of the descent group. According to Murdock, “the worldwide incidence of such preferences is so low", but still reveals that anthropologists “are correct in ascribing matrilateral preferences primarily to patrilineal societies and patrilateral preferences to matrilineal societies" [195, p.687]. Third, while 175 societies permit marriage with first cousins but has no preferred cousin marriage, there are 14 societies which do not permit any first cousin marriages but has a preference to marry a second cousin. If cousin marriage preferred is used as cousin marriage measure, it is not obvious why the latter societies should have higher cousin marriage frequencies, while the earlier societies provide more possibility for cousin marriage.

B.5.3 Giuliano and Nunn’s (2013) Data RGN FGNE ROLES GENDER OF ORIGINS

FIGURE II Traditional Plough Use among the Ethnic/Language Groups Globally

487

at Simon Fraser University on April 2, 2016 2, April on University Fraser Simon at http://qje.oxfordjournals.org/ from Downloaded

Figure B.5.2: Traditional plough use across ethnic/linguistic groups.

To compare our matching of languages to the Ethnographic Atlas societies with [16], we replicated the ethnic- level map of traditional plough use. The result shows very high similarity except for the newly added polygons in Australasia, central America, and south America (since we used new versions of the data and polygons).

Robustness checks. In our country-level cousin marriage measure, we ignored the ethnicities with miss- ing data. In regressions (1) and (2) of Table B.5.10, we replicated the basic regression for both reduced form

16For example, New Englanders, Turks, Syrians, and Iranians all reported to permit marriage with all four first cousins. However, New Englanders and Turks has no preferred cousin marriage, while Syrians and Iranians prefer marriage with father’s brother’s daughter. This simply implies that Turks were indifferent among their cousins when marrying one, but it doesn’t imply that they had no preference for cousin marriage in general same as New Englanders (as we know from [42]’s consanguinity rates).

192 and instrumental variable estimation using the newly released data by [104]. Our measure of cousin terms is highly correlated with theirs for the countries for which we also have corruption data (Spearman’s ρ = 0.78, p-value < 0.001, N = 129). Moreover, in regressions (3) and (4) we dropped countries for which share of ethnicities with missing information is more than 50%. Regressions (5) and (6) includes countries for which share of ethnicities with missing information is zero. Therefore, our results are robust with respect to using alternative data from [104], or whether we drop countries with missing information for cousin term measure on some or all ethnicities.

(1) (2) (3) (4) (5) (6) VARIABLES Reduced form Instrumental variable Reduced form Instrumental variable Reduced form Instrumental variable

Cousin term measure -0.849*** -0.823*** -1.139*** (0.169) (0.190) (0.289) Consanguinity -4.845*** -4.556*** -6.053*** (1.164) (1.255) (1.749) Ethnic fractionalization 0.251 0.677 0.310 0.594 0.762 1.954 (0.448) (0.533) (0.477) (0.603) (0.638) (1.393) Log population -0.200* -0.243 -0.253** -0.208 -0.571*** -0.497 (0.106) (0.163) (0.120) (0.198) (0.194) (0.430) Latitude 2.824*** 1.771* 2.678*** 1.860* 2.574** 2.029 (0.716) (0.991) (0.724) (1.049) (0.963) (1.330) Africa -0.301 0.348 -0.336 0.307 0.311 -1.124 (0.241) (0.433) (0.278) (0.424) (0.447) (0.778) East Asia -0.107 -0.468 -0.142 -0.486 -0.194 -0.924 (0.346) (0.566) (0.352) (0.581) (0.497) (0.618) Latin America -0.673** -0.927** -0.794*** -0.910* -1.230*** -1.342** (0.260) (0.457) (0.277) (0.459) (0.355) (0.476) Socialist legal origin -1.397*** -1.318*** -1.407*** -1.287*** -1.533*** -1.930*** (0.241) (0.350) (0.251) (0.345) (0.321) (0.658) French legal origin -0.139 -0.279 -0.083 -0.236 -0.110 -0.316 (0.161) (0.249) (0.188) (0.263) (0.316) (0.466) German legal origin 0.345 0.462 0.376 0.420 0.415 0.838 (0.320) (0.368) (0.325) (0.421) (0.398) (0.718) Scandinavian legal origin 1.017*** 0.718** 1.018*** 0.720** 0.852** 0.543 (0.297) (0.351) (0.302) (0.350) (0.406) (0.656) Constant 4.221*** 5.255*** 4.659*** 4.957*** 6.987*** 6.910** (0.846) (1.215) (0.935) (1.364) (1.397) (2.699)

Observations 128 63 112 57 59 29 R-squared 0.624 0.663 0.633 0.666 0.718 0.705 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.10: Regression analysis of the relationship between consanguinity and corruption, with cousin term measure in reduced form and as the instrument for consanguinity. N varies due to missing data for some countries. Regressions (1) and (2) ignore the ethnicities with missing data. Regressions (3) and (4) drop countries for which share of ethnicities with missing information is more than 50%. Regressions (5) and (6) includes countries for which share of ethnicities with missing information is zero. The data on cousin marriage measure in these specifications comes from the aggregation of the Ethnographic Atlas as reported in [104].

In addition, we replicated Alesina et al. (2003)’s regressions in Tables B.4.1 and B.4.2 using the cousin term measure in both reduced form and instrumental variable estimates; the connection between consanguineous marriage and corruption remains robust. See Appendix B.4.

B.5.4 Additional within-country analysis (Italy)

Table B.5.11 reports the full regression table underlying Table 3.7 in Section 3.3.2. Table B.5.12 reports robustness checks for the specifications reported in Table 3.9 in Section 3.3.2. Here we see that the effect of the instrument remains significant controlling for geography, but when we control for latitude (which is highly correlated with the instrument), standard errors become large and the coefficient is no longer significant.

193 (1) (2) (3) (4) (5) (6) Basic model and Consanguinity Income instead of Share of agriculture both Income and without Income and VARIABLES Share of agriculture as instrument Share of agriculture Share of agriculture

Consanguinity 4.451* 4.506* 4.440* 4.472* 4.470* (2.443) (2.421) (2.470) (2.460) (2.395) Share of agriculture 1.537 0.366 0.991 (3.469) (3.385) (3.631) Log population -0.020 -0.050 -0.032 -0.067 -0.021 -0.051 (0.123) (0.113) (0.124) (0.181) (0.132) (0.111) Log value added per capita 0.029 -0.024 0.042 (0.081) (0.228) (0.088) Latitude -0.359*** -0.271* -0.278* -0.269* -0.273* -0.273* (0.130) (0.147) (0.148) (0.148) (0.149) (0.146) Mean Temperature -0.125* -0.120* -0.121** -0.119* -0.122** -0.120** (0.063) (0.060) (0.060) (0.063) (0.061) (0.060) Mean precipitation 0.697 -4.032 -4.075 -3.967 -4.143 -4.017 (6.040) (4.367) (4.373) (4.428) (4.381) (4.350) Suitability for agriculture 0.437 0.362 0.349 0.375 0.339 0.363 (0.524) (0.526) (0.526) (0.524) (0.538) (0.521) Distance to coast -0.001 -0.002 -0.002 -0.002 -0.002 -0.002 (0.002) (0.002) (0.002) (0.002) (0.002) (0.002) Elevation 0.018 0.026 0.027 0.026 0.026 0.026 (0.054) (0.054) (0.055) (0.054) (0.055) (0.054) Slope -0.449 0.580 0.629 0.583 0.575 0.604 (0.788) (0.859) (0.872) (0.851) (0.870) (0.851) Ruggedness 1.387 -1.896 -2.051 -1.907 -1.877 -1.973 (2.426) (2.666) (2.706) (2.640) (2.701) (2.641) Constant 17.200** 13.886* 14.120** 13.953** 13.771* 14.030** (6.607) (7.026) (7.002) (6.961) (7.087) (6.941)

Observations 101 101 101 101 101 101 R-squared 0.459 0.502 0.502 0.501 0.503 0.502 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.11: Replication of Table 3.6 including controls for climate and geography.

194 (1) (2) (3) (4) (5) (6) Basic model and geography and latitude both Income and and geography and latitude VARIABLES Share of agriculture

Consanguinity 8.306*** 9.535* 7.634 8.658*** 10.407* 8.787 (2.396) (5.551) (6.498) (2.538) (5.820) (6.663) Share of agriculture -2.205 0.227 -0.472 -1.731 0.622 0.035 (5.420) (4.007) (3.748) (5.552) (4.246) (3.984) Log population -0.006 -0.064 -0.071 0.045 -0.038 -0.042 (0.146) (0.135) (0.124) (0.156) (0.148) (0.138) Log value added per capita 0.077 0.048 0.053 (0.094) (0.104) (0.096) Latitude -0.208 -0.189 (0.202) (0.207) Mean Temperature -0.067 -0.116* -0.072 -0.117* (0.054) (0.060) (0.055) (0.061) Mean precipitation -6.465 -7.412 -7.736 -8.752 (7.998) (7.485) (8.260) (7.471) Suitability for agriculture 0.065 0.308 0.042 0.259 (0.569) (0.553) (0.599) (0.590) Distance to coast -0.006*** -0.003 -0.006*** -0.004 (0.002) (0.003) (0.002) (0.003) Elevation 0.083 0.031 0.080 0.033 (0.062) (0.064) (0.065) (0.068) Slope 0.950 1.316 1.206 1.569 (1.926) (1.616) (2.006) (1.672) Ruggedness -3.209 -4.243 -4.012 -5.049 (6.046) (5.128) (6.300) (5.305) Constant 0.955 2.407 11.516 0.530 2.274 10.530 (1.899) (1.917) (8.738) (1.973) (1.988) (8.884)

Observations 101 101 101 101 101 101 R-squared 0.307 0.435 0.480 0.281 0.417 0.462 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.12: Active years of archdioceses as an instrument for consanguinity.

195 Time series of consanguinity across Italy

As described in Appendix B.3 Table B.3.2, we computed the consanguinity rate of Italian provinces using consanguinity percentages reported by [57] for the period 1945-1964. Although, consanguinity was measured for regions in the late 1940s, 1950s and early 1960s, if the regions share a common trend, it can be argued that consanguinity rates plausibly capture provincial differences in subsequent decades as well.

70 FRIULI VENEZIA GIULIA UMBRIA EMILIA ROMAGNA 60 MARCHE VENETO VALLE D'AOSTA 50 TOSCANA TRENTINO ALTO ADIGE PIEMONTE 40 LOMBARDIA LAZIO

30 PUGLIA ABRUZZO LIGURIA Consanguinity % Consanguinity

20 CAMPANIA BASILICATA SARDEGNA

10 MOLISE CALABRIA SICILIA 0

1910- 1915- 1920- 1925- 1930- 1935- 1940- 1945- 1950- 1955- 1960- 1914 1919 1924 1929 1934 1939 1944 1949 1954 1959 1964

Year

Figure B.5.3: Consanguinity trend for Italian regions.

As noted by [57], trends of consanguinity in the 20 Italian regions shown in Figure B.5.3 reveal

“first, an increase, at first slow, then faster, which ends around 1915-1925 when a peak of consanguinity is reached, at a similar time for all Italian regions, and, second, a rapid descent begins, [. . . ] some of the possible explanations of the first phase, which are, in order of time: (1) changes of laws of inheritance; (2) decreased influence of the Roman Catholic Church; and (3) increase in relative abundance of cousins due to increase in population size, which increases disproportionately the abundance of relatives, though with a delay of one generation for first cousins and two for second cousins, [. . . ] The second phase of the phe- nomenon, the decrease of consanguinity, must be the same as that called the breakdown of isolates [. . . ] where it began in the second half of the nineteenth century. It is essentially due to an increase in individual mobility, tied to increased means and speed of transportation, and increased opportunities of work in spe- cific industrial areas, favoring relocation of workers, [. . . ] The difference in the peak consanguinity values between north and south agrees with the lower growth rate of the northern populations, [. . . However] the difference in the time at which the peak appears seems relatively small in the various parts of the country, indicating that the breakdown of isolates took place at similar times all over Italy and was a national rather than a local phenomenon." [57, p. 238-41].

Therefore, the assumption of a common negative consanguinity trend across Italian provinces appears plau- sible. From post-war highs, consanguinity rates declined by an average of almost 50% across regions by 1964. Moreover, in unreported regression analysis, when we regress consanguinity rates as a percentage of the region-level average from 1945-1964 on region dummies and a region×year interaction, we find support for a negative and significant slope in each region over the period 1945-1964. However, a Wald test rejects the null hypothesis that the slope is the same in all regions (p-value < 0.01).

Province-level data: additional robustness checks

This section reports additional regression analyses of the relationship between consanguinity and corruption in Italy, controlling for potential confounds. For clarity, we report our baseline specifications drawn from

196 column (2) of Tables 3.6 and 3.7 side by side in columns (1) and (2) of Table B.5.13. As we add controls for other confounding variables, we report specifications with and without controls for climate and geography.

Civil society. One cultural difference across Italian regions that may influence the quality of institutions and the level of corruption is the extent of civil society (civicness). The intuition is that more civically active areas may have better developed a capacity for effective government as citizens acquire habits, skills and values through participation in non-governmental organizations. [214] contrasts cooperation rooted in civil society to that based on Banfield’s (1958) notion of ‘amoral familism’ in which heavy reliance on kinship ties results in lack of general trust and community cohesion. Our first measure of civicness is [214]’s civic involvement index, which is an aggregation of five indicators measured between 1860 and 1921, on a scale of 1 to 9. This measure of civicness is negatively correlated with consanguinity (Spearman’s ρ = -0.73, p-value < 0.00, N=101). This might suggest that consanguinity is just a product of lack of civicness. However, for civic involvement in columns (3) and (4) of Table B.5.13, consanguinity remains highly significant, although the results also confirm [214]’s hypothesis in column (3); civicness exhibits a modest but significant negative relationship with corruption. As a second measure of civicness, in columns (5) and (6) we use the number of voluntary organizations (established before 1945) per 100,000 inhabitants, with qualitatively similar results.

(1) (2) (3) (4) (5) (6) Basic model and geography Civic involvement and geography Voluntary organizations and geography VARIABLES (1860-1921) (established before 1965)

Consanguinity 5.144*** 4.451* 3.848*** 4.317* 4.972*** 4.470* (1.099) (2.443) (1.462) (2.457) (1.138) (2.450) Civic involvement -0.092** -0.025 (0.045) (0.057) Voluntary organizations -0.009** -0.008 (0.004) (0.005) Share of agriculture 3.472 0.366 1.829 0.339 3.614 0.834 (3.243) (3.385) (3.613) (3.409) (3.271) (3.388) Log population 0.022 -0.050 0.004 -0.051 0.039 -0.033 (0.115) (0.113) (0.114) (0.113) (0.117) (0.118) Latitude -0.271* -0.240 -0.258* (0.147) (0.167) (0.148) Mean Temperature -0.120* -0.115* -0.119* (0.060) (0.060) (0.061) Mean precipitation -4.032 -3.702 -4.148 (4.367) (4.386) (4.389) Suitability for agriculture 0.362 0.336 0.272 (0.526) (0.525) (0.536) Distance to coast -0.002 -0.002 -0.003 (0.002) (0.002) (0.002) Elevation 0.026 0.020 0.033 (0.054) (0.055) (0.053) Slope 0.580 0.511 0.560 (0.859) (0.885) (0.866) Ruggedness -1.896 -1.669 -1.837 (2.666) (2.756) (2.688) Constant 0.763 13.886* 1.764 12.749* 0.605 13.174* (1.505) (7.026) (1.613) (7.654) (1.520) (7.109)

Observations 101 101 101 101 101 101 R-squared 0.430 0.502 0.455 0.503 0.435 0.504 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.13: Regression analysis of the relationship between consanguinity and corruption in Italy: controlling for civil society.

Family types, family ties. Consanguinity may be most attractive when extended families are close-knit and there are frequent interactions with kin, and strong family ties have been implicated as a source of cor- ruption. Thus, the observed effect of consanguinity may instead simply proxy for family ties. Consistent with this idea, a number of studies have shown that medieval family types as classified by [248] are associated

197 with current regional or cross-country differences in development and institutions [e.g. 78, 97, 10]. Todd’s classification of family types comes from the patterns between generations within families (nu- clear or extended) and inheritance practices (equal or unequal division of assets among children). [248] uses his classification of family types “to explain relative levels of diffusion or resistance to important societal changes such as Protestantism, secularism, or political ideology" [15, p. 180], where e.g. nuclear family structures encourage children to the leave the home, weakening the influence of extended family on norms and behavior. According to [248], Italy had three family types; incomplete stem family (extended family, unequal inher- itance), communitarian family (extended family, equal inheritance), and egalitarian nuclear family (nuclear family, equal inheritance). In columns (11) and (12) of Table B.5.14, we controlled for family types where the omitted dummy is the incomplete stem family. Consanguinity remains highly significant in the regression. The communitarian family type, characterized by both cohabitation with extended family and egalitarian in- heritance, is also significantly associated with higher corruption. As a second and more recent measure of family ties, following [10] we used the fraction of youth aged 18-34 living with at least one parent, averaged over the period 2002-2009 (from ISTAT). Controlling for this measure of family ties in columns (9) and (1), consanguinity remains significant.

South and islands. A well-known fact about corruption in Italy is that the corruption level is higher in the south. This difference is usually attributed to cultural differences between the south and north [29, 214]. Figure 3.3b shows high consanguinity rates in southern Italy. Therefore, the impact on corruption that we attribute to consanguinity might instead result from some other feature of southern Italy related to corruption. Therefore, we also included a dummy variable in the regressions for the provinces in so-called Mezzogiorno regions: southern Italy (Abruzzo, Molise, Campania, Puglia, Basilicata, Calabria) and the islands (Sicilia, Sardegna). Including this dummy variable assures a stringent test of our hypothesis because southern Italy and the islands also happen to have the highest consanguinity rates. Nevertheless, including a dummy for the south and islands, the relationship between consanguinity and corruption remains robust.

Political domination. A historical path dependency in Italy may have arisen from colonization and polit- ical domination to which different parts of Italy were subjected in the Middle Ages, which carried with them cultural, political and institutional baggage that could influence outcomes today. Most relevant to our study is that the Papal States were territories under the sovereign direct rule of the pope, from the 8th century until Italian unification in 1871. Therefore, it is plausible to expect stricter historical emphasis on consanguinity bans, and this is consistent with the fact that we observe lower consanguinity rates in the Papal states than in the other provinces (means are 6% and 13%, respectively, t-test, two-sided p-value < 0.01, N=101). More- over, southern territories under the Spanish or Norman rule have had lower institutional quality and higher corruption [214, 73]. These are also the regions with the highest consanguinity rates. On the other hand, northern territories were historically under Austrian administration which was “usually portrayed as a good administrator that did not implement exploiting or extracting policies" [73, p. 13]. To capture the effect of these historical differences, we use dummy variables based on the map constructed by [73] that identify, for each province, the administration that presided continuously during the period 1560- 1659: Spanish, Papal, Austrian, Venetian, Sabaudian and Independent.17 In columns (13) and (14) of Table B.5.14, we include these variables as additional controls; the omitted category is for provinces under Spanish domination. As expected, medieval Austrian administration in northern Italy is associated with lower corrup- tion today, and being in the Papal states at that time is not associated with higher corruption than southern territories under the Spanish rule. However, consanguinity remains significant in the regressions.

17The Papal and Spanish dominations in [73]’s map are roughly the same regions with the Papal states and the Kingdom of Sicily in [214]’s map. However, contrary to [214]’s map, there are no missing regions in [73]’s map. In addition, [73]’s mapping of the missing regions in [214]’s map, and of the Papal states is consistent with the sources used for [214]’s map, such as [149].

198 (7) (8) (9) (10) (11) (12) (13) (14) South and and geography Family ties and geography Family types and geography Dominations and geography VARIABLES islands (Middle Ages) (Middle Ages)

Consanguinity 4.267** 4.670* 4.746*** 4.411* 5.508*** 5.099* 5.209*** 5.383* (1.681) (2.527) (1.227) (2.536) (1.227) (2.687) (1.397) (2.756) South and Islands 0.348 -0.247 (0.352) (0.399) Family ties 0.032 -0.005 (0.029) (0.040) Communitarian family 0.427** 0.459 (0.197) (0.332) Egalitarian nuclear family 0.091 0.405 (0.196) (0.359) Papal 0.381 0.232 (0.269) (0.329) Austrian -0.636*** -0.810* (0.240) (0.483) Venetian -0.121 -0.108 (0.253) (0.309) Sabaudian -0.125 0.129 (0.275) (0.333) Independent 0.065 0.109 (0.249) (0.260) Share of agriculture 2.325 0.425 2.149 0.415 3.608 0.828 3.534 0.944 (3.622) (3.361) (3.741) (3.531) (3.204) (3.151) (3.348) (3.169) Log population 0.017 -0.051 0.032 -0.051 0.024 -0.019 0.021 -0.028 (0.116) (0.114) (0.113) (0.114) (0.117) (0.122) (0.118) (0.119) Latitude -0.326** -0.285 -0.217 -0.248 (0.160) (0.197) (0.160) (0.181) Mean Temperature -0.128** -0.121* -0.131** -0.127* (0.061) (0.061) (0.065) (0.069) Mean precipitation -4.594 -4.306 -2.925 -3.706 (4.294) (5.131) (4.574) (5.136) Suitability for agriculture 0.387 0.380 0.481 0.453 (0.537) (0.502) (0.563) (0.643) Distance to coast -0.002 -0.002 -0.004 -0.003 (0.002) (0.002) (0.003) (0.003) Elevation 0.030 0.026 0.033 0.032 (0.056) (0.054) (0.052) (0.058) Slope 0.721 0.621 0.649 0.919 (0.905) (0.886) (0.862) (0.887) Ruggedness -2.348 -2.022 -2.114 -2.953 (2.816) (2.744) (2.685) (2.752) Constant 0.855 16.355** -1.154 14.800 0.500 10.638 0.739 12.441 (1.525) (7.566) (2.017) (10.818) (1.562) (7.923) (1.612) (8.436)

Observations 101 101 101 101 101 101 101 101 R-squared 0.440 0.505 0.444 0.502 0.451 0.512 0.455 0.517 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.14: Regression analysis of the relationship between consanguinity and corruption in Italy: controlling for family structure and political history.

Data collection dates. In Table B.5.15 we vary the cutoff dates for defining the consanguinity rate in Italy. The results are robust to restricting attention to more recent consanguinity estimates.

Alternative corruption metric. Figure B.5.4 displays Golden and Picci’s (2005) measure of corruption based on infrastructure at the province level alongside the consanguinity rate (see Appendix B.3 for details on the data). Here a higher score is associated with lower corruption (in contrast to our baseline measure which uses the number of crimes). This alternative measure of corruption is even more highly correlated with consanguinity than our primary measure (Spearman’s ρ = −0.63, p-value < 0.001, N = 90). Moreover, Table B.5.16 reports replications of our province-level regressions from Tables 3.6 and 3.7 in the body of the paper using this alternative measure of corruption. Consanguinity remains a robust predictor of corruption in all specifications. However, as [108] note, their province-level ratios are not cost-adjusted due to lack of data (the region-level ratios are cost-adjusted), and this is why we focus our main analysis on associative crime. Finally, note also that our measures of share of agriculture in GDP and population come from the period 2000-2013 (and hence from a period after Golden and Picci’s data on corruption). We do not have access to earlier province-level statistics on GDP or population, so the reliability of these specifications depends on

199 (1) (2) (3) (4) Consanguinity sample Consanguinity sample Consanguinity sample Consanguinity sample VARIABLES of basic model (1945-1964) restricted to 1950-1964 restricted to 1955-1964 restricted to 1960-1964

Consanguinity 5.144*** 5.504*** 5.701*** 5.950*** (1.099) (1.178) (1.222) (1.308) Share of agriculture 3.472 3.195 3.014 2.749 (3.243) (3.280) (3.323) (3.336) Log population 0.022 0.021 0.023 0.022 (0.115) (0.113) (0.110) (0.110) Constant 0.763 0.820 0.822 0.872 (1.505) (1.479) (1.444) (1.446)

Observations 101 101 101 101 R-squared 0.430 0.436 0.440 0.438 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.15: Robustness check for date of data collection, Italy. the assumption that provincial GDP (and share of agriculture therein) and population in 2000-2013 is highly correlated with the same in 1954-1997, which seems plausible.

Figure B.5.4: Alternative measure of corruption from Golden and Picci (2005) and consanguin- ity in Italy: province level. Darker colored regions are more corrupt. Grey colored areas indicate missing data.

200 (1) (2) (3) (4) (5) (6) Basic model and Consanguinity Income instead of Share of agriculture both Income and without Income and VARIABLES Share of agriculture as instrument Share of agriculture Share of agriculture

Consanguinity -2.283*** -2.323*** -2.307*** -2.312*** -2.216*** (0.359) (0.343) (0.388) (0.363) (0.318) Share of agriculture -4.646* 1.058 -0.239 (2.352) (2.649) (2.985) Log population -0.121 -0.062 -0.110 -0.103 -0.113 -0.068 (0.079) (0.072) (0.084) (0.119) (0.091) (0.071) Log value added per capita -0.066 -0.056 -0.068 (0.067) (0.138) (0.076) Constant 2.850*** 2.162** 2.583** 2.536** 2.620** 2.268** (1.052) (0.967) (1.015) (1.207) (1.091) (0.942)

Observations 90 90 90 90 90 90 R-squared 0.042 0.275 0.282 0.281 0.282 0.274 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1 (1) (2) (3) (4) (5) (6) Basic model and Consanguinity Income instead of Share of agriculture both Income and without Income and VARIABLES Share of agriculture as instrument Share of agriculture Share of agriculture

Consanguinity -1.703** -1.685** -1.770** -1.749** -1.411* (0.853) (0.823) (0.835) (0.834) (0.799) Share of agriculture 2.209 3.326 1.049 (2.880) (3.020) (3.493) Log population -0.125 -0.099 -0.196** -0.219* -0.181* -0.120* (0.079) (0.073) (0.083) (0.118) (0.093) (0.071) Log value added per capita -0.119* -0.156 -0.107 (0.065) (0.135) (0.078) Latitude 0.182** 0.158** 0.157** 0.161** 0.160** 0.141* (0.073) (0.074) (0.076) (0.078) (0.077) (0.076) Mean Temperature -0.003 -0.002 0.000 0.001 0.000 -0.003 (0.035) (0.034) (0.035) (0.036) (0.035) (0.035) Mean precipitation 1.939 4.317* 4.588* 4.838* 4.673* 3.786* (2.399) (2.313) (2.342) (2.433) (2.353) (2.214) Suitability for agriculture -0.016 -0.099 -0.084 -0.079 -0.085 -0.100 (0.443) (0.451) (0.434) (0.433) (0.440) (0.444) Distance to coast 0.001 0.002 0.002 0.002 0.002 0.002 (0.002) (0.002) (0.002) (0.002) (0.002) (0.002) Elevation 0.049 0.037 0.034 0.032 0.034 0.042 (0.047) (0.049) (0.053) (0.053) (0.052) (0.049) Slope -0.774* -1.158** -1.096** -1.127** -1.137** -0.994** (0.405) (0.453) (0.438) (0.441) (0.446) (0.439) Ruggedness 2.247* 3.483** 3.275** 3.377** 3.410** 2.947** (1.217) (1.375) (1.332) (1.337) (1.353) (1.332) Constant -5.214 -4.504 -3.509 -3.555 -3.855 -3.361 (3.474) (3.462) (3.471) (3.485) (3.610) (3.484)

Observations 90 90 90 90 90 90 R-squared 0.374 0.394 0.407 0.405 0.408 0.384 Robust standard errors in parentheses *** p<0.01, ** p<0.05, * p<0.1

Table B.5.16: Replication of Tables 3.6 and 3.7 using the corruption measure from Golden and Picci (2005).

201 Region-level data

To provide further evidence for Italy, we collected data on corruption and consanguinity in 20 regions of Italy. We use multiple available measures of corruption for Italian regions: (i) number of corruption crimes (such as peculation, malversation and bribery) per 100,000 inhabitants used in previous studies [e.g. see 70, 71, 88, 44], (ii) the ratio of existing public infrastructure (in 1998) to expected infrastructure given past government spending (from 1954-1997) computed by [108], (iii) the European Quality of Government Index (EQI) computed from survey data by [58], and (iv) a measure of regional institutional performance from [214]. Note that the latter three measures assign higher scores to less corrupt regions, in contrast to the measures based on crime statistics. Overall we find a highly significant correlation between consanguinity and each of our corruption measures for Italy (i) crimes (Spearman’s ρ = 0.59, p-value = 0.001, N=20) (ii) infrastructure (Spearman’s ρ = -0.79, p-value < 0.001, N=20), (iii) EQI (Spearman’s ρ = -0.81, p-value < 0.001, N=20), and (iv) institutional performance (Spearman’s ρ = -0.91, p-value < 0.001, N=20). Moreover, each of these measures is strongly and negatively correlated with province-level associative crime, providing further evidence that associative crime is a reasonable proxy for corruption (Spearman’s ρ < −0.43, all p-values ≤ 0.05).

202 B.6 Experiment materials and procedures

B.6.1 Experiment instructions

You are now participating in a decision making experiment. At the end of the experiment, you will be paid in cash based on your decisions. Please read the instructions carefully so you understand clearly how your payoff is determined. Please do not talk to other participants. If you have any questions, raise your hand and the experimenter will answer them privately.

This experiment consists of only one round where you will make a decision in a three-person scenario. You will be assigned one of three possible roles in this scenario: A, B and C. Your role will be determined randomly.

In addition to your $7 show up payment, you will earn Experimental Currency Units (ECU) during the experiment based on your decision. 10 ECU is worth $1. At the end of the experiment, we will convert your earnings to from ECU to dollars.

Figure 1

You will see a graph similar to Figure 1 during the experiment. The graph will help you to see the possible outcomes of the three-person scenario based on your and other participants’ decisions.

Little blue circles show which person is making a choice.

Two black lines exiting from each circle show the choices available to the person choosing. For example, the top circle and its lines show that Person A can choose Choice 1 or Choice 2.

203 Little blue squares show all possible outcomes of the scenario and the payoffs for each person are shown by A= ... B= ... C= ...

Payoffs are determined based on the decisions of Person A and Person B. As you can see Person C has no decision to make.

The scenario:

You will participate in a three-player scenario. This scenario is shown in the figure. Participants in the roles of A and B start with an initial endowment of 100 ECU and C starts with 160 ECU. The scenario works as follows:

Stage 1

Stage 1: At the first stage, A decides whether to choose "Transfer" or "Not-transfer".

- If A chooses "Not-transfer", the round ends immediately, and all participants’ final payoffs will be equal to their initial endowment (i.e. A=100, B=100, C=160).

- If A chooses to "Transfer" 40 ECU to B, his/her endowment will be reduced by 5 ECU. Whether 40 ECU will actually be transferred to B or not and the final payoffs will depend on choices made by B at the next stages.

204 Stage 2

Stage 2: Assuming that A has transferred 40 ECU, B chooses "Accept" or "Reject".

- If B chooses "Reject", the round ends immediately. The final payoff of A will be 100-5=95, and the fi- nal payoff of B and C will be their initial endowments.

- If B chooses "Accept", 40 ECU will be deducted from A’s endowment and will be added to B’s endowment. Then, the experiment moves to stage 3.

Stage 3: B who has accepted the transfer, now decides whether to choose "Right" or "Left".

- If B chooses "Left", the round ends immediately and the payoffs of participants are as follows:

A receives the initial endowment minus 5 ECU, minus the amount of transfer = 100 - 5 - 40 = 55 ECU. B receives the initial endowment plus the transfer = 100 + 40 = 140 ECU. C receives the initial endowment = 160 ECU.

- If B chooses "Right", 5 ECU will be deducted from B, 105 ECU will be deducted from C, but 105 ECU will be added to A. Then the round ends, and the payoffs of participants are as follows:

A receives the initial endowment minus 5 ECU, minus the amount of the transfer, plus 105 = 100 - 5 - 40 + 105 = 160 ECU. B receives the initial endowment plus the transfer, minus 5 = 100 + 40 - 5 = 135 ECU. C receives the initial endowment minus 105 = 160 - 105 = 55 ECU.

205 Stage 3

206 B.6.2 Recruitment survey

Vancouver

Our online pre-experiment questionnaire in Vancouver included the following questions. Pre-experiment Questionnaire

Please enter your email:

1. How old are you? ◦ 18-22 ◦ 23-25 ◦ 26-30 ◦ more than 30

2. What is your field of study?

3. What is your gender? ◦ Male ◦ Female

4. Which country are you born in?

5. Are you a Canadian Citizen? ◦ Yes ◦ No

6. What is your PRIMARY ethnic origin? Aboriginal, African, Arab, Caribbean, Chinese, Dutch, English, French, German, Indian, Iranian, Irish, Italian, Jewish, Norwegian, Polish, Portuguese, Russian, Scottish, Spanish, Swedish, Ukrainian, Welsh, Other

If you chose "other", please specify:

7. Please specify time slots (as many as you can) when you are available this semester for the experiment at [lab address at UBC/SFU].

8. Would you like to participate in our experiment with a parent or sibling (18 or older) in a public location (such as Starbucks, Tim Hortons, McDonalds, etc) around your neighbourhood? ◦ Yes. ◦ No

If “Yes", which family member of yours (parent or sibling) will participate in the ex- periment? How old is she/he?

and please specify the name and postal code of a public location.

207 Urmia

Our in-paper pre-experiment Persian questionnaire in Iran, titled “Pre-research Questionnaire", included the following questions. After we finished collecting data for our kin treatments, we just kept questions 1-7. Pre-research Questionnaire

1. Choose your age group? ◦ 18-22 ◦ 23-25 ◦ 26-30 ◦ more than 30 2. Choose your gender: ◦ Male ◦ Female 3. Choose your occupation group: ◦ Student ◦ Private sector ◦ Government sector ◦ Unemployed 4. Choose your ethnicity: (written in Persian alphabetical order) ◦ Beluch ◦ Azeri Turk ◦ Arab ◦ Persian ◦ Kurd ◦ Gilak ◦ Lur ◦ Other 5. What is your latest degree (or current level of study, if student)? ◦ High school diploma ◦ Kardani [2-year university or college degree] ◦ Karshenasi (B.A) ◦ Karshenasi Arshad (M.A) 6. Which university did/do you study? ——————— 7. What is your field of study? ——————— 8. Would your brother or sister also like to participate in the research? (in this case, both you and your sibling will be separately paid around 20000 Tomans for your participation in the research session) ◦ No ◦ Yes. My brother would like to participate ◦ Yes. My sister would like to participate If your answer to question 7 is "Yes", please enter your brother or sister’s age group: ◦ 18-22 ◦ 23-25 ◦ 26-30 ◦ more than 30 If your answer to question 7 is "Yes", please enter your brother or sister’s occupation group: ◦ Student ◦ Private sector ◦ Government sector ◦ Unemployed 9. To be informed of location, dates and hours of the research sessions, please enter your cell phone number. ———————

208 B.6.3 Post-experiment questionnaire

The following are the list of questions from post-experiment questionnaire. Questions 1 and 2 are the “Harm" and “Ingroup" questions from the Moral Foundations questionnaire [118]. Questions 3-5 are used to measure “family ties" and are drawn from the World Values Survey. Questions 11, 12, 13, 15 and 16 were not included in our experiments in Iran. The variables were coded numerically as indicated below.

Q1. When you decide whether something is right or wrong, to what extent are the following considera- tions relevant to your thinking? Please rate each statement using this scale:

0 1 2 3 4 5 not at all relevant not very relevant slightly relevant somewhat relevant very relevant extremely relevant

• Whether or not someone suffered emotionally • Whether or not someone cared for someone weak or vulnerable • Whether or not someone was cruel • Whether or not someone was good at math • Whether someones action showed love for his or her country • Whether or not someone did something to betray his or her group • Whether or not someone showed a lack of loyalty

Q2. Please read the following sentences and indicate your agreement or disagreement:

0 1 2 3 4 5 Strongly Disagree Moderately Disagree Slightly Disagree Slightly Agree Moderately Agree Strongly Agree

• Compassion for those who are suffering is the most crucial virtue. • One of the worst things a person could do is hurt a defenseless animal. • It can never be right to kill a human being. • I am proud of my country’s history. • People should be loyal to their family members, even when they have done something wrong. • It is better to do good than to do bad. • It is more important to be a team player than to express oneself.

Q3. How important is family in your life? (3) Very important (2) Rather important (1) Not very important (0) Not at all important

Q4. Which of two statements do you agree more? • (1) Regardless of what the qualities and faults of one’s parents are, one must always love and respect them. • (0) One does not have the duty to respect and love parents who have not earned it.

Q5. Which of two statements do you agree more? • (1) It is the parents’ duty to do their best for their children even at the expense of their own well-being. • (0) Parents have a life of their own and should not be asked to sacrifice their own well-being for the sake of their children.

Q6. Which of two statements do you agree more?

209 • (1) Regardless of what the qualities and faults of one’s elderly relatives (grand-father and -mother, uncles and aunts) are, one must always love and respect them. • (0) One does not have the duty to respect and love elderly relatives who have not earned it.

Q7. Generally speaking, would you say that most people can be trusted or that you need to be very care- ful in dealing with people? (1) Most people can be trusted (0) Need to be very careful

Q8. How much do you trust . . .

(3) Trust completely (2) Somewhat (1) Not very much (0) No trust at all

• your extended family? • people of same ethnicity? • people you meet for the first time? • your family?

Q9. To which of these groups would you say you belong first? (0) Your country (1) Your ethnicity

Q10. How important is for you that your sibling would not marry someone of a different ethnicity? (3) Very important (2) Rather important (1) Not very important (0) Not at all important

Q11. How important is for you to not have people of a different ethnicity as neighbor? (3) Very important (2) Rather important (1) Not very important (0) Not at all important

Q12. Do you agree that ethnic diversity erodes a country’s unity? (1) Agree (0) Disagree

Q13. How many of your friends have the same ethnicity as yours? (4) All of them (3) Most of them (2)About half of them (1) A few of them (0) None of them

Q14. People have different views about themselves and how they relate to the world. How strongly do you agree or disagree with each of the following statements about how you see yourself?

(3) Strongly agree (2) Agree (1) Disagree (0) Strongly disagree • I see myself as a/an [ethnicity]. • I see myself as a/an [Nationality]. • I see myself as a world citizen.

Q15. In your opinion, what are the odds that someone in Vancouver pay a bribe to get a job? (2) more than 50 percent (1) 50 percent (0) less than 50 percent

Q16. In your opinion, what are the odds that someone in Vancouver gets punished if he/she pays a bribe to get a job? (0) more than 50 percent (1) 50 percent (2) less than 50 percent

Q17. Some people have a stronger sense of belonging to some things than others. How strong is your sense of belonging to . . .

Not strong at all Very strong

210 1 2 3 4 5

• your family? • your extended family? • your ethnicity? • your country?

Q18. Do you know any cousins who are married to each other? (1) YES (0) NO

Q19. Suppose your parent hit a pedestrian while exceeding maximum speed. You are the only witness and police knows that. The only way to save your parent from the serious consequences is that you lie that he/she was not exceeding maximum speed. Would you lie to the police? (1) YES (0) NO

Survey results

For completeness, below we report the survey results from the post-experiment questionnaire, indicating whether there are significant differences between the responses in the two countries via Wilcoxon Rank-Sum tests. Note the sharp difference in the familiarity with cousin marriage (Q18), as well as differences in the WVS questions underlying studies of family ties (Q3 - Q5).

Question Canada Iran p-value Question Canada Iran p-value Q1.1 3.37 3.1 0.021 Q8.2 1.67 1.84 0.005 Q1.2 3.15 2.79 0.004 Q8.3 1.41 1.06 <0.001 Q1.3 3.47 3.32 0.372 Q8.4 2.72 2.91 <0.001 Q1.4 1.17 1.98 <0.001 Q9 0.5 0.33 0.248 Q1.5 1.61 3 <0.001 Q10 0.49 1.22 <0.001 Q1.6 3.15 3.79 <0.001 Q11 0.29 Q1.7 3.21 3.6 0.009 Q12 0.19 Q2.1 3.53 3.7 0.026 Q13 2.31 Q2.2 3.72 4.09 <0.001 Q14.1 1.86 1.89 0.444 Q2.3 3.06 3.9 <0.001 Q14.2 2.23 2.52 <0.001 Q2.4 3.15 3.82 <0.001 Q14.3 2.17 2.44 <0.001 Q2.5 3.03 3.81 <0.001 Q15 0.33 Q2.6 4.46 4.57 0.029 Q16 1.16 Q2.7 2.74 3.75 <0.001 Q17.1 3.34 3.18 0.03 Q3 2.53 2.7 0.007 Q17.2 2.21 2.06 0.129 Q4 0.56 0.91 <0.001 Q17.3 2.17 2.06 0.327 Q5 0.68 0.45 <0.001 Q17.4 2.59 2.86 0.002 Q6 0.5 0.72 <0.001 Q18 0.04 0.86 <0.001 Q7 0.49 0.15 <0.001 Q19 0.45 0.36 0.072 Q8.1 2.07 2.05 0.929 N Obs. 199 188 199 188

Table B.5.16: Responses to post-experiment survey questionnaire, by country.

B.6.4 Experiment procedure in Urmia and Tehran

Our experiments in Iran were conducted with collaboration of the Moaser Research Center18 which possesses a permit from the Ministry of Science, Research and Technology to conduct research in Economics and Management. The center took the full responsibility of planning, ethical review and official approvals to run

18mem.ac.ir

211 experiments in Iran. We also had ethics approval for the Iranian experiment from Simon Fraser University’s Office of Research Ethics.

Kin treatments in Urmia

Recruiting: We recruited from undergraduate students who were registered in summer semester courses at various colleges and universities in Urmia. We hired local research assistants (a teacher and some stu- dents) to help us with the recruiting for kin treatments. Our research assistants went to different colleges and universities throughout the city and invited students to participate in the experiments and earn cash. Those who were interested to participate in the experiment were given forms titled “pre-research questionnaire" in Persian, translated in Appendix B.6.2. Based on their self-reported background, we chose subjects who were i) students, ii) less than 30 years old, iii) Azeri or Kurdish. Then our assistants contacted and asked them to choose one of the scheduled experiment sessions to attend. Those who had siblings interested in participating in the experiment (answered question 7 with “Yes") and whose siblings were i) students and ii) less than 30 years old, were invited in pairs to play in the roles of two players with kin relation in our kin treatments. Subjects who participated alone played the stranger role in our kin treatments.

Participants: In total 180 subjects participated in the kin treatments of our experiment (60 participants for each kin treatment; KKS, KSK, SKK). From 120 subjects in the roles of Person A and B, half were Azeri Turk and Half were Kurd. 57 of these subjects were female and 63 of them were male. Recruiting student siblings below 30 years old during our two months conducting experiments in Urmia was achievable for some reasons; (i) in Iran, most of the undergraduate students spend their summers with their families in their hometowns, because most of the universities and colleges in Iran do not offer dormitories to undergraduate students during summer semesters. Those who want to take summer courses can take general courses in local universities and colleges as a “guest student" in their hometown, (ii) youth mostly live in the same place with their families until they get married, (iii) the age of marriage in Iranian metropolises is relatively high (23-4 for women, 27-8 for men), (iv) the show-up fee (7000 Tomans for each, i.e. 14000 for a sibling) covers costs of commuting to the experiment location from all areas of the city by chartered cab (max 5000 Tomans for a one-way trip, i.e. 10000 Tomans for commuting of a sibling pair), (iv) Higher education is a trend in today’s Iran and there are variety of public and private colleges and universities. Many youth between 18-24 in cities are studying for a bachelor or master degree in some college or university.

Conducting experiment sessions: We ran the kin treatment sessions in a high school with the official approval of the West Azerbaijan province’s Ministry of Education in June-July 2015. In the high school, we had access to three classrooms with twelve seats in each class. We labeled the classrooms with Persian letters “Aleph", “Be", “Jim" (same as A, B, C in English) and labeled seats with numbers 1 to 12. Subjects in the roles of Person A, Person B and Person C were directed to classes “Aleph", “Be" and “Jim" respectively. Each subject was matched with the other participants seating in the seats with the same numbers in the other two classrooms. The number of subjects in each session varied between 12 (4 in each class) and 24 (8 in each class). Sessions lasted around an hour. Experiments were run using pen-and-paper. All forms and booklets were anonymous and include only the class and seat number, except for the consent form and receipt, on which participants printed their full name, signed and dated. Subjects had been asked to bring their National and Student ID Cards. A National ID Card includes full name, the date of birth and the name of father which allowed us to check whether pairs were actually siblings. Upon arrival, we handed subjects forms asking their background information (with the same questions 1-7 of pre-research questionnaire). After subjects filled the background information forms and signed consent

212 forms, we handed them the instruction booklets, which were the same as in Appendix B.6.1 but translated to Persian, and with converted currency (see payments section). They were not allowed to talk to each other during experiments. They were asked to raise their hand to ask questions, and these were answered privately by experimenters. Two experimenters managed the “Aleph" classroom (subjects in the roles of Person A) and the “Be" classroom (subjects in the roles of Person B) and an assistant attended to the “Jim" classroom with the subjects in the passive role (C). After reading instructions, subjects received a decision booklet in three pages. The first page read (in Persian):

Decision booklet

Please note that:

I. Participants in roles of A and B make choices at the same time. We will match their choices to determine the final outcome. You will see the outcome and your final payoff at the end of the experiment.

II. You might observe some background information from the pre-experiment question– naire about participants in the other roles.

page 1

The second page of the decision booklet presented the information to the subjects about the other people in the three-person scenario (their kin and a stranger). About non-kin they saw only their age group, written as “18-30 years old" which was true about all subjects. This information was the same for all kin and ethnic treatments, therefore introduced no noise in the experiment. Below is the second page of decision booklet of player A in the KKS treatment who has a sister in the role of player B. The third page displayed the game tree, a question asking which action the player would like to take, and a box asking subjects to explain their choice. Below is the third page as seen by Person B in a KKS treatment.

213 Your Role: Person A

The information provided to you and other participants:

Person B’s background information Person C’s background information

Your sister 18-30 years old

Person B observes the same type of Person C observes the same type of information about you. information about you.

page 2

Your Role: Person B

Suppose Person A decided to Transfer 40 ECU to you. Please choose one of the following options:

◦ Accept, then Right ◦ Accept, then Left ◦ Reject

page 3

After collecting the decision booklets, we distributed post-experiment questionnaires. While subjects were answering questionnaires, we matched the decisions of players in roles A and B and wrote down their payoffs. Subjects remained seated until they got paid with cash in an envelope. The collected data was converted to a

214 computer file after experiment sessions, and all the forms and booklets filled by subjects during experiments are stored.

Payments: In the kin level experiment, the minimum total payment was 13000 Tomans, the maximum total payment was 23000 Tomans and average payment to 180 subjects was 19000 Tomans which is equal to about 7 CAD based on free market rates.

Ethnic treatments in Urmia

We ran our ethnic treatments in the Faculty of Literature and Humanities of Urmia University in August 2015 with an approval from university officials. Urmia University is a public university and the oldest and largest university in the city. In summer 2015, the university offered general courses and also a range of courses in different fields of study (such as engineering and economics), but all classes were concentrated in the Faculty of Literature and Humanities.

Participants: The total number of subjects in the ethnic treatments was 180 (60 participants for each eth- nic treatment; CCS, CSC, SCC). All participants were undergraduate students (except few medical students taking general courses) and were below 30 years old. From 120 subjects in the roles of Person A and B, 80 were Azeri Turk and 40 were Kurd. 46 of these subjects were female and 74 of them were male.

Recruiting, conducting experiment and payments: We had access to three classrooms with 20 seats each in the Faculty of Literature and Humanities. As in the kin treatment sessions, we labeled the rooms (“Aleph", “Be" and “Jim") and seats (from 1 to 20), and subjects played the three-person scenario with two other subjects sitting in the other classrooms in seats with the same numbers. The number of subjects in each session varied between 30 (10 in each class) and 60 (20 in each class). Each session lasted about an hour. To recruit from students in the Faculty of Literature and Humanities, we prepared a form including questions 1-7 of the pre-research questionnaire used in the kin treatment sessions. We printed 60 of these forms and wrote a number (1 to 60) in two corners of each form. One corner of the form with a number on it was cut and could be tear apart easily. At the beginning of the classes, we asked instructors to give us 5 minutes to invite students to our experiment. After a brief introduction, we handed them the forms and told them to tear apart the number in the corner of the form in order to participate in the experiment right after their class and provided them with the room number of class “Jim". We had 1.5 hour to assign roles and seat numbers to the numbers on pre-experiment questionnaire forms. After students were dismissed from their classes, they started to show up and were directed to their seats based on the numbers in their hands. Payments were the same as in the kin treatments, with an average payment of about 19111 Tomans. Exper- iments were conducted exactly the same as the kin treatments except that the second page of the decision booklets presented the information of a co-ethnic and a stranger player. Below is the second page of decision booklet for Person A in CCS treatment.

215 Your Role: Person A

The information provided to you and other participants:

Person B’s background information Person C’s background information

18-30 years old 18-30 years old Azeri Turk

Person B observes the same type of Person C observes the same type of information about you. information about you.

page 2

Additional experiments in Tehran

For the robustness checks in Iran, we ran experiments in Tehran with strangers in both low and high effort cost variants, and with siblings, cousins and friends in the high cost variant. Tehran is the capital city of Iran and is highly ethnically diverse. The results of the experiments with strangers and siblings are not distinguishable from those in Urmia which indicates that our results are good representative of the urban society in Iran. The game tree below was presented to subjects in the high cost variant of the experiment.

The experiments in Tehran took place in August 2016 prior to a meeting of the Tehran Thought Club. The club hosts young adults (mostly students) interested in social sciences for various presentations and discus- sions. We collected the data from 12 triplets per treatment in our experiments in Tehran with almost the same procedure in Urmia except that in pre-experiment questionnaire we added choices to participate in the ex- periment with a close friend or a cousin. We specified that those who participate in the experiment as close friends should not be relatives.

216 Appendix C

Origins of Weird Psychology

See Online Supplementary Materials for for Origins of WEIRD Psychology: https://psyarxiv.com/d6qhu/

217