<<

Page 1 of 31 American Journal of Physical Anthropology

1 2 3 SURNAMES IN 4 5 A study of the population of Chile through isonymy 6 I. Barrai, A. Rodriguez-Larralde 2, J. Dipierri 1, E.Alfaro 1, N. Acevedo 3, 7 8 E. Mamolini, M. Sandri, A.Carrieri and C. Scapoli. 9 10 Dipartimento di Biologia ed Evoluzione, Università di Ferrara, 44121- Ferrara, Italy 11 1Instituto de Biología de la Altura, Universidad Nacional de Jujuy, 4600 – San Salvador De Jujuy, 12 13 Argentina. 14 2 15 Centro de Medicina Experimental, Laboratorio de Genetica Humana, IVIC, 1020A -Caracas, 16 Venezuela. 17 18 3Museo Nacional de Ciencias Naturales, , Chile 19 20 21 Running title: Surnames in Chile 22 23 24 25 26 Correspondence to: 27 Chiara Scapoli 28 Department of Biology and Evolution 29 30 University of Ferrara, 31 Via L. Borsari 46, - I-44121 Ferrara, Italy. 32 Telephone: +39 0532 455744; FAX: : +39 0532 249761 33 Email: [email protected] 34 35 36 Number of text pages: 15 37 Literature pages: 4 38 39 Number of Tables : 2 40 41 Number of Figures: 7 42 43 44 KEYWORDS : Chile, Population Structure, Isonymy, Inbreeding, Isolation by distance 45 46 47 ACKNOWLEDGMENTS: The authors are grateful to the Director of the Servicio Electoral de la 48 49 Republica de Chile Sr. Juan Ignacio Garcia Rodríguez, who made the data available, and to Sr. 50 51 Dr.Ginés Mario Gonzalez Garcia, Embajador de la Republica Argentina en Chile. The work was 52 supported by grants of the Italian Ministry of Universities and Research (MIUR) to Chiara Scapoli. 53 54 The authors take pleasure in dedicating this work to Dr. Juan Pinto Cisternas of Valparaiso, a 55 56 pioneer of isonymy studies. 57 58 59 60 John Wiley & Sons,1 Inc. 10

8

6

4

Log(frequency)

2

0

-2 -2 0 2 4 6 8 10 12 14 Log(occurrence) Figure S1. Frequency of a given occurrence of a surname as a function of its occurrence. 8.1 million paternal surnames, Chile 2006. Bilogarithmic scale.

12

10

8

6

4

Log(frequency)

2

0

-2 -2 0 2 4 6 8 10 12 14 Log(occurrence)

Figure S2. Frequency of a given occurrence of a surname as a function of its occurrence. 8.1 million maternal surnames, Chile 2006. Bilogarithmic scale.

0.028 0.026 0.024 0.022 0.020 0.018 0.016 0.014 0.012 0.010 0.008 0.006 0.004 0.002 0.000 -0.002 0.004 0.008 0.012 0.016 0.020 0.024 0.028 FST 0.006 0.010 0.014 0.018 0.022 0.026 0.030 FIS FIT Figure S3. The components of inbreeding levels in 54 of Chile. Local inbreeding seems to be the largest component, whereas random inbreeding tends to stay constant. Inbreeding from isonymy, 16 million surnames, Chile 2006.

6.6

6.4

6.2

6.0

5.8

5.6

5.4

5.2

LAS 5.0 LASP -500 0 500 1000 1500 2000 2500 3000 3500 4000 LASM CENTER

Figure S4. Variation of Lasker’s distance on kilometers, Chile 2006. The belts are one standard deviation wide.

1.8

1.6

1.4

1.2

1.0

0.8

0.6

0.4

0.2

0.0

NEI -0.2 NEIP -500 0 500 1000 1500 2000 2500 3000 3500 4000 NEIM CENTER

Figure S5. Variation of Nei’s distance on kilometers, Chile 2006. The belts are one standard deviation wide.

4

3

COQUIMBO ATACAMA LOS LAGOS 2 MAGALLANES AISEN 1 Y PARINACOTA TARAPACA

0 LOS RIOS

Fact. 2: 20.35% VALPARAISO -1

LIBERTADOR ARAUCANIA

-2 MAULE METROPOLITANA BIOBIO

-3

-4 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 Fact. 1: 41.25%

Figure S6P. Projection of the Euclidean distance matrix between on the first two Factors of PCA. Regions result geographically ordered in a concave arc from North to South.

Figure S6M. Projection of the Euclidean distance matrix between regions on the first two Dimensions of MDS. Regions result geographically ordered in a convex arc from North to South.

UPGMA Euclidean Distance 0.55

0.50

0.45

0.40

0.35

0.30

0.25

0.20

AISEN

MAULE

BIOBIO

LOS RIOS LOS

ATACAMA

TARAPACA

COQUIMBO

LOS LAGOS LOS

ARAUCANIA

VALPARAISO

LIBERTADOR

MAGALLANES

ANTOFAGASTA

METROPOLITANA

ARICA Y PARINACOTA Y ARICA

Figure S7. Dendrogram based on the matrix of Euclidean surname distances between regions. Note the strict North-South clustering from Tarapaca to Aisen.

Figure S8. The .

12

10

8 Limarí Isla de Pascua

ChoapaHuasco 6 Chañaral Parinacota Elqui Tamarugal TocopillaEl Loa Copiapó 4 Antofagasta

2 San FelipeQuillotaArica de Aconcagua Los AndesCardenal Caro CauquenesAntartica ChilenaCapitanPalena Prat Chiloé Fact. 2: 20.77% Fact. 0 General Carrera ColchaguaMelipilla Ultima Esperanza Aisén AraucoTierra RancodelOsorno Fuego ValparaísoSan AntonioCuricó Llanquihue -2 Linares ChacabucoCachapoalTalca Cautin Malleco Magallanes ÑubleBiobío Coihaique CordilleraSantiagoMaipo -4 Concepcion

-6

-8 -12 -10 -8 -6 -4 -2 0 2 4 6 8 10 12 14 Fact. 1: 38.90%

Figure S9P. Projection of the Euclidean distance matrix between provinces on the first two Factors of PCA. Note the counterclockwise arc ordering of the provinces in the second, third, fourth, and first quadrant. Note the outlier position of Isla de Pascua () in the first quadrant.

Figure S9M. Projection of the Euclidean distance matrix between provinces on the first two Dimensions of MDS. Note the counterclockwise arc ordering of the provinces in the third, second, first and fourth quadrant.

UPGMA Euclidean Distance 0.8

0.7

0.6

0.5

0.4

0.3

0.2

0.1

Elqui

Arica

Talca

Aisén

Ñuble

Maipo

Limarí

Chiloé Biobío Loa El

Cautin

Ranco Curicó

Palena

Arauco

Iquique

Linares

Osorno

Quillota

Huasco

Malleco

Petorca

Valdivia

Choapa

Melipilla

Copiapó

Santiago

Tocopilla

Chañaral

Cordillera

Talagante

Coihaique

Los Andes Los

Parinacota Colchagua Cachapoal Tamarugal

Llanquihue

Chacabuco

Magallanes

Cauquenes

Concepcion

Antofagasta

San Antonio San

Capitan Prat Capitan

Marga Marga Marga

Cardenal Caro Cardenal

Isla de Pascua de Isla

General Carrera General

Tierra del Fuego del Tierra

Antartica Chilena Antartica

Ultima Esperanza Ultima

San Felipe de Aconcagua de Felipe San

Figure S10. Dendrogram of the 54 provinces of Chile. Note the considerable North-South ordering of the provinces from Arica down to and Antartica Cilena. Complete Linkage Euclidean Distance 3.0

2.5

2.0

1.5

Distanza Legame 1.0

0.5

0.0 NNNNNNNNNNNSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSNNNNNNNNNNNNNSSSSSSSSSNNNNNNNNNNNNNNNSSSSSSNSNCHEN NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNSSSSSSSSSSSSSSSSSSSSSSSSSSSS NNNNNNNNNNNSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSNNNNNNNNNNNNNNSSSSSSSSSNNNNNNNNNNNNNNSSSSSSSSSNNNNNNNNNNNNNNNNNNNNNNNNNSNNNNNNNNNNNNNNNNNNSSSSSSSSSSSSSSSSSSSSSSSSSSSSS

Figure S11. The dendrogram from the Euclidean distance matrix between the 346 . (Label CHE means Chepica ; N = North; S = South)

Figure S12P. Projection of the Euclidean distance matrix between the 346 communes of Chile on the first two Factors of PCA. Note that the North (N) and the South (S) clusters are inverted in the Projection. Label CHE means Chepica municipality.

Figure S12M. Projection of the Euclidean distance matrix between the 346 communes of Chile on the first two Dimensions of MDS. Label CHE means Chepica municipality.

Figure S13. Mapping of Euclidean’s matrix of surname distances between departments in Chile on the first three components of Pca. The first component (a) green) represents 38.9% of the total dispersion, the second (b) blue) 20.8%, and the third component (c) red) 8.5%. Table S1. Isonymy in Chile. From: I. Barrai et al., to be submitted to: Am J Phys Anthropol, 2011. Municipality, average coordinates (Lat and Lon). Prefix: NIND = number of individuals; NSUR = number of different surnames; I = isonymy; ALFA = Fisher’α; NI = Karlin’s ν. Suffix: PA = paternal surname; MA = maternal surname; PM = both paternal and maternal surname. FST = random inbreeding FST; IOBS = total observed isonymy; FIS = local inbreeding FIS; FIT = total inbreeding FIT.

Table S2. The first 50 surnames in Chile, paternal and maternal series.

Paternal N Maternal N GONZALEZ 171266 GONZALEZ 172755 MUÑOZ 133998 MUÑOZ 136343 ROJAS 96090 ROJAS 96793 DIAZ 94912 DIAZ 95589 PEREZ 76643 PEREZ 76053 SOTO 68560 SOTO 70602 CONTRERAS 64057 CONTRERAS 64716 SILVA 61416 SILVA 60162 SEPULVEDA 58638 SEPULVEDA 58613 MARTINEZ 58330 MARTINEZ 58146 MORALES 57962 MORALES 57747 RODRIGUEZ 56353 LOPEZ 55210 LOPEZ 55418 RODRIGUEZ 55086 FUENTES 53657 FUENTES 53675 ARAYA 52596 TORRES 52383 TORRES 52393 ARAYA 52313 HERNANDEZ 52350 HERNANDEZ 52070 ESPINOZA 51132 FLORES 51562 VALENZUELA 50590 ESPINOZA 51042 FLORES 50462 VALENZUELA 50610 CASTILLO 49786 CASTILLO 49584 RAMIREZ 49316 RAMIREZ 49174 REYES 48353 REYES 48475 GUTIERREZ 46829 GUTIERREZ 46959 CASTRO 46104 CASTRO 46642 VARGAS 45640 VARGAS 46536 ALVAREZ 44468 VASQUEZ 45231 VASQUEZ 43960 ALVAREZ 44866 FERNANDEZ 42641 TAPIA 41821 TAPIA 42268 CARRASCO 40950 SANCHEZ 40822 SANCHEZ 40582 CORTES 40347 FERNANDEZ 40167 HERRERA 39925 CORTES 40100 CARRASCO 39811 GOMEZ 39858 GOMEZ 39795 HERRERA 39422 NUÑEZ 38325 NUÑEZ 39142 JARA 37672 JARA 38334 VERGARA 36579 VERGARA 37338 RIVERA 34321 FIGUEROA 34522 FIGUEROA 34072 RIVERA 34337 GARCIA 33385 RIQUELME 33967 RIQUELME 33114 VERA 33209 BRAVO 32903 MIRANDA 32747 VERA 32071 BRAVO 31986 MIRANDA 31696 GARCIA 31800 MOLINA 30464 MOLINA 30652 VEGA 30309 VEGA 30273 SANDOVAL 29194 CAMPOS 29762 CAMPOS 29116 SANDOVAL 28667 OLIVARES 28660 ORELLANA 28618