View metadata, citation and similar papers at core.ac.uk brought to you by CORE
provided by Repositori Institucional de la Universitat Jaume I
Título artículo / Títol article: Gender is a major factor explaining discrepancies in eye colour prediction based on HERC2/OCA2 genotype and the IrisPlex model
Autores / Autors Conrado Martínez-Cadenas, María Peña- Chilet, Maider Ibarrola-Villava, Gloria Ribas
Revista: Forensic Science International: Genetics
Versión / Versió: Versió post-print de l’autor
Cita bibliográfica / Cita bibliogràfica (ISO 690): MARTINEZ -CADENAS, Conrado, et al. Gender is a major factor explaining discrepancies in eye colour prediction based on< i> HERC2/OCA2 genotype and the IrisPlex model. Forensic Science International: Genetics, 2013, vol. 7, no 4, p. 453-460.
url Repositori UJI: http://hdl.handle.net/10234/86709
Our reference: FSIGEN 974 P-authorquery-v9
AUTHOR QUERY FORM
Journal: FSIGEN Please e-mail or fax your responses and any corrections to:
E-mail: [email protected]
Article Number: 974 Fax: +353 6170 9272
Dear Author,
Please check your proof carefully and mark all corrections at the appropriate place in the proof (e.g., by using on-screen annotation in the PDF file) or compile them in a separate list. Note: if you opt to annotate the file with software other than Adobe Reader then please also highlight the appropriate place in the PDF file. To ensure fast publication of your paper please return your corrections within 48 hours.
For correction or revision of any artwork, please consult http://www.elsevier.com/artworkinstructions.
Any queries or remarks that have arisen during the processing of your manuscript are listed below and highlighted by flags in the proof. Click on the ‘Q’ link to go to the location in the proof.
Location in Query / Remark: click on the Q link to go article Please insert your reply or correction at the corresponding line in the proof Q1 Please confirm that given names and surnames have been identified correctly. Q2 Please check the telephone number of the corresponding author, and correct if necessary. Q3 Please provide volume and page range for Refs. [11, 15]. Q4 Please provide author name for Ref. [21]. Q5 Please cite footnotes a, c, and d in Table 1. Q6 Please provide significance of bold given in Table 2. Q7 Please cite footnotes a and b in Table 2. Q8 Please cite footnotes aÐd in Table 3. Q9 Please cite footnotes aÐc in Table 4.
Please check this box or indicate your approval if you have no corrections to make to the PDF file
Thank you for your assistance.
G Model
FSIGEN 974 1–8
Forensic Science International: Genetics xxx (2013) xxx–xxx
Contents lists available at SciVerse ScienceDirect
Forensic Science International: Genetics
jou rnal homepage: www.elsevier.com/locate/fsig
1
2 Gender is a major factor explaining discrepancies in eye colour
3 prediction based on HERC2/OCA2 genotype and the IrisPlex model
a,1 b,1 b
4 Q1 Conrado Martinez-Cadenas , Maria Pen˜a-Chilet , Maider Ibarrola-Villava ,
b,
5 Gloria Ribas *
a
6 Department of Medicine, University of Castellon Jaume I, Castellon, Spain
b
7 Medical Oncology and Haematology Unit, Health Research Institute – INCLIVA, Valencia, Spain
A R T I C L E I N F O A B S T R A C T
Article history: In recent years, several studies have greatly increased our understanding of the genetic basis underlying
Received 23 January 2013
human eye colour variation. A large percentage of the eye colour diversity present in humans can already
Received in revised form 13 March 2013
be genetically explained, so much so that different DNA-based eye colour prediction models, such as
Accepted 17 March 2013
IrisPlex, have been recently developed for forensic purposes. Though these models are already highly
accurate, they are by no means perfect, with many genotype–phenotype discrepancies still remaining
Keywords:
unresolved. In this work we have genotyped six SNPs associated with eye colour (IrisPlex) in 535
Genetics
individuals from Spain, a Mediterranean population. Aside from different SNP frequencies in Spain
Pigmentation
compared to Northern Europe, the results for eye colour prediction are quite similar to other studies.
Eye colour
Irisplex However, we have found an association between gender and eye colour prediction. When comparing
Polymorphisms similar eye colour genetic profiles, females tend, as a whole, to have darker eyes than males (and,
Spanish population conversely, males lighter than females). These results are also corroborated by the revision and meta-
analysis of data from previously published eye colour genetic studies in several Caucasian populations,
which significantly support the fact that males are more likely to have blue eyes than females, while
females tend to show higher frequencies of green and brown eyes than males. This significant gender
difference would suggest that there is an as yet unidentified gender-related factor contributing to human
eye colour variation.
ß 2013 Published by Elsevier Ireland Ltd.
8
9 1. Introduction Human eye colour is a fully genetically determined trait with a 21
complex inheritance pattern, although most times it apparently 22
10 Eye colour variability is one of the most visible manifestations behaves as a simple Mendelian character. This is so because one 23
11 of individual physical variation in people of Caucasian descent. The genetic region, HERC2/OCA2, accounts for the majority of the blue 24
12 pigmentation of the iris is determined by the amount of melanin and brown variation (the two most common colours) in human 25
13 present in the anterior layer of the iris stroma [1]. Eye colour is thus eye colour [2,3]. 26
14 a quantitative character dependent on melanin content, with a Although genetic association studies had already identified the 27
15 gradient of different colours ranging from light blue (very low relationship between the OCA2 gene and eye colour [2,4], it was not 28
16 melanin content) to dark brown (high melanin content). However, until 2008 when three independent studies found that HERC2, a 29
17 in almost all cases this quantitative variability can be unequivo- neighbouring gene, and specifically the SNP rs12913832, was the 30
18 cally reduced to three broad qualitative categories naturally key human eye colour regulator [5–7]. In addition, other genes 31
19 perceived by the human eye: blue/grey, green/hazel (also called such as SLC24A4, SLC45A2, TYR, TYRP1, ASIP or IRF4 have also been 32
20 intermediate) and brown/black. recognised to contribute to eye colour variation, although to a 33
much lesser extent [8–10]. 34
More recently, investigations have focused on the development 35
of a forensic genetic test to accurately predict eye colour from an 36
unknown or anonymous DNA sample [11–13]. Although these 37
* Corresponding author at: Medical Oncology and Haematology Unit, Health
studies provide slightly different recommendations for accurate 38
Q2 Research Institute – INCLIVA, Av. Blasco Iba´n˜ez, 17, 46010 Valencia, Spain.
eye colour prediction, all suggest that the genotyping of a small 39
Tel.: +34 963862894.
number of SNPs from a handful of pigmentation-associated genes 40
E-mail address: [email protected] (G. Ribas).
1
These authors contributed equally to this work. can predict blue/brown eye colour with more than 90% accuracy. 41
1872-4973/$ – see front matter ß 2013 Published by Elsevier Ireland Ltd.
http://dx.doi.org/10.1016/j.fsigen.2013.03.007
Please cite this article in press as: C. Martinez-Cadenas, et al., Gender is a major factor explaining discrepancies in eye colour prediction
based on HERC2/OCA2 genotype and the IrisPlex model, Forensic Sci. Int. Genet. (2013), http://dx.doi.org/10.1016/j.fsigen.2013.03.007
G Model
FSIGEN 974 1–8
2 C. Martinez-Cadenas et al. / Forensic Science International: Genetics xxx (2013) xxx–xxx
42 Significantly, intermediate eye colours such as green or hazel still HERC2/OCA2 region), rs1800407 (OCA2 gene), rs12896399 86
43 remain quite difficult to predict [8]. (SLC24A4 gene), rs16891982 (SLC45A2 gene), rs1393350 (TYR 87
44 In this work, as in Walsh et al. [13], we have genotyped six SNPs gene), and rs12203592 (IRF4 gene). These six SNPs have been used 88
45 associated with eye colour (IrisPlex) in 535 people from Spain, a in previous studies, and have already demonstrated their 89
46 Mediterranean population. Aside from different SNP frequencies in suitability for predicting eye colour in Caucasian populations 90
47 Southern European populations when compared to Northern [12–16]. Public databases such as NCBI (http://www.ncbi.nlm. 91
48 European ones, results for eye colour prediction are similar to other nih.gov) and Ensembl (http://www.ensembl.org) were used to 92
49 studies. However, we have found a very strong association collect information about SNPs and genes. A north European 93
50 between gender and eye colour prediction. When comparing population (CEU) and a southern one from Tuscany, Italy (TSI) 94
51 similar eye colour genetic profiles, females tend, as a whole, to available from the HapMap project (phase 3), were used to get all 95
52 have darker eyes than males (and, conversely, males lighter than SNP minor allele frequencies, as well as the raw genotypes. Details 96
53 females). This significant gender difference would suggest that of gene names, gene locations, nucleotide changes, minor allele 97
54 there is an as yet unidentified factor contributing to human eye frequencies, SNP assay number and eye colour associated with the 98
55 colour variation. less frequent allele are provided in Table 1 and in Supplementary 99
Table S2.Supplementary data associated with this article can be 100
56 2. Materials and methods found, in the online version, at http://dx.doi.org/10.1016/j.fsigen. 101
2013.03.007. 102
57 2.1. Study subjects and data collection
2.3. Genotyping assays 103
58 A total of 535 individuals were included in this study. All
59 samples belonged to healthy control subjects of Spanish origin Genotyping assays were carried out using Kaspar technology 104
60 drawn from various skin cancer studies, and were collected at (KBiosciences, Hoddesdon, UK). PCRs were performed in a total 105
61 several Spanish hospitals from the provinces of Madrid, Valencia reaction volume of 4 ml containing about 10 ng of genomic DNA, a 106
62 and Castellon (Supplementary Table S1). All participants provided final concentration of 4 New Kaspar Reaction Mix and 12 mm of 107
63 written informed consent, and the study was approved by the each Kaspar primer. For SLC45A2 SNP rs16891982, TaqMan 108
64 Ethics Committee of INCLIVA Health Research Institute, Valencia, technology was used (Applied Biosystems, Foster City, USA). 109
65 Spain.Supplementary data associated with this article can be PCR conditions varied depending on the requirements of each 110
66 found, in the online version, at http://dx.doi.org/10.1016/j.fsi- probe, always according to the manufacturer’s indications. The 111
67 gen.2013.03.007. genotype of each sample was determined by measuring final 112
68 A standardised questionnaire was used to collect information allele-specific fluorescence in the ABI Prism 7900HT Detection 113
69 on pigmentation characteristics, as well as personal and family System, using SDS v2.3 software for allelic discrimination (Applied 114
70 history of relevant diseases. For the purpose of this study we took Biosystems, Foster City, USA). 115
71 into consideration age, gender and eye colour. Eye colour was As a quality control measure, we included one no-template 116
72 determined by especially trained personnel. All eye colours were sample and one sample duplicate in each 96-well plate (a total of 117
73 grouped in just three functional categories: brown/black, hazel/ four per 384-well plate used). Genotypes were provided automat- 118
74 green (or intermediate), and blue/grey. ically by the software and were confirmed manually in the 119
75 Genomic DNA from individuals was isolated from peripheral laboratory by two different personnel. Additionally, all apparent 120
76 blood lymphocytes and diluted to a final solution of 50 ng/ml. This discrepancies between genotype and phenotype were genotyped 121
77 was done using the traditional saline method or the DNAzol twice in independent experiments in order to prevent potential 122
78 procedure (Invitrogen, Eugene, OR, USA). DNA concentration was genotyping errors. Only individuals without missing genotypes 123
79 quantified in samples prior to genotyping by using Quant-iT (493) were considered for further analyses. 124
80 PicoGreen dsDNA Reagent (Invitrogen, Eugene, OR, USA). Further
81 DNA concentration measurements were obtained using the 2.4. Prediction model 125
82 Nanodrop 2000 spectrophotometer.
Liu and colleagues [8] have previously published the IrisPlex 126
83 2.2. SNP selection algorithm for eye colour prediction used in this and previous 127
studies [13]. It is based on a multinomial logistic regression model. 128
84 We genotyped the six SNPs included in the IrisPlex eye The probabilities of each individual for being brown-eyed (p1), 129
85 colour panel [8,13]. These comprise rs12913832 (located in the blue-eyed (p2), and otherwise (p3) were calculated based on the 130
Table 1
Q5 Allelic distributions in different European populations.
Gene SNP Minor Minor allele’s Ancestral MAF MAF Spanish p-Value CEU p-Value TSI
allele colour prediction allele HapMap_CEU HapMap_TSI population
pHWE MAF
b
HERC2/OCA2 rs12913832 T Brown T 0.21 0.59 0.21 0.63 5.68E-39 0.29
OCA2 rs1800407 T Blue C 0.08 0.10 0.99 0.10 0.25 0.87
b
SLC24A4 rs12896399 G Brown G 0.43 0.63 0.012 0.66 1.02E-12 0.52
SLC45A2 rs16891982 C Brown C 0.02 – 0.94 0.16 5.62E-07 –
TYR rs1393350 T Blue C 0.22 0.25 0.94 0.23 0.66 0.49
IRF4 rs12203592 A Blue G 0.15 0.09 0.05 0.14 0.43 0.09
a
MAF, minor allele frequency (referenced to HapMap_CEU minor allele).
b
Alleles with higher frequencies in the Italian and the Spanish population, becoming the major allele in both.
c
pHWE p-value for Pearson’s goodness of fit test for deviation from Hardy–Weinberg equilibrium. p-value CEU and p-value TSI refer to the p-values of a Pearson’s goodness of
fit test comparing the Spanish minor allele frequency obtained from our samples with HapMap_CEU and HapMap_TSI frequencies respectively.
d
Bold in p-values denotes statistically significant results. Bold in minor alleles denotes alleles that become the major allele in our population.
Please cite this article in press as: C. Martinez-Cadenas, et al., Gender is a major factor explaining discrepancies in eye colour prediction
based on HERC2/OCA2 genotype and the IrisPlex model, Forensic Sci. Int. Genet. (2013), http://dx.doi.org/10.1016/j.fsigen.2013.03.007
G Model
FSIGEN 974 1–8
C. Martinez-Cadenas et al. / Forensic Science International: Genetics xxx (2013) xxx–xxx 3
131 sample genotypes, being allocated different weights depending including them all in a multivariate unconditional logistic 193
132 on whether an individual has two, one or no minor alleles. regression analysis. 194
133 The probabilities were calculated according to the following A meta-analysis was performed to compare our results to 195
134 formulas: previously published data and check the robustness of our results. 196
135 P We searched for Caucasian populations in the literature containing 197
Pexpða1 þ bðp1ÞKxKÞ P data on both eye colour (classified in three categories that could be 198
p1 ¼
1 þ expða1 bðp1ÞKxKÞ þ expða2 þ bðp2ÞKXKÞ grouped in blue/non-blue eyes) and gender [10,17,18]. We 199
conducted the meta-analysis on populations from Iceland, 200
136137
P Holland, Australia and Poland along with the Spanish population 201
138 expða2 þ bðp2Þ xKÞ
P K P
p2 ¼ analysed in this study, by applying a Cochran–Mantel–Haenszel 202
1 þ expða1 bðp1Þ xKÞ þ expða2 þ bðp2Þ xKÞ
K K test to the analyses performed on the total number of samples from 203
all different populations, as previously described [19,20]. We set 204
139140
being male as a suspected risk factor for having blue eyes. 205
141 p3 ¼ 1 p1 p2
To evaluate differences in prediction accuracy between 206
142143 where xk is the number of minor alleles of the kth SNP. The model’s genders, we performed a 2-by-2 contingency table and calculated 207
144 parameters, alpha and beta, were derived based on the genotypes Chi-square values, p-values and associated odds ratios. We first 208
145 of 3804 Dutch individuals in the model-building set of the previous considered the genotype of the HERC2/OCA2 SNP and then we 209
146 study, and can be found in the same publication. These added the remaining IrisPlex SNPs and prediction algorithm. All 210
147 probabilities can be calculated using the macro provided in the samples were assembled in four different groups depending on 211
148 supplementary material of Walsh et al. [13]. This prediction model gender and eye colour prediction: males correctly predicted, males 212
149 classifies each individual as having brown, blue or an intermediate wrongly predicted, females correctly predicted and females 213
150 eye colour. Brown colour is considered when, given a specific wrongly predicted. A sub-classification within the wrongly 214
151 IrisPlex SNP combination, the probability of being brown-eyed is predicted groups was established based on whether the predicted 215
152 higher than 0.5. Blue colour is predicted only when the probability eye colour was darker or lighter than the real colour. The groups 216
153 of being blue-eyed is higher than 0.7. When, according to this six obtained are described in Table 4. 217
154 SNP genotype, the probability of being brown-eyed is less than 0.5
155 and, at the same time, the probability of being blue-eyed is lower 3. Results 218
156 than 0.7, the colour is classified as intermediate. The prediction
157 relevance rank by SNP, in descending order of importance, was the 3.1. SNP allele distributions 219
158 following: rs12912832 first (most important), rs1800407 second,
159 12896399 third, rs16891982 fourth, rs1393350 fifth and finally Minor allele frequencies for each SNP in three different 220
160 rs12203592 sixth. Those individuals in which at least one SNP populations (HapMap_CEU and TSI, and Spanish) and p-values 221
161 failed to be genotyped were excluded from this part of the study, for the comparison between them and the Spanish population are 222
162 since it was not possible to estimate eye colour probabilities, detailed in Table 1, along with p-values of test of departure from 223
163 therefore a total of 493 individuals remained for subsequent Hardy–Weinberg equilibrium among our population. All polymor- 224
164 analyses. phism complied with Hardy–Weinberg equilibrium except SNP 225
165 We assessed the accuracy of the prediction model by comparing rs12896399 located within the SLC24A4 gene. 226
166 the predicted eye colour with the real one, and then all samples We first observed that in both rs12913832 and rs12896399, the 227
167 were classified, for further analyses, as correctly predicted, darker minor allele in the HapMap_CEU population becomes the major 228
168 than predicted, or lighter than predicted. allele in both the Spanish and the HapMap_TSI populations. We 229
were not able to test the difference between the Spanish and the 230
169 2.5. Statistical analysis Italian populations for SNP rs16891892, since this data is not 231
available in HapMap. 232
170 SPSS v19 was used to carry out the analyses. Bonferroni’s Minor allele differences in the Spanish population differ 233
171 correction was implemented as the method of adjustment for significantly from HapMap_CEU frequencies in three SNPs: 234
172 multiple comparisons. All p-values were two-sided and those rs12913832 (HERC2/OCA2), rs12896399 (SLC24A4) and 235
173 lower than 0.008 were considered statistically significant. rs16891982 (SLC45A2). We observed that frequencies in our 236
174 For all polymorphisms studied, Fisher’s exact test was used population were more similar to the ones in the HapMap_TSI 237
175 both to test for deviations from Hardy–Weinberg equilibrium population (Table 1). 238
176 (HWE) among the population sample and to compare differences
177 in the minor allele frequency (MAF) distributions between cases 3.2. Phenotype and genotype distribution 239
178 and controls. HWE was rejected when the p-value was lower than
179 0.008 after Bonferroni’s correction. Our data was compared with We classified all participants into three eye colour groups: 240
180 raw data obtained from HapMap_CEU and HapMap_TSI using brown, blue/grey and intermediate (intermediate phenotypes 241
181 unconditional logistic regression, in order to evaluate minor allele ranging from light green to light brown/hazel with shades of green). 242
182 frequency differences between of our Spanish population and The Spanish population predominantly possesses brown eyes 243
183 those available in HapMap. (75.47%), while blue-eyed people comprise 11.32% of the population, 244
184 Several analyses were performed to assess associations and there is a relatively high percentage (13.21%) of people with 245
185 between genotypes and eye colour, and to evaluate the weight intermediate phenotypes (Fig. 1). When eye colour percentages 246
186 of each SNP, simultaneously allowing for various confounding are separated according to gender, percentages turn out to be 247
187 factors. Genotype-related odds ratios (ORs) using the co-dominant different between males and females (Table 2). The percentage of 248
188 model, their corresponding 95% confidence intervals (CIs) and blue-eyed females (8.5%) was notably lower than the one of blue- 249
189 associated p-values were estimated via unconditional logistic eyed males (14.71%) in the Spanish population, whereas brown- 250
190 regression. This was done for each SNP as well as for gender. We eyed females (78.45%) were considerably more common than males 251
191 also obtained results adjusted by possible confounding effects (71.43%). These differences presented statistically significant results 252
192 between SNPs or with phenotypic factors like gender after (OR 1.34, 95% CI: 1.04–1.73, p-value 0.024) (Table 2). 253
Please cite this article in press as: C. Martinez-Cadenas, et al., Gender is a major factor explaining discrepancies in eye colour prediction
based on HERC2/OCA2 genotype and the IrisPlex model, Forensic Sci. Int. Genet. (2013), http://dx.doi.org/10.1016/j.fsigen.2013.03.007
G Model
FSIGEN 974 1–8
4 C. Martinez-Cadenas et al. / Forensic Science International: Genetics xxx (2013) xxx–xxx
performed an alternative analysis excluding the Australian data 271
from the analysis (including only Iceland, Holland, Poland and 272