1

2 DR THAISE MELO (Orcid ID : 0000-0003-0983-4602)

3 DR MARINA R. S. FORTES (Orcid ID : 0000-0002-7254-1960)

4

5

6 Article type : Original Article

7

8

9 Short Running Title: Across-breed QTL validation for sexual precocity in tropical cattle 10 Title: ACROSS-BREED VALIDATION STUDY CONFIRMS AND IDENTIFIES 11 NEW LOCI ASSOCIATED WITH SEXUAL PRECOCITY IN BRAHMAN AND 12 NELLORE CATTLE1 13 Thaise Pinto de Melo*, Marina Rufino Salinas Fortes†‡, Ben Hayes‡, Lucia Galvão de 14 Albuquerque*§, Roberto Carvalheiro*§ 15 16 *Department of Animal Science, School of Agricultural and Veterinarian Sciences, 17 FCAV/ UNESP - Sao Paulo State University, Jaboticabal, Sao Paulo, 14884-900, 18 Brazil. 19 †The University of Queensland, School of Chemistry and Molecular Biosciences, St 20 Lucia, Queensland 4072, Australia. 21 ‡The University of Queensland, Queensland Alliance for Agriculture and Food 22 Innovation, St Lucia, Queensland 4072, Australia. 23 §National Council for Scientific and Technological Development (CNPq), Brasília, 24 Distrito Federal, Brazil. 25 Corresponding author: Roberto Carvalheiro, School of Agricultural and Veterinarian 26 Sciences, FCAV/ UNESP - Sao Paulo State University, Jaboticabal, Brazil. Email:

27 [email protected] Manuscript 28

This is the author manuscript accepted for publication and has undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process, which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1111/JBG.12429

This article is protected by copyright. All rights reserved 29 ABSTRACT: The aim of this study was to identify candidate regions associated with 30 sexual precocity in Bos indicus. Nellore and Brahman were set as validation and 31 discovery populations, respectively. SNP selected in Brahman to validate in Nellore 32 were from: regions affecting reproductive traits (G1), and significant SNP (P ≤ 10- 33 3) from a meta-analysis (G2). In the validation population early pregnancy (EP) and 34 scrotal circumference (SC) were evaluated. To perform GWAS in validation population 35 we used regression and Bayes C. SNP with P ≤ 10-3 in regression and Bayes Factor ≥ 3 36 in Bayes C were deemed significant. Significant SNP (for EP or SC) or SNP in their ± 37 250 Kb vicinity region, which were in at least one discovery set (G1 or G2) were 38 considered validated. SNP identified in both G1 and G2 were considered candidate. For 39 EP 145 SNP were validated in G1 and 41 in G2, for SC these numbers were 14 and 2. 40 For EP 21 candidate SNP were detected (G1 and G2). For SC no candidate SNP were 41 identified. Validated SNP and their vicinity region were located close to QTL or 42 related to reproductive traits and were enriched in terms related with 43 reproductive success. These are therefore, strong candidate regions for sexual precocity 44 in Nellore and Brahman. 45 Keywords: Bos indicus, discovery population, reproductive traits, SNP validation, 46 tropical beef cattle 47 INTRODUCTION 48 49 Across-breed validation studies are commonly used to validate quantitative trait 50 loci (QTL) for several traits. In Genome-wide association studies (GWAS) those 51 significant markers under an empirical P-value for the same or correlated traits in 52 different breeds are likely tagging QTL that segregate across-breeds. These QTLs may 53 harbor some important genes affecting both populations. Karlsson et al. (2007) used this 54 approach to validate single nucleotide polymorphisms (SNP) markers in different dog 55 breeds. They notice that this strategy was highly efficient to fine-mapping across 56 breeds. 57 The probability of finding common QTL for correlated traits in breeds that share 58 common ancestry is expected to be higher than in breeds with very distinct genetic Author Manuscript 59 origin. This is because breeds that do not share a recent common ancestor are more 60 genetically distinct from each other. Genetic differences increase with distance to 61 common ancestors. Distinct genetic origin affects the linkage disequilibrium pattern 62 between SNPs at long-ranges, and by consequence it affects QTL mapping (Goddard &

This article is protected by copyright. All rights reserved 63 Hayes, 2009). As Nellore and Brahman are both Bos indicus breeds, and Brahman was 64 originally developed by three base breeds, Gir, Guzerat and Nellore (Briggs & Briggs, 65 1980), the likelihood of both breeds sharing QTL controlling correlated traits is higher 66 than in unrelated breeds. 67 Several strategies have been used to conduct across-breed validation studies. 68 Pryce et al. (2010) used two dairy breeds to validate QTL for milk production and 69 fertility traits. They distributed Holstein bulls in a discovery population and younger 70 Holstein bulls and Jersey bulls in a validation population. SNP that were detected as 71 significant at an empirical threshold P-value in discovery and validation populations 72 were considered validated. Also validating fertility traits in dairy cattle, Höglund et al. 73 (2014) used three breeds to validate genomic associations. They used one breed as 74 discovery population and the other two breeds as validation populations. They argued 75 that using two populations simultaneously to validate significant associations is a 76 powerful strategy to decrease the risk of false positive association. 77 Genic regions are strong candidate regions to present QTL segregating across 78 related breeds because it is expected that the metabolic pathways in which these genes 79 are involved are conserved across breeds. Also, regions with pleiotropic effect across 80 related traits could result in higher number of true positive validated associations across 81 breeds, because genes that are controlling multiple traits in a breed might preserve 82 similar pattern of pleiotropic effect in another related breed (Saatchi et al., 2014). 83 The aim of this study was to validate in a Nellore population genomic regions 84 associated with sexual precocity that were reported for Brahman. We used as discovery 85 data two SNP sets pre-selected in a Brahman population, 1) from gene regions 86 previously reported as significant for reproductive traits, and 2) from significant 87 associations detected in a meta-analysis study of sexual precocity traits. 88 89 MATERIAL AND METHODS 90 Ethics Statement 91 All managements and procedures involving production, maintenance and use of 92 Nellore animals were certified and approved by the National Council of Animal Author Manuscript 93 Experimentation Control (CONCEA, 2008) and Use Committee at University of Sao 94 Paulo, Jaboticabal Campus (18.340/16). Regarding Brahman animals, Animal care and 95 Use committee approval was not required because the data is from existing databases 96 described in the following section.

This article is protected by copyright. All rights reserved 97 98 Discovery population 99 The discovery population was composed by Brahman animals. Phenotypes were 100 provided by Cooperative Research Centre for Beef Genetic Technologies (Beef CRC). 101 Brahman phenotypes included the female traits age when the first corpus luteum (CL) 102 was observed (AGECL), first postpartum anoestrus interval (PPAI), ability to ovulate 103 prior to weaning the calf (PW), and the male traits scrotal circumference (SC) measured

104 at 12, 18 and 24 months of age (SC12, SC18, SC24). 105 The AGECL was defined as the number of days from the heifer birth to the first 106 CL detected. PPAI, measured in days, was calculated as the difference between the 107 calving date and the date of the first observed ovulation postpartum. PW, a binary trait 108 was defined as 0 for females that had success to ovulate before weaning her calf or 1 for 109 those females that failed. For all female traits ovarian ultrasounds were carried out to 110 verify the presence of CL that is an indicator of the ovulation, at every 4 to 6 weeks 111 after heifers achieved 200 Kg of weight. 112 Scrotal circumference was measured in cm, with a standard metal tape. For 113 females and males, contemporary groups (CG) were defined by the concatenation year 114 of birth and management group information (defined as cohorts). The age of young 115 bulls at recording was considered a covariate for SC. Details about animals, cohorts and 116 phenotypes are described in Johnston et al. (2009), Johnston et al. (2010), Burns et al. 117 (2013), Corbet et al. (2013) and Fortes et al. (2018). 118 Animals were genotyped with the Illumina BovineSNP50 V1 and V2. Genotypes 119 were imputed for high-density panel using Beagle software v.3.2 (Browning & 120 Browning, 2009) and a reference population of representative animals of the Beef CRC 121 population genotyped using the high-density Illumina Bovine HD Assay (Illumina, San 122 Diego, CA, USA), as described by Fortes et al. (2013a). Quality control excluded 123 samples with call rate < 98%, SNP in non-autosomal regions, with call rate < 85% and 124 minor allele frequency (MAF) < 0.02. The number of SNP after quality control was 125 625,041 for females and 612,992 for males. Details about genotypes and imputation are 126 described in Fortes et al. (2013a). Author Manuscript 127 128 Validation population 129 Data from Nellore animals were used as validation population. Phenotypic 130 information was obtained from Alliance Nellore dataset. The animals considered in this

This article is protected by copyright. All rights reserved 131 study were born in eight farms distributed over Midwest, Southeast and Northeast of 132 Brazil. In general, two breeding seasons are applied during the year, where the females 133 are either artificially inseminated or naturally mated. The heifers are exposed in the 134 early breeding season at around 16 months of age. After 60 days of the early breeding 135 season, pregnancy is confirmed and those females that failed in conceiving in the first 136 breeding season had a second opportunity at around 2 years old. 137 Nellore phenotypes used here are early pregnancy (EP) and scrotal circumference

138 (SCN). The EP, a binary trait, assumed the value of 2 for heifers that had success in 139 calving before 31 months of age; and 1 for heifers that failed in calving before that age. 140 And SC was measured, in cm, from the widest point of the scrotum, at around 16 141 months of age. Details about Nellore animals and phenotypes are described by Costa et 142 al. (2015) and Irano et al. (2016). Summary statistics of all traits for both populations 143 are presented in Table 1.

144 Fixed effects for EP and SCN were concatenated in CG and included information

145 of herd, year and season of birth, weaning and yearling management groups. For SCN, it 146 was excluded phenotypic information that was not in the interval of ± 3 SD from the 147 mean of each CG. Also, CG with less than 3 animals were excluded. For EP, were 148 excluded CG without variability, in which all females had the same categorical

149 response, and CG with less than four animals. For SCN, the age of the animal at the 150 recording was included as a covariate in the model. It was considered in the analysis 151 only animals with age ranging from 10 to 24 months. A total of 20 animals that 152 presented ages out of this interval were removed. 153 As Nellore dataset presented both genotyped and non-genotyped animals, for this 154 population phenotypes were pre-corrected for the fixed effect CG to avoid biased fixed 155 effect estimates. A regular mixed animal model (Henderson, 1984) was used to pre- 156 correct phenotypes. Pre-corrected EP was expressed in a continuous scale. 157 Markers were genotyped using the high-density Illumina Bovine HD Assay 158 (Illumina, San Diego, CA, USA) and GeneSeek Genomic Profiler Indicus HD - 159 GGP75Ki (Neogen Corporation, Lincoln, NE, USA), which contain 777,962 and 74,677 160 SNP, respectively. Imputation of the smaller panel (GGP75Ki) to the high-density panel Author Manuscript 161 was performed by using the software FImpute v.2.2 (Sargolzaei et al., 2014). It were 162 excluded animals with call rate < 0.90, SNP with MAF < 0.01, call rate < 0.95, Hardy- 163 Weinberg equilibrium test P-value < 10-5 and in non-autosomal regions. After quality

This article is protected by copyright. All rights reserved 164 control, remained 1,796 females genotyped for 412,876 SNP and 4,261 males 165 genotyped for 512,063 SNP. 166 167 Across-breed Validation 168 To select significant SNP from the discovery population two selection approaches 169 were adopted: 1) SNP in gene regions that were previously associated with sexual 170 precocity traits in a Brahman population (G1), or 2) significant SNP from two 171 independent meta-analyses, one for female and other for male traits, measured in the 172 same Brahman population of G1 (G2). Details about these meta-analyses are further 173 described. 174 To select SNP in genic regions (G1), it was firstly verified which genes were 175 associated with sexual precocity traits in Brahman cattle. These genes were pre-selected 176 from previous studies using the same Brahman population used here (Fortes et al., 177 2012a; Fortes et al., 2012b; Porto-Neto et al., 2015; Fortes et al., 2016; Nguyen et al., 178 2017). Those studies were either GWAS or transcriptional gene expression studies. SNP 179 inside genes or that were located in a region of 250 Kb upstream or downstream from 180 genes were selected for the discovery dataset. A total of 2,753 genes located in 181 autosome were described in those papers. These genes were mapped by 182 using the Biomart R package (Durinck et al., 2005, 2009). 183 For selecting SNP for G2, it was verified which SNP presented significant 184 association (P-value ≤ 10-3) in both meta-analyses for female (AGECL, PPAI and PW)

185 and male (SC12, SC18, SC24). 186 A total of 30 SNP were in both SNP sets, i.e., in genic regions and in significant 187 regions of the meta-analysis study for female traits. For male traits, just one SNP was in 188 common between these two SNP sets. 189 A genome mixed model proposed by Kang et al. (2010) was used to estimate SNP 190 effects and their standard errors used in meta-analyses, as follows: 191 = + + + , (1) 192 where y is the vector of Brahman phenotypes, AGECL, PPAI, PW, SC , SC , or SC , � �� �� �� � 12 18 24 193 X is an incidence matrix relating CG fixed effects in β with the phenotypes in y, and the Author Manuscript 194 age of the young bull at the measurement as a covariate for SC, s is a vector with 195 genotypes coded as 0, 1 or 2 according to the number of B allele copies, is a vector 196 containing allelic substitution effects, Z is the incidence matrix of polygenic random � 197 effects of the animals in u and ε is the vector of residuals. Vectors u and ε followed

This article is protected by copyright. All rights reserved 2 2 198 normal distribution with u~N(0, Gσ a) and ε~N(0, Iσ e), respectively, and G is the 199 genomic relationship matrix, that relates all individuals, calculated as in the first method

2 200 described in VanRaden (2008), σ a is the additive genetic variance, I is an identity 2 201 matrix and σ e is the residual variance. SNP effects were calculated by using the SNP & 202 Variation Suite (SVS) software (Release 8.3.0, Golden Helix, Inc., 2014). 203 A multi-trait approach described by Bolormaa et al. (2014) was used to perform 204 the meta-analyses. This method is a statistic test that follows a χ2 distribution with n 205 degrees of freedom, and n is the number of traits considered in meta-analysis. The χ2 206 statistic is calculated as follows:

2 -1 207 Multi-trait χ = t’i V ti, (2) -1 208 where t’i is the transpose vector of ti, and V is a 3 x 3 inverse matrix of the correlation 209 between the t-values of the three traits for female and male across all SNP markers in 210 common among these traits. The vector containing the t-values of the ith SNP effects

211 estimated by each independent GWAS (ti) was computed as:

212 ti=ai /SE(ai), (3)

213 where SE (ai) is the standard error of the SNP effect vector ai. 214 SNP with an empirical P-value ≤ 10-3 were considered significant in the meta- 215 analysis of the discovery population. 216 217 Statistical Methods 218 To perform GWAS in Nellore validation population (700 K panel) two statistical 219 methods were used, regression (Zhang et al., 2010) and Bayes C (Habier et al., 2011), 220 which model was implemented as:

221 = + = 1 + , (4) � 222 Where y is the vector � containing1� ∑� pre-corrected������ � phenotypes (corrected as

223 previously described) for Nellore traits, EP and SCN, 1 is a vector of ones, μ is the

224 overall mean, gi is the vector with the genotypes of the animals (coded as 0, 1 or 2, th 225 according to the number of B allele copies) for the i SNP effect in bi, and δi is an 226 indicator variable, which takes value 0 or 1, and is sampled from a binomial distribution

227 with parametersAuthor Manuscript n and π, where n is the total number of SNP and π is the proportion of 228 SNP with null effect in the model. It was assumed that π followed a prior beta 229 distribution with parameters α = 108 and β = 1010, which in practice is equivalent to fix

This article is protected by copyright. All rights reserved 2 230 π = 0.99 (Legarra et al., 2014). For the variance of SNP effects (σ g) and the residual 2 231 variance (σ e) it was assumed a scaled inverse chi-squared prior distribution. 232 Bayes C analyses were performed by using GS3 software (Legarra et al., 2014). It 233 was ran a single Markov chain Monte Carlo with 500,000 iterations, a burn-in period of 234 50,000, and a thinning interval of 50 iterations. 235 Model used in regression analyses was similar to that presented in Equation 1, 236 except that the vector containing the fixed effects and its incidence matrix ( were 237 replaced by a vector of ones and the mean ( ), with the same assumption for the ��) 238 distribution of random effects described in Equation 1. This model was implemented in 1� 239 the GAPIT software (Zhang et al., 2010), and consists in a regular mixed linear model 240 with a genomic relationship matrix. 241 242 SNP validation criteria 243 The validation criteria for SNPs followed 5 steps: 244 1) Verify which SNPs were significant from regression analysis (P ≤ 10-3); 245 2) Verify which SNPs were significant from Bayes Cpi analysis (Bayes Factor ≥ 3); 246 3) Select corroborate SNPs in common between regression and Bayes Cpi methods; 247 4) Identify which SNPs selected in step 3 were located in genic regions (± 250 Kb) 248 associated with sexual precocity in the Brahman population studied here; 249 5) Identify which SNPs selected in step 3 were significant in the meta-analysis with 250 Brahman traits (or were in ± 250 Kb away). 251 Steps 1 and 2 were performed in the validation population, i.e., in Nellore, while 252 steps 4 and 5 were performed in Brahman, that was the discovery population. SNPs 253 selected in steps 4 and 5 were validated for G1 and G2, respectively. SNPs in common 254 between G1 and G2 were considered strong candidates. These steps are summarized in 255 Figure 1. 256 Bayes Factor was calculated as in Varona et al. (2001):

(1 ) 257 � = ― � (5) (1 ) � �� ― � 258 where p is a posteriori probability of an SNP present a non-zero effect and π is a priori Author Manuscript 259 probability of an SNP to be included in analysis. 260 261 Gene Ontology Analysis

This article is protected by copyright. All rights reserved 262 The region of ± 250 Kb surrounding the validated SNP was scanned to find 263 neighbour genes, which were used to perform the gene ontology analysis using David 264 software (v.6.7) (Huang et al., 2009a; 2009b). G1 and G2 genes were analysed together

265 for EP and SCN. There were a total of 579 genes for EP and 110 genes for SC (G1 and 266 G2), from which 467 and 104 genes were recognized by David software for EP and

267 SCN, respectively. 268 269 RESULTS 270 The total number of significant SNP detected in meta-analyses was 747 SNP for 271 female traits and 10 SNP for male traits (P-value ≤ 0.001). This large difference may be 272 explained because the most significant SNP for SC in Brahman were located on 273 X, which was removed in the current analysis. 274 Regression and Bayesian methods identified some of the same SNP as significant:

275 a total of 302 SNP for EP and 40 SNP for SCN were in common for both methods. For

276 EP a total of 145 SNP were validated in G1 and 41 SNP were validated in G2. For SCN 277 a total of 14 and 2 SNP were validated in G1 and G2 respectively, under a P-value ≤ 10- 278 3. 279 The low number of validated SNP for SC in G2 may be explained due to most of 280 the significant SNP for this trait in Brahman being located on X chromosome for all 281 evaluated ages, which was not observed for Nellore (data not shown). 282 283 Validated and candidate SNP for EP 284 A total of 145 SNP were validated for EP in G1. They were distributed over 285 chromosomes 2, 3, 4, 5, 6, 7, 8, 11, 13, 14, 15, 16, 17, 18, 19, 21, 22, 23, 24 and 27. 286 Most of them were located on BTA3 (14.5%). For G2, validated SNP were located on 287 chromosomes 6, 8, 14, 15, 17, 21 and 24, and most of them (46.3%) were on BTA21. 288 Table 2 presents the most significant validated SNP for each chromosome for EP in G1 289 and G2. The complete list of validated and candidate SNP for EP are presented in 290 Supplementary Table 1. 291 A total of 21 candidate SNP were detected for female sexual precocity, that is Author Manuscript 292 SNP validated for both, G1 and G2. These SNP were located on chromosomes 6, 8, 14, 293 15, 21 and 24, and most of them were on BTA6 (38%). Table 3 presents the most 294 significant candidate SNP by chromosome for EP.

This article is protected by copyright. All rights reserved 295 These results showed that even though the SNP that were validated for Nellore 296 were not exactly the same that were important for Brahman, they were located in the 297 vicinity (± 250 Kb) of the important SNP for Brahman, which could be explained by the 298 difference of linkage disequilibrium among breeds. 299 Comparing GWAS results individually, the majority of candidate SNP for EP (in 300 Nellore) was generally significant for AGECL in Brahman, but was not significant for 301 PW or PPAI, which suggests that EP and AGECL should be more correlated than EP 302 and PPAI or PW. This is reasonable, as both traits are associated with the age of sexual 303 maturity. 304 The 21 candidate SNP for EP identified here are plausible candidates to be 305 affecting the sexual precocity in Nellore and Brahman populations, and are likely in LD 306 with a QTL segregating in both breeds. 307

308 Validated and candidate SNP for SCN 309 The most significant SNP (P-value ≤ 10-3) for each chromosome validated in G1

310 and G2 for SCN were distributed over chromosomes 11, 12, 15, 16, 17, 19, 23 and 29,

311 (Table 4). The complete list of validated SNP for SCN are presented in Supplementary 312 Table 2. 313 No candidate SNP, i.e., SNP in common between G1 and G2, were found for

314 SCN, mainly due to the low number of validated SNP that were found in G2 for SCN. 315 316 Gene ontology analysis 317 A total of 2 significant gene ontology terms (P ≤ 10-3) were enriched for EP 318 genes: IPR:018363 ~ Conserved Site: CD59 Antigen (CD177, LYPD3, LYPD5, PLAUR, 319 TEX101) and IPR:003006 ~ Conserved Site: Immunoglobulin/major histocompatibility 320 complex, conserved site (ENSBTAG00000005146, AGER, BCAN¸ 321 ENSBTAG00000038619, HLA-DMA, BOLA-DMB, ENSBTAG00000002581.

-3 322 SCN genes were enriched in 5 gene ontology terms (P ≤ 10 ): GO:0042157 ~ 323 Biological Process: Lipoprotein metabolic process (NMT1, APOA1, APOA4, APOA5, 324 APOC3), GO:0008289 ~ Molecular Function: Lipid binding (ACBD4, APOA1, APOA4, Author Manuscript 325 APOA5, APOC3, HIP1R, VPS36), GO:0034369 ~ Biological Process: Plasma 326 lipoprotein particle remodelling (APOA4, APOA5, APOC3), GO:0034367 ~ Biological 327 process: Macromolecular complex remodelling (APOA4, APOA5, APOC3),

This article is protected by copyright. All rights reserved 328 GO:0034368 ~ Biological Process: -lipid complex remodelling (APOA4, 329 APOA5, APOC3). 330 331 DISCUSSION 332 333 Genes and QTLs in the validated regions 334 According to the reference Brahman studies, in which the genes were collected for 335 selection in G1, some of the candidate genes close to the validated SNP for EP were 336 either differentially expressed in pituitary gland of Brahman heifers, comparing pre- 337 pubertal and pos-pubertal heifers (Nguyen et al., 2017), differentially expressed in 338 pituitary gland and grouped in pathways related to reproduction (Nguyen et al., 2017), 339 differentially expressed in the hypothalamus of pos versus pre pubertal Brahman heifers 340 (Fortes et al., 2016) or differentially expressed in ovary of Brahman females (Nguyen et

341 al., 2017). Candidate genes for SCN close to the validated SNP were either differentially 342 expressed in pituitary gland of Brahman heifers, comparing pre-pubertal and pos- 343 pubertal heifers (Nguyen et al., 2017), differentially expressed in pituitary gland and 344 grouped in pathways related to reproduction (Nguyen et al., 2017) or were differentially 345 expressed in the hypothalamus of pos versus pre pubertal Brahman heifers (Fortes et al., 346 2016). The axis hypothalamus-pituitary gland is responsible to control the production of 347 estrogens hormones in ovaries. These hormones have an important role in the 348 preparation of uterus for the reproduction. 349 Some of the validated SNP for EP were previously associated with other 350 reproductive traits for the same populations studied here. For example, Costa et al. 351 (2015) found SNP associated with age at first calving and heifer rebreeding on 352 chromosome 3 at 6 Mb in Nellore. In chromosome 5 (Table 2) the top SNP 353 “BovineHD0500031072” (107.89 Mb) was located close to a significant signal (108 354 Mb) for the trait inhibin in Brahman bulls (Fortes et al., 2013b). Dias et al. (2015) found 355 three genes related with lipid metabolism that were also associated with sexual 356 precocity in Nellore and were mapped close to significant regions found here: genes 357 UCP-1 (BTA17 at 17 Mb), GH-1 (BTA19 at 48 Mb) and TNF (BTA23 at 27 Mb). On Author Manuscript 358 chromosome 21 the top SNP “BovineHD2100000944” (Table 2) was located in a 359 distance of 395,444 bp away from a SNP that was significant for pregnancy outcome 360 after fixed-time AI of Brahman heifers (Porto-Neto et al., 2015). These authors verified 361 that this SNP was in an intronic region of the gene LRRK1.

This article is protected by copyright. All rights reserved 362 Other populations presented QTL related to sexual precocity in regions close to 363 those reported here (less than 0.5 Mb of distance). Some regions were close to the 364 validated SNP for EP, as Cochran et al. (2013), that found a SNP on BTA3 at 6 Mb 365 associated with daughter pregnancy rate, heifer conception rate and cow conception rate 366 in Holstein. This SNP was mapped into the gene HSD17B7, which was also detected in 367 the genes selected as important for Brahman (Table 2). Also, these authors found on 368 BTA23 at 27 Mb another SNP associated with those traits in the gene NFKBIL1, which 369 is 294.03 bp away from the top SNP found in this region. Buzanskas et al. (2017) found 370 a SNP on BTA5 at 107 Mb associated with scrotal circumference at 420 days of age in 371 Canchim cattle. Oliveira Júnior et al. (2017) studying early pregnancy in Nellore found 372 some genomic 1 Mb windows on BTA18 at 54 - 56 Mb explaining more than 1% of the 373 additive genetic variance for this trait. In Brangus cows Peters et al. (2013) found a SNP 374 associated with first service conception (FSC) on chromosome 19 at 49 Mb, which is 375 close to a significant SNP detected here at 48.7 Mb. Fontanesi et al. (2014) also found 376 on BTA19 at 48.7 Mb markers associated with fertility index in Holstein. 377 Other regions were close to the candidate SNP for EP, as on BTA14 at 16 Mb, 378 where Mota et al. (2017) found a SNP window associated with age at first calving in 379 Nellore. Parker Gaddis et al. (2016) found on at 31-38 Mb an 380 association between some markers in this region and the traits daughter pregnancy rate, 381 cow conception rate and heifer conception rate in Holstein cattle. Also, some regions

382 validated for SCN were reported in other studies, as in McClure et al. (2010) that found 383 QTL associated with scrotal circumference in Angus cattle on chromosome 19 at 45.1 384 Mb, which was close to the SNP BovineHD1900012747 at 45.3 Mb (Table 4). Also, 385 Sahana et al. (2010) found SNP associated with fertility index and days from first to last 386 insemination in Holstein in this genomic region. 387 In Table 2 (EP validated SNP) some genes were previously related to reproductive 388 processes. On BTA2, STAT1 gene is involved in apoptotic process in female granulosa 389 cells (Benifla et al., 2002). Also this gene was reported to be activated during the 390 spermatozoa capacitation and after the fertilization, acting in the male pronucleus 391 (Bastián et al., 2007). The mRNA of MYO1B gene was transcript in man reproductive Author Manuscript 392 tissues which sample sperms failed to promote pregnancy (García-Herrero et al., 2010). 393 This gene expression was abundant in fish eggs after photo stimulation treatment to 394 induce ovulation (Bonnet et al., 2007). Ortega et al. (2016) in a GWAS identify a 395 significant SNP associated with heifer conception rate inside the gene HSD17B7 in

This article is protected by copyright. All rights reserved 396 Holstein cows. This gene, located on BTA3 in bovine, is associated with production of 397 stradiol, an important female hormone, in corpus luteum and with embryo development 398 in mice (Gibori et al., 2009) and was involved in both sex steroid hormones synthesis, 399 in human (Osuch et al., 2012). 400 On BTA4 the gene CREB3L2-201 was expressed in human placenta and in fetal 401 tissues (Storlazzi et al., 2004). On BTA5, the SLC6A13 gene was differentially 402 expressed in mouse blastocysts (Giritharan et al., 2010). This gene was down-regulated 403 in the presence of estrogen in the posterior part of adult female rats hypothalamus (Xu 404 et al., 2008). KDM5A gene is involved in the regulation of spermiogenesis in mice 405 (Lambrot et al., 2008). This gene was identified as a regulator of bovine milk fatty acid 406 content (Pegolo et al., 2017). Boschiero et al. (2013) proposed this gene as a candidate 407 gene associated with fat deposition in chickens. 408 On BTA6 the MARCH1 gene was associated with semen production traits in 409 Chinese Holstein bulls (Liu et al., 2017). Also this gene is associated with the Major 410 Histocompatibility Complex I (MHC I), acting in the process of recognition of antigen 411 presentation (Wilson et al., 2018) and was found in a candidate region affecting female 412 sexual precocity (Table 3). On BTA7, PBX4 gene is located in mouse testis and 413 participates of spermatogenesis (Wagner et al., 2001). NDUFA13 gene was associated 414 with asthenozoospermia, a male pathology that reduces spermatozoa motility in mouse, 415 causing infertility (Yang et al., 2017). This gene was overexpressed in blastocysts of 416 super ovulated heifers (Gad et al., 2011). 417 Transcription of gene NFIB, on BTA8, is essential for lung and brain development 418 in mice embryos (Steele-Perkins et al., 2005). This gene was located in a candidate 419 region for female sexual precocity (Table 3). On BTA13, SNAI1 is essential for the 420 normal cardiovascular system development (Lomelí et al., 2009), and TMEM189 was 421 upregulated in the porcine endometria in days 12 and 16 of gestation versus the other 422 days of estrous cycle (Kiewisz et al., 2014). On BTA14, the ST3GAL1 was expressed in 423 bovine cervical tissue (Pluta et al., 2012). TG gene was identified in several GWAS 424 associated with fat deposition in cattle (Gan et al., 2008; Anton et al., 2011). On 425 BTA15, TAGLN was associated with endometriosis (dos Santos Hidalgo et al., 2011). Author Manuscript 426 This gene was also detected in reproductive tract of pregnant mice and in hens 427 (Kaftanovskaya et al., 2015; Riou et al., 2015). On BTA16, the gene LMOD1 was 428 regulated by androgen and progesterone receptors in the human endometrium affecting 429 the decidualizing process (Cloke et al., 2008).

This article is protected by copyright. All rights reserved 430 On BTA17, CLGN is associated with the capacity of the sperm binding in the 431 mice zona pellucida during fertilization (Yamaguchi et al., 2006). CLGN was also 432 expressed in porcine spermatozoa (Kempisty et al., 2008). This gene was enriched in the 433 blastocysts of unstimulated heifers compared with those that were superovulated (Gad 434 et al., 2011). On BTA18, mutant NAPA gene caused embryo lethality and infertility in 435 mouse, due to a defect in acrosome reaction (Chae et al., 2004; Bátiz et al., 2009). 436 STRADA gene, on BTA19, participates of neuronal embryo formation (Orlova et al., 437 2010; Crino, 2015). The gene CHSY1 (BTA21) was overexpressed in bovine cumulus 438 cells 6 hour after the LH surge (Assidi et al., 2010) and was located in a candidate 439 region of this study (Table 3). PGAM2 (BTA22), was differentially expressed in 440 asthenozoospermic sperm from human sperm (Jodar et al., 2017). 441 ATF6B gene, on BTA23, was up-regulated in porcine fertilized blastocysts (Xu et 442 al., 2015). This gene was associated with growth traits in bovine (Mao et al., 2016) and 443 with reproductive traits in porcine (Liu et al., 2018). ENSBTAG00000006864 gene was 444 located in a region in association with serum IGF-1 concentration in dairy cows 445 (Gobikrushanth et al., 2018). C4A gene was downregulated in low infertility risk group 446 patients compared to intermittent infertility risk group of boys with cryptorchidism 447 (Hadziselimovic et al., 2009). This gene is associated with recurrent spontaneous 448 abortions in humans (Laitinen et al., 1991). Furthermore, C4A and C2 genes are 449 associated with the Major histocompatibility complex (MHC), which is involved in 450 inflammatory and immune responses (Perlmutter et al., 1986; Lokki & Laitinen, 2001). 451 On BTA27, the ASAH1 gene regulates the function of a steroidogenic fator in human 452 adrenochotical cells (Lucki et al., 2012). This gene was related with embryo lethality (Li 453 et al., 2002; Eliyahu et al., 2007). 454 Some of the genes presented in Table 4 (SNP validated for SC) were related to 455 reproductive events. On BTA11, the gene C11H2orf49, homologue of C2orf49, was 456 down-regulated in uterus of repeat breeder cows compared with non-repeat breeder 457 cows (Hayashi et al., 2017). On BTA12, the gene NEK3 is associated with prolactin 458 receptor in humans (Miller et al., 2005). Devi and Halperin (2014) discussed the role of 459 prolactin hormone in some reproduction process in rodents and human. This gene was Author Manuscript 460 down-regulated in F3 of mice males exposed to an organochlorine insecticide with 461 estrogenic properties, promoting a reduction in spermatogonia numbers (Gely-Pernot et 462 al., 2018). Gene FOXO1 showed an increasing of expression in the endometrium of 463 women with polycystic ovary syndrome (Kohan et al., 2010). Lappas et al. (2009)

This article is protected by copyright. All rights reserved 464 suggested that FOXO1 is related to gestation maintenance in humans. The expression of 465 this gene could be observed in spermatogonia and granulosa cells of several mammal 466 species, as mouse, rat, dog, cow, rhesus, chimpanzee and human (Tarnawa et al., 2013). 467 On BTA15, the ZPR1 gene is associated with embryo development; Gangwani et 468 al. (2005) observed that mice embryos with disruption in this gene died in early 469 embryonic development stage. APOA4 gene was associated with follicular development 470 in pigs oocyte (Ji et al., 2010). It was observed an increasing of this gene protein 471 expression in the later rat embryonic development stages (Dihazi et al., 2015), and those 472 authors suggest an association of this protein with steroid metabolic process and 473 cholesterol synthesis. Also APOC3 gene is related to cholesterol metabolism and 474 Tabatabaie et al. (2011) raised the hypothesis of this gene can be affect gonadal 475 maturity in rats. On BTA17, HCAR1 gene is commonly expressed in adipocytes and 476 was involved in the regulation of lipolysis in Holstein cows during the transition period 477 from late pregnancy to lactation (Weber et al., 2016). 478 On BTA19, GFAP gene is associated with the regulation of estrous cycle in cows 479 and mouse (Garcia-Segura et al., 1994; Stone et al., 1998). HEXIM1 and HEXIM2 genes 480 are associated with cell proliferation, and participate of mouse embryo development 481 (Yik et al., 2005). Gene TFEB was grouped in clusters functionally associated with 482 placentation on the branch ancestral to Hominidae, i.e., human, chimp, gorilla and 483 orangutan (Crosley et al., 2013). This gene was also reported as playing an important 484 role in normal placenta vascularization in mice (Steingrímsson et al., 1998). CCND3 485 gene was down-regulated in endometrium of patients with endometriosis (Ohlsson 486 Teague et al., 2009). This gene was listed by Choudhury and Kanapp (2001) as 487 associated with fertility, reproduction and development in human and rodents. This gene 488 presented significant expression in bovine granulosa cells (Shimizu et al., 2013). TAF8 489 gene was associated with embryo development in bovine (Al Naib et al., 2012). 490 MRPS10 was associated with luteolysis in ovine corpus luteum (Xu et al., 2017). On 491 BTA29 HIKESHI gene is associated with cellular stress response, as promoted by heat 492 shock (Kose et al., 2012). The gene EED is associated with female puberty in rats 493 (Lomniczi et al., 2013). Author Manuscript 494 495 Gene Ontology Analysis 496 Significant gene ontology terms enriched for EP were associated with immune 497 processes. Some Major Histocompatibility Complex (MHC) genes Classes I and II were

This article is protected by copyright. All rights reserved 498 enriched in these terms/pathways (ENSBTAG00000005146, ENSBTAG00000038619, 499 HLA-DMA, BOLA-DMB, ENSBTAG00000002581, BOLA-DOA). The relationship 500 between MHC and reproductive processes is well discussed in literature and briefly, has 501 association with maternal-fetal immune tolerance (Riley, 2008), conception success 502 (Fernandez et al., 1999) and sexual partner choice (Ziegler et al., 2010). 503 Melo et al. (2017) in a GWAS of number of calving at 53 months of age (NC53), 504 using the same Nellore population, found MHC Class II as the most significant term 505 enriched for this trait. However, the genomic region found here (for G1) was the BTA23 506 at 6.9-7.4 Mb, while in Melo et al. (2017) the MHC genes were detected in another 507 region, on BTA23 at 25-26 Mb. NC53 is a trait that reflects the ability of female to stay 508 in herd productively, while EP represents the female precocity. Besides these traits are 509 different and are measured in different female ages they both express the reproductive 510 potential of females to get pregnant, which could explain the similarity in the gene 511 pathways that they share. This found is an extra evidence of the importance of this gene 512 complex for female pregnancy success, apparently independent of the female age.

513 Most of the SCN genes were enriched in terms related to lipid metabolism. The 514 lipid metabolism is associated with a range of metabolic processes as: energy supply, 515 carbohydrate metabolism, endocrine system, hormones biosynthesis, as progesterone 516 and testosterone, which are dependent of the cholesterol molecule for their synthesis. 517 The importance of lipid metabolism for the reproductive performance in cows has 518 been well described in literature (Rodney et al., 2015; Wathes et al., 2013; Snijders et 519 al., 2000; Leroy et al., 2008). Females that present some nutritional lack affecting their 520 energy reserves will present problems to ovulate or to maintain a gestation. Likewise, 521 the lipid metabolism is also fundamental for male fertility (Kim et al., 2017). Some fatty 522 acids present crucial role in the maintenance of sperm integrity, sperm motility, viability 523 and fertility in several livestock species (Van Tran et al., 2016). Rezende et al. (2018) 524 found some genes associated with sire conception rate that were enriched in a functional 525 term named “Fatty acid degradation”, and reported previous studies that explained the 526 relationship between fatty acids and the ATP synthesis, promoting the sperm motility 527 (Amaral et al., 2013). Author Manuscript 528 In summary, we could validate genomic regions in a Brazilian Nellore cattle 529 population that were previous known as affecting sexual precocity traits in an 530 Australian Brahman cattle population. Also, we found new loci associated with sexual 531 precocity in common for both breeds. A total of 21 SNP were found as strong

This article is protected by copyright. All rights reserved 532 candidates to be affecting sexual precocity in Brahman and Nellore females. SNP that 533 were significant for Brahman generally were not the same for Nellore, however they 534 were close to significant SNP for Nellore. The reason for this could be the difference in 535 the linkage disequilibrium pattern between these breeds, i.e., different mutations 536 segregating across breeds that can be indicating the same gene or genomic region. The 537 different definition of female phenotypes used for both breeds may also be another 538 explanation for the found results. 539 For male, fewer regions could be validated than for female, especially for G2 and 540 no candidate SNP were found. This could be explained because for SC in Brahman the 541 X chromosome has an important effect, which could not be observed in the Nellore 542 population (results not shown), which led us to do not use this chromosome in our 543 analysis. 544 Some genes that were found close to validated SNP seem to be affecting sexual 545 precocity traits in females and males, suggesting the action of pleiotropic effects. 546 Results also showed that some of the candidate SNP from both validation sets (G1 and 547 G2), were located near genes playing roles in reproductive events in several mammal 548 species, including bovine. Therefore, those genes are strong candidates to be affecting 549 sexual precocity in tropical beef cattle. The genomic regions described here seem to be 550 related to the expression of the sexual precocity in Nellore and Brahman. 551 552 ACKNOWLEDGEMENTS 553 The authors acknowledge the researcher mobility International Cooperation 554 Program CAPES/COFECUB (Grant nº: 88881.133149/2016-01) that facilitated the 555 collaborations between UNESP and the University of Queensland. Financial support for 556 this research was granted by Fundação de Amparo à Pesquisa (FAPESP - Grant nº 557 2009/16118-5), Conselho Nacional de Desenvolvimento Científico e Tecnológico 558 (CNPq - Grant no 559631/2009-0) and Coordenação de Aperfeiçoamento de pessoal de 559 nível superior (CAPES). This work used the legacy dataset of the Cooperative Research 560 Centre for Beef Genetic Technologies (www.beefcrc.com) and the Alliance Nellore 561 dataset (www.gensys.com.br). Author Manuscript 562 CONFLICT OF INTEREST 563 The authors have no conflict of interest. 564 565 DATA AVAILABLE STATEMENT

This article is protected by copyright. All rights reserved 566 The Brahman data that support the findings of this study are available from the 567 Cooperative Research Centre for Beef Genetic Technologies. Restrictions apply to the 568 availability of these data, which were used under a research only license for this study. 569 Data are available from www.beefcrc.com with the permission of Meat and Livestock 570 Australia and the University of Queensland (if interested please contact Dr Marina 571 Fortes). 572 The Nellore data that support the findings of this study are available from the 573 Alliance Nellore dataset. Restrictions apply to the availability of these data, which were 574 used under license for this study. Data are available from https://gensys.com.br/ with the 575 permission of Alliance Nellore and Dr. Vanerlei Roso. 576 577 REFERENCES 578 Al Naib, A., Mamo, S., & Lonergan, P. (2012). Investigation of a Preferentially 579 Upregulated Gene Cluster in Day 7 Bovine Embryos Derived from RNA Sequencing 580 Data. Reproduction, Fertility and Development, 25, 256-256. 581 https://doi.org/10.1071/RDv25n1Ab215 582 583 Amaral, A., Castillo, J., Estanyol, J. M., Ballescà, J. L., Ramalho-Santos, J., & Oliva, R. 584 (2013). Human sperm tail proteome suggests new endogenous metabolic 585 pathways. Molecular & Cellular Proteomics, 12(2), 330-342. 586 https://doi.org/10.1074/mcp.M112.020552 587 588 Anton, I., Kovacs, K., Holló, G., Farkas, V., Lehel, L., Hajda, Z., & Zsolnai, A. (2011). 589 Effect of leptin, DGAT1 and TG gene polymorphisms on the intramuscular fat of 590 Angus cattle in Hungary. Livestock Science, 135, 300-303. 591 https://doi.org/10.1016/j.livsci.2010.07.012 592 593 Assidi, M., Dieleman. S. J., & Sirard, M. A. (2010). Cumulus cell gene expression 594 following the LH surge in bovine preovulatory follicles: potential early markers of 595 oocyte competence. Reproduction, 140, 835-852. https://doi.org/10.1530/REP-10-0248 Author Manuscript 596 597 Aquila, S., Bonofiglio, D., Gentile, M., Middea, E., Gabriele, S., Belmonte, M., ... & 598 Andò, S. (2006). Peroxisome proliferator activated receptor (PPAR) γ is expressed by

This article is protected by copyright. All rights reserved 599 human spermatozoa: Its potential role on the sperm physiology. Journal of cellular 600 physiology, 209(3), 977-986. https://doi.org/10.1002/jcp.20807 601 602 Bastián, Y., Zepeda-Bastida, A., Uribe, S., & Mujica, A. (2007). In spermatozoa, Stat1 603 is activated during capacitation and the acrosomal reaction. Reproduction, 134, 425- 604 433. https://doi.org/10.1530/REP-06-0264 605 606 Bátiz, L. F., De Blas, G. A., Michaut, M. A., Ramírez, A. R., Rodríguez, F., Ratto, M. 607 H., ... Mayorga, L. S. (2009). Sperm from hyh mice carrying a point mutation in αSNAP 608 have a defect in acrosome reaction. PLoS One, 4, e4963. 609 https://doi.org/10.1371/journal.pone.0004963 610 611 Benifla, J. L., Sifer, C., Bringuier, A. F., Blanc-Layrac, G., Camus, E., Madelenat, P., & 612 Feldmann, G. (2002). Induced apoptosis and expression of related in granulosa 613 cells from women undergoing IVF: a preliminary study. Human Reproduction, 17, 916- 614 920. https://doi.org/10.1093/humrep/17.4.916 615 616 Bolormaa, S., Pryce, J. E., Reverter, A., Zhang, Y., Barendse, W., Kemper, K., … 617 Goddard, M. E. (2014). A Multi-Trait, Meta-analysis for Detecting Pleiotropic 618 Polymorphisms for stature, Fatness and Reproduction in Beef Cattle. PLoS Genetics, 619 10(3), e1004198. https://doi.org/10.1371/journal.pgen.1004198 620 621 Bonnet, E., Fostier, A., & Bobe, J. (2007). Microarray-based analysis of fish egg quality 622 after natural or controlled ovulation. BMC Genomics, 8, 55. 623 https://doi.org/10.1186/1471-2164-8-55 624 625 Boschiero, C., Jorge, E. C., Ninov, K., Nones, K., do Rosário, M. F., Coutinho, L. L., & 626 Moura, A. S. A. (2013). Association of IGF1 and KDM5A polymorphisms with 627 performance, fatness and carcass traits in chickens. Journal of applied genetics, 54, 103- 628 112. https://doi.org/10.1007/s13353-012-0129-6 Author Manuscript 629 630 Briggs, H. M., & Briggs, D. M. (1980). Modern breeds of livestock (4th ed.). New 631 York, The Macmillian Publishing Company. 632

This article is protected by copyright. All rights reserved 633 Browning, B. L., & Browning, S. R. (2009). A unified approach to genotype imputation 634 and haplotype-phase inference for large data sets of trios and unrelated individuals. 635 American Journal of Human Genetics, 84, 210-223. 636 https://doi.org/10.1016/j.ajhg.2009.01.005 637 638 Burns, B. M., Corbet, N. J., Corbet, D. H., Crisp, J. M., Venus, B. K., Johnston, D. J., 639 … Holroyd, R. G. (2013). Male traits and herd reproductive capability in tropical beef 640 cattle. 1. Experimental design and animal measures. Animal Production Science, 53, 87- 641 100. http://dx.doi.org/10.1071/an12163 642 643 Buzanskas, M. E., do Amaral Grossi, D., Ventura, R. V., Schenkel, F. S., Chud, T. C. 644 S., Stafuzza, N. B., … Munari, D. P. (2017). Candidate genes for male and female 645 reproductive traits in Canchim beef cattle. Journal of animal science and biotechnology, 646 8, 67. https://doi.org/10.1186/s40104-017-0199-8 647 648 Chae, T. H., Kim, S., Marz, K. E., Hanson, P. I., & Walsh, C. A. (2004). The hyh 649 mutation uncovers roles for αSnap in apical protein localization and control of neural 650 cell fate. Nature genetics, 36, 264-270. https://doi.org/10.1038/ng1302 651 652 Choudhury, S. R., & Knapp, L. A. (2001). Human reproductive failure II: 653 immunogenetic and interacting factors. Human reproduction update, 7, 135-160. 654 https://doi.org/10.1093/humupd/7.2.135 655 656 Cloke, B., Huhtinen, K., Fusi, L., Kajihara, T., Yliheikkilä, M., Ho, K. K., ... & Kim, J. 657 J. (2008). The androgen and progesterone receptors regulate distinct gene networks and 658 cellular functions in decidualizing endometrium. Endocrinology, 149(9), 4462-4474. 659 https://doi.org/10.1210/en.2008-0356 660 661 Cochran, S. D., Cole, J. B., Null, D. J., & Hansen, P. J. (2013). Discovery of single 662 nucleotide polymorphisms in candidate genes associated with fertility and production Author Manuscript 663 traits in Holstein cattle. BMC genetics, 14, 49. https:// doi.org/10.1186/1471-2156-14-49 664 665 CONCEA. (2008, October 8). Conselho nacional de controle de experimentação animal 666 [pdf file]. Retrieved from

This article is protected by copyright. All rights reserved 667 http://www.unitau.br/files/arquivos/category_114/Normativa_CONCEA_1439476063.p 668 df 669 670 Corbet, N. J., Burns, B. M., Johnston, D. J., Wolcott, M. L., Corbet, D. H., Venus, B. 671 K., … Holroyd, R. G. (2013). Male traits and herd reproductive capability in tropical 672 beef cattle. 2. Genetic parameters of bull traits. Animal Production Science, 53, 101- 673 113. https://10.1186/10.1071/AN12163 674 675 Costa, R. B., Camargo, G. M., Diaz, I. D. P. S., Irano, N., Dias, M. M., Carvalheiro, R., 676 ... Albuquerque, L. G. (2015). Genome-wide association study of reproductive traits in 677 Nellore heifers using bayesian inference. Genetics Selection Evolution, 47, 67. 678 https://10.1186/s12711-015-0146-0 679 680 Crino, P. B. (2015). mTOR signaling in epilepsy: insights from malformations of 681 cortical development. Cold Spring Harbor perspectives in medicine, 5, a022442. 682 https://10.1101/cshperspect.a022442 683 684 Crosley, E. J., Elliot, M. G., Christians, J. K., & Crespi, B. J. (2013). Placental invasion, 685 preeclampsia risk and adaptive molecular evolution at the origin of the great apes: 686 evidence from genome-wide analyses. Placenta, 34, 127-132. 687 https://doi.org/10.1016/j.placenta.2012.12.001 688 689 Devi, Y. S., & Halperin, J. (2014). Reproductive actions of prolactin mediated through 690 short and long receptor isoforms. Molecular and cellular endocrinology, 382, 400-410. 691 https://doi.org/10.1016/j.mce.2013.09.016 692 693 Dias, M. M., Souza, F. R. P., Takada, L., Feitosa, F. L., Costa R. B., Diaz, I. D., ... 694 oliveira, H. N. (2015). Study of lipid metabolism-related genes as candidate genes of 695 sexual precocity in Nellore cattle. Genetics and Molecular Research, 14, 234-243. 696 https://doi.org/10.4238/2015.January.16.7 Author Manuscript 697 698 Dihazi, G. H., Mueller, G. A., Asif, A. R., Eltoweissy, M., Wessels, J. T., & Dihazi, H. 699 (2015). Proteomic characterization of adrenal gland embryonic development reveals

This article is protected by copyright. All rights reserved 700 early initiation of steroid metabolism and reduction of the retinoic acid pathway. 701 Proteome science, 13, 6. https://doi.org/10.1186/s12953-015-0063-8 702 703 Durinck, S., Moreau, Y., Kasprzyk, A., Davis, S., De Moor, B., Brazma, A., & Huber, 704 W. (2005). BioMart and Bioconductor: a powerful link between biological databases 705 and microarray data analysis. Bioinformatics, 21, 3439-3440. 706 https://doi.org/10.1093/bioinformatics/bti525 707 708 Durinck, S., Spellman, P., Birney, E., & Huber, W. (2009). Mapping identifiers for the 709 integration of genomic datasets with the R/Bioconductor package biomaRt. Nature 710 Protocols, 4, 1184-1191. https://doi.org/10.1038/nprot.2009.97 711 712 Eliyahu, E., Park, J. H., Shtraizent, N., He, X., & Schuchman, E. H. (2007). Acid 713 ceramidase is a novel factor required for early embryo survival. The FASEB journal, 21, 714 1403-1409. https://doi.org/10.1096/fj.06-7016com 715 716 Fernandez, N., Cooper, J., Sprinks, M., AbdElrahman, M., Fiszer, D., Kurpisz, M., & 717 Dealtry, G. (1999). A critical review of the role of the major histocompatibility complex 718 in fertilization, preimplantation development and feto-maternal interactions. Human 719 Reproduction Update, 5(3), 234-248. https://doi.org/10.1093/humupd/5.3.234 720 721 Fontanesi, L., Calò, D. G., Galimberti, G., Negrini, R., Marino, R., Nardone, A., ... 722 Russo, V. (2014). A candidate gene association study for nine economically important 723 traits in Italian Holstein cattle. Animal genetics, 45, 576-580. 724 https://doi.org/10.1111/age.12164 725 726 Fortes, M. R. S., Lehnert, S. A., Bolormaa, S., Reich, C., Fordyce, G., Corbet, N. J., … 727 Reverter, A. (2012a). Finding genes for economically important traits: Brahman cattle 728 puberty. Animal Production Science, 52, 143-150. https://doi.org/10.1071/AN11165 729 Author Manuscript 730 Fortes, M. R., Reverter, A., Hawken, R. J., Bolormaa, S., & Lehnert, S. A. (2012b). 731 Candidate genes associated with testicular development, sperm quality, and hormone 732 levels of inhibin, luteinizing hormone, and insulin-like growth factor 1 in Brahman 733 bulls. Biology of Reproduction, 87, 58. https://doi.org/10.1095/biolreprod.112.101089

This article is protected by copyright. All rights reserved 734 735 Fortes, M. R. S., Kemper, K., Sasazaki, S., Reverter, A., Pryce, J. E., Barendse, W., … 736 Lehnert, S. A. (2013a). Evidence for pleiotropism and recent selection in the PLAG1 737 region in Australian beef cattle. Animal Genetics, 44, 636-647. 738 https://doi.org/10.1111/age.12075 739 740 Fortes, M. R. S., Reverter, A., Neto, L. P., Kelly, M., Moore, S. S., & Lehnert, S. A. 741 (2013b). Genetic markers associated with male reproductive traits across 2 beef cattle 742 breeds: Brahman and tropical composite. Proceddings of the Association for 743 Advancement of Animal Breeding and Genetics, 20, 389-392 744 745 Fortes, M. R. S., Nguyen, L. T., Weller, M. M., Cánovas, A., Islas-Trejo, A., Porto- 746 Neto, L. R., ... Moore, S. S. (2016). Transcriptome analyses identify five transcription 747 factors differentially expressed in the hypothalamus of post-versus prepubertal Brahman 748 heifers. Journal of Animal Science, 94, 3693-3702. https://doi.org/10.2527/jas.2016- 749 0471 750 751 Fortes, M. R. S., Enculescu, C., Porto Neto, L. R., Lehnert, S. A., McCulloch, R., & 752 Hayes, B. (2018). Candidate mutations used to aid the prediction of genetic merit for 753 female reproductive traits in tropical beef cattle. Revista Brasileira de Zootecnia, 47, 754 e20170226. https://dx.doi.org/10.1590/rbz4720170226 755 756 Gad, A., Besenfelder, U., Rings, F., Ghanem, N., Salilew-Wondim, D., Hossain, M. M., 757 … Hölker, M. (2011). Effect of reproductive tract environment following controlled 758 ovarian hyperstimulation treatment on embryo development and global transcriptome 759 profile of blastocysts: implications for animal breeding and human assisted 760 reproduction. Human Reproduction, 26, 1693-1707. 761 https://doi.org/10.1093/humrep/der110 762 763 Gan, Q. F., Zhang, L. P., Li, J. Y., Hou, G. Y., Gao, X., Ren, H. Y., … Xu, S. Z. (2008). Author Manuscript 764 Association analysis of thyroglobulin gene variants with carcass and meat quality traits 765 in beef cattle. Journal of Applied Genetics, 49, 251-255. 766 https://doi.org/10.1007/BF03195621 767

This article is protected by copyright. All rights reserved 768 Gangwani, L., Flavell, R. A., & Davis, R. J. (2005). ZPR1 is essential for survival and is 769 required for localization of the survival motor neurons (SMN) protein to Cajal bodies. 770 Molecular and cellular biology, 25, 2744-2756. 771 https://doi.org/10.1128/MCB.25.7.2744-2756.2005 772 773 García-Herrero, S., Meseguer, M., Martínez-Conejero, J. A., Remohí, J., Pellicer, A., & 774 Garrido, N. (2010). The transcriptome of spermatozoa used in homologous intrauterine 775 insemination varies considerably between samples that achieve pregnancy and those 776 that do not. Fertility and sterility, 94, 1360-1373. 777 https://doi.org/10.1016/j.fertnstert.2009.07.1671 778 779 Garcia-Segura, L. M., Luqín, S., Párducz, A., & Naftolin, F. (1994). Gonadal hormone 780 regulation of glial fibrillary acidic protein immunoreactivity and glial ultrastructure in 781 the rat neuroendocrine hypothalamus. Glia, 10, 59-69. 782 https://doi.org/10.1002/glia.440100108 783 784 Gely-Pernot, A., Hao, C., Legoff, L., Multigner, L., D’Cruz, S. C., Kervarrec, C., … 785 Smagulova, F. (2018). Gestational exposure to chlordecone promotes transgenerational 786 changes in the murine reproductive system of males. Scientific reports, 8, 10274. 787 https://doi.org/10.1038/s41598-018-28670-w 788 789 Gibori, G., Shehu, A., Mao, J., Gibori, G. B., Risk, M.C., Duan, R., … Devi, Y. S. 790 (2009). The Large Luteal Cells-Derived PRAP/HSD17B7: An Enzyme with a Split 791 Personality. Biology of Reproduction, 81. https://doi.org/10.1093/biolreprod/81.s1.11 792 793 Giritharan, G., Li, M. W., De Sebastiano, F., Esteban, F. J., Horcajadas, J. A., Lloyd, K. 794 C. K., ... Rinaudo, P. F. (2010). Effect of ICSI on gene expression and development of 795 mouse preimplantation embryos. Human reproduction, 25, 3012-3024. 796 https://doi.org/10.1093/humrep/deq266 797 Author Manuscript 798 Gobikrushanth, M., Purfield, D. C., Colazo, M. G., Wang, Z., Butler, S. T., & Ambrose, 799 D. J. (2018). The relationship between serum insulin-like growth factor-1 (IGF-1) 800 concentration and reproductive performance, and genome-wide associations for serum

This article is protected by copyright. All rights reserved 801 IGF-1 in Holstein cows. Journal of dairy science, 101, 1-14. 802 https://doi.org/10.3168/jds.2018-14535 803 804 Goddard, M. E., & Hayes, B. J. (2009). Mapping genes for complex traits in domestic 805 animals and their use in breeding programmes. Nature Reviews Genetics, 10, 381-391. 806 https://doi.org/10.1038/nrg2575 807 808 Habier, D., Fernando, R. L., Kizilkaya, K., & Garrick, D. J. (2011). Extension of the 809 Bayesian alphabet for genomic selection. BMC Bioinformatics, 12, 186. 810 https://doi.org/10.1186/1471-2105-12-186 811 812 Hadziselimovic, F., Hadziselimovic, N. O., Demougin, P., Krey, G., Hoecht, B., & 813 Oakeley, E. J. (2009). EGR4 is a master gene responsible for fertility in cryptorchidism. 814 Sexual Development, 3, 253-263. https://doi.org/10.1159/000249147 815 816 Hayashi, K. G., Hosoe, M., Kizaki, K., Fujii, S., Kanahara, H., Takahashi, T., & 817 Sakumoto, R. (2017). Differential gene expression profiling of endometrium during the 818 mid-luteal phase of the estrous cycle between a repeat breeder (RB) and non-RB cows. 819 Reproductive Biology and Endocrinology, 15, 20. https://doi.org/10.1186/s12958-017- 820 0237-6 821 822 Henderson, C. R. (1984). Aplications of linear models in animal breeding. University of 823 Guelph, Guelph. 824 825 Höglund, J. K., Sahana, G., Guldbrandtsen, B., & Lund, M. S. (2014). Validation of 826 associations for female fertility traits in Nordic Holstein, Nordic Red and Jersey dairy 827 cattle. BMC Genetics, 15, 8. https://doi.org/10.1186/1471-2156-15-8 828 829 Huang, D. W., Sherman, B. T., & Lempicki, R. A. (2009a). Systematic and integrative 830 analysis of large gene lists using DAVID bioinformatics resources. Nature Author Manuscript 831 protocols, 4(1), 44. https://doi.org/10.1038/nprot.2008.211 832

This article is protected by copyright. All rights reserved 833 Huang, D. W., Sherman, B. T., & Lempicki, R. A. (2009b). Bioinformatics enrichment 834 tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic 835 acids research, 37(1), 1-13. https://doi.org/10.1093/nar/gkn923 836 837 Irano, N., Camargo, G. M., Costa, R. B., Terakado, A. P. N., Magalhães, A. F. B., Silva, 838 R. M. O., ... Albuquerque, L. G. (2016). Genome-wide association study for indicator 839 traits of sexual precocity in Nellore cattle. Plos One, 11, e0159502. 840 https://doi.org/10.1371/journal.pone.0159502 841 842 Ji, M. R., Jang, D. M., Lee, Y. S., Cheong, H. T., Yang, B. K., & Park, C. K. (2010). 843 Changes of protein profiles during follicle development and in vitro oocyte maturation 844 in the pig. Reproduction, Fertility and Development, 23, 187-187. 845 https://doi.org/10.1071/RDv23n1Ab169 846 847 Jodar, M., Soler-Ventura, A., & Oliva, R. (2017). Semen proteomics and male 848 infertility. Journal of proteomics, 162, 125-134. 849 http://dx.doi.org/10.1016/j.jprot.2016.08.018 850 851 Johnston, D. J., Barwick, S. A., Corbet, N. J., Fordyce, G., Holroyd, R. G., Williams, P. 852 J., & Burrow, H. M. (2009). Genetics of heifer puberty in two tropical beef genotypes in 853 northern australia and associations with heifer- and steer-production traits. Animal 854 Production Science, 49, 399-412. https://doi.org/10.1071/EA08276 855 856 Johnston, D. J., Barwick, S. A., Fordyce, G., & Holroyd, R. G. (2010). Understanding 857 the Genetics of Lactation Anoestrus in Brahman Beef Cattle to Enhance Genetic 858 Evaluation of Female Reproductive Traits. Proceddings of 9th World Congress on 859 Genetics Applied to Livestock Production. Leipzig, Germany. 860 861 Kaftanovskaya, E. M., Huang, Z., Lopez, C., Conrad, K., & Agoulnik, A. I. (2015). 862 Conditional deletion of the relaxin receptor gene in cells of smooth muscle lineage Author Manuscript 863 affects lower reproductive tract in pregnant mice. Biology of reproduction, 92, 1-9. 864 https://doi.org/10.1095/biolreprod.114.127209 865

This article is protected by copyright. All rights reserved 866 Kang, H. M., Sul, J. H., Service, S. K., Zaitlen, N. A., Kong, S. Y., Freimer, N. B., … 867 Eskin, E. (2010). Variance component model to account for sample structure in 868 genome-wide association studies. Nature Genetics, 42, 348-354. 869 https://doi.org/10.1038/ng.548 870 871 Karlsson, E. K., Baranowska, I., Wade, C. M., Hillbertz, N. H. C. S., Zody, M. C., 872 Anderson, N., … Lindblad-Toh, K. (2007) Efficient mapping of mendelian traits in 873 dogs through genome-wide association. Nature genetics, 39, 1321-1328. 874 https://doi.org/10.1038/ng.2007.10 875 876 Kempisty, B., Antosik, P., Bukowska, D., Jackowska, M., Lianeri, M., Jaśkowski, J. M., 877 & Jagodziński, P. P. (2008). Analysis of selected transcript levels in porcine 878 spermatozoa, oocytes, zygotes and two-cell stage embryos. Reproduction, Fertility and 879 Development, 20, 513-518. https://doi.org/10.1071/RD07211 880 881 Kiewisz, J., Krawczynski, K., Lisowski, P., Blitek, A., Zwierzchowski, L., Ziecik, A. J., 882 & Kaczmarek, M. M. (2014). Global gene expression profiling of porcine endometria 883 on Days 12 and 16 of the estrous cycle and pregnancy. Theriogenology, 82, 897-909. 884 http://dx.doi.org/10.1016/j.theriogenology.2014.07.009 885 886 Kim, N., Nakamura, H., Masaki, H., Kumasawa, K., Hirano, K. I., & Kimura, T. (2017). 887 Effect of lipid metabolism on male fertility. Biochemical and biophysical research 888 communications, 485(3), 686-692. https://doi.org/10.1016/j.bbrc.2017.02.103 889 890 Kohan, K., Carvajal, R., Gabler, F., Vantman, D., Romero, C., & Vega, M. (2010). Role 891 of the transcriptional factors FOXO1 and PPARG on gene expression of SLC2A4 in 892 endometrial tissue from women with polycystic ovary syndrome. Reproduction, 140, 893 123-131. https://doi.org/10.1530/REP-10-0056 894 895 Kose, S., Furuta, M., & Imamoto, N. (2012). Hikeshi, a nuclear import carrier for Author Manuscript 896 Hsp70s, protects cells from heat shock-induced nuclear damage. Cell, 149, 578-589. 897 https://doi.org/10.1016/j.cell.2012.02.058 898

This article is protected by copyright. All rights reserved 899 Laitinen, T., Lokki, M. L., Tulppala, M., Ylikorkala, O., & Koskimies, S. (1991). 900 Increased frequency of complement C4 ‘null’alleles in recurrent spontaneous abortions. 901 Human Reproduction, 6, 1384-1387. 902 903 Lambrot, R., Tian, C., Jones, S., Saint-Phar, S., Godmann, M., & Kimmins, S. (2008). 904 Specialized spermiogenic distribution of histone lysine methyltransferase KMT6 and 905 demethylase KDM5A. Proceedings of Biology of reproduction. Madison, USA: Soc 906 Study Reproduction. https://doi.org/10.1093/biolreprod/78.s1.210 907 908 Lappas, M., Riley, C., Rice, G. E., & Permezel, M. (2009). Increased expression of ac- 909 FoxO1 protein in prelabor fetal membranes overlying the cervix: possible role in human 910 fetal membrane rupture. Reproductive Sciences, 16(7), 635-641. 911 https://doi.org/10.1177/1933719109332831 912 913 Legarra, A., Ricard, A., & Filangi, O. (2014). GS3. User Manual. France. 914 915 Lemberger, T., Desvergne, B., & Wahli, W. (1996). Peroxisome proliferator-activated 916 receptors: a nuclear receptor signaling pathway in lipid physiology. Annual review of 917 cell and developmental biology, 12(1), 335-363. 918 https://doi.org/10.1146/annurev.cellbio.12.1.335 919 920 Leroy, J. L. M. R., Opsomer, G., Van Soom, A., Goovaerts, I. G. F., & Bols, P. E. J. 921 (2008). Reduced fertility in High yielding dairy cows: Are the oocyte and embryo in 922 danger? Part I The importance of negative energy balance and altered corpus luteum ‐ 923 function to the reduction of oocyte and embryo quality in High yielding dairy 924 cows. Reproduction in domestic animals, 43(5), 612-622. ‐ 925 https://doi.org/10.1111/j.1439-0531.2007.00960 926 927 Li, C. M., Park, J. H., Simonaro, C. M., He, X., Gordon, R. E., Friedman, A. H., … 928 Schuchman, E. H. (2002). Insertional mutagenesis of the mouse acid ceramidase gene Author Manuscript 929 leads to early embryonic lethality in homozygotes and progressive lipid storage disease 930 in heterozygotes. Genomics, 79, 218-224. https://doi.org/10.1006/geno.2002.6686 931

This article is protected by copyright. All rights reserved 932 Liu, S., Yin, H., Li, C., Qin, C., Cai, W., Cao, M., & Zhang, S. (2017). Genetic effects 933 of PDGFRB and MARCH1 identified in GWAS revealing strong associations with 934 semen production traits in Chinese Holstein bulls. BMC Genetics, 18, 218-224. 935 https://doi.org/10.1186/s12863-017-0527-1 936 937 Liu, C., Ran, X., Yu, C., Xu, Q., Niu, X., Zhao, P., & Wang, J. (2018). Whole-genome 938 analysis of structural variations between Xiang pigs with larger litter sizes and those 939 with smaller litter sizes. Genomics. https://doi.org/10.1016/j.ygeno.2018.02.005 940 941 Lokki, M. L., & Laitinen, T. (2001). Role of major histocompatibility complex class III 942 genes in recurrent spontaneous abortions. Frontiers in Bioscience, 6, 23-29. 943 944 Lomelí, H., Starling, C, & Gridley, T. (2009). Epiblast-specific Snai1 deletion results in 945 embryonic lethality due to multiple vascular defects. BMC research notes, 2, 1-6. 946 https://doi.org/10.1186/1756-0500-2-22 947 948 Lomniczi, A., Loche, A., Castellano, J. M., Ronnekleiv, O. K., Bosch, M., Kaidar, G., 949 … Ojeda, S. R. (2013). Epigenetic control of female puberty. Nature neuroscience, 16, 950 281-292. https://doi.org/10.1038/nn.3319 951 952 Lucki, N. C., Li, D., Bandyopadhyay, S., Wang, E., Merrill, A. H., & Sewer, M. B. 953 (2012). Acid ceramidase (asah1) represses steroidogenic factor-1 (sf-1)-dependent gene 954 transcription in h295r human adrenocortical cells by binding to the receptor. Molecular 955 and cellular biology, 32, 4419-4431. https://doi.org/10.1128/MCB.00378-12 956 957 Mao, X., Sahana, G., De Koning, D. J., & Guldbrandtsen, B. (2016). Genome-wide 958 association studies of growth traits in three dairy cattle breeds using whole-genome 959 sequence data. Journal of animal science, 94, 1426-1437. 960 https://doi.org/10.2527/jas2015-9838 961 Author Manuscript 962 McClure, M. C., Morsci, N. S., Schnabel, R. D., Kim, J. W., Yao, P., Rolf, M. M., 963 …Taylor, J. F. (2010). A genome scan for quantitative trait loci influencing carcass, 964 post-natal growth and reproductive traits in commercial Angus cattle. Animal genetics, 965 41, 597-607. https://doi.org/10.1111/j.1365-2052.2010.02063.x

This article is protected by copyright. All rights reserved 966 967 de Melo, T. P., De Camargo, G. M. F., De Albuquerque, L. G., & Carvalheiro, R. 968 (2017). Genome-wide association study provides strong evidence of genes affecting the 969 reproductive performance of Nellore beef cows. PloS one, 12(5), e0178551. 970 https://doi.org/10.1371/journal.pone.0178551 971 972 Miller, S. L., DeMaria, J. E., Freier, D. O., Riegel, A. M., & Clevenger, C. V. (2005). 973 Novel association of Vav2 and Nek3 modulates signaling through the human prolactin 974 receptor. Molecular endocrinology, 19, 939-949. https://doi.org/10.1210/me.2004-0443 975 976 Mota, R. R., Guimarães, S. E. F., Fortes, M. R. S., Hayes, B., Silva, F. F., Verardo, L. 977 L., … Moore, S. (2017). Genome-wide association study and annotating candidate gene 978 networks affecting age at first calving in Nellore cattle. Journal of Animal Breeding and 979 Genetics, 134, 484-492. https://doi.org/10.1111/jbg.12299 980 981 Nguyen, L. T., Reverter, A., Cánovas, A., Venus, B., Islas-Trejo, A., Porto-Neto, L. R., 982 & Fortes, M. R. S. (2017). Global differential gene expression in the pituitary gland and 983 the ovaries of pre-and postpubertal Brahman heifers. Journal of Animal Science, 95, 984 599-615. https://doi.org/10.2527/jas2016.0921 985 986 Ohlsson Teague, E. M. C., Print, C. G., & Hull, M. L. (2009). The role of microRNAs 987 in endometriosis and associated reproductive conditions. Human reproduction update, 988 16, 142-165. https://doi.org/10.1093/humupd/dmp034 989 990 Oliveira Júnior, G., Perez, B. C., Cole, J. B., Santana, M. H. A., Silveira, J., Mazzoni, 991 G., ... Ferraz, J. B. S. (2017). Genomic study and Medical Subject Headings enrichment 992 analysis of early pregnancy rate and antral follicle numbers in Nelore heifers. Journal of 993 animal science, 95, 4796-4812. https://doi.org/10.2527/jas2017.1752 994 995 Orlova, K. A., Parker, W. E., Heuer. G. G., Tsai, V., Yoon, J., Baybis, M., … Crino, P. Author Manuscript 996 B. (2010). STRADα deficiency results in aberrant mTORC1 signaling during 997 corticogenesis in humans and mice. The Journal of clinical investigation, 120, 1591- 998 1602. https://doi.org/10.1172/JCI41592 999

This article is protected by copyright. All rights reserved 1000 Ortega, M. S., Denicol, A. C., Cole, J. B., Null, D. J., & Hansen, P. J. (2016). Use of 1001 single nucleotide polymorphisms in candidate genes associated with daughter 1002 pregnancy rate for prediction of genetic merit for reproduction in Holstein cows. Animal 1003 Genetics, 47, 288-297. https://doi.org/10.1111/age.12420 1004 1005 Osuch, J. R., Hsu, W. W., Todem, D., Landgraf, J., Mikucki, D., Haan, P. S., & 1006 Karmaus, W. (2012). Female reproductive status and circulating blood leukocyte 1007 expression of selected metabolic or signaling genes involved in sex steroid metabolism. 1008 International journal of molecular epidemiology and genetics, 3, 134-143. 1009 1010 Parker Gaddis, K., Null, D. J., & Cole, J. B. (2016). Explorations in genome-wide 1011 association studies and network analyses with dairy cattle fertility traits. Journal of 1012 dairy science, 99, 6420-6435. http://dx.doi.org/10.3168/jds.2015-10444 1013 1014 Pegolo, S., Dadousis, C., Mach, N., Ramayo-Caldas, Y., Mele, M., Conte, G., ... 1015 Cecchinato, A. (2017). SNP co-association and network analyses identify E2F3, 1016 KDM5A and BACH2 as key regulators of the bovine milk fatty acid profile. Scientific 1017 reports, 7, 1-19, https://doi.org/10.1038/s41598-017-17434-7 1018 1019 Perlmutter, D, H., Goldberger, G., Dinarello, C. A., Mizel, S. B., & Colten, H. R. 1020 (1986). Regulation of class III major histocompatibility complex gene products by 1021 interleukin-1. Science, 232, 850-852. https://doi.org/10.1126/science.3010455 1022 1023 Peters, S. O., Kizilkaya, K., Garrick, D. J., Fernando, R. L., Reecy, J. M., Weaber, R. 1024 L., …Thomas, M. G. (2013). Heritability and Bayesian genome-wide association study 1025 of first service conception and pregnancy in Brangus heifers. Journal of animal science, 1026 91, 605-612. https://doi.org/10.2527/jas2012-5580 1027 1028 Pluta, K., McGettigan, P. A., Reid, O. J., Browne, J. A., Irwin, A. J., Tharmalingam, T., 1029 … Carrington, S. D. (2012). Molecular aspects of mucin biosynthesis and mucus Author Manuscript 1030 formation in the bovine cervix during the periestrous period. Physiological genomics, 1031 44, 1165-1178. https://doi.org/10.1152/physiolgenomics.00088.2012. 1032

This article is protected by copyright. All rights reserved 1033 Porto-Neto, L. R., Edwards, S., Fortes, M. R., Lehnert, S. A., Reverter, A., & Mcgowan, 1034 M. (2015). Genome-wide association for the outcome of fixed-time artificial 1035 insemination of Brahman heifers in northern Australia. Journal of Animal Science, 93, 1036 5119-5127. https://doi.org/10.2527/jas2015-9401 1037 1038 Pryce, J. E., Bolormaa, S., Chamberlain, A. J., Bowman, P. J., Savin, K., Goddard, M. 1039 E., & Hayes, B. J. (2010). A validated genome-wide association study in 2 dairy cattle 1040 breeds for milk production and fertility traits using variable length haplotypes. Journal 1041 of dairy science, 93, 3331-3345. https://doi.org/10.3168/jds.2009-2893 1042 1043 Rezende, F. M., Dietsch, G. O., & Peñagaricano, F. (2018). Genetic dissection of bull 1044 fertility in US Jersey dairy cattle. Animal genetics, 49(5), 393-402. 1045 https://doi.org/10.1111/age.12710 1046 1047 Riley, J. K. (2008). Trophoblast immune receptors in maternal-fetal 1048 tolerance. Immunological investigations, 37(5-6), 395-426. 1049 https://doi.org/10.1080/08820130802206066 1050 1051 Riou, C., Saint-Dizier, M., & Gérard, N. (2015). Sperm storage: expression of 1052 progesterone receptors, structural proteins, and heat shock proteins in the avian oviduct. 1053 Reproduction, Fertility and Development, 28, 203-203. 1054 https://doi.org/10.1071/RDv28n2Ab146 1055 1056 Rodney, R. M., Celi, P., Scott, W., Breinhild, K., & Lean, I. J. (2015). Effects of dietary 1057 fat on fertility of dairy cattle: A meta-analysis and meta-regression. Journal of dairy 1058 science, 98(8), 5601-5620. http://dx.doi.org/10.3168/jds.2015-9528 1059 1060 Saatchi, M., Schnabel, R. D., Taylor, J. F., & Garrick, D. J. (2014). Large-effect 1061 pleiotropic or closely linked QTL segregate within and across ten US cattle breeds. 1062 BMC genomics, 15, 1-16. https://doi.org/10.1186/1471-2164-15-442 Author Manuscript 1063 1064 Sahana, G., Guldbrandtsen, B., Bendixen, C., & Lund, M. S. (2010). Genome-wide 1065 association mapping for female fertility traits in Danish and Swedish Holstein cattle. 1066 Animal Genetics, 41, 579-588. http://dx.doi.org/ 10.3168/jds.2014-8141

This article is protected by copyright. All rights reserved 1067 1068 Santoro, M., Guido, C., De Amicis, F., Sisci, D., Vizza, D., Gervasi, S., ... & Aquila, S. 1069 (2013). Sperm metabolism in pigs: a role for peroxisome proliferator-activated receptor 1070 gamma (PPARγ). Journal of Experimental Biology, 216(6), 1085-1092. 1071 https://doi.org/10.1242/jeb.079327 1072 1073 dos Santos Hidalgo, G., Meola, J., Silva, J. C. R., de Paz, C. C. P., & Ferriani, R. A. 1074 (2011). TAGLN expression is deregulated in endometriosis and may be involved in cell 1075 invasion, migration, and differentiation. Fertility and sterility, 96, 700-703. 1076 https://doi.org/10.1016/j.fertnstert.2011.06.052 1077 1078 Sargolzaei, M., Chesnais, J. P., & Schenkel, F. S. (2014). A new approach for efficient 1079 genotype imputation using information from relatives. BMC genomics, 15, 1-12. 1080 https://doi.org/10.1186/1471-2164-15-478. 1081 1082 Shimizu, T., Hirai, Y., & Miyamoto, A. (2013). Expression of Cyclins and Cyclin- 1083 Dependent Kinase Inhibitors in Granulosa Cells from Bovine Ovary. Reproduction in 1084 Domestic Animals, 48, e65-e69. https://doi.org/10.1111/rda.12177 1085 1086 Snijders, S. E. M., Dillon, P., O'Callaghan, D., & Boland, M. P. (2000). Effect of 1087 genetic merit, milk yield, body condition and lactation number on in vitro oocyte 1088 development in dairy cows. Theriogenology, 53(4), 981-989. 1089 https://doi.org/10.1016/S0093-691X(00)00244-2 1090 1091 Steele-Perkins, G., Plachez, C., Butz, K. G., Yang, G., Bachurski, C. J., Kinsman, S. L., 1092 … Gronostajski, R. M. (2005). The transcription factor gene Nfib is essential for both 1093 lung maturation and brain development. Molecular and cellular biology, 25, 685-698. 1094 https://doi.org/10.1128/MCB.25.2.685–698.2005 1095 1096 Steingrímsson, E., Tessarollo, L., Reid, S. W., Jenkins, N. A., & Copeland N. G. (1998). Author Manuscript 1097 The bHLH-Zip transcription factor Tfeb is essential for placental vascularization. 1098 Development, 125, 4607-4616. 1099

This article is protected by copyright. All rights reserved 1100 Stone, D. J., Song, Y., Anderson, C. P., Krohn, K. K., Finch, C. E., & Rozovsky, I. 1101 (1998). Bidirectional transcription regulation of glial fibrillary acidic protein by 1102 estradiol in vivo and in vitro. Endocrinology, 139, 3202-3209. 1103 https://doi.org/10.1210/endo.139.7.6084 1104 1105 Storlazzi, C. T., Mertens, F., & Panagopoulos, I. (2004). CREB3L2 (cAMP responsive 1106 element binding protein 3-like 2). Atlas of Genetics and Cytogenetics in Oncology and 1107 Haematology, 8, 316-317. 1108 1109 Tabatabaie, V., Atzmon, G., Rajpathak, S. N., Freeman, R., Barzilai, N., Crandall, J. 1110 (2011). Exceptional longevity is associated with decreased reproduction. Aging (Albany 1111 NY), 3, 1202-1205. https://doi.org/10.18632/aging.100415 1112 1113 Tarnawa, E. D., Baker, M. D., Aloisio, G. M., Carr, B. R., Castrillon, D. H. (2013). 1114 Gonadal expression of Foxo1, but not Foxo3, is conserved in diverse Mammalian 1115 species. Biology of reproduction, 88, 1-11. 1116 https://doi.org/10.1095/biolreprod.112.105791 1117 1118 VanRaden, P. M. (2008). Efficient methods to compute genomic predictions. Journal of 1119 Dairy Science, 91, 4414-4423. https://doi.org/10.3168/jds.2007-0980 1120 1121 Van Tran, L., Malla, B. A., Kumar, S., & Tyagi, A. K. (2017). Polyunsaturated fatty 1122 acids in male ruminant reproduction—a review. Asian-Australasian journal of animal 1123 sciences, 30(5), 622. https://doi.org/10.5713/ajas.15.1034 1124 1125 Varona, L., García-Cortés, L. A., & Pérez-Enciso, M. (2001). Bayes factors for 1126 detection of quantitative trait loci. Genetics Selection Evolution, 33, 133-152. 1127 https://doi.org/10.1051/gse:2001113 1128 1129 Wagner, K., Mincheva, A., Korn, B., Lichter, P., & Pöpperl, H. (2001). Pbx4, a new Author Manuscript 1130 Pbx family member on mouse chromosome 8, is expressed during spermatogenesis. 1131 Mechanisms of development, 103, 127-131. https://doi.org/10.1016/S0925- 1132 4773(01)00349-5 1133

This article is protected by copyright. All rights reserved 1134 Wathes, D. C., Clempson, A. M., & Pollott, G. E. (2013). Associations between lipid 1135 metabolism and fertility in the dairy cow. Reproduction, Fertility and 1136 Development, 25(1), 48-61. https://doi.org/10.1071/RD12272 1137 1138 Weber, M., Locher, L., Huber, K., Kenéz, Á., Rehage, J., Tienken, R., … Mielenz, M. 1139 (2016). Longitudinal changes in adipose tissue of dairy cows from late pregnancy to 1140 lactation. Part 1: The adipokines apelin and resistin and their relationship to receptors 1141 linked with lipolysis. Journal of dairy science, 99, 1549-1559. 1142 https://doi.org/10.3168/jds.2015-10131 1143 1144 Wilson, K. R., Liu, H., Healey, G., Vuong, V., Ishido, S., Herold, M. J., … Mintern, J. 1145 D. (2018). MARCH1-mediated ubiquitination of MHC II impacts the MHC I antigen 1146 presentation pathway. PloS one, 13, e0200540. 1147 https://doi.org/10.1371/journal.pone.0200540 1148 1149 Xu, Q., Hamada, T., Kiyama, R., Sakuma, Y., & Wada-Kiyama, Y. (2008). Site-specific 1150 regulation of gene expression by estrogen in the hypothalamus of adult female rats. 1151 Neuroscience letters, 436, 35-39. https://doi.org/10.1016/j.neulet.2008.02.054 1152 1153 Xu, W. H., Li, Z. C., Ouyang, Z. P., Yu, B., Shi, J. S., Liu, D. W., & Wu, Z. F. (2015). 1154 RNA-Seq transcriptome analysis of porcine cloned and in vitro fertilized blastocysts. 1155 Journal of Integrative Agriculture, 14, 926-938. https://doi.org/10.1016/S2095- 1156 3119(14)60866-2 1157 1158 Xu, Y., Hutchison, S. M., Hernández-Ledezma, J. J., & Bogan, R. L. (2017). Increased 1159 27-hydroxycholesterol production during luteolysis may mediate the progressive 1160 decline in progesterone secretion. MHR: Basic science of reproductive medicine, 24, 2- 1161 13. https://doi.org/10.1093/molehr/gax061 1162 1163 Yamaguchi, R., Yamagata, K., Ikawa, M., Moss, S. B., & Okabe, M. (2006). Aberrant Author Manuscript 1164 distribution of ADAM3 in sperm from both angiotensin-converting enzyme (Ace)-and 1165 calmegin (Clgn)-deficient mice. Biology of reproduction, 75, 760-766. 1166 https://doi.org/10.1095/biolreprod.106.052977 1167

This article is protected by copyright. All rights reserved 1168 Yang, Y., Cheng, L., Wang, Y., Han, Y., Liu, J., Deng, X., & Chao, L. (2017). 1169 Expression of NDUFA13 in asthenozoospermia and possible pathogenesis. 1170 Reproductive biomedicine online, 34, 66-74. https://doi.org/10.1016/j.rbmo.2016.10.001 1171 1172 Yik, J. H., Chen, R., Pezda, A. C., & Zhou, Q. (2005). Compensatory contributions of 1173 HEXIM1 and HEXIM2 in maintaining the balance of active and inactive P-TEFb 1174 complexes for control of transcription. Journal of Biological Chemistry, 280, 16368- 1175 16376. https://doi.org/10.1074/jbc.M500912200 1176 1177 Zhang, Z., Ersoz, E., Lai, C. Q., Todhunter, R. J., Tiwari, H. K., Gore, M. A., … 1178 Buckler, E. S. (2010). Mixed linear model approach adapted for genome-wide 1179 association studies. Nature genetics, 42, 355-360. https://doi.org/10.1038/ng.546 1180 1181 Ziegler, A., Santos, P. S. C., Kellermann, T., & Uchanska-Ziegler, B. (2010). 1182 Self/nonself perception, reproduction, and the extended MHC. Self/nonself, 1(3), 176- 1183 191. https://doi.org/10.4161/self.1.3.12736 1184 1185 TABLES 1186 Table 1. Summary statistics of the traits early pregnancy (EP) and scrotal circumference 1187 (SC) in Nellore cattle, and age at first corpus luteum (AGECL), first postpartum 1188 anoestrus interval (PPAI), ability to ovulate prior to weaning the calf (PW), and SC

1189 measured at 12, 18 and 24 months of age (SC12, SC18, SC24) in Brahman cattle. Number of Trait Mean ± SE observations EP (%) 1,849 28.3 ± 1.10 SC (cm) 4,248 26.57 ± 2.56 AGECL (days) 1,007 750.6 ± 142.14 PPAI (days) 629 180.11 ± 108.71 PW (%) 629 52.78 ± 2.00

SC12 (cm) 1,098 21.40 ± 2.41 Author Manuscript

SC18 (cm) 1,203 26.51 ± 2.76

SC24 (cm) 1,098 29.89 ± 2.86 1190 1191 Table 2. Validated top SNP for EP in G1 and G2 (P-value ≤ 10-3).

This article is protected by copyright. All rights reserved SNP BTA† Position† (Mb) P-value Gene‡ (symbol§)/ SNP¶ G1 BovineHD0200022985 2 79.97 8.7x10-4 ENSBTAG00000007867 (STAT1) ENSBTAG0000001125 (MYO1B) BovineHD0300002093 3 6.66 1.1x10-4 ENSBTAG00000005976 (HSD17B7) BovineHD0400028801 4 102.42 3.9x10-4 ENSBTAG00000015802 (CREB3L2-201) BovineHD0500031072 5 107.89 3.6x10-4 ENSBTAG00000038415 (SLC6A12) ENSBTAG00000014525 (SLC6A13) ENSBTAG00000020472 (KDM5A) BovineHD0600034574 6 1.74 5.1x10-5 ENSBTAG00000044082 (MARCH1)/ BovineHD0600000418 BovineHD0700001070 7 3.77 1.6x10-4 ENSBTAG00000009975 (PBX4) ENSBTAG00000007812 (NDUFA13) BovineHD0800009146 8 30.10 4.3x10-4 ENSBTAG00000027442 (NFIB) BovineHD1100024980 11 86.98 5.6x10-4 ENSBTAG00000004259 (HPCAL1) BovineHD1300022820 13 78.85 2.0x10-4 ENSBTAG00000014554 (SNAI1) ENSBTAG00000010135 Author Manuscript (TMEM189) BovineHD1400002554 14 9.18 9.6x10-5 ENSBTAG00000001156 (ST3GAL1)

This article is protected by copyright. All rights reserved ENSBTAG00000007823 (TG) BovineHD1500007541 15 28.10 1.8x10-4 ENSBTAG00000002650 (ZPR1) ENSBTAG00000019770 (APOA4) ENSBTAG00000012398 (APOC3) ENSBTAG00000007196 (TAGLN) ENSBTAG00000019764 (APOA5) BovineHD1600013840 16 49.84 6.5x10-5 ENSBTAG00000021576 (LMOD1) BovineHD1700005155 17 17.82 7.9x10-5 ENSBTAG00000001580 (CLGN) BovineHD1800016044 18 54.75 1.2x10-4 ENSBTAG00000013084 (NAPA) BovineHD1900013552 19 48.75 7.0x10-4 ENSBTAG00000000056 (STRADA) BovineHD2100000944 21 5.30 1.3x10-5 ENSBTAG00000007357 (CHSY1)/ BovineHD2100001004 BovineHD2200000079 22 0.37 7.8x10-5 ENSBTAG00000014547 (PGAM2) ENSBTAG00000012073 (VOPP1) BovineHD2300007469 23 27.26 4.9x10-5 ENSBTAG00000008794 (ATF6B) ENSBTAG00000006864 Author Manuscript ENSBTAG00000037533 (C4A) ENSBTAG00000007450

This article is protected by copyright. All rights reserved (C2)

BovineHD2400002342 24 8.37 3.8x10-4 ENSBTAG00000006726 (CCDC102B)/ BovineHD2400002368 BovineHD2700005243 27 18.18 8.8x10-4 ENSBTAG00000011257 (ASAH1) G2 BovineHD0800012292 8 4.12 4.0x10-4 BovineHD0800012229 BovineHD1400004742 14 16.71 2.6x10-4 BTA-122375-no-rs BovineHD1500008343 15 31.06 1.1x10-4 BovineHD1500008365 BovineHD1700009553 17 34.48 2.7x10-4 BovineHD1700009509 1192 †SNP were mapped to the UMD3.1 Bovine Assembly. 1193 ‡Gene = Important genes for Brahman reproductive traits (interval of ± 250 Kb). 1194 §Gene symbols were researched at Ensembl website (Cow UMD3.1). 1195 ¶SNP = Closest significant SNP of the Brahman meta-analysis (± 250 Kb). 1196 SNP in bold were in common between G1 and G2. 1197 1198 Table 3. Candidate top SNP per chromosome for EP (P-value ≤ 10-3). SNP BTA† Position (Mb) † P-value Gene‡ (symbol) §/ SNP¶ BovineHD0600034574 6 1.74 5.1x10-5 ENSBTAG00000044082 (MARCH1)/ BovineHD0600000418 BovineHD0800009146 8 30.10 4.3x10-4 ENSBTAG00000027442 (NFIB)/ BovineHD0800009169 BovineHD1400004742 14 16.71 2.6x10-4 ENSBTAG00000009394 (NSMCE2) BTA-122375-no-rs BovineHD1500008850 15 32.60 2.8x10-4 ENSBTAG00000019246(

Author Manuscript SC5D)/ BovineHD1500008844

This article is protected by copyright. All rights reserved BovineHD2100000944 21 5.30 1.3x10-5 ENSBTAG00000007357 (CHSY1)/ BovineHD2100001004 BovineHD2400002342 24 8.37 3.8x10-4 ENSBTAG00000006726 (CCDC102B)/ BovineHD2400002368 1199 †SNP were mapped to the UMD3.1 Bovine Assembly. 1200 ‡Gene = Closest gene, important for Brahman reproductive traits (± 250 Kb). 1201 §Gene symbols were researched at Ensembl website (Cow UMD3.1). 1202 ¶SNP = Closest significant SNP of the Brahman meta-analysis (± 250 Kb). 1203 1204 Table 4. Validated top SNP for SC in G1 and G2 (P-value ≤ 10-3). SNP BTA† Position† (Mb) P-value Gene‡ (symbol§)/ SNP¶ G1 BovineHD1100003302 11 9.35 2.4x10-4 ENSBTAG00000012332 (C2orf49) BovineHD1200006565 12 21.81 2.3x10-4 ENSBTAG00000020612 (NEK3) ENSBTAG00000011647 (SLC25A15) ENSBTAG00000044105 (FOXO1) BovineHD1500007531 15 28.06 1.2x10-4 ENSBTAG00000002650 (ZPR1) ENSBTAG00000019770 (APOA4) ENSBTAG00000012398 (APOC3) ENSBTAG00000019764 (APOA5) Author Manuscript BovineHD1700015592 17 54.94 4.3x10-5 ENSBTAG00000000908 (HCAR1) BovineHD1900012747 19 45.37 3.4x10-6 ENSBTAG00000013534

This article is protected by copyright. All rights reserved (GFAP) ENSBTAG00000006056 (HEXIM1) ENSBTAG00000024974 (HEXIM2) BovineHD2300015274 23 15.69 7.3x10-5 ENSBTAG00000012384 (TFEB) ENSBTAG00000008977 (TOMM6) ENSBTAG00000010100 (MED20) ENSBTAG00000010106 (CCND3) ENSBTAG00000011339 (TAF8) ENSBTAG00000015301 (MRPS10) BovineHD2900002704 29 9.16 8.8x10-5 ENSBTAG00000019995 (HIKESHI) ENSBTAG00000007847 (EED) G2 BovineHD1600000479 16 1.86 1.8x10-4 BovineHD1600000503 1205 †SNP were mapped to the UMD3.1 Bovine Assembly. 1206 ‡Gene = Important genes for Brahman reproductive traits (± 250 Kb). 1207 §Gene symbols were researched at Ensembl website (Cow UMD3.1). 1208 ¶SNP = Closest significant SNP of the Brahman meta-analysis (± 250 Kb). 1209 1210 FIGURE LEGENDS 1211 Figure 1. Scheme of across breed SNP validation for Brahman and Nellore populations. Author Manuscript 1212 EP = Early pregnancy and SC = Scrotal circumference, both measured in Nellore, G1 = 1213 Significant SNP in Nellore in genic regions important for Brahman, G2 = Significant 1214 SNP in Nellore close to significant SNP from Brahman meta-analyses.

This article is protected by copyright. All rights reserved jbg_12429_f1.tiff Author Manuscript

This article is protected by copyright. All rights reserved