<<

bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 1

1

2

3

4 The giant pouched (Cricetomys ansorgei) olfactory

5 receptor repertoire

6

7 Angela R. Freeman1*, Alexander G. Ophir 1, Michael J. Sheehan2

8

9 1 Department of , Cornell University, Ithaca, NY, USA

10 2 Department of Neurobiology and Behavior, Cornell University, Ithaca, NY, USA

11

12 *Corresponding author

13 E-mail: [email protected] (AF)

14

15

16

17 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 2

18 Abstract

19 For , olfaction is essential for locating food, recognizing mates and competitors,

20 avoiding predators, and navigating their environment. It is thought that rodents may have

21 expanded olfactory receptor repertoires in to specialize in olfactory behavior. Despite

22 being the largest of and depending on olfaction relatively little work has

23 documented olfactory repertoires outside of the conventional laboratory mice and . Here we

24 report the olfactory receptor repertoire of the African giant (Cricetomys ansorgei), a

25 Muroid distantly related to mice and rats. The African is notable for its

26 large cortex and olfactory bulbs relative to its body size compared to other sympatric rodents,

27 which suggests anatomical elaboration of olfactory capabilities. We hypothesized that in addition

28 to anatomical elaboration for olfaction, these pouched rats might also have an expanded olfactory

29 receptor repertoire to enable their olfactory behavior. We examined the composition of the

30 olfactory receptor repertoire to better understand how their sensory capabilities have evolved.

31 We identified 1145 functional olfactory genes, and 260 additional pseudogenes within 301

32 from the African giant pouched rat . This repertoire is similar to mice and

33 rats in terms of size, pseudogene percentage and number of subfamilies. Analyses of olfactory

34 receptor gene trees revealed that the pouched rat has 6 expansions in different subfamilies

35 compared to mice, rats and . We identified 99 orthologous genes conserved among 4

36 rodent and an additional 167 conserved genes within the Muroid rodents. The

37 orthologous genes shared within suggests that there may be a conserved Muroid-

38 specific olfactory receptor repertoire. We also note that the description of this repertoire can

39 serve as a complement to other studies of rodent olfaction, as the pouched rat is an outgroup bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 3

40 within Muroidea. Thus, our data suggest that African giant pouched rats are capable of both

41 natural and trained olfactory behaviors with a typical Muriod olfactory receptor repertoire.

42 Keywords

43 Olfactory receptor – olfaction – rodent orthology – functional expansion

44 Introduction

45 In rodents, olfaction is essential for a number of behaviors including social recognition

46 (1), sexual behavior (2), predator detection (3), and finding food (4). The mechanism for

47 olfactory perception in rodents uses two systems, the main olfactory system and the vomeronasal

48 system, the former of which contains the olfactory receptors (ORs) which transduce odorants

49 into a perceived ‘smell’ (5). The repertoire of these ORs has been of particular interest, as the

50 OR gene is one of the largest in the genome, and the overall number of functional

51 receptor genes is thought to correlate with olfactory perceptual ability (6). In particular,

52 expansion and contraction of the OR repertoire (i.e. functional OR genes) among species is

53 thought to correlate with changing environmental niche – e.g., that use vision primarily

54 for navigation and recognition tend to have more pseudogenes compared to functional ORs (7–

55 9). Furthermore, certain families of ORs are associated with diet and niche, with larger

56 repertoires generally associated with asocial, terrestrial, and diurnal species [(10,11) but see

57 (12)].

58 ORs are G-protein coupled receptors that are located in olfactory sensory neurons (OSN)

59 of the nasal epithelium, where they bind odorants to transmit information to the (13).

60 Which ORs bind which odorants is still largely unknown, as there are a number of technical

61 limitations for high-throughput screening methods for ligand identification (14,15). Despite these bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 4

62 limitations, research shows that related ORs within groups (and orthologous OR genes

63 among taxa) tend to bind structurally similar ligands (16), though the selectiveness and level of

64 response by individual OSNs may vary (9,17–21). When an odorant binds to the OR in the OSN,

65 a signal travels down that neuron’s . The axon projects onto glomeruli in the olfactory bulb,

66 which allows for the signal to be transmitted and interpreted. Although OSNs only contain one

67 type of OR, the OR can bind a range of odorants causing this olfactory information to be coded

68 using a combinatorial method (21). For example, a molecule might activate OSNs only in

69 specific changing concentrations, or in mixtures (or alone) (5,22). This elegant system of many

70 concurrently activated or inactivated ORs in combination is what allows animals to perceive

71 their olfactory-world and respond accordingly.

72 To understand how may have influenced both the number and distribution of

73 ORs, several studies have been conducted on the OR repertoire across species. Since the first

74 description of ORs in mice by Axel and Buck (1991) (13), OR genes in mammals have been

75 more extensively described in mice ( musculus) (23), but also in swine (Sus scrofa) (24),

76 (Homo sapiens) (17), (Canis familiarisi) (25,26), cattle (Bos taurus) (27), non-

77 (8,12), guinea pig (Cavia porcellus), rabbit (Oryctolagus cuniculus), horse

78 (Equus caballus), and elephant (Loxodonta africana) (28), among other mammals (11).

79 By investigating the overall diversity of OR gene repertoires, we can further elucidate

80 how this large receptor family has evolved and whether these repertoires have adapted over time

81 for certain environments and behavior specializations. Certainly, rodents are expected to have

82 large OR repertoires, because much of their behavior relies on olfaction. However, OR

83 repertoires of rodents still vary; guinea pigs are reported to have fewer than 1000 OR genes

84 (functional and pseudogenes), while other mammalian species have olfactory repertoires twice bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 5

85 that size (11). In order to expand our understanding of olfaction and evolution within mammals,

86 we have examined the African giant pouched rat (Cricetomys ansorgei) olfactory genome.

87 The African giant pouched rat (colloquially, ‘Pouchie’) is a large, mostly nocturnal

88 rodent native to the savannah of (29). Cricetomys are within which shared a

89 common ancestor with rodents (i.e. Mus and ) approximately 18-20 MYA (Fig 1)

90 (30,31). The name ‘pouched rat’ is a misnomer given that it is only a distant relative of the

91 conventional Rattus. Furthermore, the term ‘African giant pouched rat’ can refer to any of the 6

92 species of Cricetomys (C. ansorgei, C. emini, C. gambianus, C. kivuensis, and two unnamed

93 species) (32). The Cricetomys is widely distributed across sub-Saharan Africa, and

94 different species tend to associate in different environmental niches (32). A uniting feature of

95 this genus and other members of the subfamily Cricetomyinae (i.e. and )

96 (33) are the gerbil-like cheek pouches where Cricetomys store food during foraging. However,

97 their physical resemblance to Rattus is most likely a result of convergent evolution from distinct

98 ancestors. Cricetomys species have notably large olfactory bulbs and neocortex for their size -

99 even when compared to other rodents (29,34). The olfactory bulbs of the pouched rat comprise

100 19% of the total brain length, while the greater (Thryonomys swinderianus) have

101 olfactory bulbs of only 9% of the brain, despite occupying similar regions in Africa (34,35).

102 Furthermore, the neocortex for the pouched rat accounts for 75% of the

103 (compared to 50% in mice) (35), and the neocortex ratio places the pouched rat in the range of

104 primates (29). The anatomical features of the olfactory system can predict the size of the OR

105 repertoire (36,37), thus, we predicted that the pouched rat may also have an expanded OR

106 repertoire.

107 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 6

108 Fig 1. Phylogeny of species compared in this study. The figure shows a simplified phylogeny

109 of mice (Mus sp.), rats (Rattus sp.), pouched rats (Cricetomys sp.), and squirrels (Ictidomys sp.).

110 Pouched rats are Nesomyid rodents that are distantly related to Murids. Silhouettes show

111 approximate size differences among species. Time periods are listed along the x-axis. MYA:

112 million years ago, Plioc: Pliocene, IV: Quaternary period.

113

114 In addition to these anatomical features, recent behavioral research emphasizes the

115 pouched rats’ olfactory ability. Pouched rat males can use olfactory cues to discriminate between

116 reproductively available and unavailable females, and similarly, females exhibit preferences for

117 ‘competitive’ males based on olfactory cues (38). Furthermore, males can perceive olfactory

118 cues from other males’ scent marks which they countermark in unfamiliar contexts (39).

119 Pouched rats are even used as ‘biodetectors’ and can be trained to detect and TNT

120 via olfaction (40,41). Given their anatomical for olfaction, their use of olfactory cues

121 in potential competitor and mate assessment, and their use as olfactory biodetectors, we assessed

122 the olfactory repertoire of C. ansorgei to determine how the pouched rat OR repertoire compares

123 to other rodents. We hypothesized that in addition to anatomical elaboration of the olfactory

124 system, pouched rats would have an expanded OR repertoire which enables their olfactory

125 behavior.

126 Most rodents that have been previously studied for their OR repertoire, or used for

127 comparative work, are limited to lab models (i.e. Guinea pig, rat (Rattus norvegicus), )

128 (26,28), though recently some others have been described (11). Thus, our aim was to describe the

129 olfactory receptor repertoire of C. ansorgei as a representative of Cricetomys, which would bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 7

130 expand the diversity within described Rodentia olfactory receptor gene repertoires. Here we

131 describe ORs in this species and compare their distribution and number to other rodents (Fig 1).

132 Methods

133 We downloaded the publicly available draft assembly of the pouched rat genome (NCBI

134 ID 75238 (CriGam_v1_BIUU). This genome contains 1,102,952 scaffolds, and 1,110,425

135 contigs. The scaffold N50 and L50 values are 110,049 and 5,340, while the contig N50 and L50

136 are 80,761 and 7,273, respectively. The genome is reported to have 36.7x coverage and a GC%

137 of 42.1%.

138 Detection of OR genes in the pouched rat genome

139 We created a database of ORs from human (H. sapiens), mouse (M. musculus), rat (R.

140 norvegicus), and (Ictidomys tridecemlineatus) from Ensembl BioMart (42), using the

141 search term ‘olfactory receptor’ and the most recent genome versions. We then performed a

142 translated basic local alignment search (TBLASTN) to identify potential ORs within the pouched

143 rat genome. We included all matches, which had e-values of 0, and/or the match started at 1,

144 protein alignment lengths of > 133 amino acids, bit scores of > 224, and gaps of 40 or fewer

145 amino acids. These parameters set a threshold; below these parameters matches declined quickly

146 in similarity. For mouse matches, this reduced the number of total results (1,148,779) to 1213,

147 and for rat matches reduced the number of total results (1,219,637) to 1219. For human and

148 squirrel matches, all the matches were above the threshold and included 903 and 35 sequences,

149 respectively. We used the Galaxy web platform to conduct a multiple alignment using fast

150 fourier transform (MAFFT) to locate and remove duplicate sequences and combine overlapping

151 sequences (43). This resulted in 2,399 sequences which coded for potential pouched rat ORs. We bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 8

152 then extended the ends of sequences by 100 bp to ensure that we would obtain the full coding

153 region, and excluded any genes of less than 750 bp, because most functional OR genes are >250

154 amino acids (44), which resulted in 1409 sequences. We obtained all the open reading frames

155 (via EMBOSS) (45) within all the sequences (~23k; 84% were shorter than 50 amino acids in

156 length) and filtered out those shorter than 250 amino acids (44). This left 1,149 functional coding

157 genes. Using these putative functional ORs, we generated a multiple sequence alignment (via

158 MAFFT) and produced a using FastTree to examine any outliers. FastTree was

159 chosen at this step instead of RAxML for processing efficiency (46). We used BLAST on any

160 outliers (long branches separate from other groups) in the tree, to verify they encoded olfactory

161 receptors, which led to the removal of 4 sequences which were most similar in terms of sequence

162 similarity to other known G-coupled receptors. Thus, out of the 1405 potentially functional OR

163 receptors, 18.5% were labeled ‘pseudogenes’ that likely did not produce a functional receptor

164 product (but see (47)), either due to truncation, deletions or frameshift . Using these

165 ‘manual similarity-pipeline’ search and selection methods, we found 1145 putative functional

166 OR receptor genes.

167 In addition to the above pipeline, we also used the Olfactory Receptor Assigner (ORA)

168 module to detect potential OR genes and pseudogenes in the entire pouched rat genome (11).

169 This method identified a similar number of functional OR genes, detailed methods are available

170 in the supplement (S1 File). The ORA module detected 1060 unique putative functional OR

171 receptor genes from the pouched rat genome.

172 Classification based on phylogeny bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 9

173 We produced a phylogenetic tree using the 1145 putative functional pouched rat OR

174 sequences located during the manual pipeline search using MAFFT (L-INS-I method) for

175 sequence alignment and RAxML to produce an unrooted tree with 1000 bootstraps using the

176 PROTCATJTTF option. We then used the method described by Zhang and Firestein (2002), to

177 determine family categorization, by > 50% bootstrap support and 40% protein identity.

178 Subfamilies were defined as >50% bootstrap support and 60% protein identity (48). Protein

179 identity was determined through a BLASTP pairwise-comparison. To determine , high

180 bootstrap support is typically used (44), the pouched rat olfactory repertoire separated into two

181 clades with 76 and 71% bootstrap support.

182

183 Comparative rodent OR phylogenetic tree

184 We used a MAFFT alignment (L-INS-I method) and RaxML to produce an unrooted tree

185 with 500 bootstraps using the PROTCATJTTF option to compare the 1145 OR protein

186 sequences of mouse, rat, squirrel, and pouched rat. This comparative rodent OR phylogenetic

187 tree was used for examination of expansions and orthology.

188 Expansions within gene families

189 We used the rodent OR phylogenetic tree to identify pouched rat-specific expansions. We

190 compared pouched rat functional gene distributions (number of genes in the subfamily) and

191 compared this to the most similar mouse, rat, and squirrel genes. For each noted expansion, we

192 located the most closely related mouse OR gene and identified its subfamily group to determine

193 if there were potential identified ligands within the mouse OR subfamily. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 10

194 Orthologous ORs

195 Using the rodent tree created from protein sequences, we examined the branching to

196 identify orthologous ORs among pouched rats and squirrel, mouse, and rat. We enumerated

197 instances where two paralogs (from any of the rodent species) were indicated in the tree. For

198 larger numbers of paralogs (i.e. larger expansions), we did not include these in paralog or

199 ortholog enumeration. The structure of branching for each ortholog group was recorded and

200 compared to shared sequence identity. We then calculated the shared sequence identity for each

201 ortholog compared to the pouched rat for each group.

202

203 Clustering within the genome

204 We analyzed the relative positions of the putative functional pouched rat ORs obtained

205 from the manual similarity-pipeline, by grouping gene clusters according to positional proximity.

206 The pouched rat genome was parsed into a custom BLAST database (using makeblastdb (49)),

207 which allowed for a conservative estimate of physical clustering – as sequences within the same

208 parsed cluster of the genome would be physically clustered in a chromosome. OR genes were

209 considered ‘clustered’ when they were located within the same sequence identifier.

210

211 Results

212 We identified 1145 functional pouched rat ORs from the pouched rat genome using our

213 manual pipeline method, which is similarly-sized compared to repertoires of mouse and rat (S2-

214 S3 files). The manual pipeline method and ORA method described similar numbers of ORs, and bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 11

215 details of the ORA method and results are available in the supplement (S1 File). We identified

216 260 pseudogenes with the manual pipeline method.

217 The pouched rat has similar numbers of clades, pseudogenes, or truncated genes to mice

218 and rats, and has a relatively similar number of subfamilies to most other species using the

219 manual pipeline search method (Table 1, Fig 2, S4 file).

220

221 Fig 2. Family-group representation in the pouched rat olfactory receptor gene tree.

222 Different colors at nodes indicate different families. All families share > 50% bootstrap support

223 and 40% protein identity.

224

225 Table 1. Distribution of pouched rat olfactory receptor genes among subfamilies using a

226 manual pipeline search method. (48)

Number of sequences in Number of instances

subfamily

1 140

2 48

3 25

4 24

5 10

6 11

7 4 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 12

8 6

9 5

10 5

11 3

13 1

14 2

15 5

16 2

18 2

19 1

22 1

24 1

26 1

31 2

32 1

49 1

227

228 The pouched rat has similar numbers of functional and pseudogenes (Fig 3) compared to

229 other rodents, and similar numbers of subfamilies to mice (Table 2). Ranges within table 3

230 indicate variation in reported values due to variation in search strategies.

231

232 Fig 3. Graph comparing functional and pseudogene numbers among select species. Black

233 bars indicate functional genes and grey bars indicate pseudogenes. Pouched rats, rats, mice, bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 13

234 squirrels, dogs, cattle, and pigs have similar numbers of reported olfactory receptor genes.

235 Squirrels have large numbers of pseudogenes.

236

237 Table 2. Families, subfamilies and functional olfactory receptor genes within selected

238 species.

Species Number Number of Number of Number of

of families subfamilies functional genes pseudogenes

(%)

African giant 87 301 1145 (82) 260

pouched rat

(C. ansorgei), this

paper

Squirrel (11) 16 nd 830 (32) 1763

Rat (6,26,27) 21 282 1201-1217 (71-80) 292

Mouse (23,27,48) 228 241 913-1037 (75-80) 354

Human (25,27,50,51) 17-61 285-300 388-390 (48) 414-465

Dog (6,25–27) 23 300 811-875 (75-82) 222-278

Pig (27) 17 349 1113 (86) 188

Cattle (27) 18 272 881 (82) 190

Zebrafish (27,52) 8 40 102-143 (74-86) 17-35

Chicken (53,54) nd nd 82-368 (15-66) 111-476

Frog (27) nd nd 410 (46) 478

239 nd: no data available bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 14

240

241 To compare pouched rat subfamilies with other species, we produced a gene tree using

242 the sequences identified during manual pipeline search (Fig 4; S5 File), which illustrates that for

243 most mouse, rat, and squirrel ORs subfamilies, there is at least one pouched rat OR (A tree using

244 sequences from the ORA was visually similar in structure, and is provided in the supplement: S6

245 File). Of particular interest is the number and distribution of lineage specific gene family

246 expansions. We can identify potential expansions where there are relatively more pouched rat

247 ORs and fewer ORs from the other rodents within a family or subfamily. We have indicated

248 these expansions, and their associated family groups in Fig 4 for reference. The first expansion

249 includes ORs from pouched rat subfamily 329 (S7 File); the largest pouched rat subfamily. It is

250 associated with mouse OR subfamily 150 (Human equivalent Family 1, Subfamily E) (23).

251 There are 6 ORs in the associated mouse subfamily, but 35 pouched rat ORs (23) (Table 4). The

252 second expansion includes ORs from the pouched rat subfamily 312. This expansion was

253 associated with mouse subfamily 137. The third and fourth expansions include ORs from

254 pouched rat subfamily 321 and 317, respectively. Pouched rat subfamily 317 is the second

255 largest pouched rat subfamily. The third expansion is associated with mouse subfamily 135

256 (Human equivalent Family 7 Subfamily E) and the fourth expansion is associated with mouse

257 subfamily 136 (Human equivalent Family 7 Subfamily E). A fifth expansion includes pouched

258 rat subfamily 12, which is related closely to mouse olfr183 and mouse subfamily 89, and also rat

259 olr1555. The last noted expansion is associated with mouse subfamily 71. The distribution of

260 ORs by subfamily size, as defined by (48) is shown in Table 2 and the composition of

261 expansions are shown in Table 4 (comparative results from the ORA are shown in the Table in

262 S1 Table). bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 15

263 Table 4. Expanded subfamilies of pouched rat olfactory receptors (ORs) and associated

264 mouse subfamilies

Expansion Pouched Pouched rat Associated Number of Number of

rat OR OR mouse Pouched rat ORs mouse ORs

family subfamily subfamily

1 101 329 150 35 6

2 101 312 137 12 4

3 101 321 135 15 1

4 101 317 136 33 5

5 5 12 89 10 1

6 72 219 71 7 2

265

266

267 Fig 4. Comparative unrooted gene tree of olfactory receptor sequences for pouched rat,

268 mouse, rat, and squirrel. Expansions are indicated with group identifiers and arrows.

269

270 Orthologous genes

271 We found that many pouched rat genes were orthologous to mouse, rat, and squirrel OR

272 genes, using the tree in Fig 2. 99 of the 1145 putative olfactory receptor genes shared a 1-to-1

273 orthology with rat, mouse, and squirrel. Using the percent similarity-method for defining

274 families, 10 of the 99 orthologs came from family 38 (ORA equivalent family: OR 51) and 11 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 16

275 from family 101 (ORA equivalent families: OR 1 and OR 7). This further depicts an unequal

276 distribution of orthologs across all of the 87 families defined using this method.

277

278

279

280 The average similarities of the associated pouched rat gene to these orthologous genes

281 were 92.4% for mouse, 91.9% for rat, and 86.1% for squirrel. However, in other locations we

282 observed potentially paralogous ORs, and there were multiple pouched rat ORs for one rat,

283 mouse, or squirrel OR. For example, Pouchie948 associated with rat olr1356, mouse olfr15, and

284 OR2C1 (squirrel) (Fig 5a). We found a total of 33 locations on the gene tree where either the

285 pouched rat, mouse, rat, or squirrel had two paralogous genes. For 1-to-1 orthology of mouse,

286 rat, squirrel, and pouched rat, the most typical pattern (75/99) observed included the mouse and

287 rat being most closely aligned, followed by the pouched rat gene, then the squirrel OR gene (Fig

288 5b).

289 In 33 instances, one or more of the species had a paralogous gene. These paralogous

290 groupings were also very similar, with the pouched rat genes sharing an average of 91.9%

291 sequence similarity to mouse, 91.0% to rat, and 86.2% to squirrel. In 8 cases, there were two

292 paralogous pouched rat OR genes; which had an average pairwise sequence similarity of 97.1%,

293 suggesting a recent duplication of these genes within the pouched rat lineage

294 166 of 1145 pouched rat genes had 1-to-1 orthology with mouse and rat; all three species

295 are in the Muroidea superfamily. 146 of these orthologies followed the predicted pattern of

296 mouse and rat being closely aligned, followed by the pouched rat gene. Comparatively, there bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 17

297 were an additional 44 mouse-pouched rat orthologous genes, 19 rat-pouched rat orthologous

298 genes, and 4 squirrel-pouched rat orthologous genes (Fig 6). There were 14 cases of squirrel-

299 mouse-rat orthologous genes. These latter patterns are consistent with differential loss of genes

300 across rodent lineages.

301

302 Fig 5. Examples of orthologous olfactory receptor (OR) gene sequence among rodents.

303 Numbers at branch points indicate bootstrap values. A) 1-to-1 orthology between pouched rat,

304 mouse, rat, and squirrel (Ictidomys tridecemilineatus) B) Example of pouched rat paralog within

305 orthologous gene grouping

306 Fig 6. Relative proportions of one-to-one orthology of pouched rat olfactory receptor genes

307 and squirrel, mouse, and rat. Pouched rats have the largest number of orthologous genes

308 shared with mice and rats. The number of orthologous groupings with squirrels are consistently

309 smaller than orthologies with mice or rats, due to phylogenetic placement.

310

311

312 Clustering of ORs

313 We examined physical clustering of the ORs based on the locations of the sequences

314 within the gene scaffolds. We found that there were 173 clusters of ORs comprising 644 of the

315 1145 ORs. All but 50 of these clusters contained ORs that were clustered completely within a

316 family group. The largest clusters contained 12 and 13 sequences (Table 5).

317 Table 5. Distribution of pouched rat olfactory receptors in location clusters bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 18

Number of Number of

sequences in instances

cluster

2 70

3 40

4 22

5 14

6 8

7 5

8 4

9 2

10 3

12 2

13 3

318

319 We examined where these large clusters were located to determine if the expansions were

320 due to potential duplication events in the genome. 20 of these clusters were located either

321 entirely or partially within these large expansions (Table 6). All of the expansions contained at

322 least one cluster of ORs, supporting the hypothesis that expansions may be partially due to

323 tandem duplications of OR genes.

324 Table 6. Pouched rat olfactory receptor (OR) family expansions and numbers of clustered

325 OR sequences within each expansion bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 19

Expansion Family Clusters present (number of sequences represented)

1 101 116 (3 of 4), 154 (4 of 5), 87 (3), 108 (3), 85 (4), 41 (2)

2 101 72 (2), 168 (3 of 7), 139 (3 of 5)

3 101 166 (2 of 7), 111 (1 of 2), 147 (3 of 4), 141 (3 of 12), 16 (4 of 9)

4 101 141 (9 of 12), 151 (1 of 3), 3 (6), 166 (5 of 7), 147 (1 of 4), 16 (5 of

9), 111 (1 of 2), 112 (2)

5 5 52 (2)

6 72 92 (1 of 2)

326

327

328 Discussion

329 The olfactory receptor repertoire of the pouched rat is similar to rat and mouse olfactory

330 receptor repertoires but has several lineage-specific expansions. We have shown that these

331 expansions are partially attributable to physically clustered OR genes in the genome, suggesting

332 a role of tandemly arrayed gene duplications in generating additional OR diversity, as has been

333 seen other species (52,55,56). Pouched rats were similar in olfactory repertoire size and

334 pseudogene composition to mice, and rats (Table 2) – which all engage in numerous olfaction-

335 mediated behaviors. Though pouched rats have increased olfactory bulb volume, compared to

336 similar rodents, pouched rats’ olfactory capabilities are mediated by a typical rodent OR

337 repertoire with lineage-specific expansions in several subfamilies.

338 We had hypothesized that the pouched rats’ olfactory capabilities were supported by

339 anatomical elaboration and an associated genetic expansion of the OR repertoire. In other bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 20

340 species, anatomical features including the olfactory bulb size, have been used to effectively

341 predict OR repertoire size (37,60). We observed some subfamily and family-specific expansion,

342 however, the overall number of functional ORs did not differ substantially from mice and rats.

343 The potential advantage of their large olfactory bulbs is unclear – whether these bulbs enable

344 enhanced scent detection for foraging (61), or are an to being nocturnal (62,63) or

345 territorial (64), or enable better sensitivity to odorants in some way is unknown. Pouched rats

346 may be more ‘specialized’ by having family-specific expansions while maintaining a similar

347 number of functional OR genes compared to mice and rats. Additional investigation of the

348 relationship between olfactory bulb size and OR repertoire in rodents is necessary to determine if

349 other rodents might have enlarged bulbs with typical OR repertoire sizes.

350

351 Expansions in the pouched rat OR gene family

352 We did not find any evidence that the size of the olfactory subgenome or the number of

353 subfamilies in pouched rat differed substantially from other rodents. The ratio of functional

354 genes to pseudogenes using the manual pipeline search was also similar. It is unclear whether

355 pouched rat pseudogenes may have a retained function (47), or have a loss of function.

356 As has been reported for other species, lineage-specific expansions tend to occur in

357 tandemly-arrayed clusters (51,52,57). Tandemly-arrayed gene expansions are indicative of a

358 series of gene duplications. We combined overlapping sequences and eliminated duplicates

359 during analysis which could influence the detection of these tandem duplications. Identical

360 sequences within different contigs were eliminated, as we interpreted these as artifacts from

361 assembly. Thus, the number of expansions was conservatively calculated, and future genome bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 21

362 assemblies will potentially reveal additional clustering in the genome. Those cases that are

363 shared among mice, rats and pouched rats indicate earlier expansions compared to the tandem

364 arrays specific to pouched rats.

365 Unfortunately we found no ligands in the literature that were associated with mouse or rat

366 ORs within the same subfamilies of pouched rat OR expansions (14,23,58). Future work on

367 discerning ligands for these ORs might reveal whether pouched rats are primed to specialize in

368 detecting specific types of odorants that are particularly salient and relevant for this species.

369

370 Receptor Conservation among Rodents and within Muroidea

371 Approximately 9% of the pouched rat olfactory genome shows 1-to-1 orthology with the

372 other rodents in this study – mice, rats and squirrels (Fig 6). Pouched rat gene sequences were

373 more similar to mouse and rat sequences than to squirrel sequences, as expected. This agrees

374 with the current phylogenetic hypothesis for rodents: rat and mouse are placed within the

375 Muridae family together, with pouched rat in a different family (Nesomyidae) but within the

376 Muroidea superfamily (Fig 1). Squirrels are in a different family (Sciuridae), whereas all four

377 species compared in this study are within Rodentia. Given that pouched rats are more closely

378 related to rats and mice than squirrels are to rats and mice, we should expect the OR phylogeny

379 to reflect this phylogenetic relationship, and it does for these orthologous genes. Further

380 supporting this assertion, an additional 14.5% of the pouched rat olfactory receptor genes have

381 orthologous mouse and rat olfactory receptor genes (Fig 6). Pouched rat orthology with only one

382 species was also observed; though this was much more common with either mice or rats

383 compared to the more distant rodent relative – the squirrel. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 22

384

385

386 Interestingly, there were 14 instances where orthology is shared between mouse, rat, and

387 squirrel – potentially pouched rats have lost functional olfactory genes here during their

388 evolutionary history. Four of these potential losses were from family OR 4 (as defined by (11)),

389 although the loss would not substantially affect the number and proportion of functional to non-

390 functional OR genes given the numbers of pseudogenes in the associated pouched rat OR

391 families. Future work investigating pseudogenes may yield additional insight into the

392 orthologous OR genes among rodents. Unfortunately, no ligands have been characterized for

393 these receptors, so we cannot speculate on how this might impact the olfactory capabilities or

394 behavior of the pouched rat.

395

396 Additions to our knowledge about olfaction and OR repertoire

397 Most rodent species with a described OR repertoire are lab models, and include mice (23)

398 and rats (26), while guinea pigs were included in a comparative study which did not focus on

399 guinea pigs specifically (11). Two other rodent species include some OR information (i.e. an

400 estimate of ORs and pseudogenes based on published genomic information), but it is unclear

401 which search terms were used, and whether the algorithm used would obtain OR families that

402 were not shared by the exemplars (i.e. subfamilies). These species included the kangaroo

403 rat (Dipodomys ordii) and the thirteen-lined ground squirrel (also included in this study) (11);

404 both of these species are native to North America. Here we have shown that the functional

405 pouched rat OR repertoire is very similar in size to those of other Muroids. The pouched rat OR bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 23

406 repertoire is also more conserved within Muroids than within Rodentia, as expected. Our

407 characterization of the orthologous genes within Muroidea suggests that there may be a set of

408 conserved Muroid-specific ORs, although additional species will need to be sequenced and

409 compared to support this hypothesis. The addition of the African giant pouched rat to described

410 rodent OR repertoires also can serve as an outgroup for future analyses given its position in the

411 Nesomyidae family, which has not been previously described.

412 We noted a pattern in our gene tree of rodent ORs where squirrel and pouched rat ORs

413 tend to branch earlier than the mouse and rat ORs. Given that mice and rats are more closely

414 related to pouched rats than to squirrels, we expected that orthologous genes might follow this

415 pattern. We assume these genes should be orthologous and bind similar odorants with similar

416 structures due to their similar protein sequences, however, it could be that small changes in these

417 genes create ORs that bind different odorants, or change perception in other ways (20,21,59).

418 Identification of these potentially conserved ORs is important for understanding how the OR

419 repertoire might respond to evolutionary pressures over time.

420 Conclusions

421 The description of the African giant pouched rat olfactory receptor repertoire adds to our

422 understanding of how these repertoires vary within Muroidea and among rodents, and how these

423 genes may have been conserved or diverged to serve different behavioral purposes. Our results

424 suggested that there could be a conserved set of ‘Muroidea’ olfactory receptor genes. Whether

425 pouched rats have enlarged olfactory bulbs to enhance their olfactory capabilities in spite of a

426 typical OR repertoire size remains an open, yet interesting question. Further investigation into

427 family-specific expansions within the pouched rat OR repertoire compared to mice and rats bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 24

428 might reveal whether these changes have evolved as a way for pouched rats to olfactorily

429 specialize within their .

430

431 Acknowledgments

432 We wish to thank Andrew Legan for his assistance on this project.

433

434

435 References

436 1. Hurst JL, Payne CE, Nevison CM, Marie AD, Humphries RE, Robertson DHL, et al.

437 Individual recognition in mice mediated by major urinary proteins. Nature. 2001 Dec

438 6;414(6864):631–4.

439 2. Keverne EB. Importance of olfactory and vomeronasal systems for male sexual function.

440 Physiol Behav. 2004 Nov 15;83(2):177–87.

441 3. Nielsen BL, Rampin O, Meunier N, Bombail V. Behavioral responses to odors from other

442 species: Introducing a complementary model of allelochemics involving vertebrates. Front

443 Neurosci. 2015;9(JUN):1–11.

444 4. Lin DY, Shea SD, Katz LC. Representation of Natural Stimuli in the Rodent Main

445 Olfactory Bulb. Neuron. 2006 Jun 15;50(6):937–49.

446 5. Firestein S. How the olfactory system makes of scents. Nature. 2001 Sep

447 13;413(6852):211–8. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 25

448 6. Niimura Y, Nei M. Extensive gains and losses of olfactory receptor genes in mammalian

449 evolution. PLoS One. 2007 Aug 8;2(8):e708.

450 7. Gilad Y, Wiebe V, Przeworski M, Lancet D, Pääbo S. Loss of Olfactory Receptor Genes

451 Coincides with the Acquisition of Full Trichromatic Vision in Primates. PLoS Biol. 2004

452 Jan 20;2(1):e5.

453 8. Rouquier S, Blancher A, Giorgi D. The olfactory receptor gene repertoire in primates and

454 mouse: Evidence for reduction of the functional fraction in primates. Proc Natl Acad Sci.

455 2000;97(6):2870–4.

456 9. Adipietro KA, Mainland JD, Matsunami H. Functional Evolution of Mammalian Odorant

457 Receptors. I, editor. PLoS Genet. 2012 Jul 12;8(7):e1002821.

458 10. Hughes GM, ESM, Finarelli JA, Murphy WJ, Higgins DG, Teeling EC. The birth

459 and death of olfactory receptor gene families in Mammalian niche adaptation. Mol Biol

460 Evol. 2018;35(6):1390–406.

461 11. Hayden S, Bekaert M, Crider TA, Mariani S, Murphy WJ, Teeling EC. Ecological

462 adaptation determines functional mammalian olfactory subgenomes. Genome Res. 2010

463 Jan 1;20(1):1–9.

464 12. Niimura Y, Matsui A, Touhara K. Acceleration of Olfactory Receptor Gene Loss in

465 Evolution: Possible Link to Anatomical Change in Sensory Systems and Dietary

466 Transition. Satta Y, editor. Mol Biol Evol. 2018 Jun 1;35(6):1437–50.

467 13. Buck L, Axel R. A novel multigene family may encode odorant receptors: a molecular

468 basis for odor recognition. Cell. 1991 Apr 5;65(1):175–87. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 26

469 14. Saito H, Chi Q, Zhuang H, Matsunami H, Mainland JD. Odor coding by a Mammalian

470 receptor repertoire. Sci Signal. 2009 Mar 3;2(60):ra9.

471 15. Peterlin Z, Firestein S, Rogers ME. The state of the art of odorant receptor

472 deorphanization: a report from the orphanage. J Gen Physiol. 2014 May 1;143(5):527–42.

473 16. Abaffy T, Matsunami H, Luetje CW. Functional analysis of a mammalian odorant

474 receptor subfamily. J Neurochem. 2006 Jun 1;97(5):1506–18.

475 17. Malnic B, Godfrey PA, Buck LB. The human olfactory receptor gene family. Proc Natl

476 Acad Sci U S A. 2004 Feb 24;101(8):2584–9.

477 18. Malnic B, Hirono J, Sato T, Buck LB. Combinatorial Receptor Codes for Odors. Cell.

478 1999 Mar 5;96(5):713–23.

479 19. Kajiya K, Inaki K, Tanaka M, Haga T, Kataoka H, Touhara K. Molecular bases of odor

480 discrimination: Reconstitution of olfactory receptors that recognize overlapping sets of

481 odorants. J Neurosci. 2001 Aug 15;21(16):6018–25.

482 20. Krautwurst D, Yau K-W, Reed RR. Identification of Ligands for Olfactory Receptors by

483 Functional Expression of a Receptor Library. Cell. 1998 Dec 23;95(7):917–26.

484 21. Baud O, Yuan S, Veya L, Filipek S, Vogel H, Pick H. Exchanging ligand-binding

485 specificity between a pair of mouse olfactory receptor paralogs reveals odorant

486 recognition principles. Sci Rep. 2015 Dec 9;5(1):14948.

487 22. Snitz K, Yablonka A, Weiss T, Frumin I, Khan RM, Sobel N. Predicting Odor Perceptual

488 Similarity from Odor Structure. PLoS Comput Biol. 2013 Sep 12;9(9):e1003184.

489 23. Godfrey PA, Malnic B, Buck LB. The mouse olfactory receptor gene family. Proc Natl bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 27

490 Acad Sci U S A. 2004 Feb 17;101(7):2156–61.

491 24. Nguyen D, Lee K, Choi H, Choi M, Le M, Song N, et al. The complete swine olfactory

492 subgenome: expansion of the olfactory gene repertoire in the pig genome. BMC

493 Genomics. 2012;13(1):584.

494 25. Quignon P, Kirkness E, Cadieu E, Touleimat N, Guyon R, Renier C, et al. Comparison of

495 the canine and human olfactory receptor gene repertoires. Genome Biol. 2003;4(12):R80.

496 26. Quignon P, Giraud M, Rimbault M, Lavigne P, Tacher S, Morin E, et al. The and rat

497 olfactory receptor repertoires. Genome Biol. 2005 Sep 28;6(10):R83.

498 27. Lee K, Nguyen DT, Choi M, Cha SY, Kim JH, Dadi H, et al. Analysis of cattle olfactory

499 subgenome: The first detail study on the characteristics of the complete olfactory receptor

500 repertoire of a ruminant. BMC Genomics. 2013;14(1).

501 28. Niimura Y, Matsui A, Touhara K. Extreme expansion of the olfactory receptor gene

502 repertoire in African elephants and evolutionary dynamics of orthologous gene groups in

503 13 placental mammals. Genome Res. 2014 Sep;24(9):1485–96.

504 29. Ibe CS, Onyeanusi BI, Hambolu JO. Functional morphology of the brain of the African

505 giant pouched rat (Cricetomys gambianus Waterhouse, 1840). Onderstepoort J Vet Res.

506 2014;81(1):01–7.

507 30. Steppan SJ, Schenk JJ. Muroid rodent phylogenetics: 900-species tree reveals increasing

508 diversification rates. PLoS One. 2017 Aug 16;12(8):e0183070.

509 31. Huchon D, Chevret P, Jordan U, Kilpatrick CW, Ranwez V, Jenkins PD, et al. Multiple

510 molecular evidences for a living mammalian . Proc Natl Acad Sci U S A. 2007 May bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 28

511 1;104(18):7495–9.

512 32. Olayemi A, Nicolas V, Hulselmans J, Missoup AD, Fichet-Calvet E, Amundala D, et al.

513 of the African giant pouched rats (Nesomyidae: Cricetomys): Molecular and

514 craniometric evidence support an unexpected high species diversity. Zool J Linn Soc.

515 2012 Jul 1;165(3):700–19.

516 33. Jansa SA, Weksler M. Phylogeny of muroid rodents: relationships within and among

517 major lineages as determined by IRBP gene sequences. Mol Phylogenet Evol. 2004

518 Apr;31(1):256–76.

519 34. Nzalak JO, Byanet O, Salami SO, Umosen AD. Comparative morphometric studies of the

520 cerebellum and forebrain of the African Giant rat (Cricetomys gambianus, Waterhouse)

521 and that of grasscutter (Thryonomys swinderianus),. J Anim Vet Adv. 2008;

522 35. Nzalak JO, Ayo JO, Neils JS, Okpara JO, Onyeanusi BI, Ghaji A, et al. Morphometric

523 studies of the cerebellum and forebrain of the African giant rat (Agr)(Cricetomys

524 gambianus, Waterhouse). Trop Vet. 2005;23(3):87–92.

525 36. Garrett EC, Steiper ME. Strong links between genomic and anatomical diversity in both

526 mammalian olfactory chemosensory systems. Proc R Soc B Biol Sci. 2014 Apr

527 9;281(1783):20132828–20132828.

528 37. Bird DJ, Murphy WJ, Fox-Rosales L, Hamid I, Eagle RA, Van Valkenburgh B. Olfaction

529 written in bone: cribriform plate size parallels olfactory receptor gene repertoires in

530 Mammalia. Proc R Soc B Biol Sci. 2018 Mar 14;285(1874):20180100.

531 38. Freeman AR, Sheehan MJ, Ophir AG. Anogenital distance predicts sexual odour bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 29

532 preference in African giant pouched rats. Anim Behav. 2019;148:123–32.

533 39. Freeman AR, Ophir AG. Scent-marking behavior of the southern giant pouched rat

534 (Cricetomys ansorgei). J . 2018 Oct 24;99(6):1430–5.

535 40. Weetjens BJC, Mgode GF, Machang’u RS, Kazwala R, Mfinanga G, Lwilla F, et al.

536 African pouched rats for the detection of pulmonary tuberculosis in sputum samples. Int J

537 Tuberc Lung Dis. 2009;13(6):737–43.

538 41. Poling A, Weetjens BJ, Cox C, Beyene NW, Sully A. Using giant African pouched rats

539 (Cricetomys gambianus) to detect landmines. Psychol Rec. 2010;60(4):715–28.

540 42. Zerbino DR, Achuthan P, Akanni W, Amode MR, Barrell D, Bhai J, et al. Ensembl 2018.

541 Nucleic Acids Res. 2018 Jan 4;46(D1):D754–61.

542 43. Afgan E, Baker D, van den Beek M, Blankenberg D, Bouvier D, Čech M, et al. The

543 Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016

544 update. Nucleic Acids Res. 2016 Jul 8;44(W1):W3–10.

545 44. Niimura Y. Identification of olfactory receptor genes from mammalian genome sequences.

546 Methods Mol Biol. 2013;1003:39–49.

547 45. P, Longden I, Bleasby A. EMBOSS: the European Molecular Biology Open

548 Software Suite. Trends Genet. 2000 Jun;16(6):276–7.

549 46. Liu K, Linder CR, Warnow T. RAxML and FastTree: Comparing Two Methods for

550 Large-Scale Maximum Likelihood Phylogeny Estimation. Wu R, editor. PLoS One. 2011

551 Nov 21;6(11):e27731.

552 47. Prieto-Godino LL, Rytz R, Bargeton B, Abuin L, Arguello JR, Peraro MD, et al. Olfactory bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 30

553 receptor pseudo-pseudogenes. Nature. 2016 Nov 24;539(7627):93–7.

554 48. Zhang X, Firestein S. The olfactory receptor gene superfamily of the mouse. Nat

555 Neurosci. 2002 Feb 22;5(2):124–33.

556 49. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+:

557 architecture and applications. BMC Bioinformatics. 2009 Dec 15;10(1):421.

558 50. Olender T, Lancet D, Nebert DW. Update on the olfactory receptor (OR) gene

559 superfamily. Hum Genomics. 2008 Sep 1;3(1):87.

560 51. Niimura Y, Nei AM. Evolutionary dynamics of olfactory and other chemosensory

561 receptor genes in vertebrates. J Hum Genet. 2006 Jun 11;51(6):505–17.

562 52. Alioto TS, Ngai J. The odorant receptor repertoire of teleost fish. BMC Genomics. 2005

563 Dec 6;6(1):173.

564 53. Steiger SS, Kuryshev VY, Stensmyr MC, Kempenaers B, Mueller JC. A comparison of

565 reptilian and avian olfactory receptor gene repertoires: Species-specific expansion of

566 group γ genes in birds. BMC Genomics. 2009 Sep 21;10(1):446.

567 54. Steiger SS, Fidler AE, Valcu M, Kempenaers B. Avian olfactory receptor gene

568 repertoires: evidence for a well-developed in birds? Proceedings Biol Sci.

569 2008 Oct 22;275(1649):2309–17.

570 55. Niimura Y, Nei M. Comparative evolutionary analysis of olfactory receptor gene clusters

571 between humans and mice. Gene. 2005 Feb 14;346:13–21.

572 56. Niimura Y, Nei M. Evolution of olfactory receptor genes in the human genome. Proc Natl

573 Acad Sci U S A. 2003 Oct 14;100(21):12235–40. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 31

574 57. Dugas JC, Ngai J. Analysis and Characterization of an Odorant Receptor Gene Cluster in

575 the Zebrafish Genome. Genomics. 2001 Jan;71(1):53–65.

576 58. Mombaerts P. Genes and ligands for odorant, vomeronasal and taste receptors. Nat Rev

577 Neurosci. 2004 Apr 1;5(4):263–78.

578 59. Mainland JD, Keller A, Li YR, Zhou T, Trimmer C, Snyder LL, et al. The missense of

579 smell: functional variability in the human odorant receptor repertoire. Nat Neurosci. 2014

580 Jan 8;17(1):114–20.

581 60. Hughes GM, Finarelli JA. Olfactory receptor repertoire size in dinosaurs. Proc R Soc B

582 Biol Sci. 2019 Jun 12;286(1904):20190909.

583 61. Bhatnagar KP, Kallen FC. Cribriform plate of ethmoid, olfactory bulb and olfactory acuity

584 in forty species of bats. J Morphol. 1974 Jan 1;142(1):71–89.

585 62. Baron G, Frahm HD, Bhatnagar KP, Stephan H. Comparison of brain structure volumes in

586 Insectivora and Primates. III. Main olfactory bulb (MOB). J Hirnforsch. 1983;24(5):551–

587 68.

588 63. Healy S, Guilford T. Olfactory-Bulb Size and Nocturnality in Birds. Evolution (N Y).

589 2006 Mar 1;44(2):339.

590 64. Onyono PN, Kavoi BM, Kiama SG, Makanya AN. Functional Morphology of the

591 Olfactory Mucosa and Olfactory Bulb in Fossorial Rodents: The East African Root Rat

592 ( splendens) and the Naked Mole Rat (Heterocephalus glaber). Tissue Cell.

593 2017 Oct 1;49(5):612–21.

594 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. 32

595 Supporting Information

596 S1 File. Fasta file of all 1145 ORs from the manual pipeline search.

597 S2 File. Fasta file of all ORs from the ORA automated search method.

598 S3 File. Tree file from Pouched rat ORs from manual pipeline search

599 S4 File. Tree file from Rodent ORs, Pouched rat ORs from manual pipeline search

600 S5 File. Tree file from Rodent ORs, Pouched rat ORs from ORA automated search method

601 S6 File. File categorizing family groupings for Pouched rat ORs

602 S1 Table. Distribution of pouched rat olfactory receptor (OR) genes among families using

603 the automated olfactory receptor assigner.

604 bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license. bioRxiv preprint doi: https://doi.org/10.1101/742668; this version posted August 21, 2019. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY 4.0 International license.