<<

Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 diversity complexity. trait in predator number in overall patterns their evolutionary than determining eco- important an more in is of community diversity logical results the that Our concept . general of the targets support cascade coagulation sequence the by supported in claim diversity a venom, of relevant targets functionally the in measures divergence prey of diversity phylogenetic the that concerted suggest families We a families. gene is gene independent evolution many largest among complexity response four that showing the venom, of viper in three of tran- predicted complexity evolution further scriptomic the diversity in phylogenetic alone, Prey lineages complexity. of of number diver- the species’ than overall prey rather was gence, implicating than specifically diversity, complexity species venom prey to correlated was strongly diversity more for phylogenetic prey Strikingly, We these analyses. variation. for comparative phylogenies dietary genome-scale quantify constructed to then studies collated history and complexity natural venom several of measures transcrip- proteomic ven- independent copperheads, and front-fanged generated tomic , We of the cottonmouths. clade America: and largest cantils, North the in in snakes diet omous from sources ecological selection and alterna- of complexity tested venom We unclear. linking are hypotheses variation tive this complexity, of in causes the decreases yet and and increases apparent with evolutionary timescales, over fluctuates complexity Venom to prey. function ven- collectively incapacitate that Predatory proteins between multiple complex- species. of links composed across trait are functions hypothesized oms 2020) of their testing 28, and evolution July by review forms the for characterized complex (received in 2021 be 9, selection can February natural approved ity and of MO, Louis, role St. Louis, The St. in University Washington Losos, B. Jonathan by Edited and ; 05503-900, neovd(1 12). remains complexity (11, rel- trait unresolved most of is evolution species the understanding interacting for of evant nature (8–10). or number strategies the infection phylogenetically Whether divergent with for guilds reserved parasite alter- units distant or, defensive (4–7), to might modular diversity) response parasites species natively, (a or species many richness parasite community against to each overall for trajectory. defending responds defenses evolutionary trait host unique trait’s evolve a a the pressures example, on specific impacts For critical func- The in have 3). of complexity can forms (2, favor varied to traits interac- expected to species tional are rise and that give selection evolution early interactions natural trait an species between placing Diverse link (1), tions. the nature in on complexity focus of maintenance and D Fundac¸ 02138; MA Cambridge, University, S Butantan, h Instituto Imunopatologia, de Ju Universidad 32306; a Parkinson L. Christopher Moura-da-Silva M. Ana Viala Louis Vincent Hogan P. Michael aailCasta Gamaliel Holding L. pitvipers Matthew American North in complex more favor diets diverse Phylogenetically PNAS eateto ilgclSine,CesnUiest,Cesn C29634; SC Clemson, University, Clemson Sciences, Biological of Department etro ois mueRsos n elSgaig S Signaling, Cell and Immune-Response Toxins, of Center n o aua eeto ol rmt h evolution the promote could hypothesiz- selection when natural riverbank entangled how an ing evoked arwin 01Vl 1 o 7e2015579118 17 No. 118 Vol. 2021 od eiiaToia otrHio iiaDuao aas600 Brazil; 69040, Manaus Dourado, Vieira Heitor Doutor Tropical Medicina de ao c ˜ eateto vlto,EooyadOgnsa ilg,TeOi tt nvriy oubs H43210; OH Columbus, University, State Ohio The Biology, Organismal and Ecology Evolution, of Department | predator rzdlEtd eDrno ..300G 35010 C.P. , de Estado del arez ´ neda-Gayt ˜ | toxin b m eateto oetyadEvrnetlCnevto,CesnUiest,Cesn C29634 SC Clemson, University, Clemson Conservation, Environmental and Forestry of Department g,h unrS Nystrom S. Gunnar , a,b,1 | transcriptomics f,k akJ Margres J. Mark , a,m,1 an ao .Strickland L. Jason , ´ j eieG Grazziotin G. Felipe , ue Biol Museu d hitp .Gr I. Christoph , oPuo05390 Brazil; 05503-900, Paulo ao ˜ | gc,IsiuoBtna,S Butantan, Instituto ogico, ´ itbreadth diet b a,i,5 cye .Ellsworth A. Schyler , mzPlco g. ; Dgo., Palacio, omez ´ oPuo05390 Brazil; 05503-900, Paulo ao ˜ a,2 rk Hingst-Zaher Erika , unwald ¨ l ht .Rautsaw M. Rhett , .LseGibbs Lisle H. , g Laborat e ao .Jones M. Jason , oPuo05390 Brazil; 05503-900, Paulo ao ˜ b eateto ilgclSine lrd tt nvriy alhse,FL Tallahassee, University, State Florida Science, Biological of Department rod oiooi piaa nttt uatn S Butantan, Instituto Aplicada, Toxinologia de orio ´ doi:10.1073/pnas.2015579118/-/DCSupplemental at online information supporting contains article This 5 4 3 2 ulse pi 9 2021. 19, April Published 1 the under Published Submission.y Direct PNAS a is article This H.L.G., interest.y F.G.G., competing A.M.M.-d.-S., no declare M.J.M., authors The A.J.M., F.G.G. paper. R.M.R., y the and wrote J.L.S., C.L.P. I.L.M.J.-d.-A., and M.L.H., D.R.R., M.J.M., L.A.F.-d.-S., and T.J.C., data; S.A.E., J.L.S., analyzed G.S.N., M.L.H., tools; A.J.M., and reagents/analytic E.P.H., D.R.R., new R.M.R., F.G.G., contributed I.L.M.J.-d.-A., D.R.R. E.H.-Z., research; M.B., T.J.C., performed M.J.M., V.L.V., C.L.P. S.A.E., L.A.F.-d.-S., G.S.N., M.P.H., J.M.J., C.L.P. A.J.M., C.I.G., and E.P.H., G.C.-G., G.C.-G., D.R.R., R.M.R., M.B., H.L.G., J.L.S., F.G.G., T.J.C., M.L.H., A.M.M.-d.-S., M.P.H., research; I.L.M.J.-d.-A., designed A.J.M., E.H.-Z., E.P.H., M.J.M., R.M.R., J.M.J., J.L.S., C.I.G., M.L.H., contributions: Author xml,Shannon’s example, rcsl unie ytenme n bnac funique of such metrics abundance diversity Shannon’s of as and use the diver- number allows be This the the can (16–19). components complexity to by their complexity quantified because trait precisely communities linking ecological are of for interactions sity models antagonistic the in as by involved them emerging often produce traits to and required Molecular 14) genome (15). (13, the in phenotype information functional of amount final the to ing rsn drs:Dprmn fItgaieBooy nvriyo ot lrd,Tampa, Florida, South of University Biology, Integrative 32611. y of FL Department Gainesville, address: Florida, Present of University Biology, of Department address: NC Present Wilmington, College, Community AL Fear Mobile, Cape Department, Alabama, Science South address: Present of University Biology, of Department address: Present owo orsodnemyb drse.Eal [email protected] rmatthe- or [email protected] Email: addressed. be may correspondence whom To L33620.y FL 28401. 36688. [email protected]. b pce omnt,rte hnterrcns ln,i an is traits. complex alone, of of richness evolution nature the their The in than feature diver- venom. important rather reflecting of community, targets likely species physiological a species, the of in phylogenetic community gence to a response in in indicate diversity evolves results complexity The venom diets. that high- of history diversity Using natural phylogenetic to complexity unclear. via in changes link remain we Amer- pitvipers, favor- North traits ican in pressures complexity in complex venom selection of question measurements or resolution specific major the simple a but ing is sciences, life evolves the complexity biological Why Significance ope risaedfie ymlil opnnscontribut- components multiple by defined are traits Complex c j e ioh .Colston J. Timothy , l i EPM .. il del Villa A.C., HERP.MX Laborat eateto raimcadEouinr ilg,Harvard Biology, Evolutionary and Organismic of Department y y ai .Rokyta R. Darin , In , a coL .Junqueira-de-Azevedo M. L. acio ´ rc .Hofmann P. Erich , e rod Colec¸ de orio ´ H uin .Freitas-de-Sousa A. Luciana , NSlicense.y PNAS ne 1,2)t umrz ri opeiy For complexity. trait summarize to 20) (16, Index k y nttt ePsus Cl Pesquisa de Instituto H https://doi.org/10.1073/pnas.2015579118 e Zool oes ˜ a sdt ikmjrhistocompatibility major link to used was b lae,Clm 87,Mexico; 28973, Colima Alvarez, ´ b,4 and , a,3 gcs nttt uatn S Butantan, Instituto ogicas, iulBorja Miguel , ´ d autdd inisBiol Ciencias de Facultad nrwJ Mason J. Andrew , . y https://www.pnas.org/lookup/suppl/ nc alsBorborema, Carlos ´ ınica oPuo05390 Brazil; 05503-900, Paulo ao ˜ f d g,h , , , f Laborat ogicas, ´ oPaulo ao ˜ a,c | , f10 of 1 orio ´

EVOLUTION complex diversity with the richness of local parasite species (21, tive framework. We collected venom and venom-gland samples 22), as well as phytochemical defenses in plant leaves to the from Agkistrodon, , and Sistrurus across North America α-diversity of herbivorous insects (16, 17). to generate the largest dataset of proteomes and venom-gland venoms function by disrupting homeostatic physiolog- transcriptomes for this group to date (68 lineages). We gen- ical processes to rapidly incapacitate prey (23) and provide the erated a phylogeny by harvesting 1,525 nonvenom genetic loci opportunity to understand the forces mediating trait complexity from transcriptomes and collated diet studies on these snakes. in predator–prey interactions. As protein mixtures, venom phe- Specifically, we tested three primary hypotheses for the evo- notype complexity can be quantified through chromatographic lution of venom complexity levels (shown in SI Appendix, Fig. separation. Additionally, the venom-gland transcriptome directly S1). First, if intense pairwise coevolution favored trait com- links the venom proteome to the genotype (24–26). Transcrip- plexity, we expected to observe a negative relationship between tomes, therefore, provide a second independent measure of venom complexity and prey species diversity, as the intensity venom complexity via expressed genomic sequence complex- of coevolutionary selection is expected to be highest when ity (14). Recent studies across several venomous lineages have only a few key prey species are consumed (45, 46). In con- shown that the transcript (24), protein (25), and enzymatic activ- trast, if diffuse coevolution with multiple species exerted the ities (26) of venom are more complex when more prey classes are strongest selection on trait complexity, we predicted venom com- consumed. However, these studies do not directly test alternative plexity to be positively associated with prey species diversity, hypotheses for how prey diversity may select for either reduc- as diffuse coevolution is expected to be strongest when sev- tions or increases in venom complexity. Disentangling the roles eral prey species are important (17, 47). If prey divergence, of species richness and phylogenetic diversity of diets in the evo- rather than the simple number of prey, exerted the strongest lution of venom complexity requires taxon-wide diet and venom selection on trait complexity, we expected venom complexity characterization in a system known to exhibit adaptive evolution to be correlated with phylogenetic diversity of prey consumed of prey-specific venoms. and for prey phylogenetic diversity to explain more varia- Functional studies of have revealed several tion in complexity than prey species diversity alone (48, 49). examples of variable complexity and evolved prey specificity. Finally, a lack of any relationship between dietary ecology and For example, feeding solely on fish has relaxed selection venom complexity accompanied by strong phylogenetic signal in in the eydouxii, resulting in a simplified venom complexity would support neutral evolution of complexity venom arsenal and frequent null and deleterious mutations in levels. venom genes (27, 28). Additionally, adaptive specialization of snake venom function occurs toward congeneric prey species Results (29) and populations (30–32). Finally, highly expressed venom Phylogenetic Relationships. We inferred phylogenies using both gene paralogs from the same snake can display taxonomic speci- species-tree and concatenation approaches (Fig. 1 and SI ficity. Paralogous three-finger toxins in the Amazon puffing Appendix, Figs. S2–S4) using sequences from 169 individuals snake (Spilotes sulphureus) are alternatively lethal to representing 46 species of Agkistrodon, Crotalus, and Sistrurus or (33), while snake venom metalloproteinase (SVMP) (71.8% of species-level diversity) (38). To capture maximum paralogs in the Brazilian lancehead (Bothrops neuwiedi) differen- diversity within our focal clade, we expanded our sampling to tially enact procoagulant function in blood of either mammals or include phylogenetically discrete subspecies and lineages with (34). Despite deep conservation of the individual metabolic described phylogenetic diversity, resulting in a final dataset of and homeostatic targets of venom (35, 36), the sequence diver- 9 Agkistrodon lineages, 5 Sistrurus lineages, and 54 Crotalus lin- gence that has accumulated in these systems appears to influ- eages. See SI Appendix, Results for an abbreviated discussion of ence venom protein function and may exert selection on venom relationships based on our topologies. We used these complexity. trees for phylogenetic comparative analyses of the relationship Unlike other venomous groups (24, 25), dietary ecology in between diet diversity and venom complexity. viperid snakes has been researched extensively (Dataset S1 A and B), allowing detailed quantification of dietary variation. Most Complex Genotypes Underlie Complex Phenotypes. Venom tran- vipers feed mostly on small vertebrates, with the ancestral diet scriptomic and proteomic complexities were positively related 2 of the clade likely being mammals, lizards, and frogs (37). Rat- (R = 0.45, β = 0.0004, T2,62 = 7.1, P < 0.001; Fig. 1, Inset tlesnakes (Crotalus and Sistrurus), copperheads, cottonmouths, [scatterplot]), affirming a link between the expressed genotype and cantils (Agkistrodon) comprise the largest clade of the front- and the phenotype complexity in snake venoms. The lack of a fanged species in North America, with between stronger relationship between these metrics likely underscores 45 and 64 recognized species (38). Species range widely in dietary the nature of our sequence data as a richer source of infor- ecology. For example, the Timber Rattlesnake (Crotalus hor- mation. In high-performance liquid chromatography (HPLC), ridus) is a specialist (39), whereas the exceptionally similar proteins may elute together as a single peak in chro- diverse diet of the Florida Cottonmouth (Agkistrodon piscivorus matograms, potentially underrepresenting phenotype complexity conanti) (40) includes fish, frogs, mammals, snakes, lizards, birds, at the level of individual components. Our transcriptomic com- and even turtles. The venoms of these three genera consist of plexity measure will instead quantify such differences, while between 10 and 70 proteins from 15 to 25 distinct gene families the equal weighting of all unique sequences regardless of tran- (41, 42). Their venom cocktails underlie diverse functionality, script length provides a complimentary focus on total sequence with venoms enacting neurotoxic, coagulopathic, hemorrhagic, diversity, rather than organization into transcripts. There was and myotoxic effects in accordance with their composition (41). phylogenetic signal in both transcriptomic complexity (Pagel’s λ Functional variation also exists within gene families, where neo- = 0.79 ± 0.014, P = 0.002) and proteomic complexity (λ = 0.96 functionalization has generated phospholipase A2 (PLA2) par- ± 0.009, P < 0.0001; Fig. 1), as well as marked variation among alogs that produce both cytotoxic and neurotoxic functions (43, lineages. The transcriptomic complexity underlying venoms 44) and SVMP paralogs with hemorrhagic and coagulopathic varied by a factor of 10 among lineages, with the most com- functions. plex, the Massasauga Rattlesnake (Sistrurus tergeminus To test alternative hypotheses for the link between ecologi- edwardsii), expressing 45,191 effective k-mers and the simplest, cal communities and complexity, we have combined analysis of the South American Rattlesnake ( terrificus), snake diets and independent, standardized venom proteomic and expressing only 4,723 effective k-mers. Venom proteomic com- venom-gland transcriptomic complexity measures in a compara- plexity also varied considerably, with the most complex being the

2 of 10 | PNAS Holding et al. https://doi.org/10.1073/pnas.2015579118 Phylogenetically diverse diets favor more complex venoms in North American pitvipers Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 vlto oadbt ipe n oecmlxvnm has venoms S5 complex Fig. Appendix, more (SI time and over simpler occurred both toward evolution of ancestor mon effective and (28,751 peaks) (22.3 complexity protein on transcriptomic intermediate indicate peaks (54) in Tiger 5 effective function version ace the ape 8 the using being reconstructions only simplest state Ancestral having the average. ), tigris and (Crotalus peaks protein Rattlesnake effective mean a 32 with of lepidus ) lepidus (Crotalus Rattlesnake Rock Mottled hlgntclydvredesfvrmr ope eosi ot mrcnpitvipers American North in venoms complex more favor diets diverse Phylogenetically al. et Holding species. across (Inset gene venom scatterplot venom the The to of available. aligning not complexity phylogeny were reads protein of measures of and tips complexity complexity the venom complexity to transcriptome which transcriptomic next for mean nuntius, between taxa Bars and viridus indicate relationship 70%. Crotalus gray) bars than positive of (dark less significant place species values the in support each Dashes shows posterior from gray). indicate (light samples nodes regions venom on coding of circles transcript open complexity and protein estimates, mean date indicate divergence for bars) (blue CIs 1. Fig. Protein Complexity (peaks) 10 15 20 25 30 18 ae hlgn of phylogeny Dated R p 00 00 00 40000 30000 20000 10000 Transcript Complexity(kmers) <0.001 2 =0.45 Agkistrodon and cerastes, cerastes Crotalus Neogene

Miocene Miocene Agkistrodon, 10 n atenks ugsigthat suggesting rattlesnakes, and and Crotalus, ).

mr)i h com- the in k-mers) Pliocene rtlssuuau scutulatus scutulatus Crotalus Sistrurus

Pleistocene Quat. rdcduigAta-I n ae ihTeP n oslclbain oe nld 95% include Nodes calibration. fossil and TreePL with dated and Astral-III using produced 0 Crotalus durissuscollilineatus Crotalus lannomi Crotalus ericsmithi Crotalus cerastes laterorepens Crotalus cerastes cerastes Crotalus cerastes cercobombus Crotalus enyo Crotalus horridusatricaudatus Crotalus horridus Crotalus intermediusgloydi Crotalus transversus Crotalus priceimiquihuanus Crotalus pricei Agkistrodon contortrixpictigaster Agkistrodon contortrixlaticinctus Agkistrodon contortrixmokasen Crotalus oreganus oreganus−South Crotalus oreganus helleri Crotalus oreganus oreganus−North Crotalus oreganus lutosus Crotalus oreganus abyssus Crotalus cerberus Crotalus viridisnuntius Crotalus scutulatussalvini Crotalus scutulatusscutulatus−Cen.Mx Crotalus scutulatusscutulatus−Son. Crotalus scutulatusscutulatus−Chih. Crotalus tigris Crotalus adamanteus Crotalus pyrrhus Crotalus mitchellii Crotalus stephensi Crotalus ruberlucasensis Crotalus atrox West Crotalus atrox East Crotalus molossusoaxacus Crotalus basiliscus Crotalus molossusnigrescens Crotalus molossus Crotalus ornatus Crotalus durissusterrificus Crotalus willardi willardi Crotalus willardi silus Crotalus lepidusklauberi Crotalus morulus Crotalus aquilus −East Crotalus triseriatus−West Crotalus campbelli Crotalus armstrongi ravus Crotalus polystictus Sistrurus tergeminus tergeminus Sistrurus tergeminus edwardsii Sistrurus catenatus Sistrurus miliariusbarbouri Sistrurus miliariusstreckeri Agkistrodon piscivorusleucostoma Agkistrodon piscivorus Agkistrodon piscivorusconanti Agkistrodon russeolus Agkistrodon bilineatus Agkistrodon contortrix rncitmccmlxt a oee etb ryMPD this represent prey to used by trees best phylogenetic alternative modeled all was across complexity = transcriptomic (mean 731 to and 2 118 (Fig. from divergence effec- of 6.7) ranged years = million MPD 365) (mean prey 15 while to species, 2.9 tive from ranged diversity consumed. (mea- prey species of diversity MPD Prey Complexity. and species species) of simple number Venom the effective as both sured in Predicts widely ranged Diversity species Diet Phylogenetic h bevdvraini ohvnmpoemcand proteomic venom both in variation observed The mgscei:Tai ihr(oadClee a neo TX). Angelo, San College, (Howard Fisher Travis credit: images (Venom Phenotype) (Venom Protein Complexity 30 20 10 0 (ExpressedGenotype) Venom 10000 Transcript Complexity 20000 https://doi.org/10.1073/pnas.2015579118 30000 40000 i.S6). Fig. Appendix, SI rtlspie pricei, pricei Crotalus PNAS | Snake f10 of 3 )

EVOLUTION Tree scale: 100 million years

Bird & Turtle Squamate Fish Mammal

i Callis

P

s

P

Crotaph s

Cophos

h

e

h o

Ctenosau H r

d

r n

y

i

y Tham n o

n a Gambel h Ophisau e Dipsosa n

a

l Helod r r

o Thamn urus d Anol b o

n

y

a s cus r r t

l s ami i

o omus

o ytus bicinctor e a callii

o o r a Thamn Saur l

o

Store Thamn m n formis rri Bar ciosus h s T m i Gambel Tham urus texan

ana ophis me i k gister rop e s caroli a aciosus

a cutti Anolis sagrei aris i a r a r Uma notata a raconoi rma sus a Tham ophis marcian ia wislizen a hemilopha rus attenuatus u i anthalis

o

sia imb m m omal c Elga neus ria occipidocloni rus dor m r nor uatus

o o o i rovii entalis n ophis scaliger b q kii osus

ophis eq a s s Nerodia fasciata r

Carp i ophis sau rus gr n n

c

c o o orus un osoma p orus me nophis si n i u

u u orus or r a sila Diadophis punctatus u u Heterodon plati n p n gatus ia king n Fara lanog ensis porus g orus gramm r d t s ater l p V Store l y orus palaciosi y porus ma

p a u porus zoste a dana ricata r semi es u r hophis a irg on lineatum ectum salis p

r e m porus scal olina t

h orus m s i h porus bic

e ens Chion tomacul a porus ae r minckii T s ii orus tor ntina ratus

ncia iabacu P P Phry porus jar antilla nia striatu Phrynosoma m Uta stansburi Urosa Urosaurus ornatus p orus spi e o Lampr ues Scelo a Scelop r Scelo porus clar orus occid Hypsigl Scelo Coluber constrictor ia dekayi i Scelo p ster Scelo orus vi r r us i Scelop p Lamp itus Scelo Opheod talis Scelo us actis occipitalis Scelo porus undulatus l Scelo ne ferox mys flori m Scelop melanocephala Scelo e anus opeltis tr a Scelo o i oenus Scelo ra serp fornica r ta Scelo apene ca i iceps opeltis e Scelo r oura annulatarhinos Scelop d na jani l Scelo ochelystherus temrnon od flavesc lus Arizo r r a Apal er umbel l Pituop ys vernal a T Pseudrachemys scripta a ostris rius T s virgin olinusons i Chelyd r r Crotal na elegansm angulum Macr eplaa cal macr vicianus CrotalCrotal Sterno ngir o Agkistr Crotal his deppeiexicana Kinoste ifornicaus Aspid Bonas lymbus po l Crotalus aquilus Colinu dis falcine Crotal us triserius horrus is Callip apicus va inim u Zenaid r erpes ca Aspid oscelis mao s atroxviri Podi n orus lud don pisciv Rallus lo idonax fulvif Charinaus bottae cerastes d Plega AspidoscelisAspido Aspidoscelis sexli tigri is Sphy locoma etaca thalassinatoriusis o idus Mela iparus m acus scelis hype a Emp tus Thryoth us guttatus rina m rufum scelis ino r Aphe a mo o Psaltrachycinus migra PlestiodPlestio rus T lineata mus Plestio Ame ratus Cathar r la passe tus Plestiod ythrus Turd estes grama don ifasciatusrnatus Sturnus vulgar ophthal on tetragrammusd va ameivaneatus s Toxostom ispiza bi Plestio on obsoletus Spizel h Plestiodonon inexp laticeps Chond Amp o erythr odia is don skiltonianus Pipilo macul Plestiodon gilbeectatus Pipil etes gramineus is Scincel Junco hyemalisla alpestr Coleonyx varieg Kieneria crissalis dinal Xantusi Pooec na ea la later Melospizamophir domesticus mel a l Ere eric la rti inalis car Pseuda Hyla squia vigilisalis Passe aga ceru Osteopilus septeHyla cine Card iceusphalus atus Spiza amesiaus motacil ater is Acriscris crepitans ocul rella Setoph rea Park ius phoenrsupial Rana clamitansaris Molothr gus cyanoceiniana Rana catesbntriona Agela his ma GastrophrRana sph Eupha lis Didelp n lotorn ius yne caroenocephae Didelphiso virg r Spea intermontanaiana Procy us Spea bombifrolinensisla Mustelaogale visous puto pallid Spea h Spil lis Spea m Antrozo reweri Scaphi ammo ns Eptesicus fuscus Scaphopus holultiplicandii Myotisascalops soda b rdi iopus couchb ta Par is Euryce rookii Scalopusrex aquaticus olinenscrawfo Euryce Notioso a longicaa lucifugaii ina car Eurycea cirr Blarrina brevicaudaodwini PseudoeurycePseudotriton ruuda Bla rva igera Cryptotis go idgii Plethoda papenfussiber Cryptotisx trowbr pa e on cinereus Sore DesmognathusPlethodon br glutinosus Sorex s Sorex cinereu imleyorum Sorex fumeusrostris Aneid ex longi Amph es lugubris Sor Notophthal iuma means Sorex haydeni mus viridescens Sorex vagransinceps Siren inter Ochotona pr Micro media ifornicus pterus salmoides Lepus cal mani Lepomis macrochirus Sylvilagus bach Fundu Sylvilagus audubonii lus heteroclitus Sylvilagus nuttallii Ameiurus brunneus Sylvilagus aquaticus Scolopendromorpha Sylvilagus palustris

MPD = -4.7, Protein Complexity = 17.1 0

8

0 2 Sylvila

4 gus cunicularius 6 6 Timber Rattlesnake ( 4

2 Sylvilagus floridanus

6

6

5

6

3 7 8 Aplodontia rufa

Sid 9 Glaucomys vo

3

4

5 6

MPD = -1.4, Protein Complexity = 7 2 1 Tamiasciurus hulans ewinder ( Banded Rock Rattles Sciurus carol dsonicus Sciurus nigerinensis Tamias striatus Tamias amoen MPD = 4.4, P Tamias mini us Ammosper mus Am mophil C. cerastes cerastes Marmotamosperm flavivent us leucur ophil us Spermophilus variegatusus harr Spermoph ris isii Sper rotein naComplex Spe mophiilus moha k Spe rmoph lus tereticaudusvensis e ( Spermophilusrmophiilus richardsonii tridec C. h. horridus C. l Spermophiluslus moll columemlin Sper eatus epidu Sper moph bianus Geomysmophilus burilus beech spilos Pappogeomys b is Thom ity = 27.s kl sarius oma Thom omys talpoi e ) auberi Thomoomys bottae yi 24.5 Dipodo ulleri Dipodo mys umbr d 6 Dipod mys dese es ) Dipod mys spectabilis Dipodomysomys agil ordii inus ) Dipod omys me rti Dipod Per omys pana Perognathusognathusomys fasciatus heea rria Perognathus flavescei mi Perognathus flavus s Per m Per rmanniintinus Chaetodipusognathus form lo mplus Chaetodiognathus pa Chaetodi Chaetodipus penic Chaetodi Chaetod n ns Chaetodipus spinatuspus baile gimembris Chaetod pus hispid rvus 0.8 Napae Zapus hu pus inter osus Zapus p ipus califo yi Mus musc Rattus rattus ozapusi insignis us Clethr pus fallax i Lem mediusllatus

0.4 Synaptomys cooperir d Ondatra zibe incepssonius rnicus Microtus chrotorrm i Micro onomys ug Micr iscus curtatuslus Micro Micr Micr Micr o tus mexicanu Micr tus ochrog 0.0 Sigmodon his o tus pinetor Bolom o a Calomys callosus tus californ Oligoryz o tus montanus thicus pperi Oryzomys pal o tus pennsylvanicus Akodo tus townsendii Akodon urichi Holochilus bras Absorbance (220nm) 0 20 40 60 80 100 120 0 20 40 60 80 100 120 0 20 40 60 80 100 120 Pseudoryzomys si Neotom ys lactens hinus Neotom Neotom Neotom aster Neotom Baio u s Ochrotomys n n Reithr omys eliu m Retention Time (mins) Retention Time (mins) Reithr i Retention Time (mins) Reithr sanctipaul cus Onychomys le Onychomys tor Neotomo Per pidus Peromyscus cri Per Pero Peromyscus leuc Per Per Per P P m a a fuscipes

e e ys taylori a albigu cinerea o a florida ustris o o r r odontomys h a omyscus maniculatus o o myscus califo odontomys megalotis o o myscus ere dontomys fulve myscus gossyp lepida

myscus mel myscus mexi

m m r iliensis us

y y don alstoni e

s s nsis

c c la m uttalli n

u u a plex s s u

r cogaster

t p idus

r n

e

u

micus itus umulis

c

e

anotis o r

t i nicus o canus pus scens

r i a nus

l

i

s Density 0.0 0.02 0.04 0.06 0.08

10 15 20 25 30 35 Venom Protein Complexity

Fig. 2. Phylogenetic diversity of all prey species used to calculate diet diversity. Topology was produced by joining published supertrees (50–53) using divergence dates from TimeTree.org (52). The pie chart (Top Left) shows the overall percentage of recorded snake stomach contents belonging to each prey taxa, and the pie chart colors correspond to colors on the prey phylogeny. Tracks outside the tree represent the diet composition of species with the lowest (blue), nearest to average (orange), and highest (red) phylogenetic diet diversity, as measured by standardized MPD. Gray dashed lines on track mark 50% of the diet. Insets show three representative HPLC chromatograms of a venom sample from these snake lineages with absorbance values standardized to the tallest peak in each sample and a density plot of all species’ mean HPLC complexity values for which diet data were also tabulated, with values for species featured in chromatograms marked with vertical lines and color-coded as for diet tracks.

relationship (change in sample-size corrected Akaike Informa- complexity and MPD (Fig. 3), model selection supports the phy- tion Criterion [∆AICc] > 2 for all MPD models compared to logenetic diversity hypothesis for the evolution of more complex complementary species diversity models; Table 1). Furthermore, venom traits across these predators. For brevity, we discuss pgls the MPD models were significantly better fits to the data com- analyses using our Astral-III species tree here, with replicate pared with intercept-only null phylogenetic generalized least analyses using other phylogenetic hypotheses reported in Table squares (pgls) models (∆AICc > 2), suggesting neutral evo- 1. Venom proteomic complexity and MPD were positively cor- 2 lution alone does not explain variation in venom complexity. related (β = 0.73, R = 0.17, T2,26 = 2.3, P = 0.028; Figs. 2 and Together with positive slopes for the relationship between venom 3), but we did not detect a significant association between species

4 of 10 | PNAS Holding et al. https://doi.org/10.1073/pnas.2015579118 Phylogenetically diverse diets favor more complex venoms in North American pitvipers Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 itDvriyPeit xrse opeiyi he eo Gene Venom Three in Complexity Families. Expressed Predicts Diversity Diet using data. result trait our and diet of both robustness tests of demonstrating summations model these alternative 1), in best (Table complexity the well transcriptomic remained as and proteomic again our both lin- recalculate MPD their of we Prey using (56) that diversity assignments. al. diet required and eage et This complexity Blair venom (55). of of al. estimates phylogenies et the Alencar used and we hypotheses, netic (β complexity transcriptomic R species venom raw between and relationship significant diversity a detect not did also (R complexity tomic hlgntclydvredesfvrmr ope eosi ot mrcnpitvipers American North in venoms complex more favor diets diverse Phylogenetically al. et transcripts. Holding gene venom of regions coding to mapping reads sequencing all of (Bottom) complexity 3. Fig. 0.007 0.003 0.009 569.7 382.4 0.004 569.7 0.68 0.28 382.4 (β complexity T protein 0.68 venom 0.019 and 577.6 390.7 0.029 diversity 0.29 models. MPD and Div.) The (Sp. (56). 98.7 577.6 al. diversity 0.031 162.0 et species 0.020 575.2 389.5 Blair in 390.8 from terms one diet and (55), al. et 161.9 0.18 0.21 Alencar 575.5 98.7 389.6 0.18 (56) 164.7 102.4 al. 0.28 et Blair 4. (55) al. et 164.4 Alencar 3. 164.3 101.7 102.7 RAxML 2. 164.0 Astral-III 1. 101.5 complexity Transcript (56) al. et Blair 4. (55) AIC al. et Alencar 3. RAxML 2. Astral-III 1. complexity Protein Analysis/tree complexity transcript venom of predictors AIC as via MPD comparison prey Model 1. Table 2 2,26 oass h outeso u eut oohrphyloge- other to results our of robustness the assess To ahmdli etdars ortes w rmti td,oefrom one study, this from two trees: four across tested is model Each 0.006, = = eainhpbtenseisdvriy(fetv ubro pce)(Left species) of number (effective diversity species between Relationship −1.3, enx sesdterltosi ewe eo tran- venom between relationship the assessed next We T 2,26 P .8.MDas rdce eo transcrip- venom predicted also MPD 0.18). = = 2 −0.42, 0.25, = ulS.Div. Sp. Null Venom Transcriptome Complexity Venom Proteome Complexity c 20000 30000 40000 1520 25 30 P T 0.68). = 2,26 AIC 101214 8 6 4 468101214 c 2.9, = Effective NumberofSpecies Effective NumberofSpecies fpe pce iest and diversity species prey of c P = .0;Fg ) We 3). Fig. 0.007; = P P −0.43, ausaesonfor shown are values MPD AIC R p = R =0.68 c R p 2 2 =0.18 =0.0006 2 −216.2, 0.07, = =0.06 P opeiyadMDwr on o h VPgn family gene SVMP the for found were (R MPD gene and transcriptomic single between complexity a transcript relationships from average positive transcripts Significant of for family. lengths similar controlling on while based length MPD families, in and gene heterogeneity complexity of among transcriptomic levels between These of relationships bp). assessment the 463 = an length permitted sequence sequence analyses mean mean [CTLs]: lectins [SVSPs]: C- bp; in proteases 783 families = serine gene length venom pairs venom base snake 1,641 largest = length [bp]; four sequence mean the the (SVMPs: venoms at for viper prey level of family diversity gene phylogenetic and complexity scriptomic l (R ily pce iest a infiatybtcnieal esstrongly less considerably but MCCD significantly with was diversity correlated species strongly was MPD (R species, Appendix , in snake (SI (MCCD) the site distance for per cascade substitutions distances coagulation acid phylogenetic mean amino a mean as recalculate snake to each protein these each for used and trees gene and generated plasminogen, We S8). different (Dataset fibrinogen, C protein kininogen, eight FVIII, prothrombin, for FV, We (57): FXIII, SVSPs sequences by prey. targeted acid proteins different cascade coagulation of amino targets sequence available acid venom compiled amino homologous using diversity in species variation in raw diversity does functional than reflects prey better diversity Key phylogenetic in prey Divergence Acid Targets. Amino Venom Predicts Diversity Phylogenetic Prey families. way more gene several this from across occurs selection in prey to diverse family phylogenetically response gene evolutionary an by demonstrates complexity transcriptomic between association (T Parsing complexity significant transcriptomic a and show MPD not did however, family, (R family Venom Transcriptome Complexity Venom Proteome Complexity 2 2 0.32, = 70%, = rMD(Right MPD or ) 20000 30000 40000 15 20 25 30 2 0.32, = 2 4− 4 2 0 −2 −4 4− 4 2 0 −2 −4 0.20, = T P 2,26 < aty epromdadrc eto whether of test direct a performed we Lastly, Standardized MPD T Standardized MPD 3.5, = 2,26 PLA 0.001; rtranscriptomic or (Top) complexity proteomic and ) T 2,26 3.5, = 2 ensqec egh=48b;and bp; 418 = length sequence mean : P ,weesprey whereas S7A), Fig. Appendix, SI .0;Fg ) h VPgn fam- gene SVSP the 4), Fig. 0.002; = 2.5, = P R p R p =0.03 =0.007 https://doi.org/10.1073/pnas.2015579118 .0) n the and 0.002), = 2 2 =0.25 =0.17 P .1) h T gene CTL The 0.018). = 2,26 = −0.71, .Across Methods). PNAS PLA P 0.48). = | 2 f10 of 5 gene

EVOLUTION Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 soitdwt CD(R snakes. MCCD viperid with of associated venoms the in families gene prevalent most four the for MPD and complexity transcriptomic between relationships show 4. Fig. loasgicn rdco fSS rncitm complexity transcriptome SVSP of predictor (R significant a also f10 of rela- 6 linear continuous, a ecology revealing multi- dietary By of between complexity. testing link venom allowed the important and system for several study hypotheses in our alternative ple First, complexity 26). trait (25, venom taxonomic ways both and builds defined work of breadth categorically Our linking diet evolution time. findings the over previous phy- venoms with upon complex the diets, more with and snake association simpler of in diversity evolves logenetic complexity venom Snake Discussion predict to acid power amino has complexity. targets and venom venom diversity, specific species in diversity prey important sequence does functionally than in targets diversity venom sequence more protein relates MPD to prey strongly Thus, selection SVSPs. of in source the complexity is maintaining MCCD that hypothesis the under dicted 2 26%, = | o Left (Top estimates complexity transcriptomic of Boxplots https://doi.org/10.1073/pnas.2015579118 PNAS P 0.006; = ,a ol epre- be would as S7B), Fig. Appendix, SI Venom Transcriptome Complexity Venom Transcriptome Complexity Venom Transcriptome Complexity 2 30%, = 2500 3500 4500 5500 10000 15000 20000 0 10000 20000 30000 40000 VPSS CTL SVSP SVMP 4− 4 2 0 −2 −4 4− 4 2 0 −2 −4 Prey PhylogeneticDiversityvs.ComplexitybyGeneFamily P .0) ryMC was MCCD Prey 0.002). = Standardized MPD Standardized MPD Gene Family SVMPs PLA PLA 2 sCTLs 2 R p R p o Right (Top family gene each for mapped reads venom of proportion and ) =0.018 =0.002 2 2 =0.20 =0.32 Total hlgntclydvredesfvrmr ope eosi ot mrcnpitvipers American North in venoms complex more favor diets diverse Phylogenetically

ini osre oi agt hthsaie uigprey during arisen has varia- fundamental that mediate subtle venom targets of the targets toxin The of divergence. conserved species’ because in complexity tion venom of evolution com- venom families. within show gene levels and venom complexity we independent diversity of multiple Second, evolution diet parallel 26). the between involves (25, plexity snails) relationship Snakes) (Conid or the families by (Spiders that ancient prevented orders more or were of (24) insights sampling These sporadic complexity. the vari- several venom marked with exhibiting in years, species ation related million snakes closely past (or of the of doubled examples stark has within clade venom largely of young halved) complexity 26). been relatively effective (24, the this how relatedness showed sampling phylogenetic densely differ- their matters Also, only of than taxonomic species regardless major rather or prey groups, species venoms, prey among among targeted differences among of divergence ences evolution of the amount for argue results the our MPD, that and complexity venom between tionship Proportion of Mapped Reads eageta rypyoeei iest ieypeit the predicts likely diversity phylogenetic prey that argue We 0.0 0.2 0.4 0.6 2000 3000 4000 5000 6000 4000 8000 12000 16000 VPSS T Other CTL SVSP SVMP 4− 4 2 0 −2 −4 4− 4 2 0 −2 −4 Standardized MPD Standardized MPD Gene Family SVSPs PLA 2 R p R p =,0.002 2 = 0.48 =0.32 2 =0.02 .Scatterplots ). odn tal. et Holding Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 hlgntclydvredesfvrmr ope eosi ot mrcnpitvipers American North in venoms complex more favor diets diverse Phylogenetically al. et Holding nacsrlsto ee ee natne ra 4) Dowell (43). array tandem a in genes seven of set ancestral an role venom a rattlesnake hypothesized Similarly, and expansion. PLA tandem (77), in 30 a selection and positive in 5 for paralogs between SVMP range in distinct can contraction showed array of and recently expansion counts these (77) family where gene all Crotalus, al. SVMP where et of rapidity ancestor Giorgianni the common present. a were from complex- proteins expressed divergence of (76). since evolution snakes the ity other evaluates from study vipers our of Hence, divergence the before other evolved by modulated are that patterns (72–75). complex- processes form diversity venom evolutionary to on phylogenetic broadly selection levels the exerts ity snake’s that communities a prey suggest against snake We defense of and 71). (45), (58, traits accel- predators of can evolution which 31), the processes (30, erate prey key resistant diversity Other with prey complexity. coevolution venom in include evolved trends in stable reflected long-term are if avail- address prey Future could in variation 31). ability temporal (30, and geographic adaptation estimating local work venom even complexity, intraspecific an venom and measure assuming MPD would prey sampling between local- relationship venom stronger matched and that diet possible between is ities It variation undersampled. population-level species , undoubtedly variation adult is whole ontogenetic sampling to for by venoms controlled locations in we single while from Similarly, scale ranges. geographic var- here in used studies ied Diet vary (68–70). can composition snakes venom of and populations diet among both caveat that One is complexity. approach venom our of in variation context. the of that portion on stantial based of of system, evolution this the all, in in complexity role not trait a play of but likely communities processes some, unmeasured where While represent (67), diversity factors contextual varying several on dent present the in shown results functional the paper. such of that much hypothesize explains We specialization (34). phyloge- prey in distinct targets physiological netically disrupt to facil- function complexity venom venom itates more in evolving where paralogs examples toxin concrete three-finger procoagulant of of ity specificity phylogenetic in from taxonomic venoms in paralogs The SVMP simple venoms 66). from complex (65, continuum generalists and a specialists is taxonomic phylogenetic result with proteins The venom for specificity. generalism selects phylogenetic preference toxic- suggest dietary in complexity in here trends venom shown These between (64). MPD/MCCD relationships prey and the of with set combined diverse highly ity is more that a venom to predicts and diet lethal 63) snakes’ relevant (62, a in prey functionally with breadth natural decreases taxonomic of from potency distance venom accrual phylogenetic snake increasing the targets, venom with in changes associated is gence venoms. some nico- of and for specificity relevance (60) taxonomic functional variation channels carries (61), ion sequence receptors as acetylcholine vivo, such tinic proteins, in conserved unexplored other largely in snake fac- is in coagulation homologs SVSPs divergent tor for to specificity extend SVSP may While with predators. that interaction (59) their proteins for heterologous variation prey coagulation of sequence species’ consequences of factor functional homologs coagulation snake well-documented on are a based There their that MCCD factors. Although showed prey (35). predicts we occurred MPD conserved, has one factor is and coagulation presence duplication a one only of tetrapods, loss extant Since 58). of (23, evolution lethality confer the to cas- receptors, or coagulation channels, the ion of cade, members as such processes, physiological h VP SVSP, SVMP, The depen- are interactions species of outcomes evolutionary The diver- phylogenetic prey that hypothesis the with Consistent 2 aebe rvosyhpteie ohv vle from evolved have to hypothesized previously been have s .neuwiedi B. PLA 2 n T eo eefamilies gene venom CTL and , 3)adtedfeeta lethal- differential the and (34) .sulphureus S. R 2 P xlisasub- a explains MPD , 3)provide (33) a ln lds u eut edu opeitaeso higher of areas predict to simi- us chemically lead but results speciose Our of can clades. (88) existence plant families the lar plant by of richness explained species be the and herbiv- of insects diversity the orous species example, explain dietary For between help diversity. relationship can biological negative in taxa, trends between global distance important with varia- physiological accumulates and/or tion for functional context where phylogenetic phylogenetic eco- generalism, The elevating dietary pressure. by selection specifically that maintained a as 17), suggest is generalism (16, quantified and complex- generalism have evolves logical trait complexity we molecular trait trends predicts molecular evolutionary that of The form species the ity. prey (in diversity of nature (species number the alone) the is than rather it diversity) interactions, phylogenetic species of diverse context to responses modular forming in species. expression roles 87) prey and relative 86, variation, the allelic (31, variation, framework, variation number comparative copy cru- a gene be in of will testing, paralogs for venom cial of characterization functional combined with data sequence The whole-genome generated. of scores availability increasing complexity paralogs, parse the of to to counts variation inability length, expression its sequence refer- and by coding a of limited of contributions is the free approach with complexity this abundance, sequence However, relative ence. quantifying their and of bases benefit 60 the selection of balancing strings unique as or as genes underexpression of the sets The to wanes. entire lead of may deletion mam- diets or genomic simplified as and such and scenario, mals, guilds, SVSP, simplest prey clustered SVMP, the proteins phylogenetically modular of Under of a module (83–85). composed in a families units varies multiple (82). functional venom chromosomes from separate that different with evidence on fashion, mounting occur is regions, families genomic There gene unlinked these multiple as across ecologically selection to response mediated variable a highlights tested families gene multigene of dynamics. example evolutionary an class-specific coevolv- such provide with plants We families parsnip protein (20). webworms in with classes ing toxin occurring unprecedented, phytochemical evolu- not evolutionary are Different among rapid (71). classes vWF toxin by the among on CTLs dynamics sites binding to the resistance of these tion coevolved and rattlesnakes, have including pitvipers, mammals on feed (Didelphidae) to in Opossums known function are their predators. by sim- coevolving influenced between be against given also complexity defense issue may sequence CTLs an The in not CTLs. variation and likely of are 81), levels variation (80, ilar extant evolution protein of of levels rates qua- but constrain as Forming can multiple 79). such of chains composed (78, receptors protein sites GP1b ligand-binding platelet and or to structures (vWF) ternary bind factor instead Willebrand but von families enzymes act gene and as heterodimers venom not multimeric other as only the function major They and Two standout: families. CTLs gene the between across in diet differences heterogeneity from selection suggests to MPD response gene and venom complexity large scriptomic of toxin maintenance functional the same for key the likely of families. a more is produce (77), increas- to merely Thus, than expression diet. rather diverse ing venoms, complex evolutionarily more an for SVMPs of selection of selection sets balancing complex the of expression and this that support rat- suggesting results in Our notion, differentiation. loss remain that niche genes gene during of screening functional lineage-specific selective to that due be hypothesized could tlesnakes (43) al. et h rmr otiuino u td ssoigta nthe in that showing is study our of contribution primary The four the of three to results our of extension the Broadly, tran- expressed CTL between relationship a of lack The PLA 2 ,a ela VP,aeprasbs anandby maintained best perhaps are SVSPs, as well as s, k mrapoc esrdepesdgn sequence gene expressed measured approach -mer https://doi.org/10.1073/pnas.2015579118 PLA 2 ee ih xs for exist might genes PNAS | PLA f10 of 7 2 s

EVOLUTION community phylogenetic diversity will produce more complex and longer k-mer sizes (SI Appendix, Fig. S8), and simulated transcriptomes defense traits on average within a community. Indeed, taxa rang- showed they are sensitive to single mutations and small changes in expres- ing from invertebrates to plants tend to be better chemically sion and gene copy number (SI Appendix, Methods and Fig. S9). Variance in defended against multiple enemies in the tropics (89), where these transcriptome complexity scores was not clearly associated with sam- phylogenetic diversity also happens to be highest in many taxa ple size and showed significant variation among lineages (SI Appendix, Fig. S10). We therefore used the mean transcriptomic complexity value for each (90, 91). Overall, our results suggest the following general princi- lineage in downstream comparative analyses, with sample size ranging from ple governing trait evolution in the planet’s entangled bank: In a 1 to 17 individuals (mean = 2.8 individuals). given ecological community, species’ phylogenetic diversity plays a large role in shaping the evolutionary trajectory of complex Measuring Venom Phenotypic Complexity. To measure phenotypic complex- traits of community members. ity, we subjected 155 whole venom samples to HPLC on a Shimadzu Prominence HPLC System (Shimadzu Scientific Instruments). We measured Materials and Methods absorbance at 220 nm for 140 min (see SI Appendix, Methods for full analyt- Sample Collection and Transcriptome Analyses. Snakes were collected from ical run parameters) and used the Chromatopac feature in the Lab Solutions the field in numerous localities across the , Mexico, and Brazil version 5.92 software (Shimadzu) to call HPLC peaks between 10 and 125 (Dataset S2 A and B). Venom was collected from each snake for proteomic min, where peaks generally represent separate venom protein classes or iso- analysis and venom glands were dissected 4 d post-venom collection for RNA forms. We retained all called peaks contributing at least 0.2% of total peak sequencing. Transcriptomes were assembled following Holding et al. (92). area and then recalculated the proportional area of each peak on the final See SI Appendix, Methods for full details. peak set. Venom protein complexity was then calculated by using the area of each HPLC peak as input into the equation for Shannon’s H Index, where Phylotranscriptomic Analysis of Nontoxin Genes. We extracted nontoxin cod- pi was the proportional area of peak i, and converting H to effective num- ing regions from the Trinity assembly of a subsample of up to five individuals ber of protein peaks as exp(H). We used the mean protein complexity value of each known lineage within our sampled transcriptomes (169 individu- for each lineage in downstream analyses, with sample size ranging from 1 als from the 68 focal taxa and 6 individuals comprising 6 outgroup taxa) to 5 individuals (mean = 2.1). using BUSCO version 3.0.1 (93, 94) searches against the “tetrapoda odb9” set of 3,950 genes from OrthoDB (95). BUSCO identified 3,633 of these Measuring Snake Diet Diversity. We measured diet diversity by collating pub- genes as “complete and single copy” sequences in at least one individ- lished gut content data from Agkistrodon, Crotalus, and Sistrurus (Dataset ual. We produced alignments of each BUSCO locus using MAFFT version S1 A and B). We tabulated counts of prey items identified to the species 7.313 (96, 97) with the GINSI algorithm to prevent overalignment: “mafft level. If prey were identified only to the level or higher, the counts –adjustdirectionaccurately –allowshift –unalignlevel 0.8 –leavegappyregion of these prey items were proportionally distributed among the prey identi- –maxiterate 10.” Alignments were trimmed using trimAl version 1.4.15 fied to species. For example, if a diet study included 3 entries—3 Peromyscus (98) using the “automated1” setting, manually inspected alignments in sp., 10 Peromyscus leucopus, and 5 Peromyscus maniculatus—we would pro- Geneious version 10.2.4 (https://www.geneious.com), kept loci present in at portionally assign the 3 genus-level observations among the species-level least 50% of samples for phylogenetic analysis. This resulted in 1,525 loci observations, resulting in 12 P. leucopus and 6 P. maniculatus as input to totaling 1,997,137 bp. later analysis. Some sampled diet species were not present in available ver- We used RAxML version 8.2.10 (99) to infer a best tree estimate for each tebrate phylogenies, and counts of these prey species were assigned to the gene based on 100 independent searches and a GTRGAMMA model. Nodal nearest congener present in the phylogeny. To control for different prey support values for the best tree estimate were attained through 1,000 boot- sample size in diet studies, we generated 1,000 random subsamples for 15 strap replicates. The best tree for each gene was then used as input for prey items. These species and count data were used to calculate Shannon’s species-tree inference in Astral-III version 5.6.1 (100). Gene tree nodes with H Index of snake diet diversity, where pi was the proportion of all diet items less than 10% bootstrap support were collapsed to polytomies (101), and we composed of species i, and H was converted to effective number of prey incorporated multiple individuals per lineage in species-tree inference (102). species as exp(H). The mean effective number of species from the 1,000 To estimate divergence dates with credibility intervals, we dated our trees subsamples was used as the prey species diversity measure for each snake with the autocorrelated relaxed clock based on the penalized-likelihood lineage. We required at least 10 diet items to be reported in the literature approach as implemented in TreePL (103) (see SI Appendix, Methods for to consider a snake lineage for comparative analyses, resulting in N = 29 more details). lineages for hypotheses testing. We incorporated evolutionary distance as a second measure of diet diver- Measuring Toxin Transcriptomic Complexity. We quantified expressed venom sity by calculating abundance-weighted MPD of snake diet diversity, using gene transcriptomic complexity by developing a k-mer–based measure sim- the ses.mpd function from picante (110). To incorporate phylogeny we gen- ilar to those used in assessments of microbial community diversity (104), erated a combined phylogeny of all listed prey species using published, counting appearances of all unique k = 60 bp sequences in the subset of dated supertrees for mammals (50), squamate reptiles (53), and venom-gland RNA-seq reads mapping to venom gene transcripts. Venom (51), as well as median node-depth consensus phylogenies for the archosaur k-mer complexity is unaffected by known sources of bias in venom studies and invertebrate taxa produced using www.timetree.org (52). We united attempting to call paralogs versus alleles and filter chimeric assemblies in these separate trees into one at the median node depth for their most large, rapidly diverging multigene families (92). We first identified assem- recent common ancestors, again provided by www.timetree.org. bled contigs containing toxin sequences using Basic Local Alignment Search Tool (BLAST) (105) searches (blastn) (e = 10−8) against a curated set of 804 Statistical Analyses. All analyses were conducted in R version 3.6.1 (111). To viperid toxin coding sequences (CDSs) spanning the known viperid venom calculate phylogenetic signal for our venom complexity metrics, we used gene families. We also used this curated set of coding sequences to calculate our Astral-III species tree (Fig. 1 and the pmc function in the pmc R package) the mean sequence length of the SVMP, SVSP, PLA2, and CTL gene fami- (112) with 1,000 bootstraps for confidence intervals. To assess relationships lies. Portions of contigs with a BLAST hit to a toxin CDS were concatenated between each diet complexity measure and each venom complexity mea- into a master database of all 191 adult snakes sampled, creating a compre- sure, we used pgls analyses as implemented in the pgls function of the caper hensive database of sampled toxin sequence diversity across Agkistrodon, R package (113), with lambda set to “ML,” various phylogenetic trees named Crotalus, and Sistrurus. We then used the mem algorithm within bwa (106) in Table 1 to accommodate alternative phylogenetic hypotheses, and venom to align the cleaned reads from each sample to this master database, pro- transcriptomic or proteomic complexity as the dependent variable. The inde- viding us with the set of reads that align to toxins for each snake. We pendent variables were prey species diversity or MPD. We conducted our randomly sampled 4 million toxin reads per snake using seqtk (107) to pgls analyses both with and without the South American rattlesnake C. standardize sampled reads per individual and then used the count utility durissus terrificus in the dataset, as it presented a large standardized resid- in Jellyfish version 2.2.1 (108) to count all unique 60-mers in the sampled ual (−2.8) and high leverage in having both the simplest transcriptome (SD = toxin reads. Finally, we used the 60-mer counts to calculate expressed toxin −3.0), simplest proteome (SD = −2.4), and the third lowest MPD of any lin- Ps eage in our pgls dataset. We thus sought to ensure that our pgls results were sequence complexity as the Shannon Diversity Index [H = − i=1 piln(pi)] of the k-mer count table from Jellyfish, where i is the ith k-mer and pi is not driven by this datapoint. Regression results were robust to its removal the proportion of all k-mers counted comprising k-mer i. Shannon’s H was (SI Appendix, Fig. S11). We present the relationships between venom com- translated to effective number of k-mer as exp(H) for downstream analyses plexity and diet diversity in the Results section with this lineage removed. (109). These transcriptomic complexity measurements were robust to shorter These results are thus unbiased by potential altered relationships between

8 of 10 | PNAS Holding et al. https://doi.org/10.1073/pnas.2015579118 Phylogenetically diverse diets favor more complex venoms in North American pitvipers Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021 0 .C .Wih,V .Fia,M .M mt,M .Bokus,Cosrssac is Cross-resistance Brockhurst, A. M. Smith, M. C. M. Friman, P. V. Wright, T. C. R. 10. 7 .Salazar D. 17. Richards Opsins, complexity. A. biological L. of eye: Evolution 16. Collier, vertebrate C. T. the Ofria, C. of Adami, complexity. C. biological Evolution of 15. foundations Physical Koonin, Pugh, V. E. Katsnelson, N. I. E. M. Wolf, I. Y. Collin, 14. P. S. bank, Lamb, tangled a in D. T. evolution parasite and 13. Host King, C. K. Rafaluk, C. Betts, A. 12. Antonovics J. 11. 9 .Znol,N .Cswl,Vnmssesa oesfrsuyn h rgnand origin the studying for for models as system systems model Venom a Casewell, as R. N. Venom Zancolli, macroevolution: G. to 19. molecules From Arbuckle, K. 18. 4 .A hog .N aadk,M .Afr,Deaybedhi oiieycorrelated positively is breadth Dietary Alfaro, E. M. Mahardika, N. G. Phuong, A. M. 24. W R., N. Casewell 23. and plant a between matching phenotype Chemical Zangerl, R. A. Berenbaum, R. M. 20. 5 .Pek S. 25. Manczinger M. varia- 22. complex histocompatibility Major Pemberton, M. J. Wilson, K. Paterson, S. 21. .Wa,E eiCsr,K arne .Kfns .Su,A lvr n S. and Oliver, A. Saul, R. Bernard, Koffinas, M. L. Aceves, Flavio Lawrence, Quillen, J. K. A. Cochran, Neri-Castro, May, K. S. E. C. May, Trumbower, Wray, Mead, C. K. Mallery, C. Speckin, J. C. Lock, J. D. Townsend, J. Garc Schmidt, Slone, Linsalata, B. J. M. J. Sabido, Deem, C. Bianca Territo, D. Salmon, Millan, Stark, Marcos G. G. J. Ortiz, Javier Price, W. Weber, Swanson, D. M. Villalobos, E. I. Petty, VanSooy, T. Swanson, McMartin, Ortiz, C. R. McCormick, D. R. E. Eaton, McNally, Mayerhofer, Jeffrey B. R. Rodriguez, Dimler, J. T. Fisher, C. T. Burkhardt, T. Feldner, Forest, Govreau, M. La R. Engeldorf, B. Dahn, Badillo, H. Adams, L. J. Solis, Williams, R. Chaparro, Ramirez- R. Franz-Ch H. in field: provided are ACKNOWLEDGMENTS. data processed (https://doi.org/10.5061/ phylogenet- and repository for Dryad S1–S8. raw loci Datasets the Other nontoxin in (115). of deposited dryad.mpg4f4qxt) been Alignments accession have S2A). Archive ics Read Dataset Sequence in and numbers Biosample and PRJNA88989 Project Availability. Data of arrival recent (114). the with associated evolution the venom and diversity dietary hlgntclydvredesfvrmr ope eosi ot mrcnpitvipers American North in venoms complex more favor diets diverse Phylogenetically al. et Holding .A et,D .Gfod .C aLa,K .Kn,Prst iest rvsrpdhost rapid herbivores: drives diversity multiple Parasite King, to C. K. resistance MacLean, C. plant R. Gifford, of R. D. Evolution Betts, A. Rausher, of 9. effect D. M. The Westra, Iwao, R. E. K. Buckling, 8. A. Paterson, S. Meaden, S. Broniewski, M. coevolution- J. host-parasite real-time 7. on predation of Effects Buckling, A. Friman, P. V. 6. .E .P ivn,J era,P ge,Y ihlks .Lnrad eopsn para- Decomposing Lenormand, T. Michalakis, Y. Agnew, P. Perreau, J. Lievens, P. J. in E. resistance evolving 5. of costs The Thompson, N. J. Buckling, A. Lin, M. D. Koskella, the B. using description 4. community Ecological Carpenter, R. S. Jonsson, T. traits Cohen, functional E. plant J. on selection 3. modify Herbivores Gianoli, E. Salgado-Luarte, C. 2. Darwin, C. 1. rc al cd c.U.S.A. Sci. Acad. Natl. Proc. interactions. bacteria–phage in modular system. bacteria-phage a in (2016). resistance 969–978 of evolution and dynamics coevolution. diffuse Quantifying evolution. resistance bacterial on (2020). diversity genetic phage dynamics. ary rpclte lineage. tree tropical U.S.A. Sci. U.S.A. Sci. Acad. Natl. Proc. cup. eye and retina photoreceptors, Parasitol. interactions. host-pathogen in resistance nonhost and system. two-parasite two-host, a in specialization Lett. of basis the reveals fitness site (2012). 1903 environments. parasite heterogeneous size. body and (2003). 1781–1786 abundance, species web, food understory. rainforest temperate a in Life for Struggle the in Races Favoured euaino vltoaynovelties. evolutionary of regulation life. of levels across biology evolutionary ihvnmcmlxt ncn snails. cone in complexity venom with venoms. of novelty evolutionary The populations. human in alleles herbivore. insect its in fvnmu rdtr:Acs td sn spiders. using study case A (2018). predators: venomous of tions unmanaged large a L.). in (1998). aries resistance (Ovis parasite and population survival ungulate juvenile with associated tion .durissus C. a suFoe,CitblMrn,Aa aa,Ja Casta Juan Salas, Alan Moreno, Cristobal Flores, Esau ´ ıa, 2 ar ´ 4,3045(2018). 390–405 (4), tal. et 6–7 (2016). 863–873 32, 97 nteOii fSeisb en fNtrlSlcin rPeevto of Preservation or Selection, Natural of Means by Species of Origin the On rgnadmitnneo hmcldvriyi species-rich a in diversity chemical of maintenance and Origin al., et 9,46–48(2000). 4463–4468 (9), eo ln ieadvnmcmlxt seta rpi adapta- trophic Essential - complexity venom and size gland Venom , h rgno pcfiiyb en fntrlslcin Evolved selection: natural of means by specificity of origin The al., et cl Lett. Ecol. htceia iest rvspatisc omnt diversity. community plant–insect drives diversity Phytochemical al., et ope steln ot mrcnlnaeo rattlesnakes of lineage American South lone the as complex ahgndvriydie h vlto fgnrls MHC-II generalist of evolution the drives diversity Pathogen al., et se . okF . arsnR . r .G 21)Cmlxcocktails: Complex (2012) G. B. Fry A., R. Harrison J., F. Vonk W., uster ¨ vz .T hmd-arlo .A el or Loranca, Torre la de A. M. Ahumada-Carrillo, T. I. avez, hr eddt aebe eoie nGnak(Bio- GenBank in deposited been have data read Short ´ rc al cd c.U.S.A. Sci. Acad. Natl. Proc. a.Eo.Evol. Ecol. Nat. 16 etaktefloigpol o epi the in help for people following the thank We 07–07 (2015). 10973–10978 112, 87–88 (2018). E8678–E8687 115, 1,3–6(2013). 39–46 (1), LSBiol. PLoS m Nat. Am. a.Rv Neurosci. Rev. Nat. 2 rnsEo.Evol. Ecol. Trends o.Bo.Evol. Biol. Mol. rc al cd c.U.S.A. Sci. Acad. Natl. Proc. m Nat. Am. M Genom. BMC 6,9390(2018). 983–990 (6), Jh ury 89 odn K 1859). UK, London, 1859, Murray, (John rc al cd c.U.S.A. Sci. Acad. Natl. Proc. 17 LSBiol. PLoS oio X Toxicon 149 1,e003 (2019). e3000131 (1), 2,3635(1997). 316–335 (2), 180 95 rc al cd c.U.S.A. Sci. Acad. Natl. Proc. 2) 34–34 (1998). 13743–13748 (23), 16 2,E2E3(2012). E42–E53 (2), 004(2020). 100034 6, 17 7729 (2020). 2777–2790 37, 1) 2007(2018). e2006057 (10), Evolution 28:219–229. 1,41(2016). 401 (1), 8 1) 6–7 (2007). 960–976 (12), o.Ecol. Mol. SEJ. ISME 67 95 279 1,19(2013). 1–9 (1), Evolution rc al Acad. Natl. Proc. 14 neda-Gayt ˜ 7,3714–3719 (7), 4257–4269 27, 13) 1896– (1735), 3,828–836 (3), 100 Trends 70 Evol. an, ´ (5), (4), 6 .R eebu,A .Znel asi ewrsadhs lnsa oeand home at Go diversity plants J. host parasite 47. and High webworms Parsnip King, Zangerl, C. R. K. A. Berenbaum, MacLean, R. C. M. R. 46. Zelek, M. Gray, C. cascading unlocks Betts, mutation A. single A 45. Rokyta, R. D. Mason, J. A. Whittington, C. A. 44. Dowell transcrip- L. venom-gland N. The 43. Aronow, K. Margres, J. M. Lemmon, R. A. Rokyta, R. signifi- D. biological 42. and Trends rattlesnakes: in composition “Venom Mackessy, P. S. 41. Conant, P. R. Gloyd, K. H. horridus. 40. Crotalus rattlesnake, timber the of Diet Clark, W. Ho R. Je. 39. Freed, P. Uetz, P. 38. Greene, W. H. Martins, M. Alencar, R. L. 37. Davidson J. coagulation. C. blood 36. vertebrate of evolution Step-by-step Doolittle, F. R. 35. Bernardoni L. J. 34. prey- distinct of evolution Adaptive Mackessy, P. S. Frietze, S. Mrinalini, Modahl, com- M. a C. of 33. adaptation local Evaluating Gibbs, L. H. Farrell, M. T. Smiley-Walters, A. S. 32. resistance Margres venom J. and M. function venom 31. of Coevolution Gibbs, H. Biardi, J. Holding, rat- M. Pygmy species: 30. of importance evolu- The venom Gibbs, L. snake H. Farrell, on M. T. brakes Smiley-Walters, A. the S. Putting 29. Kini, Manjunatha R. Fry, G. B. Li, M. changes profile 28. toxin the for implications Its diet: Eggs-only Kini, M. R. Fry, Evi- G. B. diet: Li, and M. activities toxic 27. venom snake of Coevolution Arbuckle, K. Davies, L. E. 26. rc,L lna,L ihrs n he nnmu eiwr mrvdthe improved reviewers anonymous three and General S. manuscript. Richards, by of L. suggestions Alencar, Helpful Institute L. P20GM109094). an (Grant National Price, NIH by Clemson the the of supported the Sciences from is Medical Award from which Computing Development Facility, resources High-Performance Genomics Institutional computing and Palmetto with Bioinformatics Clemson University supplemented the and resources computing by Cluster American Parallel the and J.L.S.). provided through and A.J.M., (A.J.M. were Fund History J.L.S., Memorial Natural (to Roosevelt of Museum Theodore Grant Southwestern the Research the and McCarley (J.L.S.), M.B.), Naturalists Grant Grants-in-Aid- of Research Xi Sigma SnakeDays Association (J.L.S.), a Inc. (J.L.S.), Research Biotic of-Research Grant Prairie Tecnologia M.B.), y Ciencia (to de 247437 Nacional through Consejo University (C.L.P.), Clemson funds Professor), startup Visiting faculty provided Manaus a was as A.M.M.-d.-S. support (to Additional I.L.M.J.-d.-A.). Fundac¸by A.M.M.-d.-S., DEB and F.G.G., and M.L.H.); and (to (to 1145978 J.L.S. 5 1711141 DEB PRFB funded C.L.P.), H.L.G.), which (to (to Fundac¸ 1638872 and 1822417 DEB C.L.P. D.R.R.), DEB (to (to and 1638902 pro- 1161228 1638879 was DUE DEB support Grants A.J.M.), Financial NSF Coleccion Biologicas. Sciences, by Ciencias of vided Univer- de Academy State Facultad Arizona Collections, Herpetologica Collections, in History used History specimens Natural Natural cataloging sity State for Angelo museums study: following this the thank We Gable. emnhdvriyadMCcasI oyopimi . in polymorphism II class (2008). 1144–1150 MHC and diversity helminth mosaic. geographic a in (2006). complexity Trophic abroad: diversification. and (2018). adaptation host accelerates neurotoxin. pitviper potent (2018). a 898 of origin the in exaptations rattlesnakes. adamanteus). (Crotalus rattlesnake diamondback 13 eastern 495–510. the pp. of 2008), tome CA, Linda, Loma Press, University Linda (Loma Eds. Bush, P. S. in cance” Review (2002). 499 2020. November 22 Accessed database.org. 2018). Ltd, Sons, network. human for implications Biol. Quant. Symp. and Harbor prey different targeting in envenomation. advantages Adaptive venom. snake (2018). prey. rear-fanged in treefrog genes toxin on specific venoms rattlesnake pigmy of tests Oecologia Reciprocal phenotype: plex (2017). differentiation. expression through exclusively ceeded prey. squirrel its and predator rattlesnake a in species. non-native related and prey native Toxicon between differs toxicity venom tlesnake sea (marbled toxins. eydouxii A2 Aipysurus phospholipase of patterns snake) evolutionary molecular unique The tion: eydouxii). (Aipysurus snake (2005). sea marbled the of ecology and diversity. toxicological favours (2019). generalism ecological that dence 1,32(2012). 312 (1), yd elc,N hronl .Mrn,Ceouinr eainhpbetween relationship Coevolutionary Morand, S. Charmonnel, N. Bellocq, de uy od Amparo de ao ˜ ¨ SceyfrteSuyo mhbasadRpie,Ofr,O,19) .614. p. 1990), OH, Oxford, Reptiles, and Amphibians of Study the for (Society 24 (2018). 42–47 144, od Amparo de ao ˜ hob Haemostasis Thromb. .K ae,K .Baa,M .Cardwell, D. M. Beaman, R. K. Hayes, K. W. Rattlesnakes, the of Biology –0(2017). 1–10 184, ur Biol. Curr. h epoii n eetls fvnmtxngnsin genes toxin venom of loss recent and origin deep The al., et uniy o ult:Rpdaatto naplgnctatpro- trait polygenic a in adaptation Rapid quality: not Quantity, al., et lSOne PloS oeua vlto ftevrert lo coagulation blood vertebrate the of evolution Molecular al., et ucinlvraiiyo nk eo metalloproteinases: venom snake of variability Functional al., et eqiad saod S de Estado do Pesquisa a ` 26 9 eqiad saod mzns(PRO-ESTADO) Amazonas do Estado do Pesquisa a h nkso h gitoo ope:AMonographic A Complex: Agkistrodon the of Snakes The ` e,“h etl aaae 22) http://www.reptile- (2020). database” “The sek, ˇ 54 (2009). 35–40 74, 1) 4424 (2016). 2434–2445 (18), 1) 195 (2014). e109651 (10), 89 o.Bo.Evol. Biol. Mol. 3,4048(2003). 420–428 (3), Jh ie & Wiley (John Vipers, of History Evolutionary https://doi.org/10.1073/pnas.2015579118 rc il Sci. Biol. Proc. 22 rc il Sci. Biol. Proc. 4,9491(2005). 934–941 (4), Science oPuoGat2015/50127- Grant Paulo ao ˜ o.Bo.Evol. Biol. Mol. Ecology o.Bo.Evol. Biol. Mol. .Ml Evol. Mol. J. 360 283 .Herpetol. J. 285 .Eo.Biol. Evol. J. 69) 0 LP–911 907 (6391), 87 12) (2016). (1829), Toxins PNAS 18) 20181003 (1884), 1) 3070–3081 (12), 3099–3110 34, M Genom. BMC 60 11 odSpring Cold 35 36 | 1,81–89 (1), 1) 711 (12), 4,887– (4), 3,494– (3), f10 of 9 21 (4),

EVOLUTION 48. L. N. Barrow et al., Deeply conserved susceptibility in a multi-host, multi-parasite 82. D. R. Schield et al., The origins and evolution of chromosomes, dosage compensation, system. Ecol. Lett. 22 (6), 987–998 (2019). and mechanisms underlying venom regulation in snakes. Genome Res. 29, 590–601 49. S. Pilosof et al., Host-parasite network structure is associated with community-level (2019). immunogenetic diversity. Nat. Commun. 5 (1), 5172 (2014). 83. H. L. Gibbs, J. E. Chiucchi, Deconstructing a complex molecular phenotype: 50. O. R. Bininda-Emonds et al., The delayed rise of present-day mammals. Nature 446 Population-level variation in individual venom proteins in eastern massasauga (7135), 507–512 (2007). rattlesnakes (Sistrurus c. catenatus). J. Mol. Evol. 72 (4), 383–397 (2011). 51. N. J. B. Isaac, D. W. Redding, H. M. Meredith, K. Safi, Phylogenetically-informed 84. A. J. Mason et al., Trait differentiation and modular toxin expression in palm- priorities for Amphibian conservation. PloS One 7 (8), e43912 (2012). pitvipers. BMC Genom. 21 (1), 147 (2020). 52. S. Kumar, G. Stecher, M. Suleski, S. B. Hedges, TimeTree: A resource for timelines, 85. B. M. Olivera, “E.E. Just lecture, 1996. Conus venom peptides, receptor and ion chan- timetrees, and divergence times. Mol. Biol. Evol. 34 (7), 1812–1819 (2017). nel targets, and drug design: 50 million years of neuropharmacology” in Molecular 53. R. Pyron, F. T. Burbrink, J. J. Wiens, A phylogeny and revised classification of Squa- Biology of the Cell (American Society for Cell Biology, 1997), Vol. 8, pp. 2101–2109. mata, including 4161 species of lizards and snakes. BMC Evol. Biol. 13 (1), 93 86. N. R. Casewell, G. A. Huttley, W. Wuster,¨ Dynamic evolution of venom proteins in (2013). squamate reptiles. Nat. Commun. 3, 1066 (2012). 54. E. Paradis, J. Claude, K. Strimmer, APE: Analyses of phylogenetics and evolution in R 87. N. R. Casewell et al., Medically important differences in snake venom composition language. Bioinformatics 20 (2), 289–290 (2004). are dictated by distinct postgenomic mechanisms. Proc. Natl. Acad. Sci. U.S.A. 111, 55. L. R. Alencar et al., Diversification in vipers: Phylogenetic relationships, time of 9205–9210 (2014). divergence and shifts in speciation rates. Mol. Phylogenet. Evol. 105, 50–62 (2016). 88. M. L. Forister et al., The global distribution of diet breadth in insect herbivores. Proc. 56. C. Blair, S. Sanchez-Ram´ ´ırez, Diversity-dependent cladogenesis throughout western Natl. Acad. Sci. U.S.A. 112 (2), 442–447 (2015). Mexico: Evolutionary biogeography of rattlesnakes (: Crotalinae: Crotalus 89. D. W. Schemske, G. G. Mittelbach, H. V. Cornell, J. M. Sobel, K. Roy, Is there a latitudi- and Sistrurus). Mol. Phylogenet. Evol. 97, 145–154 (2016). nal gradient in the importance of biotic interactions? Annu. Rev. Ecol. Evol. Syst. 40 57. S. M. T. Serrano, The long road of research on snake venom serine proteinases. (1), 245–269 (2009). Toxicon 62, 19–26 (2013). 90. A. J. Kerkhoff, P. E. Moriarty, M. D. Weiser, The latitudinal species richness gradi- 58. M. Holding, D. Drabeck, S. Jansa, H. Gibbs, Venom resistance as a model for under- ent in New World woody angiosperms is consistent with the tropical conservatism standing the molecular basis of complex coevolutionary adaptations. Integr. Comp. hypothesis. Proc. Natl. Acad. Sci. U.S.A. 111 (22), 8125–8130 (2014). Biol. 56 (5), (2016). 91. K. Safi et al., Understanding global patterns of mammalian functional and 59. P. Papareddy et al., An ecoimmunological approach to study evolutionary and ancient phylogenetic diversity. Phil. Trans. Biol. Sci. 366 (1577), 2536–2544 (2011). links between coagulation, complement and Innate immunity. Virulence 9 (1), 724– 92. M. Holding, M. Margres, A. Mason, C. Parkinson, D. Rokyta, Evaluating the perfor- 737 (2018). mance of de novo assembly methods for venom-gland transcriptomics. Toxins 10 (6), 60. A. H. Rowe, Y. Xiao, M. P. Rowe, T. R. Cummins, H. H. Zakon, Voltage-gated sodium 249 (2018). channel in grasshopper mice defends against bark scorpion toxin. Science 342 (6157), 93. F. A. Simao,˜ R. M. Waterhouse, P. Ioannidis, E. V. Kriventseva, E. M. Zdobnov, 441–446 (2013). BUSCO: Assessing genome assembly and annotation completeness with single-copy 61. C. N. Zdenek et al., A taxon-specific and high-throughput method for measuring orthologs. Bioinformatics 31 (19), 3210–3212 (2015). ligand binding to nicotinic acetylcholine receptors. Toxins 11 (10), 600 (2019). 94. R. M. Waterhouse et al., BUSCO applications from quality assessments to gene 62. K. Healy, C. Carbone, A. L. Jackson, Snake venom potency and yield are associated prediction and phylogenomics. Mol. Biol. Evol. 35 (3), 543–548 (2018). with prey-evolution, predator metabolism and habitat structure. Ecol. Lett. 22 (3), 95. E. V. Kriventseva et al., OrthoDB v10: Sampling the diversity of animal, plant, fungal, 527–537 (2019). protist, bacterial and viral genomes for evolutionary and functional annotations of 63. A. Pomento, B. Perry, R. Denton, H. Gibbs, M. Holding, No safety in the trees: orthologs. Nucleic Acids Res. 47 (D1), D807–D811 (2018). Local and species-level adaptation of an arboreal squirrel to the venom of sympatric 96. K. Katoh, D. M. Standley, A simple method to control over-alignment in the MAFFT rattlesnakes. Toxicon 118, 149–155 (2016). multiple sequence alignment program. Bioinformatics 32 (13), 1933–1942 (2016). 64. K. Lyons, M. M. Dugon, K. Healy, Diet breadth mediates the prey specificity of 97. K. Katoh, K. Misawa, K. Kuma, T. Miyata, MAFFT: A novel method for rapid multiple venom potency in snakes. Toxins 12 (2), 74 (2020). sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30 (14), 3059– 65. V. Devictor et al., Spatial mismatch and congruence between taxonomic, phyloge- 3066 (2002). netic and functional diversity: The need for integrative conservation strategies in a 98. S. Capella-Gutierrez,´ J. M. Silla-Mart´ınez, T. Gabaldon,´ trimAl: A tool for automated changing world. Ecol. Lett. 13 (8), 1030–1040 (2010). alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25 (15), 66. C. M. Tucker et al., A guide to phylogenetic metrics for conservation, community 1972–1973 (2009). ecology and macroecology. Biol. Rev. 92 (2), 698–715 (2017). 99. A. Stamatakis, RAxML version 8: A tool for phylogenetic analysis and post-analysis of 67. J. N. Thompson, C. Schwind, P. R. Guimaraes,˜ M. Friberg, Diversification through mul- large phylogenies. Bioinformatics 30 (9), 1312–1313 (2014). titrait evolution in a coevolving interaction. Proc. Natl. Acad. Sci. U.S.A. 110 (28), 100. C. Zhang, M. Rabiee, E. Sayyari, S. Mirarab, ASTRAL-III: Polynomial time species 11487–11492 (2013). tree reconstruction from partially resolved gene trees. BMC Bioinf. 19 (6), 153 68. J. C. Daltry, W. Wuster,¨ R. S. Thorpe, Diet and snake venom evolution. Nature 379, (2018). 537–540 (1996). 101. C. Zhang, E. Sayyari, S. Mirarab, “ASTRAL-III: Increased scalability and impacts of con- 69. M. L. Holding, M. J. Margres, D. R. Rokyta, H. L. Gibbs, Local prey community com- tracting low support branches” in Comparative Genomics, J. Meidanis, L. Nakhleh, position and genetic distance predict venom divergence among populations of the Eds. (Springer International Publishing, Cham, Switzerland, 2017), pp. 53–75. northern Pacific rattlesnake ( Crotalus oreganus ). J. Evol. Biol. 31, 1513–1528 (2018). 102. M. Rabiee, E. Sayyari, S. Mirarab, Multi-allele species reconstruction using ASTRAL. 70. M. M. Webber, T. Jezkova, J. A. Rodr´ıguez-Robles, Feeding ecology of sidewinder Mol. Phylogenet. Evol. 130, 286–296 (2019). rattlesnakes, Crotalus cerastes (viperidae). Herpetologica 72 (4), 324–330 (2016). 103. S. A. Smith, B. C. O’Meara, treePL: Divergence time estimation using penalized 71. S. A. Jansa, R. S. Voss, Adaptive evolution of the venom-targeted vWF protein in likelihood for large phylogenies. Bioinformatics 28 (20), 2689–2690 (2012). opossums that eat pitvipers. PloS One 6 (6), e20997 (2011). 104. V. B. Dubinkina, D. S. Ischenko, V. I. Ulyantsev, A. V. Tyakht, D. G. Alexeev, Assessment 72. V. Williams, J. White, T. D. Schwaner, A. Sparrow, Variation in venom proteins from of k-mer spectrum applicability for metagenomic dissimilarity analysis. BMC Bioinf. 17 isolated populations of tiger snakes (Notechis ater Niger, N. scutatus) in South (17), (2016). Australia. Toxicon 26 (11), 1067–1075 (1988). 105. C. Camacho et al., BLAST+: Architecture and applications. BMC Bioinf. 10 (1), 421 73. S. D. Aird et al., Population genomic analysis of a pitviper reveals microevolutionary (2009). forces underlying venom chemistry. Genom. Biol. Evol. 9 (10), 2640–2649 (2017). 106. H. Li, R. Durbin, Fast and accurate short read alignment with Burrows–Wheeler 74. D. R. Rokyta, K. P. Wray, M. J. Margres, The genesis of an exceptionally lethal venom transform. Bioinformatics 25 (14), 1754–1760 (2009). in the timber rattlesnake (Crotalus horridus) revealed through comparative venom- 107. H. Li, seqtk Toolkit for processing sequences in FASTA/Q formats (2012). gland transcriptomics. BMC Genom. 14, 394–415 (2013). https://github.com/lh3/seqtk. Accessed 16 March 2020. 75. G. Zancolli et al., Is hybridization a source of adaptive venom variation in rat- 108. G. Marc¸ais, C. Kingsford, A fast, lock-free approach for efficient parallel counting of tlesnakes? A test, using a × viridis hybrid zone in southwestern occurrences of k-mers. Bioinformatics 27 (6), 764–770 (2011). . Toxins 8 (6), 188 (2016). 109. L. Jost, Entropy and diversity. Oikos 113 (2), 363–375 (2006). 76. B. G. Fry et al., Evolution of an arsenal: Structural and functional diversification of 110. S. W. Kembel et al., Picante: R tools for integrating phylogenies and ecology. the venom system in the advanced snakes (caenophidia). Mol. Cell. Proteomics 7 (2), Bioinformatics 26 (11), 1463–1464 (2010). 215–246 (2008). 111. R Core Team, R: A Language and Environment for Statistical Computing (R 77. M. W. Giorgianni et al., The origin and diversification of a novel protein family in Foundation for Statistical Computing, Vienna, 2019). venomous snakes. Proc. Natl. Acad. Sci. U.S.A. 117 (20), 201920011 (2020). 112. C. Boettiger, G. Coop, P. Ralph, Is your phylogeny informative? Measuring the power 78. J. A. Eble, Structurally robust and functionally highly versatile—C-Type Lectin of comparative methods. Evolution 66 (7), 2240–2251 (2012). (-related) proteins in snake venoms. Toxins 11, 136. 113. D. Orme et al., Caper: Comparative Analyses of Phylogenetics and Evolution in R 79. K. J. Clemetson, A. Navdaev, D. Dormann,¨ X. Y. Du, J. M. Clemetson, Multifunctional (2018). R package version 5.2: 1–36. snake C-type lectins affecting platelets. Pathophysiol. Haemostasis Thrombosis 31 (3- 114. W. Wuster¨ et al., Tracing an invasion: Landbridges, refugia, and the phylogeography 6), 148–154 (2001). of the neotropical rattlesnake (serpentes: Viperidae: Crotalus durissus). Mol. Ecol. 14 80. D. Odokonyero et al., Loss of quaternary structure is associated with rapid sequence (4), 1095–1108 (2005). divergence in the OSBS family. Proc. Natl. Acad. Sci. U.S.A. 111 (23), 8535–8540 (2014). 115. M. L. Holding, C. L. Parkinson. Data for “Phylogenetically diverse diets favor more 81. T. Sikosek, H. S. Chan, Biophysics of protein evolution and evolutionary protein complex venoms in North American pitvipers”. Dryad. https://doi.org/10.5061/dryad. biophysics. J. R. Soc. Interf. 12, 20150915 (2014). mpg4f4qxt. Deposited 19 December 2020.

10 of 10 | PNAS Holding et al. https://doi.org/10.1073/pnas.2015579118 Phylogenetically diverse diets favor more complex venoms in North American pitvipers Downloaded at CLEMSON UNIV ACQ/RECEIVING on April 19, 2021