COMPARISON OF RUMEN MICROBE TAXA AMONG WILD AND DOMESTIC RUMINANTS

A thesis presented to the Faculty of the Graduate School University of Missouri

In partial fulfillment of the requirements for the Degree

PhD

By TASIA MARIE TAXIS

Dr. William R. Lamberson Thesis Advisor

DECEMBER 2015

The undersigned, appointed by the Dean of the Graduate School, have examined the thesis entitled

COMPARISON OF RUMEN MICROBE TAXA AMONG WILD AND DOMESTIC RUMINANTS

Presented by Tasia Marie Taxis

A candidate for the degree of PhD

And hereby certify that in their opinion it is worthy of acceptance.

Dr. William R. Lamberson, advisor University of Missouri, Columbia, MO

Dr. Robert L. Kallenbach, external committee member University of Missouri, Columbia, MO

Dr. Gavin C. Conant University of Missouri, Columbia, MO

Dr. Kristi M. Cammack University of Wyoming, Laramie, WY

DEDICATION

My professional and personal growth would have been stunted if it weren’t for the guidance and support of so many people, too many to mention. I would like to acknowledge my biological, academic, and adopted-Mizzou family. My biological family provided my foundation on which I still stand. While physically 1,000 miles away, they were always close. My mother, Mary Ann Taxis, has always grounded me back to the ‘go get it’ girl she raised. My father, Craig Taxis, has always sent extra hugs and love, which in a note or voice, could always be felt. My brother, Travis Taxis, was honest and provided the kick when I needed it and support when I didn’t. He will forever be my best friend and favorite hunting partner.

My academic family contains too many to name here, but Mike Carson was one of the first voices that encouraged me to reach for the stars and remain driven towards my goals. Having Drs. Chris Bidwell, Jolena Wadell-Fleming, Eduardo Casas, Bob Weaber, and Bill Lamberson as mentors has made me a better person in life and research.

My adopted-Mizzou families, The Mallory Family and the Kendrick Family have always provided support and encouragement. The Mallory family is responsible for introducing me to Grant Kendrick, of which kind words will never be enough. Grant has been there to celebrate the small accomplishments and great victories, and he has been support through the stressful and hard times.

I am fortunate to have paved this road, but I wouldn’t have gotten here without the support, love, and encouragement from so many. For that, I am forever grateful.

ACKNOWLEDGEMENTS

There are many individuals that have played an important role over the past several years in assisting me to reach this point. While it is impossible to thank everyone,

I would like to express my appreciation to several individuals for their assistance and guidance towards my Doctoral experience. Foremost, I would like to thank my advisor,

Bill Lamberson for his guidance, patience, and enthusiasm for me to succeed. He is one of the best! As a mentor, not only does Dr. Lamberson provide the right amount of control and freedom, but he listens. He has provided encouragement and advice when needed, and not only has improved my statistical knowledge but has been a great resource for discussion. I have learned everything from research to generalities of life. I hope he is proud of me and I strive to become a professor/mentor like him. I’d also like to thank, Dr. Gavin Conant. He has had patience in guiding me and has helped build my bioinformatics toolbox needed for research. I would also like to thank Dr. Kristi

Cammack and Dr. Robert Kallenbach for being committee members and providing help when needed. I was fortunate to travel and work in various labs, and would like to thank the scientists that hosted me. I extracted my rumen samples in Dr. Andre Wright’s laboratory at the University of Vermont, Burlington, VT. I utilized my bioinformatics toolbox and analyzed data with Dr. Ben Hayes, Keith Savin, and Min Wang at AgriBio in

Melbourne, Australia. Last, but not least, I would like to thank Sara Wolff. Without her I would not have been able to complete this research. It was with the help of all the listed individuals, and many that are not, that I have learned how to be a true professional and example of excellence in the scientific community. I not only received an education, but gained friendships I hope to never lose. Thank you.

ii

TABLE OF CONTENTS

Acknowledgements ...... ii

List of Tables ...... v

List of Figures ...... vii

Abbreviations ...... viii

Abstract ...... ix

Chapter One

Literature Review...... 1

Introduction ...... 1

Metagenomic Analysis Techniques ...... 5

Microbiome of Ruminants ...... 13

The Rumen ...... 25

Research Objectives ...... 26

Chapter Two

Distribution of Microbial Taxa and Identification of A Core Microbiome in Ruminant Host ...... 28

Summary ...... 28

Introduction ...... 30

Materials and Methods...... 33

Results and Discussion ...... 40

Chapter Three

Effects of Host Diet, Species, and Domestication Status on Microbiome...... 59

Summary ...... 59

iii

Introduction ...... 61

Materials and Methods...... 66

Results and Discussion ...... 71

Chapter Four

Conclusions and Future Directions ...... 95

Overall Conclusions ...... 95

Future Directions ...... 99

Appendix

SAS Program: PROC DISCRIM (Step-Wise Discriminate Function) ...... 102

SAS Program: PROC DISCRIM...... 103

SAS Program: PROC NLIN (Exponential Decay Model) ...... 104

SAS Program: PROC GLM ...... 105

PERL Script: Total OTUs and Common OTUs ...... 106

SAS Program: PROC VARCOMP (Type 1 Method) ...... 112

Literature Cited ...... 113

Vita ...... 123

iv

LIST OF TABLES

Table 2.1: Metadata for Domesticated Host Animals ...... 46

Table 2.2: Metadata for Wild Host Animals ...... 47

Table 2.3: Average Number of Reads for Each Host Species at Each Analysis Step ...... 48

Table 2.4: Estimated Coefficients and Standard Errors of the Exponential Decay Curve for Each Host Species ...... 49

Table 2.5: Percent Increase in Total Observed OTUs and Decrease in Common ‘Core’ OTUs as Additional Host Species are Included in Analysis ...... 50

Table 2.6: Identified Core Microbial Community ...... 51

Table 3.1: Metadata for Domesticated Host Animals ...... 81

Table 3.2: Metadata for Wild Host Animals ...... 82

Table 3.3: Ingredients of Host Diets ...... 83

Table 3.4: Average Number of Reads for Each Host Species at Each Analysis Step ...... 84

Table 3.5: Informative OTUs used for Diet Classification of White-tailed Deer via Discriminate Function Analysis ...... 85

Table 3.6: Number of Represented OTUs, Sequence Count, and Variance Estimates for each Phylum for Host Diet, Species, and Domestication Status ...... 86

Table 3.7: OTU Identification and Variance Estimates of for Host Diet ...... 87

Table 3.8: OTU Identification and Variance Estimates of /Chlorobi Group for Host Diet ...... 88

Table 3.9: OTU Identification and Variance Estimates of Actinobacteria for Host Species ...... 89

Table 3.10: OTU Identification and Variance Estimates of Bacteroidetes/Chlorobi Group for Host Species ...... 90

Table 3.11a: OTU Identification and Variance Estimates of Firmicutes for Host Species ...... 91

v

Table 3.11b: OTU Identification and Variance Estimates of Firmicutes for Host Species (Continued) ...... 92

Table 3.12: OTU Identification and Variance Estimates of Firmicutes for Host Domestication Status...... 93

Table 3.13: OTUs only Present in one Type of Host Domestication Status ...... 94

vi

LIST OF FIGURES

Figure 2.1: Examples of the Exponential Decay Curves fit to the Rank by OTU Abundance Observed Data ...... 52

Figure 2.2: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Cattle ...... 53

Figure 2.3: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Sheep ...... 54

Figure 2.4: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Bison ...... 55

Figure 2.5: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Red Deer ...... 56

Figure 2.6: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in White-tailed Deer ...... 57

Figure 2.7: Total OTUs Identified and Number of Common ‘Core’ OTUs as Additional Host Species are Added into Analysis ...... 58

vii

ABBREVIATIONS

CO2 Carbon Dioxide

CH4 ddNTP Dideoxynucleotide

DGGE Denaturing Gradient Gel Electrophoresis

DNA Deoxyribonucleic Acid

H2 Hydrogen Gas kg Kilogram

OTU Operational Taxonomic Unit

PCR Polymerase Chain Reaction rpoB RNA Polymerase β Subunit Gene

TGGE Temperature Gradient Gel Electrophoresis

VFA Volatile Fatty Acid

viii

COMPARISON OF RUMEN MICROBE TAXA ACROSS WILD AND DOMESTIC RUMINANTS

Tasia Marie Taxis

Dr. William R. Lamberson

ABSTRACT

Research of the ruminal microbiome continues to expand our knowledge and add to the improvements in livestock production and efficiency. The objectives of this study were to analyze the distribution of microbial taxa among host species and identify microbial taxa affected by host diet, species, and domestication status. Rumen samples were collected from 48 animals, representing 5 different species, wild and domestic, and consuming a forage- or concentrate-based diet. Extracted DNA was shotgun sequenced using the

Illumina GAII platform, and 16S rDNA genes were used to classify sequences into microbial taxa, or operational taxonomic units (OTUs) for analysis. The distribution between each host species was similar, suggesting that each species had only a few highly abundant microbial taxa, although particular taxa were differently ranked among host species. With each additional species added into an analysis, more OTUs were identified and fewer of those OTUs were common among all species. Nonetheless, a

‘core microbiome’ of 9 OTUs was identified. Last, various microbial phyla and taxa were associated with host diet, species, and domestication status. Interestingly, groups of

OTUs were identified as being present or absent in one species, domestication status, or a particular group so host species.

ix

CHAPTER ONE

LITERATURE REVIEW

Introduction

The discovery and knowledge of ‘animalcules,’ or microorganisms has greatly improved since 1676 when Antonie van Leeuwenhoek looked into his microscope and saw a single cell microorganism for the first time. Since that time microorganisms have been found to be ubiquitous, living in any environment capable of sustaining other life as well as being sole inhabitants of extreme environments. Additionally, microorganisms are decomposers, recycling dead matter into available organic matter and are the primary source for nutrients (Wooley et al., 2010; Ley et al., 2008). Science has expanded from visualizing microorganisms, to culturing them, and today utilizing culture-independent molecular techniques to identify and compare microorganisms of different environments

(Morgavi et al., 2013). Metagenomics is the culture-independent genomic analysis of a microbial community sampled directly from an environment (Shah et al., 2011; Wooley et al., 2010). As a result, metagenomic studies seek to identify microbial diversity, function, and/or evolution in various environments, such as the digestive system of animals (Huson et al., 2009).

1

Gruby and Delafond (1843) were the first to report that microorganisms inhabit different parts of the gastrointestinal tract of animals, and by 1884 studies showed that microorganisms in the rumen fermented cellulose (Morgavi et al., 2013; Bergman, 1990).

In 1891, Mallevre postulated that volatile fatty acids (VFAs) were utilized by ruminants as nutrients. Through the 1940’s and 1950’s, scientists showed that feed was fermented and VFA concentrations were highest in the rumen; for the first time proving VFAs harbor nutritional importance to ruminant animals (Stewart et al., 1997; Bergman, 1990;

Van Tappeiner, 1884). Researchers then began to optimize techniques for ruminal growth in order to culture the bacteria and analyze the relationship between rumen fluid, bacteria, and fiber degradation (Krause et al., 2013; Stewart et al., 1997). As individual microorganisms continued to be cultured, isolated, and studied, scientists determined that multiple microorganisms were needed to degrade the complex substrates in the ruminant’s diet, such as cellulose, starch, and protein. In 1966, Robert Hungate developed the most accurate simulation of the ruminal environment and anaerobic techniques to isolate and grow cellulose-digesting bacteria from the rumen (Krause et al.,

2013; Morgavi et al., 2013). This earned Hungate the title, father of rumen microbiology, and kick started studies of rumen microbiology in relation to digestibility, rumen fermentation characteristics, the relationship between bacteria and VFA concentrations, and the physiology of the host animal (Morgavi et al., 2013; Towne et al., 1988;

Richmond et al., 1977). As the understanding of bacterial functions and nutritional requirements improved, significant interactions between microorganisms within the rumen were apparent. Commensalism, mutualism, competition, and predation all occur

2

between rumen microbes. Each microbe plays a role in the fermentation of digesta to produce bacterial proteins and VFAs for the host animal’s utilization; thus creating a symbiotic relationship between rumen microbes and host animal (Russell et al., 2014;

Krause et al., 2013; Jami and Mizrahi, 2012; Flint, 1997; Bergman, 1990).

Approximately 70% of the host animal’s metabolic energy for maintenance, growth, and reproduction is obtained from this fermentation process (Chaucheyras-Durand and Ossa,

2014; Morgavi et al., 2013; Hernandez-Sanabria et al., 2011; Stewart et al., 1997). In the past decade, improvements in high-throughput sequencing technologies have made the idea of analyzing microbial diversity in the rumen a reality. Until this point, culturing techniques isolated and identified approximately 200 bacterial and 100 protozoa and fungi species from the rumen, accounting for <20% of all microbial inhabitants and approximately 11% of the total bacteria present in the rumen (Deusch et al., 2015;

Chaucheyras-Durand and Ossa, 2014; Krause et al., 2013; Guan et al., 2008; Nelson et al., 2003). High-throughput sequencing has allowed studies to include or solely focus on rumen microbial diversity which has improved understanding of the importance of the rumen microbiome (Russell et al., 2014; Morgavi et al., 2013). Research continues to support associations between rumen microbial taxa and biological characteristics of the host animal; however, there is still an incomplete understanding of the function and ecology of the microbiome. Each study adds to the accumulating catalog of microbial diversity, abundance, and taxonomic composition among different hosts, providing insights into the stability and symbiosis of the host and their microbiome.

3

Host phylogeny, gut morphology, and diet are interrelated and associated with changes in microbial community composition (Ley et al., 2008). Domestication of ruminants allowed for Homo sapiens to survive in new environments by raising ruminant animals to provide a stable food supply. Consequently, ruminant animals further adapted in order to survive in different climates and digesting feed sources unsuitable for human consumption. The gastrointestinal tract increased in size and passage rate of the rumen decreased, allowing for longer fermentation of feed (Morgavi et al., 2013). This domestication characteristic affected the rumen microbiome, altering the microorganisms present, or most abundant, in order to ferment and degrade various feed sources into digestible compounds for the host animal’s use (Russell et al., 2014; Krause et al., 2013;

Morgavi et al., 2013; Jami and Mizrahi, 2012; Flint, 1997). Currently, research focuses on identifying ways to improve digestive ability and growth of production animals on various feed types. Worldwide there are approximately 1.38 billion cattle, 1.96 billion sheep and goats, and 221 million buffaloes and camelids being raised for production purposes and to sustain the livelihood of hundreds of millions of people. Human population and food consumption is increasing, and the demand for meat and milk is expected to double in the next 40 years (Henderson et al., 2015; McCann et al., 2014;

Morgavi et al., 2013). Humans need to devise way to increase the number of ruminants or amount of meat and milk produced on Earth without depleting resources. The ability for one animal to produce more meat and milk products than another animal eating a similar amount or less feed is ideal. Changes in the rumen microbiome may help provide this improvement; however, the relationship between rumen microbiology and ruminant nutrition and production is poorly understood. Furthermore, there are approximately 75

4

million wild ruminants worldwide and very few are utilized in metagenomic studies

(Henderson et al., 2015). Wild ruminants, as compared to domestic relatives, appear healthier and suffer less during periods of drought, and demonstrate a greater ability to digest diets high in fiber, plants with toxins or low protein content (Nelson et al., 2003;

Orprin et al., 1985; Richmond et al., 1977). Most metagenomic studies that include wild animals in their analysis have used animals surviving in extreme environments. For example, Orpin et al. (1985) identified microorganisms in the rumen which excel at fiber digestion that enables high-artic Svalbard reindeer (Rangifer tarandus playrhynchus) to survive on low-quality feed during the winter months. While these studies improve rumen microbiology and ruminant nutrition knowledge, studies which analyze microbial data from wild animals living in similar, or the same environment as production animals have rarely been performed.

Metagenomic Analysis Techniques

The relationship between host ruminant and rumen microorganisms has been studied since the1800’s; however, the development of the Hungate Method, by Dr. Robert

Hungate in 1966, revolutionized the exploration of the ruminal habitat (Krause et al.,

2013; Morgavi et al., 2013; Stewart et al., 1997; Bergman, 1990; Gruby and Delafond,

1843). Prior to the Hungate Method, petri dishes were used to culture and study microorganisms from the rumen. The Hungate Method used roll tubes, capped with butyl rubber stoppers, containing rumen fluid, a salivary buffering system, resazurin as a redox indicator, and a gas to displace air to simulate the rumen’s natural environment. For the first time, anaerobic microorganisms from the rumen were able to repeatedly be cultured

5

and identified (Krause et al., 2013). Because of the simplicity and utility of this method, scientists had adapted the technique to match the requirements of any anaerobic ecosystem. Today, roll tubes are still utilized to culture rumen bacteria. In brief, heat- stable ingredients are mixed (pH 6.5), boiled, and CO2 is passed into the flask before it’s sealed with a butyl rubber stopper. The contents are then autoclaved and cooled (47ºC) before addition of heat-sensitive ingredients and diluted rumen contents. Scientists have identified various culture media for enumeration, isolation, and maintenance of rumen bacteria (Handelsman, 1998; Stewart et al., 1997). For example, Hobson (1969) described media for isolating proteolytic, cellulolytic, amylolytic, and lipolytic bacteria.

Alternatively, anaerobic glove boxes can be used to culture rumen bacteria. The glove boxes are chambers of which the atmospheric gas (optimal 95% CO2 and 5% H2) can be controlled and bacteria can be grown on petri dishes placed in a glove box. This method allows for replicate techniques and has been used for successful isolation

(Krause et al., 2013; Handelsman, 1998; Stewart et al., 1997). Cultured and isolated microorganisms have provided the foundation of depth and detail of modern rumen microbiology. The excitement of improved culturing techniques and bacterial isolation continued into the mid-1980s, with few scientists addressing the un-culturable microorganisms. Eventually, scientists realized that culturing did not capture the full range of microbial diversity, and if it could, the methods would not be able to identify changes in the rumen microbial community structure (Morgavi et al., 2013; Wooley et al., 2010; Handelsman 2004).

6

Carl Woese and colleagues identified a variable region of the small subunit rRNA (16S) that they hypothesized could be used to trace genetic linkages to determine phylogenetic relationships and identify bacterial populations (Krause et al., 2013). It was determined that the 16S rRNA gene sequences contain 9 hypervariable regions (V1-V9), consisting of 50 to 100 basepairs, that differed among different microorganisms. Regions V2, V3, and V6 are the most heterogeneous among the 9 regions; therefore, they are ideal regions for differentiating among bacteria and (Langille et al., 2013; Morgavi et al.,

2013; Shah et al., 2011; Petrosino et al., 2009). In 1985, Norman Pace and colleagues utilized the 16S rRNA gene sequences to describe the diversity of microorganisms from an environment without prior culturing, and in 1988, David Stahl utilized the 16S rRNA gene sequences to analyze microbial changes in the rumen caused by feed type and antimicrobial usage (Krause et al., 2013; Handelsman, 2004; Hugenholtz, 2002). These validation studies, kick started the identification and analysis of the culturable and un- culturable microbial diversity in various environments. Currently, using the 16S rRNA gene to describe microbial diversity is the primary step in most metagenomic studies

(Shah et al., 2011).

Originally, the 16S rRNA gene sequences were analyzed by extracting DNA, cloning the rDNA gene and sequencing the clones from the transformed plasmid (Shendure and Ji,

2008; Tringe and Rubin, 2005; Muyzer et al., 1993). However, as technology advanced, first generation molecular tools provided targeted sequencing and improved techniques

7

for analyzing diversity and distribution of the rumen microbiome. Technologies such as polymerase chain reaction (PCR), fingerprinting, i.e. Denaturing Gradient Gel

Electrophoresis (DGGE), and Sanger sequencing, are examples of first generation molecular tools (Logares et al., 2012). Polymerase chain reaction amplifies the 16S rDNA gene using universal primers that directly flank the hypervariable region(s) of the

16S rDNA gene. Gene products are amplified through multiple temperature regulated cycles that allow the DNA to denature, primers to anneal, and with each primer extension, via Taq DNA polymerase the amount of gene product exponentially increases.

This technology eliminates any culturing bias and has provided further evidence that the un-culturable microorganisms are highly diverse (Handelsman, 2004). While PCR is informative in microbial identification, it hosts bias. First, the universal primers were designed from known or previously sequenced microorganisms; therefore, they may not amplify all the microorganisms present in a sample with the same efficiency, potentially not amplifying novel microorganisms (Morgavi et al., 2013; Ross et al., 2012;

Winzingerode et al., 1997). There are also sequencing artifacts caused by PCR, including formation of chimeras, formation of deletion mutations, and Taq DNA polymerase errors.

Like the use of universal primers, Taq DNA polymerase efficiency may not be the same for all DNA templates in PCR (Shah et al., 2011; Acinas et al., 2005; Winzingerode et al.,

1997). It is important to note that while these biases do exist, in most cases PCR is necessary for microbial identification and diversity studies. Using first generation molecular tools, PCR-amplified products can undergo fingerprinting for visualization, via

DGGE or sequencing for molecular analysis, via Sanger sequencing.

8

Muyzer et al. (1993) first described the fingerprinting method, DGGE. This method provides a visualization of microbial diversity within an environment using electrophoresis. In brief, PCR-amplified 16S rDNA gene products are separated based on decreased electrophoretic mobility in a polyacrylamide gel containing a linear gradient and a denaturant. Visualization of amplified and separated bands is done using a stain such as ethidium bromide or SYBR Green I. Each unique sequence, or microorganism will produce a distinctive separation pattern, or fingerprint. A temperature gradient gel electrophoresis (TGGE) can be used similarly, and sequences create a fingerprint based on their melting behavior (Deng et al., 2008; Wintzingerode et al., 1997). Validation studies have shown that this technology can detect microorganisms that consist of at least

1% of the total microorganisms in the sample (Muyzer et al., 1993). Dahllof et al. (2000) found a single species that produced banding patterns as complex as what had been reported for whole microbial communities, therefore, an alternative of the 16S rDNA gene was proposed. The RNA polymerase β subunit gene, rpoB provides an increased resolution than that of the 16S rDNA gene. Studies interested in level or lower classifications of microorganisms have used rpoB as an additional or alternative marker

(Case et al., 2007). Fingerprinting provides a snapshot of the microbiota in a sample or between environments, but it is not as helpful in cataloguing microorganisms as Sanger sequencing (Morgavi et al., 2013). Sanger sequencing uses a dye-termination method, capillary based polymer gel, and a laser to record the nucleotides of a DNA sequence. A

PCR is run including universal 16S rDNA primers and the addition of fluorescently labeled dideoxynucleotides (ddNTPs). Extension of the gene product is terminated when

Taq DNA polymerase incorporates a ddNTP. The resulting PCR product is a mixture of

9

all the possible size fragments; each fragment ending with a ddNTP. PCR products are run through a capillary based polymer gel and same size fragments exit the capillary together. As the fragments exit the capillary, a laser excites the fluorescently labeled ddNTP to record the nucleotide sequence (Shendure and Ji, 2008).

Alternatively, Sanger shotgun sequencing has also been utilized. Starting material can be extracted DNA from one organism or from a community of microorganisms. First, DNA is sheared into random fragments, then cloned into plasmid vectors and grown in monoclonal libraries to produce enough genetic material for sequencing. Primers in the

PCR flank the restriction cut site from the plasmid, and DNA is sequenced using the dye- termination method explained previously. Starting material can be genomic clones from one organism or from a community of microorganisms (Wooley et al., 2010).

First generation molecular tools have increased the number of environmental samples analyzed and also allowed for exotic or extreme environmental microbial inhabitants to be identified that were unable via culturing. The most comprehensive studies have utilized a combination of techniques and added to the understanding of microbial structure and diversity. In 1987, 11 bacteria phyla were recognized, and by 1998 an additional 25 bacterial phyla were added to the list (Hugenholtz, 2002). Since 1995, when the first bacterial genome was sequenced, 916 bacterial, 1987 viral, and 67 archaeal species have been deposited in GenBank Release 2.2.6 (Wooley et al., 2010). While progress had been made, first generation molecular tools generally identified

10

microorganisms that were abundant. Second generation molecular tools improved analysis techniques to identify rare members of the microbiome (Morgavi et al., 2013).

Advancements in technology and computing power improved sequencing technology, and have been termed ‘next generation DNA sequencing.’ For purposes of this dissertation, second generation, next generation sequencing, and high-throughput sequencing technologies are synonymous. While there are multiple companies and products which utilize next generation sequencing technology, there are similarities among available platforms. DNA to be sequenced is prepared in one of two ways: PCR amplified DNA, usually amplifying the 16S rDNA gene, or randomly fragmented extracted DNA. In brief, platform-specific adaptor, or primer sequences are ligated to the sampled DNA. These molecules, called libraries, act as template sequences in the sequencing reaction. In Illumina and 454 sequencing technologies, libraries are immobilized in a specific location the platform, either a microscopic bead or a macroscopic flow cell before the template is amplified via PCR. Once an array of polymerase colonies is produced, the sequencing chemistry is applied directly to the array of colonies. Sequencing is done by synthesis. A single base is washed across the platform. If the added base complements the unpaired template base, the incorporation causes a luciferase-based light reaction, each basepair fluorescing a different color. A computer reads the light intensity that is produced by this reaction, and records the basepair. This process is repeated over several cycles, each cycle adding a different basepair until complete (Logares et al., 2012; Wooley et al., 2010; Hudson, 2008;

Shendure and Ji, 2008). Each platform contains several hundreds of thousands of specific

11

immobilizing locations which are able to be sequenced in one reaction (Wooley et al.,

2010). Next generation sequencing produces millions of sequences that are shorter in length than Sanger sequencing products (Lu et al., 2012; Scholz et al., 2012). Since 2008, there have been three major platforms used for metagenomic studies, Roche 454,

Illumina/Solexia, and SOLiD, all of which have undergone advancements and updated platform versions. Roche 454 sequencing was the first ‘next generation sequencing’ technology to become publically available (Logares et al., 2012; Shendure and Ji, 2008).

Reads from 454 sequencing are longer than other technologies (300-600 basepairs); however, sequencing errors are high (1%) due to insertions or deletions in the sequence procedure. If the template contains a homopolymer region (three or more consecutive identical DNA bases), multiple consecutive basepair incorporations in a given cycle can happen, causing the recorded signal intensity to be incorrect (Lu et al., 2010; Wooley et al., 2010; Quince et al., 2009; Shendure and Ji, 2008). The first short read sequencer,

Illumina produces shorter reads (25-300 basepairs) than 454 sequencing and has errors

(>0.1) in substitution rather than an indel (Glenn, 2011; Wooley et al., 2010; Shendure and Ji, 2008). In general, both 454 and Illumina provide the same fraction of the total diversity in a sample; however Illumnia provides a bigger data set and has fewer errors lending it to be the preferred technology in metagenomic studies (Deusch et al., 2015;

Luo et al., 2012). SOLiD was the second short read sequencer, but has rarely been utilized in metagenomic studies (Logares et al., 2012; Glenn, 2011).

After obtaining sequencing data, delineating microbial species in order to measure community diversity is challenging. Most researchers have adopted using operational

12

taxonomic units (OTUs) for estimating microbial diversity and abundance (Jeraldo et al.,

2012; Logares et al., 2012; Case et al., 2007). In general, microorganisms displaying 97-

98% sequence identity across the 16S rDNA gene are placed into the same OTU (Jeraldo et al., 2012; Case et al., 2007). Currently, there are three ways which software platforms or pipelines do this; 1) agglomerative method, in which each sequence is an independent cluster before grouping them on sequence similarity, 2) divisive method, in which all sequences are in one cluster before diving them based on sequence similarity, and 3) partitioning method, in which clusters are formed based on having the smallest intra- cluster variance (Logares et al., 2012). While this technique has been widely accepted, it too includes bias. Horizontal gene transfer readily happens in microorganisms, and a gene can exist multiple times in one genome. The 16S rDNA gene is not exempt; therefore, possibly skewing reported estimated individual bacterial counts and total OTU counts

(Wooley et al., 2010). Whole genome sequencing avoids this bias; however constructing multiple genomes, known and unknown, from one sample is challenging (Petrosino et al.,

2009; Hudson, 2008; Tringe and Rubin, 2005). For any metagenomic study, considerations of DNA extraction techniques, PCR amplification, and choosing a sequencing technology are all to be considered.

Microbiome of Ruminants

Ruminant nutrition has traditionally focused on voluntary intake, rate of passage, digestibility, fermentation, and feed efficiency as measures of performance. Variations in diet type, feed state, and the use of additives and supplements has been used to optimize

13

animal performance and efficiency; however, research has only begun to understand the role of the microbiome to meaningful responses of the host animal. With the current advancements in sequencing and computing technologies, understanding the symbiotic relationship between the rumen microbiome and host animal will continue to improve.

Plant material entering the rumen can be classified as either storage or structural polysaccharides. The major storage polysaccharide is starch, consisting of two glucose polymers, amylose and amylopectin. Starch is easily degraded by rumen microbes; however approximately 3-50% passes through the rumen, avoiding degradation. Being a storage polysaccharide, they have a skeleton matrix surrounding starch molecules which is resistant to microbial degradation. In general, processing and mastication are enough to disrupt the matrix and allow for degradation. The matrix in cereals such as wheat and barley are degraded by proteolytic bacteria, and ruminal fungi is responsible for the main degradation of the matrix in corn and sorghum (Stewart et al., 1997). Once starch is exposed it is fermented by amylases produced by amylolytic bacteria such as:

Ruminobacter amylophilus, ruminicola, Streptococcus bovis, Succinimonas amylolytica, Selenomonas ruminantium, Butyrivibrio fibrisolvens, Eubacterium ruminantium, and spp. (Morgavi et al., 2013; Stewart et al., 1997). Structural polysaccharides, or fructosans, are polymers of fructose commonly found in grasses and as soluble components of cereal grains. They are rapidly and completely fermented in the rumen by both bacteria and protozoa. Fiber is referred to as the insoluble components of plant material, or the plant cell wall. Plant cell walls can be formed by cellulose, hemicellulose such as xyloglucan and xylan, and/or the structural component lignin

14

(Stewart et al., 1997). These components are degraded by cellulase, xylose, and other pentoses produced by cellulolytic bacteria such as: Fibrobacter succinogenes,

Ruminococcus albus, R. flavefaciens, and Butyrivibrio fibrisolvens (Morgavi et al., 2013;

Stewart et al., 1997; Bergman, 1990). The final product of microbial fermentation is acetic, propionic, and butyric acids as well as hydrogen (H2), and carbon dioxide (CO2)

(Stewart et al., 1997; Bergman, 1990; Kamstra, 1975).

Acetic, propionic, and butyric acids are the primary volatile fatty acids (VFAs) in the rumen, usually existing as anions: acetate, propionate, and butyrate. Although acetate is usually in the highest concentration, rumen concentrations of acetate: propionate: butyrate can range from 75:15:10 to 40:40:20 (Kamstra, 1975). Volatile fatty acids provide approximately 70% of the animal’s metabolic energy for maintenance, growth, and reproduction (Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2012;

Hernandez-Sanabria et al., 2011; Stewart et al., 1997). Some of the remaining 30% is used by ruminal microorganisms for growth or lost as hydrogen and methane (Bergman,

1990). All three VFAs are readily utilized by the rumen epithelium, which metabolizes

VFAs through the process of absorption and transport into the bloodstream.

Approximately 88% of the rumen VFAs are directly absorbed and 12% flow into the omasum (Bergman, 1990). Acetate accounts for approximately 90% of the total VFAs in the bloodstream, and is mainly utilized by skeletal muscle and lipogenesis in adipose tissue, with only a small percent utilized by the liver (Bergman, 1990). A small percentage of propionate is converted to lactate, i.e. 3-15% in cattle. In general, producers limit overabundance of lactate in the rumen to avoid acidosis. Most of the propionate that

15

is not converted to lactate or metabolized by the rumen wall is transported to the liver. In the liver, propionate is converted to glucose or entered into the tricarboxylic acid cycle

(TCA) as oxaloacetate (Bergman, 1990). Butyrate is mostly converted to CO2, ketone bodies, or colonic mucosa in the rumen. The remaining butyrate is transported to the liver where it converted to acetyl-CoA, longer fatty acids, or ketone bodies (Bergman, 1990).

Different diets and feed states lead to variations in VFA production. For example, animals on a hay diet demonstrate slower rates of fermentation, and animals on a high starch diet produce high propionate concentrations. The link between diet and VFA production is the rumen microbiome. Microorganisms in the rumen work together to ferment digesta and produce VFAs, which are ultimately used by the host animal.

Understanding the complex ruminal microbiome could prove to be of great importance to livestock efficiency and productivity.

Without rumen microbes, plant material could not be fermented into VFAs for use by the host animal. The rumen is not functional while the young animal is still suckling; however, rapid development occurs as microorganisms successively colonize the rumen.

Not only does it grow in size but the rumen wall villi begin to develop, important for absorption in the adult animal (Jami et al., 2013; Benson et al., 2010). It’s hypothesized that the first exposure and beginning of colonization of the rumen is obtained from contact with the mother, via birth canal, teat surface, skin, and saliva (Chaucheyras-

Durand and Ossa, 2014). Within the first day of life, anaerobic bacteria has been identified in the rumen, and by day 2 they dominate the rumen (Malmuthuge et al., 2015;

Chaucheyras-Durand and Ossa, 2014; Albecia et al., 2013; Jami et al., 2013). A high

16

abundance of the Streptococcus from the Firmicutes phylum and as well as a low abundance of Prevotella from the Bacteroidetes phylum of been identified in calves 1 to 3 days of age (Rey et al., 2014; Jami et al., 2013). By week 1 the density of cellulolytic bacteria in the rumen microbiome is stabilized; however, age-dependent variations have been found in animals up to 6 weeks of age (Malmuthuge et al., 2015).

For example, Bacteroidetes increased from 46% at 2 weeks to 75% by 6 weeks of age.

Specifically, Prevotella is in lower abundance of 2 week old animals and in high abundance by 6 weeks of age (Malmuthuge et al., 2015; Ray et al., 2014; Jami et al.,

2013). As adults, diet composition alters the microbial community, with forage-based diets increasing the microbial diversity (Morgavi et al., 2013).

Of the approximate 1012 microorganisms per gram in the rumen, a total of 19 bacterial phyla have been identified (Chaucheyras-Durand and Ossa, 2014; Henderson et al., 2013;

Krause et al., 2013; Jami and Mizrahi, 2012; Nelson et al., 2003; Flint, 1997).

Chaucheyras-Durand and Ossa (2014) reported that the majority of the phyla included

Firmicutes (53%), Bacteroidetes (31%), and Proteobacteria (4%). Additionally, microorganisms from the phyla Fibrobacteres, Actinobacteria, and Spirochaetes have been commonly identified in the rumen (Stewart et al., 1997). Descriptions of major groups of rumen bacteria are provided below:

17

Firmicutes: Butyrivibrio fibrisolvens

Butyrivibrio fibrisolvens are curved shaped rods with bluntly tapered ends and are motile by a single polar flagellum. They exist as singular, paired, or form chains. B. fibrisolvens contain a wide range of genetic variation, demonstrating a range of functions (Stewart et al., 1997). They have been shown to have amylolytic and proteolytic activity. B. fibrisolvens can utilize acetate, lactate, xylan, and a wide range of sugars to produce butyrate, acetate, propionate, and on concentrate-based diets will produce lactate (Tajima et al., 2000; Stewart et al., 1997). While B. fibrisolvens is represented in ruminants fed a variety of diets, they are highly abundant in animals fed a forage-based diet (Singh et al.,

2014; Fernando et al., 2010; Stewart et al., 1997). Reclassification has been suggested due to the variety of genetic variation and functions (Stewart et al., 1997).

Firmicutes: Eubacterium spp.

First described by Bryant and Burkey in 1953, Eubacterium spp. are short rods arranged as singular, paired, or short chains and include E. ruminantium, E. limosum, E. uifromis, and E. xylanophilum (Chaucheyras-Durand and Ossa 2014; Stewart et al., 1997). E. ruminantium demonstrates hemicellulolytic activity to produce butyrate, formate, and lactate (Chaucheyras-Durand and Ossa, 2014; Stewart et al., 1997). Formate can be converted to H2 and CO2 or directly serve as a substrate for ruminal methanogenesis

(Stewart et al., 1997; Hungate et al., 1970). E. limosum utilizes H2 and CO2 to produce acetate, and E. uifromis and E. xylanophilum degrade xylan to produce butyrate and

18

formate (Stewart et al. 1997). In general, Eubacterium spp. are forage digesters and therefore associated with host animals fed a forage-based diet.

Firmictues: Lactobacillus spp.

Lactobacillus spp. are motile and non-motile rod shaped bacteria commonly found in young host animals and adult host animals fed a concentrate-based diet. They ferment a variety of sugars to produce lactate, and L. ruminis produces lactose. Two species of L. ruminis have been identified, one species is found only in cattle 3-6 weeks of age and the other is found in both young and mature cattle. Strains isolated from the rumen include L. acidophilus, L. casei, L. fermentum, L. plantarum, L. buchneir, L. brevis, L. cellobiosus,

L. helveticus, and L. salivarius (Stewart et al., 1997).

Firmicutes: Ruminococcus spp.

Ruminococcus spp. are non-motile cocci, forming long chains, requiring vitamins, branched-chain fatty acids, and ammonia for growth. R. albus and R. flavefacines are cellulolytic and are the most active bacteria in fiber degradation (Chaucheyras-Durand and Ossa, 2014; Jami et al., 2013; Kittelmann et al., 2013; Morgavi et al., 2013; Carberry et al., 2012; Singh et al., 2008; Stewart et al. 1997). R. albus shows a wider range of substrate fermentation and is more abundant than R. flavefacines. R. albus has been identified in the rumen of cattle 1 day of age, while R. flavefacines was identified as early as 3 days of age (Jami et al., 2013). Both species utilize cellulose and xylan to produce

19

acetate and H2; however, R. flavefacines is also known to produce succinate (Stewart et al., 1997). Succinate does not accumulate in the rumen as it is rapidly converted into propionate. The large amount of H2 produced by Ruminococcus spp. has shown to be positively correlated with Methanobrevibacter leading to methane production

(Kittelmann et al., 2013). Interestingly, changes of Ruminococcus spp. under prolonged selection have been documented. For example, R. flavefacines initially lost the ability to degrade dewaxed cotton, but the function was regained through selection (Stewart et al.,

1997).

Firmicutes: Selenomonas ruminantium

Selenomonas ruminantium is a motile banana-shaped curved rod having up to 16 flagella attached in a linear array to the middle of the concave side of each cell. B-vitamins and

CO2 is required for growth, and some studies show utilization of H2. S. ruminantium ferment a variety of sugars to produce acetate, propionate, and lactate. They lack the ability to ferment pectins and xylans; therefore lending to degradation of products that other cellulolytic bacteria produce (Stewart et al., 1997). S. ruminantium is associated with animals fed a concentrate-based diet, with increased amount of propionate produced as additional grain is supplied in the diet (Fernando et al., 2010; Tajima et al., 2000;

Stewart et al., 1997). In high glucose environments, S.ruminantium will produce lactate while in low glucose environments production of acetate and propionate are increased

(Stewart et al., 1997).

20

Firmicutes: Streptococcus bovis

Streptococcus bovis is a non-motile ovoid to coccal shaped amylolytic bacteria, requiring

CO2 and biotin for growth. S. bovis has been identified in cattle at 1 day of age, and increases in number as animals are fed a concentrate-based diet (Jami et al., 2013;

Fernando et al., 2010). In particular, S. bovis has been of interest because of its rapid growth and utilization of starch to produce lactate causing acidosis in animals fed excess starch (McCann et al., 2014; Jami et al., 2013; Xie et al., 2013; Stewart et al., 1997).

Bacteroidetes: Anaerovibrio lipolytica

Anaerovibrio lipolytica is a curved rod gaining motility by a single polar flagellum requiring amino acids, vitamins, and folic acid for growth (Stewart et al., 1997). A. lipolytica utilizes glycerol to produce propionate and succinate, and sugars such as ribose and fructose to produce acetate, propionate, and CO2. Interestingly, A. lipolytica demonstrates lipolytic activity and can utilize lactate for growth (Stewart et al., 1997;

Prins et al., 1975).

Bacteroidetes: Prevotella spp.

One of the most abundant groups of rumen bacteria, Prevotella spp. are found in animals on a variety of diets (McCann et al., 2014; Stewart et al., 1997). Prevotella spp. can utilize both starch and plant cell wall polysaccharides such as xylan and pectin; however, they lack the ability to degrade cellulose. Additionally, all Prevotella spp. have

21

demonstrated the ability to degrade proteins as well as degrade and absorb peptides (Jami et al., 2013; Stewart et al., 1997). Three species, P. ruminicola, P. bryantii, and P. brevis have been commonly identified, and all produce acetate, succinate, and propionate

(McCann et al., 2014; Morgavi et al., 2013; Carberry et al., 2012; Stewart et al., 1997). P. ruminicola accounts for the largest number of Prevotella spp. and is present in animals 1 day of age while P. bryantii and P. brevis appear in the rumen as early as animals 2 months of age (Jami et al., 2013; Stewart et al., 1997). P. ruminicola and P. bryantii ferment both hemicellulose and starch, therefore being present in both concentrate- and forage-fed host animals (Chaucheyras-Durand and Ossa, 2014; Fernando et al., 2010;

Singh et al., 2008; Stewart et al., 1997). Also, both species lack the ability to degrade cellulose, relying on cellulolytic bacteria to survive. P. brevis lacks the ability to ferment xyloase and xylan, but ferments gum arabic, a plant polysaccharide. The level of physiological differences between P. brevis and other Prevotella spp. suggests a distinct niche for the species in the rumen environment (Stewart et al., 1997).

Proteobacteria: Ruminobacter amylophilus

Ruminobacter amylophilus, previously known as Bacteroides amylophilus is an oval-to- long rod shaped amylolytic bacteria (Stackebrandt and Hippe, 1986; Hamlin and

Hungate, 1956). R. amylophilus requires CO2 and ammonia for growth and utilizes starch and glycogen to produce acetate, succinate, and formate. Although, R. amylophilus predominately digests starch, proteolytic activity has been recorded. Instead of providing

22

extracellular amylases, the starch molecules bind directly to R. amylophilus’ cell surface and are transported into the cell prior to hydrolysis.

Fibrobacteres: Fibrobacter succinogenes

After approximately 2 months of age, Fibrobacter succinogenes can be identified in the rumen of cattle (Jami et al., 2013). Using roll tubes, population estimates of F. succinogenes are underestimated; nonetheless, it is the most widespread and effective cellulolytic bacteria of the rumen. F. succinogenes is rod shaped, requires valerate and isobutyrate for growth, and utilizes cellulose to produce acetate and succinate

(Chaucheyras-Durand and Ossa, 2014; Jami et al., 2013; Kittelmann et al., 2013;

Morgavi et al., 2013; Carberry et al., 2012; Fernando et al., 2010; Singh et al., 2008;

Stewart et al., 1997). Research has found a positive correlation between F. succinogenes and Methanobrevibacter, and F. succinogenes the predominant cellulolytic bacteria in sheep receiving avoparcin, an antibiotic, suggesting resistance to antibiotics (Kittelmann et al., 2013; Stewart et al., 1997).

Actinobacteria: Bifidobacterium spp.

Members of the Actinobacteria are identified regularly (70% of host animals) in the rumen, representing approximately 3% of the total rumen bacteria (Sulak et al., 2012;

Stewart et al., 1997). Originally named Lactobacillus bifidus, Bifidobacterium is commonly identified in the rumen (Stewart et al., 1997; Scardovi, 1986; Hungate, 1966).

23

Bifidobacterium spp. utilizes glucose to produce acetate, lactate, and sometimes succinate and CO2. The complete function of Bifidobacterium spp. are unknown; however, an increase in species has been found in starch diets, suggesting amylolytic activity. Strains identified in the rumen include B. globosum, B. longum, B. adolescentis, B. thermophilum, B. boum,B. ruminale, B. merycicum,and B. ruminantium.

Spirochaetes: Treponema spp.

Spirochaetes are spiral organisms that account for 1-6% of the bacteria in the liquid portion of digesta, and are isolated with cellulolytic bacteria. Spirochaetes produce succinate, ethanol, and some species hydrogenate unsaturated fatty acids; however, do not use or incorporate saturated acids for growth. Two main strains isolated and studied in culture include Treponema bryantii and T. saccharophilum. T. bryantii is a motile helical rod with 1 flagellum on each end of the cell. It requires isobutyrate, 2- methlybutyrate, and B-vitamins for growth, and utilizes glucose and pectin to produce formate, acetate, succinate, and CO2. T. saccharophilum is helical shaped with a large bundle of flagella. Isobutyrate is required for growth, and T. saccharophilum utilizes a wide range of substrates such as pectin, starch, and multiple sugars to produce acetate, formate, and ethanol (Stewart et al., 1997).

Above, I have discussed a few of the microorganisms inhabiting the rumen. With advances in technology, research continues to learn about microbial species as well as identify additional species, and associate microbial community and/or diversity to host

24

phenotypic characteristics. McCann et al. (2014) found that microorganisms of the

Fibrobacter phylum increased in host animals fed hay, whereas microorganisms of the

Firmicutes and Bacteroidetes phyla decreased. Alternately, S. bovis, S. ruminantium, and

P. bryantii were in higher abundance in concentrate-based diets vs. hay fed animals.

Microbial diversity identified among different diet classifications, leading to differences in VFAs has led to the hypothesis that certain microbial community members are common among efficient host animals. In general, Succinivibrio dextrinosolvens,

Eubacterium rectale, Prevotella spp., and Ruminococcus spp. have been associated with feed efficiency in ruminants across various diet classifications (McCann et al., 2014;

Carberry et al., 2012; Hernandez-Sanabria et al., 2011; Guan et al., 2008). Other production traits such as fleece weight have been associated with differences in bacterial microorganisms in the rumen (De Barbieri et al., 2015). Studies continually suggest an interplay between host physiology and the rumen microbiome, offering opportunities to further improve production and efficiency in livestock.

The Rumen Methanogens

The majority of methane (87%) is produced in the rumen by methanogenic archaea

(Muro-Reyes et al., 2011). Currently methanogenic archaea, or methanogens identified in the rumen include Methanobrevibacter, Methanobacterium, , and

Methanosarcina (Whitford et al., 2001; Stewart et al., 1997). Methanogens immediately dispose of the hydrogen (H2) and carbon dioxide (CO2) that is created from the bacterial fermentation process to produce methane (CH4). This interaction between bacteria by- products and methanogens, called ‘interspecies H2 transfer’ allows for only traces of H2

25

to exist at any one moment in the rumen (Moss et al., 2000; Stewart et al., 1997). Acetate and butyrate promote methane production, while propionate serves as a competitive pathway for H2 use in the rumen (Moss et al., 2000). It is estimated that a 500 kg (1,000 pounds) cow produces approximately 800 liters of H2 which methanogens convert into

200 liters of CH4 (Moss et al., 2000; Stewart et al., 1997). Approximately 2-15% of total dietary energy of a host animal is lost to methane production, and a more feed efficient host animal has been shown to produce less methane by 20-30% and 12-19% in cattle and sheep, respectively (Muro-Reyes et al., 2011; Leahy et al., 2010; Whitford et al.,

2001; Zhou et al., 2009). Therefore, research strives to reduce methane produced by livestock in hopes to increase profitability by improving animal efficiency as well as reduce greenhouse gases emitted by the host animal (Muro-Reyes et al., 2011).

Research Objectives

The large dataset created for this dissertation includes a total of 48 host animals of 5 different species, wild and domestic, that were fed either a concentrate- or forage-based diet. Additionally, the two domestic host species, cattle and sheep, have feed efficiency records; however, analysis is not discussed in this dissertation. Chapter 2: Distribution of

Microbial Taxa and Identification of a Core Microbiome in Ruminant Host Species, has three main objectives; 1) analyze the distribution of microbial taxa identified among host species, 2) explore the distribution of microbial taxa identified from whole genome sequencing and 16S rDNA gene identification as additional host species are used in analyses, and 3) identify a core microbial community among all host species. Chapter 3:

Effects of Host Diet, Species, and Domestication Status on Microbiome, determines the

26

effects of host diet, species, and domestication status on the rumen microbiome. As presented earlier, a plethora of research has demonstrated that the host diet has a major influence on the rumen microbiome. Few studies have identified rumen microbial organisms that are affected by differences in host species or domestication status. Most wild animals studied are those surviving in extreme environments, whereas this study analyzed wild ruminants that inhabit similar environments as the domesticated species.

Few studies are fortunate to have a large enough dataset that includes multiple host animals of different species for this type of analyses.

27

CHAPTER TWO

DISTRIBUTION OF MICROBIAL TAXA AND

IDENTIFICATION OF A CORE MICROBIOME IN

RUMINANT HOST SPECIES

Summary

Research of the ruminal microbiome continues to expand our knowledge and add to the improvements in livestock production and efficiency. Metagenomic studies have benefitted from the improvement of sequencing technologies by increasing data size allowing billons of metagenomic shotgun sequences or targeted 16S rDNA genes to be identified directly from an environmental sample (Ellison et al., 2013; Hao and Chen,

2012; Lu et al., 2012). Whole genome sequencing provides the most comprehensive set of data, and identification of 16S rDNA genes for categorization of sequence data into microbial taxa (Petrosino et al., 2009). The aim of this study included three separate objectives to analyze the microbial distribution and diversity among various host species from whole genome sequencing and 16S rDNA gene classification. Rumen content was sampled from cattle, sheep, bison, red deer, and white-tailed deer (n=48) consuming either a concentrate- or forage-based diet. DNA was extracted and shotgun sequenced

28

using the Illumina GAII for identification of microbial taxa, or operational taxonomic units (OTUs). First, an exponential decay model was used to analyze the distribution of

OTUs identified among each host species. An exponential decay curve was used to fit each host species OTU rank to abundance data. The exponential coefficients between each host species were not different (p<0.05) from one another, suggesting the distribution of OTUs is similar among all host species. Methanobrevibacter smithii,

Prevotella brevis, Prevotella ruminicola, Fibrobacter succinogenes Ruminococcus flavefaciens were ranked in the top 10 most frequent OTUs among all host species. The second objective of this study was to explore the distribution of microbial taxa identified from whole genome sequencing and 16S rDNA gene identification as additional host species were used in analyses. As additional host species were included, the total number of OTUs identified increased, but the number of common OTUs among all host species decreased. Interestingly, there was less of an increase in the total number of OTUs and less of a decrease in the number of common OTUs identified as more host species were added to the analysis. Any OTU with at least 1 sequence count was included in the analysis, resulting in 143 OTUs that were common among all host species. The third objective of this study was to identify a core microbial community among all host species. More stringent than other studies, inclusion of an OTU into the ‘core’ required it to be present in >90% of the sampled host animals and have a minimum of 30 sequence counts within a host species (Henderson et al., 2015; Weimer, 2015; Jeraldo et al., 2012).

Nine OTUs fit these criteria, including Butyrivibrio, Prevotella, Oscillibacter, and

Ruminococcus, all of which have been described as predominant genera in the rumen

(Henderson et al., 2015; Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2013;

29

Stewart et al., 1997). Also supporting other studies, the methanogen Methanobrevibacter was identified as a core microbial community member (Henderson et al., 2015;

Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2013; Stewart et al., 1997).

Introduction

The relationship between rumen fluid, bacteria, and fiber fermentation has been studied since the 19th century and consequently has led to improvements in ruminant livestock production (Krause et al., 2013; Stewart et al., 1997). The ruminal microbiome contains approximately 1012 microorganisms per gram that work together to ferment and degrade consumed plant fibers into digestible compounds such as volatile fatty acids and bacterial proteins (Chaucheyras-Durand and Ossa, 2014; Henderson et al., 2013; Krause et al.,

2013; Jami and Mizrahi, 2012; Nelson et al., 2003; Flint, 1997). Research of the ruminal microbiome continues to expand knowledge and add to improvements in livestock production and efficiency.

Metagenomic studies have benefitted from the improvement of sequencing technologies, such as the Illumina platform. Sequencing run size has increased allowing for billons of metagenomic shotgun sequences or targeted 16S rDNA genes to be identified directly from an environmental sample (Ellison et al., 2013; Hao and Chen, 2012; Lu et al.,

2012). Whole genome sequencing provides the most comprehensive set of data, and identification of 16S rDNA genes allows categorization of sequence data into microbial taxa (Petrosino et al., 2009). These large datasets are used to identify microbial organisms

30

that are affected by various host animal phenotypes and/or environments. To date,

Henderson et al. (2015) has obtained one of the largest rumen microbial datasets produced from PCR amplification and 454 sequencing chemistry. The present study will be one of the first to utilize whole genome sequencing, Illumina GAII platform data, to analyze the diversity of microbial taxa, or operational taxonomic units (OTUs) and common OTUs identified among multiple ruminant host species.

The concept of a core microbial community among vertebrate hosts has been proposed in human and murine populations (Benson et al., 2010; Qin et al., 2010; Tap et al., 2009). In ruminant animals, certain microorganisms are consistently identified among all host species; however, few studies have had the power to identify a core. Studies which have included multiple host animals of different species have supported a core microbiome that is shared among all ruminant animals (Henderson et al., 2015; Weimer, 2015; Jeraldo et al., 2012). It is estimated that 7,000 bacterial and 1,500 archaeal species are present in the rumen, and account for approximately 70% of the host animal’s metabolic energy for maintenance, growth, and reproduction (Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2012; Hernandez-Sanabria et al., 2011; Stewart et al., 1997). Firmicutes and

Bacteroidetes/Chlorobi group are the most predominant phyla identified in the rumen

(Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2012). Some microbial species are consistently identified across ruminant host species, including Prevotella spp.,

Butyrivibrio spp., Ruminococcus spp., Fibrobacter spp., and Ruminococcaceae spp.

(Henderson et al., 2015; Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2012;

Stewart et al., 1997). Some common rumen microbes produce enzymes or other

31

substrates that assist other rumen microbes; for example, Prevotella spp. are unable to degrade plant cell walls but produce xylanase and CM-cellulase, suggesting a distinct niche for them in the rumen (Stewart et al., 1997). Other commonly identified rumen microbes are efficient at producing volatile fatty acids: as an example, Butyrivibrio fibrisolvens is a major producer of butyrate, which makes up 16% of the total volatile fatty acids in the rumen (Stewart et al., 1997). Lastly, some rumen microbes harbor the ability to ferment a wide range of substrates or are resistant to certain feed additives or antibiotics (Stewart et al., 1997). Methanogens are members of the Archaea domain that utilize bacteria microbial by-products, H2 and CO2, to produce methane.

Methanobrevibacter and Methanobacterium, are also commonly identified in the rumen of host animals, and have been suggested to be part of the ruminant host ‘core’ microbiome (Chaucheyras-Durand and Ossa, 2014; Stewart et al., 1997).

Currently, this study contains one of the largest ruminant datasets and includes 5 different wild and domesticated host species. Unlike other studies that include domestic and wild animals, the wild host species sampled were those that live within the same or similar environments as the domestic host species (Nelson et al., 2003; Orpin et al., 1985). The objectives of this study was to: 1) analyze the distribution of microbial taxa identified among host species; 2) explore the distribution of microbial taxa identified from whole genome sequencing and 16S rDNA gene identification as additional host species were used in analyses; and 3) identify a core microbial community among all host species.

32

Materials and Methods

All live animal procedures were conducted in compliance with the Guide for the Care and

Use of Laboratory Animals and The Guide for the Care and the Use of Agricultural

Animals in Agricultural Research and Teaching. Protocols for sheep (assurance A-3216-

01) and cattle (protocol 8048) sample collection were approved by the University of

Wyoming Institutional Animal Care and Use Committee (Laramie, WY) and the

University of Missouri Animal Care and Use Committee (Columbia, MO), respectively.

Host Animals and Diet

Full rumen contents were collected from five different species of ruminants (n=48) consuming either a concentrate- or forage-based diet. Rumen content was collected via oral lavage from sixteen Suffolk, Hampshire, and Rambouillet purebred sheep (Ovis aries) housed at the University of Wyoming Agriculture Research and Extension Center near Laramie, Wyoming and 16 British x Continental crossbred cattle (Bos tarus) housed at the University of Missouri South Farm near Columbia, Missouri. Equipment outlined by Geishauser (1993) was used to collect sheep and cattle rumen content. Briefly, an oro- ruminal probe, suction pump, and funnel were assembled. When the probe was passed into the animals’ rumen, approximately 80 mL of rumen content was pumped into a sterilized collection container. Additional domestic host animal metadata is presented in

Table 2.1. Bison were raised at Kentucky Bison Ranch (n=2) in Bagdad, Kentucky,

Northstar Bison (n=4) in Rice Lake, Wisconsin, and Crown Valley Winery (n=2) in Ste.

33

Genevieve, Missouri and slaughtered at Memphis Meats Processing in Memphis, Indiana,

Northstar Bison Processing, and Wright City Meats in Wright City, Missouri, respectively. Red deer were raised at Rolling Hills Red Deer Farm in Catawissa,

Pennsylvania and slaughtered at Bixlers Country Meats in Hegins, Pennsylvania. Rumen contents from white-tailed deer (Odocoileus virginianus) were collected, with permission, from animals harvested by licensed hunters during Pennsylvania deer hunting season in Herndon, Pennsylvania. Additional wild host animal metadata is presented in

Table 2.2.

All host animals had access to feed ad libitum and were healthy at time of sample collection. One-half of the cattle (n=8), sheep (n=8), and bison (n=4) were fed a concentrate-based diet consisting of primarily corn, and the other half of the cattle (n=8), sheep (n=8), and bison (n=4) as well as red deer (n=4), were fed a forage-based diet containing alfalfa pellets, grasses, and/or had free-range pasture access (diet information not shown). White-tailed deer species were confirmed to be fed a forage-based diet using

SAS via discriminant function analysis (SAS Institute, Inc., Cary, NC). Discriminant

Function analysis is used to classify observations into two or more known groups on the basis of one or more quantitative variables.

DNA Extraction from Rumen Fluid

Immediately after rumen contents were sampled they were put on dry ice for transport back to the laboratory. Once at the laboratory, samples were stored in a -80ºC freezer.

34

Prior to DNA extraction, cattle, bison, red deer, and white-tailed deer samples were split into two aliquots; one aliquot was fixed in 100% ethanol solution and the second aliquot was freeze-dried for preservation (Virtis Genesis 2XL, Innovative Laboratory Systems).

Sheep samples were thawed before DNA extraction. DNA was extracted from thawed, ethanol fixed, and freeze-dried samples using the QIAamp DNA Stool Mini Kit (Qiagen,

Santa Clarita, CA) and following the protocol described by Yu and Morrison (2004).

Particulate matter was removed from the sample via low speed centrifugation. Samples were then mixed with sterilized zirconia (0.3g of 0.1mm) and silicon (0.1g of 0.5mm) beads and 1 mL lysis buffer (1.5M Sodium Chloride, 1M Tris-HCl, 0.5M EDTA, and

10% SDS mixture). Cells in the sample were lysed by homogenization using a Mini-

Beadbeater-8 at maximum speed for 3 minutes. For sufficient lysing of bacterial cells, samples were incubated at 70°C for 15 minutes with gentle mixing every 5 minutes. Each sample was centrifuged (16,000g) at 4°C for 5 minutes before collecting the supernatant.

Homogenization, incubation, and centrifugation were repeated 3 times for each sample, and the supernatants were pooled. After DNA was precipitated, RNA and proteins were removed, and purification was completed. DNA was suspended in 10 mM Tris-Cl buffer

(pH 8.5, EB buffer from Qiagen kit) and quantified using a NanoDrop (1000 3.8.1,

Thermo Scientific). Additionally, 100 ng of DNA was run via agarose gel electrophoresis and stained with ethidium bromide to demonstrate DNA integrity.

35

Microbial Sequencing and Identification

A total of 3µg of extracted DNA (50ng/µL in 60µL) for each host animal was prepared and submitted to the University of Missouri DNA Core for microbial sequencing.

Libraries were created four samples per lane and sequenced on the Illumina

GenomeAnalyzer II, resulting in 150 base-pair, paired end reads. FASTQ sequence files were first quality filtered by truncating each read after the first run of 3 bases with a

Phred quality score less than 15 (Ewing and Green, 1998). Reads and read pairs that were less than 85 bases long or had an average quality score of less than 25 were omitted from analysis.

Quality filtered reads were compared to a database of 16S rDNA gene sequences to identify microbial taxa. The database was comprised of all 16S rDNA genes from the

Ribosomal Database (Cole et al., 2009) and the sequenced prokaryotic genomes available from GenBank (Benson et al., 2013). From that database, identical sequences or sequences less than 1,450 basepairs in length were discarded. The remaining 27,290 sequences were clustered into operational taxonomic units (OTUs) by computing all possible pairwise Needleman-Wunsch sequence alignments (Needleman and Wunsch,

1970) using a GPU-based pairwise alignment tool (Truong et al., 2014). Using Bowtie

(Langmead et al., 2009) quality filtered reads were aligned to the database. Reads where both the forward and reverse reads uniquely matched to exactly one OTU with ≥97% identity were classified as an individual from that OTU and extracted from the NCBI

36

database (Ellison et al., 2013). Additionally, a phylum classification of each

OTU was recorded. This resulted in a host animal by OTU count matrix.

Statistical Analyses: Diet Classification

As mentioned previously, white-tailed deer samples were collected from the wild; therefore, diet information was unknown. It was presumed based on visual observation of the digesta that these deer were consuming primarily forage, and this was confirmed using discriminant function analysis via SAS/PROC DISCRIM (SAS Institute, Inc., Cary,

NC). First, a step-wise discriminate function analysis performed on the normalized count matrix of all host animals, excluding the white-tailed deer host animals. This procedure provided the most informative OTUs in predicting diet composition of a host animal. The

10 most informative OTUs were selected and used to predict the diet composition of each white-tailed deer host animal as well as verify correct classification of the other host animals. Each white-tailed deer sample was classified as a forage-based diet classification with greater than 99% probability.

Statistical Analyses: Exponential Decay Model fit to OTU Distribution

All sampled host species were fed a forage-based diet, and additional samples were collected from host animals fed a concentrate-based diet. In order to avoid diet as a confounding variable, only host animals fed a forage-based diet were included in the analysis to determine the distribution of microbial taxa (OTUs) between each host

37

species. These included 8 cattle, 8 sheep, 4 bison, 4 red deer, and 4 white-tailed deer. To account for sequencing variations between each host animal, sequence counts were normalized. Normalization was done by calculating the proportion of each OTU from the total number of sequences for each host animal, and this value represented the abundance of each OTU found in each host animal (Morgan et al., 2010). Total abundance for each

OTU within a host species was calculated, and rank was determined as the highest abundance (rank=1) to the lowest abundance. A normalized host animal by OTU matrix was created for each host species and OTU rank by abundance plots were created. Plots resembled an exponential decay model:

Equation 2.1: Exponential Decay Model

푦 = 훽푒−훾푟 where 푦 is the abundance of an OTU, 푟 is the rank for that particular OTU, and 훽 is the expected abundance with rank=1.

Nonlinear regression was fit to the data in order to obtain meaningful quantities that the coefficients for each host species could be directly compared with ANOVA (Freud and

Littell, 2000). Using the NLIN procedure in SAS (SAS Institute, Inc., Cary, NC), rank and abundance data was used to estimate coefficients 훽 and 훾 for each host animal. The observed OTU abundance and estimated exponential decay curves were plotted and visualized to verify the fit of the expected curve to the observed abundance (Motulsky and Ransnas, 1987). Additionally, total abundance was calculated as the sum of each host animal’s OTU abundance within a host species. Similarly to analysis of each host animal, total observed abundance and rank for each host species were used to estimate

38

coefficients for an exponential decay model. Observed and estimated rank by OTU abundance was plotted and visually compared. A t-test was performed to determine if the model coefficients (훽 and 훾) were similar between host species. A generalized linear model in SAS (SAS Institute, Inc., Cary, NC) was used to determine whether the estimated curvature of the OTU distribution for each host species was similar. In this model, OTU rank was the independent variable and the dependent variable was each of the coefficient values predicted in the exponential decay model.

Statistical Analyses: Diversity and Common Core OTUs across Host Species

A binomial host species by OTU matrix was used to analyze the change in total OTUs and common OTUs identified by adding additional host species into an analysis. A “1” in the matrix represented an OTU that was present in the host species and a “0” representing an OTU that was absent. To analyze the distribution of microbial taxa identified as host species were added in the analysis, a PERL script was used to record the total OTUs and common OTUs identified for each iteration of adding a host species. For example, one iteration would calculate OTU numbers in the order: Host species 1, 2, 3, 4, 5 and a second iteration could be 2, 3, 4, 5, 1. This was repeated until all possible iterations

(n=120) were analyzed.

The core microbial community was determined from the original host animal by OTU matrix. The minimum number of sequence counts identified within a host species to be considered not absent by chance was calculated at a significance α <0.05. This was done

39

by including all host animals from each host species and calculating the chance, at random, that the number of sequence counts would be identified in at least one host species. Operational taxonomic units with sequence counts present in >90% of the host animals and greater than 30 sequence counts per host species were identified as members of the core microbial community.

Results and Discussion

Full rumen contents were sampled from 48 animals and microbial DNA was extracted.

Extracted DNA was shotgun sequenced on an Illumina GenomeAnalyzer II, which resulted in an average of 26 million paired-end sequences (Table 2.3). An average of

29,746 read pairs for each host animal mapped to the database, which resulted in 583 distinct OTUs.

Diet Classification

First, diet classification of the white-tailed deer samples was performed using the discriminate function analysis in SAS (SAS Institute, Inc., Cary, NC). Ten OTUs were identified and used to predict (R2=0.96) the diet classification of each sample. Each white-tailed deer sample was classified as being from a forage-based diet classification with greater than 99% probability.

40

Exponential Decay Model fit to OTU Distribution

The OTU count matrix utilized for this analysis included 28 host animals that were fed a forage-based diet, 8 cattle, 8 sheep, 4 bison, 4 red deer, and 4 white-tailed deer, and included 463 total OTUs.

First, rank and OTU abundance data was used to estimate coefficients 훽 and 훾 from

Equation 2.1 for each host animal (Table 2.4). Observed and estimated OTU abundance was plotted to visualize the fit of the expected curve to the observed data. As shown in

Figure 2.1, an irregular fit was seen when the estimated curve for OTU abundance dropped faster to a ‘near-zero’ value than what was observed. All estimated exponential decay curves appeared to fit the observed data; therefore, total abundance data from each host animal was used to analyze the distribution of OTU abundance among each host species.

Observed and estimated OTU abundance was compared as well as visualization of the plot. As discussed previously, the most irregular fit of the curve was the point, or OTU rank, at which the estimated abundance went to near-zero; however, all estimated curves fit the observed data. No significant differences (α<0.05) were found for either of the coefficients (훽 or 훾) or exponential decay curves between host species, suggesting that the distribution of OTUs among host species was not significantly different from each other. Estimated coefficients for cattle fit the observed data best (Figure 2.2). A maximum of a 2% difference between the estimated abundance and the observed

41

abundance was seen, all of which were within the first 3 ranked OTUs. Prevotella ruminicola, Mitsuokella jalaudinii, and Ruminococcus albus were estimated to be less frequent than what was observed. The OTU abundance plot for sheep is shown in Figure

2.3. There was a maximum of a 2% difference between the estimated abundance and the observed abundance; however, the difference was in OTU ranks 4 through 8. This caused the exponential decay curve to be a near-zero estimate sooner than the observed OTU abundance. There was a maximum of a 3% difference between estimate and observed abundance in OTU ranks 1, 3, and 4 of the bison samples, and this is resembled in the plot (Figure 2.4). The exponential decay curve underestimated the observed abundance of

Fibrobacter succinogenes, Prevotella ruminicola, and Methanobrevidbater smithi in bison. The plots for red deer and white-tailed deer host species are shown in Figure 2.5 and Figure 2.6, respectively. The expected curves demonstrated a similar irregularity from the observed OTU abundance as that of the sheep host species. The maximum difference was 3% between the estimated and observed OTU abundance and included 3

OTUs ranked as 3, 6, and 7. The white-tailed deer samples also had a 3% maximum difference between the estimated and observed OTU abundance. The first ranked OTU,

Prevotella ruminicola, had a higher abundance than estimated and the third ranked OTU,

P. brevis, was observed 3% lower than expected. Fitting an exponential decay model to

OTU rank by abundance appears an appropriate fit to the data. Interestingly, some of the most frequent OTUs observed in each host species were of the same microbial taxa.

Three OTUs were ranked as the top 10 most frequent OTUs; Methanobrevibacter smithii,

Prevotella brevis, and Prevotella ruminicola. Fibrobacter succinogenes was represented as one of the top 10 most abundant OTUs in all host species except white-tailed deer, and

42

Ruminococcus flavefaciens was ranked in the top 10 OTUs in all host species except bison. These are commonly identified microorganisms in the rumen, more specifically ruminants eating a forage-based diet. Methanobrevibacter smithii is a common methanogen of the rumen that uses hydrogen (H2) to reduce carbon dioxide (CO2) into methane (CH4). Some bacterial microorganisms release H2/CO2 as a by- product, such as

F. succinogenes (Zhou et al., 2009; Moss et al., 2000; Stewart et al., 1997). Also,

Prevotella spp. have a commensal relationship with cellulolytic bacteria including F. succinogenes and R. flavefaciens, supporting the high frequencies seen among these groups (Stewart et al., 1997).

Diversity and Common Core OTUs across Host Species

Whole genome sequencing fragments extracted DNA prior to sequencing and avoids the amplification biases of PCR; therefore providing the most comprehensive set of data.

Currently, identification of 16S rDNA genes allows for categorization of sequences into microbial taxa, or OTUs (Petrosino et al., 2009; Hudson, 2008; Tringe and Rubin, 2005).

This study provides one of the first analyses of OTU classification diversity when adding additional host species from whole genome sequences. Sequence data from 48 host animals resulted in identifying a total of 583 OTUs with 143 OTUs shared among all host species. As hypothesized, the total number of OTUs identified increased as additional host species were included in the analysis and common microbial taxa decreased (Figure

2.7). In this analysis, the increase in the total OTUs identified lessened as additional host species were included in the analysis. Additionally, the decrease in common OTUs

43

identified lessened as additional host species were included in the analysis. For example, the addition of 1 host species, for a total of 2 host species in an analysis, showed a 34% increase in the total number of OTUs identified and a 34% decrease in the total number of common OTUs identified among both host species. However, the addition of 1 host species, for a total of 4 host species in the analysis, had an increase of 12% in the total number of OTUs identified and a 9% decrease in the number of common OTUs identified among host species (Table 2.5). This study provides an analysis of the diversity of OTUs identified from whole genomic sequencing of rumen samples among different host species. These results may provide guidance in determining how many host species are needed in a study. Additionally, this analysis supports the idea that a common microbial population exists among all ruminant host species (Henderson et al., 2015; Jeraldo et al.,

2012). One hundred forty-three OTUs were common among all host species; however, individual host animal sequence counts representing those OTUs were not considered in this part of the analysis. Some OTUs that were common among all host species may not be present in all host animals and therefore not representative of a core microbial community.

Metagenomic studies which include multiple host species have indicated a core microbial community which is present among all ruminant host animals (Henderson et al., 2015;

Jeraldo et al., 2012). Henderson et al. (2015) analyzed 742 rumen content samples from

32 different host species from around the world. In their study, the 30 most abundant bacterial groups were identified in over 90% of the samples and were similar in other metagenomic studies of rumen microbial communities (Kim et al., 2011). The present

44

included multiple animals of five different ruminant host species, therefore providing the opportunity to add to the body of research identifying a core rumen microbial community. To identify a core microbiome, the full host animal by OTU count matrix was analyzed, including all sampled ruminant host species unbiased of domestication status (wild or domestic) or diet type (forage- and concentrate-based). For an OTU to be identified part of a core microbial community in this study the OTU needed to be identified in >90% of the host animals sampled (n=44), and a minimum of 30 sequence counts within a host species. Among all 48 host animals, a total of 583 OTUs were identified. Thirty-three OTUs were identified in >90% of the host animals, and only 9

OTUs had greater than 30 sequence counts within all host species (Table 2.6). Bacterial

OTUs identified were Butyrivibrio, Prevotella, Oscillibacter, and Ruminococcus, all of which have been described as predominant genera in the rumen (Henderson et al., 2015;

Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2013; Stewart et al., 1997). The

Henderson et al. (2015) study included camelids and did not identify Oscillibacter as dominant microbial taxa, possibly suggesting Oscilibacter is dominant among ruminants in this analysis but not among camelids. Also supporting other studies, the methanogen

Methanobrevibacter was identified as a core microbial community member (Henderson et al., 2015; Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2013; Stewart et al.,

1997). The present study supports the idea of a ‘core microbial community’; however, based on the criteria, only 9 OTUs were identified in core.

45

Table 2.1: Metadata for Domesticated Host Animals Live Animal Diet1 Breed Sex2 Age3 Collection Date Weight4 Cow 4750 C British x Continental M 1 March 2014 948 Cow 4801 C British x Continental M 1 March 2014 988 Cow 4716 C British x Continental M 1 March 2014 865 Cow 4625 C British x Continental M 1 March 2014 1160 Cow 4759 C British x Continental M 1 March 2014 806 Cow 4707 C British x Continental M 1 March 2014 845 Cow 4476 C British x Continental M 1 March 2014 932 Cow 4737 C British x Continental M 1 March 2014 757 Cow 4691 F British x Continental M 1 March 2014 1077 Cow 4374 F British x Continental M 1 March 2014 1231 Cow 4636 F British x Continental M 1 March 2014 1048 Cow 4710 F British x Continental M 1 March 2014 998 Cow 4712 F British x Continental M 1 March 2014 985 Cow 4753 F British x Continental M 1 March 2014 932 Cow 4768 F British x Continental M 1 March 2014 904 Cow 4785 F British x Continental M 1 March 2014 890 Sheep 1009 F Rambouillet M 0.7 Nov. 2011 71 Sheep 1026 C Rambouillet M 0.7 Nov. 2011 62 Sheep 1101 C Rambouillet 0.7 0.7 Nov. 2011 88 Sheep 1111 C Rambouillet M 0.7 Nov. 2011 82 Sheep 1127 F Rambouillet M 0.7 Nov. 2011 84 Sheep 1208 F Suffolk M 0.7 Nov. 2011 79 Sheep 1220 C Hampshire M 0.7 Nov. 2011 79 Sheep 1239 C Suffolk M 0.7 Nov. 2011 87 Sheep 1248 F Hampshire M 0.7 Nov. 2011 95 Sheep 1366 F Hampshire M 0.7 Nov. 2011 53 Sheep 1396 C Hampshire M 0.7 Nov. 2011 71 Sheep 1397 F Hampshire M 0.7 Nov. 2011 81 Sheep 7429 C Suffolk M 0.7 Nov. 2011 93 Sheep 7505 F Suffolk M 0.7 Nov. 2011 96 Sheep 1003 F Rambouillet M 0.7 Nov. 2011 64 Sheep 1348 C Hampshire M 0.7 Nov. 2011 83 1C-concentrate-based diet; F-forage-based diet 2M-male; F-female 3Age is presented in years 4Live Weight is presented in pounds

46

Table 2.2: Metadata for Wild Host Animals Collection Collection Live Dressed Animal Diet1 Sex2 Age3 Date Time Weight4 Weight5 8:37 AM Bison M2 C M 2.5 Sept. 2014 975 594 9:35 AM Bison M5 C M 2.5 Sept.2014 1200 695 Bison NS1 F F 2.4 Oct. 2012 10:00 AM 520 - 9:00 AM Bison NS3 F M 2 Nov. 2012 569 - 9:00 AM Bison NS4 F M 2 Nov. 2012 608 - 9:00 AM Bison NS9 F M 2 Nov. 2012 560 - Bison WC23 C M 2.5 Oct. 2013 2:30 PM - 767 Bison WC24 C M 2.5 Oct.2013 4:00 PM - 697 Red Deer 1 F F 3 July 2014 10:15 AM - 122 Red Deer 2 F F 3 July 2014 10:35 AM - 135 Red Deer 3 F F 3 July 2014 10:40 AM - 140 Red Deer 4 F F 3 July 2014 11:00 AM - 134 White-tail Deer 1 F F 1.5 Jan. 2013 2:00 PM - - White-tail Deer 2 F M 0.5 Jan. 2013 2:30 PM - - White-tail Deer 3 F F 1.5 Jan. 2013 5:30 PM - - White-tail Deer 4 F M 1.5 Jan. 2013 2:00 PM - - 1C-concentrate-based diet; F-forage-based diet 2M-male; F-female 3Age is presented in years. White-tailed deer were aged using the teeth of the lower jaw (Cain and Wallace, 2003). 4Live Weight is presented in pounds 5Dressed Weight was taken immediately after skinning and evisceration of carcass and is presented in pounds

47

Table 2.3: Average Number of Reads for Each Host Species at Each Analysis Step Raw Quality Filtered Aligned FASTA Host Species FASTA Reads FASTA Reads Reads Cattle 31,050 Sheep 31,840 Bison 20,698 Red Deer 39,619 White-tailed Deer 25,523 Average Total 29,746

48

Table 2.4: Estimated Coefficients and Standard Errors of the Exponential Decay Curve for Each Host Species1

Host Species 휷 Std. Error (휷) 휸 Std. Error (휸) Cattle 0.220 10.243 0.262 0.397 Sheep 1.315 10.243 1.250 0.397 Bison 0.178 10.243 0.230 0.397 Red Deer 0.411 14.485 0.675 0.562 White-tailed Deer 0.223 14.485 0.301 0.562 1 Exponential Decay Model: 푦 = 훽푒−훾푟 where 푦 is the abundance of an OTU, 푟 is the rank for that particular OTU, and 훽 is the expected abundance with rank=1

49

Table 2.5: Percent Increase in Total Observed OTUs and Decrease in Common ‘Core’ OTUs as Additional Host Species are Included in Analysis

Total Number of Host Species in Analysis1 2 3 4 5 Increase in Total OTUs Identified (%) 34.18 18.05 12.22 9.26 Decrease in Total common ‘core’ OTUs Identified (%) 34.18 15.13 9.12 6.17 1 The presented number of host species is reflective of the total number of species used for the analysis. For example, in column ‘Host Species 2’ there was a 34.18% increase in total OTUs identified and 34.18% decrease in common OTUs observed from when there was 1 host species in the analysis.

50

Table 2.6: Identified Core Microbial Community1

Butyrivibrio fibrisolvens Butyrivibrio hungaeti Methanobrevibacter smithii Oscillibacter valericigenes Ruminococcus albus Prevotella albensis Prevotella brevis Prevotella genomosp Prevotella ruminicola 1Individuals were classified as core microbial community members if they were found in >90% of the host animals sampled and had a minimum of 30 sequence counts within the host species.

51

Figure 2.1: Examples of the Exponential Decay Curves fit to the rank by OTU Abundance Observed Data1 A: Example of one of the best estimated curves:

B: Example of one the worst estimated curves:

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

52

Figure 2.2: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Cattle1

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

53

Figure 2.3: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Sheep 1

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

54

Figure 2.4: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Bison 1

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

55

Figure 2.5: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in Red Deer 1

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

56

Figure 2.6: Estimated and Observed OTU Rank by Abundance using an Exponential Decay Model in White-tailed Deer1

1 Coefficients for exponential decay function were estimated using SAS (SAS Institute, Inc., Cary, NC)

57

Figure 2.7: Total OTUs Identified and Number of Common ‘Core’ OTUs as Additional Host Species are Added into Analysis

Each line in the graph represents an iteration of the order of host species. For example, one iteration would calculate OTU numbers in the order: Host species 1, 2, 3, 4, 5 and the second iteration could be 2, 3, 4, 5, 1. All iterations (n=120) are presented. Blue lines represent the total number of identified OTUs. Peach lines represent the total number of common ‘core’ OTUs identified Black lines represent the mean number of OTUs over all iterations.

58

CHAPTER THREE

EFFECTS OF HOST DIET, SPECIES, AND

DOMESTICATION STATUS ON MICROBIOME

Summary

The microbial fermentation process in the rumen accounts for approximately 70% of the ruminant’s metabolic energy for maintenance or improvement of health, growth, and reproductive factors. Identification of microbial taxa in the rumen has greatly improved with developments in sequencing and analysis techniques. Identifying diverse microorganisms in the host’s gut may provide insight into ruminant production and possibly improved livestock efficiency and productivity. The aim of this study was to identify ruminal microbial phyla and taxa that varied between host diet types, species, and domestication status. Rumen contents were sampled from cattle, sheep, bison, red deer, and white-tailed deer (n=48) consuming either a concentrate- or forage-based diet.

DNA was extracted and shotgun sequenced using the Illumina GAII for phyla and microbial taxa identification, or operational taxonomic units (OTUs). The VARCOMP procedure in SAS was used to estimate the contribution of host diet, species, and

59

domestication status to the differences among read counts for each phylum and microbial taxa. In order to avoid a Type I error, the minimum number of sequence counts between compared groups was calculated at a significance α <0.05. Within each group, sequence counts for each OTU that were below this threshold were removed from the analysis. A total of 12 phyla consisting of 583 OTUs were identified among all host animals. Host diet explained >10% of the variance in read counts in Firmicutes, Actinobactera, and

Bacteroidetes/Chlorobi group, and host diet explained ≥20% of the differences in read counts in 66 total OTUs within those phyla. Host species accounted for >10% of the variance in read counts in Firmicutes, Actinobacteria, Bacteroidetes/Chlorobi group,

Fibrobacteres/Acidobacteria group, Spriochaetes, and Proteobacteria, and accounted for

≥20% of the variance in read counts in 73 OTUs within those phyla. Host domestication status explained >10% and ≥20%, respectively, of the differences among read counts in

Firmicutes and Bacteroidetes/Chlorobi group consisting of 23 OTUs. Some OTUs were unique to particular groups of host species (p<0.05). Three OTUs were found only in cattle, bison, and sheep, 1 OTU (Pectinatus haikarae) was found only in the deer host species; 8 OTUs were unique to domesticated species, and Streptococcus spp. was unique to wild species. Interestingly, some OTUs were in higher/lower abundance in cattle and bison and lower/higher in other species. Collinsella aerofaciens was present only in cattle and bison. The identified phyla and OTUs in this study provide a stepping stone to identifying differences in the rumen’s microbial community specific to different host diets, species, and domestication status.

60

Introduction

Metagenomics was first termed by soil scientists in 1998, and since then the research area has only gained momentum and expanded into the human and animal sciences

(Handelsman et al., 1998). Metagenomics refers to the study of a collective set of genomes, representing all the microbial communities, sampled directly from an environment. Originally, microbial research was performed via culturing methods; however, it’s estimated that culture-dependent microbial identification accounts for <20% of the total microbes in an environment, and only approximately 11% of the total bacterial microbes present in the rumen (Deusch et al., 2015; Chaucheyras-Durand and

Ossa, 2014; Krause et al., 2013; Guan et al., 2008; Nelson et al., 2003). The improvement of sequencing and analysis techniques has allowed for improved identification of microbes in various environments, such as the rumen.

The community of microorganisms that inhabit the rumen is complex and not fully understood. Bacteria, archaea, fungi, ciliate protozoa, and viruses make up the 1012 microorganisms per gram that are present in the rumen microbial community, or rumen microbiome (Chaucheyras-Durand and Ossa, 2014; Henderson et al., 2013; Nelson et al.,

2003). A symbiotic relationship exists between the ruminal microbiome and host animal in which microorganisms work together to ferment and degrade consumed plant fibers into digestible compounds such as volatile fatty acids and bacterial proteins (Russell et al., 2014; Krause et al., 2013; Jami and Mizrahi, 2012; Flint, 1997). Approximately 70% of the host animal’s metabolic energy for maintenance or improvement of health, growth, and reproductive factors is obtained from this fermentation process (Hernandez-Sanabria

61

et al., 2011). Understanding the complex ruminal microbiome could prove to be of great importance to livestock efficiency and productivity.

Ninety-five percent of the microorganisms found in the rumen are bacteria, of which approximately 200 of the 7,000 estimated species have been cultivated and described physiologically (Deusch et al., 2015; Chaucheyras-Durand and Ossa, 2014; Kim et al.,

2011). Diversity among the rumen microorganisms has been shown to be influenced by modifications in the diet and host species (Carberry et al., 2012; Hernandez-Sanabria et al., 2011). It has also been proposed that selective pressure on host and mood or behavior of the host animal affects the host’s microbiota (Forsythe et al., 2010; Ley et al., 2008;

Lyte et al., 2006). Most of the microbial functions are unknown and identifying diverse microorganisms across different diet types, species, and domestication status may provide additional insight into ruminant production. The aim of this study is to identify ruminal microbial phyla and taxa that vary between host diet types, species, and domestication status.

Firmicutes, Proteobacteria, and Bacteroides/Chlorobi group are the most abundant phyla found in the rumen (Chaucheyras-Durand and Ossa, 2014; Kim et al., 2011; Fernando et al., 2010; Nelson et al., 2003). Studies have shown abundance and diversity of genera are affected by different types of host diets, i.e. a high starch (concentrate-based) and a high fiber (forage-based) diet. For example, Firmicutes spp. fluctuate between diet types, mainly because the phylum contains both cellulolytic and non-cellulolytic bacteria. For

62

example, Mitsuokella, Selenomonas ruminantium, and Anaerovibrio lipolytica are found in greater abundance in starch-based diets; while Ruminococcus albus and Ruminococcus flavefaciens are more abundant in fiber rich diets (Singh et al., 2014; Jami et al., 2013;

Fernando et al., 2010; Flint and Bayer 2008; Stewart et al., 1997). Hernandez-Sanabria et al. (2011) demonstrated that Proteobacteria (Succinivibrio dextrinosolvens) was more abundant in starch diets among cattle host species. Bacteroides/Chlorobi group is associated with high fiber diets; specifically species of Prevotella (Prevotella bryantii and Prevotella ruminicola) classification (Singh et al., 2014; Jami et al., 2013).

Additionally, Fibrobacteres had higher representation among fiber diets, all of which were Fibrobacter (Singh et al., 2014; Fernando et al., 2010). While relationships between rumen microorganisms and the host have been supported, understanding the function and ecology of the microbiome is incomplete. Therefore, studies that address the diversity, abundance, and taxonomic composition of the microbiome among different host animals are beneficial (Russell et al., 2014; Morgavi et al., 2013). This study includes more host animals and a broader range of host species than have typically been used in metagenomic studies which should confirm diversity between host diet types as well as identify new influential microbial taxa.

Few studies have focused on microbial variations across multiple host species or domestication status. Nelson et al. (2003) reported Firmicutes and the

Bacteroides/Chlorobi group differed between Thompson’s gazelle (Gazella rufifrons),

Grant’s gazelle (Gazella granti), and zebu cattle (Bos indicus) host species. A cellulolytic bacterium, Ruminococcus albus, was found in higher abundance in the rumen of bison as

63

compared to sheep (Singh et al., 2014). Interestingly, unlike the diet effects, none of the

Bacteroides/Chlorobi group that were influenced by different host species were

Prevotella (Nelson et al., 2003). Provided a fiber rich diet, red deer (Cervus elaphus) digest cellulose and lignin better than sheep (Ovis aries), and goats (Capra aegagrus) digest fiber better than sheep (Domingue et al., 1991). Microbial taxa differing among these species may provide insight into more efficient cellulose and lignin microbial fermenters. Alternately, it has been shown that bison digested a poor quality forage diet more efficiently and had higher average daily gains than cattle (Hawley et al., 1981a;

Hawley et al., 1981b; Richmond et al., 1977; Peden et al., 1974). Henderson et al. (2015) identified diversity of some bacterial microorganisms in ruminant host species from different regions of the world, and postulated it was due to diet, climate, and farming practices. Identifying microbial taxa that are associated with various ruminant host species will provide a stepping-stone to determining what those differences imply for the host species production characteristics.

Wild ruminants, as compared to domestic relatives, demonstrate a superior ability to process a high fiber diet (Nelson et al., 2003; Richmond et al., 1977). High-artic Svalbard reindeer (Rangifer tarandus platyrhynchus) have a rumen bacterial population which excels at fiber digestion and enables them to survive on low-quality feed during the winter months (Orpin et al., 1985). Butyrivibrio fibrisolvens, Streptococcus bovis, and

Methanogenic bacteria appear to be the beneficial microorganism in these animals. Bison

(Bison bison) consistently select lower quality forage than cattle (Bos tarus); however, both species have similar digestion factors, such as passage rate and volatile fatty acid

64

composition (Towne et al., 1988). Differences found in ruminal microbial populations may provide insight as to why bison are able to digest fiber more efficiently than cattle

(Hawley et al., 1981a; Hawley et al., 1981b; Richmond et al., 1977; Peden et al., 1974).

Ley et al. (2008) and others suggested that co-diversification has occurred between host and their microbiome (Morgavi et al., 2013). Conceivably, differences in microbial taxa between wild and domesticated species would support co-diversification due to domestication events. Furthermore, a synonym for domestication is tamability, meaning how well a wild animal adapts to life among humans (Trut, 1999). Domesticated species are more docile than wild animals. Lyte et al. (2006) reported that certain gut bacteria in mice affect the host animal’s mood and behavior (Forsythe et al., 2010; Lyte et al., 2006).

While not studied in ruminants, variation in microbial taxa associated with differences in domestication status may be due to permanent temperament effects. Most wild animals studied and compared to domesticated animals are those living in extreme environments.

The current study includes wild ruminants that inhabit similar environments as the domesticated species, potentially identifying microbial taxa associated with host temperament.

The objective of this study was to determine the relative proportion of variation explained by microbial phyla and taxa among host diet, species, and domestication status. Results from this study seek to identify microbial taxa associated with host diet, species, and domestication status.

65

Materials and Methods

All live animal procedures were conducted in compliance with the Guide for the Care and

Use of Laboratory Animals and The Guide for the Care and the Use of Agricultural

Animals in Agricultural Research and Teaching. Protocols for sheep (assurance A-3216-

01) and cattle (protocol 8048) sample collection were approved by the University of

Wyoming Institutional Animal Care and Use Committee (Laramie, WY) and the

University of Missouri Animal Care and Use Committee (Columbia, MO), respectively.

Host Species and Diet

Full rumen contents were collected from five different species of ruminants (n=48) being fed either a concentrate- or forage-based diet. Rumen content was collected via oral lavage from sixteen Suffolk, Hampshire, and Rambouillet purebred sheep (Ovis aries) housed at the University of Wyoming Agriculture Research and Extension Center near

Laramie, Wyoming and 16 British x Continental crossbred cattle (Bos tarus) housed at the University of Missouri South Farm near Columbia, Missouri. Equipment outlined by

Geishauser (1993) was used to collect sheep and cattle rumen content. Briefly, an oro- ruminal probe, suction pump, and funnel were assembled. When the probe was passed into the animals’ rumen, approximately 80 mL of rumen content was pumped into a sterilized collection container. Additional domestic host animal metadata is presented in

Table 3.1. Bison were raised at Kentucky Bison Ranch (n=2) in Bagdad, Kentucky,

Northstar Bison (n=4) in Rice Lake, Wisconsin, and Crown Valley Winery (n=2) in Ste.

Genevieve, Missouri and slaughtered at Memphis Meats Processing in Memphis, Indiana,

66

Northstar Bison Processing, and Wright City Meats in Wright City, Missouri, respectively. Red deer were raised at Rolling Hills Red Deer Farm in Catawissa,

Pennsylvania and slaughtered at Bixlers Country Meats in Hegins, Pennsylvania. Rumen contents from white-tailed deer (Odocoileus virginianus) were collected, with permission, from animals harvested by licensed hunters during Pennsylvania deer hunting season in Herndon, Pennsylvania. Additional wild host animal metadata is presented in

Table 3.2.

All host animals had access to feed ad libitum and were healthy at time of sample collection. One-half of the cattle (n=8), sheep (n=8), and bison (n=4) were fed a concentrate-based diet consisting of primarily corn, and the other half of the cattle (n=8), sheep (n=8), and bison (n=4) as well as red deer (n=4), were fed a forage-based diet containing alfalfa pellets, grasses, and/or had free-range pasture access (Table 3.3).

White-tailed deer species were predicted to be fed a forage-based diet using SAS/STAT® software via discriminant function analysis. Discriminant Function analysis is used to classify observations into two or more known groups on the basis of one or more quantitative variables.

DNA Extraction from Rumen Fluid

Immediately after rumen contents were sampled they were put on dry ice for transport back to the laboratory. Once at the laboratory, samples were stored in a -80ºC freezer.

Prior to DNA extraction, cattle, bison, red deer, and white-tailed deer samples were split

67

into two aliquots; one aliquot was fixed in 100% ethanol solution and the second aliquot was freeze-dried for preservation (Virtis Genesis 2XL, Innovative Laboratory Systems).

Sheep samples were thawed before DNA extraction. DNA was extracted from thawed, ethanol fixed, and freeze-dried samples using the QIAamp DNA Stool Mini Kit (Qiagen,

Santa Clarita, CA) and following the protocol described by Yu and Morrison (2004).

Particulate matter was removed from the sample via low speed centrifugation. Samples were then mixed with sterilized zirconia (0.3g of 0.1mm) and silicon (0.1g of 0.5mm) beads and 1 mL lysis buffer (1.5M Sodium Chloride, 1M Tris-HCl, 0.5M EDTA, and

10% SDS mixture). Cells in the sample were lysed by homogenization using a Mini-

Beadbeater-8 at maximum speed for 3 minutes. For sufficient lysing of bacterial cells, samples were incubated at 70°C for 15 minutes with gentle mixing every 5 minutes. Each sample was centrifuged (16,000g) at 4°C for 5 minutes before collecting the supernatant.

Homogenization, incubation, and centrifugation were repeated 3 times for each sample, and the supernatants were pooled. After DNA was precipitated, RNA and proteins were removed, and purification was completed. DNA was suspended in 10 mM Tris-Cl buffer

(pH 8.5, EB buffer from Qiagen kit) and quantified using a NanoDrop (1000 3.8.1,

Thermo Scientific). Additionally, 100 ng of DNA was run via agarose gel electrophoresis and stained with ethidium bromide to demonstrate DNA integrity.

Microbial Sequencing and Identification

A total of 3µg of extracted DNA (50ng/µL in 60µL) for each host animal was prepared and submitted to the University of Missouri DNA Core for microbial sequencing.

68

Libraries were created four samples per lane and sequenced on the Illumina

GenomeAnalyzer II, resulting in 150 base-pair, paired end reads. FASTQ sequence files were first quality filtered by truncating each read after the first run of 3 bases with a

Phred quality score less than 15 (Ewing and Green, 1998). Reads and read pairs that were less than 85 bases long or had an average quality score of less than 25 were omitted from analysis.

Quality filtered reads were compared to a database of 16S rDNA gene sequences to identify microbial taxa. The database was comprised of all 16S rDNA genes from the

Ribosomal Database (Cole et al., 2009) and the sequenced prokaryotic genomes available from GenBank (Benson et al., 2013). Identical sequences or sequences less than 1,450 basepairs in length were discarded. The remaining 27,290 sequences were clustered into operational taxonomic units (OTUs) by computing all possible pairwise Needleman-

Wunsch sequence alignments (Needleman and Wunsch, 1970) using a GPU-based pairwise alignment tool (Truong et al., 2014). Using Bowtie (Langmead et al., 2009) quality filtered reads were aligned to the database. Reads that both forward and reverse read pairs uniquely matched one OTU with ≥97% identity were classified as an individual from that OTU and extracted from the NCBI taxonomy database (Ellison et al., 2013). Additionally, a phylum classification of each OTU was recorded. This resulted in a host animal by OTU count matrix. To account for sequence count variations between each host animal, the count matrix was normalized for later statistical analyses.

Normalization was done by calculating the proportion of each OTU from the total number of sequences for each host animal (Morgan et al., 2010).

69

Statistical Analyses

As mentioned previously, white-tailed deer samples were collected from the wild, and therefore had unknown diet information. Diet classification was estimated by using discriminant function analysis via SAS/PROC DISCRIM (SAS Institute, Inc., Cary, NC).

First, a step-wise discriminate function analysis performed on the normalized count matrix of all host animals, excluding the white-tailed deer host animals. This procedure provided the most informative OTUs in predicting diet composition of a host animal. The

10 most informative OTUs were selected and used to predict the diet composition of each white-tailed deer host animal as well as verify correct classification of the other host animals. Each white-tailed deer sample was classified as a forage-based diet classification with greater than 99% probability, and all other host animals were identified as correctly classified to their assigned diet classification.

Unfortunately, not all sampled host species represented both diet classifications. In order to negate confounded variables, the 48 host animals were subdivided into two datasets for analyses. The first dataset was used to determine the variation in microbial phyla and taxa by host animal’s diet; therefore it contained only host animals of which both concentrate- and forage-based diets were equally represented in the host animals. This dataset consisted of a normalized count matrix of 16 cattle, 16 sheep, and 8 bison. The second dataset was used to determine the variation in microbial phyla and taxa by host species and domestication status. This dataset contained only host animals which were fed a forage-based diet, which included a normalized count matrix of 8 cattle, 8 sheep, 4 bison,

70

4 red deer, and 4 white-tailed deer. Operational taxonomic units that contained at least 1 read count were included in the normalized count matrix.

The VARCOMP (Type 1 method) procedure in SAS (SAS Institute, Inc., Cary, NC) was used to estimate the contribution of host diet, species, or domestication status to the variation of read counts for each phylum. Operational taxonomic units comprising each phylum with variance estimates greater than 10% were also analyzed via VARCOMP procedure. This resulted in identifying phyla and OTUs within that phyla that explained

10% or more of the variation in normalized read counts due to host diet, species, and domestication status (Gaylor et al., 1970). In order to avoid a Type I error, the minimum number of sequence counts among compared groups was calculated at a significance α

<0.05. Within each compared group(s), sequence counts for each OTU that were below this threshold were removed from the analysis.

Results and Discussion

Full rumen contents were sampled from 48 animals and microbial DNA was extracted.

Extracted DNA was shotgun sequenced on an Illumina GenomeAnalyzer II, which resulted in an average of 26 million paired-end sequences (Table 3.4).

71

Microbial Taxa Identification

An average of 29,746 read pairs for each host animal mapped to the database, which resulted in 583 distinct OTUs; 143 were common among all 48 host animals. The OTUs identified represented 12 different phylum. Five phyla, Elusimicrobia, Planctomycetes,

Synergistetes, Tenericutes, and Fusobacteria had <70 total sequence counts represented in the phylum, and therefore not retained for analysis. The other 7 phyla, Firmicutes,

Actinobacteria, Bacteroidetes/Chlorobi Group, Proteobacteria, Methanobacteria,

Fibrobacteres/Acidobacteria Group, Spirochaetes, and had >600 sequence counts represented in each phylum and were retained for analysis.

Diet

First, diet classification of the white-tailed deer samples was performed using the discriminate function analysis in SAS (SAS Institute, Inc., Cary, NC). Ten OTUs were identified and used to predict (R2=0.96) the diet classification of each sample (Table 3.5).

Each white-tailed deer sample was classified as a forage-based diet classification with greater than 99% probability.

As previously mentioned, the normalized count matrix used for the diet analysis consisted of 7 phyla and 40 host animals (16 cattle, 16 sheep, and 8 bison). Half of each host species in the analysis was fed either a concentrate- or foraged-based diet. Along with diet type, host species effects were accounted for in the model; analysis of species

72

will be discussed below. A total of 533 OTUs were identified; 284 were shared between all host animals, regardless of diet classification. Host animals fed a concentrate-based diet contained an additional 142 OTUs and host animals fed a forage-based diet had an additional 107 OTUs. Diet classification explained greater than 10% of the variance in read count for 3 phyla: Firmicutes, Actinobacteria, and Bacteroidetes/Chlorobi Group

(Table 3.6).

The phylum Firmicutes was more abundant in host animals fed a concentrate-based diet.

This phylum consisted of 203 OTUs, with 51 OTUs present only in concentrate-based diet, 32 only in forage-based diet, and 120 shared OTUs across diet classifications. Diet explained greater than 20% of the variation in read counts among 37 OTUs (Table 3.7).

Acidaminococcus, Allisonella, Catonella, Dialister, Enterococcus, Eubacterium,

Howardella, Lactobacillus, Lactococcus, Pectinatus, Pseudoramibacter, , and

Seleomanas were in greater abundance in the concentrate-based diet. Blautia,

Clostridium, Gracilibacer, Megamonas, Moryella, Oscilibacter, Ruminococcus,

Schwartzia, and Sunrphococcus were more abundant in the forage-based diet (p<0.05).

The Butyrivibrio class, Butyrivibrio fibrisolvens and Butyrivibrio hungatei, had a higher abundance in host animals fed a forage-based diet (p<0.05). Butyrivibrio fibrisolvens hydrolyses a range of substrates including starch, cellulose, and xylan to produce butyrate, accounting for 16% of the total volatile fatty acids present in the rumen (Stewart et al., 1997). While most research has shown an association with starch-based diets, this study has identified OTUs that may be degrading cellulose. Dialister succinatiphilus and

73

Dialister invisus were more abundant in concentrate-based diets and accounted for large portions of the variance between diet classification types.

Diet classification explained 31% of the variance in read counts among 61 OTUs in

Actinobacteria, of which 24 OTUs were represented in only host animals fed a concentrate-diet, 10 OTUs in forage-based diet, and 27 OTUs were shared between both diet classifications. Variation in Atopobium parvulum and Atopobium fossor accounted for 28% and 26% of the differences between diet classifications, respectively. Both were more abundant in host animals fed a forage-based diet (p<0.05). Although statistically significant, representation within these OTU averaged 20 and 28 sequences in concentrate- and forage-based diets, respectively.

One hundred and ten OTUs from the Bacteroidetes/Chlorobi Group were found. Seventy- four OTUs were shared between diet classifications; 16 OTUs were represented only in concentrate-based diet host animals and 20 OTUs were only in foraged-based diets. In this phylum, diet explained ≥20% of the variance between read counts among 27 OTUs, consisting of 24 Prevotella, 1 Paraprevotella, and 2 Bacteroides microorganisms of which individual OTUs within the same class varied with association of diet type (Table

3.8). Prevotella bryantii was higher in concentrate-based diets, and Prevotella ruminicola and Prevotella brevis were more abundant in host animals fed a forage-based diet

(p<0.05). P. ruminicola and P. bryantii ferment both hemicellulose and starch; however the present study supports findings of Fernando et al. (2010) and Singh et al. (2008).

74

Since Prevotella depend on cellulolytic species to digest xylan, host animals on forage- based diets in this study contained a high proportion of cellulolytic bacterium; enough for

Prevotella to be highly represented in these host animals.

As expected, the predominant cellulolytic ruminal bacterium Ruminococcus flavefaciens and Ruminococcus albus were more abundant in forage-based diets and accounted for

39% and 40% of the variation between diet types, respectively (p<0.05). Additionally, this study identified Ruminococcus bromii attributing to 20% of the variation in read counts among diet types. These three microorganisms degrade cellulose and xylan

(hemicellulose) but not starch; supporting their increased presence in forage-based diets

(Stewart et al., 1997). Other microorganisms maintain the ability to degrade starch and are therefore are found in host animals fed concentrate-based diets, i.e. Ruminobacter amylophilus. Diet classification explained only 2% of the variation among read counts in

Proteobacteria; therefore OTUs of Fibrobacter and Succinivibrio were not analyzed.

Ecology and biology research of Dialister succinatiphilus, Dialister invisus, and the phylum Actinobacteria are limited; however, in this study have been found associated with the differences between diet classifications.

Species and Domestication Status

Research analyzing differences in gut microbiomes of different species which includes both wild and domesticated animals is lacking. This study provides some of the first insights of variation in microbial taxa between species and domestication status. In order

75

to remove diet effects from the model, only host animals fed a forage-based diet were included in the analysis. The normalized count matrix included 7 phyla and 28 total host animals (8 cattle, 8 sheep, 4 bison, 4 red deer, and 4 white-tailed deer). There was a total of 462 OTUs represented and 129 OTUs were shared among all host species. Host species included cattle, sheep, bison, red deer, and white-tailed deer and consisted of 265,

279, 204, 267, and 212 total OTUs per species, respectively. As shown in Table 3.4, host species accounts for greater than 10% of the variance among read counts in 6 phyla, and

2 phyla with domestication status, wild or domestic.

The Fibrobacteres/Acidobacteria phylum consisted of 2 OTUs, and host species explained 55% of the variance among read counts. Fibrobacter succinogenes was most abundant in cattle and bison (p<0.05). This taxa is an important cellulose digester, possibly demonstrating that these host species more efficient at cellulose and lignin fermentation (Stewart et al., 1997). Nelson et al. (2003) was unable to identify any

Fibrobacter succinogenes from rumen contents of African Zebu cattle (Bos indicus), eland (Taurotragus oryx), zebra (Equus hippotigris), Thompson’s gazelle (Eudorcas thomsoni) or Grant’s gazelle (Nanger granti).

The phylum Actinobacteria was comprised of a total of 56 OTUs. Host species accounted for greater than 20% of the variance among read counts in 3 OTUs including: Olsenella

(2) and Bifidobacterium (Table 3.9). Olsenella OTUs were both more abundant in cattle and bison (p<0.05). Olsenella was first isolated from sheep rumen, and is a lactic acid

76

bacterium, increasing in abundance during acidosis, suggesting cattle and bison in the current study had higher lactate concentrations than other host species (Petri et al., 2013;

Kraatz et al., 2011).

A total of 104 OTUs were in the Bacteroidetes/Chlorobi group phylum; 74 OTU were represented in cattle, 75 OTU sheep, 54 OTU bison, 70 OTU red deer, and 62 OTU white-tailed deer with 42 OTU shared among all host species. Host species explained greater than 20% of the variation in read counts among 21 OTUs; all of which were

Prevotella (Table 3.10). Prevotella oris and Prevotella maculosa were highly represented in cattle and sheep, and almost absent in the other host species. Results support Nelson et al. (2003) in which Prevotella spp. was not identified in wild host species. Host domestication status explained greater than 20% of the variance among read counts in 7

OTUs; 4 Prevotella, 2 Paraprevotella, and a Bacteroides/Flavonifractor. Most notably, host species and domestication status explained 53% and 22% of the variance in read counts of Prevotella micans, respectively. P. micans was more abundant in domesticated animals, with sheep hosting more of this species than cattle (P<0.05). Prevotella spp. are sensitive to monensin of which cattle in this study were fed, possibly explaining the higher abundance of Prevotella in sheep than cattle (Stewart et al., 1997; Chen and

Wolin, 1979).

Host species accounted for 47% of the variance among read counts among 4 OTUs.

Among the OTUs, host species accounted for 53% of the variation in read counts in

77

Treponema zioleckii which was more abundant in cattle and red deer (p<0.05). This suggests the presence or need for a selective microbial group, like Treponema to ferment substrates like pectin, insulin, sucrose, and L-arabinose (Stewart et al., 1997).

Firmicutes consisted of a total of 169 OTUs, 123 OTUs were represented in cattle, 112

OTUs sheep, 97 OTUs bison, 121 OTUs red deer, and 95 OTUs white-tailed deer with 72

OTUs shared among all host species. Host species (Table 3.11a/b) and domestication status (Table 3.12) explained greater than 20% of the variance among read counts in 44 and 16 OTUs, respectively. Gracilibacter thermotolerans was more abundant among cattle and bison, and Anaerovibrio lipolyticus in cattle and red deer (p<0.05).

Interestingly, Mitsuokella jalaludinii which hydrolyses phytate was found only in cattle

(Lan et al., 2002). Cattle typically have a higher rumen temperature (38-42°C) than red deer (38-39°C); alternatively, the optimal temperature for fermentation by both G. thermotolerans and M. jalaludinii is higher than that of A. lipolyticus, possibly explaining their differential representation among host species (Privé et al., 2013; Turbill et al.,

2011; Lee et al., 2006; Lan et al., 2002). Acidaminococcus, Ruminococcus albus,

Ruminococcus flavefaciens, and Moryella were associated with domesticated animals, and Butyrivibrio, Oscillibacter, Coprococcus, Selenomnas, Clostridium, and Mahella with wild animals. Domesticated animals’ diet is restricted by the producer while wild animals have ample selection of feed sources. Because of this, domesticated animals are susceptible to sickness and/or diseases based on diet, i.e. grass tetany. Grass tetany is expressed when a ruminant animal eats grasses that are high in water content and potassium and low in other nutrients (magnesium and calcium). In the rumen, microbial

78

bacteria convert trans-aconitate into tricarballylate; however, Acidaminococcus converts trans-aconitate to acetate, carbon dioxide, and hydrogen negating the effects of grass tetany (Cook et al., 1994). Possibly, the higher abundance of Acidaminococcus in domesticated animals suggests a needed avoidance of grass tetany, whereas the need is not present for wild animals. Contrary to this study, Fouts et al. (2012) identified

Oscillibacter and Coprococcus as two of the most abundant bacterial OTUs in cattle (Bos taurus). When comparing elk (Cervus canadensis) and white-tailed deer (Odocoileus virginianus), Oscillibacter was found in low abundance only in white-tailed deer

(Gruninger et al., 2014).

Of the total 94 OTUs comprising the Proteobacteria phylum, host species accounted for greater than 20% of the variance among read counts in Candidatus treponema and

Ruminobacter amylophilus. Sheep and deer had higher representations of Ruminobacter amylophilus (p<0.05). Although detected rarely, R. amylophilus predominantly has serine protease activity, and thrives in an environment of high sodium concentrations, possibly suggesting the sampled sheep and deer might have higher sodium concentrations in their rumen (Stewart et al., 1997).

This study has identified OTUs that cluster within host species or be present in only one domestication status. Five groups have been noted: 1) OTUs unique to cattle, bison, and sheep; 2) OTUs unique to cattle and bison; 3) OTUs unique to deer (red deer and white- tailed deer); 4) OTUs unique to domesticated host species; and 5) OTUs unique to wild

79

host species (Table 3.13). In this study, Collinsella aerofaciens was found solely in cattle and bison species; However, Fibrobacter succinogenes, Olsenella, and Gracilibacter thermotolerans were more abundant among cattle and bison, also. Pectinatus haikarae was unique to deer species. Like Treponema addressed previously, P. haikarae ferments

L-arabinose, further supporting the need for a selective microbial group to ferment this substrate (Caldwell et al., 2013; Stewart et al., 1997). The optimal temperature for P. haikarae is lower than that of average rumen temperatures in cattle and sheep, possibly explaining its absence among those host animals (Caldwell et al., 2013). Orpin et al. (1985) found Methylobacterium spp. in High-artic Svalbard reindeer (Rangifer tarandus playrhynchus), and postulated its presence for the survival of the host animals during winter due to eating low-quality feed. In this study, Methylobacterium spp. were identified only in the deer host species; however, sequence counts were low and not significant (data not shown). Streptococcus bovis is versatile in that it can alter its intercellular pH to mimic ruminal pH changes and can degrade both starch and fiber feedstuffs (Stewart et al., 1997; Russell, 1991a; Russell, 1991b). Wild species feed on a greater variety of feedstuffs and alter the ruminal environment more than domesticated species, suggesting Streptococcus bovis as a possibly important ruminal microbe in wild ruminant species. Streptococcus also leads to lactic acidosis in sheep and cattle, further supporting its extinction in domesticated rumen environments (Stewart et al., 1997; Russell and Hino, 1985; Hungate, 1966; Hungate et al., 1952).

80

Table 3.1: Metadata for Domesticated Host Animals Live Animal Diet1 Breed Sex2 Age3 Collection Date Weight4 Cow 4750 C British x Continental M 1 March 2014 948 Cow 4801 C British x Continental M 1 March 2014 988 Cow 4716 C British x Continental M 1 March 2014 865 Cow 4625 C British x Continental M 1 March 2014 1160 Cow 4759 C British x Continental M 1 March 2014 806 Cow 4707 C British x Continental M 1 March 2014 845 Cow 4476 C British x Continental M 1 March 2014 932 Cow 4737 C British x Continental M 1 March 2014 757 Cow 4691 F British x Continental M 1 March 2014 1077 Cow 4374 F British x Continental M 1 March 2014 1231 Cow 4636 F British x Continental M 1 March 2014 1048 Cow 4710 F British x Continental M 1 March 2014 998 Cow 4712 F British x Continental M 1 March 2014 985 Cow 4753 F British x Continental M 1 March 2014 932 Cow 4768 F British x Continental M 1 March 2014 904 Cow 4785 F British x Continental M 1 March 2014 890 Sheep 1009 F Rambouillet M 0.7 Nov. 2011 71 Sheep 1026 C Rambouillet M 0.7 Nov. 2011 62 Sheep 1101 C Rambouillet 0.7 0.7 Nov. 2011 88 Sheep 1111 C Rambouillet M 0.7 Nov. 2011 82 Sheep 1127 F Rambouillet M 0.7 Nov. 2011 84 Sheep 1208 F Suffolk M 0.7 Nov. 2011 79 Sheep 1220 C Hampshire M 0.7 Nov. 2011 79 Sheep 1239 C Suffolk M 0.7 Nov. 2011 87 Sheep 1248 F Hampshire M 0.7 Nov. 2011 95 Sheep 1366 F Hampshire M 0.7 Nov. 2011 53 Sheep 1396 C Hampshire M 0.7 Nov. 2011 71 Sheep 1397 F Hampshire M 0.7 Nov. 2011 81 Sheep 7429 C Suffolk M 0.7 Nov. 2011 93 Sheep 7505 F Suffolk M 0.7 Nov. 2011 96 Sheep 1003 F Rambouillet M 0.7 Nov. 2011 64 Sheep 1348 C Hampshire M 0.7 Nov. 2011 83 1C-concentrate-based diet; F-forage-based diet 2M-male; F-female 3Age is presented in years 4Live Weight is presented in pounds

81

Table 3.2: Metadata for Wild Host Animals Collection Collection Live Dressed Animal Diet1 Sex2 Age3 Date Time Weight4 Weight5 8:37 AM Bison M2 C M 2.5 Sept. 2014 975 594 9:35 AM Bison M5 C M 2.5 Sept.2014 1200 695 Bison NS1 F F 2.4 Oct. 2012 10:00 AM 520 - 9:00 AM Bison NS3 F M 2 Nov. 2012 569 - 9:00 AM Bison NS4 F M 2 Nov. 2012 608 - 9:00 AM Bison NS9 F M 2 Nov. 2012 560 - Bison WC23 C M 2.5 Oct. 2013 2:30 PM - 767 Bison WC24 C M 2.5 Oct.2013 4:00 PM - 697 Red Deer 1 F F 3 July 2014 10:15 AM - 122 Red Deer 2 F F 3 July 2014 10:35 AM - 135 Red Deer 3 F F 3 July 2014 10:40 AM - 140 Red Deer 4 F F 3 July 2014 11:00 AM - 134 White-tail Deer 1 F F 1.5 Jan. 2013 2:00 PM - - White-tail Deer 2 F M 0.5 Jan. 2013 2:30 PM - - White-tail Deer 3 F F 1.5 Jan. 2013 5:30 PM - - White-tail Deer 4 F M 1.5 Jan. 2013 2:00 PM - - 1C-concentrate-based diet; F-forage-based diet 2M-male; F-female 3Age is presented in years. White-tailed deer were aged using the teeth of the lower jaw (Cain and Wallace, 2003). 4Live Weight is presented in pounds 5Dressed Weight was taken immediately after skinning and evisceration of carcass and is presented in pounds

82

Table 3.3: Ingredients of Host Diets Concentrate Host Animal Species Ingredient1,2,3 Diet Forage Diet Corn 43.6 - HMC 22.3 - Soybean hulls - 37.4 Cattle Rye Haylage - 33.8 DDG 18.5 19.5 Supplement 15.5 9.3 Corn 50.2 - Corn Gluten 10.0 - DDG 1.0 - Sheep Alfalfa Pellets - 67.7 Wheat Middlings 31.0 27.5 Supplement 7.9 4.8 (a) 80.0 Corn/SBM (b) 12.8 - Barley (b) 12.5 - Bison4 Hay + Soybean hulls (ground) (a/b) moderate - Soybean hulls (b) 54.8 - Supplement (b) 19.9 - Pasture Grass (a) moderate (c) 100.0 1Values recorded as percent of total diet 2HMC – High Moisture Corn, DDG – Dried Distillers Grain, SBM – Soybean Meal 3Supplements included small amounts of ingredients in order to meet the animal’s requirement of amino acids, vitamins, and minerals. Cattle were also supplemented with monesin. 4Bison sampled from all three farms are represented: (a) Kentucky Bison Ranch, Bagdad, KY; (b) Northstar Bison Processing, Rice Lake, WI; and (c) Crown Valley Winery, Ste. Genevieve, MO

83

Table 3.4: Average Number of Reads for Each Host Species at Each Analysis Step Raw Quality Filtered Aligned FASTA Host Species FASTA Reads FASTA Reads Reads Cattle 47,549,626 33,227,457 31,050 Sheep 30,157,100 31,840 Bison 45,389,087 31,875,019 20,698 Red Deer 45,179,729 30,375,686 39,619 White-tailed Deer 52,439,347 37,755,619 25,523 Average Total 32,678,043

84

Table 3.5: Informative OTUs used for Diet Classification of White- tailed Deer via Discriminate Function Analysis Average Squared OTU F-Value1 Canonical Correlation1,2 Butyrivibrio fibrisolvens 44.37 0.514 Saccharomonospora viridis 12.79 0.629 Prevotella melaninogenica 23.78 0.768 Ruminococcus flavefaciens 12.03 0.822 Brachybacterium faecium 12.17 0.865 Atopobium fossor 8.91 0.892 Methylobacterium radiotolerans 8.64 0.913 Blautia glucerasea 8.54 0.930 Ruminococcus albus 8.98 0.944 Selenomonas ruminantium 8.49 0.956 1F-value and Average Squared Canonical Correlation were provided in the SAS/STAT® software output. 2The average squared canonical correlation values can be interpreted similarly to R2 values (Introduction to SAS, n.d.).

85

Table 3.6: Number of Represented OTUs, Sequence Count, and Variance Estimates for each Phylum for Host Diet, Species, and Domestication Status Variance Number of OTUs Total Estimate Phylum1 Represented Sequences (%) Variable: DIET Firmicutes 204 39,920 38.77 Actinobacteria 61 5,214 30.51 Bacteroidetes/Chlorobi Group 110 41,237 16.80 Methanobacteria 4 3,313 7.57 Fibrobacteres/Acidobacteria Group 2 5,483 7.10 Proteobacteria 118 2,311 1.66 Variable: SPECIES Fibrobacteres/Acidobacteria Group 2 3,844 80.00 Actinobacteria 56 551 78.66 Bacteroidetes/Chlorobi Group 104 23,851 54.84 Spirochaetes 9 415 46.85 Firmicutes 169 15,620 43.44 Proteobacteria 94 2,173 37.78 Variable: DOMESTICATION STATUS Bacteroidetes/Chlorobi Group 104 23,851 22.25 Firmicutes 169 15,620 17.87 1Phyla are listed highest to lowest variance estimates. Phyla resulting in a 0% variance estimate are not included.

86

Table 3.7: OTU Identification and Variance Estimates of Firmicutes for Host Diet

Variance Estimate OTU1,2 (%) Dialister succinatiphilus 63.34 Dialister invisus 52.62 Allisonella histaminiformans 50.14 Clostridium bolteae 40.07 Acidaminococcus fermentans 39.86 Ruminococcus albus 39.62 Butyrivibrio fibrisolvens 39.38 Ruminococcus flavefaciens 38.93 Blautia glucerasea 36.96 Megamonas hypermegale 36.71 Ruminococcus flavefaciens 36.67 Moryella indoligenes 35.65 Roseburia inulinivorans 35.56 Oscillibacter valericigenes 34.71 Syntrophococcus sucromutans 34.71 Butyrivibrio fibrisolvens 34.60 Pseudoramibacter alactolyticus 32.75 Schwartzia succinivorans 30.45 Butyrivibrio hungatei 30.06 Pectinatus haikarae 29.49 Ruminococcus obeum 28.31 Lactococcus lactis; Lactobacillus brevis 27.75 Gracilibacter thermotolerans 27.65 Selenomonas bovis 27.65 Selenomonas genomosp. 26.83 Selenomonas ruminantium 26.47 Lactobacillus salivarius 26.17 Lactobacillus acidophilus; L. helveticus; L. crispatus 25.85 Howardella ureilytica 25.72 Lactococcus lactis; Lactobacillus rhamnosus; Lactobacillus casei 24.50 Enterococcus inusitatus 23.47 Lactococcus lactis 22.99 Lactobacillus sakei 22.94 Catonella genomosp.; C. morbi 22.23 Eubacterium rectale 21.78 Selenomonas ruminantium 20.62 Ruminococcus bromii 20.57 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2 Repeated OTUs contain sequences that differ by <97%; therefore being classified as two unique OTUs. OTUs containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

87

Table 3.8: OTU Identification and Variance Estimates of Bacteroidetes/Chlorobi Group for Host Diet

Variance Estimate OTU1,2 (%) Prevotella histicola; P. veroralis; P. melaninogenica 57.73 Prevotella genomosp. 57.57 Prevotella multisaccharivorax 57.21 Prevotella melaninogenica 56.67 Prevotella ruminicola 52.22 Prevotella aurantiaca 50.68 Prevotella genomosp. 50.20 Prevotella dentalis 47.30 Prevotella multiformis 46.84 Prevotella oris 43.04 Prevotella corporis 42.44 Prevotella bryantii 35.35 Prevotella marshii 35.18 Prevotella intermedia 31.99 Prevotella genomosp. 31.93 Prevotella micans 28.00 Prevotella genomosp. 27.02 Prevotella nigrescens 25.97 Prevotella nanceiensis 25.77 Prevotella shahii 25.60 Prevotella genomosp. 25.24 Prevotella oulorum 24.75 Paraprevotella clara 23.30 Prevotella brevis 20.79 Bacteroides salanitronis 20.78 Bacteroides acidifaciens 20.47 Prevotella pleuritidis 20.09 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2 Repeated OTUs contain sequences that differ by <97%; therefore being classified as two unique OTUs. OTUs containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

88

Table 3.9: OTU Identification and Variance Estimates of Actinobacteria for Host Species

Variance Estimate OTU1 (%) Olsenella uli 40.28 Bifidobacterium longum 31.86 Olsenella genomosp. 24.35 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05).

89

Table 3.10: OTU Identification and Variance Estimates of Bacteroidetes/Chlorobi Group for Host Species

Variance Estimate OTU1,2 (%) Prevotella oris 58.99 Prevotella maculosa 52.50 Prevotella multisaccharivorax 52.12 Prevotella intermedia 49.06 Prevotella marshii 47.90 Prevotella baroniae 44.47 Prevotella timonensis 41.84 Prevotella shahii 37.58 Prevotella buccalis 37.49 Bacteroides salanitronis 35.63 Bacteroides capillosus; Flavonifractor plautii 32.95 Prevotella genomosp. 32.22 Prevotella ruminicola 30.62 Prevotella genomosp; P. loescheii; Candidatus Prevotella 30.16 Prevotella pleuritidis 29.45 Prevotella dentasini 28.39 Prevotella genomosp. 24.72 Prevotella histicola; P. verorali; P. melaninogenica 24.21 Prevotella genomosp. 22.71 Prevotella melaninogenica 21.11 Prevotella stercorea 20.54 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2 Repeated OTUs contain sequences that differ by <97%; therefore being classified as two unique OTUs. OTUs containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

90

Table 3.11a: OTU Identification and Variance Estimates of Firmicutes for Host Species*

Variance Estimate OTU1,2 (%) Gracilibacter thermotolerans 79.11 Clostridiales genomosp. 75.50 Anaerovibrio lipolyticus 71.44 Butyrivibrio fibrisolvens 58.14 Selenomonas ruminantium 57.54 Robinsoniella peoriensis 56.40 extructa 55.38 Selenomonas genomosp. 54.74 Mitsuokella jalaludinii 54.04 Clostridium fimetarium 51.87 Ruminococcus flavefaciens 51.07 Pseudobutyrivibrio ruminis; Butyrivibrio fibrisolvens 58.48 Ruminococcus torques 47.44 Selenomonas ruminantium 46.09 Butyrivibrio proteoclasticus 45.30 Clostridium islandicum 41.62 Ruminococcus bromii 40.12 Clostridium indolis 39.12 Ruminococcus flavefaciens 38.57 Blautia glucerasea 38.49 Bulleidia moorei 38.15 *Total list of OUT identification and variance estimates for Firmicutes for host species were split into two tables (Table 3.9a/b). 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2 Repeated OTUs contain sequences that differ by <97%; therefore being classified as two unique OTUs. OTUs containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

91

Table 3.11b: OTU Identification and Variance Estimates of Firmicutes for Host Species* (Continued)

Variance Estimate OTU1,2 (%) Clostridium bolteae 35.86 Ruminococcus bromii 34.97 Clostridium hathewayi 32.46 Butyrivibrio fibrisolvens 31.57 Ruminococcus flavefaciens 31.20 Clostridium aminovalericum 30.94 Clostridium aminophilum 30.68 Streptococcus spp. 28.65 Butyrivibrio fibrisolvens 28.55 Schwartzia succinivorans 25.66 Clostridium phytofermentans 24.93 Butyrivibrio hungatei 24.04 Selenomonas ruminantium 23.85 Ruminococcus obeum 23.33 Marvinbryantia formatexigens 23.32 Clostridium leptum 22.69 Acidaminococcus fermentans 22.02 Butyrivibrio fibrisolvens; B. proteoclasticus 21.80 Blautia coccoides 21.57 Oscillibacter valericigenes 21.10 Butyrivibrio fibrisolvens 20.86 Selenomonas ruminantium 20.73 Selenomonas ruminantium 20.40 Selenomonas bovis 20.09 *Total list of OUT identification and variance estimates for Firmicutes for host species were split into two tables (Table 3.9a/b). 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2 Repeated OTUs contain sequences that differ by <97%; therefore being classified as two unique OTUs. OTUs containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

92

Table 3.12: OTU Identification and Variance Estimates of Firmicutes for Host Domestication Status

Variance Estimate OTU1,2 (%) Pseudobutyrivibrio ruminis; Butyrivibrio fibrisolvens 80.18 Acidaminococcus fermentans 50.42 Butyrivibrio fibrisolvens; B. proteoclasticus 48.28 Coprococcus catus 47.04 Butyrivibrio fibrisolvens 44.24 Selenomonas ruminantium 39.00 Oscillibacter valericigenes 38.04 Butyrivibrio hungatei 37.21 Clostridium saccharolyticum; C. methoxybenzovorans; C. indolis; C. amygdalinum; 33.94 Desulfotomaculum guttoideum; C. celerecrescens Ruminococcus albus 32.96 Pseudobutyrivibrio ruminis; Butyrivibrio fibrisolvens 31.69 Ruminococcus flavefaciens 31.21 Mahella australiensis 30.18 Clostridium alkalicellulosi 30.09 Moryella indoligenes 28.02 Butyrivibrio hungatei 23.63 1Operational Taxonomic Units having greater than 20% variance estimates are presented (p<0.05). 2Operational Taxonomic Units containing multiple taxa represent sequences that matched by >97%; therefore being classified as one unique OTU.

93

Table 3.13: OTUs only Present in one Type of Host Domestication Status1 OTU Found only in Cattle, Sheep, and Bison Host Species - Elusimicrobium minutum - Neisseria weaver; N. cinerea; N. meningitides; N. gonorrhoeae OTU Found only in Cattle and Bison Host Species - Collinsella aerofaciens OTU Found only in Red Deer and White-tailed Deer Host Species - Pectinatus haikarae OTU Found only in Domesticated Host Species (Cattle and Sheep) - Alysiella filiformis; A. crassa - Bibersteinia trehalosi - Dialister invisus - Dialister succinatiphilus - Haemophilus influenzae - Mannheimia haemolytica; Bibersteinia trehalosi - Mannheimia haemolytica; M. varigena - caprae- Sharpea azabuensis - Treponema isoptericolens OTU Found only in Wild Host Species (Bison, Red Deer, and White-tailed Deer) - Streptococcus pasteuri; S. gallolyticus; S. equinus; S. lutetiensis; S. bovis; S. infantarius 1 There were sufficient sequence counts representing each listed OTU to be uniquely identified, not by random chance (p<0.05).

94

CHAPTER FOUR

CONCLUSIONS AND FUTURE DIRECTIONS

Overall Conclusions

Associations between the rumen microbiome and biological characteristics of the host animal continue to be discovered; however, currently there still exists an incomplete understanding of the function and ecology of the microbiome. The studies included in this dissertation add to the body of research in cataloging the diversity, distribution, and taxonomic composition among different ruminant host animals and species. To date, the dataset created for this dissertation is one of the largest of its kind. It includes domestic and wild host species being fed two different diet classifications, and DNA from full rumen contents were sequenced using a whole genome sequencing platform, Illumina

GAII. Chapter 2 established that an exponential decay model fit the OTU rank by abundancy plots for each host animal, and the exponential decay curvature was similar across all host species. This suggests that each host species has a similar number of highly frequent and rare OTUs. Two other objectives for Chapter 2 were to determine changes in the total and common OTUs identified among host species as additional host species were added to the analysis. Of the common OTUs, a core group of microbial taxa were identified. As expected, as additional host species were included in the analysis,

95

total OTUs identified increased and common OTUs among all host species decreased.

However, the percentages did not increase or decrease in equal proportions as host species were added. An addition of 1 host species, to total 2 host species in the analysis, resulted in a 34% increase in total OTUs identified and a 34% decrease in common

OTUs. However, the difference between analyzing 3 and 4 host species was less dramatic with a 12% increase in total OTUs identified and a 9% decrease in common OTUs among all host species. One hundred and forty-three OTUs were common among all host species; however, individual sequence counts among host animals were not included.

Henderson et al. (2015) hosts the most complex dataset of ruminant animals, and supported the notion of a ‘core’ microbial community among ruminants (Weimer, 2015;

Jeraldo et al., 2012). For this dissertation, inclusion of an OTU into the ‘core,’ must have been present in >90% of the sampled host animals and have a minimum of 30 sequence counts within a host species. Nine OTUs were identified as ‘core microbial members,’ and included Butyrivibrio, Prevotella, Oscillibacter, and Ruminococcus, all of which have been described as predominant genera in the rumen (Henderson et al., 2015;

Chaucheyras-Durand and Ossa, 2014; Morgavi et al., 2013; Stewart et al., 1997). Also supporting other studies, the methanogen, Methanobrevibacter was identified as a core microbial member (Henderson et al., 2015; Chaucheyras-Durand and Ossa, 2014;

Morgavi et al., 2013; Stewart et al., 1997).

Chapter 3 added to the body of research in identifying microbial diversity among different ruminant host animals. The total 48 ruminant animals sampled represented 2 different diet classifications (concentrate- and forage-based), 5 different host species

96

(cattle, sheep, bison, red deer, and white-tailed deer) and 2 different domestication statuses (domestic and wild). Effects of host diet, species, and domestication status on microbial taxa were identified.

Diet classification explained greater than 10% of the variation in read counts among 3 phyla Firmicutes, Actinobacteria, and Bacteroidetes/Chlorobi group. Within those phyla, diet classifiviation exaplined greater than 20% of the variance among read counts in 66

OTUs. Most identified OTUs were in higher abundance in host animals fed a diet that coordinated with the microbial known function, i.e. the predominant cellulolytic ruminal bacterium Ruminococcus flavefaciens and Ruminococcus albus were more abundant in forage-based diets and accounted for 39% and 40% of the variation among read counts due to diet types, respectively (Singh et al., 2014; Stewart et al., 1997). Interestingly,

Butyrivibrio fibrisolvens, usually associated with starch digestion, was found in higher abundance in forage-based diet. As expected, Prevotella bryantii was found in higher abundance in concentrate-based diets, and contrary to expectation, P. ruminicola and P. brevis were higher in forage-diets. Prevotella spp. depend on cellulolytic species to digest xylan, host animals on forage-based diets in this study contained a high proportion of cellulolytic bacterium, possibly enough for Prevotella to be highly represented in these host animals. Ecology and biology research of Dialister succinatiphilus, Dialister invisus, and the phylum Actinobacteria has been limited; however, in the present study an association with the differences between diet classifications was identified.

97

Host species accounted for greater than 10% of the variation among read counts in 6 phyla Firmicutes, Actinobacteria, Spirochaetes, Proteobacteria,

Fibrobacteres/Acidobacteria group, and Bacteroidetes/Chlorobi group. Within those phyla, host species explained greater than 20% of the variance in read counts in 73

OTUs. Some examples of the findings include: Fibrobacter succinogenes was found most abundant in cattle and bison. This taxa is an important cellulose digester; possibly demonstrating that these host species more efficient at cellulose and lignin fermentation

(Stewart et al., 1997). Olsenella OTUs were more abundant in cattle and bison. Olsenella was first isolated from sheep rumen, and is a lactic acid bacterium; increasing in abundance during acidosis (Petri et al., 2013; Kraatz et al., 2011). Gracilibacter thermotolerans was more abundant among cattle and bison, and Anaerovibrio lipolyticus in cattle and red deer. Sheep and deer had higher representations of Ruminobacter amylophilus. Although detected rarely, R. amylophilus has predominantly serine protease activity, and thrives in an environment of high sodium concentrations, possibly suggesting the sampled sheep and deer might have higher sodium concentrations in their rumen (Stewart et al., 1997).

Domestication status explained greater than 10% of the variation in read counts in 2 phyla, Firmicutes and Bacteroidetes/Chlorobi group. Within those phyla, domestication status accounted for greater than 20% of the difference among read counts in 23 OTUs, supporting Nelson et al.’s (2003) results in which Prevotella spp. was not identified in wild host species. The presence of monensin in the rumen may decrease the abundance of

Prevotella spp. Cattle in this study were supplemented with monensin, possibly

98

explaining the higher abundance of Prevotella in sheep than cattle (Stewart et al., 1997;

Chen and Wolin, 1979). Acidaminococcus, Ruminococcus albus, Ruminococcus flavefaciens, and Moryella were associated with domesticated animals, and Butyrivibrio,

Oscillibacter, Coprococcus, Selenomnas, Clostridium, and Mahella with wild animals.

Unexpectedly, this study also identified OTUs that cluster within host species or are present in only one domestication status. Five groups were noted: 1) OTUs unique to cattle, bison, and sheep; 2) OTUs unique to cattle and bison; 3) OTUs unique to deer (red deer and white-tailed deer); 4) OTUs unique to domesticated host species; and 5) OTUs unique to wild host species. Elusimicrobium and Neisseria spp. were only found in cattle, sheep, and bison, Collinsella aerofaciens was found solely in cattle and bison species, and Pectinatus haikarae was unique to deer species. Nine OTUs were found only in domesticated animals, and Streptococcus spp. was found only in wild animals.

Future Directions

There are approximately 200 species of ruminant animals consisting of approximately 3.5 billion domesticated and 75 million wild anifmals worldwide (Henderson et al., 2015;

McCann et al., 2014; Morgavi et al., 2013). Wild ruminants, as compared to domestic relatives, demonstrate a superior ability to process a high fiber diet; however, most studies compare domestic species to wild species that survive in extreme environments

(Nelson et al., 2003; Orprin et al., 1985; Richmond et al., 1977). This dataset provides the opportunity to compare domestic and wild species that inhabit similar, or the same

99

environment. Upon further funding, additional samples can be sequenced and compared to the other species. Domingue et al. (1991) describes digestion factors and compares sheep to goats and sheep to red deer, but does not consider all three simultaneously.

Additional wild species on forage-based and concentrate-based diets will be collected and analyzed.

Feed accounts for up to 80% of the feed costs for a producer; therefore, in hopes to lower production costs, producers seek to improve their animal’s feed efficiency (Hernandez-

Sanabria et al., 2011). An efficient animal consumes less feed than expected, and maintains growth. Feed efficiency, measured as residual feed intake, is moderately heritable and genetically independent of the aniaml’s growth and body size, making it a desirable trait for selection practices and herd improvement (Carberry et al., 2012). Guan et al. (2008) found higher concentrations of butyrate and valerate in efficient steers, and some studies have identified an association between microbial populations and feed efficiency in cattle (Carberry et al., 2012; Hernandez-Sanabria et al., 2011). Feed efficiency was obtained for the domesticated host species from this dissertation. Highly and lowly efficient animals in two both sheep and cattle can be compared for variations in their microbiome.

Finally, if databases are compiled and expand then considerable interest is in comparing other microbial communities, i.e., protozoa and fungi. Also, distributions of ‘unused’ sequences are of interest. With expanded databases to include plant and host sequence

100

data, alignments of the unused sequences in this project could be used to better understand the amount of sequencing data that truly is unaccounted for. For example, sequence data could be partitioned and identify the percentage of plant sequences, host animal sequences, possible contamination sequences, and ‘unidentifiable’ sequences.

Future analysis may try to form a prediction model to estimate the use of microbial data to identify host diet, species and/or domestication status.

101

APPENDIX

SAS Program: PROC DISCRIM (Step-Wise Discriminate Function)

data one; infile 'C:\Users\tmtb79.TIGERS\Desktop\Step_Discrim_Diet.csv' dlm=',' firstobs=2 lrecl=2000; input DietNum OTU_1 OTU_2 OTU_3 OTU_4 ... n ;

Diet = 'UNK'; if DietNum<21 then Diet ='Con'; if DietNum>20 then Diet ='For'; proc print; run;

proc stepdisc; class Diet; var OTU_1 OTU_2 OTU_3 OTU_4 ... n ; run;

102

SAS Program: PROC DISCRIM

data one; infile 'C:\Users\tmtb79.TIGERS\Desktop\Step_Discrim_Diet.csv' dlm=',' firstobs=2 lrecl=2000; input DietNum OTU_1 OTU_2 OTU_3 OTU_4 ... n ;

Diet = 'UNK'; if DietNum<21 then Diet ='Con'; if DietNum>20 then Diet ='For'; proc print; run; proc discrim data=new testdata=test testlist; class Diet; var OTU_66 OTU_477 OTU_63 OTU_215 OTU_317 OTU_97 OTU_276 OTU_169 OTU_228 OTU_312 ; run;

103

SAS Program: PROC NLIN (Exponential Decay Model)

data ExpDecay; input rank freq; datalines; 1 0.243002545 2 0.075063613 ... n # ; proc print; run; proc nlin; parms b=0.198 c=0.319; model freq = b*exp(-c*rank); run;

104

SAS Program: PROC GLM

data Coef; input Animal$ Species$ b c; datalines; WT1 WT 0.1384 0.1751 WT2 WT 0.0822 0.1036 ... N N # # ; proc print; run; proc glm; class Animal Species; model b = Species; run;

105

PERL Script: Total OTUs and Common OTUs

#command line example: perl program_name.pl input.csv output.csv use strict; use warnings; use Math::Combinatorics; my $infile = $ARGV[0]; #input a CSV file - should have header on row 1 and Column A should have OTU names. Cell A1 should be blank. The rest should contain 0 or 1 (1 = OTU present, 0 = OTU not present) my $outfile = $ARGV[1]; #output a CSV of results - one line per arrangement of columns my $lines; #direct read-in of infile lines my (@temp, @ColB, @ColC, @ColD, @ColE, @ColF, @Ary1, @Ary2, @Ary3, @Ary4, @Ary5); #arrays for each column (change this later so that I can use as many arrays as I need - since number of columns will vary) my $sum = 0; my $i=0; #counter starts at 0 my $j=0; my $k=0; my $l=0; my $m=0; my $n=0; my $p=0; if (@ARGV ne 2) { die "\nYou did not enter the correct number of parameters. Command line format: \n\n"; }

#trim the newline character chomp $infile; chomp $outfile;

#open files and assign them to references open(READDATA, "<", $infile) or die "\nCould not open $infile. Check spelling and location of file name. Terminating...\n\n"; open(WRITEDATA, ">", $outfile) or die "\nCould not create $outfile. Terminating...\n\n";

$lines = ; #discard header

#read in entire infile while ($lines = ){ chomp $lines; @temp = split ( ',', $lines); #read entire line into temp array, splitting at comma (may need tabs - check each file)

#load each part of the line into an array, ignoring the first column of the CSV $ColB[$i] = $temp[1]; $ColC[$i] = $temp[2]; $ColD[$i] = $temp[3]; $ColE[$i] = $temp[4]; $ColF[$i] = $temp[5]; $i++; }

#recycle counter $i=0;

while ($k < 5){ $k++;

while ($l < 5) { $l++;

106

while ($m < 5){ $m++;

while ($n < 5){ $n++;

while ($p < 5){ $p++;

if ($p ne $n and $p ne $m and $p ne $l and $p ne $k and $n ne $m and $n ne $l and $n ne $k and $m ne $l and $m ne $k and $l ne $k){

#assign arrays to "Ary1" etc names based on where counters are in each loop

if ($k eq 1){ @Ary1 = @ColB; } if ($k eq 2){ @Ary1 = @ColC; } if ($k eq 3){ @Ary1 = @ColD; } if ($k eq 4){ @Ary1 = @ColE; } if ($k eq 5){ @Ary1 = @ColF; }

if ($l eq 1){ @Ary2 = @ColB; } if ($l eq 2){ @Ary2 = @ColC; } if ($l eq 3){ @Ary2 = @ColD; } if ($l eq 4){ @Ary2 = @ColE; } if ($l eq 5){ @Ary2 = @ColF; }

if ($m eq 1){ @Ary3 = @ColB; } if ($m eq 2){ @Ary3 = @ColC; } if ($m eq 3){ @Ary3 = @ColD; } if ($m eq 4){ @Ary3 = @ColE; } if ($m eq 5){ @Ary3 = @ColF; }

107

if ($n eq 1){ @Ary4 = @ColB; } if ($n eq 2){ @Ary4 = @ColC; } if ($n eq 3){ @Ary4 = @ColD; } if ($n eq 4){ @Ary4 = @ColE; } if ($n eq 5){ @Ary4 = @ColF; }

if ($p eq 1){ @Ary5 = @ColB; } if ($p eq 2){ @Ary5 = @ColC; } if ($p eq 3){ @Ary5 = @ColD; } if ($p eq 4){ @Ary5 = @ColE; } if ($p eq 5){ @Ary5 = @ColF; }

#calculation subroutines &Total_Taxa(@Ary1, @Ary2, @Ary3, @Ary4, @Ary5); &OTU_Common(@Ary1, @Ary2, @Ary3, @Ary4, @Ary5);

} #ifp

}#p if ($p eq 5){ $p = 0; }

} #n if ($n eq 5){ $n = 0; }

} #m if ($m eq 5){ $m = 0; } } #l if ($l eq 5){ $l = 0; }

} #k

108

#subroutine for calculations (Total Taxa) sub Total_Taxa {

#print first character - a sum of the first column foreach (@Ary1) { $sum += $_; } print WRITEDATA "$sum,"; $sum = 0;

$j = 0;

#print second character - a sum of all species between first and second column foreach (@Ary1) { if ($Ary1[$i] + $Ary2[$i] ne 0){ $j++; } $i++; } print WRITEDATA "$j,";

#recycle counters $i=0; $j=0;

#print third character - a sum of all species represented in the first three columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i]) ne 0){ $j++; } $i++; } print WRITEDATA "$j,";

#recycle counters $i=0; $j=0;

#print fourth character - a sum of all species represented in the first four columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i] +$Ary4[$i]) ne 0){ $j++; } $i++; } print WRITEDATA "$j,";

#recycle counters $i=0; $j=0;

#print fifth character - a sum of all species represented in all five columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i] +$Ary4[$i] + $Ary5[$i]) ne 0){ $j++; } $i++; } print WRITEDATA "$j \n";

$i = 0; $j = 0;

109

}

#subroutine for calculations (OTU_Common) sub OTU_Common {

#print first character - a sum of the first column foreach (@Ary1) { $sum += $_; } print WRITEDATA "$sum,"; $sum = 0; $j = 0;

#print second character - a sum of all species between first and second column foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i]) eq 2){ $j++; } $i++; }

print WRITEDATA "$j,"; #recycle counters $i=0; $j=0;

#print third character - a sum of all species represented in the first three columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i]) eq 3){ $j++; } $i++; }

print WRITEDATA "$j,"; #recycle counters $i=0; $j=0;

#print fourth character - a sum of all species represented in the first four columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i] + $Ary4[$i]) eq 4){ $j++; } $i++; }

print WRITEDATA "$j,"; #recycle counters $i=0; $j=0;

#print fifth character - a sum of all species represented in all five columns foreach (@Ary1) { if (($Ary1[$i] + $Ary2[$i] + $Ary3[$i] + $Ary4[$i] + $Ary5[$i]) eq 5){

110

$j++; } $i++; }

print WRITEDATA "$j \n"; $i = 0; $j = 0;

}

#close files = good practice close (READDATA); close (WRITEDATA);

111

SAS Program: PROC VARCOMP (Type 1 Method)

data one; infile 'C:\Users\tmtb79.TIGERS\Desktop\Diet_Firm_OTU.csv' dlm=',' firstobs=2; input AnimalID$ Diet$ Species$ Firm_Di1 Firm_Di2 ... n ;

proc print; run;

proc varcomp method=type1; class Diet Species; model Firm_Di1 Firm_Di2 Firm_DiN = Diet Species/fixed=0; run;

112

LITERATURE CITED

Albecia, L., A.I. Martin-Garcia, G. Martinez, C.J. Newbold, D.R. Yáñez-Ruiz. 2013. Nutritional Intervention in Early Life to Manipulate Rumen Microbial Colonization and Methane Output by Kid Goats Postweaning. Journal of Animal Science 91: 4832-4840. DOI: 10.2527/jas.2012-6142.

Acinas, S.G., R. Sarma-Rupavtarm, V. Klepac-Ceraj, M.F. Polz. 2005. PCR-Induced Sequence Artifacts and Bias: Insights from Comparison of Two 16S rRNA Clone Libraries Constructed from the Same Sample. Applied and Environmental Microbiology 71: 8966-8969. DOI: 10.1128/AEM.71.12.8966-8969.2005.

Benson, A.K., S.A. Kelly, R. Legge, F. Ma, S.J. Low, J. Kim, M. Zhang, P.L. Oh, D. Nehrenberg, K. Hua, S.D. Kachman, E.N. Moriyama, J. Walter, D.A. Peterson, D. Pomp. 2010. Individuality in Gut Microbiota Composition is a Complex Polygenic Trait Shaped by Multiple Environmental and Host Genetic Factors. PNAS Early Edition: 1-6. DOI: 10.1073/pnas.1007028107.

Benson, D.A., M. Cavanaugh, K. Clark, I. Karsch-Mizrachi, D.J. Lipman, J. Ostell, E.W. Sayers. 2013. GenBank Nucleic Acids Research 41(Database Issue): D36-42.

Bergman, E.N. 1990. Energy Contributions of Volatile Fatty Acids From the Gastrointestinal Tract in Various Species. American Physiological Society 70: 567-590.

Caldwell, J.M., R. Juvonen, J. Brown, F. Breidt. 2013. Pretinatus sottacetonis sp. Nov., isolated from a commercial pickle spoilage tank. International Journal of Systematic and Evolutionary Microbiology 63: 3609-3616. DOI: 10.1099/ijs.0047886-0.

Carberry, C.A., D.A. Kenny, S. Han, M.S. McCabe, S.M. Waters. 2012. Effect of Phenotypic Residual Feed Intake and Dietary Forage Content on the Rumen Microbial Community of Beef Cattle. Applied and Environmental Microbiology 78: 4949-4958.

Case, R.J., Y. Boucher, I. Dahllöf, C. Holmström, W.F. Doolittle, S. Kjelleberg. 2007. Use of 16S rRNA and rpoB Genes as Molecular Markers for Microbial Ecology Studies. Applied and Environmental Microbiology 73: 278-288.

Chaucheyras-Durand, F. and F. Ossa. 2014. Review: The Rumen Microbiome: Composition, Abundance, Diversity, and New Investigative Tools. The Professional Animal Scientist 30: 1-12.

Chen, M. and M.J. Wolin. 1979. Effect of Monensin and Lasalocid-sodium on the Growth of Methanogenic and Rumen Saccharolytic Bacteria. Applied Environmental Microbiology 38: 72-77.

113

Cole, J.R., Q. Wang, E. Cardenas, J. Fish, B. Chai, R.J. Farris, A.S. Kulam-Syed- Mohideen, D.M. McGarrell, T. Marsh, G.M Garrity et al. 2009. The Ribosomal Database Project: Improved Alignments and New Tools for rRNA Analysis. Nucleic Acids Research 37(Database Issue): D141-145.

Cook, G.M., J.E. Wells, J.B. Russell. 1994. Ability of Acidoaminococcus fermentans to Oxidize trans-Aconitate and Decrease the Accumulation of Tricarbalylate, a Toxic End Product of Ruminal Fermentation. Applied and Environmental Microbiology 60: 2533-2537.

Dahllof, I., H. Baillie, S. Kjelleberg. 2000. rpoB-based Microbial Community Analysis Avoids Limitations Inherent in 16S rRNA gene Intraspecies Heterogeneity. Applied and Environmental Microbiology 66: 3376-3380.

De Barbieri, I., L. Gulino, R.S. Hegarty, V.H. Oddy, A. Maguire, L. Li, A.V. Klieve, D. Ouwerkerk. 2015. Production Attributes of Merino Sheep Genetically Divergent for Wool Growth are Reflected in Differing Rumen Microbiotas. Livestock Science 15: 252-258. DOI: 10.1016/j.livsci.2015.05.023.

Deng, W., D. Xi, H. Mao, M. Wanapat. 2008. The Use of Molecular Techniques Based on Ribosomal RNA and DNA for Rumen Microbial Ecosystem Studies: A Review. Molecular Biology Reports 35: 265-274. DOI: 10.1007/s11033-007- 9079-1.

Deusch, S., B. Tilocca, A. Camarinha-Silva, J. Seifert. 2015. News in Livestock Research – use of Omics-technologies to Study the Microbiota in the Gastrointestinal Tract of Farm Animals. Computational and Structural Biotechnology Jounral 13: 55-63.

Domingue, B.M.F., D.W. Dellow, P.R. Wilson, T.N. Barry. 1991. Comparative Digestion in Deer, Goats, and Sheep. New Zealand Journal of Agricultural Research 34: 45- 53.

Ellison, M.J., G.C. Conant, R.R. Cockrum, K.J. Austin, H. Truong, M. Becchi, W.R. Lamberson, K.M. Cammack. 2013. Diet Alters Both the Structure and Taxonomy of the Ovine Gut Microbial Ecosystem. DNA Research 1-11. DOI: 10.1093/dnares/dst044.

Ewing, B. and Green, P. 1998. Base-calling of Automated Sequencer Traces Using Phred II. Error Probabilities. Genome Research 8: 186-194.

Fernando, S.C., H.T. Purvis II, F.Z. Najar, L.O. Sukharnikov, C.R. Krehbiel, T.G. Nagaraja, B.A. Roe, U. DeSilva. 2010. Rumen Microbial Population Dynamics during Adaptation to a High-Grain Diet. Applied and Environmental Microbiology 76: 7482-7490. DOI: 10.1128/AEM.00388-10.

Flint, H.J. 1997. The Rumen Microbial Ecosystem-Some Recent Developments. Trends Microbiology 5: 483-488.

114

Flint, H.J. and E.A. Bayer. 2008. Plant Cell Wall Breakdown by Anaerobic Microorganisms from the Mammalian Digestive Tract. Ann NY Academy Sciences 1125: 280-288.

Forsythe, P., S. Nobuyuki, T. Dinan, V.H. Taylor, J. Bienenstock. 2010. Mood and Gut Feelings. Brain, Behavior, and Immunity 24: 9-16. DOI: 10.1016/j.bbi.2009.05.058.

Fouts, D.E., S. Szpakowski, J. Purushe, M. Torralba, R.C. Waterman, M.D. MacNeil, L.J. Alexander, K.E. Nelson. 2012. Next Generation Sequencing to Define Prokaryotic ad Fungal Diversity in the Bovine Rumen. PLoS ONE 7: e48289. DOI: 10.1371/journal.pone.0048289.

Freud, R.J. and R.C. Littell. 2000. SAS® System for Regression (3 ed.) SAS Institute, Inc., Cary, NC.

Gaylor, D.W., H.L. Lucas, R.L. Anderson. 1970. Calculation of Expected Mean Squares by the Abbreviated Doolittle and Square Root Methods. Biometrics. 26: 641-655.

Geishauser, T. 1993. An Instrument for Collection and Transfer of Ruminal Fluid and for Administration of Water Soluble Drugs in Adult Cattle. The Bovine Practitioner 27: 38-42.

Glenn, T.C. 2011. Field Guide to Next-Generation DNA Sequencers. Molecular Ecology Resources 11: 759-769. DOI: 10.1111/j.1755-0998-2011.03024.x.

Gruby, D. and H.M.O. Delafond. 1843. Recherches sur des animalcules se developpant en grand nombre dans I’estomac et dans les intestins, pedant la digestion des animaux herbivores et carnivores. Comptes Rendus de I’Academie des Sciences 17: 1304-1308.

Gruninger R.J., C.W. Sensen, T.A. McAllister, R.J. Forster. 2014. Diversity of Rumen Bacteria in Canadian Cervids. PLoS ONE 9: e89682. DOI: 10.1371/journal.pone.0089682.

Guan, L.L., J.D. Nkrumah, J.A. Basarab, S.S. Moore. 2008. Linkage of Microbial Ecology to Phenotype: Correlation of Rumen Microbial Ecology to Cattle’s Feed Efficiency. FEMS Microbiology Letter 288: 85-91. DOI: 10.1111/j.1574- 6968.2008.01343.x.

Hamlin, L.J. and Hungate, R.E. 1956. Culture and Physiology of a Starch-Digesting Bacterium. (Bacteroides amylophilus no. spp.) from the Bovine Rumen. Journal of Bacteriology 72: 548-554.

Handelsman, J. 2004. Metagenomics: Application of Genomics to Uncultured Microorganisms. Microbiology and Molecular Biology Reviews 68: 669-685. DOI: 10.1128/MBR.68.4.669-685.2004.

115

Handelsman, J., M.R. Rondon, S.F. Brady, J. Clardy, R.M. Goodman. 1998. Molecular Biological Access to the Chemistry of Unknown Soil Microbes: A New Frontier for Natural Products. Chemistry and Biology: 5: R245-R249.

Hao. X. and T. Chen. 2012. OTU Analysis Using Metagenomic Shotgun Sequencing Data. PLoS ONE 7: e49785. DOI: 10.1371/journal.pone.0049785.

Hawley, A.W.L., D.G. Peden, W.R. Stricklin. 1981(a). Bison and Hereford Steer Digestion of Sedge Hay. Canadian Journal of Animal Science 61: 165-174.

Hawley, A.W.L., D.G. Peden, H.W. Reynolds, W.R. Stricklin. 1981(b). Bison and Cattle Digestion of Forages from the Slave River Lowlands, Northwest Territories, Canada. Journal of Range Management 34: 126-130.

Henderson, G., F. Cox, S. Kittelmann, V.H. Miri, M. Zethof, S.J. Noel, G.C. Waghorn, P.H. Janssen. 2013. Effect of DNA Extraction Methods and Sampling Techniques on the Apparent Structure of Cow and Sheep Rumen Microbial Communities. PLoS ONE 8: e74787.

Henderson G, F. Cox, S. Ganesh, A. Jonker, W. Young, Global Rumen Census Collaborators, P.H. Janssen. 2015. Rumen Microbial Community Composition Varies with Diet and Host, but a Core Microbiome is Found Across a Wide Geographical Range. Scientific Reports. 5: 14567 DOI:10.1038/srep14567.

Hernandez-Sanabria, E., L.A. Goonewardene, Z. Wang, O.N. Durunna, S.S. Moore, L.L. Guan. 2011. Impact of Feed Efficiency and Diet on the Adaptive Variations in the Bacterial Community in the Rumen Fluid of Cattle Identified by PCR-DGGE and Quantitative Real Time PCR Analysis. Applied Environmental Microbiology. DOI: 10.1128/AEM.05114-11.

Hobson P.N. 1969. Rumen Bacteria In Methods in Microbiology Vol. 3B, ed. J.R. Norris and D.W. Ribbons. Academic Press, London and New York.

Hudson, M.E. 2008. Sequencing Breakthroughs for Genomic Ecology and Evolutionary Biology. Molecular Ecology Resources 8: 3-17. DOI: 10.1111/j.1471- 8286.2007.02019.x.

Hugenholtz, P. 2002. Exploring Prokaryotic Diversity in the Genomic Era. Genome Biology 3: reviews0003.1-0003.8.

Hungate, R.E., W. Smith, T. Bauchop 1970. Formate as an Intermediate in the Rumen Fermentation. Journal of Bacteriology 102: 389-397.

Hungate, R.E. 1966. The Rumen and its Microbes. Academic Press, New York and London.

116

Hungate, R.E., R.W. Dougherty, M.P. Bryant, R.M. Cello. 1952. Microbiological and Physiological Changes Associated with Acute Indigestion in Sheep. Cornell Veterinarian 42: 423-449.

Huson, D.H., D.C. Richter, S. Mitra, A.F. Auch, S.C. Schuster. 2009. BMC Bioinformatics 10(Suppl I):S12. DOI: 10.1186/1471.2105/I0/SI/S12.

Introduction to SAS. UCLA: Statistical Consulting Group. From http://www.ats.ucla.edu/stats/sas/output/sas_discrim (accessed November 6, 2015).

Jami, E. and I. Mizrahi. 2012. Composition and Similarity of Bovine Rumen Microbiota across Individual Animals. PLoS ONE 7: e33306.

Jami, E. A. Israel, A. Kotser, I. Mizrahi. 2013. Exploring the Bovine Rumen Bacterial Community form Birth to Adulthood. The International Society for Microbial Ecology 1-11. DOI: 10.1038/ismej.2013.2.

Jeraldo, P., M. Sipos, N. Chia, J.M. Brulc, A.S. Hillon, M.E. Konkel, C.L. Larson, K.E. Nelson, A. Qu, L.B. Schook, F. Yang, B.A. White. 2012. Quantification of the Relative Roles of Niche and Neutral Processes in Structuring Gastrointestinal Microbiomes. PNAS 109: 9692-9698. DOI: 10.1073/pnas.1206721109.

Kamstra, L.D. 1975. Nutrition of Farm Animals. Kendall/Hunt Publishing Company, Dubuque, IA.

Kim, M., M. Morrison, Z.Yu. 2011. Status of the Phylogenetic Diversity Census of Ruminal Microbiomes. FEMS Microbiology Ecology 76: 49-63.

Kittelmann, S., H. Seedorf, W.A. Walters, J.C. Clemente, R. Knight, J.I. Gordon, P.H. Janssen. 2013. Simultaneous Amplicon Sequencing to Explore Co-Occurrence Patterns of Bacterial, Archaeal, and Eukaryotic Microorganisms in Rumen Microbial Communities. PLoS ONE 8: e47879. DOI: 10.1317/journal.pone.0047879.

Kraatz, M., R.J. Wallace, L. Svensson. 2011. Olsenella umbonata sp. nov., a Microaerotolerant Anaerobic Lactic Acid Bacterium from the Sheep Rumen and Pig Jejunum, and Emended Descriptions of Olsenella, Olsenella uli, and Olesenella profusa. International Journal of Systematic and Evolutionary Microbiology 61: 795-803. DOI 10.1099/ijs.0.022954-0.

Krause, D.O., T.G. Nagaraja, A.D.G. Wright, T.R. Callaway. 2013. Broad-invited Review: Rumen Microbiology: Leading the way in Microbial Ecology. Journal of Animal Science 91: 331-341. DOI: 10.227/jas.2012-5567.

Lan, G.Q., N. Abdullah, S. Jalaludin, Y.W. Ho. 2002. Culture Conditions Influencing Phytase Production of Mitsuokella jalaludinii, a New Bacterial Species from the Rumen of Cattle. Journal of Applied Microbiology 93: 668-674.

117

Langille, M.G.I., J. Zaneveld, J.G. Caporaso, D. McDonald, D. Knights, J.A. Reyes, J.C. Clemente, D.E. Burkepile, R.L.V. Thurber, R. Knight, R.G. Beiko, C. Huttenhower. 2013. Predictive Functional Profiling of Microbial Communities Using 16S rRNA Marker Gene Sequences. Nature Biotechnology 31: 814-823. DOI: 10.1038/nbt.2676.

Langmead, B., C. Trapnell, M. Pop, S.L Salzberg. 2009. Ultrafast and Memory-Efficient Alignment of Shor DNA Sequences to the Human Genome. Genome Biology 10: R25.

Leahy, S.C., W.J. Kelly, E. Altermann, R.S. Ronimus, C.J. Yeoman, D.M. Pacheco, D. Li, Z. Kong, S. McTavish, C. Sang, S.C. Lambie, P.H. Janssen, D. Dey, G.T. Attwood. 2010. The Genome Sequence of the Rumen Methanogen Methanobrevibacter ruminantium Reveals New Possibilities for Controlling Ruminant Methane Emissions. PLoS ONE 5: e8826.

Lee, Y., C.S. Romanek, G.L. Mills, R.C. Davis, W.B. Whitman, J. Wiegel. 2006. Gracilibacter thermotolerans gen. nov., sp. nov., an Anaerobic, Thermotolerant Bacterium from a Constructed Wetland Receiving Acid Sulfate Water. International Journal of Systematic and Evolutionary Microbiology 56: 2089- 2093. DOI: 10.1099/ijs.0.64040-0.

Ley, R.E., C.A. Lozupone, M. Hamady, R. Knight, J.I. Gordon. 2008. Worlds within Worlds: Evolution of the Vertebrate Gut Microbiota. Nature 6: 776-788.

Logares, R., T.H.A. Haverkamp, S. Kumar, A. Lanzén, A.J. Nederbragt, C. Quince, H. Kauserud. 2012. Environmental Microbiology Through the Lens of High- Throughput DNA Sequencing: Synopsis of Current Platforms and Bioinformatics Approaches. Jounral of Microbiological Methods 91: 106-113.

Lu, C., D. Tsementzi, N. Kyripides, T. Read, K.T. Konstantinidis. 2012. Direct Comparisons of Illumina vs. Roche 454 Sequencing Technologies on the Same Microbial Community DNA Sample. PLoS ONE 7: e30087.

Lyte, M. W. Li, N. Opitz, R.P. Gaykema, L.E. Goehler. 2006. Induction of Anxiety-like Behavior in Mice During the Initial Stages of Infection with the Agent of Murine Colonic Hyperplasia Citrobacter rodentium. Physiological Behavior 89: 350-357.

Mallevre, A. 1891. Der Einfliuss der als Gahrungsprodukt der Cellulose Gebildeten essigsaure auf den Gaswechsel. P fluegers Arch. 49: 460-465.

Malmuthuge, N., P.J. Griebel, L.L. Guan. 2015. The Gut Microbiome and its Potential Role in the Development and Function of Newborn Calf Gastrointestinal Tract. Frontiers in Veterinary Science 2. DOI: 10.3389/fvets.2015.00036.

McCann, J.C., T.A. Wickersham, J.J. Loor. 2014. High-Throughput Methods Redefine the Rumen Microbiome and Its Relationship with Nutrition and Metabolism. Bioinformatics and Biology Insights 8: 109-125 DOI 10.4137/BBI.S15389.

118

Morgan, J.L., A.E. Darling, J.A. Eisen. Metagenomic Sequencing of an In Vitro- Simulated Microbial Community. 2010. PLoS ONE 5: e10209.

Morgavi, D.P., W.J. Kelly, P.H. Janssen, G.T. Attwood. 2013. Rumen Microbial (meta)Genomics and its Application to Ruminant Production. Animal 7: 184-201. DOI: 10.1017/S1751731112000419.

Moss, A.R., J. Jouany, J. Newbold. 2000. Methane Production by Ruminants: Its Contribution to Global Warming. Annales De Zootechnie 49:231-253.

Motulsky, H.J. and L.A. Ransnas. 1987. Fitting Curves to Data Using Nonlinear Regression: A Practical and Nonmathematical Review. FASEB Journal 1: 365- 374.

Muro-Reyes, A., H. Gutierrez-Banuelos, L.H. Diaz-Garcia, F.J. Gutierrez-Pina, L.M. Escareno-Sanchez, R. Banuelos-Valenzuela, C.A. Medina-Flores, A.C. Luna. 2011. Potential Environmental Benefits of Residual Feed Intake as Strategy to Mitigate Methane Emissions in Sheep. Journal of Animal and Veterinary Advances 10: 1551-1556.

Muyzer, G., E.C. De Wall, A.G. Uitterlinden. 1993. Profiling of Complex Microbial Populations by Denaturing Gradient Gel Electrophoresis Analysis of Polymerase Chain Reaction-Amplified Genes Coding for 16S rRNA. Applied and Environmental Microbiology. 59: 695-700.

Needleman, S.B. and C.D. Wunsch. 1970. A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins. Journal of Molecular Biology 48: 443-453.

Nelson, K.E., S.H. Zinder, I. Hance, P. Burr, D. Odongo, D. Wasawo, A. Odenyo, R. Bishop. 2003. Phylogenetic Analysis of the Microbial Populations in the Wild Herbivore Gastrointestinal Tract: Insights into an Unexplored Niche. Environmental Microbiology 5: 1212-1220. DOI: 10.1046/j.1462- 2920.2003.00526.x.

Orpin, C.G., S.D. Mathiesen, Y. Greenwood, A.S. Blix. 1985. Seasonal Changes in the Ruminal Microflora of the High-Artic Svalbard Reindeer (Rangifer tarandus platyrhynchus). Applied and Environmental Microbiology 501: 144-151.

Peden, D.G., G.M. Van Dyne, R.W. Rice, R.M. Hansen. 1974. The Trophic Ecology of Bison bison L. on Shortgrass Plains. British Ecological Society 11: 489-497.

Petri, R.M., T. Schwaiger, G.B. Penner, K.A. Beauchemin, R.J. Forster, J.J. McKinnon, T.A. McAllister. 2013. Changes in the Rumen Epimural Bacterial Diversity of Beef Cattle as Affected by Diet and Induced Ruminal Acidosis. Applied and Environmental Microbiology 79. DOI 10.1128/AEM.03983-12.

119

Petrosino, J.F., S. Highlander, R.A. Luna, R.A. Gibbs, J. Versalovic. 2009. Metagenomic Pyrosequencing and Microbial Identification. Clinical Chemistry 55: 856-866.

Prins, R.A., A. Lankhorst, P. Van der Meer, C.J. van Nevel. 1975. Some Characteristics of Anaerovibrio lipolytica a Rumen Lipolytic Organism. Antonie Van Leeuwenhoek 41: 1- 11.

Privé, F., N.N. Kaderbhai, S. Girdwood, H.J. Worgan, E. Pinloche, N.D. Schollan, S.A. Huws, C.J. Newbold. 2013. Identification and Characterization of Three Novel Lipases Belonging to Families II and V from Anaerovibrio lipolyticus 5ST. PLoS ONE 8: e69076.

Qin. J. et al. MeatHIT Consortium. 2010. A Human Gut Microbial Gene Catalogue Established by Metagenomic Sequencing. Nature 464: 59-65.

Quince, C., A. Lanzén, T.P. Curtis, R.J. Davenport, N. Hall, I.M. Head, L.F. Read, W.T. Sloan. 2009. Nature Methods: Brief Communications 6:639-644. DOI: 10.1038/nmeth.1361.

Rey, M., F. Enjalbert, S. Combes, L. Cauquil, O. Bouchez, V. Monteils. 2014. Establishment of Ruminal Bacterial Community in Dairy Calves from Birth to Weaning is Sequential. Journal of Applied Microbiology 116: 245-257. DOI: 10.1111/jam.12405.

Richmond, R.J., R.J. Hudson, R.J. Christropherson. 1977. Comparison of Forage Intake and Digestibility by American Bison, Yak, and Cattle. Acta Theriologica 22: 225- 230.

Ross, E.M., P.J. Moate, C.R. Bath, S.E. Davidson, T.I. Sawbridge, K.M. Guthridge, B.G. Cocks, B.J. Hayes. 2012. High Throughput Whole Rumen Metagenome Profiling Using Untargeted Massively Parallel Sequencing. BMC Genetics 13: 53.

Russell, J.A., N. Dubilier, J.A. Rudgers. 2014. Nature’s Microbiome:Introduction. Molecular Ecology 23: 1225-1237.

Russell, J.B. 1991(a). Resistance of Streptococcus bovis to Acetic Acid at low pH: Relationship Between Intracellular pH and Anion Physiology. Applied Environmental Microbiology 36: 319-329.

Russell, J.B. 1991(b). Intracellular pH of Acid-tolerant Rumen Bacteria. Applied Environmental Microbiology 57: 3383-3384.

Russell, J.B. and T. Hino. 1985. Regulation of Lactate Production in Streptococcus bovis: a Spiraling Effect that Contributes to Rumen Acidosis. Journal of Dairy Science 68: 1712-1721.

Scardovi, V. 1986. Genus Bifidobacterium. In Bergey’s Manual of Systematic Bacteriology, Vol. 2, ed. P.H.A. Sneath, Williams, and Wilkins, Baltimore, MD.

120

Scholz, M.B., C. Lo, P.S.G. Chain. 2012. Next Generation Sequencing and Bioinformatic Bottlenecks: The Current State of Metagenomic Data Analysis. Current Opinion in Biotechnology 23:9-15. DOI: 10.1016/j.copbio.2011.11.013.

Shah, N., H. Tang, T.G. Doak, Y. Ye. 2011. Comparing bacterial communities inferred from 16S rRNA gene sequencing and shotgun metagenomics. Pacific Symposium on Biocomputing 2011: 165-176.

Shendure, J. and H. Ji. 2008. Next-Generation DNA Sequencing. Nature Biotechnology 26: 1135-1145. DOI: 10.1038/nbt1486.

Singh, K.M., P.R. Pandya, A.K. Tripathi, G.R. Patel, S. Parnerkar, R.K. Kothari, C.G. Joshi. 2014. Study of Rumen Metagenome Community Using qPCR under Different Diets. Meta Gene 2: 191-199.

Singh, B., S.K. Gautam, V. Verma, M. Kumar, B. Singh. 2008. Metagenomics in Animal Gastrointestinal Ecosystem: Potential Biotechnological Prospects. Anaerobe 14: 138-144. DOI: 10.1016/j.anaerobe.2008.03.002.

Stackebrandt, E. and Hippe, H. 1986. Transfer of Bacteroides amylophilus to a new Genus Ruminobacter gen. nov. Systems and Applied Microbiology 8:204-207.

Stewart, C.S., H.J. Flynt, M.P. Bryant. 1997. The Rumen Bacteria, P.N. Hobson and C.S. Stewart (ed.), The Rumen Microbial Ecosystem. Blackie Academic and Professional, New York, NY.

Sulak, M., L. Sikorová, J. Jankuvová, P. Javorský, P. Pristaš. 2012. Variability of Actinobacteria, a Minor Component of Rumen Microflora. Folia Microbiology 57: 351-353.

Tajima, K., S. Arai, K. Ogata, T. Nagamine, H. Matsui, M. Nakamura, R.I. Aminov, Y. Benno. 2000. Rumen Bacterial Community Transition During Adaptation to High-Grain Diet. Anaerobe 6: 273-284.

Tap, J., S. Mondot, F. Levenez, E. Pelletire, C. Carson, J.P. Furet, E. Ugarte, R. Muñoz- Tamayo, D.L. Paslier, R. Nalin, J. Dore, M. Leclerc. 2009. Towards the Human Intestinal Microbiota Phylogenetic Core. Environmental Microbiology 11: 2574- 2585.

Towne, G., T.G. Nagaraja, R.C. Cochran, D.L. Harmon, C.E. Owensby, D.W. Kaufam. 1988. Comparisons of Ruminal Fermentation Characteristics and Microbial Populations in Bison and Cattle. Applied and Environmental Microbiology 54: 2510-2514.

Tringe S.G. and Rubin E.M. 2005. Metagenomics: DNA Sequencing of Environmental Samples. Nature Reviews 6:805-814. DOI: 10.1038/nrg1709.

121

Truong, H., D. Li, K. Sajjapongse, G. Conant, M. Becchi. 2014. Large-Scale Pairwise Alignments on GPU Clusters: Exploring the Implementation Space. Journal of Signal Processing Systems: 1-19.

Trut, L.N. 1999. Early Canid Domestication: The Farm-Fox Experiment: Foxes bred for tamability in a 40-year experiment exhibit remarkable transformations that suggest an interplay between behavioral genetics and development. American Scientist 87: 160-169.

Turbill, C., T. Ruf, T. Mang, W. Arnold. 2011. Regulation of Heart Rate and Rumen Temperature in Red Deer: Effects of Season and Food Intake. The Journal of Experimental Biology 214: 963-970. DOI: 10.1242/jeb.052282.

Van Tappeiner, H. 1884. Untersuchungen uber die garung der cellulose insbesondere uber deren losung in darmkanale. Z. Biologly 20: 52-134.

Weimer, PJ. 2015. Redundancy, Resilience, and Host Specificity of the Ruminal Microbiota: Implictions for Engineering IMrpoved Ruminal Fermentations. Frontiers in Microbiology 6:1-16. DOI: 10.3389/fmicb.2015.00296.

Whitford, M.F., R.M. Teather, R.J. Forster. 2001. Phylogenetic Analysis of Methanogens From the Bovine Rumen. BMC Microbiology1:5.

Wintzingerode, F.V., U.B. Göbel, E. Stackebrandt. 1997. Determination of Microbial Diversity in Environmental Samples: Pitfalls of PCR-based rRNA Analysis. FEMS Microbiology Reviews 21: 213-229.

Wooley, J.C., A. Godzik, I. Friedberg. 2010. A Primer on Metagenomics. PLoS ONE 6: e1000667.

Xie, G., G.C. Duff, L.W. Hall, J.D. Allen, C.D. Burrows, J.C. Bernal-Rigoli, S.E. Dowd, V. Guerriero, C.J. Yeoman. 2013. Alteration of Digestive Tract Microbiome in Neonatal Holstein Bull Calves by Bacitracin Methylene Disalicylate Treatment and Scours. Journal of Animal Science 91: 4984-4990. DOI: 10.2527/jas.2013- 6304.

Yu, Z. and M. Morrison. 2004. Improved Extraction of PCR-quality Community DNA from Digesta and Fecal Samples. BioTechniques 36: 808-812.

Zhou M., E. Hernandez-Sanabria, L.L. Guan. 2009. Assessment of the Microbial Ecology of Ruminal Methanogens in Cattle with Different Feed Efficiencies. Applied and Environmental Microbiology. 75: 6524-6533. DOI: 10.1128/AEM.02815-08.

122

VITA

Tasia M. Taxis was born and raised in a small rural town, Dornsife, Pennsylvania, where she grew up hunting, fishing, and rising free-range chickens for meat and egg production with her family. With an interest in learning the complete process of production animals, she also raised two barrows from birth to slaughter during high school. Tasia completed her undergraduate degree at Purdue University in 2008, earning a bachelor’s degree in

Animal Science. During that time she also worked with Dr. Jolena N. (Fleming) Waddell in Dr. Christopher A. Bidwell’s molecular genetics laboratory, and completed an undergraduate research project entitled, “Gene Expression in the Liver and Muscles of

Callipyge Lambs.” After graduation, she completed a summer project at the US Meat

Animal Research Center assisting Dr. Eduardo Casas establish an association of single nucleotide polymorphisms with the incidence of Mycobacterium paratubercilosis (MAP) in cattle. Under the advisement of Drs. Jeremy F. Taylor and Robert L. Weaber, Tasia completed her master’s degree at the University of Missouri in 2011. Her master’s project included a Genome-Wide Association Study in which she identified phenotypic and genetic effects of disposition on beef tenderness and quality attributes. During her master’s program, Tasia also guest lectured at elementary schools, taught biology at

Moberly Area Community College, taught a genomics course at the University of

Missouri, and completed an artificial insemination certificate with Select Sires, Inc. Tasia continued to teach, stay engaged in outreach, and enhance her leadership skills into her doctoral program at the University of Missouri. She has completed the President’s

Academic Leadership Institute of the University of Missouri, become a National

Academies Education Fellow in Life Sciences, and was selected among the top twenty-

123

five graduate students in the nation for the K. Patricia Cross Future Leaders Award.

Tasia’s dissertation research focused on comparing the microbiome between ruminant host animals. In 2015, she will complete her doctorate in Animal Sciences, with a minor in college teaching, under the guidance of Dr. William R. Lamberson. Upon graduation,

Tasia will begin a postdoctoral research position with the USDA, Agricultural Research

Service, National Animal Disease Center, Ruminant Diseases and Immunology Research

Unit in Ames, Iowa. Her work will focus on small non-coding RNA expression between healthy and cattle challenged with viruses and/or bacteria that produce Bovine

Respiratory Disease.

124