<<

Article

Cite This: Environ. Sci. Technol. XXXX, XXX, XXX−XXX pubs.acs.org/est

Diversity of DNA and RNA in Indoor Air As Assessed via Metagenomic Sequencing Karyna Rosario,*,† Noah Fierer,‡,§ Shelly Miller,∥ Julia Luongo,∥ and Mya Breitbart† †College of Marine Science, University of South , Saint Petersburg, Florida 33701, United States ‡Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Colorado 80309, United States §Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, Colorado 80309, United States ∥Department of Mechanical Engineering, University of Colorado, Boulder, Colorado 80309, United States *S Supporting Information

ABSTRACT: Diverse bacterial and fungal communities inhabit human-occupied buildings and circulate in indoor air; however, viral diversity in these man-made environments remains largely unknown. Here we investigated DNA and RNA viruses circulating in the air of 12 university dormitory rooms by analyzing dust accumulated over a one-year period on heating, ventilation, and air conditioning (HVAC) filters. A metagenomic sequencing approach was used to determine the identity and diversity of viral particles extracted from the HVAC filters. We detected a broad diversity of viruses associated with a range of hosts, including animals, arthropods, bacteria, fungi, humans, plants, and protists, suggesting that disparate organisms can contribute to indoor airborne viral communities. Viral community composition and the distribu- tion of human-infecting papillomaviruses and polyomaviruses were distinct in the different dormitory rooms, indicating that airborne viral communities are variable in human-occupied spaces and appear to reflect differential rates of viral shedding from room occupants. This work significantly expands the known airborne viral diversity found indoors, enabling the design of sensitive and quantitative assays to further investigate specific viruses of interest and providing new insight into the likely sources of viruses found in indoor air.

■ INTRODUCTION viral metagenomic techniques, where viral particles are purified from a given sample prior to extraction and shotgun sequencing Microbes are abundant in indoor air. Human-occupied 15,16 buildings, including residential, work, and public spaces, can of nucleic acids, can be used to expand the known diversity harbor more than 105 bacterial and fungal cells per cubic meter of viruses found indoors. An advantage of viral is of indoor air.1−3 Hundreds to thousands of bacterial and fungal that this technique can be used to survey viral diversity without taxa are found indoors, highlighting the diverse of these a priori knowledge of the viruses present in a given sample, microbial communities.4,5 This realization, combined with the resulting in the detection of novel viruses as well as the fact that we spend >90% of our time indoors,6 has fueled characterization of viral communities (i.e., viromes) in a wide ff range of environments,17 including outdoor air18 and indoor coordinated e orts to investigate the microbiome of the built 19,20 environment (e.g., microBEnet7). Extensive research on air. However, the fairly low viral biomass in indoor air 5 21 bacterial and fungal communities has shown high variability (approximately 10 -like particles per cubic meter ) 22 in community composition across buildings, or even locations presents a challenge for viral metagenomic studies. Though within a given building, due to differences in occupants, most of these viral particles are unlikely to be pathogens, or environmental conditions, geographic location, and many other even human-associated viruses, a more complete assessment of factors.8−10 However, despite decades of research on the the viral diversity circulating in indoor air is broadly relevant to bacteria and fungi that humans are exposed to indoors, there is understanding buildings as microbial ecosystems and the little information regarding the diversity of viruses found in factors that shape indoor exposure to viruses. built environments. Most of the research investigating viruses indoors has Received: August 15, 2017 focused on selected human viral pathogens found on surfaces Revised: December 5, 2017 and in air.11−14 However, known human pathogens are unlikely Accepted: January 3, 2018 to be the only viruses we encounter indoors. The application of Published: January 3, 2018

© XXXX American Chemical Society A DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

Heating, ventilation, and air conditioning (HVAC) filters, a 8891 Ultrasonic Bath Sonicator, 42 kHz frequency (Cole- which serve as passive, long-term, high volume air samplers in Parmer) for a total of 5 min in 1 to 2 min intervals followed by indoor spaces, have been used successfully to describe the vigorous shaking to detach viral particles from the filter and diversity of airborne bacteria and fungi circulating inside disrupt aggregates. To separate desorbed viral particles from buildings.23−26 Because HVAC filters can be used for integrated cells and debris after sonication, samples were filtered through a sampling of indoor airborne bacterial and fungal commun- 0.22 μm Sterivex filter (Millipore). ities,23 here we evaluated if metagenomic sequencing of viruses To concentrate virus particles, filtrates were subjected to two could also be performed from these filters. For this purpose, rounds of ultracentrifugation, which pelleted most virus-like viral purification strategies were combined with high- particles as assessed through SYBR Gold staining and throughput sequencing on a subset of HVAC filters previously epifluorescence microscopy.31,32 For this purpose, five 35 mL used to investigate bacterial and fungal communities inside a aliquots of filtrate per sample were placed into 25 × 89 mm University of Colorado dormitory building.24 By using a ultracentrifuge tubes (Beckman Coulter, Inc.) and ultra- replicated room design, this study aimed to determine what centrifuged at 28,000 rpm in a SW28Ti swinging bucket types of viruses circulate in indoor air, how viral communities rotor (Beckman Coulter, Inc.) for 4 h at 4 °C. Supernatants vary across rooms within a single building, and how these were directly decanted into a second set of ultracentrifuge tubes differences relate to the room’s occupants. To our knowledge, and centrifuged once more at 28,000 rpm for 3 h at 4 °C. this study represents the most comprehensive investigation of Resulting viral pellets in each of the tubes after the first round the DNA and RNA viral diversity found inside buildings. of ultracentrifugation were resuspended with 100 μL of SM Although many studies have focused on identifying human viral buffer and soaked overnight at 4 °C. The SM buffer containing pathogens in indoor air, this study reveals that indoor air the resuspended viral pellets from the first round was harbors a diversity of viruses associated with a wide range of subsequently used to resuspend the pellets from the second potential hosts. ultracentrifuge round by soaking for at least 6 h at 4 °C. Approximately 500 μL of total resuspended viral pellets per ■ MATERIALS AND METHODS sample after two rounds of ultracentrifugation were pooled. Sample Collection and Processing. This study utilized a Free nucleic acids were removed from the viral concentrates by subset of HVAC fan coil unit filters collected for a previous incubating resuspended pellets at 37 °C for 2 h with a nuclease project investigating bacterial and fungal diversity in dormitory cocktail consisting of 1X Turbo DNase Buffer (Ambion), 21U rooms.24 The filters were typical residential filters with a MERV of Turbo DNase (Ambion), 4.5U of Baseline-ZERO DNase 8 rating, which are on average 70−80% efficient at collecting (Epicenter), 112.5U Benzonase (EMD Millipore), and 33,34 3−10 μm particles.27 The HVAC filters were installed 12 10 μg/mL RNase A (Sigma-Aldrich). Nucleases were months prior to collection within each dormitory room housing inactivated with 20 mM EDTA prior to nucleic acid extraction undergraduate students on the University of Colorado campus from viral particles. in Boulder, Colorado, USA. Each room was equipped with a fan Because a diversity of contaminants, including viruses, can be coil unit that heated or cooled recirculated room air (no outside found in laboratory reagents and commercially available nucleic 35−38 air) at a constant volumetric flow rate. The filters from each fan acid extraction and amplification kits, a negative control coil unit only filtered the air from an individual room, allowing sample containing SM buffer alone was included. This negative for a time-integrated aerosol sample from each room. Because control was processed alongside the filter samples from the each wing of the dormitory building had a separate HVAC sonication step onward to control for unforeseen contami- system, filters for viral metagenomic analysis were collected nation in laboratory reagents that can potentially impact from the East wing to minimize variability due to HVAC metagenomic analyses of low biomass samples. Unfortunately, a system differences. Selected HVAC filters originated from six blank HVAC filter from the original batch installed in the male and six female double-occupancy rooms. In addition to dormitory rooms was not available to control for contaminants these 12 filters from individual rooms, a pooled sample of eight present in the filter material. However, any viral contamination filters from the North and West wings of the building was initially present in the filters should be obscured after collecting processed to detect potential viruses present in other wings a passive, long-term (1 year), integrated air sample. with separate HVAC systems. HVAC filters were stored at −20 Nucleic Acid Extraction and Metagenomic Library °C within 5 days of collection until further processing. Preparation. Viral DNA and RNA were extracted from 200 Virus particles were partially purified from HVAC filters and μL of purified viral particles using the QIAmp MinElute Virus concentrated to increase the probability of detecting viral Spin (Qiagen) and RNeasy (Qiagen) kits, respectively, sequences through high-throughput sequencing by adapting following manufacturer’s protocols. For viral RNA extraction, methods that have been used to investigate viromes from the on-column DNase digestion was performed according to various environmental matrices including soil, water, and air as manufacturer’s recommendations. RNA was reverse-transcribed well as methods from the biomedical field.14,28−30 To do this, using the SuperScript III First Strand Synthesis System eight pieces (8 × 8 cm) were cut from each individual filter (Invitrogen) with random hexamers. Second-strand synthesis using a sterile scalpel, to yield a total area of 64 × 64 cm. For was performed on the resulting cDNA using the Klenow the pooled sample, a single 8 × 8 cm piece from each of the Fragment DNA polymerase (New England Biolabs) following eight filters was processed. Filter sections were shredded into manufacturer’s instructions and the resulting products were smaller pieces using scissors, which were cleaned between filters cleaned using the AMPure XP Purification system (Beckman using 10% bleach followed by ethanol flame sterilization. The Coulter). DNA and cDNA samples were fragmented to 300 bp filter pieces were suspended in 225 mL of sterilized, cold using a Covaris M220 instrument. Next-generation sequencing suspension medium (SM) buffer [100 mM NaCl, 8 mM (NGS) library construction was performed with the Accel-NGS MgSO4·7H2O, 50 mM Tris-Cl (pH = 7.5), 0.01% (w/v) 1S Plus DNA Library Kit for Illumina Platforms (Swift gelatin] and placed on ice for at least 5 min before sonication in Biosciences), which allows low DNA/cDNA inputs to

B DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

Figure 1. Heatmap showing the distribution of viral OTUs identified in rooms occupied by male (M1−M6) and female (F1−F6) students as well as those identified in the pooled sample (P) according to taxonomic groups. Bar graph on the right shows the percentage of total viral OTUs identified in rooms that were classified within a given group. Colors on the left panel highlight DNA (blue) and RNA (orange) viral groups. DNA viruses are further distinguished by (dark blue) and eukaryotic (light blue) viral groups. The color scale on the heatmap represents relative low (dark purple) to high (yellow) abundances based on the total number of viral OTUs identified in a given room. Gray color on the heatmap indicates taxonomic groups that were not detected in a given room. “CRESS DNA” refers to circular replication-associated protein encoding single-stranded DNA viruses that form a diverse and evolutionarily related group. “Uncl” stands for unclassified. circumvent the introduction of biases while trying to obtain the iVirus resource for analysis of viromic data.43 Briefly, raw sufficient template for NGS.39 The Accel-NGS 1S Plus kit can sequences were trimmed for quality and to remove indexing also recover single-stranded DNA (ssDNA) templates,40 adapters using the Trimmomatic App version 0.35.0 with eliminating the need for multiple displacement amplification, default parameters,44 except for a read head crop of 10 bp which is strongly biased toward small, circular templates.41 instead of zero. The quality of the trimmed sequences was Library construction and multiplexing was performed using the verified with the FastQC-plus App version 0.10.1.45 Quality- Accel-NGS 1S Plus kit following manufacturer’s instructions for filtered sequences were then assembled using the SPAdes DNA inputs <1 ng/μL and 20 cycles of indexing PCR. A total assembler App version 3.6.0 with default parameters, k-mer of 28 libraries, including separate DNA and cDNA from air length of 55, and the metagenomics assembly option.46 Contigs filters from each of the 12 individual rooms, a pooled sample, larger than 100 bp were selected for further analyses. and a negative control, were paired-end sequenced (2 × 250 Contaminant contig sequences were removed from each data bp) on a MiSeq System (Illumina) at the University of set by comparing (BLASTn, e-value < 0.00001) against a Colorado BioFrontiers Next-Gen Sequencing core facility. database containing assembled sequences from the negative Sequences are available at the NCBI Sequence Read Archive control library. database (study number SRP113244; accession numbers After removing sequences with significant matches in the SRR5853122−SRR5853149). negative control database, contig sequences from each library Sequence Analysis. Sequences were processed using were compared (BLASTx, e-value < 0.001) against a viral bioinformatic applications (Apps) available through the protein database containingsequencesfromtheNCBI CyVerse cyberinfrastructure42 and the pipeline suggested by Reference Sequence (RefSeq) database (RefSeq Release

C DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

Table 1. Specific Viral OTUs Identified in at Least One Third of the Dormitory Rooms

BLAST match/OTU group Taxonomic group Host group Source type BLASTx similarity (%) Positive rooms (%) Pseudomonas group 1 phages Gammaproteobacteria Soil 60−93 83 Propionibacterium acnes phages Actinobacteria Human 100 67 Staphylococcus Sep9-like phages Siphoviridae Firmicutes Human 57−100 67 Lactococcus 936 sensu lato phages Siphoviridae Firmicutes Dairy 95−100 58 Bacillus phage G Myoviridae Firmicutes Soil 50−88 58 Staphylococcus Twortlikevirus phages Myoviridae Firmicutes Sewage 54−100 58 Actinobacteriophage cluster A Siphoviridae Actinobacteria Soil 56−80 50 Streptococcus group 1 phages Firmicutes Human 50−99 42 Lactococcus C2-like phages Siphoviridae Firmicutes Dairy 94−99 42 Sinorhizobium phages Myoviridae Alphaproteobacteria Soil 56−70 42 Actinobacteriophage cluster BD Siphoviridae Actinobacteria Soil 54−75 42 Lactococcus phages (unclassified) Siphoviridae Firmicutes Dairy 57−96 33 Actinobacteriophage cluster F Siphoviridae Actinobacteria Soil 59−91 33 Actinobacteriophage cluster L Siphoviridae Actinobacteria Soil 54−73 33 Clavibacter phages Siphoviridae Actinobacteria Soil 58−75 33 Actinobacteriophage cluster C Myoviridae Actinobacteria Soil 49−76 33 Enterobacteriaceae T4-like phages Myoviridae Gammaproteobacteria Sewage 59−92 33 Bacillus SPO1virus Myoviridae Firmicutes Soil 44−71 33 Actinobacteriophage cluster BE Siphoviridae Actinobacteria Soil 55−83 33 crAssphage Unclassified dsDNA Bacteroidetes Human 60−100 33 Invertebrate iridescent viruses Irridoviridae Invertebrate Arthropod 47−99 67 Human papillomavirus Human Human 77−100 67 Animal feces associated gemycircularvirus Unknown Animal 52−100 33 Sewage associated gemycircularvirus Genomoviridae Unknown Sewage 79−86 33 Insect densovirus Insect Arthropod 46−66 33 Plant tymovirus Plant Plant 64−93 83 Human Retroviridae Human Human 55−99 83 Ryegrass mottle virus Sobemovirus Ryegrass Plant 86−100 67 mild green Tobacco Plant 98−100 67 Virgaviridae Tobacco Plant 99−100 50 Insect picorna-like virus Insect Arthropod 47−99 50 Crustacean picorna-like virus Picornavirales Crustacean Arthropod 50−78 50 Turnip vein-clearing virus Virgaviridae Turnip Plant 98−100 42 Aphis glycines virus 1 Picornavirales Insect Arthropod 47−92 42 Pepper mild mottle virus Virgaviridae Pepper Plant 98−100 42 Clover Potexvirus Clover Plant 95−100 33 number 79, https://www.ncbi.nlm.nih.gov/refseq/) as well as represented a cohesive group, such as human papillomaviruses, sequences recently reported from RNA viruses in invertebrates to look for general trends at higher taxonomic levels. Sequences and environmental samples.47,48 Contig sequences with similar to infecting Actinobacteria from soil significant matches in the viral database were then compared were grouped according to clusters classified within the against the GenBank nonredundant (nr) database (BLASTx, e- Actinobacteriophage Database50 (http://phagesdb.org/). An value 0.001; downloaded December 2016) to remove contig OTU table (summarized in Supplemental File S1) was sequences that had higher identities with nonviral sequences constructed by recording the presence or absence of each (i.e., false positives). Final BLAST results were inspected using viral OTU in each dormitory room. MEGAN6 Community Edition.49 To err on the side of caution, Sequences similar to human papillomaviruses (HPVs) and contig sequences from the negative control library were also polyomaviruses (HPyVs) were further investigated given their compared against the viral and nr databases (BLASTx, e-value widespread nature in the human population. To do so, 0.001) to identify potential viral contaminants. If one of the top approximately 1450 and 650 HPV and HPyV genomes, three BLAST matches for a given contig sequence in the respectively, were downloaded from GenBank in April 2017. sample libraries included a virus identified in the negative Genome sequences were then dereplicated based on a 95% control or phylogenetically related viruses, the sequence was identity cutoff using CD-HIT.51 Quality-filtered forward reads removed from further analysis (Supplemental File S1). from each of the DNA libraries were compared against curated After false positives and potential viral contaminants were databases containing 247 HPV and 28 HPyV genomes removed, sequences that could be confidently identified as viral (BLASTn, e-value < 0.00001). Sequences with significant based on best BLAST matches to known viral sequences were matches to these databases were then compared against the organized into operational taxonomic units (OTUs). Here, viral GenBank nucleotide (nt) database (BLASTn, e-value < 0.001) OTUs represent either a single species or a group of related to eliminate false positives. The number of HPV- and HPyV- viruses in cases where BLAST matches could not be confidently related sequences as well as the HPV types and HPyV species assigned to a given viral species and/or when sequences were recorded (Supplemental File S1). Finally, the classification

D DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article of all HPV types detected in dormitory rooms were verified ments has been linked to human occupancy.9,66 Propionibacte- using the Papillomavirus Episteme database.52 rium and Staphylococcus bacteriophages are among the most abundant viruses found on healthy human skin,62,63,67 where ■ RESULTS AND DISCUSSION they have been linked to the modulation of P. acnes population 68 Overview. Here we investigated viruses circulating in the air structure. Likewise, the detection of a Propionibacterium bacteriophage through metagenomic approaches in clean of university dormitory rooms by performing metagenomic 19 sequencing on partially purified VLPs from dust accumulated rooms has been linked to human presence in the rooms. fi fi CrAssphage, which is suspected to infect a Bacteroides species, is on HVAC lters over a one-year period. Speci cally, we 69 analyzed individual HVAC filters from 12 double occupancy widespread in human fecal metagenomes. Together these fi rooms housing either female (n = 6) or male (n = 6) students results highlight that the identi ed bacteriophages in dormitory fi in the East wing of the dormitory, as well as a pooled sample room HVAC lters are concordant with bacteria that are from eight rooms located in the North and West wings of the commonly found indoors, including environmental and human- building that used different HVAC systems. A total of 215 viral associated taxa. OTUs, the majority of which represented DNA viruses (72%), Diversity of Airborne Eukaryotic Viruses Circulating were detected across all air filters after examining top BLASTx in Dormitory Rooms. Although bacteriophages dominated matches against the GenBank nr database (Supplemental File the viral diversity captured in dormitory rooms, eukaryotic S1). Due to low amino acid sequence identities to known viral viruses were also readily detected (Figure 1). Eukaryotic viral species (as low as 44% in some cases), most OTUs were OTUs that were present in more than 50% of the rooms classified to only the family or order level. Up to 79 viral OTUs included putative arthropod-infecting viruses from the family were detected on individual HVAC filters, with both female- and order Picornavirales, plant pathogens from the and male-occupied rooms having approximately 40 OTUs per genus Sobemovirus as well as those from the Tymoviridae and room. Identified viral OTUs indicate that bacteriophages and Virgaviridae families, and human viruses from the Retroviridae eukaryotic viruses spanning several viral families and genome and Papillomaviridae families (Table 1). Viral OTUs represent- types circulate in indoor air (Figure 1). ing members of the family Genomoviridae were also detected in Bacteriophages Dominate Airborne Viral Community more than half of the rooms. Because the only cultured Diversity in Dormitory Rooms. The majority (62%) of representative of the family Genomoviridae was isolated from a 70 identified viral OTUs in air filters represented bacteriophages. fungus and sequences similar to proteins encoded by these 71,72 This was not surprising considering that bacteria are abundant viruses have been identified within fungal genomes, in indoor air.2,21 Viral OTUs were dominated by tailed members of this family are suspected to infect fungi. However, bacteriophages (order ) from the Siphoviridae and a definitive host has not been confirmed for most species of 73 Myoviridae families, which were the only viral families detected Genomoviridae. in every room (Figure 1). Although the dominance of tailed Most of the eukaryotic viral OTUs represented putative bacteriophages in indoor air has not been previously noted, arthropod-infecting viruses from the order Picornavirales and tailed bacteriophages tend to dominate environmental viral plant viruses from the family Virgaviridae (Figure 1). The communities described from soil53 and water54,55 as well majority of the Virgaviridae-related sequences had high amino human-associated viromes. In particular, members of the acid level identities (≥98%) with proteins encoded by members Siphoviridae and Myoviridae families are abundant in human- of the genus . In contrast, sequences similar to associated viromes including the gut,56−59 respiratory tract,60 putative arthropod-infecting viruses from the order Picornavir- saliva,61 and skin.62,63 However, the abundance of tailed ales had low amino acid sequence identities to recently reported bacteriophages might be skewed by current knowledge of viral genomes from invertebrates48 (Supplemental File S1). bacteriophage diversity because >90% of reference bacterio- Therefore, viral sequences identified here, including a novel, ∼ phage genome sequences in GenBank (https://www.ncbi.nlm. 10 kb genome (will be described in a separate paper) that is nih.gov/genome/viruses/) belong to the order Caudovirales. most similar to viruses identified in Drosophila and ants, Nevertheless, our results indicate that a broad diversity of tailed indicate that RNA viral diversity has yet to be fully explored. bacteriophages are ubiquitous in dormitory room air. Human endogenous were the most widespread The putative host phyla identified for the observed group of human-associated viral sequences found in dormitory bacteriophage OTUs included Proteobacteria (45%), Actino- rooms (Table 1). It is possible that these endogenous retroviral bacteria (30%), Firmicutes (23%), Bacteroidetes (0.75%), and sequences represent remnant viral sequences integrated into Aquificae (0.75%) (Supplemental File S1). These putative host cellular genomes rather than exogenous viruses because groups, with the exception of Aquificae, frequently dominate retroviral sequences are among the transposable elements indoor microbiomes and were widely detected in dormitory that constitute 50% of the human genome.74,75 However, the rooms.24 The most widespread viral OTUs potentially infecting detected retroviral sequences were most similar to human Proteobacteria, including Pseudomonas and Sinorhizobium endogenous retrovirus K (HERV-K) and multiple sclerosis- bacteriophages, were most similar to bacteriophages isolated associated retrovirus (MSRV) and were mainly detected in the from environmental sources (Table 1). In addition, some of the RNA libraries. Expression of structural genes and detection of most widespread viral OTUs were similar to bacteriophages virus-like particles has suggested that the recently acquired infecting Actinobacteria, such as Propionibacterium acnes HERV-K can become active under certain physiological bacteriophages, as well as those infecting Firmicutes (Staph- circumstances.76−79 In addition, there has been debate about ylococcus and Streptococcus bacteriophages) and potentially whether or not MSRV can be exogenous and persist as an Bacteroidetes species (crAssphage) (Table 1). Bacterial species extracellular particle.80 Due to the possibility of expression of from most of these genera, including Propionibacterium, human endogenous retroviral sequences combined with low Staphylococcus, Streptococcus and Bacteroides, are part of the amino acid identities (as low as 55%) to known retroviral human microbiome64,65 and their presence in indoor environ- sequences in some cases, this OTU was retained in the

E DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

Figure 2. Distribution of human-infecting viruses from the Papillomaviridae and families identified in rooms occupied by male (M1− M6) and female (F1−F6) students as well as those identified in the pooled sample (P). The distribution of human papillomaviruses (HPVs) in each room is shown as a heatmap. Specific HPV types are listed on the left panel and colors highlight different genera, including Alphapapillomavirus (green), Betapapillomavirus (tan), and Gammapapillomavirus (light brown). ** indicates HPV types that cannot be confidently assigned because detected sequences share low nucleotide identity (<80%) with known types. The color scale on the heatmap represents relative low (dark purple) to high (yellow) abundances of HPV types based on the total number of HPV-related sequences identified in a given room. Gray color on the heatmap indicates HPV types that were not detected in a given room. Because only a single human polyomavirus (HPyV) species was identified in positive rooms, identified species in each room are listed at the bottom of the heatmap, including polyomavirus 6 (PyV6), MW polyomavirus (MWPyV), and Merkel cell polyomavirus (MCPyV). ‘X’ indicates rooms that did not have any HPyV-related sequences. analyses. Nevertheless, care should be taken when interpreting in indoor spaces, including hundreds of species found in the presence of endogenous retroviruses in viral metagenomic individual homes.81 In addition, dust samples collected from studies as sequences might represent cellular DNA. surfaces inside homes contain pollen representing a diversity of The detection of diverse eukaryotic viruses in dormitory plant species82 and outdoor vegetation may affect the rooms suggests that disparate organisms contribute to indoor composition of indoor bacterial communities.83 Moreover, airborne viral communities. A high diversity of arthropods live bacterial species potentially associated with plants and insects

F DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article were identified in the dormitory rooms.24 Therefore, plants and alphapapillomavirus that has been classified as high-risk due arthropods are likely to be significant sources of airborne to its oncogenic potential,113,114 was detected in a male- indoor bacteria and viruses. This study suggests that fungi also occupied room. This high-risk HPV type has been found at low serve as a source of airborne viruses, as several OTUs most prevalence compared to other HPV types in healthy women similar to fungal viruses were identified, including RNA viruses and men.103,114,115 These results indicate that a high diversity of (members of the , Hypoviridae, and HPVs circulates in indoor air, including HPV types with low families as well as unclassified ssRNA and dsRNA viruses) and prevalence in the general population that might be of clinical potentially ssDNA viruses (family Genomoviridae, see above). relevance. Unexpectedly, one of the rooms contained sequences similar to In contrast to HPVs, HPyVs were mainly restricted to male- feline papillomavirus type 2 (FePV2), which infects domestic occupied rooms and, although coinfection with multiple HPyVs cats.84 The presence of nonhuman occupants, such as pets, can is common,109,116 only a single HPyV species was detected in affect indoor bacterial communities9 and the detection of each positive room (Figure 2). Three HPyV species, namely FePV2 suggests that pets may also influence indoor viral HPyV6, MW polyomavirus (MWPyV), and Merkel cell diversity. However, because university policies prohibited pets polyomavirus (MCPyV), were detected. HPyV6 and MWPyC in these dormitory rooms, it is unknown if the detection of were discovered in skin and fecal samples, respectively, from FePV2 reflects the presence of an illicit cat in the dormitory healthy individuals,109,117 whereas MCPyV was originally room or resulted from external contact between human identified in Merkel cell carcinoma118 and later established as occupants and cats. a widespread cutaneous virus in the human population.94 We Diversity of Airborne Human Papillomaviruses and assembled a complete MCPyV genome that shared 99% Polyomaviruses in Dormitory Rooms. Nonenveloped genome-wide pairwise identity to a genome recovered from the dsDNA viruses from the Papillomaviridae and Polyomaviridae skin of a healthy individual109 (Accession number HM011544) families, including human papillomaviruses (HPVs) and (Supplemental File S2). The prevalence of MWPyV in the polyomaviruses (HPyVs), were widespread in the dormitory general population and environmental samples is low compared rooms. These two groups of viruses constitute part of the to other HPyVs like MCPyV;97,109,119 however, MWPyV human-associated virome in both healthy and disease states. seemed to be more widespread in dormitory room air than HPVs, which have evolved replication strategies to thrive in HPyV6 and MCPyV. This suggests that differences in shedding their host’s cutaneous and mucosal epidermal tissue,85 have rates, namely the number of virions released by infected cells at been strongly linked to the development of cervical cancer86−88 any given time, between HPyV species may contribute to as well various other cancers and skin diseases.89−93 HPyVs can differences between airborne HPyV distribution and that of cause serious diseases in immunocompromised patients human subjects. including Merkel cell carcinoma, progressive multifocal The higher diversity of airborne HPVs detected in dormitory leukoencephalopathy, nephropathy, trichodysplasia spinulosa rooms compared to HPyVs supports previous reports of low and hemorrhagic cystitis.94−96 However, healthy individuals are HPyV genetic diversity on human skin.98 Indeed, to date, more also known to be asymptomatically infected with HPVs and than 30 HPV species comprising >200 HPV types52,120,121 have HPyVs, both of which may be acquired early in infancy.97−106 been described compared to only 12 HPyV species.96,106 HPV Furthermore, HPVs and HPyVs are the most abundant groups diversity might be driven by a high selective pressure from the of human-infecting viruses associated with the healthy human human immune system.67 Although HPVs and HPyVs are skin viral flora62,98 and novel species have been discovered from widespread in the human population and have similar healthy individuals.107−109 characteristics, there could be differences in tissue tropism or The diversity of HPV- and HPyV-related sequences was virion shedding rates that might affect their relative further investigated because the presence of these viruses in representation in indoor air. Moreover, HPyVs were mainly dormitory rooms is directly linked to human occupancy. By detected in male rooms, suggesting that shedding rates for this comparing unassembled sequences to custom databases, the group of viruses differ between male and females. This is detection of HPV- and HPyV-related sequences increased. plausible considering that female children are more likely to HPVs, the second most widespread group of human-infecting shed MWPyV in feces than male children122 and seropositivity viruses detected in dormitory rooms (Table 1), were more for certain HPyV species is more common among males.116 diverse than HPyVs (Figure 2). A total of 54 HPV types, Notably, MWPyV was the only HPyV species detected in including 7 unclassified HPVs, were detected, with up to 24 female rooms (Figure 2). In addition, because Merkel cell HPV types detected in a single room (Figure 2). All of the carcinoma is more prevalent in males,123 some studies have also detected HPV-related sequences were most similar to species shown that MCPyV is highly prevalent in the skin of healthy classified within the Alphapapillomavirus, Betapapillomavirus, males.124,125 Interpersonal variation in HPV and HPyV species and Gammapapillomavirus genera, which contain most known composition62,63,98 and potential differences in shedding rates HPV species.110,111 The majority of the HPV-related sequences between HPyV species might affect indoor airborne viral loads were similar to cutaneous viruses from the Beta-and of these human-infecting viruses. Gammapapillomavirus genera. Alphapapillomaviruses, including Airborne Viral Community Variation in Dormitory both genital/mucosal (type 51) and cutaneous (types 3 and Rooms. Each dormitory room exhibited a different distribution 125) HPVs, were only detected in three of the dormitory of viral OTUs as well as HPV types, revealing a unique rooms. The most widespread HPVs included types 23 and 120, “fingerprint” of airborne viral species (Figures 1 and 2). which were detected in 33% of the rooms, followed by HPV Although Siphoviridae and Myoviridae members were detected type 24, which was detected in 25% of the rooms. Although all in all of the samples (Figure 1), the distribution of specific of these cutaneous HPV types were originally isolated from skin OTUs within these groups varied. It is possible that variation in lesions,110,112 these types have also been detected in the skin of sequencing depth among the samples might affect the detection healthy individuals.103 In addition, HPV type 51, an of low abundance viral OTUs; however, even the dominant

G DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article viral OTUs varied across individual rooms. This observation suggests that airborne viral communities indoors are variable in human-occupied spaces as has been shown for bacteria.126 The disparate airborne viral communities of individual rooms may reflect variability in occupant-associated microbiomes because individuals can release distinct “microbial clouds”.126,127 Notably, there are marked interpersonal differ- ences in viral communities associated with healthy human skin and there can be viral “blooms” in certain body sites.62,63 Furthermore, individuals with active viral infections may shed more viral particles compared to healthy individuals, further contributing to airborne viral community dynamics. For example, the detection of rhinoviruses, which are the most common causative agents of the common cold and other upper respiratory tract infections,128,129 in one of the rooms might reflect a symptomatic infection of a room occupant. Although asymptomatic rhinovirus infection can be common in young children,130 viral detection and loads are significantly higher in symptomatic adults.131,132 Challenges Associated with Airborne Virus Source Tracking. The potential sources of observed viral OTUs in dormitory rooms were examined by classifying the top BLAST match of each OTU based on its original source of isolation or, if that information was not available, its host’s niche. The analyses suggest that soil is likely the largest contributing source to airborne viral communities in dormitory rooms (Figure 3). The bacterial community in dust from floors and other indoor surfaces has been shown to harbor taxa from human occupants as well as environmental sources including soil.66,126,133 Dust resuspension, which has been suggested as an important source of indoor airborne bacteria,133 might also be a significant source of airborne viruses. However, our inability to accurately track sources of airborne viruses might alter the true contributions from potential sources described here. Due to their small size, virus particles may stay airborne for fi Figure 3. Potential sources of viruses detected on dormitory room signi cantly longer periods and, in turn, be transported for HVAC filters. Graphs show (A) the percentage of rooms (n = 12) longer distances than bacteria and fungi, making it difficult to 22 where viruses from a given source were detected and (B) the pinpoint their original source. This is further complicated by percentage of viral OTUs in each source category. virion stability, which can allow certain viruses to withstand harsh conditions, including the passage of viruses of dietary origin through the gut. For example, bacteriophages infecting most similar to members of the genus Tobamovirus, which have Lactococcus lactis,abacteriumextensivelyusedinthe stable virions that can remain infectious after disinfection production of dairy products, were widespread in dormitory treatments and persist for long periods outside their plant rooms (Table 1) and their bacterial host has been identified as host.138,139 Although identified included viruses one of the most abundant bacterial species in floor dust.66 infecting edible crops (, cucumber, pepper), tobacco Moreover, a ∼2 kb L. lactis bacteriophage contig sequence from plants, and grasses (Supplemental File S1), all of these viruses the dormitory rooms had significant matches to metagenomic are dominant in human feces140 and raw sewage.141 Due to sequences from human feces134 available through the IMG/VR their stability and presence in various sources, it is not possible database.135 Due to the remarkable stability of L. lactis to determine if plant viruses detected in dormitory room air bacteriophages,136 we cannot determine if the detected originated from plant material, such as houseplants and food bacteriophages originated directly from dairy products,137 products, or if they indicate fecal material from room from floor dust, or from the feces of room occupants. occupants. Notably, the detection of viruses that have been Estimated virus to bacteria ratios ranging from 0.7 to 1 in proposed as indicators of fecal pollution, including pepper mild indoor environments suggest that bacteriophages in indoor air mottle virus142 and crAssphage143 (Table 1), suggest that and surfaces might not be able to replicate due to host viruses originating from human feces can become airborne. dormancy or low host densities.20,21 These low ratios support Therefore, plants might contribute to indoor airborne viral the idea that bacteriophages might originate from the same diversity indirectly through dietary consumption of plant- source as their hosts (e.g., dairy products, dust, feces) rather related products and subsequent aerosolization of fecal matter. than propagating indoors. Another virus detected in this study that might originate Virion stability also obscures source tracking of eukaryotic from plant-related products is cannabis cryptic virus (CCV), an viruses found indoors. Plants were proportionally the second RNA virus that causes persistent infections in hemp (Cannabis largest source of viral OTUs detected in dormitory rooms sativa).144 Because CCV is widespread among hemp (Figure 3). However, the majority of these viral OTUs were varieties,144 the detection of this virus in two of the rooms

H DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article and the pooled sample suggest that hemp-related products sequences most similar to arthropod-infecting viruses indicates might also contribute to indoor viral diversity. Because that species-rich arthropod communities found in indoor industrial hemp can be used for various purposes (e.g., health spaces81 might be a source of unexplored viral diversity. foods, personal care products, and clothing), it remains to be Airborne viruses associated with consumer products can determined how CCV becomes airborne and if it can be found provide information regarding the spread and diversity of in biological samples, such as feces, from consumers of hemp- viruses relevant to the food (e.g., cheese production) and related products. Furthermore, it is unknown if CCV can also natural product (e.g., Cannabis sativa) industries. cause persistent infections in psychotropic varieties of C. sativa, Because it is not currently practical to process a large number which are genetically distinct from industrial hemp.145 of samples using viral metagenomic approaches, quantitative Overlooking the contribution of human feces to the assays for specific viral taxa in larger sample sets might reveal abundance of nonhuman-infecting viruses, including bacter- statistical differences between different demographic groups. iophages and plant viruses, might lead to a gross under- For example, although HPVs and HPyVs are widespread in the estimation of the contribution of room occupants to airborne human population, the data presented here suggest that there viral diversity. Moreover, several of the viral OTUs most closely might be differences in shedding rates of these viruses by a related to sewage-associated viruses might also represent viruses generally healthy cohort of young adults occupying university from human feces because sewage reflects the fecal microbiome dormitory rooms. Quantitative PCR assays targeting these of human populations146 and harbors a diversity of human- viruses in more rooms and different wings of the building are infecting viruses.34,141 Nevertheless, by investigating viral OTUs needed to confirm if airborne HPVs are more abundant than based on source of isolation or host niche, it is clear that indoor HPyVs and if the concentrations of HPyVs are higher in male- airborne viral diversity includes viruses originating from a wide occupied rooms compared to female-occupied rooms. range of organisms and sources. Furthermore, assays could be extended to air filters found in Methodological Approaches and Future Directions. daycare centers and senior living facilities to better understand To our knowledge, this is the first metagenomic analysis of the epidemiology of these human-infecting viruses in other airborne viral communities captured on HVAC filters. Here we population sectors using a noninvasive, passive sampling show that MERV 8-rated filters that are in place for extended approach. Finally, in addition to epidemiological studies, the periods of time can capture enough aerosolized viral biomass to differential detection of certain human-infecting viruses in allow the detection of a diversity of airborne DNA and RNA female- and male-occupied rooms might contribute to micro- 24,150 viruses. The detection of viruses from MERV 8-rated filters is bial forensic efforts. advantageous because these filters are commonly installed in buildings and have been used to investigate airborne bacterial ■ ASSOCIATED CONTENT 24,126 and fungal communities. It has been suggested that HVAC *S Supporting Information filters with MERV ratings ≥12 should be used for viral The Supporting Information is available free of charge on the metagenomic analyses because they are more efficient at ACS Publications website at DOI: 10.1021/acs.est.7b04203. fi collecting particles 1−3 μm in size compared to those lters Supplemental File S1: Excel workbook with details 22 fi with lower MERV ratings. Although MERV 8 lters are most regarding viral OTUs, human papillomaviruses and efficient at capturing particles >3 μm, these filters can also 27 polyomaviruses, and contaminant sequences (XLSX) capture smaller sized particles. In addition, even though Supplemental File S2: Merkel cell polyomavirus genome μ virions generally have diameters <0.2 m, viruses have been assembled from a male-occupied room in fasta format μ detected in air particles with diameters 0.4−10 m indicating (TXT) that they are associated with other particles when air- borne.147−149 Our results demonstrate that MERV 8 filters AUTHOR INFORMATION can capture a diversity of airborne viruses containing various ■ virion morphologies (e.g., icosahedral, filamentous) and Corresponding Author genome types (e.g., ssDNA, dsDNA, ssRNA, dsRNA). *K. Rosario. Email: [email protected]. The viral diversity reported here should be viewed as a ORCID conservative estimate of airborne viruses found indoors because Karyna Rosario: 0000-0001-9847-4113 viruses associated with particles <3 μm might not have been Notes captured in the analysis. In addition, it is likely that the strategy The authors declare no competing financial interest. used for classifying viral OTUs led to an underestimation of viral diversity as several viral OTUs represent groups of viruses ■ ACKNOWLEDGMENTS rather than a single viral species (Supplemental File S1) and OTUs only reflect sequences that could be confidently Authors would like to acknowledge Unilever Industries and the fi National Science Foundation (grants DEB-1239976 and IOS- identi ed as viral. For example, prophage sequences in bacterial fi genomes that have not been annotated as viral in the database 1456301) for nancial support. fi would not have been identi ed in the present analysis. REFERENCES Although metagenomic analyses of airborne viral communities ■ collected on HVAC filters might not capture all of the viral (1) Toivola, M.; Nevalainen, A.; Alm, S. Personal exposures to diversity found in indoor air, it provides an important baseline particles and microbes in relation to microenvironmental concen- trations. Indoor Air 2004, 14 (5), 351−359. for abundant and widespread viruses found in a given indoor (2) Emerson, J. B.; Keady, P. B.; Brewer, T. E.; Clements, N.; space. The data presented here can be used to design targeted Morgan, E. E.; Awerbuch, J.; Miller, S. L.; Fierer, N. Impacts of flood assays for groups of viruses that are not typically considered, damage on airborne bacteria and fungi in homes after the 2013 such as viruses associated with nonhuman occupants and Colorado front range flood. Environ. Sci. Technol. 2015, 49 (5), 2675− consumer products. Notably, the detection of novel viral 2684.

I DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

(3) Hospodsky, D.; Yamamoto, N.; Nazaroff, W. W.; Miller, D.; (22) Prussin, A. J.; Marr, L. C.; Bibby, K. J. Challenges of studying Gorthala, S.; Peccia, J. Characterizing airborne fungal and bacterial viral aerosol metagenomics and communities in comparison with concentrations and emission rates in six occupied children’s class- bacterial and fungal aerosols. FEMS Microbiol. Lett. 2014, 357 (1), 1− rooms. Indoor Air 2015, 25 (6), 641−652. 9. (4) Adams, R. I.; Miletto, M.; Taylor, J. W.; Bruns, T. D. The (23) Noris, F.; Siegel, J. A.; Kinney, K. A. Evaluation of HVAC filters diversity and distribution of fungi on residential surfaces. PLoS One as a sampling mechanism for indoor microbial communities. Atmos. 2013, 8 (11), e78866. Environ. 2011, 45 (2), 338−346. (5) Dunn, R. R.; Fierer, N.; Henley, J. B.; Leff, J. W.; Menninger, H. (24) Luongo, J. C.; Barberan, A.; Hacker-Cary, R.; Morgan, E. E.; L. Home Life: Factors structuring the bacterial diversity found within Miller, S. L.; Fierer, N. Microbial analyses of airborne dust collected and between homes. PLoS One 2013, 8 (5), e64133. from dormitory rooms predict the sex of occupants. Indoor Air 2017, (6) Klepeis, N. E.; Nelson, W. C.; Ott, W. R.; Robinson, J. P.; Tsang, 27 (2), 338−344. A. M.; Switzer, P.; Behar, J. V.; Hern, S. C.; Engelmann, W. H. The (25) Hoisington, A. J.; Maestre, J. P.; King, M. D.; Siegel, J. A.; National Human Activity Pattern Survey (NHAPS): a resource for Kinney, K. A. Impact of sampler selection on the characterization of assessing exposure to environmental pollutants. J. Exposure Sci. the indoor microbiome via high-throughput sequencing. Build Environ Environ. Epidemiol. 2001, 11 (3), 231−252. 2014, 80, 274−282. (7) Bik, H. M.; Coil, D. A.; Eisen, J. A. microBEnet: lessons learned (26) Tringe, S. G.; Zhang, T.; Liu, X. G.; Yu, Y. T.; Lee, W. H.; Yap, from building an interdisciplinary scientific community in the online J.; Yao, F.; Suan, S. T.; Ing, S. K.; Haynes, M.; Rohwer, F.; Wei, C. L.; sphere. PLoS Biol. 2014, 12 (6), e1001884. Tan, P.; Bristow, J.; Rubin, E. M.; Ruan, Y. J. The airborne (8) Adams, R. I.; Bhangar, S.; Dannemiller, K. C.; Eisen, J. A.; Fierer, metagenome in an indoor urban environment. PLoS One 2008, 3 N.; Gilbert, J. A.; Green, J. L.; Marr, L. C.; Miller, S. L.; Siegel, J. A.; (4), e1862. Stephens, B.; Waring, M. S.; Bibby, K. Ten questions concerning the (27) ASHRAE. Standard 52.2-2017: Method of testing general microbiomes of buildings. Build Environ 2016, 109, 224−234. ventilation air-cleaning devices for removal efficiency by particle size (9) Barberan, A.; Dunn, R. R.; Reich, B. J.; Pacifici, K.; Laber, E. B.; (ANSI approved); American Society for Heating, Refrigerating and Air- Menninger, H. L.; Morton, J. M.; Henley, J. B.; Leff, J. W.; Miller, S. L.; Conditioning Engineering: Atlanta, GA, 2017. Fierer, N. The ecology of microscopic life in household dust. Proc. R. (28) Colombet, J.; Robin, A.; Lavie, L.; Bettarel, Y.; Cauchie, H. M.; Soc. London, Ser. B 2015, 282 (1814), 212−220. Sime-Ngando, T. Virioplankton ‘pegylation’: Use of PEG (poly- (10) Meadow, J. F.; Altrichter, A. E.; Kembel, S. W.; Kline, J.; ethylene glycol) to concentrate and purify viruses in pelagic Mhuireach, G.; Moriyama, M.; Northcutt, D.; O’Connor, T. K.; ecosystems. J. Microbiol. Methods 2007, 71 (3), 212−219. Womack, A. M.; Brown, G. Z.; Green, J. L.; Bohannan, B. J. M. Indoor (29) Ichim, C. V.; Wells, R. A. Generation of high-titer viral airborne bacterial communities are influenced by ventilation, preparations by concentration using successive rounds of ultra- occupancy, and outdoor air source. Indoor Air 2014, 24 (1), 41−48. centrifugation. J. Transl. Med. 2011, 9, 137. (11) Yang, W.; Elankumaran, S.; Marr, L. C. Concentrations and size (30) Williamson, K. E.; Wommack, K. E.; Radosevich, M. Sampling distributions of airborne influenza A viruses measured indoors at a natural viral communities from soil for culture-independent analyses. health centre, a day-care centre and on aeroplanes. J. R. Soc., Interface Appl. Environ. Microbiol. 2003, 69 (11), 6628−6633. 2011, 8 (61), 1176−1184. (31) Chen, F.; Lu, J. R.; Binder, B. J.; Liu, Y. C.; Hodson, R. E. (12) Moon, K. W.; Huh, E. H.; Jeong, H. C. Seasonal evaluation of Application of digital image analysis and flow cytometry to enumerate bioaerosols from indoor air of residential apartments within the stained with SYBR Gold. Appl. Environ. Microbiol. 2001, metropolitan area in South Korea. Environ. Monit. Assess. 2014, 186 67 (2), 539−545. (4), 2111−2120. (32) Patel, A.; Noble, R. T.; Steele, J. A.; Schwalbach, M. S.; Hewson, (13) Ijaz, M. K.; Zargar, B.; Wright, K. E.; Rubino, J. R.; Sattar, S. A. I.; Fuhrman, J. A. Virus and prokaryote enumeration from planktonic Generic aspects of the airborne spread of human pathogens indoors aquatic environments by epifluorescence microscopy with SYBR and emerging air decontamination technologies. Am. J. Infect. Control Green I. Nat. Protoc. 2007, 2 (2), 269−276. 2016, 44 (9), S109−S120. (33) Victoria, J. G.; Kapoor, A.; Li, L.; Blinkova, O.; Slikas, B.; Wang, (14) Goyal, S. M.; Anantharaman, S.; Ramakrishnan, M. A.; Sajja, S.; C.; Naeem, A.; Zaidi, S.; Delwart, E. Metagenomic analyses of viruses Kim, S. W.; Stanley, N. J.; Farnsworth, J. E.; Kuehn, T. H.; Raynor, P. in the stool of children with acute flaccid paralysis. J. Virol 2009, 83 C. Detection of viruses in used ventilation filters from two large public (9), 4642−4651. buildings. Am. J. Infect. Control 2011, 39 (7), E30−E38. (34) Ng, T. F. F.; Marine, R.; Wang, C. L.; Simmonds, P.; (15) Thurber, R. V.; Haynes, M.; Breitbart, M.; Wegley, L.; Rohwer, Kapusinszky, B.; Bodhidatta, L.; Oderinde, B. S.; Wommack, K. E.; F. Laboratory procedures to generate viral metagenomes. Nat. Protoc. Delwart, E. High variety of known and new RNA and DNA viruses of 2009, 4 (4), 470−483. diverse origins in untreated sewage. J. Virol 2012, 86 (22), 12161− (16) Edwards, R. A.; Rohwer, F. . Nat. Rev. 12175. Microbiol. 2005, 3 (6), 504−510. (35) Salter, S. J.; Cox, M. J.; Turek, E. M.; Calus, S. T.; Cookson, W. (17) Rosario, K.; Breitbart, M. Exploring the viral world through O.; Moffatt, M. F.; Turner, P.; Parkhill, J.; Loman, N. J.; Walker, A. W. metagenomics. Curr. Opin. Virol. 2011, 1 (4), 289−297. Reagent and laboratory contamination can critically impact sequence- (18) Whon, T. W.; Kim, M. S.; Roh, S. W.; Shin, N. R.; Lee, H. W.; based microbiome analyses. BMC Biol. 2014, 12 (1), 87. Bae, J. W. Metagenomic characterization of airborne viral DNA (36) Thoendel, M.; Jeraldo, P.; Greenwood-Quaintance, K. E.; Yao, diversity in the near-surface atmosphere. J. Virol 2012, 86 (15), 8221− J.; Chia, N.; Hanssen, A. D.; Abdel, M. P.; Patel, R. Impact of 8231. contaminating DNA in whole genome amplification kits used for (19) Weinmaier, T.; Probst, A. J.; La Duc, M. T.; Ciobanu, D.; metagenomic shotgun sequencing for infection diagnosis. J. Clin. Cheng, J. F.; Ivanova, N.; Rattei, T.; Vaishampayan, P. A viability- Microbiol. 2017, 55, 1789−1801. linked metagenomic analysis of cleanroom environments: eukarya, (37) Naccache, S. N.; Greninger, A. L.; Lee, D.; Coffey, L. L.; Phan, prokaryotes, and viruses. Microbiome 2015, 3 (1), 62. T.; Rein-Weston, A.; Aronsohn, A.; Hackett, J.; Delwart, E. L.; Chiu, (20) Gibbons, S. M.; Schwartz, T.; Fouquier, J.; Mitchell, M.; C. Y. The perils of pathogen discovery: origin of a novel parvovirus- Sangwan, N.; Gilbert, J. A.; Kelley, S. T. Ecological succession and like hybrid genome traced to nucleic acid extraction spin columns. J. viability of human-associated microbiota on restroom surfaces. Appl. Virol 2013, 87 (22), 11966−11977. Environ. Microbiol. 2015, 81 (2), 765−773. (38) Smuts, H.; Kew, M.; Khan, A.; Korsman, S. Novel hybrid (21) Prussin, A. J.; Garcia, E. B.; Marr, L. C. Total concentrations of parvovirus-like Vvrus, NIH-CQV/PHV, contaminants in silica virus and bacteria in indoor and outdoor air. Environ. Sci. Technol. Lett. column-based nucleic acid extraction kits. J. Virol 2014, 88 (2), 2015, 2 (4), 84−88. 1398−1398.

J DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

(39) Head, S. R.; Komori, H. K.; LaMere, S. A.; Whisenant, T.; Van S. M.; Speich, S.; Dimier, C.; Kandels-Lewis, S.; Picheral, M.; Searson, Nieuwerburgh, F.; Salomon, D. R.; Ordoukhanian, P. Library S.; Bork, P.; Bowler, C.; Sunagawa, S.; Wincker, P.; Karsenti, E.; construction for next-generation sequencing: Overviews and chal- Sullivan, M. B.; Coordinators, T. O. Patterns and ecological drivers of lenges. BioTechniques 2014, 56 (2), 61. ocean viral communities. Science 2015, 348 (6237), 1261498. (40) Roux, S.; Solonenko, N. E.; Dang, V. T.; Poulos, B. T.; (55) Rodriguez-Brito, B.; Li, L. L.; Wegley, L.; Furlan, M.; Angly, F.; Schwenck, S. M.; Goldsmith, D. B.; Coleman, M. L.; Breitbart, M.; Breitbart, M.; Buchanan, J.; Desnues, C.; Dinsdale, E.; Edwards, R.; Sullivan, M. B. Towards quantitative viromics for both double- Felts, B.; Haynes, M.; Liu, H.; Lipson, D.; Mahaffy, J.; Martin- stranded and single-stranded DNA viruses. PeerJ 2016, 4, e2777. Cuadrado, A. B.; Mira, A.; Nulton, J.; Pasic, L.; Rayhawk, S.; (41) Kim, K.-H.; Bae, J.-W. Amplification methods bias metagenomic Rodriguez-Mueller, J.; Rodriguez-Valera, F.; Salamon, P.; Srinagesh, S.; libraries of uncultured single-stranded and double-stranded DNA Thingstad, T. F.; Tran, T.; Thurber, R. V.; Willner, D.; Youle, M.; viruses. Appl. Environ. Microbiol. 2011, 77 (21), 7663−7668. Rohwer, F. Viral and microbial community dynamics in four aquatic (42) Goff, S.; Vaughn, M.; McKay, S.; Lyons, E.; Stapleton, A.; environments. ISME J. 2010, 4 (6), 739−751. Gessler, D.; Matasci, N.; Wang, L.; Hanlon, M.; Lenards, A.; Muir, A.; (56) Minot, S.; Sinha, R.; Chen, J.; Li, H. Z.; Keilbaugh, S. A.; Wu, G. Merchant, N.; Lowry, S.; Mock, S.; Helmke, M.; Kubach, A.; Narro, D.; Lewis, J. D.; Bushman, F. D. The human gut virome: Inter- M.; Hopkins, N.; Micklos, D.; Hilgert, U.; Gonzales, M.; Jordan, C.; individual variation and dynamic response to diet. Genome Res. 2011, Skidmore, E.; Dooley, R.; Cazes, J.; McLay, R.; Lu, Z.; Pasternak, S.; 21 (10), 1616−1625. Koesterke, L.; Piel, W.; Grene, R.; Noutsos, C.; Gendler, K.; Feng, X.; (57) Breitbart, M.; Hewson, I.; Felts, B.; Mahaffy, J. M.; Nulton, J.; Tang, C.; Lent, M.; Kim, S.-j.; Kvilekval, K.; Manjunath, B. S.; Tannen, Salamon, P.; Rohwer, F. Metagenomic analyses of an uncultured viral V.; Stamatakis, A.; Sanderson, M.; Welch, S.; Cranston, K.; Soltis, P.; community from human feces. J. Bacteriol. 2003, 185 (20), 6220− Soltis, D.; O’Meara, B.; Ane, C.; Brutnell, T.; Kleibenstein, D.; White, 6223. J.; Leebens-Mack, J.; Donoghue, M.; Spalding, E.; Vision, T.; Myers, (58) Breitbart, M.; Haynes, M.; Kelley, S.; Angly, F.; Edwards, R. A.; C.; Lowenthal, D.; Enquist, B.; Boyle, B.; Akoglu, A.; Andrews, G.; Felts, B.; Mahaffy, J. M.; Mueller, J.; Nulton, J.; Rayhawk, S.; Ram, S.; Ware, D.; Stein, L.; Stanzione, D. The iPlant collaborative: Rodriguez-Brito, B.; Salamon, P.; Rohwer, F. Viral diversity and cyberinfrastructure for plant biology. Front. Plant Sci. 2011, 2, 34. dynamics in an infant gut. Res. Microbiol. 2008, 159 (5), 367−373. (43) Bolduc, B.; Youens-Clark, K.; Roux, S.; Hurwitz, B. L.; Sullivan, (59) Reyes, A.; Haynes, M.; Hanson, N.; Angly, F. E.; Heath, A. C.; M. B. iVirus: facilitating new insights in viral ecology with software and Rohwer, F.; Gordon, J. I. Viruses in the faecal microbiota of community data sets imbedded in a cyberinfrastructure. ISME J. 2017, monozygotic twins and their mothers. Nature 2010, 466 (7304), 11 (1), 7−14. 334−U81. (44) Bolger, A. M.; Lohse, M.; Usadel, B. Trimmomatic: a flexible (60) Willner, D.; Furlan, M.; Haynes, M.; Schmieder, R.; Angly, F. E.; trimmer for Illumina sequence data. Bioinformatics 2014, 30 (15), Silva, J.; Tammadoni, S.; Nosrat, B.; Conrad, D.; Rohwer, F. 2114−2120. Metagenomic analysis of respiratory tract DNA viral communities in (45) Andrews, S. FastQC: a quality control tool for high throughput cystic fibrosis and non-cystic fibrosis individuals. PLoS One 2009, 4 sequence data. 2010, http://www.bioinformatics.babraham.ac.uk/ (10), e7370. projects/fastqc. (61) Pride, D. T.; Salzman, J.; Haynes, M.; Rohwer, F.; Davis-Long, (46) Bankevich, A.; Nurk, S.; Antipov, D.; Gurevich, A. A.; Dvorkin, C.; White, R. A.; Loomer, P.; Armitage, G. C.; Relman, D. A. Evidence M.; Kulikov, A. S.; Lesin, V. M.; Nikolenko, S. I.; Pham, S.; Prjibelski, A. D.; Pyshkin, A. V.; Sirotkin, A. V.; Vyahhi, N.; Tesler, G.; Alekseyev, of a robust resident bacteriophage population revealed through M. A.; Pevzner, P. A. SPAdes: A new genome assembly algorithm and analysis of the human salivary virome. ISME J. 2012, 6 (5), 915−926. its applications to single-cell sequencing. J. Comput. Biol. 2012, 19 (5), (62) Oh, J.; Byrd, A. L.; Deming, C.; Conlan, S.; Kong, H. H.; Segre, 455−477. J. A. Biogeography and individuality shape function in the human skin (47) Krishnamurthy, S. R.; Janowski, A. B.; Zhao, G. Y.; Barouch, D.; metagenome. Nature 2014, 514 (7520), 59−64. Wang, D. Hyperexpansion of RNA bacteriophage diversity. PLoS Biol. (63) Hannigan, G. D.; Meisel, J. S.; Tyldsley, A. S.; Zheng, Q.; 2016, 14 (3), e1002409. Hodkinson, B. P.; SanMiguel, A. J.; Minot, S.; Bushman, F. D.; Grice, (48) Shi, M.; Lin, X.-D.; Tian, J.-H.; Chen, L.-J.; Chen, X.; Li, C.-X.; E. A. The human skin double-stranded DNA virome: topographical Qin, X.-C.; Li, J.; Cao, J.-P.; Eden, J.-S.; Buchmann, J.; Wang, W.; Xu, and temporal diversity, genetic enrichment, and dynamic associations J.; Holmes, E. C.; Zhang, Y.-Z. Redefining the invertebrate RNA with the host microbiome. mBio 2015, 6 (5), e01578-15. virosphere. Nature 2016, 540 (7634), 539−543. (64) Cho, I.; Blaser, M. J. The human microbiome: at the interface of (49) Huson, D. H.; Beier, S.; Flade, I.; Gorska, A.; El-Hadidi, M.; health and disease. Nat. Rev. Genet. 2012, 13 (4), 260−270. Mitra, S.; Ruscheweyh, H. J.; Tappu, R. MEGAN Community Edition (65) Grice, E. A.; Segre, J. A. The skin microbiome. Nat. Rev. - interactive exploration and analysis of large-scale microbiome Microbiol. 2011, 9 (4), 244−253. sequencing data. PLoS Comput. Biol. 2016, 12 (6), e1004957. (66) Taubel, M.; Rintala, H.; Pitkaranta, M.; Paulin, L.; Laitinen, S.; (50) Russell, D. A.; Hatfull, G. F. PhagesDB: the actinobacteriophage Pekkanen, J.; Hyvarinen, A.; Nevalainen, A. The occupant as a source database. Bioinformatics 2017, 33 (5), 784−786. of house dust bacteria. J. Allergy Clin. Immunol. 2009, 124 (4), 834− (51) Fu, L.; Niu, B.; Zhu, Z.; Wu, S.; Li, W. CD-HIT: accelerated for 840. clustering the next-generation sequencing data. Bioinformatics 2012, 28 (67) Hannigan, G. D.; Zheng, Q.; Meisel, J. S.; Minot, S. S.; (23), 3150−3152. Bushman, F. D.; Grice, E. A. Evolutionary and functional implications (52) Van Doorslaer, K.; Li, Z.; Xirasagar, S.; Maes, P.; Kaminsky, D.; of hypervariable loci within the skin virome. PeerJ 2017, 5, e2959. Liou, D.; Sun, Q.; Kaur, R.; Huyen, Y.; McBride, A. A. The (68) Liu, J.; Yan, R.; Zhong, Q.; Ngo, S.; Bangayan, N. J.; Nguyen, L.; Papillomavirus Episteme: a major update to the papillomavirus Lui, T.; Liu, M. S.; Erfe, M. C.; Craft, N.; Tomida, S.; Li, H. Y. The sequence database. Nucleic Acids Res. 2017, 45 (D1), D499−D506. diversity and host interactions of Propionibacterium acnes bacterioph- (53) Fierer, N.; Breitbart, M.; Nulton, J.; Salamon, P.; Lozupone, C.; ages on human skin. ISME J. 2015, 9 (9), 2078−2093. Jones, R.; Robeson, M.; Edwards, R. A.; Felts, B.; Rayhawk, S.; Knight, (69) Dutilh, B. E.; Cassman, N.; McNair, K.; Sanchez, S. E.; Silva, G. R.; Rohwer, F.; Jackson, R. B. Metagenomic and small-subunit rRNA G. Z.; Boling, L.; Barr, J. J.; Speth, D. R.; Seguritan, V.; Aziz, R. K.; analyses reveal the genetic diversity of bacteria, , fungi, and Felts, B.; Dinsdale, E. A.; Mokili, J. L.; Edwards, R. A. A highly viruses in soil. Appl. Environ. Microbiol. 2007, 73 (21), 7059−7066. abundant bacteriophage discovered in the unknown sequences of (54) Brum, J. R.; Ignacio-Espinoza, J. C.; Roux, S.; Doulcier, G.; human faecal metagenomes. Nat. Commun. 2014, 5, 4498. Acinas, S. G.; Alberti, A.; Chaffron, S.; Cruaud, C.; de Vargas, C.; (70) Yu, X.; Li, B.; Fu, Y. P.; Jiang, D. H.; Ghabrial, S. A.; Li, G. Q.; Gasol, J. M.; Gorsky, G.; Gregory, A. C.; Guidi, L.; Hingamp, P.; Peng, Y. L.; Xie, J. T.; Cheng, J. S.; Huang, J. B.; Yi, X. H. A Iudicone, D.; Not, F.; Ogata, H.; Pesant, S.; Poulos, B. T.; Schwenck, geminivirus-related DNA mycovirus that confers hypovirulence to a

K DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article plant pathogenic fungus. Proc. Natl. Acad. Sci. U. S. A. 2010, 107 (18), L.; Rothstein, J.; Nath, A. Human endogenous retrovirus-K contributes 8387−8392. to motor neuron disease. Sci. Transl. Med. 2015, 7 (307), 307ra153. (71) Rosario, K.; Dayaram, A.; Marinov, M.; Ware, J.; Kraberger, S.; (77) Douville, R.; Liu, J. K.; Rothstein, J.; Nath, A. Identification of Stainton, D.; Breitbart, M.; Varsani, A. Diverse circular ssDNA viruses active loci of a human endogenous retrovirus in neurons of patients discovered in dragonflies (Odonata: Epiprocta). J. Gen. Virol. 2012, 93, with amyotrophic lateral sclerosis. Ann. Neurol. 2011, 69 (1), 141− 2668−2681. 151. (72) Liu, H. Q.; Fu, Y. P.; Li, B.; Yu, X.; Xie, J. T.; Cheng, J. S.; (78) Serafino, A.; Balestrieri, E.; Pierimarchi, P.; Matteucci, C.; Ghabrial, S. A.; Li, G. Q.; Yi, X. H.; Jiang, D. H. Widespread horizontal Moroni, G.; Oricchio, E.; Rasi, G.; Mastino, A.; Spadafora, C.; Garaci, gene transfer from circular single-stranded DNA viruses to eukaryotic E.; Vallebona, P. S. The activation of human endogenous retrovirus K genomes. BMC Evol. Biol. 2011, 11, 276. (HERV-K) is implicated in melanoma cell malignant transformation. (73) Krupovic, M.; Ghabrial, S. A.; Jiang, D.; Varsani, A. Exp. Cell Res. 2009, 315 (5), 849−862. Genomoviridae: a new family of widespread single-stranded DNA (79) Contreras-Galindo, R.; Kaplan, M. H.; Dube, D.; Gonzalez- viruses. Arch. Virol. 2016, 161 (9), 2633−2643. Hernandez, M. J.; Chan, S.; Meng, F.; Dai, M.; Omenn, G. S.; Gitlin, S. (74) Lander, E. S.; Consortium, I. H. G. S.; Linton, L. M.; Birren, B.; D.; Markovitz, D. M. Human endogenous retrovirus type K (HERV- Nusbaum, C.; Zody, M. C.; Baldwin, J.; Devon, K.; Dewar, K.; Doyle, K) particles package and transmit HERV-K−related sequences. J. Virol. M.; FitzHugh, W.; Funke, R.; Gage, D.; Harris, K.; Heaford, A.; 2015, 89 (14), 7187−7201. Howland, J.; Kann, L.; Lehoczky, J.; LeVine, R.; McEwan, P.; (80) Hon, G. M.; Erasmus, R. T.; Matsha, T. Multiple sclerosis- McKernan, K.; Meldrim, J.; Mesirov, J. P.; Miranda, C.; Morris, W.; associated retrovirus and related human endogenous retrovirus-W in Naylor, J.; Raymond, C.; Rosetti, M.; Santos, R.; Sheridan, A.; patients with multiple sclerosis: A literature review. J. Neuroimmunol. Sougnez, C.; Stange-Thomann, N.; Stojanovic, N.; Subramanian, A.; 2013, 263 (1−2), 8−12. Wyman, D.; Rogers, J.; Sulston, J.; Ainscough, R.; Beck, S.; Bentley, D.; (81) Bertone, M. A.; Leong, M.; Bayless, K. M.; Malow, T. L. F.; Burton, J.; Clee, C.; Carter, N.; Coulson, A.; Deadman, R.; Deloukas, Dunn, R. R.; Trautwein, M. D. Arthropods of the great indoors: P.; Dunham, A.; Dunham, I.; Durbin, R.; French, L.; Grafham, D.; characterizing diversity inside urban and suburban homes. PeerJ 2016, Gregory, S.; Hubbard, T.; Humphray, S.; Hunt, A.; Jones, M.; Lloyd, 4, e1582. C.; McMurray, A.; Matthews, L.; Mercer, S.; Milne, S.; Mullikin, J. C.; (82) Craine, J. M.; Barberan, A.; Lynch, R. C.; Menninger, H. L.; Mungall, A.; Plumb, R.; Ross, M.; Shownkeen, R.; Sims, S.; Waterston, Dunn, R. R.; Fierer, N. Molecular analysis of environmental plant R. H.; Wilson, R. K.; Hillier, L. W.; McPherson, J. D.; Marra, M. A.; DNA in house dust across the United States. Aerobiologia 2017, 33 Mardis, E. R.; Fulton, L. A.; Chinwalla, A. T.; Pepin, K. H.; Gish, W. (1), 71−86. R.; Chissoe, S. L.; Wendl, M. C.; Delehaunty, K. D.; Miner, T. L.; (83) Weikl, F.; Tischer, C.; Probst, A. J.; Heinrich, J.; Markevych, I.; Delehaunty, A.; Kramer, J. B.; Cook, L. L.; Fulton, R. S.; Johnson, D. Jochner, S.; Pritsch, K. Fungal and Bacterial Communities in Indoor L.; Minx, P. J.; Clifton, S. W.; Hawkins, T.; Branscomb, E.; Predki, P.; Dust Follow Different Environmental Determinants. PLoS One 2016, Richardson, P.; Wenning, S.; Slezak, T.; Doggett, N.; Cheng, J. F.; 11 (4), e0154131. Olsen, A.; Lucas, S.; Elkin, C.; Uberbacher, E.; Frazier, M.; Gibbs, R. (84) Sundberg, J. P.; Van Ranst, M.; Montali, R.; Homer, B. L.; A.; Muzny, D. M.; Scherer, S. E.; Bouck, J. B.; Sodergren, E. J.; Worley, Miller, W. H.; Rowland, P. H.; Scott, D. W.; England, J. J.; Dunstan, R. K. C.; Rives, C. M.; Gorrell, J. H.; Metzker, M. L.; Naylor, S. L.; W.; Mikaelian, I.; Jenson, A. B. Feline papillomas and papillomaviruses. Kucherlapati, R. S.; Nelson, D. L.; Weinstock, G. M.; Sakaki, Y.; Vet. Pathol. 2000, 37 (1), 1−10. Fujiyama, A.; Hattori, M.; Yada, T.; Toyoda, A.; Itoh, T.; Kawagoe, C.; (85) Sakakibara, N.; Chen, D.; McBride, A. A. Papillomaviruses use Watanabe, H.; Totoki, Y.; Taylor, T.; Weissenbach, J.; Heilig, R.; recombination-dependent replication to vegetatively amplify their Saurin, W.; Artiguenave, F.; Brottier, P.; Bruls, T.; Pelletier, E.; Robert, genomes in differentiated cells. PLoS Pathog. 2013, 9 (7), e1003321. C.; Wincker, P.; Rosenthal, A.; Platzer, M.; Nyakatura, G.; Taudien, S.; (86) Arbyn, M.; Tommasino, M.; Depuydt, C.; Dillner, J. Are 20 Rump, A.; Yang, H. M.; Yu, J.; Wang, J.; Huang, G. Y.; Gu, J.; Hood, human papillomavirus types causing cervical cancer? J. Pathol 2014, L.; Rowen, L.; Madan, A.; Qin, S. Z.; Davis, R. W.; Federspiel, N. A.; 234 (4), 431−435. Abola, A. P.; Proctor, M. J.; Myers, R. M.; Schmutz, J.; Dickson, M.; (87) Bosch, F. X.; Lorincz, A.; Munoz, N.; Meijer, C. J. L. M.; Shah, Grimwood, J.; Cox, D. R.; Olson, M. V.; Kaul, R.; Raymond, C.; K. V. The causal relation between human papillomavirus and cervical Shimizu, N.; Kawasaki, K.; Minoshima, S.; Evans, G. A.; Athanasiou, cancer. J. Clin. Pathol. 2002, 55 (4), 244−265. M.; Schultz, R.; Roe, B. A.; Chen, F.; Pan, H. Q.; Ramser, J.; Lehrach, (88) Walboomers, J. M. M.; Jacobs, M. V.; Manos, M. M.; Bosch, F. H.; Reinhardt, R.; McCombie, W. R.; de la Bastide, M.; Dedhia, N.; X.; Kummer, J. A.; Shah, K. V.; Snijders, P. J. F.; Peto, J.; Meijer, C. J. Blocker, H.; Hornischer, K.; Nordsiek, G.; Agarwala, R.; Aravind, L.; L. M.; Munoz, N. Human papillomavirus is a necessary cause of Bailey, J. A.; Bateman, A.; Batzoglou, S.; Birney, E.; Bork, P.; Brown, D. invasive cervical cancer worldwide. J. Pathol. 1999, 189 (1), 12−19. G.; Burge, C. B.; Cerutti, L.; Chen, H. C.; Church, D.; Clamp, M.; (89) Cubie, H. A. Diseases associated with human papillomavirus Copley, R. R.; Doerks, T.; Eddy, S. R.; Eichler, E. E.; Furey, T. S.; infection. Virology 2013, 445 (1−2), 21−34. Galagan, J.; Gilbert, J. G. R.; Harmon, C.; Hayashizaki, Y.; Haussler, (90) Köhler, A.; Gottschling, M.; Manning, K.; Lehmann, M. D.; D.; Hermjakob, H.; Hokamp, K.; Jang, W. H.; Johnson, L. S.; Jones, T. Schulz, E.; Krüger-Corcoran, D.; Stockfleth, E.; Nindl, I. Genomic A.; Kasif, S.; Kaspryzk, A.; Kennedy, S.; Kent, W. J.; Kitts, P.; Koonin, characterization of ten novel cutaneous human papillomaviruses from E. V.; Korf, I.; Kulp, D.; Lancet, D.; Lowe, T. M.; McLysaght, A.; keratotic lesions of immunosuppressed patients. J. Gen. Virol. 2011, 92 Mikkelsen, T.; Moran, J. V.; Mulder, N.; Pollara, V. J.; Ponting, C. P.; (7), 1585−1594. Schuler, G.; Schultz, J. R.; Slater, G.; Smit, A. F. A.; Stupka, E.; (91) Vasiljevic, N.; Hazard, K.; Dillner, J.; Forslund, O. Four novel Szustakowki, J.; Thierry-Mieg, D.; Thierry-Mieg, J.; Wagner, L.; Wallis, human betapapillomaviruses of species 2 preferentially found in actinic J.; Wheeler, R.; Williams, A.; Wolf, Y. I.; Wolfe, K. H.; Yang, S. P.; Yeh, keratosis. J. Gen. Virol. 2008, 89, 2467−2474. R. F.; Collins, F.; Guyer, M. S.; Peterson, J.; Felsenfeld, A.; (92) D’Souza, G.; Dempsey, A. The role of HPV in head and neck Wetterstrand, K. A.; Patrinos, A.; Morgan, M. J.; Conso, I. H. G. S. cancer and review of the HPV vaccine. Prev. Med. 2011, 53, S5−S11. Initial sequencing and analysis of the human genome. Nature 2001, (93) Bzhalava, D.; Guan, P.; Franceschi, S.; Dillner, J.; Clifford, G. A 409 (6822), 860−921. systematic review of the prevalence of mucosal and cutaneous human (75) Hughes, J. F.; Coffin, J. M. Evidence for genomic rearrange- papillomavirus types. Virology 2013, 445 (1−2), 224−231. ments mediated by human endogenous retroviruses during primate (94) Spurgeon, M. E.; Lambert, P. F. Merkel cell polyomavirus: A evolution. Nat. Genet. 2001, 29 (4), 487−489. newly discovered human virus with oncogenic potential. Virology 2013, (76) Li, W.; Lee, M. H.; Henderson, L.; Tyagi, R.; Bachani, M.; 435 (1), 118−130. Steiner, J.; Campanac, E.; Hoffman, D. A.; von Geldern, G.; Johnson, (95) Dalianis, T.; Hirsch, H. H. Human polyomaviruses in disease K.; Maric, D.; Morris, H. D.; Lentz, M.; Pak, K.; Mammen, A.; Ostrow, and cancer. Virology 2013, 437 (2), 63−72.

L DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

(96) DeCaprio, J. A.; Garcea, R. L. A cornucopia of human (114) IARC. Human papillomaviruses. IARC Monogr. Eval. Carcinog. polyomaviruses. Nat. Rev. Microbiol. 2013, 11 (4), 264−276. Risks Hum. 2012 100B, 255−313. (97) Nicol, J. T. J.; Leblond, V.; Arnold, F.; Guerra, G.; Mazzoni, E.; (115) Han, J. J.; Beltran, T. H.; Song, J. W.; Klaric, J.; Choi, Y. Tognon, M.; Coursaget, P.; Touze, A. Seroprevalence of human Prevalence of genital human papillomavirus infection and human Malawi polyomavirus. J. Clin Microbiol 2014, 52 (1), 321−323. papillomavirus vaccination rates among us adult men: National health (98) Foulongne, V.; Sauvage, V.; Hebert, C.; Dereure, O.; Cheval, J.; and nutrition examination survey (nhanes) 2013−2014. JAMA Oncol Gouilh, M. A.; Pariente, K.; Segondy, M.; Burguiere, A.; Manuguerra, J. 2017, 3 (6), 810−816. C.; Caro, V.; Eloit, M. Human skin microbiota: High diversity of DNA (116) Gossai, A.; Waterboer, T.; Nelson, H. H.; Michel, A.; viruses identified on the human akin by high throughput sequencing. Willhauck-Fleckenstein, M.; Farzan, S. F.; Hoen, A. G.; Christensen, PLoS One 2012, 7 (6), e38499. B. C.; Kelsey, K. T.; Marsit, C. J.; Pawlita, M.; Karagas, M. R. (99) Li, J. J.; Cai, H.; Xu, Z. Y.; Wang, Q. Y.; Hang, D.; Shen, N.; Liu, Seroepidemiology of human polyomaviruses in a US population. Am. J. M. F.; Zhang, C. Y.; Abliz, A.; Ke, Y. Nine complete genome Epidemiol. 2016, 183 (1), 61−69. sequences of cutaneous human papillomavirus genotypes isolated from (117) Siebrasse, E. A.; Reyes, A.; Lim, E. S.; Zhao, G.; Mkakosya, R. healthy skin of individuals living in rural He Nan Province, . J. S.; Manary, M. J.; Gordon, J. I.; Wang, D. Identification of MW Virol 2012, 86 (21), 11936−11936. polyomavirus, a novel polyomavirus in human stool. J. Virol 2012, 86 (100) Antonsson, A.; Erfurt, C.; Hazard, K.; Holmgren, V.; Simon, (19), 10321−10326. M.; Kataoka, A.; Hossain, S.; Hakangard, C.; Hansson, B. G. (118) Feng, H.; Shuda, M.; Chang, Y.; Moore, P. S. Clonal Prevalence and type spectrum of human papillomaviruses in healthy integration of a polyomavirus in human Merkel cell carcinoma. Science skin samples collected in three continents. J. Gen. Virol. 2003, 84, 2008, 319 (5866), 1096. 1881−1886. (119) Torres, C.; Barrios, M. E.; Cammarata, R. V.; Cisterna, D. M.; (101) Antonsson, A.; Forslund, O.; Ekberg, H.; Sterner, G.; Hansson, Estrada, T.; Martini Novas, S.; Cahn, P.; Blanco Fernandez,́ M. D.; B. G. The ubiquity and impressive genomic diversity of human skin Mbayed, V. A. High diversity of human polyomaviruses in environ- papillomaviruses suggest a commensalic nature of these viruses. J. Virol mental and clinical samples in Argentina: Detection of JC, BK, Merkel- 2000, 74 (24), 11636−11641. cell, Malawi, and human 6 and 7 polyomaviruses. Sci. Total Environ. (102) Antonsson, A.; Karanfilovska, S.; Lindqvist, P. G.; Hansson, B. 2016, 542 (Part A), 192−202. G. General acquisition of human papillomavirus infections of skin (120) Bernard, H.-U. Taxonomy and phylogeny of papillomaviruses: occurs in early infancy. J. Clin Microbiol 2003, 41 (6), 2509−2514. An overview and recent developments. Infect., Genet. Evol. 2013, 18, (103) Ma, Y. F.; Madupu, R.; Karaoz, U.; Nossa, C. W.; Yang, L. Y.; 357−361. Yooseph, S.; Yachimski, P. S.; Brodie, E. L.; Nelson, K. E.; Pei, Z. H. (121) Bzhalava, D.; Eklund, C.; Dillner, J. International stand- Human papillomavirus community in healthy persons, defined by ardization and classification of human papillomavirus types. Virology 2015, 476, 341 344. metagenomics analysis of human microbiome project shotgun − (122) Yu, G.; Greninger, A. L.; Isa, P.; Phan, T. G.; Martínez, M. A.; sequencing data sets. J. Virol 2014, 88 (9), 4786−4797. de la Luz Sanchez, M.; Contreras, J. F.; Santos-Preciado, J. I.; (104) Kean, J. M.; Rao, S.; Wang, M.; Garcea, R. L. Seroepidemi- Parsonnet, J.; Miller, S.; DeRisi, J. L.; Delwart, E.; Arias, C. F.; Chiu, C. ology of human polyomaviruses. PLoS Pathog. 2009, 5 (3), e1000363. Y. Discovery of a novel polyomavirus in acute diarrheal samples from (105) Stolt, A.; Sasnauskas, K.; Koskela, P.; Lehtinen, M.; Dillner, J. children. PLoS One 2012, 7 (11), e49449. Seroepidemiology of the human polyomaviruses. J. Gen. Virol. 2003, (123) Agelli, M.; Clegg, L. X. Epidemiology of primary Merkel cell 84, 1499−1504. carcinoma in the United States. J. Am. Acad. Dermatol. 2003, 49 (5), (106) Korup, S.; Rietscher, J.; Calvignac-Spencer, S.; Trusch, F.; 832−841. Hofmann, J.; Moens, U.; Sauer, I.; Voigt, S.; Schmuck, R.; Ehlers, B. (124) Wieland, U.; Mauch, C.; Kreuter, A.; Krieg, T.; Pfister, H. Identification of a novel human polyomavirus in organs of the Merkel cell polyomavirus DNA in persons without Merkel cell gastrointestinal tract. PLoS One 2013, 8 (3), e58021. carcinoma. Emerging Infect. Dis. 2009, 15 (9), 1496−1498. (107) Martin, E.; Dang, J.; Bzhalava, D.; Stern, J.; Edelstein, Z. R.; (125) Wieland, U.; Silling, S.; Scola, N.; et al. Merkel cell Koutsky, L. A.; Kiviat, N. B.; Feng, Q. H. Characterization of three polyomavirus infection in HIV-positive men. Arch. Dermatol. 2011, novel human papillomavirus types isolated from oral rinse samples of 147 (4), 401−406. healthy individuals. J. Clin. Virol. 2014, 59 (1), 30−37. (126) Hospodsky, D.; Qian, J.; Nazaroff, W. W.; Yamamoto, N.; (108) Li, J. J.; Pan, Y. Q.; Deng, Q. J.; Cai, H.; Ke, Y. Identification Bibby, K.; Rismani-Yazdi, H.; Peccia, J. Human occupancy as a source and characterization of eleven novel human gamma-papillomavirus of indoor airborne bacteria. PLoS One 2012, 7 (4), e34867. isolates from healthy skin, found at low frequency in a normal (127) Meadow, J. F.; Altrichter, A. E.; Bateman, A. C.; Stenson, J.; population. PLoS One 2013, 8 (10), e77116. Brown, G. Z.; Green, J. L.; Bohannan, B. J. M. Humans differ in their (109) Schowalter, R. M.; Pastrana, D. V.; Pumphrey, K. A.; Moyer, A. personal microbial cloud. PeerJ 2015, 3, e1258. L.; Buck, C. B. Merkel cell polyomavirus and two previously unknown (128) Arden, K. E.; Mackay, I. M. Human rhinoviruses: coming in polyomaviruses are chronically shed from human skin. Cell Host from the cold. Genome Med. 2009, 1 (4), 44−44. Microbe 2010, 7 (6), 509−515. (129) Turner, R. B. Rhinovirus: More than just a common cold virus. (110) Bernard, H. U.; Burk, R. D.; Chen, Z. G.; van Doorslaer, K.; J. Infect. Dis. 2007, 195 (6), 765−766. zur Hausen, H.; de Villiers, E. M. Classification of papillomaviruses (130) Principi, N.; Zampiero, A.; Gambino, M.; Scala, A.; Senatore, (PVs) based on 189 PV types and proposal of taxonomic amendments. L.; Lelii, M.; Ascolese, B.; Pelucchi, C.; Esposito, S. Prospective Virology 2010, 401 (1), 70−79. evaluation of rhinovirus infection in healthy young children. J. Clin. (111) Bernard, H. U.; Burk, R. D.; deVilliers, E. M.; zur Hausen, H. Virol. 2015, 66, 83−89. Family Papillomaviridae; Academic Press: London, 2012. (131) Granados, A.; Goodall, E. C.; Luinstra, K.; Smieja, M.; (112) Kremsdorf, D.; Favre, M.; Jablonska, S.; Obalek, S.; Rueda, L. Mahony, J. Comparison of asymptomatic and symptomatic rhinovirus A.; Lutzner, M. A.; Blanchet-Bardon, C.; Van Voorst Vader, P. C.; infections in university students: incidence, species diversity, and viral Orth, G. Molecular cloning and characterization of the genomes of load. Diagn. Microbiol. Infect. Dis. 2015, 82 (4), 292−296. nine newly recognized human papillomavirus types associated with (132) Mackay, I. M. Human rhinoviruses: The cold wars resume. J. epidermodysplasia verruciformis. J. Virol. 1984, 52 (3), 1013−1018. Clin. Virol. 2008, 42 (4), 297−320. (113) Muñoz, N.; Bosch, F. X.; de Sanjose,́ S.; Herrero, R.; (133) Gauzere, C.; Godon, J. J.; Blanquart, H.; Ferreira, S.; Moularat, Castellsague,́ X.; Shah, K. V.; Snijders, P. J. F.; Meijer, C. J. L. M. S.; Robine, E.; Moletta-Denat, M. ’Core species’ in three sources of Epidemiologic classification of human papillomavirus types associated indoor air belonging to the human micro-environment to the exclusion with cervical cancer. N. Engl. J. Med. 2003, 348 (6), 518−527. of outdoor air. Sci. Total Environ. 2014, 485, 508−517.

M DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX Environmental Science & Technology Article

(134) Hollister, E. B.; Riehle, K.; Luna, R. A.; Weidler, E. M.; Rubio- Gonzales, M.; Mistretta, T.-A.; Raza, S.; Doddapaneni, H. V.; Metcalf, G. A.; Muzny, D. M.; Gibbs, R. A.; Petrosino, J. F.; Shulman, R. J.; Versalovic, J. Structure and function of the healthy pre-adolescent pediatric gut microbiome. Microbiome 2015, 3, 36. (135) Paez-Espino, D.; Chen, I. M. A.; Palaniappan, K.; Ratner, A.; Chu, K.; Szeto, E.; Pillay, M.; Huang, J.; Markowitz, V. M.; Nielsen, T.; Huntemann, M.; Reddy, T. B. K.; Pavlopoulos, G. A.; Sullivan, M. B.; Campbell, B. J.; Chen, F.; McMahon, K.; Hallam, S. J.; Denef, V.; Cavicchioli, R.; Caffrey, S. M.; Streit, W. R.; Webster, J.; Handley, K. M.; Salekdeh, G. H.; Tsesmetzis, N.; Setubal, J. C.; Pope, P. B.; Liu, W.-T.; Rivers, A. R.; Ivanova, N. N.; Kyrpides, N. C. IMG/VR: a database of cultured and uncultured DNA Viruses and retroviruses. Nucleic Acids Res. 2017, 45 (D1), D457−D465. (136) Verreault, D.; Gendron, L.; Rousseau, G. M.; Veillette, M.; Masse,́ D.; Lindsley, W. G.; Moineau, S.; Duchaine, C. Detection of airborne lactococcal bacteriophages in cheese manufacturing plants. Appl. Environ. Microbiol. 2011, 77 (2), 491−497. (137) Marco,́ M. B.; Moineau, S.; Quiberoni, A. Bacteriophages and dairy fermentations. Bacteriophage 2012, 2 (3), 149−158. (138) Lanter, J. M.; McGuire, J. M.; Goode, M. J. Persistence of in tomato debris and soil under field conditions. Plant Dis. 1982, 66, 552−555. (139) Reingold, V.; Lachman, O.; Blaosov, E.; Dombrovsky, A. Seed disinfection treatments do not sufficiently eliminate the infectivity of Cucumber green mottle mosaic virus (CGMMV) on cucurbit seeds. Plant Pathol. 2015, 64 (2), 245−255. (140) Zhang, T.; Breitbart, M.; Lee, W. H.; Run, J.-Q.; Wei, C. L.; Soh, S. W. L.; Hibberd, M. L.; Liu, E. T.; Rohwer, F.; Ruan, Y. RNA viral community in human feces: prevalence of plant pathogenic viruses. PLoS Biol. 2005, 4 (1), e3. (141) Cantalupo, P. G.; Calgua, B.; Zhao, G.; Hundesa, A.; Wier, A. D.; Katz, J. P.; Grabe, M.; Hendrix, R. W.; Girones, R.; Wang, D.; Pipas, J. M. Raw sewage harbors diverse viral populations. mBio 2011, 2 (5), e00180−11. (142) Rosario, K.; Symonds, E. M.; Sinigalliano, C.; Stewart, J.; Breitbart, M. Pepper mild mottle virus as an indicator of fecal pollution. Appl. Environ. Microbiol. 2009, 75 (22), 7261−7267. (143) Stachler, E.; Bibby, K. Metagenomic Evaluation of the Highly Abundant Human Gut Bacteriophage CrAssphage for Source Tracking of Human Fecal Pollution. Environ. Sci. Technol. Lett. 2014, 1 (10), 405−409. (144) Ziegler, A.; Matousek,̌ J.; Steger, G.; Schubert, J. Complete sequence of a cryptic virus from hemp (Cannabis sativa). Arch. Virol. 2012, 157 (2), 383−385. (145) Sawler, J.; Stout, J. M.; Gardner, K. M.; Hudson, D.; Vidmar, J.; Butler, L.; Page, J. E.; Myles, S. The genetic structure of marijuana and hemp. PLoS One 2015, 10 (8), e0133292. (146) Newton, R. J.; McLellan, S. L.; Dila, D. K.; Vineis, J. H.; Morrison, H. G.; Eren, A. M.; Sogin, M. L. Sewage reflects the microbiomes of human populations. mBio 2015, 6 (2), e02574-14. (147) Alonso, C.; Raynor, P. C.; Davies, P. R.; Torremorell, M. Concentration, size distribution, and infectivity of airborne particles carrying swine viruses. PLoS One 2015, 10 (8), e0135675. (148) Lindsley, W. G.; Blachere, F. M.; Davis, K. A.; Pearce, T. A.; Fisher, M. A.; Khakoo, R.; Davis, S. M.; Rogers, M. E.; Thewlis, R. E.; Posada, J. A.; Redrow, J. B.; Celik, I. B.; Chen, B. T.; Beezhold, D. H. Distribution of airborne influenza virus and respiratory ayncytial virus in an urgent care medical clinic. Clin. Infect. Dis. 2010, 50 (5), 693− 698. (149) Milton, D. K.; Fabian, M. P.; Cowling, B. J.; Grantham, M. L.; McDevitt, J. J. Influenza virus aerosols in human exhaled breath: Particle size, culturability, and effect of surgical masks. PLoS Pathog. 2013, 9 (3), e1003205. (150) Fierer, N.; Lauber, C. L.; Zhou, N.; McDonald, D.; Costello, E. K.; Knight, R. Forensic identification using skin bacterial communities. Proc. Natl. Acad. Sci. U. S. A. 2010, 107 (14), 6477−6481.

N DOI: 10.1021/acs.est.7b04203 Environ. Sci. Technol. XXXX, XXX, XXX−XXX