Comparing the anterior nare bacterial community of two discrete human populations using Illumina amplicon sequencing.
Item Type Article
Authors Camarinha-Silva, Amélia; Jáuregui, Ruy; Chaves-Moreno, Diego; Oxley, Andrew P A; Schaumburg, Frieder; Becker, Karsten; Wos- Oxley, Melissa L; Pieper, Dietmar H
Citation Comparing the anterior nare bacterial community of two discrete human populations using Illumina amplicon sequencing. 2014, 16 (9):2939-52 Environ. Microbiol.
DOI 10.1111/1462-2920.12362
Journal Environmental microbiology
Download date 09/10/2021 21:30:35
Link to Item http://hdl.handle.net/10033/334759 Supplementary figures and tables
Comparing the anterior nare bacterial community of two discrete human populations using Illumina amplicon sequencing
Amélia Camarinha-Silva1, Ruy Jáuregui1, Diego Chaves-Moreno1, Andrew P.A. Oxley1,2, Frieder Schaumburg3, Karsten Becker3, Melissa L. Wos-Oxley1, Dietmar H. Pieper1*
1Microbial Interactions and Processes Research Group, Helmholtz Centre for Infection Research, Braunschweig, Germany; 2Infection Immunology Research Group, Helmholtz Centre for Infection Research, Braunschweig, Germany; 3Institute of Medical Microbiology, University Hospital Münster, Münster, Germany *For correspondence: E-mail [email protected]; Tel. (+49) 531 6181 4200; Fax (+49) 531 6181 4499
Summary The anterior nares are an important reservoir for opportunistic pathogens and commensal microorganisms. A barcoded Illumina paired-end sequencing method targeting the 16S rRNA V1-2 hypervariable region was developed to compare the bacterial diversity of the anterior nares across distinct human populations (volunteers from Germany vs a Babongo Pygmy tribe, Africa). Of the 251 phylotypes detected, 231 could be classified to the genus level and 109 to the species level, including the unambiguous identification of the ubiquitous Staphylococcus aureus and Moraxella catarrhalis. The global bacterial community of both adult populations revealed that they shared 85% of the phylotypes, suggesting that our global bacterial communities have likely been with us for thousands of years. Of the 34 phylotypes unique to the non-westernized population, most were related to members within the suborder Micrococcineae. There was an even more overwelming distinction between children and adults of the same population, suggesting a progression of a childhood community of high diversity comprising species of Moraxellaceae and Streptococcaceae to an adult community of lower diversity comprising species of Propionibacteriaceae, Clostridiales Incertae Sedis XI, Corynebacteriaceae and Staphylococcaceae. Thus, age was a stronger factor for accounting for differing bacterial assemblages than the origin of the human population sampled.
1
Fig S1. Correlation co-efficients after comparing aggregated data matrices. The correlation co-efficient (rho) derived when each of the genus-, family-, order-, class- and phylum-level data matrices were compared to the original species-level matrix, as determined by using the RELATE routine in PRIMER.
2
A
100
s
e
p
y
t
o
l
y
h
p
f
o
r
e 10
b
m
u n
1 0 2000 4000 6000 8000 10000 number of reads
B
100
s
e
p
y
t
o
l
y
h
p
f
o
r
e 10
b
m
u n
1 0 2000 4000 6000 8000 10000 number of reads
Fig. S2. Sampling effort across each sample, individual-based rarefaction curves. Rarefaction curves portraying the number of resolved phylotypes against sampling depth of each sample within the (A) non-westernized population and (B) westernized population.
3
A
B
Fig. S3. Sampling effort across each population, species accumulation curves. Estimators of species richness are the total number of all species (Sobs) and the Chao 2 estimator of true richness. Plotted values are the mean +/- standard deviation of 999 permutations. (A) curves representing both westernized and non-westernized populations, (B) curves representing both non- westernized adults and children.
4
Fig. S4. Maximum-likelihood tree depicting the taxonomic breadth of the 251 phylotypes detected from the anterior nares. The taxonomic breadth of westernized adults () and non- westernized adults () and non-westernized children (), as represented by complete or near complete 16S rRNA gene sequences from their closest taxonomic relatives available from the SILVA database. GenBank accession numbers are given after each strain name. Branch support values were calculated from 1000 bootstrap re-samplings (values > 85% are given at the nodes). Scale bar represents 5% nucleotide sequence divergence.
5
Fig. S5. Comparing the relative abundance and prevalence of selected phylotypes in the anterior nare bacterial communities. (A) Kocuria sp., (B) Kocuria marina and (C) Kocuria koreensis observed in westernized and non-westernized adults.
6
Fig. S6. Non-metric multidimensional scaling (nMDS) plot comparing the global bacterial community structure of 190 human anterior nares. (A) Community structures across 147 adult volunteers (non-westernized () n=55, westernized () n=92). (B) Community structures across 92 non-westernized volunteers (non-westernized adults () n=55, non-westernized children () n=43). For each phylotype the amount of sequence reads was standardized (%) but untransformed prior to the use of the Bray-Curtis similarity algorithm. While 2D stress values of 0.18 and 0.19 indicate some stress on the plots, it is deemed acceptable considering that so many samples are being ordinated.
7
Fig. S7. Non-metric multidimensional scaling (nMDS) plot with superimposed bubbles. (A,C,E,G) Superimposed bubbles onto the ordination plot of Fig. S6A ordinating both adult populations (where westernized adults are denoted by an askerisk and non-westernized adults denoted by a hash). (B,D,F,H) Superimposed bubbles onto the ordination plot of Fig. S6B ordinating non-westernized children and adults (where non-westernized adults are denoted by a hash and non-westernized children denoted by a cross). Bubbles represent the relative abundance of (A-B) C. accolens (PT1-1), (C-D) P. acnes (PT3-1), (E-F) S. epidermidis/S. capitis/S. caprae (PT38-1) and (G-H) Peptoniphilus sp. (PT44-1). 8
Fig. S8. Comparing the relative abundance and prevalence of selected phylotypes in the anterior nare bacterial communities. (A) S. aureus, (B) M. lacunata/ M. nonliquefaciens and (C) M. catarrhalis observed in females and males in the non-westernized adults, (D) M. lacunata/ M. nonliquefaciens and (E) M. catarrhalis observed in females and males in the westernized adults.
9
Fig. S9. Non-metric multidimensional scaling (nMDS) plot with superimposed bubbles. (A,C,E) Superimposed bubbles onto the ordination plot of Fig. S6A ordinating both adult populations (where westernized adults are denoted by an askerisk and non-westernized adults denoted by a hash). (B,D,F) Superimposed bubbles onto the ordination plot of Fig. S6B ordinating non-westernized children and adults (where non-westernized adults are denoted by a hash and non-westernized children denoted by a cross). Bubbles represent the relative abundance of (A-B) Streptococcus pneumoniae/S. mitis (PT59-1), (C-D) Dolosigranulum pigrum (PT66-1), (E-F) Dolosigranulum sp. (PT72-1).
10
Fig. S10. Non-metric multidimensional scaling (nMDS) plot with superimposed bubbles. (A,C,E) Superimposed bubbles onto the ordination plot of Fig. S6A ordinating both adult populations (where westernized adults are denoted by an askerisk and non-westernized adults denoted by a hash). (B,D,F) Superimposed bubbles onto the ordination plot of Fig. S6B ordinating non-westernized children and adults (where non-westernized adults are denoted by a hash and non-westernized children denoted by a cross). Bubbles represent the relative abundance of (A-B) Staphylococcus aureus (PT26-1), (C-D) Moraxella lincolnii (PT98-1), (E-F) Moraxella catarrhalis (PT123-1).
11
Fig. S11. Ecological biodiversity indices of the anterior nare bacterial communities of non- westernized adults and children and westernized adults. The indices presented here are: total phylotypes (S), Shannon diversity (H’), Pielou’s evenness (J’), Simpson index (1-lambda), average Taxonomic Distinctness (delta+) and variation in Taxonomic Distinctness (lambda+).
12
Table S1. Nucleotide sequences of all 251 phylotypes determined using Illumina-based amplicon deep-sequencing.
Phylotype Sequence
PT1-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT1-3 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-5 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGGGCCGACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-6 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGTGAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-7 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGGGCCTACCAAGGGGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-9 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCGACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-10 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGGAAGGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-11 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGTGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-12 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGACACGGCCCAA
PT1-13 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGGGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-14 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGGGCCTCCCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-15 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGGAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-8 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGACACGGCCCAG
PT1-16 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTCCCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-18 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGGAATGGCCGACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-19 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAAGGGCCTACCAAGGCGGCGGCGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-20 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCGACCAAGGGGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-21 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT1-22 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAGTGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-23 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCGCCCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-24 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTCACAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT1-25 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTCCCAAGGGGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT2-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCCAGCTTGCTGGGGTGCTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTGCGGCCACATTGGGACTGAGATACGGCCCAG
PT3-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGGTAGTGGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT3-3 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGGTAGTGGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
13
Phylotype Sequence
PT3-4 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGGTAGGTGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT3-5 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGGTAATGGCTTACCAAGGCTTTGACGGGTAGTCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT3-6 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGTGAGTGGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT3-7 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTTTGTGGGGTGCTCGAGTGGCGAACG-GGGTATGGGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT4-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGAAGAGCGATGGAAGCTTGCTTCTATCAATCTTAGTGGCG-GGGTAACGGCCCACCAAGGCGATGATCAGTAGCCGGTCTGAGAGGATGAACGGCCACATTGGGACTGAGACACGGCCCAG
PT5-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGGATTTAGTAAACAGAAGCCTCGGTGGAAGATTACTAAT-AGATAACAGCCCACCAAGGCGACGATCAGTAGCCGGTCTGAGAGGATGAACGGCCACATTGGAACTGAGACACGGTCCAA
PT6-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAGATTATTAGACTTCTTCGGAATGAAAATAATTGG-AGAAAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT8-1 GACGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAACGGAGAGATACGAGAGCTTGCTCTTGTAAATCTTAGTGGCA-GGGTAACGGCCTACCAAGACGATGATCAATAGCCGGTCTGAGAGGATGAACGGCCACACTGGAACTGAGACACGGTCCAG
PT9-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAGTAGACTTCTTCGGAAAGACACTAAGCGG-AGATAAAAGCTCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT10-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGCTTGGTGCTTGCACCGAGCGGATGAGTTGCGAA-AGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT12-1 GATGAACGCTGACAGAATGCTTAACACATGCAAGTCTTTGGCGAAGCTGTGCTTGCACAGCCTAGCCAAGGCGGACGGGT-GGGTAACGGCCCACCAAGGCGATGATAGGTAGCCGGCCTGAGAGGGTGGACGGCCACAAGGGGACTGAGATACGGCCCTT
PT13-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGCAGGGTAACATGAGGAAAGCTTGCTTTCCTTGATGACGACTGG-GGGTAACGGCCCACCAAGGCTACGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT15-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCCCTTGTGGGGTACTCGAGTGGCGAACGGGT-GGGTAGTGGCTCACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT16-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGGATTTAGTAGACAGAAGCTTCGGTGGAAGATTACTAAT-AGATAACAGCCCACCAAGGCGACGATCAGTAGCCGGTCTGAGAGGATGAACGGCCACATTGGAACTGAGACACGGTCCAA
PT17-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGCAGGGTAACGTGAGGGAAGCTTGCTTCCCTTGACGACGACTGG-GGGTAACGGCCCACCAAGGCGACGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT19-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGATAAGGAGCTTGCTCCTTTGACGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCAACGATACGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT20-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGGACGGCAGCACAGAGAAGCTTGCTTCTTGGGTGGCGAGTGGCG-GGGTAAAGGCCTACCAAGGCGACGATCAGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT22-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTCTTCTGGATGAGTTGCGAACG-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT22-2 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTCTTCTGGATGAGTTGCGAACG-AGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT24-1 GATGAACGCTAGCGATAGGCTTAACACATGCAAGTCGAGGGGTAACGTGTTGGAAGCTTGCTTCCGATGACGACGACCGG-AGGTAACGGCTCACCAAGCCAACGATAGGTAGGGGTGCTGAGAGGCAGATCCCCCACATTGGGACTGAGACACGGCCCAA
PT25-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGAGAGCGACCGGTGCTTGCACTGGTCAATCTAGTGGCGAA-AGGTAACGGCTCACCAAGGCCGTGATGCATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT26-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACGGACGAGAAGCTTGCTTCTCTGATGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT27-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCACCAGCTTGCTGGTGTGGATGAGTGGCGAACG-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT28-1 GACGAACGCTGGCGGCGCGCTTAACACATGCAAGTCGAGCGATGAAATCTTGACAGATCCCTTCGGGGTGAAGATAAGAT-AGGTAACGGCCCACCAAGGCGACGATGGGTAGCCGGCCTGAGAGGGTGAACGGCCACATTGGAACTGAGAAACGGTCCAA
PT31-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGGATTTAGTAGACAGAAACCTCGGTGGAAGATTACTAAT-AGATAACAGCCCACCAAGGCGACGATCAGTAGCCGGTCTGAGAGGATGAACGGCCACATTGGAACTGAGACACGGTCCAA
PT32-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTAAGGCTCCAGCTTGCTGGGGTACACGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGACACGGCCCAG
PT35-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCCAGCTTGCTGGGTGGATTAGTGGCGAACGGG-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
14
Phylotype Sequence
PT36-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGAGAGGAGCTTGCTCTTCTTGGATGAGTTGCGAA-GGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT36-2 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGAGAGGAGCTTGCTCTTCTTGGATGAGTTGCGAA-AGGTAATGGCTCACCTAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT36-3 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGAGAGGAGCTTGCTCTTCTTGGATGAGTTGCGAA-AGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT37-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTGCTTGCAGGGGTGCTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT38-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGACGAGGAGCTTGCTCCTCTGACGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCAACGATGCGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT38-2 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGACGAGGAGCTTGCTCCTCTGACGTTAGCGGCGGA-GGGTAACGGCTTACCAAGGCGACGATGCGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT39-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGCGGAGATGAGGTGCTTGCACCTTATCTTAGCGGCGGACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT40-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCTTGGTGCTTGCACTGGGTGGATGAGTGGCGAA-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT40-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCTTGGTGCTTGCACTGGGTGGATGAGTGGCGAA-GGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT41-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAAATGATTTCTTCGGAATGAACTTAAGTGG-AGAAAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT43-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCTGTGCTTGCACAGTGGATTAGTGGCGAACGGG-AGGTAGTGGCTCACCAAGGCGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT44-1 GACGAACGCTGGCGGCGCGCTTAACACATGCAAGTCGAGCGATGAAACTTTAACAGAACCCTTCGGGGCGAAGATAAAGT-GAGTAAAAGCCTACCAAGGCGACGATGGGTAGCCGGCCTGAGAGGGTGAACGGCCACATTGGAACTGAGAAACGGTCCAA
PT45-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGAACGGCAGCGGGGAGGAGCTTGCTTCTCTGCCGGCGAGTGGCG-AGGTAAAGGCTCACCAAGGCGAAGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT46-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCAAGCTTGCTTGGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT48-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTCTTTCCGGATGAGTTGCGAAC-AGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT49-1 ATTGAACGCTGGCGGCATGCCTTACACATGCAAGTCGAACGGCAGCACGGGAGCAATCCTGGTGGCGAGTGGCGAACGGG-GGGTAAAGGCCTACCAAGGCGACGATCAGTAGCTGGTCTGAGAGGACGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT50-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCTAGCTTGCTAGGTGGATTAGTGGCGAACGGG-AGGTAACGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT50-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCTAGCTTGCTAGGTGGATTAGTGGCGAACGGG-AGGTAATGGCTTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT51-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAGCAGGAGAAAGCTTGCTTTCTTGCTGACGAGTGGCG-GGGTAAAGGCCTACCAAGCCTGCGATCTCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT51-2 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAGCAGGAGAAAGCTTGCTTTCTTGCTGACGAGTGGCG-GGGTAAGGGCCTACCAAGCCTGCGATCTCTAGCTGGTCTGAGAGGATGGCCAGCCACACTGGAACTGAGACACGGTCCAG
PT52-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCAGTGCTTGCACTGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT53-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTTACTGACTTCTTCGGAAAGATTTAAAGCGG-AGAAAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT54-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAAAAGATTTCTTCGGAATGAACTTAAGTGA-AGATAACAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT55-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCTCCTGCTTGCAGGGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT59-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTTCTCTGGATGAGTTGCGAACG-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT59-2 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTTCTCTGGATGAGTTGCGAACG-AGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT60-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCTCCAGCTTGCTGGAGTACTCGAGTGGCGAACG-GGGTGATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGATCGGCCACATTGGGACTGAGATACGGCCCAG
15
Phylotype Sequence
PT61-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGTACGGTAAGGCCCTTTCGGGGGTACACGAGTGGCGAACGGGTG-GGGTAGTGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT62-1 GACGAACGCTGGCTGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCTGGTGCTTGCACCGGGTGGATGAGTGGCGAA-AGGTAATGGCTCACCAAGACGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT63-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTTAATGAACCCTTCGGGGAGAATTAAAGCGG-AGATAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT64-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCCTTTGGGGGTACTCGAGTGGCGAACGGGTG-GGGTAGTGGCTTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT65-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGAACGGCAGCGAGGAGAAGCTTGCTTCTCTGTCGGCGAGTGGCG-GGGTAATGGCTCACCAAGGCAATGATCCGTAGCGGGTCTGAGAGGACGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT66-1 GACGAACGCTGGCGGCATGCCTAATACATGCAAGTCGAACGATGATATCACTGCTTGCAGTGATTGATTAGTGGCGAACG-AGGTAATGGCTTACCAAGGCAGTGATACGTAGCCGACTTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT68-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTTGAGCGCTGAAGGTTGGTACTTGTACCAACTGGATGAGCAGCGAA-AGGTAAAGGCTCACCAAGGCGATGATACATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT69-1 AGTGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGATGAAGTCCTAGCTTGCTAGGATGGATTAGTGGCGCACG-AGGTAATGGCTTACCAAGGCTATGACGCGTAACTGGTCTGAGAGGATGATCAGTCACATTGGAACTGAGACACGGTCCAA
PT70-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGGGATCAGCTTGCTGGTTCTGGATGAGTGGCGAA-GGGTGATGGCCTACCAAGGCGGTGACGGGTAGCCGGCCTGAGAGGGTGGACGGTCACACTGGGACTGAGACACGGCCCAG
PT71-1 ATTGAACGCTAGCGGGATGCTTTACACATGCAAGTCGAACGGCAGCGCGAGAGAGCTTGCTCTCTTGGCGGCGAGTGGCG-GGGTAAAGGCTCACCAAGGCAACGATCCGTAGCTGGTTTGAGAGGACGACCAGCCACACTGGGACTGAGACACGGCCCAG
PT72-1 GACGAACGCTGGCGGCATGCCTAATACATGCAAGTCGAACGATGACGTCACTGCTTGCAGTGACTGATTAGTGGCGAACG-AGGTAATGGCTTACCAAGGCAGTGATACGTAGCCGACTTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT72-2 GACGAACGCTGGCGGCATGCCTAATACATGCAAGTCGAACGATGACGTCACTGCTTGCAGTGACTGATTAGTGGCGAACG-AGGTAATGGCTTACCAAGGCAGTGATACGTAGCCGACTTGAGAGGGTGATCGGCCACATTGGGACTGAGATACGGCCCAG
PT73-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCACAGTTTATACCGTAGCTTGCTACACCATAGACTGTGA-AGGTAACGGCTCACCTAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT74-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTTATAGAATTCTTCGGAAGGAAATAAAGAGG-AGATAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT76-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGATGAAGTCTAGCTTGCTAGACGGATTAGTGGCGAACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT77-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGATGACGGGAGCTTGCTCCTTGATTCAGCGGCGGACGGG-GGGTAATGGCTCACCAAGGCGACGATCCGTAACTGGTCTGAGAGGATGATCAGTCACACTGGAACTGAGACACGGTCCAG
PT79-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAACATAAAGAAGCTTGCTTCTTTGATGACGAGTGGCG-AGGTAAAGGCTCACCAAGCCGACGATCTCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGGACTGAGACACGGCCCAG
PT80-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGAAACGGCATTGAGTGCTTGCACTCTTTGGACGTCGACC-GGGTAACGGCCCACCAAGGCAACGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT81-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGGACGGCAGCGGGGTAGTGCTTGCACTACTGCCGGCGAGTGGCG-GGGTAAAGGCCTACCAAGGCGACGATCAGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT82-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTTTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT83-1 GATGAACGCTGACAGAATGCTTAACACATGCAAGTCAACTTGAACTTCGGTTTGGGTGGCGGACGGGTGAGTAACGCGTA-AGGTAACGGCTCACCAAGGCGATGATGGGTAGCCGGCCTGAGAGGGTGATCGGCCACAAGGGGACTGAGACACGGCCCTT
PT84-1 GATGAACGCTAGCGATAGGCTTAACACATGCAAGTCGAGGGGCAGCATGGTCTTAGCTTGCTAAGACTGATGGCGACCGG-AGGTAACGGCTCACCAAGGCGACGATGGGTAGGGGAACTGAGAGGTTGAACCCCCACACTGGTACTGAGACACGGACCAG
PT86-1 GATGAACGCTAGCTATAGGCTTAACACATGCAAGTCGAGGGGCAGCGAATAGATAGCTTGCTATTTATGTCGGCGACCGG-GGGTAACGGCCCACCAAGGCAACGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT88-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGGAGAGCTTGCTCTTCTGGATGAGTTGCGAACG-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT89-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAAATGAATTCTTCGGAATGATTTTAATTGG-AGATAACAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT90-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGGTGGTGCTTGCACCGCCTGATTAGTGGCGAACG-AGGTGATGGCTCACCAAGACGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT90-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGGTGGTGCTTGCACCGCCTGATTAGTGGCGAACG-AGGTAGTGGCTCACCAAGGCGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
16
Phylotype Sequence
PT91-1 GACGAACGCTGGCGGCGCGCTTAACACATGCAAGTCGAGCGATGAAACTTTAACAGAACTCTTCGGAGTGAAGATAAAGT-GAGTAAAAGCCTACCAAGGCGACGATGGGTAGCCGGCCTGAGAGGGTGAACGGCCACATTGGAACTGAGAAACGGTCCAA
PT92-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGGAAGAAGCTTGCTTCTTCTGATGGCGACCGG-GGGTAACGGCCCACCTAGCCATCGATCGGTAGGGGTTCTGAGAGGAAGGTCCCCCACACTGGAACTGAGACACGGTCCAG
PT93-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGGATTTAGTAAACGGAAGCTTCGGTGGAAGATTACTAAT-AGATAACAGCCCACCAAGGCGACGATCAGTAGCCGGTCTGAGAGGATGAACGGCCACATTGGAACTGAGACACGGTCCAA
PT94-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCACCAGCTTGCTGGTGTGGATTAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT95-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAGTCTGCCTTGAAGATCGGAGTGCTTGCACTCTGTGAAA-GGGTAACGGCCTACCAAGGCGATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT96-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTTGAGCGATGAAGATTGGTGCTTGCACCAATTTGAAGAGCAGCGAA-AGGTAAAGGCTCACCAAGGCGATGATACATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT97-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGAACCGCGACTGAGTGCTTGCACTTGGTCAAGGTGAGTGG-GGGTAACGGCCTACCAAGGCGATGATGCATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT98-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGAAGAGGTCTAGCTTGCTAGACTGATTAGTGGCGAACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT100-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGATGACTCTCTAGCTTGCTAGAGATGATTAGTGGCGGACG-GGGTAAAGGCCCACCAAGGCGACGATCTGTAACTGGTCTGAGAGGATGATCAGTCACACCGGAACTGAGACACGGTCCGG
PT101-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAGCGGAAAGGCCACTTCGGTGGTACTCGAGCGGCGAACGGGTG-GGGTAATGGCCTACCATGGCTTCGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT102-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAGCACAGAGAGCTTGCTCTCGGGTGACGAGTGGCGGA-GGGTAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT107-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTGCTCGAGTGGCGAACGGGTG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT109-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGTAACAGGAGAAAGCTTGCTTTCTTGCTGACGAGCGGCG-GGGTAAAGGCTCACCTAGGCGACGATCTCTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT114-1 ATTGAACGCTGGCGGCATGCCTTACACATGCAAGTCGAACGGCAGCACGGACTTCGGTCTGGTGGCGAGTGGCGAACGGG-GGATAAAAGCTTACCAAGCCAACGATCTGTAGCTGGTCTGAGAGGACGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT118-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAGCGGAACTAACAGATTTACTTCGGTAATGACGTTAGGA-AGGTAAAGGCTTACCAAGGCGATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT120-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGCTCCAGCTTGCTGGAGTGGATCAGTGGCGAACG-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT121-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACTGATTAGAAGCTTGCTTCTATGACGTTAGCGGCGGA-AGGTAACGGCTCACCAAGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT123-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGAAGTTAGGAAGCTTGCTTCTGATACTTAGTGGCGGACGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT125-1 GACGAACGCTGGCGGCGCGCTTAACACATGCAAGTCGAGCGATGAAACCTTGACAGAACCCTTCGGGGCGAAGACAAAGT-GAGTAAAAGCCTACCAAGGCGACGATGGGTAGCCGGCCTGAGAGGGTGAACGGCCACATTGGAACTGAGAAACGGTCCAA
PT127-1 GATGAACGCTGACAGAATGCTTAACACATGCAAGTCTACTTGAATTTGGGTTTTTAACTTAGGTTTGGGTGGCGGACGGG-AGGTAACGGCTCACCAAGGCGATGATGGGTAGCCGGCCTGAGAGGGTGATCGGCCACAAGGGGACTGAGACACGGCCCTT
PT130-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGGG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT132-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAGCGGAAAGGCCCTTCGGGGTACTCGAGCGGCGAACGGGTGAG-AGGTAATGGCTCACCAAGGCTTCGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT136-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGCCCAGCTTGCTGGGTGGATCAGTGGCGAACGGG-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT137-1 GATGAACGCTAGCGATAGGCTTAACACATGCAAGTCGAGGGGCAGCACAAGGTAGCTTGCTATCTGGGTGGCGACCGGCG-AGGTAACGGCTCACCAAGGCAACGATGGGTAGGGGAACTGAGAGGTTTATCCCCCACACTGGTACTGAGACACGGACCAG
PT140-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTAAGGCCCTTCGGGGTACACGAGTGGCGAACGGGTGAG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAG
PT141-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGGGACTGGTTTTTGCTGGTTCTGGATGAGTGGCG-GGGTGATGGCCTACCAAGGCGGTGACGGGTAGCCGGCCTGAGAGGGTGGACGGTCACACTGGGACTGAGACACGGCCCAG
PT143-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCGAAGCTTGCTTCGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGACACGGCCCAA
17
Phylotype Sequence
PT143-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCGAAGCTTGCTTCGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT144-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAACAGGAAGCAGCTTGCTGCTTTGCTGACGAGTGGCG-GGGTAACGGCTCACCAAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT145-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAACGGTAACAGGAAACAGCTTGCTGTTTCGCTGACGAGTGGCG-GGGTAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT149-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGACTTTAGCTTGCTAAAGTTGGAAGAGTTGCGAA-AGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT150-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGATGAGAAGCTTGCTTCTCTGATGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCGACGATACGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT152-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCTGGGTGCTTGCACCTGGTGGATGAGTGGCGAA-AGGTAATGGCTTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT153-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTTGAACGAGAAATTTATGATTGATTCTTCGGATGAAAGATTAAATG-GGGTAATGGCCTACCATGGCGACGATCAGTAGCCGGATTGAGAGGTTGAACGGCCACACTGGAACTGAGACACGGTCCAG
PT154-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGCGGGGGAAGGTAGCTTGCTACCGGACCTAGCGGCGGACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT155-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAACAGGGATTAGCTTGCTAATCTGCTGACGAGTGGCG-AGGTAAAGGCTGACCAAGCCGACGATCTCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT156-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGTAGCACAGAGAGCTTGCTCTCGGGTGACGAGCGGCGGA-GGGTAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT157-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAAAAGATTTCTTCGGAATGACCTTAAGTGG-AGATAACAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT160-1 GATGAACGCTAGCTATAGGCTTAACACATGCAAGTCGAGGGGAAACGGCATTAAGTGCTTGCACTTTTTGGACGTCGACC-GGGTAACGGCCCACCAAGGCATCGATCGGTAGGGGTTCTGAGAGGAAGGTCCCCCACATAGGAACTGAGACACGGTCCTA
PT161-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTGCTTTCCCTTGTGGATTGCGGGTGCTCGAG-GGGTAATGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT162-1 GATAAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAACTTAGTTACTTTCTTCGGAAAGAAATTAAGTGA-AGATAAAAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAA
PT163-1 GACGAACGCTGGCTGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCCGAAGCTTGCTTCGGGTGGATGAGTGGCGAA-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT164-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGCGGGGCAGCATGGGGGTTGCTTGCAACCCCTGATGGCGACCGG-GGGTAACGGCCCACCGTGGCGACGATCGGTAGGGGTTCTGAGAGGAAGGTCCCCCACACTGGAACTGAGACACGGTCCAG
PT166-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGCTCAAGCTTGCTTGGGTGGATCAGTGGCGAACG-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT168-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCGCAGCGAAAGGTGCTTGCACCTTTCAAGCGAGTGGCGA-GGGTAAAGGCCTACCAAGACAATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT168-2 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCGCAGCGAAAGGTGCTTGCACCTTTCAAGCGAGTGGCGA-GGGTAAAGGCTTACCAAGACGATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT170-1 GATGAACGCCGGCGGTGTGCCTAATACATGCAAGTCGAACGCGTTGGCCCAATTGATTGACGGTGCTTGCACCTGATTGA-GGGTAACGGCCTACCAAGGCGATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACAATGGGACTGAGACACGGCCCAT
PT171-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGGGTCGCAAGACCCAGTGGCGGACGGGTGAGTAACGCGT-GGGTAACGGCCTACCAAGGCGACGATCGGTAGCTGGTCTGAGAGGACGATCAGCCACACTGGAACTGAGACACGGTCCAG
PT172-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGATGACAGGAGCTTGCTCCTGAATTCAGCGGCGGACGGG-AGGTAATGGCTCACCAAGGCGACGATCCGTAACTGGTCTGAGAGGATGATCAGTCACACTGGAACTGAGACACGGTCCAG
PT173-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGCGGGGTGATGGTGCTTGCACTATCACTTAGCGGCGGACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT174-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCGCAGCTTGCTGTGGTGGATTAGTGGCGAACG-GGGTAGTGGCCTACCAAGGCTTTGACGGTTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT175-1 ATTGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTTCTCTGGATGAGTTGCGAACG-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT176-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAGCTTGCCTAGATGATTTTAGTGCTTGCACTAAATGAAA-AGGTAACGGCTTACCAAGGCAATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT177-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGTTGAAGGAGAAGCTTGCTTCTCTGGATGAGTTGCGAACG-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
18
Phylotype Sequence
PT178-1 GACGAACGCTGGCTGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCGACAGCTTGCTGTTGGTGGATGAGTGGCGAA-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT179-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGTACGGAAAGGCCTCTTTTTGGGGTACTCGAGTGGCGAACGGGT-GGGTAGTGGCCTACCAAGGCGGTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACATTGGGACTGAGATACGGCCCAG
PT180-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAGCAGGGATTAGCTTGCTAATTTGCTGACGAGTGGCG-GGGTAAAGGCCTACCAAGCCTGCGATCTCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT181-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGCTCCAAGCTTGCTTGGGGTGGATCAGTGGCGAA-AGGTAATGGCTCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT182-1 ATTGAACGCTGGCGGCATGCCTAATACATGCAAGTCGAACGATGACGTCACTGCTTGCAGTGACTGATTAGTGGCGAACG-AGGTAATGGCTTACCAAGGCAGTGATACGTAGCCGACTTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT183-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCTTTGTGGTTCAACTGATTTGAAGAGCTTGCTCAGATAT-AGGTAATGGCTTACCAAGGCGATGATGCATAGCCGAGTTGAGAGACTGATCGGCCACAATGGGACTGAGACACGGCCCAT
PT184-1 GACGAACGCTGGCTGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCAGAAGCTTGCTTCTGGTGGATGAGTGGCGAA-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT186-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGATGGCCGTGCTTGCACGGTCGGATTAGTGGCGAACG-AGGTAGTGGCTTACCAAGGCGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT188-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCGGTACTTGTATCGTGGATTAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT189-1 GGTGAACGCTAGCGGTGCGCCTAACACATGCAAGTCGAGCGGTATATTTAATTAAGCTTGCTTAAATAAATAGAGAGCGG-AGGTAAAAGCTTACCAAGGCAATGACGGATAACTGGTCTGAGAGGATGATCAGTCACAATGGAACTGAGATACGGTCCAT
PT190-1 GACGAACGCTGGCTGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCGGGTGCTTGCACCTGGTGGATGAGTGGCGAA-GGGTAAGGGCCCACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT191-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAATGACGAGAAGCTTGCTTCTCTGATTTAGCGGCGGACG-GGGTAACGGCCTACCAAGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT192-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTGGAACGCATTGATATCACCGGAGCTTGCTCCATCGATATTAATGA-GGGTAGCGGCCTACCAAGGCACCGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT193-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCCTTTCGGGGGTGGATGAGTGGCGAACGGGTG-GGGTAGTGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGTCACACTGGGACTGAGACACGGCCCAG
PT194-1 GATGAACGCTGACAGAATGCTTAACACATGCAAATCTATGAAAAGTTAACTAGCTTGCTAGGTAATGGATCATGGTGGAC-GGGTAAAGGCCTACCAAGGCGATGATAGGTAGCCGGCCCGAGAGGGTGAACGGCCACAAGGGGACTGAGATACGGCCCTT
PT197-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCTTCTTTTCCGCCGAACTTCGGTTCATTGGAAAAGAGGA-AGGTAACGGCTCACCAAGGCAACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT199-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTGAAGCTCCCAGCTTGCTGGGGGTGGATCAGTGGCGAA-AGGTAATGGCTTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT200-1 GATGAACGCTAGCGGGAGGCTTAACACATGCAAGCCGAGGGGTATTTAATTTTCGGATTAGAGAGACCGGCGAACGGGTG-GTGTAGAGGACTACCAAGGCGATGATCTGTAGGGGGCCTGAGAGGGTGAACCCCCACACTGGTACTGAGACACGGACCAG
PT201-1 GATGAACGCTAGCGGGAGGCCTAACACATGCAAGTCGAGCGGGATATGGTAGCTTGCTACCATTGAGAGCGGCGTACGGG-AGGTAACGGCTCACCAAGGCGATGATCTTTAGGGGGCCTGAGAGGGTGATCCCCCACACTGGGACTGAGACACGGCCCAG
PT204-1 GATGAACGCTAGCGGGAGGCTTAACACATGCAAGCCGAGGGGTATAGTTAGCTTGCTAACTTGAGACCGGCGCACGGGTG-AGGTAACGGCTTACCAAGACGATGATCTTTAGGGGGCCTGAGAGGGTGAACCCCCACACTGGTACTGAGACACGGACCAG
PT206-1 GATGAACGCTGACAGAATGCTTAACACATGCAAATCTATGTAAATTAATTATGCTTGCATAAATAAGGAGCATGGTGGAC-GGGTAAAGGCCTACCAAGGCGATGATAGGTAGCCGGCCCGAGAGGGTGAACGGCCACAAGGGGACTGAGATACGGCCCTT
PT209-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAGCTTTATAGACTTCTTCGGAATGAAATAAAGAGG-AGATAATAGCCCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
PT212-1 AGTGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGATAAGATTTCTAGCTTGCTAGAGATTGAATAGTGGCGCA-AGGTAATGGCTTACCAAGGCCATGACGGGTATCCGGCCTGAGAGGGTGAACGGACACACTGGAACTGAGACACGGTCCAG
PT214-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGTTGGTTGCTTGCAACCGATGATGGCGACCGG-AGGTAACGGCTCACCAAGGCATCGATCAGTAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT222-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAAGGAGCTTGCTCTTTCCGGATGAGTTGCGAA-AGGTAACGGCTCACCAAGGCAACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT226-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGTCACTGCTTGCAGTGACTGATTAGTGGCGAACG-AGGTAATGGCTTACCAAGGCAGTGATACGTAGCCGACTTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT228-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCTGGTGCTTGCACTGGTGGATTAGTGGCGAACG-GGGTAGTGGCCTACCATGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
19
Phylotype Sequence
PT231-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCCCAGCTTGCTGGGTGGATGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT233-1 GACGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGATGAAGTCTAGCTTGCTAGACGGATTAGTGGCGAACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT235-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAGCGGTAAGGCCCTTCGGGGTACACGAGCGGCGAACGGGTGAG-GGGTGATGGCCTACCAAGGCGACGACGGATAGCCGGCCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT238-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAGCGGTAAGGCCCTTTCGGGGGTACACGAGCGGCGAACGGGTG-GGGTGATGGCCTACCAAGGCGGTGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT239-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCTGGTGCTTGCACCGGGTGGATTAGTGGCGAA-AGGTAATGGCTCACCGTGGCGATGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT241-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGGACCAGCTTGCTGGTTTGGATTAGTGGCGAACG-GGGTAGTGGCCTACCATGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT246-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTAAGGCCCCTTCGGGGGTACACGAGTGGCGAACGGGTG-AGGTAACGGCTCACCAAGGCTTCGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGATACGGCCCAG
PT250-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGACCTAGCAATAGGTTGATGGCGACCGGCGCA-GGGTAACGGCCCACCAAGACATCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT251-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGCGGGGAAAGGTAGCTTGCTACCTGACCTAGCGGCGGACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT253-1 GATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGAACGTGAAGTGGTGTTTCCAGTGGCGAACGGGTGCGTAATGCG-AGGCAATAGCTCACCAAGGCTACGATCAGTAGTTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACAAGGCCCAG
PT254-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTGCTTGCAGGGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT261-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAGCGGGGGATGTTGCTTCGGTAACTGACCTAGCGGCGGACGGG-GGGTAAAGGCCTACCAAGGCGACGATCTGTAGCGGGTCTGAGAGGATGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT278-1 GATGAACGCTAGCGGGAGGCCTAACACATGCAAGCCGAGCGGTATTGTTTCTTCGGAAATGAGAGAGCGGCGTACGGGTG-AGGTAACGGCTCACCAAGGCGATGATCTTTAGGGGGCCTGAGAGGGTGATCCCCCACACTGGTACTGAGACACGGACCAG
PT280-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCGAGAGGAGGGAGCTTGCTCCCTCTGATCGAGTGGCGGA-GGGTAACGGCCCACCAAGGCAACGATGCGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT281-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGAACTTAGCTTGCTAAGTTTGATGGCGACCGG-GGGTAACGGCCCACCAAGCCTTCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT282-1 AACGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAGCGCGAATGGAGCAATCCTAGTAGAGCGGCGCACGGGTGCGT-GGGTAATGGCCCACCAAGGCGACGACGGGTAGCTGGTCTGAGAGGACGATCAGCCACACTGGAACTGAGACACGGTCCAG
PT317-1 GATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGGACGGGAAGTGGTGTTTCCAGTGGCGGACGGGTGAGTAACGCG-AGGCAATAGCTTACCAAGGCGATGATCAGTAGCTGGTCCGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT360-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCAGCTTGCTGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT372-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGCTGAAGCTCAGCTTTTGTTGGGTGGATGAGTGGCGAACG-GGGTGATGGCCTACCAAGGCTTTGACGGGTAGCCGGCCTGAGAGGGTGACCGGTCACATTGGGACTGAGATACGGCCCAG
PT403-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTGAAGCTTGCTTTGGGTACTCGAGTGGCGAA-GGGTAATGGCCTACCAAGGCTTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT438-1 GATGAACGCTGGCGGCGTGCCTAACACATGCAAGTCGAGCGATTTACTTCGGTAAAGAGCGGCGGACGGGTGAGTAACGC-AGGTAACGGCTTACCAAGGCGACGATCAGTAGCCGACCTGAGAGGGTGATCGGCCACATTGGAACTGAGACACGGTCCAA
PT487-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAAGTTTTTCTGGTGCTTGCACCGGAAAAACTTAGCGGCG-GGGTAAAGGCCTACCAAGGCGACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT488-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAAGTTTTTCTGGTGCTTGCACTAGAAAAACTTAGCGGCG-GGGTAAAGGCCTACCAAGGCGACGATGCATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT489-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTGGGACGCAAGAGAACACACTGTGCTTGCACACCGTGTTTTCTTGA-AGGTAAGAGCTTACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT490-1 GATGAACGCTAGCTACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGGTCTTAGCTTGCTAAGGCCGATGGCGACCGG-GGGTAACGGCCCACCTAGTCTTCGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGAACTGAGACACGGTCCAA
PT491-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACCACTTCGGTGGGAAGCGGCGAACGGGTGAGTAACAC-GGGTAAAGGCCTACCAAGGCGACGATGCGTAGCCGACCTGAGAGGGTGAACGGCCACACTGGGACTGAGACACGGCCCAG
PT493-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGCAGCGGAAAGTAGCTTGCTACTTTGCCGGCGAGCGGCG-GGGTAACGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
20
Phylotype Sequence
PT494-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGAACGGATAAAGAGCTTGCTCTTTTGAAGTTAGTGGCGGA-AGGTAATGGCTTACCAAGGCGACGATCCATAGCCGACCTGAGAGGGTGATCGGCCACATTGGGACTGAGACACGGCCCAA
PT496-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGCACTCTCGTTTAGATTGAAGGAGCTTGCTCCTGATTGAT-AGGTAAAGGCTCACCAAGACCGTGATGCATAGCCGACCTGAGAGGGTAATCGGCCACACTGGGACTGAGACACGGCCCAG
PT497-1 GATGAACGCTGGCGGCATGCTTAACACATGCAAGTCGAGCGAAGCACCTTTTTGGAATCTTCGGAGGAAGAGGAGGTGAC-GGGTAACGGCCTACCAAGGCGACGATCAGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGACACGGCCCAG
PT502-1 AGTGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAACGGAGCTTATAGTAAGCTTGCTTATTATAAGCTTAGTGGCG-GGGTAATGGCTTACCAAGGCTATGACGCATAACTGGTCTGAGAGGATGATCAGTCACACTGGAACTGAGACACGGTCCAG
PT503-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGGTAACAGGGATTAGCTTGCTAATTTGCTGACGAGTGGCG-GGGTAAAGGCCTACCAAGCCGACGATCTCTAGCTGGTCTGAGAGGATGATCAGCCACACTGGAACTGAGACACGGTCCAG
PT505-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGAACGGCAGCGAAGAGAAGCTTGCTTCTCTGTCGGCGAGTGGCG-AGGTAATGGCTCACCAAGGCGATGATCCGTAGCGGGTCTGAGAGGACGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT506-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGACAAGGAGCTTGCTCCTTTGACGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCGACGATACGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT507-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGAACGGCAGCGAGGAGGAGCTTGCTTCTCTGTCGGCGAGTGGCG-AGGTAATGGCTCACCAAGGCGATGATCCGTAGCGGGTCTGAGAGGACGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT508-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGACGAGGAGCTTGCTCCTTTGACGTTAGCGGCGGA-AGGTAACGGCTTACCAAGGCAACGATACGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT509-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGACGAGGAGCTTGCTCCTCTAACGTTAGCGGCGGA-AGGTAACAGCTTACCAAGGCAACGATGCGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
PT510-1 ATTGAACGCTGGCGGCAGGCTTAACACATGCAAGTCGAACGAAGAAGTCTAGCTTGCTAGACTGATTAGTGGCGAACGGG-GGGTAAAAGCCTACCAAGGCGACGATCTGTAGCTGGTCTGAGAGGATGATCAGCCACACTGGGACTGAGACACGGCCCAG
PT511-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTGCTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGGACGGCCACATTGGGACTGAGATACGGCCCAG
PT511-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTGCTCGAGTGGCGAACGGG-GGGTAAGGGCCTACCAAGGCGGCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT512-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGTAAGGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT512-2 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGTAATGGCCGACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT512-3 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGGAATGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT512-4 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGTAATGGCCTGCCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT512-5 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTCTTCGGAGGTACTCGAGTGGCGAACGGGTG-GGGTAATGGCCTACCAAGGGGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT513-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCCTGCTTGCAGGGTACTCGAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGTCACATTGGGACTGAGATACGGCCCAG
PT514-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCTCCAGCTTGCTGGAGTGCTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGATCGGCCACATTGGGACTGAGATACGGCCCAG
PT515-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGAAGCCCAGCTTGCTGGGTGGATTAGTGGCGAACGGG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT516-1 ATTGAACGCTGGCGGCAGGCCTAACACATGCAAGTCGAGCGGTAACACAGAGAGCTTGCTCTCGGGTGACGAGCGGCGGA-GGGTAATGGCTCACCTAGGCGACGATCCCTAGCTGGTCTGAGAGGATGACCAGCCACACTGGAACTGAGACACGGTCCAG
PT517-1 ATTGAACGCTGGCGGCATGCTTTACACATGCAAGTCGAACGGCAACGAGGAGAAGCTTGCTTCTCTGTCGGCGAGTGGCG-GGGTAAGGGCTCACCAAGGCAATGATCCGTAGCGGGTCTGAGAGGACGATCCGCCACACTGGGACTGAGACACGGCCCAG
PT518-1 GATGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCGAAGCTTGCTTCGGTACTCGAGTGGCGAACG-GGGTAATGGCCTACCAAGGCGTCGACGGGTAGCCGGCCTGAGAGGGTGTACGGCCACATTGGGACTGAGATACGGCCCAG
PT519-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGGTGGTGCTTGCACCACCTGATTAGTGGCGAACG-AGGTAATGGCTCACCAAGACGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT520-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGAAAGGCCTTTTCGGAGGTACTCGAGTGGCGAACGGGGG-GGGTAAGGGCCTACCAAGGCGGCGACGGGTAGCCGGACTGAGAGGTTGGTCGGCCACATTGGGACTGAGATACGGCCCAG
PT521-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTCGAGCGAACAGATAAGGAGCTTGCTCCTTTGATGTTAGCGGCGGA-AGGTAATGGCTTACCAAGGCAACGATACGTAGCCGACCTGAGAGGGTGATCGGCCACACTGGAACTGAGACACGGTCCAG
21
Phylotype Sequence
PT522-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAGGAGCTTGCTTCTCCGGATGAGTTGCGAACG-AGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT523-1 GATGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGGAAGGAGCTTGCTCTTTCCGGATGAGTTGCGAA-AGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT524-1 GACGAACGCTGGCGGCGTGCCTAATACATGCAAGTAGAACGCTGAAGCTTAGTGCTTGCACCGAGCGGATGAGTTGCGAA-GGGTAACGGCTCACCAAGGCGACGATACATAGCCGACCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT525-1 GATGAACGCTGACAGAATGCTTAACACATGCAAGTCTACTTGAACTTCGGTTTGGGTGGCGGACGGGTGAGTAACGCGTA-AGGTAACGGCTCACCAAGGCGATGATGGGTAGCCGGCCTGAGAGGGTGAACGGCCACAAGGGGACTGAGACACGGCCCTT
PT526-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGGTGGTGCTTGCACTGCCTGATTAGTGGCGAACG-AGGTAATGGCTCACCAAGGCGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT527-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGATGACGGCCGTGCTTGCACGGTCTGATTAGTGGCGAACG-AGGTGATGGCTCACCAAGACGATGACGGGTAGCCGGCCTGAGAGGGCGACCGGCCACACTGGGACTGAGACACGGCCCAG
PT528-1 GACGAACGCTGGCGGCGTGCTTAACACATGCAAGTCGAACGGTAAGGCCCTTTCGGGGGTACACGAGTGGCGAACGGGTG-GGGTAATGGCCTACCAAGGCGACGACGGGTAGCCGGCCTGAGAGGGTGATCGGCCACACTGGGACTGAGACACGGCCCAG
PT529-1 GATGAACGCTAGCGACAGGCTTAACACATGCAAGTCGAGGGGCAGCATGATTTGTAGCAATACAGATTGATGGCGACCGG-GGGTAACGGCCCACCAAACCGACGATGGATAGGGGTTCTGAGAGGAAGGTCCCCCACATTGGTACTGAGACACGGACCAA
PT531-1 GATTAACGCTGGCGGCGTGCATAACACATGCAAGTCGAACGATGAAGCTTAATTGATCCCTTCGGGGTGATTTAAAGTGA-AGATAAAAGCTCACCAAGGCAACGATCAGTAGCCGGCTTGAGAGAGTGTACGGCCACATTGGGACTGAGACACGGCCCAG
22
Table S2. Phylogenetic assignment. Description of all 251 phylotypes determined using Illumina-based amplicon deep-sequencing and the RDP database.
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT1-1 160 Corynebacterium accolens AJ439346 100 Corynebacterium accolens Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-2 160 Unc. bacterium GQ003864 100 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
Corynebacterium tuberculostearicum PT1-3 160 99 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ438050
PT1-5 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-6 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-7 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
Corynebacterium tuberculostearicum PT1-9 160 100 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ438050
PT1-10 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-11 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-12 160 Corynebacterium auriscanis AJ243820 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
Corynebacterium tuberculostearicum PT1-13 160 99 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ438050
PT1-14 160 Corynebacterium accolens GQ338419 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-15 160 Unc. bacterium GQ000343 100 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-8 160 Corynebacterium variabile AJ222815 100 Corynebacterium variabile Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
Corynebacterium tuberculostearicum PT1-16 160 99 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ438050 Corynebacterium tuberculostearicum PT1-18 160 99 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ438050
PT1-19 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-20 160 Corynebacterium accolens AJ439346 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-21 160 Unc. bacterium JF184547 100 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-22 160 Unc. bacterium FM875046 100 Corynebacterium tuberculostearicum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-23 160 Corynebacterium accolens GQ338419 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-24 160 Corynebacterium accolens GQ338419 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT1-25 160 Corynebacterium accolens GQ338419 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
23
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
Corynebacterium glucuronolyticum PT2-1 160 100 Corynebacterium glucuronolyticum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium X86688
PT3-1 160 Propionibacterium acnes AB042288 100 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT3-3 160 Propionibacterium acnes AB042288 99 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT3-4 160 Propionibacterium acnes AB042288 99 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT3-5 160 Propionibacterium acnes AB042288 99 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT3-6 160 Propionibacterium acnes AB042288 99 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT3-7 160 Propionibacterium acnes AB042288 99 Propionibacterium acnes Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT4-1 160 Veillonella parvula CP001820 100 Veillonella sp. Firmicutes Clostridia Clostridiales Veillonellaceae Veillonella
PT5-1 160 Unc. Finegoldia sp. GQ179678 100 Finegoldia magna Firmicutes Clostridia Clostridiales IncertaeSedisXI Finegoldia
PT6-1 160 Unc. bacterium FM875546 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
Negativicoccus succinicivorans PT8-1 160 100 Negativicoccus succinicivorans Firmicutes Clostridia Clostridiales Veillonellaceae unclassifiedVeillonellaceae HQ264056
PT9-1 160 Unc. bacterium GQ055471 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT10-1 160 Streptococcus parasanguinis HM596296 99 Streptococcus parasanguinis Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT12-1 160 Leptotrichia wadei AY029802 100 Leptotrichia wadei Fusobacteria Fusobacteria Fusobacteriales Leptotrichiaceae Leptotrichia
PT13-1 160 Prevotella timonensis DQ518919 100 Prevotella timonensis Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
PT15-1 160 Propionibacterium sp. Y17821 100 Propionibacterium sp. Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT16-1 160 Finegoldia magna AB640691 100 Finegoldia magna Firmicutes Clostridia Clostridiales IncertaeSedisXI Finegoldia
PT17-1 160 Prevotella buccalis AB547676 100 Prevotella buccalis Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
Staphylococcus warneri/Staphylococcus PT19-1 160 Staphylococcus warneri L37603 100 Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus lugdunensis / Staphylococcus auricularis Neisseria meningitidis alpha14 Neisseria meningitidis / Neisseria PT20-1 160 100 Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae Neisseria AM889136 zoodegmatis Streptococcus cristatus/Streptococcus PT22-1 160 Streptococcus cristatus AY188347 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus infantis Streptococcus cristatus/Streptococcus PT22-2 160 Streptococcus infantis AY485603 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus infantis
PT24-1 160 Porphyromonas bennonis AB588020 99 Porphyromonas bennonis Bacteroidetes Bacteroidia Bacteroidales Porphyromonadaceae Porphyromonas
PT25-1 160 Granulicatella adiacens FR822389 100 Granulicatella sp. Firmicutes Bacilli Lactobacillales Carnobacteriaceae Granulicatella
PT26-1 160 Staphylococcus aureus D83355 100 Staphylococcus aureus Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
24
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT27-1 160 Kocuria palustris FR691399 100 Kocuria sp. Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Kocuria
PT28-1 160 Unc. bacterium AY958797 100 Peptoniphilus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Peptoniphilus
PT31-1 160 Finegoldia magna AB109772 100 Finegoldia magna Firmicutes Clostridia Clostridiales IncertaeSedisXI Finegoldia
Corynebacterium amycolatum / PT32-1 160 Corynebacterium amycolatum X82057 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium Corynebacterium freneyi
PT35-1 160 Micrococcus luteus type strain AJ536198 100 Micrococcaceae bacterium Actinobacteria Actinobacteria Actinomycetales Micrococcaceae
PT36-1 160 Streptococcus sp. oral taxon HM596298 100 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
Streptococcus salivarius / Streptococcus PT36-2 160 Streptococcus salivarius AY188354 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus thermophilus Streptococcus salivarius / Streptococcus PT36-3 160 Streptococcus sanguinis AF003928 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus sanguinis
PT37-1 160 Unc. bacterium GQ002688 100 Uncultured Actinomycetales bacterium Actinobacteria Actinobacteria Actinomycetales
Staphylococcus epidermidis / PT38-1 160 Staphylococcus epidermidis AE015929 100 Staphylococcus capitis / Staphylococcus Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus caprae Staphylococcus epidermidis / PT38-2 160 Unc. bacterium GQ053099 100 Staphylococcus capitis / Staphylococcus Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus caprae PT39-1 160 Acinetobacter junii Z93438 100 Acinetobacter junii Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Acinetobacter
PT40-1 160 Kocuria marina AY211385 100 Kocuria marina Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Kocuria
PT40-2 160 Kocuria rhizophila AP009152 100 Kocuria rhizophila Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Kocuria
PT41-1 160 Anaerococcus octavius Y07841 100 Anaerococcus octavius Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT43-1 160 Dermabacter hominis FJ200385 100 Dermabacter hominis Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Dermabacter
PT44-1 160 Peptoniphilus sp. gpac121 AM176529 100 Peptoniphilus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Peptoniphilus
PT45-1 160 Unc. bacterium GQ080908 100 Neisseriaceae bacterium PT77 Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae
Corynebacterium riegelii / Corynebacterium PT46-1 160 Corynebacterium riegelii Y14651 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium tuscaniense
PT48-1 160 Unc. bacterium HM247068 100 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT49-1 160 Cupriavidus sp. AB438047 99 Cupriavidus sp./Ralstonia sp. Proteobacteria Betaproteobacteria Burkholderiales Burkholderiaceae Cupriavidus/Ralstonia
PT50-1 160 Rothia mucilaginosa AP011540 100 Rothia mucilaginosa Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Rothia
PT50-2 160 Rothia dentocariosa M59055 100 Rothia dentocariosa Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Rothia
PT51-1 160 Haemophilus influenzae AY613454 100 Haemophilus influenzae Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Haemophilus
25
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT51-2 160 Haemophilus sp. DQ404358 100 Haemophilus influenzae Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Haemophilus
Corynebacterium aurimucosum PT52-1 160 100 Corynebacterium aurimucosum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AY271720
PT53-1 160 Unc. bacterium GQ087145 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT54-1 160 Unc. bacterium JF128314 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
Corynebacterium timonense / PT55-1 160 Corynebacterium sp. X81904 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium Corynebacterium genitalium Streptococcus pneumoniae / Streptococcus PT59-1 160 Streptococcus pneumoniae AF003930 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus mitis Streptococcus pneumoniae / Streptococcus PT59-2 160 Streptococcus oralis FR720602 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus mitis/oralis
PT60-1 160 Unc. bacterium GQ049759 100 Uncultured Actinomycetales bacterium Actinobacteria Actinobacteria Actinomycetales UnclassifiedActinomycetales
PT61-1 160 Propionibacterium granulosum AJ003057 100 Propionibacterium granulosum Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae Propionibacterium
PT62-1 160 Brevibacterium antiquum AY243344 100 Brevibacterium casei/antiquum Actinobacteria Actinobacteria Actinomycetales Brevibacteriaceae Brevibacterium
PT63-1 160 Anaerococcus sp. HM587319 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
Propionibacterium propionicum PT64-1 160 100 Propionibacterium propionicum Actinobacteria Actinobacteria Actinomycetales Nocardiaceae Gordonia AJ003058
PT65-1 160 Unc. bacterium JF228735 100 uncultured Neisseriaceae bacterium PT75/77 Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae
PT66-1 160 Dolosigranulum pigrum X70907 100 Dolosigranulum pigrum Firmicutes Bacilli Lactobacillales Carnobacteriaceae Dolosigranulum
PT68-1 160 Lactococcus lactis AB100804 100 Lactococcus lactis subsp. lactis Firmicutes Bacilli Lactobacillales Streptococcaceae Lactococcus
PT69-1 160 Campylobacter ureolyticus FN401323 100 Campylobacter-ureolyticus Proteobacteria Epsilonproteobacteria Campylobacterales Campylobacteraceae Campylobacter
Actinomyces naeslundii / Actinomyces PT70-1 160 Actinomyces naeslundii AJ234050 100 Actinobacteria Actinobacteria Actinomycetales Actinomycetaceae Actinomyces viscosus
PT71-1 160 Alcaligenes faecalis AY662683 100 Alcaligenes faecalis subsp. faecalis Proteobacteria Betaproteobacteria Burkholderiales Alcaligenaceae Alcaligenes
PT72-1 160 Unc. bacterium GQ080513 100 Dolosigranulum sp. Firmicutes Bacilli Lactobacillales Carnobacteriaceae Dolosigranulum
PT72-2 160 Unc. bacterium JF179825 99 Dolosigranulum sp. Firmicutes Bacilli Lactobacillales Carnobacteriaceae Dolosigranulum
Streptococcus gordonii / Streptococcus PT73-1 160 Streptococcus gordonii AF003931 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus anginosus
PT74-1 160 Unc. bacterium GQ020166 94 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
Moraxella lacunata/Moraxella PT76-1 160 Moraxella lacunata D64049 100 Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Moraxella nonliquefaciens Pseudomonas putida / Pseudomonas PT77-1 160 Pseudomonas putida CP000712 100 Proteobacteria Gammaproteobacteria Pseudomonadales Pseudomonadaceae Pseudomonas entomophila
PT79-1 160 Haemophilus parainfluenzae AY362908 100 Haemophilus parainfluenzae Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Haemophilus
26
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT80-1 160 Prevotella melaninogenica AY323525 100 Prevotella melaninogenica Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
PT81-1 160 Neisseria sp. oral taxon GQ131417 100 Kingella potus / Neisseria sp. Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae Neisseria/Kingella
Corynebacterium propinquum/ PT82-1 160 Corynebacterium propinquum X84438 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium Corynebacterium pseudodiphtheriticum Fusobacterium nucleatum subsp. Fusobacterium russii / Fusobacterium PT83-1 160 100 Fusobacteria Fusobacteria Fusobacteriales Fusobacteriaceae Fusobacterium nucleatum FJ471640 nucleatum
PT84-1 160 Porphyromonas sp. oral taxon GU409235 100 Porphyromonas sp. Bacteroidetes Bacteroidia Bacteroidales Porphyromonadaceae Porphyromonas
PT86-1 160 Prevotella bivia AB547673 100 Prevotella bivia Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
Streptococcus cristatus / Streptococcus PT88-1 160 Streptococcus cristatus AJ853835 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus infantis
PT89-1 160 Unc. bacterium GQ011956 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
Brachybacterium paraconglomeratum PT90-1 160 100 Brachybacterium paraconglomeratum Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium AJ415377
PT90-2 160 Brachybacterium sp. HQ845004 100 Brachybacterium sp. Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium
PT91-1 160 Unc. bacterium GQ042101 100 Peptoniphilus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Peptoniphilus
PT92-1 160 Prevotella sp. oral taxon GU413280 100 Prevotella sp. Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
PT93-1 160 Unc. bacterium AM697048 100 Finegoldia magna Firmicutes Clostridia Clostridiales IncertaeSedisXI Finegoldia
PT94-1 160 Dermacoccus sp. JF905611 100 Dermacoccus sp. Actinobacteria Actinobacteria Actinomycetales Dermacoccaceae Dermacoccus
PT95-1 160 Lactobacillus iners Y16329 100 Lactobacillus iners Firmicutes Bacilli Lactobacillales Lactobacillaceae Lactobacillus
PT96-1 160 Lactocuccus lactis AB100802 100 Lactocuccus lactis subsp. cremoris Firmicutes Bacilli Lactobacillales Streptococcaceae Lactococcus
PT97-1 160 Unc. bacterium GQ113763 100 Abiotrophia defectiva Firmicutes Bacilli Lactobacillales Aerococcaceae Abiotrophia
PT98-1 160 Moraxella lincolni FR822735 100 Moraxella lincolni Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Moraxella
Moraxella osloensis/ Enhydrobacter PT100-1 160 Enhydrobacter aerosaccus AJ550856 100 Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Enhydrobacter aerosaccus
PT101-1 160 Nocardioides terrigena EF363712 100 Nocardoides sp. Actinobacteria Actinobacteria Actinomycetales Nocardioidaceae Nocardioides
Klebsiella oxytoca / Enterobacter PT102-1 160 Enterobacter cloacae Z96079 100 Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae sp./Pantoea sp. Corynebacterium Corynebacterium pseudodiphtheriticum PT107-1 160 99 propinquum/Corynebacterium Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343 pseudodiphtheriticum PT109-1 160 Proteus mirabilis DQ885256 100 Proteus mirabilis Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Proteus
PT114-1 160 Acidovorax sp. AY258065 100 Comamonas sp. Proteobacteria Betaproteobacteria Burkholderiales Burkholderiaceae Comamonas
PT118-1 160 Lactobacillus crispatus AF257097 100 Lactobacillus crispatus Firmicutes Bacilli Lactobacillales Lactobacillaceae Lactobacillus
27
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT120-1 160 Janibacter limosus Y08539 100 Janibacter sp. Actinobacteria Actinobacteria Actinomycetales Intrasporangiaceae Janibacter
PT121-1 160 Bacillus flexus AB021185 100 Bacillus flexus Firmicutes Bacilli Bacillales Bacillaceae Bacillus
PT123-1 160 Moraxella catarrhalis U10876 100 Moraxella catarrhalis Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Moraxella
PT125-1 160 Unc. bacterium GQ094081 100 Peptoniphilus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Peptoniphilus
Fusobacterium nucleatum subsp. PT127-1 160 100 Fusobacterium nucleatum Fusobacteria Fusobacteria Fusobacteriales Fusobacteriaceae Fusobacterium nucleatum AJ810277 Corynebacterium pseudodiphtheriticum PT130-1 160 99 Corynebacterium pseudodiphtheriticum Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343
PT132-1 160 Nocardioides albus AF004988 100 Nocardoides sp. Actinobacteria Actinobacteria Actinomycetales Nocardioidaceae Nocardioides
PT136-1 160 Unc. soil bacterium EF688360 100 Micrococcineae bacterium Actinobacteria Actinobacteria Actinomycetales Cellulomonadaceae Cellulomonas
PT137-1 160 Porphyromonas sp. oral taxon GU429852 100 Porphyromonas sp. Bacteroidetes Bacteroidia Bacteroidales Porphyromonadaceae Porphyromonas
PT140-1 160 Unc. bacterium JF226494 100 Dietzia sp. Actinobacteria Actinobacteria Actinomycetales Dietziaceae Dietzia
PT141-1 160 Unc. bacterium GQ026602 100 Actinomyces oris Actinobacteria Actinobacteria Actinomycetales Actinomycetaceae Actinomyces
PT143-1 160 Corynebacterium imitans Y09044 98 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT143-2 160 Corynebacterium imitans Y09044 99 Corynebacterium imitans Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT144-1 160 Escherichia coli CP002516 100 Escherichia/Salmonella/Citrobacter Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Escherichia/Salmonella/Citrobacter
PT145-1 160 Escherichia coli AJ567540 100 Escherichia/ Shigella Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Escherichia/Shigella
Streptococcus bovis / Streptococcus PT149-1 160 Streptococcus equinus AB002481 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus lutetiensis / Streptococcus equinus
PT150-1 160 Staphylococcus sciuri AJ421446 100 Staphylococcus sciuri Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
PT152-1 160 Kocuria koreensis FJ607312 100 Kocuria koreensis Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Kocuria
PT153-1 160 Helcococcus ovis Y16279 91 Helcococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Helcococcus
PT154-1 160 Acinetobacter baumannii X81660 100 Acinetobacter baumannii Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Acinetobacter
Haemophilus parahaemolyticus PT155-1 160 99 Haemophilus parahaemolyticus Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Haemophilus JF506635
PT156-1 160 Klebsiella pneumoniae X87276 100 Klebsiella pneumoniae / Enterobacter Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Klebsiella
PT157-1 160 Unc. bacterium GQ050470 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT160-1 160 Unc. bacterium JF193426 100 Prevotella sp. Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
PT161-1 160 Unc. bacterium GQ059838 100 Turicella sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Turicella
28
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT162-1 160 Unc. bacterium HM343633 100 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT163-1 160 Brevibacterium oceani AM158906 100 Brevibacterium oceani Actinobacteria Actinobacteria Actinomycetales Brevibacteriaceae Brevibacterium
Candidatus Prevotella conceptionensis PT164-1 160 100 Prevotella conceptionensis Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella DQ139963
PT166-1 160 Janibacter anophelis AY837752 100 Janibacter anophelis Actinobacteria Actinobacteria Actinomycetales Intrasporangiaceae Janibacter
PT168-1 160 Leuconostoc lactis AB596941 100 Leuconostoc lactis / Leuconostoc garlicum Firmicutes Bacilli Lactobacillales Leuconostocaceae Leuconostoc
PT168-2 160 Leuconostoc citreum DQ489736 99 Leuconostoc citreum/Leuconostoc palmae Firmicutes Bacilli Lactobacillales Leuconostocaceae Leuconostoc
PT170-1 160 Lactobacillus fermentum EF535258 100 Lactobacillus fermentum Firmicutes Bacilli Lactobacillales Lactobacillaceae Lactobacillus
PT171-1 160 Unc. bacterium AJ583205 100 Unclassified bacterium Unk.Phylum Unk.Class
Pseudomonas jessenii / Pseudomonas PT172-1 160 Pseudomonas jessenii AF068259 100 Proteobacteria Gammaproteobacteria Pseudomonadales Pseudomonadaceae Pseudomonas reinekei Acinetobacter calcoaceticus / Acinetobacter PT173-1 160 Acinetobacter calcoaceticus AJ888984 100 Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Acinetobacter rhizosphaera
PT174-1 160 Unc. bacterium HM186144 96 Micrococcineae bacterium Actinobacteria Actinobacteria Actinomycetales Mobiluncus
PT175-1 160 Unc. bacterium JF236101 99 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
Lactobacillus johnsonii/Lactobacillus PT176-1 160 Lactobacillus johnsonii AY531540 100 Firmicutes Bacilli Lactobacillales Lactobacillaceae Lactobacillus taiwanensis
PT177-1 160 Unc. bacterium HM271154 99 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT178-1 160 Brevibacterium sp. AY577816 100 Brevibacterium sp. Actinobacteria Actinobacteria Actinomycetales Brevibacteriaceae Brevibacterium
PT179-1 160 Propionibacterium granulosum AJ003057 97 Uncultured Actinomycetales bacterium Actinobacteria Actinobacteria Actinomycetales
PT180-1 160 Actinobacillus arthritidis L06084 98 Actinobacillus sp. Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Actinobacillus
PT181-1 160 Unc. bacterium HM306116 99 Janibacter sp. Actinobacteria Actinobacteria Actinomycetales Intrasporangiaceae Janibacter
PT182-1 160 Unc. bacterium GQ080513 98 Dolosigranulum sp. Firmicutes Bacilli Lactobacillales Carnobacteriaceae Dolosigranulum
PT183-1 160 Weissella confusa M23036 100 Weissella confusa Firmicutes Bacilli Lactobacillales Leuconostocaceae Weissella
Brevibacterium linens / Brevibacterium PT184-1 160 Brevibacterium linens EU660372 100 Actinobacteria Actinobacteria Actinomycetales Brevibacteriaceae Brevibacterium iodinum
PT186-1 160 Brachybacterium sp. AB617574 98 Brachybacterium sp. Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium
PT188-1 160 Ornithinicoccus sp. AB188219 99 Ornithinicoccus sp. Actinobacteria Actinobacteria Actinomycetales Intrasporangiaceae Ornithinicoccus
PT189-1 160 Unc. bacterium clone JF241112 94 Unclassified bacterium Unk.Phylum Unk.Class
PT190-1 160 Unc. bacterium HM292641 100 Brevibacterium sp. Actinobacteria Actinobacteria Actinomycetales Brevibacteriaceae Brevibacterium
29
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT191-1 160 Kurthia gibsonii AB271738 100 Kurthia gibsonii Firmicutes Bacilli Bacillales Planococcaceae Kurthia
PT192-1 160 Streptococcus downei AY188350 99 Streptococcus downei Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT193-1 160 Saccharopolyspora halophila DQ923129 98 Pseudonocardiaceae Actinobacteria Actinobacteria Actinomycetales Pseudonocardiaceae Saccharopolyspora
PT194-1 160 Unc. bacterium JF131185 99 Streptobacillus sp. Fusobacteria Fusobacteria Fusobacteriales Leptotrichiaceae Streptobacillus
PT197-1 160 Enterococcus saccharolyticus AF061004 100 Enterococcus saccharolyticus Firmicutes Bacilli Lactobacillales Enterococcaceae Enterococcus
PT199-1 160 Unc. bacterium HM306359 99 Janibacter sp. Actinobacteria Actinobacteria Actinomycetales Intrasporangiaceae Janibacter
PT200-1 160 Unc. bacterium EU779437 90 Uncultured flavobacteriales sp. Bacteroidetes Flavobacteria Flavobacteriales Flavobacteriaceae Wautersiella
PT201-1 160 Unc. bacterium HM333437 96 Uncultured flavobacteriales sp. Bacteroidetes Flavobacteria Flavobacteriales Flavobacteriaceae Cloacibacterium
PT204-1 160 Wautersiella falsenii AM084341 99 Wautersiella falsenii Bacteroidetes Flavobacteria Flavobacteriales Flavobacteriaceae Wautersiella
PT206-1 160 Unc. bacterium HM247613 99 Streptobacillus sp. Fusobacteria Fusobacteria Fusobacteriales Leptotrichiaceae Streptobacillus
PT209-1 160 Unc. bacterium FM875546 96 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
PT212-1 160 Helicobacter pylori CP002336 96 Helicobacter sp. Proteobacteria Epsilonproteobacteria Campylobacterales Helicobacteraceae Helicobacter
PT214-1 160 Prevotella sp. oral taxon GU409530 99 Prevotella sp. Bacteroidetes Bacteroidia Bacteroidales Prevotellaceae Prevotella
PT222-1 160 Streptococcus sp. oral clone AF432134 100 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT226-1 160 Unc. bacterium GQ080513 98 Dolosigranulum sp. Firmicutes Bacilli Lactobacillales Carnobacteriaceae Dolosigranulum
PT228-1 160 Dermacoccus sp. PB08bact48 JN412434 100 Micrococcineae bacterium Actinobacteria Actinobacteria Actinomycetales
PT231-1 160 Kribbia sp. DQ985060 100 Micrococcineae bacterium Actinobacteria Actinobacteria Actinomycetales
Moraxella nonliquefaciens/ Moraxella PT233-1 160 Moraxella lacunata D64049 98 Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Moraxella lacunata Actinomycetospora cinnamomea PT235-1 160 100 Actinomycetospora cinnamomea Actinobacteria Actinobacteria Actinomycetales Pseudonocardiaceae Actinomycetospora AB514520
PT238-1 160 Pseudonocardia alni AJ252823 100 Pseudonocardia sp. Actinobacteria Actinobacteria Actinomycetales Pseudonocardiaceae Pseudonocardia
PT239-1 160 Unc. Nesterenkonia sp. HQ674863 100 Uncultured Nesterenkonia sp. Actinobacteria Actinobacteria Actinomycetales Micrococcaceae Nesterenkonia
PT241-1 160 Flexivirga alba AB539735 97 Micrococcineae bacterium Actinobacteria Actinobacteria Actinomycetales
PT246-1 160 Microlunatus soli FJ807672 100 Propionibacteriaceae bacterium Actinobacteria Actinobacteria Actinomycetales Propionibacteriaceae
Bacteroides intestinalis/Bacteroides PT250-1 160 Bacteroides intestinalis AB214328 100 Bacteroidetes Bacteroidia Bacteroidales Bacteroidaceae Bacteroides cellulosilyticus Acinetobacter schindleri/Acinetobacter PT251-1 160 Acinetobacter schindleri FJ860880 100 Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Acinetobacter johnsonii
30
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
Agrostis stolonifera EF115543 / Unc. PT253-1 160 95/97 Cyanobacterium/Chloroplast Cyanobacteria/Chloroplast Chloroplast Bacterium GQ076170 Corynebacterium PT254-1 160 Corynebacterium pilbarense FN295567 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium pilbarense/Corynebacterium coyleae
PT261-1 160 Acinetobacter sp. FN395270 100 Acinetobacter sp. Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Acinetobacter
PT278-1 160 Flavobacteriaceae bacterium JN019026 100 Cloacibacterium sp. Bacteroidetes Flavobacteria Flavobacteriales Flavobacteriaceae Cloacibacterium
PT280-1 160 Salinicoccus carnicancri FJ182049 98 Salinicoccus sp. Firmicutes Bacilli Bacillales Staphylococcaceae Salinicoccus
PT281-1 160 Bacteroides uniformis AB050110 100 Bacteroides uniformis Bacteroidetes Bacteroidia Bacteroidales Bacteroidaceae Bacteroides
PT282-1 160 Cystobacter fuscus M94276 100 Cystobacter sp. Proteobacteria Deltaproteobacteria Myxococcales Cystobacteraceae Cystobacter
PT317-1 160 Nicotiana tabacum Z00044 100 Cyanobacterium/Chloroplast Cyanobacteria/Chloroplast Chloroplast
PT360-1 160 Gordonia terrae X79286 100 Gordonia sp. Actinobacteria Actinobacteria Actinomycetales Nocardiaceae Gordonia
PT372-1 160 Actinomyces sp. oral strain AF287751 100 Actinomyces sp. Actinobacteria Actinobacteria Actinomycetales Actinomycetaceae Actinomyces
PT403-1 160 Corynebacterium sp. AY581888 100 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT438-1 160 Unc. bacterium DQ327445 100 Uncultured Clostridium XI sp. Firmicutes Clostridia Clostridiales Peptostreptococcaceae ClostridiumXI
PT487-1 160 Gemella haemolysans L14326 100 Gemella haemolysans Firmicutes Bacilli Bacillales BacillalesIncertaesedisXI Gemella
PT488-1 160 Gemella sanguinis Y13364 100 Gemella sanguinis/Gemella morbillorum Firmicutes Bacilli Bacillales BacillalesIncertaesedisXI Gemella
PT489-1 160 Streptococcus mutans DQ677759 100 Streptococcus mutans Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
Bacteroides massiliensis/Bacteroides PT490-1 160 Bacteroides massiliensis AY126616 100 Bacteroidetes Bacteroidia Bacteroidales Bacteroidaceae Bacteroides vugatus
PT491-1 160 Unc. bacterium DQ015145 100 Turicibacter sp. Firmicutes Erysipelotricha Erysipelotrichales Erysipelotrichaceae Turicibacter
PT493-1 160 Enterobacter asburiae HQ242719 100 Enterobacter/Yersinia Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Enterobacter/Yersinia
PT494-1 160 Brochothrix thermosphacta AY543023 100 Brochothrix thermosphacta Firmicutes Bacilli Bacillales Listeriaceae Brochothrix
PT496-1 160 Lactobacillus sakei AM113784 100 Lactobacillus sakei Firmicutes Bacilli Lactobacillales Lactobacillaceae Lactobacillus
PT497-1 160 Unc. bacterium EU467572 91 Uncultured Clostridiales bacterium Firmicutes Clostridia Clostridiales
PT502-1 160 Campylobacter mucosalis DQ174173 94 Campylobacter sp. Proteobacteria Epsilonproteobacteria Campylobacterales Campylobacteraceae Campylobacter
PT503-1 160 Actinobacillus ureae AY362900 100 Actinobacillus sp. Proteobacteria Gammaproteobacteria Pasteurellales Pasteurellaceae Actinobacillus
PT505-1 160 Unc. bacterium JF145937 100 Neisseriaceae bacterium Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae
PT506-1 160 Staphylococcus haemolyticus AP006716 100 Staphylococcus haemolyticus Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
31
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT507-1 160 Unc. bacterium HM257861 99 Neisseriaceae bacterium Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae
PT508-1 160 Staphylococcus hominis X66101 100 Staphylococcus hominis Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
PT509-1 160 Unc. bacterium GQ018981 100 Staphylococcus sp. Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
PT510-1 160 Moraxella lincolni FR822735 99 Moraxella lincolni Proteobacteria Gammaproteobacteria Pseudomonadales Moraxellaceae Moraxella
PT511-1 160 Corynebacterium kroppenstedtii Y10077 100 Corynebacterium kroppenstedtii Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT511-2 160 Unc. bacterium GQ061146 99 Corynebacterium sp. Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
Corynebacterium pseudodiphtheriticum Corynebacterium propinquum PT512-1 160 99 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343 /Corynebacterium pseudodiphtheriticum Corynebacterium pseudodiphtheriticum Corynebacterium propinquum PT512-2 160 99 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343 /Corynebacterium pseudodiphtheriticum Corynebacterium pseudodiphtheriticum Corynebacterium propinquum PT512-3 160 99 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343 /Corynebacterium pseudodiphtheriticum Corynebacterium propinquum PT512-4 160 Unc. bacterium HM302210 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium /Corynebacterium pseudodiphtheriticum Corynebacterium pseudodiphtheriticum Corynebacterium propinquum PT512-5 160 100 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium AJ439343 /Corynebacterium pseudodiphtheriticum
PT513-1 160 Unc. bacterium JF179978 100 Corynebacterium lubricantis Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT514-1 160 Unc. bacterium HM342925 100 Uncultured Actinomycetales bacterium Actinobacteria Actinobacteria Actinomycetales UnclassifiedActinomycetales
PT515-1 160 Rhodococcus rhodochrous X79288 100 Rhodococcus sp. Actinobacteria Actinobacteria Actinomycetales Nocardiaceae Rhodococcus
PT516-1 160 Enterobacter aerogenes CP002824 100 Enterobacter Proteobacteria Gammaproteobacteria Enterobacteriales Enterobacteriaceae Enterobacter
PT517-1 160 Unc. bacterium GQ038683 99 Neisseriaceae bacterium Proteobacteria Betaproteobacteria Neisseriales Neisseriaceae
PT518-1 160 Corynebacterium imitans Y09044 100 Corynebacterium imitans Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium
PT519-1 160 Unc. bacterium GQ104724 100 Brachybacterium sp. Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium
Corynebacterium propinquum PT520-1 160 Corynebacterium propinquum X84438 99 Actinobacteria Actinobacteria Actinomycetales Corynebacteriaceae Corynebacterium /Corynebacterium pseudodiphtheriticum
PT521-1 160 Staphylococcus arlettae AB009933 99 Staphylococcus arlettae Firmicutes Bacilli Bacillales Staphylococcaceae Staphylococcus
Streptococcus oralis/Streptococcus PT522-1 160 Streptococcus oralis AY281080 100 Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus mitis/Streptococcus pneumoniae
PT523-1 160 Streptococcus sp. oral strain AF385525 100 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus
PT524-1 160 Streptococcus sp. oral strain AY005040 99 Streptococcus sp. Firmicutes Bacilli Lactobacillales Streptococcaceae Streptococcus Fusobacterium periodonticum/ PT525-1 160 Fusobacterium periodonticum FJ471667 100 Fusobacterium nucleatum/ Fusobacterium Fusobacteria Fusobacteria Fusobacteriales Fusobacteriaceae Fusobacterium russii Brachybacterium faecium/Brachybacterium PT526-1 160 Brachybacterium faecium X91032 100 Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium fresconis 32
Phylogenetic affiliation based on 16S rRNA gene sequence
Phylotype Length Closest representative Identity Designation Phylum Class Order Family Genus
PT527-1 160 Brachybacterium sp. AM396915 99 Brachybacterium sp. Actinobacteria Actinobacteria Actinomycetales Dermabacteraceae Brachybacterium
PT528-1 160 Dietzia natronolimnaea FJ468335 100 Dietzia sp. Actinobacteria Actinobacteria Actinomycetales Dietziaceae Dietzia
PT529-1 160 Parabacteroides merdae AB238928 100 Parabacteroides merdae Bacteroidetes Bacteroidia Bacteroidales Porphyromonadaceae Parabacteroides
PT531-1 160 Unc. bacterium GQ081220 97 Anaerococcus sp. Firmicutes Clostridia Clostridiales IncertaeSedisXI Anaerococcus
33
Table S3. Gender and age of westernized volunteers. Information on each of the 92 westernized volunteers of this study.
Sample Gender Age Sample Gender Age Sample Gender Age GER1 F 31y GER59 F 43y GER100 M n.d GER2 F 42y GER60 F 44y GER110 F 52y GER4 M 47y GER61 M 53y GER111 M n.d GER5 F 28y GER63 F n.d GER112 F 46y GER6 F 55y GER65 F 20y GER113 F 29y GER7 M 34y GER66 F 55y GER114 M 53y GER8 F 57y GER67 F n.d GER115 M n.d GER9 M 31y GER69 F 23y GER116 M 19y GER10 F 46y GER70 M 35y GER117 M 38y GER11 F 26y GER71 F n.d GER118 F 35y GER12 F 43y GER72 F 23y GER119 F 30y GER13 M 27y GER73 F 44y GER120 M 33y GER14 F 45y GER74 F 49y GER121 F 30y GER15 F 33y GER75 F 43y GER122 F 31y GER18 F 29y GER76 M 49y GER123 M 55y GER19 F 34y GER77 F 39y GER124 F 38y GER20 F 37y GER79 F 51y GER125 M n.d GER21 F 45y GER80 M 27y GER126 F 45y GER25 F 46y GER82 M 56y GER127 F 50y GER26 M 49y GER84 F n.d GER128 F 43y GER27 F 24y GER85 F n.d GER129 M n.d GER28 F 63y GER87 M 40y GER130 F 30y GER29 F 64y GER88 M 19y GER131 M n.d GER30 F 53y GER89 M 65y GER132 M 26y GER33 F 56y GER90 M 59y GER133 M 26y GER34 M 36y GER91 M 42y GER134 F 34y GER40 F 49y GER93 M 53y GER135 M 35y GER45 F 34y GER96 F 46y GER137 M 29y GER46 F 26y GER97 F n.d GER138 M 31y GER52 F 45y GER98 F n.d GER140 F 59y GER58 M 49y GER99 M n.d
y: age in years; n.d: adult, but exact age was not disclosed.
34 Table S4. Gender and age of non-westernized volunteers. Information on each of the 98 non-westernized volunteers of this study.
Age Age Age Sample Gender Sample Gender Sample Gender (years) (years) (years) GAB1 M 30y GAB36 M 3y GAB69 F 41y GAB2 M 41y GAB37 F 40y GAB70 F 7y GAB3 M 23y GAB38 M 49y GAB71 F 6y GAB4 M 37y GAB39 M 45y GAB72 F 37y GAB5 M 44y GAB40 M 84y GAB73 M 35y GAB6 M 14y GAB41 M 45y GAB76 M 40y GAB7 M 47y GAB42 M 25y GAB81 F 5y GAB8 F 3y GAB43 M 23y GAB82 F 7y GAB9 F 4y GAB44 M 29y GAB83 F 9y GAB10 F 3y GAB45 M 74y GAB84 M 10y GAB11 F 3y GAB46 F 49y GAB85 M 3y GAB12 F 2y GAB47 M 50y GAB86 M 13y GAB13 M 7y GAB48 M 19y GAB87 M 43y GAB14 F 30y GAB49 F 74y GAB88 M 10y GAB15 F 24y GAB50 F 77y GAB90 M 6y GAB16 F 25y GAB51 F 74y GAB91 F 5y GAB17 M 43y GAB52 F 24y GAB92 F 4y GAB18 F 21y GAB53 M 45y GAB93 F 11y GAB19 M 61y GAB54 M 23y GAB94 M 10y GAB20 F 41y GAB55 M 24y GAB95 F 9m GAB21 F 70y GAB56 M 37y GAB96 M 3y GAB23 M 39y GAB57 M 10m GAB97 F 3m GAB24 M 1m GAB58 F 4y GAB98 F 22y GAB25 M 6y GAB59 F 3y GAB99 F 18y GAB26 F 4y GAB60 M 44y GAB100 F 2m GAB27 F 5y GAB61 F 47y GAB101 F 19y GAB28 F 13y GAB62 M 43y GAB102 M 2y GAB29 F 4y GAB63 F 20y GAB103 F 19y GAB30 M 2y GAB64 M 28y GAB104 F 19y GAB31 M 1m GAB65 M 27y GAB105 M 3y GAB32 M 12y GAB66 M 42y GAB106 F 3m GAB34 M 42y GAB67 M 49y GAB107 F 4m GAB35 M 18y GAB68 F 9y
y: age in years; m: age in months
35 Table S5. Primers used in this study.
Primer name Primer Sequence (5'-3') Reference
Illumina primers
IlluFBC1 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAAGCCTCAAGAGTTTGATCMTGGCTCAG
lluFBC2 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAAGTTACAAGAGTTTGATCMTGGCTCAG
IlluFBC3 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAATACGCAAGAGTTTGATCMTGGCTCAG
IlluFBC4 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAATCGACAAGAGTTTGATCMTGGCTCAG
IlluFBC5 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAATGATCAAGAGTTTGATCMTGGCTCAG
IlluFBC6 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACCAATCAAGAGTTTGATCMTGGCTCAG
IlluFBC7 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACCGTCCAAGAGTTTGATCMTGGCTCAG
IlluFBC8 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACCTCGCAAGAGTTTGATCMTGGCTCAG
IlluFBC9 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACGAGGCAAGAGTTTGATCMTGGCTCAG
IlluFBC10 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACTATACAAGAGTTTGATCMTGGCTCAG
IlluFBC11 ACACTCTTTCCCTACACGACGCTCTTCCGATCTACTTACCAAGAGTTTGATCMTGGCTCAG
IlluFBC12 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGAAGCCAAGAGTTTGATCMTGGCTCAG
IlluFBC13 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGACCGCAAGAGTTTGATCMTGGCTCAG
IlluFBC14 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGAGTACAAGAGTTTGATCMTGGCTCAG
IlluFBC15 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGATATCAAGAGTTTGATCMTGGCTCAG
IlluFBC16 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGCATGCAAGAGTTTGATCMTGGCTCAG
IlluFBC17 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGCCGTCAAGAGTTTGATCMTGGCTCAG
IlluFBC18 ACACTCTTTCCCTACACGACGCTCTTCCGATCTAGGCTCCAAGAGTTTGATCMTGGCTCAG This study
IlluFBC19 ACACTCTTTCCCTACACGACGCTCTTCCGATCTATAGACCAAGAGTTTGATCMTGGCTCAG
IlluFBC20 ACACTCTTTCCCTACACGACGCTCTTCCGATCTATATTGCAAGAGTTTGATCMTGGCTCAG
IlluFBC21 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCAGATGCAAGAGTTTGATCMTGGCTCAG
IlluFBC22 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCAGCAACAAGAGTTTGATCMTGGCTCAG
IlluFBC23 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCATAGCCAAGAGTTTGATCMTGGCTCAG
IlluFBC24 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCCAGGTCAAGAGTTTGATCMTGGCTCAG
IlluFBC25 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCCGCTTCAAGAGTTTGATCMTGGCTCAG
IlluFBC26 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCCGTAGCAAGAGTTTGATCMTGGCTCAG
IlluFBC27 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCCTGAACAAGAGTTTGATCMTGGCTCAG
IlluFBC28 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGAATTCAAGAGTTTGATCMTGGCTCAG
IlluFBC29 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGATCACAAGAGTTTGATCMTGGCTCAG
IlluFBC30 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGCAACCAAGAGTTTGATCMTGGCTCAG
IlluFBC31 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGCGCTCAAGAGTTTGATCMTGGCTCAG
IlluFBC32 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCGTCAGCAAGAGTTTGATCMTGGCTCAG
IlluFBC33 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCTAAGACAAGAGTTTGATCMTGGCTCAG
IlluFBC34 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCTCCATCAAGAGTTTGATCMTGGCTCAG
IlluFBC35 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCTCGGCCAAGAGTTTGATCMTGGCTCAG
36 Primer name Primer Sequence (5'-3') Reference
IlluFBC36 ACACTCTTTCCCTACACGACGCTCTTCCGATCTCTGCGGCAAGAGTTTGATCMTGGCTCAG
IlluFBC37 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGACCTTCAAGAGTTTGATCMTGGCTCAG
IlluFBC38 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGACGACCAAGAGTTTGATCMTGGCTCAG
IlluFBC39 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGATTAGCAAGAGTTTGATCMTGGCTCAG
IlluFBC40 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGCATCCCAAGAGTTTGATCMTGGCTCAG
IlluFBC41 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGCCAGCCAAGAGTTTGATCMTGGCTCAG
IlluFBC42 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGCGACTCAAGAGTTTGATCMTGGCTCAG
IlluFBC43 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGCTCATCAAGAGTTTGATCMTGGCTCAG
IlluFBC44 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGGCCAACAAGAGTTTGATCMTGGCTCAG
IlluFBC45 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGGCTTCCAAGAGTTTGATCMTGGCTCAG
IlluFBC46 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGGTACCCAAGAGTTTGATCMTGGCTCAG
IlluFBC47 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTAATCCAAGAGTTTGATCMTGGCTCAG
IlluFBC48 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTACGTCAAGAGTTTGATCMTGGCTCAG
IlluFBC49 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTCAAGCAAGAGTTTGATCMTGGCTCAG
IlluFBC50 ACACTCTTTCCCTACACGACGCTCTTCCGATCTGTCTCTCAAGAGTTTGATCMTGGCTCAG
IlluFBC51 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTACGGTCAAGAGTTTGATCMTGGCTCAG
IlluFBC52 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTACTTGCAAGAGTTTGATCMTGGCTCAG
IlluFBC53 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTATCTCCAAGAGTTTGATCMTGGCTCAG
IlluFBC54 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTCATAACAAGAGTTTGATCMTGGCTCAG
IlluFBC55 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTCCGAGCAAGAGTTTGATCMTGGCTCAG
IlluFBC56 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTCGCCGCAAGAGTTTGATCMTGGCTCAG
IlluFBC57 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTCTAGTCAAGAGTTTGATCMTGGCTCAG
IlluFBC58 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTCTGCCCAAGAGTTTGATCMTGGCTCAG
IlluFBC59 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTGGTCCCAAGAGTTTGATCMTGGCTCAG
IlluFBC60 ACACTCTTTCCCTACACGACGCTCTTCCGATCTTTATGCCAAGAGTTTGATCMTGGCTCAG
Multiplexing AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT PCR
IlluRevAdap GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCTTGCTGCCTCCCGTAGGAGT
Index 1 CAAGCAGAAGACGGCATACGAGATCGTGATGTGACTGGAGTTC
Index 2 CAAGCAGAAGACGGCATACGAGATACATCGGTGACTGGAGTTC
Index 4 CAAGCAGAAGACGGCATACGAGATTGGTCAGTGACTGGAGTTC Illumina Inc.
Index 6 CAAGCAGAAGACGGCATACGAGATATTGGCGTGACTGGAGTTC
Index 8 CAAGCAGAAGACGGCATACGAGATTCAAGTGTGACTGGAGTTC
37