
RESEARCH ARTICLES by the unmodified strand. We therefore devised a single-stranded library preparation method wherein the ancient DNA is dephosphorylated, A High-Coverage Genome Sequence heat denatured, and ligated to a biotinylated adap- tor oligonucleotide, which allows its immobiliza- tion on streptavidin-coated beads (Fig. 1). A primer from an Archaic Denisovan Individual hybridized to the adaptor is then used to copy the original strand with a DNA polymerase. Fi- Matthias Meyer,1*‡ Martin Kircher,1*† Marie-Theres Gansauge,1 Heng Li,2 Fernando Racimo,1 2,3 4 4 1 1 nally, a second adaptor is joined to the copied Swapan Mallick, Joshua G. Schraiber, Flora Jay, Kay Prüfer, Cesare de Filippo, strand by blunt-end ligation, and the library mol- Peter H. Sudmant,6 Can Alkan,5,6 Qiaomei Fu,1,7 Ron Do,2 Nadin Rohland,2,3 Arti Tandon,2,3 1 8 3 3 1 ecules are released from the beads. The entire pro- Michael Siebauer, Richard E. Green, Katarzyna Bryc, Adrian W. Briggs, Udo Stenzel, tocol is devoid of DNA purification steps, which Jesse Dabney,1 Jay Shendure,6 Jacob Kitzman,6 Michael F. Hammer,9 Michael V. Shunkov,10 10 2 1 6,11 inevitably cause loss of material. Anatoli P. Derevianko, Nick Patterson, Aida M. Andrés, Evan E. Eichler, We applied this method to aliquots of the two 4 2,3‡ 1 1‡ Montgomery Slatkin, David Reich, Janet Kelso, Svante Pääbo DNA extracts (as well as side fractions) that were previously generated from the 40 mg of bone We present a DNA library preparation method that has allowed us to reconstruct a high-coverage that comprised the entire inner part of the pha- (30×) genome sequence of a Denisovan, an extinct relative of Neandertals. The quality of this lanx (2, 8). Comparisons of these newly generated genome allows a direct estimation of Denisovan heterozygosity indicating that genetic diversity libraries with the two libraries generated in the in these archaic hominins was extremely low. It also allows tentative dating of the specimen on previous study (2) show at least a 6-fold and 22- the basis of “missing evolution” in its genome, detailed measurements of Denisovan and fold increase in the recovery of library mole- Neandertal admixture into present-day human populations, and the generation of a near-complete cules (8). catalog of genetic changes that swept to high frequency in modern humans since their In addition to improved sequence yield, the divergence from Denisovans. single-strand library protocol reveals new aspects of DNA fragmentation and modification pat- raft genome sequences have been recov- pendent population history that differs from that terns (8). Because the ends of both DNA strands ered from two archaic human groups, of Neandertals. Also, whereas a genetic con- are left intact, it reveals that strand breakage DNeandertals (1) and Denisovans (2). tribution from Neandertal to the present-day occurs preferentially before and after guanine Whereas Neandertals are defined by distinct human gene pool is present in all populations residues (fig. S6), which suggests that guanine morphological features and occur in the fossil outside Africa, a contribution from Denisovans nucleotides are frequently lost from ancient on March 30, 2016 record of Europe and western and central Asia is found exclusively in island Southeast Asia DNA, possibly as the result of depurination. It from at least 230,000 until about 30,000 years and Oceania (6). ago (3), Denisovans are known only from a dis- Both published archaic genome sequences tal manual phalanx and two molars, all exca- are of low coverage: 1.9-fold genomic coverage vated at Denisova Cave in the Altai Mountains from the Denisovan phalanx and a total of 1.3- in southern Siberia (2, 4, 5). The draft nuclear fold derived from three Croatian Neandertals. As genome sequence retrieved from the Denisovan a consequence, many positions in the genomes phalanx revealed that Denisovans are a sister are affected by sequencing errors or nucleotide Downloaded from group to Neandertals (2), with the Denisovan misincorporations caused by DNA damage. Pre- nuclear genome sequence falling outside Nean- vious attempts to generate a genome sequence dertal genetic diversity, which suggests an inde- of high coverage from an archaic human have been hampered by high levels of environmental contamination. The fraction of hominin endog- 1Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, D-04103 Leipzig, Germany. enous DNA is commonly smaller than 1% and 2Broad Institute of Massachusetts Institute of Technology rarely approaches 5% (1, 7), which makes shot- and Harvard, Cambridge, MA 02142, USA. 3Department of gun sequencing of the entire genome econom- Genetics, Harvard Medical School, Boston, MA 02115, USA. ically and logistically impractical. The only 4Department of Integrative Biology, University of California, Berkeley, Berkeley, CA 94720, USA. 5Department of Com- known exception is the Denisovan phalanx, which puter Engineering, Bilkent University, 06800 Ankara, Turkey. contains ~70% endogenous DNA. However, an 6Department of Genome Sciences, University of Washington extremely small fragment of this specimen is 7 SchoolofMedicine,Seattle,WA98195,USA. CAS-MPS Joint available to us, and the absolute number of en- Laboratory for Human Evolution, Institute of Vertebrate Paleon- tology and Paleoanthropology, Chinese Academy of Sciences, dogenous molecules that could be recovered from 100044 Beijing, China. 8Jack Baskin School of Engineering, the sample was too low to generate high genomic University of California, Santa Cruz, Santa Cruz, CA 95064, coverage. USA. 9Arizona Research Laboratories, Division of Biotechnol- A single-stranded library preparation meth- 10 Fig. 1. For single-stranded library preparation, ogy, University of Arizona, Tucson, AZ 85721, USA. Palaeo- od. DNA libraries for sequencing are normally lithic Department, Institute of Archaeology and Ethnography, ancient DNA molecules are dephosphorylated and Russian Academy of Sciences, Siberian Branch, 630090 prepared from double-stranded DNA. Howev- heat-denatured. Biotinylated adaptor oligonucleo- 11 Novosibirsk, Russia. Howard Hughes Medical Institute, Seat- er, for ancient DNA the use of single-stranded tides are ligated to the 3′ ends of the molecules, tle, WA 98195, USA. DNA may be advantageous, as it will double which are immobilized on streptavidin-coated beads *These authors contributed equally to this work. its representation in the library. Furthermore, in andcopiedbyextensionofaprimerhybridizedto † Present address: Department of Genome Sciences, Univer- a single-stranded DNA library, double-stranded the adaptor. One strand of a double-stranded adaptor sity of Washington, Seattle, WA 98195, USA. ‡To whom correspondence should be addressed. E-mail: molecules that carry modifications on one strand is then ligated to the newly synthesized strand. [email protected] (M.M.); [email protected]. that prevent their incorporation into double- Finally, the beads are destroyed by heat to release edu (D.R.); [email protected] (S.P.) stranded DNA libraries could still be represented the library molecules (not shown). 222 12 OCTOBER 2012 VOL 338 SCIENCE www.sciencemag.org RESEARCH ARTICLES also reveals that deamination of cytosine resi- Second, because the Denisovan phalanx comes the range of plausible estimates for human mu- dues occurs with almost equal frequencies at from a female (2), we infer male human DNA tation rates and thus the human-chimpanzee di- both ends of the ancient DNA molecules. Because contamination to be 0.07% (C.I. 0.05 to 0.09%) vergence date. deamination is hypothesized to be frequent in from alignments to the Y chromosome. Third, a When comparing the number of substitutions single-stranded DNA overhangs (9, 10), this sug- maximum-likelihood quantification of autosom- inferred to have occurred between the human- gests that 5′ and 3′ overhangs occur at similar al contamination gives an estimate of 0.22% chimpanzee ancestor and the Denisovan and lengths and frequencies in ancient DNA. (C.I. 0.22 to 0.23%). We conclude that less than present-day human genomes, the number for the Genome sequencing. We sequenced these 0.5% of the hominin sequences determined are Denisovan genome is 1.16% lower (1.13 to libraries from both ends using Illumina’s Ge- extraneous to the bone (i.e., contamination from 1.27%) (Fig. 2) (8). This presumably reflects the nome Analyzer IIx and included reads for present-day humans). age of the Denisovan bone, which had less time two indexes (11),whichwereaddedinthe Coverage of the genome is fairly uniform with to accumulate changes than present-day humans. clean room to exclude the possibility of down- 99.93% of the “mappable” positions covered by Assuming 6.5 million years of sequence diver- stream contamination with modern DNA li- at least one, 99.43% by at least 10, and 92.93% gence between humans and chimpanzees, the braries (1). Sequences longer than 35 base pairs by at least 20 independent DNA sequences (8). shortening of the Denisovan branch allows the (bp) were aligned to the human reference ge- High-quality genotypes (genotype quality ≥40) bone to be tentatively dated to between 74,000 nome (GRCh37/1000 Genome project release) could be determined for 97.64% of the posi- and 82,000 years before present, in general agree- and the chimpanzee genome (CGSC 2.1/panTro2) tions. Whereas coverage in libraries prepared ment with the archaeological dates (2). However, with the Burrows-Wheeler Aligner (12). After from ancient samples
Details
-
File Typepdf
-
Upload Time-
-
Content LanguagesEnglish
-
Upload UserAnonymous/Not logged-in
-
File Pages6 Page
-
File Size-