Genome-Wide Human SNP Array 5.0. Data Sheet

Total Page:16

File Type:pdf, Size:1020Kb

Genome-Wide Human SNP Array 5.0. Data Sheet Pantone #2685 (Blue) 4/Color Equivalent (C:100.0 M:94.0 Y:0.0 K:0.0) Pantone #3288 (Green) 4/Color Equivalent (C:100.0 M:0.0 Y:56.0 K:18.5) Pantone #032 (Red) ® 4/Color Equivalent (C:0.0 M:91.0 Y:87.0 K:0.0) Affymetrix Genome-Wide Human SNP Array 5.0 Black AFFYMETRIX® PRODUCT FAMILY > DNA ARRAYS AND REAGENTS > Data Sheet II ® II Affymetrix Genome-Wide Human SNP Array 5.0 The new single-chip Affymetrix Introduction The Whole-genome Sampling Genome-Wide Human SNP Array The new single-chip Affymetrix Genome- Assay 5.0 features single nucleotide Wide Human SNP Array 5.0 contains all The Affymetrix Genome-Wide Human polymorphisms (SNPs) from the 500,568 single nucleotide polymorphisms SNP Nsp/Sty Assay Kit 5.0/6.0 (P/N original two-chip Mapping 500K (SNPs) from the two-array Mapping 500K 901152, 901015) was developed and vali- Array Set, as well as additional Array Set as well as an additional 420,000 dated for use in conjunction with the non-polymorphic probes that can non-polymorphic probes that can measure Genome-Wide Human SNP Array 5.0 measure other genetic differences, other genetic differences, such as copy num- (P/N 901167, 901071). Briefly, total such as copy number variation. The ber variation. SNPs on the array are present SNP 5.0 Array gives researchers a genomic DNA (500 ng) is digested with on 200 to 1,100 base pair (bp) Nsp I or Sty significant increase in information Nsp I and Sty I restriction enzymes and lig- I digested fragments in the human genome, above the original 500K Array Set, ated to adaptors that recognize the cohesive while reducing the array and are amplified using the fifth generation 4 bp overhangs. All fragments resulting processing time. of the Whole-genome Sampling Assay from restriction enzyme digestion, regard- (WGSA). This assay now combines the Nsp less of size, are substrates for adaptor liga- and Sty fractions previously assayed on two tion. A generic primer that recognizes the separate arrays. Using the current version of adaptor sequence is used to amplify adap- the Affymetrix® Genotyping Console, a set tor-ligated DNA fragments. PCR condi- of 440,794 SNPs on the array exhibit the tions have been optimized to preferentially performance capabilities detailed in this amplify fragments in the 200 to 1,100 bp data sheet. size range. PCR amplification products for One hundred thousand non-polymorphic each restriction enzyme digest are com- probes were chosen to cover 2,000 germline bined and purified using polystyrene beads. copy number variants (CNV) identified in The amplified DNA is then fragmented, the UCSC Genome Browser database with labeled and hybridized to a Genome-Wide 50 probes each. The other 320,000 were Human SNP Array 5.0. chosen to give even spacing across the The Affymetrix Genome-Wide Human genome, concentrating on areas that were SNP Nsp/Sty Assay Kit 5.0/6.0 contains not already represented by SNPs. Current validated and qualified reagents for the Affymetrix software does not support analy- most critical steps in the assay. This sis of the non-polymorphic probes. includes the PCR primer and adaptors, Please check the Affymetrix web site for reagents to fragment and label the PCR additional updates to the genotyping call- products and several control reagents. Kits ing algorithm and updates to support are available for either 50 or 100 reactions analysis of non-polymorphic probes. (refer to Ordering Information). the Affymetrix® Genotyping Console Manual. Figure 1: The fifth-generation Whole-genome Sampling Assay. Further details can be found in the snp5_probeset_genotypemanual (www.affymetrix.com/products/software/spe- Nsp INsp I Nsp I Sty I Sty I Sty I cific/genotyping_console_software.affx). RE Digestion RE Digestion Performance Data To test the performance of the SNP Array 5.0, Affymetrix and the Broad Institute Nsp Adaptor Ligation Sty Adaptor Ligation jointly ran the 270 samples from the International HapMap Project. In addi- tion, two external sites and one internal validation group ran a plate of 44 PCR: One Primer PCR: One Primer HapMap DNAs, which includes 30 Amplification Amplification unique samples, 10 trios and five samples with multiple replicates. Complexity The arrays that passed the QC call rate Reduction Clean Up threshold were analyzed using the BRLMM-P algorithm at the default set- Fragmentation and ting of 0.05. The average call rate for End-labeling each set was greater than 99 percent, and the concordance with HapMap geno- Hybridization types was observed to be greater than or and Wash equal to 99.5 percent. For the 10 trios, AA BB AB the Mendelian inheritance consistency was found to be greater than 99.9 per- Whole-genome-amplified material pre- sample should be repeated or used for cent. Reproducibility was measured at pared by the Qiagen REPLI-g® kits may downstream analysis. If the sample passes 99.9 percent. also be used as the starting material for the QC call rate, it is expected to have a the Genome-Wide Human SNP Assay minimum BRLMM-P call rate of 97 per- Data analyzed with BRLMM-P (0.05). Kit 5.0/6.0. cent. The user may adjust the default con- 270 fidence score of BRLMM-P to allow geno- HapMap Site 1 Site 2 Internal Genotype Calls Using typing with either greater accuracy or Call Rate 99.71 99.55 99.37 99.63 Affymetrix® Genotyping higher call rates, depending on what is 3 HapMap Console needed for the application of interest . 99.69 99.67 99.56 99.69 Concordance The Genome-Wide Human SNP Array 5.0 Using the current version of the algo- ® Mendelian is used in conjunction with the Affymetrix rithm BRLMM-P, a set of 440,794 SNPs 99.96 99.95 99.94 99.96 Consistency Genotyping Console. Genotyping Console exhibit the performance capabilities detailed implements two algorithms; the Dynamic below. Future versions of the algorithm have Reproducibility NA 99.9 99.9 99.9 Model algorithm (DM) generates a quality the potential to add more SNPs to this control (QC) call rate for each array by test- default set of 440,794 SNPs. In addition, ® ing 3,022 SNPs specifically chosen for eval- advanced users can analyze the full set of Refer to the Affymetrix Genome-Wide uating data quality, and the BRLMM-P 500,568 SNPs using an alternative library Human SNP Nsp/Sty Assay 5.0 Manual (P/N algorithm uses data from multiple arrays to file (CDF) that reveals all of the SNP content 702419) for details on the QC call rate thresholds, as well as procedures on DNA make genotype calls1, 2. BRLMM-P is an on the array. This advanced workflow cannot updated version of the previous BRLMM be conducted using Genotyping Console; it target preparation, target hybridization, flu- (Bayesian Robust Linear Model with can only be performed with the command- idics setup, array scanning and data analysis. Mahalanobis distance classifier) algorithm line “snp5-probeset-genotype” software. FLUIDICS PROTOCOL REQUIRED that only analyzes perfect match probes. The details of the Genotyping Console data GenomeWideSNP5v1_450 The QC call rate for an individual array analysis workflow using BRLMM-P, includ- (at a DM confidence threshold of 0.33) ing step-by-step instructions to install and LIBRARY FILES REQUIRED should be used to determine whether a run the Genotyping Console are described in GenomeWideSNP_5 II II 2 Library files contain information about REFERENCES probe array design layout and other charac- 1. Rabbee N., et al. A genotype calling algorithm for Affymetrix SNP arrays. Bioinformatics 22:7-12 teristics, probe use and content, and scan- (2006). ning and analysis parameters. 2. Affymetrix White Paper, BRLMM: An Improved These files are unique for each probe Genotype Calling Method for the Mapping 500K Array Set. array type. Library files are available 3. Matsuzaki H., et al. Genotyping over 100,000 from the Affymetrix web site at: SNPs on a Pair of Oligonucleotide Arrays. Nature Methods 1:109-111 (2004). www.affymetrix.com/support/technical/lib 4. Papassotiropoulos A., et al. Common Kibra Alleles raryfilesmain.affx. Are Associated with Human Memory Two alternate SNP List (CDF) files are Performance. Science 314(5798):475-8 (2006). available. These files identify which SNPs are available for downstream analysis. GenomeWideSNP_5.cdf is the default set of 440,794 SNPs that are accessible via the Genotyping Console (BRLMM-P algorithm). GenomeWideSNP_5.Full.cdf is for advanced users who wish to look at all SNPs from the previous-generation Mapping 500K Array Set. This CDF file can only be accessed from the command- line tool, snp5-probeset-genotype, which is part of the Affymetrix Power Tools distri- bution. The full CDF file includes SNPs that may have lower per-SNP accuracy or call rates. It is expected that the perform- ance of some of these SNPs will improve with different or future algorithms. Affymetrix products can be pur- chased directly from Affymetrix in the United States and many European countries. For all other territories, refer to our list of distribution partners located at: www.affymetrix.com/site/con- tact/index.affx. II II 3 Product Information Number of SNPs on the array 500,568 Ordering Information Number of SNPs accessible using BAT 2.0 440,794 Number of Arrays 1 DNA Required 500ng Affymetrix® Genome-Wide Human SNP Expected minimum BRLMM-P Array 5.0 Call Rate (0.05) 97 percent ≥ 901167 Contains 50 arrays Average Minor Allele Frequency (MAF) 0.22 901071 Contains 100 arrays Average Heterozygosity 0.31 PCR Primers 1 per sample Affymetrix® Genome-Wide Human SNP Instrumentation GeneChip® Scanner 3000 7G with AutoLoader Nsp/Sty Assay Kit 5.0/6.0 Throughput >21 million
Recommended publications
  • Genome-Wide Human SNP Array 6.0
    Data Sheet Genome-Wide Human SNP Array 6.0 Introduction The Genome-Wide Human SNP Array 6.0 contains more than 906,600 single nucleotide polymorphisms (SNPs) and more than 946,000 probes for the detection of copy number variation. SNPs on the array are present on 200 to 1,100 base pairs (bp) Nsp I or Sty I digested fragments in the human genome, and are amplified using the Genome-Wide Human SNP Nsp/Sty Assay Kit 5.0/6.0. This assay, which is also compatible with the SNP Array 5.0, now combines the Nsp and Sty fractions previously assayed on two separate arrays. SNPs on the SNP Array 6.0 were screened in more than 500 distinct samples, including 270 HapMap samples and separate diversity samples. Approximately 482,000 SNPs are derived from the previous-generation Mapping 500K and SNP 5.0 Arrays. The remaining 424,000 SNPs include tag SNP markers derived from the International HapMap Project. These novel markers have better representation of SNPs on chromosomes X and Y, mitochondrial SNPs, SNPs in recombination hotspots, and new SNPs added to the dbSNP database after completion of the GeneChip® Human Mapping 500K Array Set. This array contains a total of 946,000 non-polymorphic copy number probes. These probes—744,000 originally selected for their spacing and 202,000 selected based on known copy number changes reported in the Toronto Database of Genomic Variants (DGV)—enable you to detect de novo copy number changes and perform association studies by genotyping both SNP and known copy number polymorphism (CNP) loci (as The Genome-Wide Human SNP Array 6.0 reported by McCarroll, et al.).
    [Show full text]
  • An Integrated Map of Genetic Variation from 1,092 Human Genomes
    ARTICLE doi:10.1038/nature11632 An integrated map of genetic variation from 1,092 human genomes The 1000 Genomes Project Consortium* By characterizing the geographic and functional spectrum of human genetic variation, the 1000 Genomes Project aims to build a resource to help to understand the genetic contribution to disease. Here we describe the genomes of 1,092 individuals from 14 populations, constructed using a combination of low-coverage whole-genome and exome sequencing. By developing methods to integrate information across several algorithms and diverse data sources, we provide a validated haplotype map of 38 million single nucleotide polymorphisms, 1.4 million short insertions and deletions, and more than 14,000 larger deletions. We show that individuals from different populations carry different profiles of rare and common variants, and that low-frequency variants show substantial geographic differentiation, which is further increased by the action of purifying selection. We show that evolutionary conservation and coding consequence are key determinants of the strength of purifying selection, that rare-variant load varies substantially across biological pathways, and that each individual contains hundreds of rare non-coding variants at conserved sites, such as motif-disrupting changes in transcription-factor-binding sites. This resource, which captures up to 98% of accessible single nucleotide polymorphisms at a frequency of 1% in related populations, enables analysis of common and low-frequency variants in individuals from diverse, including admixed, populations. Recent efforts to map human genetic variation by sequencing exomes1 individual genome sequences, to help separate shared variants from and whole genomes2–4 have characterized the vast majority of com- those private to families, for example.
    [Show full text]
  • Chromosome SNP Microarray a New High-Density Allele-Specific Diagnostic Platform
    Chromosome SNP Microarray A New High-density Allele-specific Diagnostic Platform Analysis of submicroscopic genomic changes can pair (allele) targets that have two different forms, revealing which form is present at that locus as well as the number of copies of that detect the cause of congenital anomalies and/or DNA segment. CGH-based arrays cannot detect polymorphic allele learning disabilities. targets (only dosage), resulting in a significant advantage for the SNP array. This advantage is based both on added confirmation of Introduction dosage changes through allele comparisons and the identification of Genetic imbalances are often associated with multiple birth defects, syndrome-associated “copy neutral” contiguous stretches of allele developmental delay, growth retardation, and dysmorphic features. homozygosity. The presence of the latter allows for detection of Standard cytogenetic analysis can identify visible chromosomal uniparental disomy for all chromosomes and, when consanguinity alterations, such as an extra chromosome band, but small deletions is present, it will provide the degree as well as the resulting genomic or duplications in the genome cannot be reliably detected. location of regions of recessive allele risk.5,6 Submicroscopic unbalanced rearrangements have been found in approximately 3% of patients with learning disabilities and mental Increase in Genomic Targets retardation of unknown cause using a set of FISH (fluorescence The initial 262,000 SNP microarray has been upgraded to offer a in situ hybridization) probes that can only target the ends of the much more dense array of 1.8 million genomic targets (marker every chromosomes.1 700 bp).7 The ultra dense array is much more sensitive in identifying extremely small genomic variations and more statistically reliable Advances in molecular cytogenetics further improved the sensitivity due to the large increase in markers through which each variation is of testing through the application of microarray-based comparative detected.
    [Show full text]
  • Identity-By-Descent Detection Across 487,409 British Samples Reveals Fine Scale Population Structure and Ultra-Rare Variant Asso
    ARTICLE https://doi.org/10.1038/s41467-020-19588-x OPEN Identity-by-descent detection across 487,409 British samples reveals fine scale population structure and ultra-rare variant associations ✉ Juba Nait Saada 1 , Georgios Kalantzis 1, Derek Shyr 2, Fergus Cooper 3, Martin Robinson 3, ✉ Alexander Gusev 4,5,7 & Pier Francesco Palamara 1,6,7 1234567890():,; Detection of Identical-By-Descent (IBD) segments provides a fundamental measure of genetic relatedness and plays a key role in a wide range of analyses. We develop FastSMC, an IBD detection algorithm that combines a fast heuristic search with accurate coalescent-based likelihood calculations. FastSMC enables biobank-scale detection and dating of IBD segments within several thousands of years in the past. We apply FastSMC to 487,409 UK Biobank samples and detect ~214 billion IBD segments transmitted by shared ancestors within the past 1500 years, obtaining a fine-grained picture of genetic relatedness in the UK. Sharing of common ancestors strongly correlates with geographic distance, enabling the use of genomic data to localize a sample’s birth coordinates with a median error of 45 km. We seek evidence of recent positive selection by identifying loci with unusually strong shared ancestry and detect 12 genome-wide significant signals. We devise an IBD-based test for association between phenotype and ultra-rare loss-of-function variation, identifying 29 association sig- nals in 7 blood-related traits. 1 Department of Statistics, University of Oxford, Oxford, UK. 2 Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA. 3 Department of Computer Science, University of Oxford, Oxford, UK.
    [Show full text]
  • Regions of Homozygosity Identified by SNP Microarray Analysis Aid in The
    ORIGINAL RESEARCH ARTICLE ©American College of Medical Genetics and Genomics Regions of homozygosity identified bySN P microarray analysis aid in the diagnosis of autosomal recessive disease and incidentally detect parental blood relationships Kristen Lipscomb Sund, PhD, MS1, Sarah L. Zimmerman, PhD1, Cameron Thomas, MD2, Anna L. Mitchell, MD, PhD3, Carlos E. Prada, MD1, Lauren Grote, BS1, Liming Bao, MD, PhD1, Lisa J. Martin, PhD1 and Teresa A. Smolarek, PhD1 Purpose: The purpose of this study was to document the ability of was suspected in the parents of at least 11 patients with regions of single-nucleotide polymorphism microarray to identify copy-neutral homozygosity covering >21.3% of their autosome. In four patients regions of homozygosity, demonstrate clinical utility of regions of from two families, homozygosity mapping discovered a candidate homozygosity, and discuss ethical/legal implications when regions of gene that was sequenced to identify a clinically significant mutation. homozygosity are associated with a parental blood relationship. Conclusion: This study demonstrates clinical utility in the identifica- Methods: Study data were compiled from consecutive samples tion of regions of homozygosity, as these regions may aid in diagnosis sent to our clinical laboratory over a 3-year period. A cytogenetics of the patient. This study establishes the need for careful reporting, database identified patients with at least two regions of homozygosity thorough pretest counseling, and careful electronic documentation, >10 Mb on two separate chromosomes. A chart review was conduct- as microarray has the capability of detecting previously unknown/ ed on patients who met the criteria. unreported relationships. Results: Of 3,217 single-nucleotide polymorphism microarrays, Genet Med 2013:15(1):70–78 59 (1.8%) patients met inclusion criteria.
    [Show full text]
  • Genetic Profiles of 103,106 Individuals in the Taiwan Biobank Provide Insights Into the Health and History of Han Chinese
    UCSF UC San Francisco Previously Published Works Title Genetic profiles of 103,106 individuals in the Taiwan Biobank provide insights into the health and history of Han Chinese. Permalink https://escholarship.org/uc/item/9s81s8g7 Journal NPJ genomic medicine, 6(1) ISSN 2056-7944 Authors Wei, Chun-Yu Yang, Jenn-Hwai Yeh, Erh-Chan et al. Publication Date 2021-02-11 DOI 10.1038/s41525-021-00178-9 Peer reviewed eScholarship.org Powered by the California Digital Library University of California www.nature.com/npjgenmed ARTICLE OPEN Genetic profiles of 103,106 individuals in the Taiwan Biobank provide insights into the health and history of Han Chinese Chun-Yu Wei1,4, Jenn-Hwai Yang1,4, Erh-Chan Yeh1, Ming-Fang Tsai1, Hsiao-Jung Kao1, Chen-Zen Lo1, Lung-Pao Chang1, Wan-Jia Lin1, Feng-Jen Hsieh1, Saurabh Belsare 2, Anand Bhaskar 3, Ming-Wei Su1, Te-Chang Lee1, Yi-Ling Lin1, Fu-Tong Liu1, Chen-Yang Shen1, ✉ Ling-Hui Li1, Chien-Hsiun Chen1, Jeffrey D. Wall2, Jer-Yuarn Wu1 and Pui-Yan Kwok 1,2 Personalized medical care focuses on prediction of disease risk and response to medications. To build the risk models, access to both large-scale genomic resources and human genetic studies is required. The Taiwan Biobank (TWB) has generated high- coverage, whole-genome sequencing data from 1492 individuals and genome-wide SNP data from 103,106 individuals of Han Chinese ancestry using custom SNP arrays. Principal components analysis of the genotyping data showed that the full range of Han Chinese genetic variation was found in the cohort.
    [Show full text]
  • SNP Ascertainment Bias in Population Genetic Analyses: Why It Is Important, and How to Correct It Recently in Press
    Prospects & Overviews SNP ascertainment bias in population genetic analyses: Why it is important, and how to correct it Recently in press Joseph Lachanceà and Sarah A. Tishkoff Whole genome sequencing and SNP genotyping arrays African hunter-gatherers and the power can paint strikingly different pictures of demographic of whole genome sequencing history and natural selection. This is because genotyping arrays contain biased sets of pre-ascertained SNPs. In this Due to technological advances and increases in computation- al power, the cost of genotyping has plummeted over the past short review, we use comparisons between high-coverage few years. Because of this, it is now feasible to conduct whole genome sequences of African hunter-gatherers and population genetic analyses of whole genome sequencing data from genotyping arrays to highlight how SNP data. One advantage of whole genome sequencing is that ascertainment bias distorts population genetic inferences. SNP ascertainment bias is reduced compared to alternative Sample sizes and the populations in which SNPs are genotyping technologies. This lack of SNP ascertainment bias discovered affect the characteristics of observed variants. is critical for accurate population genetic analyses where allele frequency distributions are used to infer demographic We find that SNPs on genotyping arrays tend to be older history and scan for past targets of natural selection. Using the and present in multiple populations. In addition, geno- technology of Complete Genomics [1], we recently sequenced typing arrays cause allele frequency distributions to be the whole genomes of 15 African hunter-gatherers at >60 shifted towards intermediate frequency alleles, and coverage [2]. Sequenced individuals included five Pygmies estimates of linkage disequilibrium are modified.
    [Show full text]
  • Mutagenesys: Estimating Individual Disease Susceptibility Based On
    Vol. 24 no. 3 2008, pages 440–442 BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm587 Databases and ontologies MutaGeneSys: estimating individual disease susceptibility based on genome-wide SNP array data Julia Stoyanovich* and Itsik Pe’er Department of Computer Science, Columbia University, 1214 Amsterdam Avenue, New York, NY 10025, USA Received on May 29, 2007; revised on October 16, 2007; accepted on November 22, 2007 Advance Access publication November 29, 2007 Associate Editor: Jonathan Wren ABSTRACT First and foremost, genetic information remains expensive Summary: We present MutaGeneSys: a system that uses genome- to collect, and it is currently economically prohibitive to wide genotype data to estimate disease susceptibility. Our system make a complete set of an individual’s genotypes of SNPs integrates three data sources: the International HapMap project, available for analysis. Nature Genetics’ ‘Question of the Year’ whole-genome marker correlation data and the Online Mendelian (www.nature.com/ng/qoty) announced the sequencing of the Inheritance in Man (OMIM) database. It accepts SNP data of indivi- entire human genome for $1000 as a goal for the genetics duals as query input and delivers disease susceptibility hypotheses community. Cost-effective methods (e.g. SNP arrays) currently even if the original set of typed SNPs is incomplete. Our system exist for collecting genetic data from 1% to 5% of all 11 million is scalable and flexible: it produces population, technology and human SNPs. This calls for the development of techniques that confidence-specific predictions in interactive time. can effectively utilize partial genetic information for disease Availability: Our system is available as an online resource at http:// prediction.
    [Show full text]
  • Pharmacogenomics and Personalized Medicine: the Plunge Into Next-Generation Sequencing Mia Wadelius1* and Ana AlRevic2
    Wadelius and Alrevic Genome Medicine 2011, 3:78 http://genomemedicine.com/content/3/12/78 MEETING REPORT Pharmacogenomics and personalized medicine: the plunge into next-generation sequencing Mia Wadelius1* and Ana Alrevic2 Abstract technology has led to a drastic drop in the cost (>10,000-fold) and time (from 10 years to 1 week) needed A report on the 9th Annual Cold Spring Harbor/ to sequence a genome. NGS is now being introduced as a Wellcome Trust meeting ‘Pharmacogenomics and method to personalize medicine. Personalized Medicine’, Hinxton, Cambridge, UK, Yingrui Li (Beijing Genomics Institute, China) des- 29 September to 2 October 2011. cribed the sequencing revolution that has made personal Keywords Consortium, genome-wide association genomes affordable, while it remains difficult and costly study, meta-analysis, next-generation sequencing, to interpret the results. Recent whole genome sequencing pharmacogenomics/pharmacogenetics, public health of individuals at the Beijing Genomics Institute has genomics, P4 medicine. shown an excess of rare deleterious SNPs together with extensive structural variations and novel individual specific sequences with potential functional impact. ese In recent years, pharmacogenomics has moved beyond new genetic variants are likely to explain part of the candidate gene and genome-wide association studies missing heritability seen with previous GWASs. Interest- (GWASs) towards truly personalized genomics. e use ingly, Li reported that more ethnic-specific haplotypes of new biotechnological, mathematical and computa- exist than previously thought and that each defined tional tools has enabled an exponential increase in the population will need its own genome-wide tag SNP array. number of biomarkers for drug safety and efficacy; how- is is a general problem that must be considered when ever, their clinical utility in achieving personalized performing pharmacogenetic GWASs in non-Caucasian therapy remains to be determined.
    [Show full text]
  • Human Identification by Genotyping Single Nucleotide Polymorphisms (Snps) Using an APEX Microarray
    Human Identification by Genotyping Single Nucleotide Polymorphisms (SNPs) Using an APEX Microarray Lisa D. White1,2, John M. Shumaker2, Jeffrey J. Tollett2, and Rick W. Staub1 1Identigene, Inc., 7400 Fannin, Suite 1222, Houston, TX 77054 2Dept. Molecular & Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston TX 77030 ABSTRACT Interest in utilizing single nucleotide polymorphisms (SNPs) for genetic research is rapidly increasing. Nuclear SNPs are typically biallelic loci that segregate in a Medelian fashion and are the most common type of genetic variation in humans. They are eminently suited to a broad range of applications that include genome mapping, pharmacogenomics, and genotyping for disease diagnostics and genetic identity. By applying Arrayed Primer Extension (APEX) technology to a suite of 24 SNP loci simultaneously, we have successfully developed a prototype identity chip for use in DNA paternity testing and forensic identification. These 24 SNP loci were chosen based upon reported human population allele frequencies of approximately 0.5 and produce a combined match probability of approximately 16 billion. Sense and antisense oligonucleotide chip primers were designed flanking each SNP site. The use of sense and antisense primers increases the accuracy of the chip by providing an internal quality control check (the sense SNP allele detected must be complementary to the antisense SNP allele). Amplification of the SNP loci is accomplished by modified multiplex PCR technology. A template-dependent DNA polymerase extends each primer in the array with a dye-labeled ddNTP terminator. Detection of the labeled SNP sites is performed by a multicolor (4-color) Total Internal Reflection Fluorescence (TIRF) reader.
    [Show full text]
  • Europe PMC Funders Group Author Manuscript Nat Genet
    Europe PMC Funders Group Author Manuscript Nat Genet. Author manuscript; available in PMC 2016 April 01. Published in final edited form as: Nat Genet. 2015 October ; 47(10): 1121–1130. doi:10.1038/ng.3396. Europe PMC Funders Author Manuscripts A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease A full list of authors and affiliations appears at the end of the article. # These authors contributed equally to this work. Abstract Existing knowledge of genetic variants affecting risk of coronary artery disease (CAD) is largely based on genome-wide association studies (GWAS) analysis of common SNPs. Leveraging phased haplotypes from the 1000 Genomes Project, we report a GWAS meta-analysis of 185 thousand CAD cases and controls, interrogating 6.7 million common (MAF>0.05) as well as 2.7 million low frequency (0.005<MAF<0.05) variants. In addition to confirmation of most known CAD loci, we identified 10 novel loci, eight additive and two recessive, that contain candidate genes that newly implicate biological processes in vessel walls. We observed intra-locus allelic heterogeneity but little evidence of low frequency variants with larger effects and no evidence of synthetic association. Our analysis provides a comprehensive survey of the fine genetic architecture of CAD showing that genetic susceptibility to this common disease is largely determined by common SNPs of small effect size. Europe PMC Funders Author Manuscripts Coronary artery disease (CAD) is the main cause of death and disability worldwide and represents an archetypal common complex disease with both genetic and environmental determinants1,2.
    [Show full text]
  • Impact of SNP Microarray Analysis of Compromised DNA on Kinship Classification
    bioRxiv preprint doi: https://doi.org/10.1101/2021.06.25.449870; this version posted June 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license. Impact of SNP microarray analysis of compromised DNA on kinship classification success in the context of investigative genetic genealogy Short title: SNP microarrays on compromised DNA for genetic genealogy Jard H. de Vries1,§, Daniel Kling2,§, Athina Vidaki3, Pascal Arp1, Vivian Kalamara3, Michael M.P.J. Verbiest1, Danuta Piniewska-Róg4,5, Thomas J. Parsons6, André G. Uitterlinden1,7, Manfred Kayser3,* 1 Erasmus MC, University Medical Center Rotterdam, Department of Internal Medicine, Dr. Molewaterplein 40, 3015 GD, Rotterdam, the Netherlands 2 Department of Forensic Genetics and Toxicology, National Board of Forensic Medicine, Artillerigatan 12, 587 58 Linköping, Sweden 3 Erasmus MC, University Medical Center Rotterdam, Department of Genetic Identification, Dr. Molewaterplein 40, 3015 GD, Rotterdam, the Netherlands 4 Malopolska Centre of Biotechnology, Jagiellonian University, 30-387 Krakow, Poland 5 Department of Forensic Medicine, Jagiellonian University Medical College, 31-531 Krakow, Poland 1 bioRxiv preprint doi: https://doi.org/10.1101/2021.06.25.449870; this version posted June 25, 2021. The copyright holder for this preprint (which was not certified by peer review) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under aCC-BY-NC-ND 4.0 International license.
    [Show full text]