The Mapping of Chromosome 16
Total Page:16
File Type:pdf, Size:1020Kb
The Mapping of "hromosomeu The The Mapping of Chromosome 16 Setting the Stage mapping techniques had been developed of lambda-phage clones, each containing and were being applied to the genomes DNA from one of the twenty-four hu- Both the molecular and the physical of some of the favorite organisms of man chromosomes (see "Libraries from technology for constructing physical molecular biologists. Flow-sorted Chromosomes"). Those maps of complex genomes have devel- Cassandra Smith and Charles Cantor chromosome-specific libraries were oped at a blistering pace over the past had used pulsed-field gel electrophoresis designed as a source of probes to five years, due largely to the initiation to order the very large restriction frag- find polymorphic DNA markers for of the Human Genome Project. These ments produced by cutting the E. coli constructing genetic-linkage maps (see technologies include the cloning of very genome with two rare-cutting restriction "Modem Linkage Mapping") and as large DNA fragments, electrophoretic enzymes. The resulting long-range a source of clones for rapid isolation separation of million-base-sized DNA restriction map of E. coli demonstrated of genes using cDNAs, or coding-region fragments, and sequence-based mapping that pulsed-field gel electrophoresis is probes, to pick out the appropriate clones using the polymerase chain reaction a way to study the long-range order from the libraries. Deaven and his group (PCR)to identify unique sequences of landmarks on the DNA of human were also constructing larger-insert along the genome. The latter provides a chromosomes. Contig maps, or physical chromosome-specific libraries using language for interrelating various types maps of ordered, overlapping cloned cosmid vectors. The large DNA inserts of genome maps. The significance of fragments, were near completion for the were prepared by partially digesting these developments is discussed in Part genomes of E, coli (about 5 million base sorted chromosomes with restriction I1 of "Mapping the Genome." pairs) and the yeast 5'. cerevisiae (about enzymes, thereby creating overlapping In 1988, when our laboratory initiated 13 million base pairs). Those maps were fragments. The cloned fragments would the physical mapping of chromosome constructed using lambda-phage clones, therefore be useful in constructing 16, the cloning of very large DNA which carry an average DNA insert size physical maps of ordered, overlapping fragments in yeast artificial chromo- of 20,000 base pairs. Work had also clones covering extended regions of somes (YACs) was just beginning in begun on mapping the genome of the human chromosomes. Among the first a handful of laboratories and only nematode (100 million base pairs) using chromosome-specific cosmid libraries to one library of YAC clones containing cosmid clones. Cosrnids carry the much be constructed at Los Alamos was one all the DNA in the human genome longer average insert size of 35,000 base for human chromosome 16. had been constructed worldwide. The pairs. Human chromosomes range in size total human-genomic YAC library was The haploid human genome, which from 50 million base pairs for chro- constructed at Washington University, includes one copy of each human chro- mosome 21 to 263 million base pairs where the technique of YAC cloning mosome, has 3 billion base pairs and for chromosome 1. Chromosome 16, had originally been developed. The is therefore about 250 times the size of which is about 100 million base pairs in . polymerase chain reaction had not yet the yeast genome and 30 times the size length, was chosen as our primary target become a standard tool of molecular of the nematode genome. When plans for large-scale physical mapping. We biology, and the use of sequence-tagged for the Human Genome Project were selected chromosome 16 for a number sites (STSs) as unique DNA landmarks being discussed in the late 1980s, it was of technical reasons including: (1) for physical mapping had not yet been natural to consider dividing the human the availability of a hybrid-cell line conceived (see "The Polymerase Chain genome by chromosome and mapping containing a single copy of chromosome Reaction and Sequence-tagged Sites" one chromosome at a time. 16 in a mouse-chromosome background, in "Mapping the Genome"). Thus, Ongoing work at Los Alamos on which permitted accurate sorting of in 1988 the most modem tools for human DNA and on adapting flow- human chromosome 16 from the mouse large-scale physical mapping of human sorting technology to separating in- chromosomes and thus the construction chromosomes were still waiting in the dividual human chromosomes set the of a high-purity chromosome 16-specific wings. On the other hand, a number of stage for the Laboratory to play a key library of cosmid clones for use in role in the Human Genome Project. map construction; (2) identification [Opening pages: large photomicro- In particular, as part of the National of a chromosome 16-specific satellite graph courtesy of Evelyn Campbell; Gene Library Project, a group led by repetitive-sequence probe permitting inset image courtesy of David Ward, Larry Deaven had constructed twenty- accurate purity assessments of sorted Yale University School of Medicine.] four libraries, or unordered collections chromosomes; and (3) the availability, Los Alamos Science Number 20 1992 The Mapping of Chromosome 16 Table 1. Disease Genes Localized to Human Chromosome 16 Location Symbol Cloned Disease HBA Yes Thalassemia PKD1 No Autosomal dominant polycystic kidney disease MEF No Familial Mediterranean fever RTS No Rubinstein-Taybi syndrome CLN3 No Batten's disease (juvenile-onset neuronal ceroid lipofuscionosis) PHKB No Gl ycogen-storage disease, type VIIIb CETP Yes Elevated high-density lipoprotein (HDL), (CETP deficiency) Corneal opacities, anemia, proteinuria with unesterified LCAT Yes hypercholesterolemia (Nomm disease) Richner-Hanhort syndrome, oculocutaneous tyrosinemia I1 (TAT TAT Yes deficiency) ALDOA Yes Hemolytic anemia (ALDOA deficiency) APRT Yes Urolithiasis, 2,5 dihydroxyadenine (APRT) deficiency CYBA No Autosomal chronic granulomatous disease CTM No Marner's cataract CMH2 No Familial hypertrophic cardiomyopathy through collaboration, of a panel of overlapping clones for chromosome 16 clones chosen at random, determine the a large number of hybrid-cell lines would facilitate rapid isolation of those overlaps between pairs of clones from containing portions of chromosome 16. genes not yet cloned. the similarities between fingerprints, and This hybrid-cell panel enables probes It takes about 2500 cosmid clones assemble the clone pairs into contigs, from chromosome 16 to be localized laid end to end to represent all the . or islands of overlapping clones. This into intervals along the chromosome DNA in chromosome 16 once, and so basic clone-to-fingerp~t-to-contigstrat- having an average length of 1.6 million our chromosome 16-specific library of egy, which is described in "Physical base pairs. 25,000 cosmid clones represented a Mapping-A One-Dimensional Jigsaw Chromosome 16 is also interesting to tenfold coverage of the chromosome. In Puzzle" in "Mapping the Genome", had the biomedical community. It contains 1988, with funds from the Department been applied successfully to the mapping gene loci for several human diseases of of Energy, we took on the physical of the E. coli, yeast, and nematode both clinical and economic importance, mapping of chromosome 16. genomes. However, those maps of less including polycystic kidney disease, a complex genomes had taken many years class of hemoglobin disorders, and sev- of work. In addition, the human genome eral types of cancer (including leukemia Developing a Mapping Strategy contains many classes of repetitive and breast cancer). Table 1 lists dis- sequences that tend to complicate the ease genes that have been localized Our initial strategy for constructing an process of building contigs. When faced to chromosome 16 through genetic- ordered-clone, or contig, map for chro- with the mapping of human chromosome linkage analysis. A physical map of mosome. 16 was to fingerprint cosmid 16, which is about ten times larger than Number 20 1992 Los Alums Science The Mapping of Chromosome 16 the yeast genome, we needed to develop a strategy that would increase the speed Various Classes of Human of contig building while retaining the required accuracy. Lander and Waterman's 1988 analysis Repetitive DNA Sequences of random-clone fingerprinting sug- gested the key to increased mapping efficiency. That paper showed that the Described below are the most abundant classes of repetitive DNA on human chro- size of the smallest detectable clone mosomes. The figure shows the locations of these classes on chromosome 16. overlap was an important parameter in Numbers in parentheses indicate the size of continuous stretches of each repetitive determining the rate at which contigs DNA class. would increase in length and therefore the rate at which contig maps would near Telomere Repeat: The tandemly repeating unit TTAGGG located at the very ends completion. In particular, the calculated of the linear DNA molecules in human and vertebrate chromosomes. The telomere rate of progress increases significantly if repeat (TTAGGG)n extends for 5000 to 12,000 base pairs and has a structure the detectable clone overlap is reduced different from that of normal DNA. A special enzyme called telomerase replicates from 50 percent to 25 percent of the the ends of the chromosomes in an unusual fashion that prevents the chromosome clone lengths. from shortening during replication. In the mapping efforts for yeast and E. coli, the overlap between two clones Subtelomeric repeats: Classes of repetitive sequences that are interspersed in the was detected by preparing a restriction- last 500,000 bases of nonrepetitive DNA located adjacent to the telomere. Some fragment fingerprint of each clone sequences are chromosome specific and others seem to be present near the ends and identifying restriction-fragment of all human chromosomes. lengths that were common to the two fingerprints.