Construction of a Cdna Library
Total Page:16
File Type:pdf, Size:1020Kb
Learning Objectives : • Understand the basic differences between genomic and cDNA libraries • Understand how genomic libraries are constructed • Understand the purpose for having overlapping DNA fragments in genomic libraries and how they are generated • Understand how cDNA libraries are constructed and the use of reverse transcriptase for their construction • Understand the rationale for library screening • Understand the method of plaque hybridization • Understand the four methods for library screening and when they are put into use Molecular cloning in bacterial cells…. This strategy can be applied to genomic DNA as well as cDNA Library construction • two types of libraries • a genomic library contains fragments of genomic DNA (genes) • a cDNA library contains DNA copies of cellular mRNAs • both types are usually cloned in bacteriophage vectors Construction of a genomic library vector DNA (bacteriophage lambda) • lambda has a linear double- stranded DNA genome • the left and right arms are essential for the phage replication cycle • the internal fragment is dispensable Bam HI sites “left arm” “right arm” internal fragment (dispensable for phage growth) NNGGATCCNN human genomic DNA (isolated from Bam HI sites: many cells) NNCCTAG GNN cut with Bam HI cut with Sau 3A (4-base cutter) (6-base cutter) which has ends compatible with Bam HI: NNN GATCNNN internal fragment NNNCTAG NNN remove internal fragment isolate ~20 kb fragments “left arm” “right arm” “left arm” “right arm” combine and treat with DNA ligase “left arm” “right arm” package into bacteriophage and infect E. coli 5 6 2 3 1 4 7 • genomic library of human DNA fragments in which each phage contains a different human DNA sequence Partial restriction enzyme digestion allows cloning of overlapping fragments a “contig” • isolation of ~20 kb fragments provides optimally sized DNAs for cloning in bacteriophage • partial digestion with a frequent-cutter (4-base cutter) allows production of overlapping fragments, since not every site is cut • overlapping fragments insures that all sequences in the genome are cloned • overlapping fragments allows larger physical maps to be constructed as contiguous chromosomal regions (contigs) are put together from the sequence data • number of clones needed to fully represent the human genome (3 X 109 bp) assuming ~20 kb fragments • theoretical minimum = ~150,000 • 99% probability that every sequence is represented = ~800,000 All possible sites: Results of a partial digestion: = uncut = cut Genomic Library making… The partial digest is one of the most important steps. Why??? Due to the production of overlapping DNA fragments The production of a cDNA library Construction of a cDNA library • reverse transcriptase makes a DNA copy of an RNA The life cycle of a retrovirus depends on reverse transcriptase retrovirus 2. the capsid is uncoated, releasing genomic RNA and reverse transcriptase 3. reverse transcriptase 1. virus enters cell makes a DNA copy and looses envelope 4. then copies the DNA strand to make it double-stranded DNA, 6. it is translated into viral proteins, removing the RNA with RNase H and assembled into new virus particles 5. the DNA is then integrated into the host cell genome where it is transcribed by new viruses host RNA polymerase II • cDNA library construction 5’ AAAAA 3’ mRNA (all mRNAs in cell) anneal oligo(dT) primers of 12-18 bases in length 5’ AAAAA 3’ 3’ TTTTT 5’ add reverse transcriptase and dNTPs 5’ AAAAA 3’ 3’ TTTTT 5’ cDNA add RNaseH (specific for the RNA strand of an RNA-DNA hybrid) and carry out a partial digestion 5’ 3’ AA TTTTT short RNA fragments serve as primers for second strand synthesis using DNA polymerase I 5’ AAAAA 3’ TTTTT short RNA fragments serve as primers for second strand synthesis using DNA polymerase I 5’ AAA 3’ TTTTT DNA polymerase I removes the remaining RNA with its 5’ to 3’ exonuclease activity and continues synthesis 5’ AAA 3’ TTTTT DNA ligase seals the gaps 5’ AAAAA double-stranded 3’ TTTTT cDNA 5’ AAAAA 3’ TTTTT NNNNNNNNG EcoRI linkers are ligated to both ends NNNNNNNNCTTAA using DNA ligase AAAAANNNNNNNNG TTTTTNNNNNNNNCTTAA 5’ AATTCNNNNNNNN 3’ GNNNNNNNN • double-stranded cDNA copies of mRNA with EcoRI cohesive ends are now ready to ligate into a bacteriophage lambda vector cut with EcoRI EcoRI sites cDNAs “left arm” “right arm” combine cDNAs with lambda arms and treat with DNA ligase “left arm” “right arm” package into bacteriophage and infect E. coli 5 6 2 3 1 4 7 • cDNA library in which each phage contains a different human cDNA DNA libraries • Genomic DNA libraries contains both introns and exons and promoters etc… – Usually made with 4 base cutters that cut frequently ( every 275 bases or so). – The production of overlapping sequences is due to partial digestion. – Libra sqry complexity is important to make sure that the sequence you are looking for is found in the DNA that has been sampled. – N = ln (1-P) / ln (1-f) where N = number of clones, P = probability that the DNA fragment is found in your library and f = the frequency of the DNA in your library. Genomic DNA complexity • To screen for a clone in a library usually want a 99% probability that your clone is found there. • Frequency is the size of the DNA fragment in the library/the size of the haploid genome. For a lambda library 17 kb (1.7 x 104) is the average size of library. The size of the genome is 3 x 109 bp F = 1.7 x 104 / 3 x 109 bp N = ln (1-.99) / ln (1- [1.7 x 104 / 3 x 109]) N = ln .01 / ln (1 - 0.56 x 10-5) N = -4.6061702 / -0.0000056 N = 822,351 clones Genome equivalents • How many genome equivalents are there in this library? How do you calculate this? 822,351 x 1.7 x 104 bps = 1.40 x 1010 bps Divide by the genome size 3.0 x 109 bps = 4.67 times the genome equivalent How many positives will you get if you screen for a single copy gene? Insertional mutagensis • In all of the vectors that are currently used to date there is a system that can either identify or select for vectors containing clones. This is the backbone of recombinant DNA technology. • Initial vectors involved the cloning into a antibiotic resistance gene making a bacteria containing a vector with a DNA fragment sensitive to the antibiotic. This is not the best situation, Why? Insertional mutagenesis II • The use of the beta-galactosidase gene for an insertional mutagenesis target allowed the screening of all clones for those that contained inserts by a simple blue white color assay. This gene cleaves X gal (chromagen) to give rise to a blue dye that colors the bacteria or phage plaque. This allows the screening those plasmids or phage particles that contain DNA disrupting the target gene. Insertional mutagenesis III • In addition suppressor tRNA genes can be used to identify YAC that contain an insert. The suppressor tRNA can suppress the effects of a Ade2 ochre mutation. This gives a white yeast colony. When the tRNA gene is disrupted the colonies are pink due to the accumulation of a precursor of Adenine. Pink colonies are what is desired. See Figure 4.16 Clones are usually characterized first by restriction digestion. This DNA fragment was digest with various enzymes giving rise to specific sizes. These can be used to generate a restriction map Vectors for library construction • Plasmid vectors – Small circles of DNA that contain a selection marker like antibiotic resistance. – Insertional mutagenesis target with a multi cloning site. – A variety DNA replicons. Bacterial, Yeast. • Maximum size of insert is about 10 kb. Lambda and Cosmid vectors • Bacteriophage lambda can be used as a cloning vector. It has a genome of about 50 kb of linear DNA. Its life cycle is condusive to the use as a cloning vector… The lytic cycle can be supported by only a portion of the genes found in the lambda genome. Lambda life cycle. The lytic life cycle produces phage particles immediately The lysogenic life cycle requires genes in the middle of the genome, which can be replaced Lambda insertion and replacement vectors • Only 37 to 52 kb DNA fragments can be packaged into the lambda head. This can be done in vitro .Because the middle portion of the lambda genome can be replace if the lytic life cycle is used up to 23 kb DNA can be inserted in lambda genome. These are used for genomic DNA libraries. • Insertion vectors can hold up to 7 kb of cDNA. Lambda genome In vitro Packaging of ligated lambda DNA. Cosmid vectors • A cosmid is a hybrid between a lambda vector and a plasmid. The COS sites are the only thing that is necessary for lambda DNA packaging. Therefore if one can ligate COS sites about 50 kb apart then the ligation products can be in vitro packaged. Therefore cosmid vectors can contain 33 to 45 kb. Cosmid vector ligation Making a Genomic Library with Cosmids • Partial digest TetR 21.5 kb cos EcoRI • Ligation into site EcoRI Final Steps of a Genomic Library • Package into heads and plug with tails • Transduce E. coli receptor cell • Select white colonies with tetR • Check for plasmid • Screen in your mutant for phenotype restoration Things You Should Remember • Some plasmids are used as vectors or cloning vehicles but they are limited to the amount of DNA that can be cloned. • A cosmid is a plasmid that has at least one COS (cohesive end site). • COS comes from a bacteriophage. • A genomic library contains at least one copy of every gene in an organism. Cloning large DNA fragments • Due to the large size of the human genome and the fact that many genes are very large and some DNA fragments cannot be replicated in lambda other vector systems needed to be developed.