cDNA SEQUENCING WITH OXFORD NANOPORE
Getting Started In medicine, the identification of differentially With nanopore sequencing, there is no Figure 1. Introduction spliced isoforms and fusion transcripts can upper read length limit: size selection is inform disease diagnosis and treatment. not necessary and whole transcripts can be sequenced end-to-end in single reads, RNA sequencing provides a rapid, effective enabling simple, accurate assembly, with the The ability to sequence RNA has method of virus identification, and in ability to distinguish between highly similar provided invaluable insight into developmental biology the ability to track isoforms, identify novel transcripts, and detect the study of living things across transcriptional changes over time helps to fusions. Furthermore, rates of multimapping numerous applications, shedding resolve the developmental mechanisms at are significantly lower . light on the dynamics of the (see figure 1) play. Transcriptomics also has applications in 2 Multimapping reads (%) transcriptome, from single cells environmental and agricultural science, such Accurate isoform quantification is attainable to whole tissues. as in strategies for pest management. with nanopore sequencing, as demonstrated using spike-in RNA standards (ERCC) Despite the numerous advances made in Short read ONT cDNA (see figure 2). transcriptome analysis using short-read cDNA sequencing technologies, these methods Figure 2. have several shortcomings. a) Standard PCR-cDNA b) Strand-specific PCR-cDNA • Transcripts are generally several kilobases long: the typical human gene contains 12 exons each with an average length of
1 236 base pairs and alternative splicing has 2 2
2 observed counts observed counts been observed in 95% of human genes. 10 Spearman r = 0.989; 10 Spearman r = 0.96; p < 0.001 p < 0.001 Log Log • Short reads only partially cover a 1 1 Log expected counts Log expected counts transcript’s length, making accurate 10 10 isoform assembly a difficult process c) Direct cDNA d) Direct RNA reliant on computational reconstruction; short reads also exhibit high rates of multimapping. 2 2 observed counts
Spearman r = 0.989; observed counts Spearman r = 0.97; 10 10 p < 0.001 p < 0.001 Log Log 1 1
Log10 expected counts Log10 expected counts References 1 1 2 RNA n n 1. NCBI: https://www.ncbi.nlm.nih.gov/Web/ Newsltr/Spring03/human.html
2. NCBI: https://www.ncbi.nlm.nih.gov/Web/ Newsltr/Spring03/human.html
2 Figure 3.