Gene Finding: Motivation
CSE 527 Sequence data flooding in Computational Biology What does it mean? protein genes, RNA genes, mitochondria, chloroplast, regulation, replication, structure, repeats, transposons, unknown stuff, … Gene Prediction More generally, how do you learn from complex data in an unknown language
7
Protein Coding Nuclear DNA Biological Basics
Focus of this lecture Central Dogma: Goal: Automated annotation of new seq data transcription translation State of the Art: DNA RNA Protein In Eukaryotes: predictions ~ 60% similar to real proteins