CS 4491/CS 7990 SPECIAL TOPICS IN BIOINFORMATICS
Mingon Kang, PhD Computer Science, Kennesaw State University Self Introduction
Instructor: Mingon Kang
http://ksuweb.kennesaw.edu/~mkang9 Or Google “Mingon Kang” and click the first one.
Research interests: Bioinformatics, Machine Learning, Data Mining, and Big Data Analytics
Projects you may be interested in: Several Genomics projects in Bioinformatics Medical image classification Course Information
Instructor: Mingon Kang, PhD Office: J-339 Email: [email protected] include ‘CS4991’ or ‘CS7790’ in the subject of your message when you email. Office Hours: Tuesday: 2-4pm, Wednesday: 10am-noon By appointment Lecture slides, Homework, and other materials are posted on the course web page at: http://ksuweb.kennesaw.edu/~mkang9/?menu=CS4491_ 7990 lecture recordings will be in D2L Choice of Language
You can use your favorite language, but R, Matlab, Python are highly recommended.
The course will briefly introduce R in case you have no experience of those script languages.
Better for file I/O of textual biological data
Better to do matrix manipulation
Fast Prototyping Expected Background
For graduate students: Coursework in data structures and algorithms, or CS 5040 as per admissions analysis.
For undergraduate students: CS 3304, CS 3410
Statistics: good if you’ve had some background, but not required
Molecular biology: no knowledge assumed, but an interest in learning some basic molecular biology is mandatory What is Bioinformatics
Application of computer and information technology to problems in biology, particular molecular biology. NIH definitions:
“Bioinformatics applies principles of information sciences and technologies to make the vast, diverse, and complex life sciences data more understandable and useful. Computational biology uses mathematical and computational approaches to address theoretical and experimental questions in biology. Although bioinformatics and computational biology are distinct, there is also significant overlap and activity at their interface.”
sometimes used synonymously with computational biology or computational molecular biology Other Terminologies
Biomedical informatics? It is defined by National Library of Medicine (NLM) as “the intersection of basic informational and computing sciences with an application domain in biomedicine.”
Biometrics? It is the science of using biological properties to identify individuals; for example, face recognition, finger prints, a retina scan, and voice recognition. Goal of this Course
Understanding the types and sources of data available for computational biology.
Understanding the important computational problems in molecular biology.
Understanding the most significant & interesting algorithms.
Identifying opportunities in this field, and perhaps formulating projects to explore further. What this course will do
Give you an understanding of main issues in molecular biological computing: sequence, structure, and function.
Give you an opportunity to implement some widely used algorithms.
Give you exposure to classic computational problems, as manifested in biology.
Give you exposure to classic biological problems, as represented computationally.
Practice How to read and write research papers Topics in Bioinformatics
Computational Biology
Gene Expression Analysis
Gene Regulatory Network Inference
Markov Chain Model for Gene Identification
Genome-Wide Association Studies (GWAS) and expression Quantitative Trait Loci (eQTL) analysis
Next Generation Sequencing Data Analysis
Deep Learning in Bioinformatics Reference Books
N. Jones & P. Pevzner, "An Introduction to Bioinformatics Algorithms," 2004, ISBN 0262101068
Supratim Choudhuri, Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools, 2014, ISBN: 0124104711 Evaluation (tentative)
Homework Assignment (4-5 assignments: 40%)
Mainly writing codes for algorithm implementation Late assignments will be accepted up to 24 hours after the due date for 50% credit.
Midterm (20%) + Final (20%)
Allows two pages of cheat sheets
Two research papers (5% and 15%)
Using Latex Academic Integrity
Academic dishonesty Cheating Plagiarism Collusion The submission for credit of any work or materials that are attributable in whole or in part to another person Taking an examination for another person Any act designed to give unfair advantage to a student or the attempt to commit