Introduction to computational biology 1000-2N03BO

1. Biological introduction: basic knowledge of molecular biology, structure of nucleic acids and proteins, transcription and translation.
2. Molecular sequence analysis: sequencing by hybridization, algorithms for global and local alignment of two sequences.
3. Mathematical models of molecular evolution: Jukes-Cantor and Kimura models for DNA sequences, PAM and BLOSUM substitution matrices for proteins, statistical significance of alignment scores.
4. Multiple sequence alignment: dynamic programming, greedy algorithms, efficient heuristics (CLUSTALW, T-Coffee, MUSCLE).
5. Hidden Markov Models and their applications to molecular sequences: Viterbi and Baum-Welch algorithms.
6. Searching sequence databases: BLAST algorithm.
7. Finding motifs in DNA sequences, functional enrichment analysis of gene sets.
8. Introduction to phylogenetics: reconstructing phylogenetic trees of single genes and reconciling them.
9. Introduction to genomic data analysis: mapping reads to reference genome, genome assembly, metagenomics.

The course will be given in Polish, if no non-Polish-speaking students register for it.

Type of course

elective courses

Prerequisites

Algorithms and data structures
Probability theory

Course coordinators

Aleksander Jankowski

Learning outcomes

Knowledge:
1. Has a general knowledge of the problems of contemporary computational biology.
2. Has basic knowledge of mathematical models and computational methods used in the description of molecular sequences.

Skills:
1. Can implement fundamental bioinformatics analyses of molecular sequences.
2. Can use advanced bioinformatics tools to analyze experimental data.

Competences:
1. Knows the limitations of their own knowledge and understands the need for further education (K_K01).
2. Is able to manage their time and make commitments and meet deadlines (K_K05)
3. Is able to use interdisciplinary literature.

Assessment criteria

Theory test, programming assignments, programming homework. Oral exam.

In the case of completing the course by a doctoral student, an additional element will be to read an original research article that is close to the current research front and discuss it with the lecturer.

Bibliography

1. Richard Durbin, Sean R. Eddy, Anders Krogh, Graeme Mitchison, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids, Cambridge University Press, 1998.
2. Pavel A. Pevzner, Computational Molecular Biology: An Algorithmic Approach, MIT Press, 2000.
3. Warren J. Ewens, Gregory R. Grant, Statistical Methods in Bioinformatics: An Introduction, Springer 2001.
4. A. Malcolm Campbell, Laurie J. Heyer, Discovering Genomics, Proteomics, and Bioinformatics, CSHL Press, 2007.

Additional information

Information on level of this course, year of study and semester when the course unit is delivered, types and amount of class hours - can be found in course structure diagrams of apropriate study programmes. This course is related to the following study programmes:

Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system:

Description of 1000-2N03BO in USOSweb