Architecture of large projects in bioinformatics 1000-717ADP
Data formats in bioinformatics.
Popular software libraries (BioPerl, BioPython).
Most important bioinformatics databases (UniProt, PDB, RefSeq, GenBank, ENA, InterPro, etc.)
Software licensing for scientific purposes. Free-software licensing. Patents.
Generic model Organism database (GMOD) project - assumptions, history and usage.
Genome browsers, problem description and state of the solutions.
High-performance computing (HPC)
Version control systems (CVS, SVN, git), and online collaboration ad distribution platforms (github, sourceforge).
Software testing, automated testing frameworks.
Scientific workflow systems - taverna and galaxy. MyExperiment platform. Reproducible research.
Literate programming idea and sweave, markdown, software documentation.
Interactive scripting platforms, Rstudio, Jupyter.
Main fields of studies for MISMaP
biology
computer science
Type of course
Mode
Prerequisites (description)
Course coordinators
Assessment criteria
Homework for some laboratories. Team project and a presentation on a chosen subject
Bibliography
Materials on the website:
https://www.mimuw.edu.pl/~lukaskoz/teaching/adp/
Additional information
Information on level of this course, year of study and semester when the course unit is delivered, types and amount of class hours - can be found in course structure diagrams of apropriate study programmes. This course is related to the following study programmes:
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: