Image processing 2400-ZEWW988
The course will sequentially cover the following topics:
1. Image preprocessing:
conversion to greyscale, normalisation, contrast enhancement (e.g. histogram equalisation)
noise reduction using Gaussian and median filtering,
resizing, sharpening, blurring,
data augmentation techniques such as rotation, flipping, cropping, and random colour alterations (colour jittering).
2. Geometric transformations:
basic transformations including translation, scaling, rotation, and affine transformations,
perspective transformations and homography,
interpolation methods: nearest-neighbour, bilinear, and bicubic.
3. Morphological operations:
processing of binary images,
erosion, dilation, opening, and closing,
hit-or-miss, top-hat, and black-hat operations.
4. Edge detection and filtering:
filtering fundamentals: box filter, Gaussian, and median filters,
edge detection algorithms: Sobel, Prewitt, Laplacian,
Canny edge detector: gradient calculation, non-maximum suppression, and hysteresis thresholding,
low-pass vs. high-pass filtering.
5. Feature extraction:
classical descriptors: corner detectors – Harris, Shi-Tomasi; keypoint detectors – SIFT, SURF, ORB,
descriptors and matching: BRIEF, FREAK, feature matching using FLANN.
6. Image segmentation:
semantic and instance segmentation,
thresholding (Otsu’s method), the Watershed algorithm,
region-growing methods,
U-Net architecture and its variants.
7. Frequency-domain transformations:
Fourier transform (DFT, FFT): amplitude and phase analysis
low-, high-, and band-pass filtering,
Discrete Cosine Transform (DCT),
Wavelet Transform: multi-resolution analysis and denoising.
Type of course
Course coordinators
Learning outcomes
Students will learn how to prepare images for further analysis. Additionally, they will be introduced to various image processing techniques, including geometric transformations, morphological operations, edge detection, feature extraction methods, image segmentation, and frequency-domain transformations. By the end of the course, students will be able to analyse images, selecting appropriate methods based on the specific nature of the encountered problem. Furthermore, students will also be aware of the current challenges and issues related to image processing.
Assessment criteria
The final grade will be determined based on: a home-taken project (70% of the grade) and a project presentation (30% of the grade).
The assessment will be both written (project) and oral (project presentation).
Bibliography
Basic:
Gonzalez, R., & Woods, R. (2017). Digital Image Processing (4th ed.). Pearson.
Sonka, M., Hlavac, V., & Boyle, R. (2013). Image processing, analysis and machine vision. Springer.
Szeliski, R. (2022). Computer vision: algorithms and applications. Springer Nature.
Supplementary:
Jähne, B. (2005). Digital image processing. Springer Science & Business Media.
O'Gorman, L., Sammon, M. J., & Seul, M. (2008). Practical algorithms for image analysis with CD-ROM. Cambridge University Press.
Additional information
Additional information (registration calendar, class conductors, localization and schedules of classes), might be available in the USOSweb system: