Outline the operation of the 3x3 mean filter.
Technique used to smooth or reduce noise in an image. The mean filter is a linear filter that calculates the mean intensity value of the surrounding pixels.
This can be done by summing the intensities of the surrounding pixels and dividing by the total number of pixels in the neighbourhood.
Outline the operation of the 3x3 median filter.
Technique used to smooth or reduce noise in an image. The median filter is a non-linear filter that calculates the median intensity value of the surrounding pixels.
This can be done by sorting the intensities of the surrounding pixels in ascending order, and then selecting the middle value.
Outline the key elements of the multi-stage Canny edge detection process.
Key elements:
1. Noise Reduction: This is typically done using a Gaussian filter, which smooths the image by removing high-frequency noise while preserving the edges in the image.
Outline an application of the Euler number to binary image analysis. What is a key advantage of this approach?
Object recognition:
Consider a binary image of a scene that contains several objects, such as a car, a tree, and a building. The Euler number can be used to identify and classify each of these objects by counting the number of connected components and holes in the image.
Key Advantage:
Robustness and reliability: Unlike other methods that rely on pixel values or geometric features, the Euler number is a topical invariant that is not affected by factors such as noise, blur or deformations in the image.
Outline the operation of the 3x3 Chamfer Algorithm based Distance Transform.
Calculates the distance from each pixel in an image to the nearest non-zero pixel
1. apply a horizontal and vertical dilation to the binary image using a 3x3 SE. = first distance map
2. Apply a diagonal dilation operation to the binary image. = Second distance map
3. Compute the final distance map by taking the minimum of the 1st and 2nd distance maps at each pixel.
Using the faster of the two approaches to Otsu thresholding (inter or intra class based variance), apply Otsu to the image segment illustrated in Figure Q1.3 for a sample threshold value of 1.
faster of the 2 approached = intra-class based variance.
Works by dividing the image into 2 classes: foreground pixels with intensity values above the threshold, and background pixels with intensity values below the threshold. The goal of the algorithm is to find the threshold value that minimises the intra-class variance, which is a measure of the spread of intensity values within each class.
Outline a Mathematical Morphology based approach to the removal of an illumination gradient from a poorly acquired grey scale image.
Illumination gradients can cause uneven lighting and make it difficult to accurately analyse an image.
1. Convert the image to a binary image by applying a threshold to the pixel intensities. This divides the pixels into foreground and background pixels.
2. Use dilation & erosion to expand the foreground pixels and contract the background pixels. This removes some of the uneven lighting in the image by making both the foreground and background pixels more uniform.
3. Use opening to remove any small, isolated foreground pixels that may be present. Opening = a combination of erosion followed by dilation
4. Use closing to fill any small holes / gaps in the foreground. Closing = a combination of dilation followed by erosion.
The illumination gradient should now be significantly reduced and the objects in the image should be more uniformly lit and easier to analyse.
Outline an example of the Mathematical Morphology Erosion-Dilation Duality concept.
A fundamental concept in mathematical morphology that states that the erosion and dilation operations are not inverse of each other. This means that if an image is eroded and then dilated with the same SE, the resulting image will not be the same as the original image.
Using the concept of convolutional based filtering; develop a sequence of robust image processing and analysis steps to detect and highlight all the instances of the template letters “L” (Figure Q1.2) and “T” (Figure Q1.3) in the source image (Figure Q1.1). Assume both the source and template images are binary. You can assume that the template images are of the same scale as their corresponding entries in the source image.
What happens if you apply the templates to a rotated version of the source image?
Finally, we test the solution for robustness: noise, scale and rotation variations.
Outline two different mathematical morphology Point-Pair structuring element based approaches to removing the non-parallel line from the binary image illustrated in Figure Q2.4. Assume the distance between the periodic lines is 16 pixels.
Detail how discrete convolution can be used as a filter in image processing. Give a detailed example of its usage.
Mathematical operation that can be used to filter images in image processing and analysis. Convolution is a point-to-point operation that modifies the value of each pixel in the image, based on the values of the pixels in a neighbourhood around the pixel.
Outline the operation of a 5x5 Adaptive Threshold function.
A type of thresholding algorithm that automatically computes a local threshold value for each pixel in the image.
1. define a neighbourhood size: 5x5 size is used, 25 pixels.
2. Compute the local mean value: using a sample or a weighted average. It’s an estimate of the background intensity of the image.
3. Compute the local threshold value: the sum of the local mean value and the threshold coefficient, and determines whether the pixel is assigned to the foreground or the background region.
4. Binarise the image: compare the value of each pixel with the local threshold value, foreground region pixels are assigned 1, background is assigned 0.
Detail the operation of Freeman Chain Coding when applied to a binary image.
Method for representing the shape and the contour of an object in a binary image. Commonly used in IPA as it provides a compact and efficient way to encode the shape and the contour of the object, without the need for storing the pixels or the edges in the image.
1. Identify the starting point: first pixel in an object in a predefined search direction.
2. Trace the contour of the object: following the sequence of pixels that form the boundary of the object, starting from the starting point.
3. Encode the contour of the object: record the sequence of codes that corresponds to the pixels visited in the previous step.
Outline the operation of the Marr-Hildreth edge detector. Clearly indicate the role of the Gaussian in this approach.
Develop a sequence of robust image processing and analysis steps using pseudo code to remove the boundary objects from a binary image
Detail the operation of the Maximum-likelihood Classifier for a two class problem.
Method of classifying objects into 2 classes based on their observed characteristics. Operates by finding the class that is most likely to produce the observed characteristics of the project.
Also referred to as Bayesian Classification.
Assume 2 classes of objects, C1 and C2, with a single feature(x) to distinguish these classes
Find the probability density function (PDF) for each class.
Measure of probability that an object from a given class will have a given feature value.
Found by measuring x for a large number of samples of each class.
Outline one approach to image thresholding that is data driven (i.e. not dependent on a user defined fixed threshold value).
Dynamic thresholding:
The threshold value is determined automatically by the algorithm, and is updated as the algorithm processes the image.
Advantage: suitable for applications where the characteristics of the image may vary, or where it is not possible for the user to set the threshold value accurately.
Develop a sequence of robust image processing and analysis steps using pseudo code to find the number of objects (gears) in the binary image (sequence of steps only)
Sequence of steps:
1. Load the input image.
3.Apply a Hough Transform to detect the inner circles of the gears.
Disadvantage of run-coding
It is not lossless, meaning that it does not preserve all of the information in the original image. Run code algorithm only stores the runs of pixels, and discards information about the individual pixels in the image. As a result, the reconstructed image may not be identical to the original image, and may have some loss of detail or quality.
Not very versatile, as it only works for binary images with two colours. It cannot be used for images with more than 2 colours, or for images with continuous intensity values. This can limit its usefulness in applications where the input image may have more than 2 colours, or where the image may need to be processed in more detail.
Outline how Convex Deficiencies can be used as a discriminating feature in binary image analysis.
Convex deficiencies are points in a binary image where the local convex hull of the object, which is the smallest convex polygon that encloses the object, is not equal to the object itself.
1. Compute the local convex hull of each object in the binary image, using a convex hull algorithm
2. Identify the points in the binary image where the convex hull is not equal to the object, and label these points as convex deficiencies.
3. Use the convex deficiencies as distinctive features for identifying and classifying the objects in the image.
Using appropriate pseudo code, develop a convolution based N-tuple procedure capable of detecting the letter “t” in the image showing “can you locate the letter ‘t’ in this sentence”. State any assumptions clearly. Discuss the robustness of this approach.
Robust as it uses convolution and N-tuples to detect the letter “t”, which are robust methods for patter recognition.
Limitations: the need for manually selecting N-tuples, and the assumption that the letter “t” is vertically aligned in the image.
Using appropriate pseudo code, outline the operation of the Otsu approach to automated threshold selection. What are the key advantages and disadvantages of this approach?
Advantage:
Automated - does not require the user to specify the threshold value
Based on well-defined objective function
Disadvantage:
Assumes that the image contains 2 dominant intensity levels, and that the histogram of the image can be modelled as a bimodal distribution.
4 key stages in developing a solution