What is a digital image?
A digital image is literally a matrix of numbers.
In grayscale images, how many values are there per pixel?
One value per pixel
Commonly stored as 1 byte (8 bits), where 0 = black and 255 = white.
A grayscale image can be treated as a function: f(x,y)→______?
intensity
Here, x,y = pixel location and f(x,y) = brightness at that pixel.
How can a colour image be represented?
Each pixel represents colour intensity in all three channels.
In Python, how are images stored?
As arrays (matrices)
For an image im of size N × M × 3, im[y, x, 0] → red value, im[y, x, 1] → green value, im[y, x, 2] → blue value.
What is the purpose of image filtering?
Filtering creates a new image where each output pixel depends on a local neighbourhood of the input image.
What does the sliding window idea in filtering involve?
This is the basis of convolution and cross-correlation.
Define cross-correlation in image processing.
The kernel is applied as-is (not flipped).
What is the difference between convolution and cross-correlation?
Convolution involves flipping the kernel horizontally and vertically before applying
Convolution is commutative and associative.
What is linear filtering?
This includes mean filters, Gaussian filters, and sharpening filters.
What is a mean filter?
It smooths the image, reduces noise, but blurs edges.
What does padding do in image processing?
Adds extra border pixels (often zeros)
Padding controls output image size.
What is stride in the context of image filtering?
Step size of the sliding window
A stride of 1 results in detailed output, while a larger stride leads to smaller output.
Name some linear filter examples.
Sharpening emphasizes differences between pixels.
What is the process of sharpening via detail extraction?
This is also called a high-pass filter.
What are ringing artifacts?
Oscillations near sharp edges caused by using a box (mean) filter
This occurs because the box filter has sharp cut-offs in frequency space.
Describe a Gaussian filter.
It smooths images naturally and preserves edges better than mean filters.
When is Gaussian filtering preferred over mean filtering?
When the image has sharp edges and smooth noise reduction without strong artifacts is desired
Mean filter is simpler but causes ringing and edge blurring.
What is a median filter?
Replaces each pixel with the median of its neighbourhood
It is robust to outliers and preserves edges better than mean or Gaussian filters.
What is the purpose of thresholding in image processing?
Convert image to binary
Rule: If pixel ≥ threshold → white, else → black.
Summarize the key points about images and filtering.
These concepts are fundamental in image processing.
What is the core idea of Computer Vision?
Every image tells a story
Computer Vision aims to understand that story automatically.
What is the main goal of Computer Vision?
Extract meaning from pixels
This includes understanding geometric shape, identifying objects and people, and interpreting scenes.
What do humans see in images?
Computers see a grid of numbers (pixel intensities).