understanding CNN Flashcards

(31 cards)

1
Q

What is a convolution neural network (CNN)

A

deep neural network designed for preprocessing visual data
they detect patterns in images by using convolutions layers to detect spatial hierarchies of patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what is spacial locality in images

A

images contain spacial structure and therefore nearby pixels are highly correlated

features therefore exist in local regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do CNN exploit spacial locality

A

local receptive fields

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is convolution

A
  • Convolution is a mathematical operation
    applies a small matrix (kernel) to an image
    to extract features.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What mathematical operation is involved in convolution

A

A mathematical operation where:
A small matrix (kernel/filter) slides over the image
Performs:
Element-wise multiplication
Summation
Produces a feature map

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does a kernel detect

A

Filters detect edges, textures, and objects.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the stride controls

A

Stride controls movement of the filter, padding
preserves spatial dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does padding do

A

Controls output size:
valid → no padding
→ Output smaller than input
same → zero padding
→ Output same size as input

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Define convolution layers

A

Extract feature maps from images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

define pooling layers

A

Reduce spatial size (dimensionality) and retain key information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do activation functions do

A

Activation Functions to introduce non-linearity, or generating probabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do fully connected layers do

A

Perform final decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is step one of CNN back propagation

A

CNNs process input images using filters to
find patterns/features. (convolution+pooling)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is step 2 of CNN back propagation

A

Loss Function to measure difference between
predicted and actual class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is step 3 of CNN backpropagation

A

CNN compute gradients using chain rule
(backpropagation).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the final step of CNN backpropagation

A

CNNs adjust filters weights by minimising
error using Optimisation algorithms (SGD,
Adam).

17
Q

Explain convolution layers in more detail

A

A kernel (filter) slides over the image, performing element-wise multiplication and summation.

18
Q

Define stride

A

stride refers to the step size by which the convolutional filter (or kernel) moves across the input
image during the convolution operation.

19
Q

What does padding=’valid’ mean

A

padding=’valid’ means that the convolution is only computed where the input and the filter fully overlap, and
therefore the output is smaller than the input.

20
Q

what does padding=’same’ mean

A

padding=’same’ means that we have an output that is the same size as the input, for which the area around the input
is padded with zeros.

21
Q

show a CNN architecture example

A
  1. Simple CNN Architecture Example
    Given:
    Input: 32×32 image
    No padding
    Stride = 1
    Layer 1:
    3×3 kernel
    16 filters
    Output: 30×30×16
    Parameters:
    3×3×16 = 144
    Layer 2:
    3×3 kernel
    16 input channels
    32 filters
    Output: 28×28×32
    Parameters:
    3×3×16×32 = 4,608
    Fully Connected Layer:
    28×28×32 = 25,088 inputs
    10 outputs
    Parameters:
    25,088 × 10 = 250,880
    Total Parameters:
    144 + 4,608 + 250,880 = 255,632
22
Q

What is a DCNN

A

A DCNN is simply a CNN with many convolutional layers stacked together.

23
Q

What is the key idea of DCNN

A

Key Idea:
Early layers → detect simple features (edges)
Middle layers → detect shapes/textures
Deep layers → detect high-level objects
👉 Depth = ability to learn hierarchical features.

24
Q

What is the purpose of the pooling layer

A

Purpose:
Reduce spatial dimensions
Reduce computation
Improve generalisation
Control overfitting

25
What are types of pooling
Types: Max Pooling Takes maximum value Keeps strongest features Average Pooling Takes mean value
26
What is batch normalisation
A technique to stabilise deep learning training by working as a slight regulariser, reducing overfitting. * Normalises the output of each layer to ensure steady learning. * Reduces sensitivity to weight initialisation.
27
What is the problem without batch normalisation
Outputs of each layer can become too large or too small vanishing gradients exploding gradients
28
Explain vanishing gradients
tiny updates cause the network to learn too slowly
29
Explain exploding gradients
large updates make training unpredictable
30
What are techniques to improve generalisation
Hyper parameter optimisation: - Learning rate - batch size - Number of filters - kernel size
31