Neural-network Algorithms. Flashcards

(67 cards)

1
Q

What is an artificial neural network inspired by?

A

The brain’s neurons

An artificial neural network carries out pattern matching in a manner similar to how neurons in the brain operate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

A classical artificial neural network consists of which layers?

A
  • Input layer
  • Hidden layers
  • Output layer

Each layer serves a specific function in processing data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a node in an artificial neural network sometimes referred to as?

A

Perceptron

A perceptron is analogous to a neuron in the network.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the process called when data flows through each layer in order from input to output?

A

Feed-forward

This process is essential for the functioning of neural networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

In the example of deciding whether to go skiing, what are the inputs considered?

A
  • Has it snowed?
  • Are the slopes busy?
  • Is the avalanche risk high?

These inputs are evaluated to make a decision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How are decisions weighted in the skiing example?

A
  • Snow: 5
  • Crowds: 2
  • Avalanche risk: 4

The weights reflect the importance of each factor in the decision-making process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What function decides whether to activate a node based on a threshold value?

A

Activation function

This function plays a crucial role in determining the output of the node.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What type of learning uses a labeled dataset to train the model?

A

Supervised learning

This method involves using known outcomes to guide the training process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of a bias value in neural networks?

A

To reduce error from the output

The bias helps optimize the weights to improve accuracy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a convolutional neural network used for?

A

Image or time-series-related AI systems

This type of network is a modern update to artificial neural networks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What do convolution layers in a convolutional neural network do?

A

Look for high-level features based on low-level features

This process helps in feature extraction from images.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does deep learning refer to?

A

Many layers of nodes

This approach allows for more complex models but is computationally intensive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What technology allows for complex neural networks to be processed efficiently?

A

Graphics Processing Units (GPUs)

GPUs enable the execution of tasks requiring significant computational power.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the basic structure of an artificial neuron, or perceptron?

A
  • Receives input
  • Processes input
  • Produces output

The perceptron mimics the behavior of biological neurons using input signals, weights, and an activation function.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

True or false: The single-layer perceptron can solve non-linear problems effectively.

A

FALSE

Minsky and Papert demonstrated that single-layer perceptrons could not solve basic logical tasks like XOR.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the key differences between logistic regression and a 1-neuron perceptron?

A
  • Logistic regression uses the logistic function
  • A perceptron typically uses a step function
  • Logistic regression outputs probabilities
  • A perceptron produces hard binary outputs

These differences highlight the perceptron’s decisiveness but less flexibility in representing probabilities.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the Hebbian learning rule?

A

Cells that fire together wire together

This rule strengthens connections between neurons that are activated simultaneously.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a training epoch in the context of neural networks?

A

A single pass through the entire training dataset

After each epoch, the network’s weights are updated based on the error calculated for each data point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the purpose of a training stop rule?

A
  • Determine when to end training
  • Prevent overfitting or underfitting

Common stop rules include fixed number of epochs and early stopping.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the activation function do in a neuron?

A

Produces the output based on the weighted sum of inputs

It introduces non-linearity into the network, allowing it to model complex patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

List common types of activation functions.

A
  • Sigmoid
  • ReLU (Rectified Linear Unit)
  • Tan h (Hyperbolic Tangent)
  • Softmax

Each activation function has different properties and is chosen based on the model’s needs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the role of a weight matrix in a neural network?

A

Contains the weights applied to the inputs at each layer

The weight matrix reflects the differences between classes and helps in classification.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is backpropagation?

A

A supervised learning algorithm that adjusts weights by propagating error backwards

It allows the training of deep neural networks and is effective in minimising error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the Adam optimisation algorithm known for?

A
  • Handling sparse gradients
  • Adapting the learning rate during training
  • Computational efficiency

Adam is widely used in deep learning due to its effectiveness with large amounts of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What is the limitation of **Hebbian learning**?
* Inability to solve non-linear problems * No error correction mechanism ## Footnote This makes it challenging to train multilayer networks effectively.
26
What does a **classifier** do in the context of neural networks?
Assigns a label to an input data point based on features ## Footnote It categorises input data into one or more classes.
27
What is the significance of the **1-neuron perceptron** in artificial neural networks?
Serves as a basic building block for more complex networks ## Footnote It provides insights into how more complex neural networks operate and learn.
28
What is the foundational role of the **1-neuron perceptron model** in artificial neural networks?
It serves as a basic building block, providing insights into how more complex neural networks operate and learn. ## Footnote The perceptron’s ability to classify data based on a linear decision boundary made it a pioneering model in the field of AI.
29
What is a significant limitation of the **1-neuron perceptron**?
It struggles with non-linear problems or more complex data patterns. ## Footnote These limitations have driven the development of more advanced neural network architectures.
30
What type of neural network is developed due to the limitations of the **1-neuron perceptron**?
Multilayer perceptrons (MLP) ## Footnote MLP consists of multiple layers of neurons, each capable of learning more complex patterns.
31
What do **non-linear activation functions** in MLP allow these networks to do?
Overcome the limitations of simple perceptrons and model complex, non-linear relationships. ## Footnote This advancement enables MLP to perform tasks previously impossible with single-layer perceptrons.
32
List some applications of **multilayer perceptrons (MLP)**.
* Image recognition * Natural language processing ## Footnote MLP can solve a wide range of problems due to their ability to learn complex patterns.
33
True or false: The **1-neuron perceptron** can outperform traditional methods like logistic regression.
TRUE ## Footnote Understanding the foundational concepts of perceptrons equips learners to tackle modern AI challenges.
34
What is the ongoing process related to **learning rules** in artificial intelligence?
Improving learning rules is an ongoing, continuous process. ## Footnote Many specific learning and optimizing rules are effective in different tasks.
35
What is the **Multilayer Perceptron (MLP)**?
A type of artificial neural network organized into layers with feedforward connections ## Footnote MLP consists of an input layer, one or more hidden layers, and an output layer.
36
In an MLP, how is information processed?
Information flows in one direction—from the input layer through hidden layers to the output layer ## Footnote Each neuron in a layer is connected to every neuron in the subsequent layer.
37
What is the **universal approximation theorem**?
An MLP with two hidden layers can approximate any algebraic function ## Footnote This highlights the power of MLP in solving a wide range of problems.
38
What is a common rule of thumb when designing an MLP?
Start with a large number of neurons in the first hidden layer, reducing by half in each subsequent layer ## Footnote This helps refine the understanding of input data.
39
What is an **artificial neuron**?
A computational unit that processes inputs using a weighted sum and an activation function ## Footnote Neurons are inspired by biological neurons in the brain.
40
What do **feedforward connections** refer to?
The flow of data through layers in a single direction—forward ## Footnote There are no cycles or loops in a feedforward neural network.
41
What are **Recurrent Neural Networks (RNN)** designed for?
Handling sequences of data with feedback loops ## Footnote RNNs retain information across time steps, making them ideal for tasks like time series forecasting.
42
What is a key application of RNNs?
Text generation ## Footnote RNNs can learn to generate new sentences that mimic the style and structure of training data.
43
What are **Convolutional Neural Networks (CNNs)** tailored for?
Processing data with a grid-like topology, such as images ## Footnote CNNs detect local patterns like edges and textures in images.
44
What is a **receptive field** in CNNs?
A small region of the input layer to which each neuron is connected ## Footnote This design makes CNNs efficient for tasks like image classification.
45
What is one of the most famous applications of MLP?
Optical character recognition (OCR) for handwritten numbers ## Footnote MLP was crucial for automating the sorting of mail in post offices.
46
How were digit images processed for MLP in OCR?
Converted into a 28x28 pixel grid, flattened into a 784-dimensional vector ## Footnote Each pixel was assigned a value based on its intensity.
47
What do the weight matrices in hidden layers of MLP represent?
Filters that detect specific features of the digits ## Footnote These filters can reveal insights into how the network processes input data.
48
What is the efficiency of MLP in recognizing handwritten digits?
Achieving accuracy rates well above 90% ## Footnote The trained network generalizes well to new images of digits.
49
What do hidden layers in MLP develop for natural image recognition?
Filters that represent basic geometric shapes like lines and curves ## Footnote These filters are common in many natural images.
50
What is the significance of **style transfer** in CNNs?
Applying the style of one image to another while retaining its content ## Footnote This technique can also be applied to text generation and MLP tasks.
51
What does the evolution from a simple 1-neuron perceptron to complex ANN types showcase?
The potential of neural networks in solving diverse problems ## Footnote This includes the introduction of layering and specialized architectures for sequences and spatial data.
52
What is the **purpose** of a **clustering approach** in unsupervised learning?
To group unlabelled data to reveal hidden structures ## Footnote Clustering helps uncover meaningful groups or patterns in data without predefined labels.
53
The origins of **unsupervised learning** can be traced back to which decade?
1980s ## Footnote This period saw the development of Kohonen networks and self-organising maps.
54
What are the **two main capabilities** of **self-organising maps (SOM)**?
* Clustering * Feature reduction ## Footnote SOM allows high-dimensional data to be visualised and analysed in a more manageable form.
55
Name the **four modern clustering techniques** mentioned.
* K-Means * Hierarchical Clustering * DBSCAN * Gaussian mixtures ## Footnote These techniques offer robust and scalable solutions for organising complex datasets.
56
What is the primary goal of the **K-Means algorithm**?
To partition data into 𝑘 clusters by minimising the within-cluster variance ## Footnote K-Means is praised for its simplicity and efficiency, especially with large datasets.
57
What are the **steps** involved in the **K-Means algorithm**?
* Initialisation: Randomly assign 𝑘 initial cluster centroids * Assignment: Assign each data point to the nearest centroid * Update: Recalculate the centroids * Repeat: Iterate until centroids no longer change significantly ## Footnote K-Means is sensitive to the initial placement of centroids and assumes spherical clusters.
58
True or false: **Hierarchical Clustering** requires the number of clusters to be predefined.
FALSE ## Footnote Hierarchical Clustering builds a tree of clusters (dendrogram) without predefining the number of clusters.
59
What are the two approaches of **Hierarchical Clustering**?
* Agglomerative (bottom-up) * Divisive (top-down) ## Footnote Agglomerative starts with each data point as its own cluster, while divisive starts with all data points in one cluster.
60
What does **DBSCAN** stand for?
Density-Based Spatial Clustering of Applications with Noise ## Footnote DBSCAN is effective in identifying clusters of varying shapes and sizes.
61
What are the two parameters required by **DBSCAN**?
* Maximum distance between two points to be considered neighbours * Minimum number of points required to form a dense region ## Footnote DBSCAN is robust to noise and can handle clusters of different shapes.
62
What is the main objective of **Principal Component Analysis (PCA)**?
To reduce the dimensionality of the data while retaining key information ## Footnote PCA identifies directions along which the data varies the most and projects the data onto these directions.
63
How does **PCA** differ from clustering methods?
* Objective: PCA reduces dimensionality; clustering groups data points * Output: PCA transforms data; clustering assigns data points to clusters * Interpretability: PCA simplifies data; clustering identifies natural groupings ## Footnote PCA is often used as a preprocessing step before clustering.
64
What is the **Elbow Method** used for?
To determine the optimal number of clusters ## Footnote It involves plotting the within-cluster sum of squares (WCSS) against the number of clusters.
65
What does the **Silhouette Score** measure?
How similar each data point is to its own cluster compared to other clusters ## Footnote A higher score indicates better clustering.
66
What is a common approach to **summarise clusters**?
Summarise based on the features of the data points within each cluster ## Footnote For example, average age, income, or spending habits in customer segmentation.
67
In what scenarios is **clustering** often used?
* Customer segmentation in marketing * Grouping countries based on economic indicators * Identifying disease subtypes in medical datasets ## Footnote Clustering helps tailor strategies and identify trends.