3A - Algorithms in machine learning Flashcards

(12 cards)

1
Q

Artificial intelligence VS machine learning

A

AI is a field of computer science that aims to imitate how the human brain functions (reasoning, decision-making, problem-solving, etc)

ML is a branch of AI that recognises patterns and extrapolates to give outputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Supervised learning

A
  • Labelled input so that function(input) = output

Advantages:
- Accurate and reliable
- performance is easy to measure

Disadvantages:
- training data can be expensive to get
- loses effectiveness with noisy, complex or small sets of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Unsupervised

A
  • Unlabelled input which requires the model to find patterns, groupings and structures on its own.

Advantages:
- doesn’t require labelled data
- can simplify complex, high-dimensional data

Disadvantages:
- computationally intensive
- harder to evaluate results with no ‘right’ answer
- liable to false positives

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Semi-supervised

A

Some labelled, mostly unlabelled data.

Advantages:
- suitable for when labelled data is expensive

Disadvantages:
- small amount of labelled data might not be representative of overall trends and cause incorrect results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Reinforcement learning

A

Involves an agent performing actions in an environment, receiving rewards or penalties based on those actions, and adjusting its behaviour accordingly. Over time, the agent’s behaviour is optimised to produce positive outcomes.

Advantages:
- powerful for decision-making tasks (e.g. in games) especially for when outcomes change and are uncertain
- Learns strategies through experience

Disadvantages:
- Complex (esp. troubleshooting and debugging)
- Requires a lot of training time and computational resources
- Reliant on effectiveness of feedback

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Common applications of ML

A
  • Forecasting
  • Image recognition
  • Virtual assistants (e.g. sorting emails)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Linear regression

A

Linear regression is used to predict unknown values based on the features of the training data (must form a linear relationship).

Regression: fitting a line to datapoints

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Gradient descent

A

An optimisation algorithm that iteratively calculates the line by taking the gradient of the current position and adjusting it according to the learning rate (learning rate / step eventually becomes insignificant). This finds the minimum of a curve representative of errors.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Polynomial regression (+multivariate polynomial regression)

A

Same as linear regression but with more complex patterns that fit polynomial curves.

When the value of y is controlled by multiple input values in a polynomial relationship, this is called multivariate polynomial regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Logistic regression

A

CLASSIFICATION algorithm: Predicts the chance of something happening.

Given input, it can predict the output value, which goes through a mathematical process called the sigmoid function, which ‘squashes; this predicted output into a value 0-1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

K-nearest neighbour

A

Classification or regression tasks. It is based on the idea that the observations closest to a given data point are the most ‘similar’, and we can therefore classify new points based on the values of the closest existing points. K = the no. of nearby data points to consider.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Loss functions (e.g. MSE)

A

How ML algorithms measure error (they quantify difference between the model’s predictions and the actual values)

Common one is mean squared error which calculates the average of the squared differences between predicted and actual values (beneficial as ensures errors are positive AND penalises large errors more)

Not used for unsupervised learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly