What is machine learning
a set of methods that can automatically detect patterns in data
What does machine learning do with these patterns
they are used to predict future data or to perform other kinds of decision making under uncertainty
What is a key premise of machine learning
the learning problem
what is the learning problem
learning from data is used in situations where we don’t have any analytic solutions but we do have data that can construct an empirical solution
what does the learning problem use
a machine learning method
What kind of inputs does a model have
feature, attribute, predictor, independent variable
what kind out outputs does a model have
response, dependant variable, label growth
When is regression used in supervised learning
where Y is continuous (quantitative)
Where is classification used in supervised learning
covers situations where Y is categorical
How do you minimise the least square error
the gradient decent method
what are the main two types of supervised learning
regression and classification
Give an example of regression
predicting house prices based on size, location and number of bedrooms
Give an example of classification
spam/non-spam
What is classification used for
assigning instance to discrete categories
How is the “best fit” defined
the line that minimises the sum of squared errors between actual and predicted values
What algorithms are supervised and can solve both regression and classification problems
Decision trees
Random Forest
K-nearest neighbours
What are strengths of linear regression
What are the limitations of linear regression
What is the process of the decision tree
What is entropy
a measure of uncertainty or impurity of a dataset
What does information gain measure?
A: The reduction in entropy after a dataset is split on an attribute.
Why Is information gain important
It determines the best attribute to split on.
What are the strengths of the decision tree
What are the limitations of the decision tree