Who first defined machine learning and when?
Arthur Samuel 1959
Define machine learning.
Field of study that gives computers the ability to learn without being explicitly programmed.
How did Arthur Samuel implement machine learning?
Automated checkers programmed that learnt positions so could re-use them instead of searching manually each time.
How does machine learning operate in a nutshell?
Machine receives answers and data as input and outputs rules.
Name the steps of machine learning. x6
Give some more example applications of machine learning.
Any one of:
Building smart robots
Text understanding
Computer vision,
Medical Informatics
Database Mining
Distinguish between supervised and unsupervised learning with examples.
Supervised: Has training data with known answers and produces model to predict for new data. E.g. Classify tumors
Unsupervised: Data provided but with no answers and algorithm finds structure or interesting patterns in data. E.g. Market Segmentation Research
Distinguish between eager and lazy learning.
Eager: Constructs a general, input-independent target function during training. Model constructed before any queries.
Lazy: Generalization beyond the training data is delayed until a query is made. Stores training data with only minor processing until query.
T or F. Lazy learners take less time in training and less time in predicting than eager learners.
False. Lazy learners take less time in training and more time in predicting.
List relevant algorithms studied for eager / lazy respectively.
Eager:
- Decision Trees
- NaiveBayes
- Neural Networks
Lazy:
- K-nearest neighbour
Timing of Model Building: Difference between Lazy and Eager Learning.
Lazy: During prediction
Eager: Before prediction.
Data Dependency: Difference between Lazy and Eager Learning.
Lazy: Relies heavily on training data during prediction.
Eager: Less dependent on training data during prediction.
Computational Efficiency: Difference between Lazy and Eager Learning.
Lazy: Faster in training, slower in prediction due to model building.
Eager: Slower in training, faster during prediction due to pre-built model.
Memory Usage: Difference between Lazy and Eager Learning.
Lazy: Less memory usage in training, more during prediction.
Eager: More memory in training, but less during prediction.
Other than supervised, unsupervised, what is the third type of machine learning?
What drives it?
Give one use.
Give one example algorithm.
Reinforcement Learning
Learning from mistakes
Gaming
Q-Learning
What drives supervised and unsupervised learning respectively.
Supervised: Task driven
Unsupervised: Data driven
What is regression?
Technique used in supervised learning where historical data is used to predict future.
What is a decision tree?
Each node has a question, each branch possible answers, leaf nodes have decisions. E.g. Series of q’s to determine user BMI
Define the following terms as parts of decision tree.
1. attribute
2. attribute value
3. classification
4. conjunctive term
Outline the DT learning algorithm.
Choose best attribute
Add new node for this attribute and a new branch for each attribute value.
Sort training examples through node to current leaves.
If training examples are unambiguously classified then stop.
Otherwise repeat.
T or F. Finding an optimal decision tree is tractable.
False. It is intractable.
T or F. Finding a near optimal tree is tractable.
True.
What is the ideal goal when making a decision tree?
That it be as small as possible.
T or F. If an attribute leads to an immediate classification then it is a good attribute.
True