What is tree-based learning? What does it do and how?
Tree-based learning is a type of
- supervised machine learning
- performs classification and regression tasks.
- It uses a decision tree as a predictive model to go from observations about an item represented by the branches to conclusions about the items target value represented by the leaves.
Ensemble Learning
which enable you to use multiple decision trees simultaneously in order to produce very powerful models
What’s the benefit of hyperparameter tuning?
Knowing how and when to tune a model can help increase its performance significantly
What is a Decision Tree?
How data professionals use decision tree?
to make predictions about future events based on the information that is currently available.
Decision Tree PROs
Decision Tree CONs
The model might get extremely good at predicting scene data, but as soon as new data is introduced, it may not work nearly as well.
What are made at each node?
Decisions are made at each node.
Edges
The edges connect together the nodes essentially directing from one node to the next along the tree.
What is a Root Node?
What is a Decision Node?
Leaf Node
What are Child Nodes?
What are Parent Nodes?
node that the child splits from
What prediction outcomes types can decision tree be used for?
What is the criteria to split a Decision node?
A decision node is split on the criterion that minimizes the impurity of the classes in their resulting children.
What is Impurity?
Name 4 metrics to determine impurity
What’s the requirement for choosing split points?
Choosing split for categorical predictor variable
consider splitting based on the categorical variable, ie. color.
Choosing split for continuous predictor variable
splits can be made anywhere along the range of numbers that exist in the data
Ie. sorting the fruit based on diameter: 2.25, 2.75, 3.25, 3.75, 5, and 6.5 centimeters.
Describe Gini impurity score
Classification trees PROs
Name 2 disadvantages of classification trees