What are tree-based methods?
How do we write a classification problem of a regression tree in formal notation?
How do we go about finding cj?
Target for regression:
How do we go about finding the partition order and deciding how the tree splits?
How do we apply the top-down greed approach to build a tree model?
What is the general process of splitting the predictor space?
How do we deal with the problem of overfitting when creating a tree through the top-down greedy approach?
How do we go about choosing the optimal lambda (optimal subtree)?
What is the entire algorithm for building a tree regression?
How do tree methods change for classification problems?
l-hat(j) is the majority vote class that lies in the j-th region
What other methods of loss do we use for multi-classification problems?
What are the advantages and disadvantages of the trees?
How do we deal with the overfitting issue of trees?
How do we perform Bootstrap aggregation or Bagging?
Does bagging actually reduce variance?
How can we improve on this for tree models?