Describe the Supervised learning problem
- Vector of P predictor measurements X ( input, regressor, covariates, independent var)
What are X and Y in regression/classification problems
Regression problem
- Y is quantitative ( price, blood pressure)
Classification problem
- Y takes value in a finite ordered set ( classes, true/false)
has training data - instances of the data
List objectives of supervised learning (AUA)
Describe unsupervised learning
Describe Statistical Learning vs ML
ML is a subset of AI
SL is a subfield of stats
ML has a greater emphasis on large-scale applications and prediction accuracy
SL emphasizes models and their interpretability, precision, and uncertainty
Describe the regression function
Describe the nearest neighbor
Describe the linear model
f(x) = B0 + B1X1 + B2X2 + … BPXP
trade-offs of linear model (PGP)
Describe assessing model accuracy
Compute average squared prediction error over TE (fresh test data) rather than TR (training data) to avoid bias towards overfit models.
- MSETe = Avei∈Te[yi − ˆf (xi)]2
Describe Bias Variance Trade-off
Describe Classification Problem (BAU)
Is there an ideal C(X)?
- Let pk(x) = Pr(Y = k|X = x), k = 1, 2, . . . , K. These are conditional class probabiliteies
The Bayes optimal classifier at x is
C(x) = j if pj (x) = max{p1(x), p2(x), . . . , pK (x)}
Classification details (MBS)
Describe Tree based models
Describe Pros and Cons of tree-based methods
Details of tree building process
Describe classification tree
- Predict that each observation belongs to the most commonly occurring class of training observations in its region
Details of classification tree
Describe Gini index
G =K∑^pmk(1 − ˆpmk)
k=1
- takes on a small value if all of pmk are close to 0 or 1
Tree 10 fold / N fold cross validation
Evaluation Measures
Accuracy = TP + TN / ( TP+TN+FP+FN)
True Positive Rate = TP/ (TP+FN)
False positive Rate = FP / (FP + TN)
Issues with decision trees
- Missing values assign most common attribute value or common class value
Describe unsupervised Learning
- Not interested in prediction since no response variable Y