Explain Parametric?
Parametric statistical procedures rely on assumptions about the shape of the distribution (i.e., assume a normal distribution) in the underlying population and about the form or parameters (i.e., means and standard deviations) of the assumed distribution. Advantage: Restrictive models are much more interpretive.
Explain Non-parametric?
Nonparametric statistical procedures rely on no or few assumptions about the shape or parameters of the population distribution from which the sample was drawn. Advantage (compared to parametric methods): They may accurately fit a wider range of possible shapes for f. Disadvantage: A very large number of observations is required in order to obtain an accurate estimate of f
Explain Supervised?
For each observation of the predictor measurement(s)π₯i, π=1,…,π there is an associated response measurement π¦i
Supervised (examples):
β’ Linear regression
β’ Logistic regression
β’ Support vector machines
β’ Neural Networks
β’ Collaborative filtering (Methods that try to fill in the missing values e.g. Netflix ratings)
What is Unsupervised?
We observe a vector of measurements π₯i, π=1,…,π, but no associated response π¦i
Unsupervised (examples):
β’ Clustering
β’ PCA
What is the Bias-variance tradeoff?
What is Quality of fit?
How can machine learning be included in your research?