What is machine learning?
subset of artificial intelligence (AI) that enables computers to learn from data without having to program it, using algorithms to identify patterns and use this knowledge to make predictions or decisions
What’s difference between supervised machine learning and unsupervised machine learning?
What is difference between labeled data and unlabeled data?
In supervised machine learning what’s the difference between regression problems and classification problems?
What is the difference between dimension reduction and clustering in unsupervised machine learning?
-dimension reduction: reducing number of feature variables (independent variables)/frequencies so you don’t overfit and use the most important variables to explain outcome (similar to parisomy)
-clustering observations or groping observations based on common characteristics
What difference between deep learning and reinforcement learning?
What is generalization in machine learning algorithms?
model that doesn’t explain training data very well is considered underfit
model that explains training data too well it’s considered overfit
The dataset for machine learnings models are usually divided into 3 samples what are the 3 samples and uses of 3 samples?
What is difference between bias errors, variance errors, and base errors?
What are two methods for addressing overfitting models? CC
What is penalized regression in supervised machine learning, and what is noise?
-Penalized regression: technique used in machine learning to prevent models from becoming too complex and overfitting the training data. Overfitting happens when a model learns not just the underlying patterns but also the noise in the data, which makes it perform poorly on new, unseen data.
noise refers to random, irrelevant, or erroneous information that doesn’t represent the true underlying patterns you’re trying to learn (eg. Measurement errors when collecting data, outliers, etc)
How does penalized regression solve complex models or assigning excessively large coefficients to some features?
How does the lasso work, and what happens to coefficients as lasso increases?
What is support vector machine in supervised machine learning?
What happens if data points falls within the support vectors and hyperplane, called the margin?
What is k nearest neighbor in supervised machine learning?
What is classification & regression tree in supervised learning (CART)?
eg.
1. Is it sunny?
• Yes → Go to the next question
• No → Stay inside
What is ensemble learning in supervised machine learning?
Ensemble learning: like having a team of experts instead of relying on just one person’s opinion. The idea is to combine multiple models (called weak learners) to create a stronger, more accurate model. working together these models make better predictions than any single model could on its own!
What are 3 types of ensemble learning techniques for supervised machine learning? VBR
What is principal component analysis in unsupervised machine learning, and what can’t principal component analysis be used for?
What are eigenvectors and eigenvalue?
What is projection error in principal component analysis?
What is a scree plot?
What is clustering for unsupervised machine learning?