CRISP-DM Framework:
Supervised Learning
Unsupervised Learning
Common Feature Selection & Dimension Reduction Techniques:
* Correlation Analysis
Common Feature Selection & Dimension Reduction Techniques:
* Multicollinearity Check
Common Feature Selection & Dimension Reduction Techniques:
* Wald Chi-Square
Common Feature Selection & Dimension Reduction Techniques:
* Factor Analysis
Common Feature Selection & Dimension Reduction Techniques:
* Principal Component Analysis
Common Feature Selection & Dimension Reduction Techniques:
* Latent Semantic Analysis
Common Feature Selection & Dimension Reduction Techniques:
* Linear Discriminant Analysis (LDA)
A good clustering method should produce high quality clusters with
There are 2 types of clustering technique:
Partitioning-based clustering
Hierarchical clustering:
How to choose inputs for clustering?
In general, you should seek inputs that have these attributes:
* are meaningful to the analysis objective
* are relatively independent
* are limited in number
* have a measurement level of Interval
* have low kurtosis and skewness (at least in the training data)
The performance of logistic regression can be gauged by the following measures:
Decision Tree Strengths
Decision Tree Weaknesses