The classification trees classification algorithm:
2
The naive Bayes classification algorithm:
3
the knn classification alg:
1
classification algorithms that do not use assumptions abt the structure of teh data are ___ algorithms
data driven
a good use of classification alg would be:
4
in a CART model classification rules are extracted from
the decision tree
the knn techique is what type of technique
a classification technique
in setting up the knn model:
1
Below are the 8 actual values of the target variable in the training position:
(0,0,0,1,1,1,1,1)
What is the entropy of the target variable?
-5/8 log2(5/8)-3/8 log2(3/8)
5/8 log2(5/8)-3/8 log2(3/8)
-3/8 log2(3/8)+5/8 log2(3/8)
-5/8 log2(3/8)+log2(5/8)
1
Classification programs are distinguished from estimation problems in that
2
Which statement is true about the decision tree attribute selection process:
2
What is the ensemble enhancement that is a method of creating psudo-data from the data in an og data set? partitioning overfitting sampling bagging
bagging
What is the ensemble enhancement that is an iterative technique that adjusts the weight of any record based upon the last classification bootstrapping boosting sampling bagging
boosing
What is the most often used ensemble enhancement
bagging
What are the 3 most popular methods for creating ensembles?
2
What is one benefit of using an ensemble model?
3
What is the most common uses of clustering algorithms?
2
in logit P/(1-p) represents:
the odds of sucess
In a naive bayes model it is necessary that:
-all attributes are categorical
-to partition the data into 3 parts (training, validation, scoring)
-to set cutoff values to less than .75
to have a continuous target variable
1 (ie gender, blood type); can never have cont. variables
Generally, an ensemble method works better, if the individual base model have _____
Assume each indiv. base models have accuracy greater than 50%
-less correlation among predictors
-high correlation amond predictors
-correlation does not have any impact on ensemble output
-none of the above
1
a dendogram is used w which analytics algorithsm? text mining clustering ensemble models all of the above
clustering
What is a bootstrap?
4
what is clustering
3
Which of the following are not types of clustering?
4