What are the three different types of data mining tasks that can be used depending on the business need?
Prediction, clustering, or association.
What is classification in data mining prediction?
what is being Predicted is a class label (e.g., weather: sunny, cloudy, rainy).
What is regression in data mining prediction?
Predicting a numeric value (e.g., temperature: 31).
What kind of learning does classification employ?
Supervised learning.
What is the most frequently used data mining method?
Classification.
What is nominal data?
Data labelled/classified into mutually exclusive categories within a variable.
What is ordinal data?
Statistical data where variables have natural, ordered categories.
What are the two steps in classification-type prediction?
Model development/training and model testing/deployment.
What is the simple split method in classification?
Splitting data into 2 mutaally exclusive sets
(eg: training (~70%) and testing (30%) sets.)
What is k-fold cross-validation?
Data split into k subsets; k training/testing experiments are done.
What is a confusion matrix used for?
Accuracy estimation in classification problems.
What is a True Positive (TP)?
A correctly predicted positive case.
What is a False Positive (FP)?
An incorrectly predicted positive case.
What is a False Negative (FN)?
An incorrectly predicted negative case.
What is a True Negative (TN)?
A correctly predicted negative case.
What are classification techniques in data mining?
Classification techniques are methods used to assign data items to predefined classes based on their attributes.
List the classification techniques mentioned in the slides.
Decision tree analysis, statistical analysis, neural networks, support vector machines, case-based reasoning, Bayesian classifiers, genetic algorithms, and rough sets.
True or False: Decision tree analysis is a type of classification technique.
True.
True or False: Neural networks are included under classification techniques.
True.
Which classification technique is based on probabilistic reasoning?
Bayesian classifiers.
Which classification technique mimics biological evolution concepts?
Genetic algorithms.
Which classification technique focuses on similarity to past cases?
Case-based reasoning.
Which classification technique separates classes using optimal boundaries?
Support vector machines.
What is a decision tree?
A decision tree is a classification method that employs a divide-and-conquer approach to classify data.