Data Representation Flashcards

(6 cards)

1
Q

Define discrete data

A

Data that can only take set values in a range of

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define continuous

A

Data that can take any value in a range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Define outlier

A

Extreme value which doesn’t fit the same patterns as the rest of the data set. You can never remove them unless there is a genuine reason to do so.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Define anomalies

A

An outlier where there is a genuine reason why it shouldn’t be there. You can remove anomalies from the data called cleaning the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How can you use mean and standard deviation to identify outlier

A

Any value which is more than 3 standard deviations from the mean is considered an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How can you use tukey fences to identify outliers

A

The lower tukey fence is Q1-1.5 IQR
The upper tukey fence Q3+1.5 IQR

Any value outside the tukey fence is considered an outlier

How well did you know this?
1
Not at all
2
3
4
5
Perfectly