Module 2 - Lecture 2 Flashcards

(32 cards)

1
Q

Use median when

A

When data set is skewed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Measures of central tendency

A

Mean median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

When to use mean

A

When data is not skewed unless question asks for mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Data set that is not skewed

A

Symmetric

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Measure of variability

What and why is it useful

A

A measure of variability (aka measure of variation, dispersion, or spread) is a measure of the amount of spread of the data set (sample or population) on a number line

Useful bc it gives us an idea of the “give and take” we can expect around a measure of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Range

A

Range is the difference between the maximum and minimum value of a data set

Ex. X={1,2,3,8)

Range = 8-1=7

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Range in notation

A

Range = max(x) - min(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Range in excel

A

=max()-min()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mean absolute deviation

A

Average value of the difference in absolute value; of each observation from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mean absolute deviation
Excel

A

=avedev()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variance

A

Variance of a sample or of a population is (approximately) the average squared deviation of each observation from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Population variance excel

A

=var.p()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Sample variance excel

A

=var.s()

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Standard deviation

A

Standard deviation s of a sample is the square root of the sample variance. The population standard deviation is the square root of the population variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard deviation s of a sample

A

Square root of the sample variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Population Standard deviation

A

The square root of the population variance

17
Q

Advantage of standard deviation

A

Variance is in squared units, standard deviation is in the same u its as the original data

Ex. If you’re measuring weights in KG and wanted to measure the spread of the weights, the variance would give you a measure in kg2 and the standard deviation would give you a measure in kg

18
Q

Sample standard deviation excel

19
Q

Population Standard deviation excel formula

20
Q

When should you use the range

A

If you want a quick and dirty understanding of how spread out your data set is. Careful of extreme values

21
Q

When should you use mean absolute deviation MAD

A

If you wanna measure of the spread that’s not as sensitive to extreme values and is relatively straightforward to explain

22
Q

When should you use variance

A

If you need to for mathematical proofs, etc.

23
Q

When should you use standard deviation

A

Use it by default. It’s the most commonly used, and it’s used in many subsequent parts of statistical analysis. Like what we’re about to do

24
Q

Properties of variation

A
  1. The smallest possible value of the variation is 0 (you cannot have a negative spread)
  2. The measure of variation shown here are all affected by skewness
  3. The only time the variation can be 0 is if all values in the data set are the same
25
Extreme values
One way in which a value is considered extreme is if it is greater that the sample mean plus 3 times the sample standard deviation (ie. x bar +3s) or less than the sample mean minus 3 times the sample standard deviation (ie. xbar - 3s)
26
Measures of central tendency include
Mean. Median. Mode
27
If the median is less than the mean
The data set is right (positively) skewed
28
If the median is greater than the mean
The data set is left (negatively) skewed
29
If the median and the mean are the same
The data set is symmetric
30
Measures of variation include
Range MAD (mean absolute deviation) Standard deviation Variance
31
Sometimes a weighted mean makes more sense than an unweighted mean
True
32
Extreme values do what to all measures of variation and the mean
Pull them