week 2- data analysis Flashcards

(24 cards)

1
Q

histogram and ogive

A

x y
histogram intervals frequency
ogive (ungrouped
measurement grouped) CF
top of interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

histogram interpretation
- the modal class

A

the tallest column

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

histogram interpretation
- skewness

A
  • graph lean to the right = negative skew
  • graph lean to the left = positive skew
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

histogram interpretation
- outliers

A

data that is unusual

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

draw the ogive

A

x axis - top of interval
y axis - cumulative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

median formula - ungrouped data

A

(n+1)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

median formula - grouped data

A

(n+1) x 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

measures of average compared
- the mode or the median can give a better average than the mean when the data is skewed
positive

A

the mode > median > mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

measures of average compared
normal

A

‘perfect’ mean=median=mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

measures of average compared
negative

A

the mode < median < mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

range formula

A

highest value - lowest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

coefficient of variation
formula

A

this is standard deviation expresses as a % of the mean
standard deviation/ mean x 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

why calculate the coefficient of variation

A

is useful for comparing the spread of different data sets (e.g. long bolts of approximately. 16 cm with short one approx. 16mm)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

interquartile range formula

A

Q3-Q1

that is the range for the mid 50% of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

quartile deviation formula

A

(Q3-Q1)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is standard deviation

A
  • is a measure of spread from the mean
  • it includes all data and is used in statistical analysis
  • can be affected by skewed data and is affected by open classes
17
Q

what is interquartile range

A
  • measures middle 50% of data and is usually used with median
  • can be useful when data is skewed or where there are outliers
  • disregards bottom and top 25% of data
18
Q

how to work out Q1

19
Q

how to work out Q2

A

median
or
(n+1) x 50%

20
Q

how to work out Q3

21
Q

how to work out variance

A

standard deviation to the power of 2

22
Q

to draw graphs remember that:

A

histogram uses: x axis = intervals and y axis = frequency (BAR CHART)

ogive uses: x axis = top of interval and y axis = cumulative frequency (LINE CHART)

23
Q

what does this symbol mean
~
-

A

approx equal to

24
Q

variance formula

A

standard deviation squared