Statistics 1 Flashcards

(63 cards)

1
Q

What does it mean if a variable has a nominal scale?

A

Consists of numbers/ categories, cannot be put into a logical order, has no fixed unit of meaurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does it mean if a variable has a ordinal scale?

A

Consists of numbers/ categories, can be put into a logical order, doesn’t have a fixed unit of measurement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a variable with a interval scale have?

A

consists for numbers, can be put into a logical order, has a fixed unit of measurement, 0 has no real meaning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does it mean if a variable has a ratio scale?

A

Consists of numbers, can be put into a logical order, fixed unit of measurement, meaningful 0 point

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name 3 example for a nominal scale.

A

Which party do you lean towards? What hair color do you have? What fruits do you like? … etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Name 3 examples for a ordinal scale.

A

How often have you felt angry? Who performed best in the exam? Placements of a race. …etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Name 3 examples for a interval scale.

A

How satisfied are you with…? Rank year of birth… . Temperature Celsius/ Farenheit… . Etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Name 3 ratio scales.

A

Height in cm. Monthly income in Euro. Weight in kgs. …etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What scale is it: categories/ words, no order, no fixed unit of measurement

A

Nominal scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What scale is it: categories/ words, logical order, no fixed unit of measurement

A

Ordinal scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What scale is it: numbers, order possible, fixed unit of measurement, 0 has no real meaning

A

Interval scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What scale is it: numbers, orders possible, fixed unit of measurement, meaningful 0 point

A

Ratio scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the three measures of central tendency?

A

Mean, Median, Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the mean?

A

Sum of all values divided by total number of values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are disadvantages of the Mean?

A

Can be distorted by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Median?

A

Central value of a dataset

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

How do you find the median?

A

Put values in order, find the middle. Even amount of values: amount of values plus one, divided by 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Advantages of the mean

A

Not distorted by „extreme values/ outliers“

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

When is the effect of extreme values larger?

A

In small datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What’s the mode?

A

Category which occurs most often

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What’s the mode also called?

A

Modal value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Which measure of central tendency is important for nominal data?

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How many values can median and mean have?

A

One

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What are three measures of dispersion?

A

Range, IQR, standard deviation (and variance)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What’s the range?
Distance between lowest and highest value
26
What’s the simplest way of showing the variability?
Range
27
How do you calculate the range?
Maximum minus minimum
28
What’s a disadvantage of the range?
For example: two datasets with completely different distributions that have the same minimum and maximum have the same range
29
How do you find the interquartile range?
Divide the dataset into 4 equal parts (Q1, Q2, Q3)
30
What’s Q2 in the IQR?
The median
31
What’s the standard deviation and variance?
Mean of all squared deviations in the dataset
32
What’s the deviation
Amount by which a value differs from the mean
33
What are the 6 steps to calculate the standard deviation?
1. Find the mean 2. Find the distance of each value from the mean 3. Square the deviations 4. Sum up all squared deviations 5. Divide by N 6. Take the square root of the result
34
How do you interpret the standard deviation?
„On average a value deviates by ____ from the mean.“
35
When is it possible to compare standard deviations from different datasets?
Only possible to compare IF a) both datasets have same number of obervations AND b) if they have same unit of measurement
36
Whats the CV?
Coefficient of variation
37
How do you calculate the CV?
Standard deviation divided by the mean
38
Whats the rule of thumb?
CV > 1 = rather high variability, CV < 1 = rather low variability
39
When should the CV only be used?
Only used for ratio-level data, NOT for interval-level data
40
Name graphical represantation of data
bar charts, line charts, histogram, box-plot
41
What do graphs need to be?
easily understandable and clearly labelled, need to show more than the obvious
42
Why do bar charts have to start at 0?
Graphs not starting at 0 can be misleading
43
What are line charts used for?
time series data (e.g. months, years, days etc.), best used for comparisons over time and between entities
44
What does a Histogram do?
shows pattern of the whole dataset
45
what are the three properties of histograms?
1. height of the collum is proportional to the number of observations 2. all intervals are displayed, even if there are no observations 3. collum are equally wide ("bin width")
46
What does the box-plot do?
shows information on typical value and the spread in one graph
47
what informations does the box-plot combine?
Minimum and Maximum (without outliers), Q1, Median, Q3
48
What are out outliers?
values equal or bigger than Q3+1,5*IQR or values equal or smaller than Q1-1,5*IQR
49
What are far out outliers?
Values equal or bigger than Q3+3*IQR or values equal or smaller than Q1-3*IQR
50
What are frequency distributions used for?
for larger datasets (30 and higher) we can look at overall distribution
51
Why are frequency distributions used for larger datasets?
with larger datasets we have enough repetitions of the same or similar values
52
What are the steps to make a frequency table?
1. Make a list with all values in your dataset starting with the lowest 2. Go through your scores and count the number of values for each of the values in your dataset 3. Calculate the percentage of scores for each value 1-10
53
What do frequency tables or histograms describe?
frequency distribution (show how frequencies are spread out)
54
What does frequency distribution show?
pattern of frequencies over the various values
55
What are the shapes of distribution?
unimodal distribution, bimodal distribution, multimodel distribution and rectangular distribution
56
How to differenciate the shapes of distribution?
unimodal: 1 peak bimodal: 2 peaks multimodal: three or more peaks rectangular: all values have same frequency
57
What frequency distribution more likely in real life data?
unimodal distributions, bimodal and multimodal less often
58
What are the properties of symmetrical distributions?
1. two halfs look the same 2. equal number of causes on each side 3. typical value lies in the middle
59
Whats the opposite of symmetrical distribution?
Skewness (=Schiefe)
60
What types of skewed distribution are there?
- distribution skewed to the left (negatively skewed) - distribution skewed to the right (positively skewed)
61
Whats the "floor effect"?
a lot of values (in lower boundaries) can be found in the lowest interval
62
What are examples of lower boundaries?
-cant buy groceries with less than 0€ - parents cant have less than 0 children
63
What are examples of upper boundaries?
- number of cinema tickets sold (limit is capacity of theatre) - percentage of correct answers in mutliple-choice test (max 100%)