Statistics Flashcards

(47 cards)

1
Q

Give the coding of x to get the standard normal distribustion

A

z = x - μ / δ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give the equation for friction

A

Fmax = μR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Conditions for binomial distribution

A

Discrete
Fixed no. of trials
Success or failure
Trials independent
Constant probability of success

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Conditions for normal distribution

A

Continuous data
Symmetrical
Reduced frequency outside of mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Give the steps of binomial hypothesis testing

A

H0 - null hypothesis
H1 - alternative hypothesis
Define X
X ~ B(n,p)
Conclusion using significance level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

P(AnB) if independent

A

P(A)P(B)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

P(A|B) if independent

A

P(A)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is PMCC a measure of

A

Type and strength of correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What % of data lies within 1δ of the mean on a normal distribution?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What % of data lies within 2δ of the mean on a normal distribution?

A

95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Give the 5 locations in the large data set in the UK from north to south

A

Leuchars
Leeming
Heathrow
Hurn
Cambourne

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Give the 3 international locations in the large data set and their countries

A

Beijing (China)
Perth (Australia)
Jacksonville (US)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are the two time periods in the large data set?

A

May - Oct 1987
May - Oct 2015

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What should be done with variables listed as ‘n/a’ or ‘-‘?

A

Ignored in calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What should be done with variables listed as ‘tr’?

A

Use 0 or 0.025 as values, include in calculations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Which locations in the large data set are close to the coast?

A

Jacksonville (US)
Perth (Australia)
Leuchars
Hurn
Cambourne

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Which location in the large data set is in the southern hemisphere?

A

Perth (Australia)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What affects 1987 results in the UK?

A

Great storm 15-16 Oct
High wind speed in south and east

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Give all of the 9 variables in the large data set

A

Daily mean (air) temperature
Daily total rainfall
Daily total sunshine
Daily maximum relative humidity
Daily mean windspeed and direction
Daily maximum gust and direction
Cloud cover
Daily mean visibility
Daily mean pressure

20
Q

Give the units of daily mean (air) temperature

21
Q

Give the units of daily total rainfall

A

mm
1d.p.
tr < 0.05mm

22
Q

Give the units of daily total sunshine

23
Q

Give the units of daily maximum relative humidity

A

%
nearest integer
>95% mist and fog

24
Q

Give the units of daily mean wind speed and direction

A

kn (speed)
1kn = 1.15mph
degrees (direction)
nearest 10, cardinal direction

25
Give the units of daily maximum gust and direction
kn (speed) 1kn = 1.15mph degrees (direction) nearest 10, cardinal direction
26
Give the units of cloud cover
Oktas 0 1 2 3 4 5 6 7 8
27
Give the units of daily mean visibility
dm 1dm = 10m horizontally
28
Give the units of daily mean pressure
hPa (hectopascals) 1 hPa = 100 Pa = 1 milibar
29
What is collecting data from an entire population called?
Census
30
Give an advantage and disadvantage of a census
Fully accurate Time-consuming and expensive Can destroy all items if consumables e.g. fireworks
31
What is collecting data from a population subset called?
Sampling
32
Give advantages and disadvantages of sampling
Quicker and cheaper than census Less data to analyse than census Bias Incorrect population representation
33
Define simple random sampling
Randomly selects n members of the population so that all members have an equal chance of selection by ordering them and selecting n random numbers (for sample size n)
34
Define systematic sampling
Choose members of population at regular intervals from list e.g. every third house
35
Define proportional stratified sampling
Population divided into groups (strata) e.g. by age then simple random sample from each stratum Proportion of sample from each stratum = proportion of population in that stratum
36
Define quota sampling
Population split into groups, members selected until quota from that group is filled e.g. survey needing 25-30 females
37
Define opportunity (convenience) sampling
Sample uses available population who fit criteria
38
Give the formula for sample standard deviation of size n
Population standard deviation / sqrt(size of sample)
39
Give the equation for the variance of a sample with size n
σ²/n Where σ is the original standard deviation ~N(μ,σ²/n)
40
Define a sampling frame
A list of individuals in a population
41
Give advantages and disadvantages of simple random sampling
No bias Easy and cheap to implement (for small populations) Each sampling unit has equal, known chance of selection Unsuitable if population or sample size is large Sampling frame (list) needed
42
Give advantages and disadvantages of systematic sampling
Simple and quick to use Suitable for large samples and populations Can introduce bias if sampling frame isn't random Sampling frame (list) needed
43
Give advantages and disadvantages of proportional stratified sampling
Accurately reflects population Guaranteed proportional representation Population must be split into strata Within strata - must have sampling frame, unsuitable for large population
44
Define disproportional systematic sampling
Split population into strata Divide sample size by number of strata Choose that number of people from each stratum for surveying
45
Give advantages and disadvantages of disproportional stratified sampling
Accurately represents population structure Split population into strata
46
Give advantages and disadvantages of quota sampling
Small sample still representative No sampling frame needed Quick, cheap, easy Easy comparison between groups Non-random Must divide population into groups Increasing scope increases cost Nonresponses ignored
47
Give advantages and disadvantages of opportunity sampling
Easy, cheap Not random Heavily biased Dependent on individual researcher Unlikely to be representative