Probability Flashcards

Question

What is quota sampling?

Answer 1

The population is divided into categories and each category is given a quota (number of members you want to sample), data is collected until the quotas are met (**without** using random sampling)

Answer 2

Advantages: easy for the sampler as they don’t need access to the whole population or a list of every member Disadvantages: can be biased if the selection process isn’t random (some of the population may be excluded)

Answer 3

A random variable is a number generated by a random experiment (e.g. rolling a die). The variable is discrete if the possible values form a countable set

Answer 4

nth term of the numerator is -4n+30 which rearranges to 30-4n So the whole nth term is (30-4x)/80

Answer 5

3a=5b (a-b) + 3a + 5b + (6a+5b) + (4a+b) = 1 (probabilities in a table always sum to 1! Now you have 2 equations you can use simultaneous equations to solve) a=0.05 and b=0.03

Answer 6

1) 1-P(not B) = 1-(0.65)^4 =0.8215 2) (0.2x0.2) + (0.45x0.45) **x 6** =0.0486 **Multiply by 6 as there are 6 ways of choosing 2 from 4?**

Answer 7

(This is binomial distribution.) P(x=4) = (10 choose 4) x (1/6)^4 x (5/6)^6 210x(1/6)^4x(5/6)^6=0.0543

Answer 8

1) go to home 2) click on ‘inference’ then ‘probability’ 3) select ‘binomial’ 4) type in n,P then choose ‘next’ 5) select the type of graph on the left (little graph icon) to get x=/≥/≤ 6) type in a value for x and press ‘exe’

Answer 9

This is where people choose to be part of the sample. (Advertise/appeal to the whole population, those who respond are included in this sample).

Answer 10

Advantages: requires little time/ effort to find sample members, people who have volunteered are less likely to not respond Disadvantages: there can be trends within the correspondents that lead to bias, people may not want to volunteer for various reasons

Answer 11

X ~ B (10,0.25)

Answer 12

1) P(x=5) = 0.2007 2) Y ~ B (7,0.2007) when P(Y=2) (use calculator) gives you 0.2759

Answer 13

X ~ B (10,0.9) when P(X≥8) gives you 0.9298. Let Y be the number of trays in which at least 8 seeds germinate. Y ~ B (20,0.9298) when P(Y≥19) gives you **0.5855**

Answer 14

W ~ B (n,0.27) P(W≥1)>0.95 is the same as 1-P(W=0)>0.95, thus P(W=0)<0.05 which means we can put it in the formula: (nC0) x (0.27)^0 x (0.73)^n <0.05 0.73^n<0.05 take logs of both sides: nlog(73) < log(0.05), n>log(0.05)/log(73) **(remember to flip < to > as log(0.05) is a negative number!!)** n>9.519 so **n=10 as it must be a positive whole number (natural)**

Answer 15

1) self-select 2) simple random 3) opportunity

Answer 16

X ~ B (n,p) X is a discrete random variable ~ means ‘relating to’ B represents binomial distribution n is the number of trials p is the probability

Answer 17

n is in the naturals so includes zero and positive whole numbers

Answer 18

1) mean: np (trials x probability) 2) variance: np(1-p)

Answer 19

Calculate the **‘expected number’**

Answer 20

1) x≤10 2) x≥2 3) 3≤x≤17

Answer 21

P(x<20) is the same as P(x≤19), typed into calculator gives 0.2130

Answer 22

P(x>42) is the same as P(x≥43), typed into calculator gives 0.5990

Answer 23

The interval must use only ≤ symbol so type this into the calculator: 3≤x≤6. This gives an answer of 0.7579

Answer 24

1) discrete data 2) each selection is independent 3) the probability is fixed/constant 4) there are only 2 possible outcomes e.g. success/failure or heads versus tails **In a question you must relate these factors to the context/ scenario**

Answer 25

1) **H0 (H naught)** which is known as a **null hypothesis** 2) **H1** known as the **alternate hypothesis**

Answer 26

Where findings are **statistically insignificant**, we use the phrase ‘**fail to reject H0**’ if we believe H0 is true

Answer 27

Where findings are **statistically significant and have a direction**, we use the phrase ‘**reject H0**’ if we believe H1 is true (the statement is always with reference to H0, NEVER say ‘accept’)

Answer 28

Positive, negative or you can state that ‘there’s a difference’

Answer 29

Type 1: falsely rejecting the null hypothesis when it’s true Type 2: failing to reject the null hypothesis when it’s incorrect (opposite of type 1)

Answer 30

The cut-off point for either rejecting or failing to reject the null hypothesis. The value can be 10% (0.1), 5% (0.05) or 1% (0.01).

Answer 31

5% as 10% is too liberal and 1% is too strict

Answer 32

The probability due to chance, calculated e.g. by doing X ~ B (12,0.5) when P(x≥11), the answer: 0.003174 is the p-value

Answer 33

Probability due to chance is too high so you must **fail to reject the null hypothesis**

Answer 34

Probability due to chance is low enough that you can **reject the null hypothesis**

Answer 35

You must give **all** of these points, if you miss any you lose the consecutive marks!! 1) let p be the probability of getting heads 2) H0: p=0.5 (equal chance of heads or tails) 3) H1: p<0.5 (heads is less likely to appear) 4) assuming H0 is true, then X ~ B (7,0.5) when P(x≤1) = 0.0625 (this is the p-value) P(x≤**1**) because 1 of the 7 flips is heads 5) 0.0625>0.5 the result is **not significant** so we **fail to reject the null hypothesis** 6) there is **insufficient** evidence to **suggest** that the coin is biased against heads

Answer 36

Range of values which would lead you to reject H0 (opposite of acceptance region)

Answer 37

1) **1/2 the significance level** 2) write the critical region in interval notation

Answer 38

Fail to reject H0

Answer 39

9C2 x 8^7 x (x^2)^2 36 x 2097152 x x^4 = 75497472x^4 so the coefficient is 75497472

Answer 40

(3-(x/2))=2.995, -x/2=-0.005 so **x=0.01** 6561-8748(0.01)+5103(0.01)^2 =**6,474.0303**

Answer 41

7C7 x (1/x)^0 x 1^7 = 1 7C6 x (1/x)^1 x 1^6 = 7x^-1 7C5 x (1/x)^2 x 1^5 = 21x^-2 7C4 x (1/x)^3 x 1^4 = 35x^-3 So 1+7x^-1 + 21x^-2 + 35x^-3 (**the only difference between ascending and descending powers of x is that you start with, in this case for descending powers, 7C7 instead of 7C0!**)

Answer 42

It uses the words: difference/ biased / change rather than increase/ decrease (which would indicate one tailed - regular - tests)

Answer 43

The vague language shows it’s a 2 tailed test: Let p be the probability of rolling a 6 H0: p=1/6 H1: p**≠**1/6 Assume H0 is true: X ~ B (36,1/6) when P(x≤1/6) (**use ≤ because 1/36 being a 6 is a very small probability so it’s likely the bias is against sixes**) = 0.0116 < 0.025 (**significance level halved!!**) The result is statistically significant so reject H0. There’s significant evidence to suggest the die is biased

Answer 44

≠ means it’s a 2 tailed test so your answer will be a union of 2 critical regions. 10% needs to be halved so you use 0.05 (5%) 1) find the mean to give you a starting point (nxp = 32x0.6=19.2) 2) to find the lower critical region, test each number consecutively working down from the mean until you reach one that gives a p-value below 0.05. This is true for numbers below 14 so x∈[0,14] because it’s the lower bound the **zero is automatically known** P(x≤14) = 0.0463<0.05 P(x≤15)=0.0920>0.05 3) to find the upper critical region, test each number consecutively working up from the mean until you reach one that gives a p-value below 0.05. This is true for numbers above 25 so x∈[25,32] because it’s the upper bound the **top number is always ‘n’, which in this case is 32** P(x≥24) = 0.0575>0.05 P(x≥25)=0.0248<0.05 4) answer: x∈[0,4] U [25,32]

Answer 45

1) mean (np) = 70x0.8=56 2) because the **sign is < in the question (for H1) use P(x≤…)** for each value you try 3) test numbers between the mean and zero because of the < symbol 4) the first number with a p-value below the significance level (0.05) is 49. 5) write out both 49 and 50 to show that 50 doesn’t satisfy being below 0.05 and thus that 49 is the first acceptable value: P(x≤49) = 0.0303<0.05 P(x≤50)=0.0545>0.05 6) **zero is always the lower bound for < symbol** so the critical region is: x∈[0,49]

Answer 46

1) mean (np) = 85x0.25=21.25 2) because the **sign is > in the question (for H1) use P(x≥…)** for each value you try 3) test numbers between the mean and ‘n’ (which in this case is 85) because of the > symbol 4) the first number with a p-value below the significance level (0.1) is 27. 5) write out both **26** and 27 to show that 26 doesn’t satisfy being below 0.1 and thus that 27 is the first acceptable value: P(x≥26) = 0.1439>0.1 P(x≥27)=0.09639<0.1 6) **n is always the upper bound for > symbol** so the critical region is: x∈[27,85]

Answer 47

Continuous data that forms a bell-shaped curve

Answer 48

X ~ N (μ, σ^2) Where μ is the mean of the data and σ^2 is the variance

Answer 49

The **area represents (/ is equal to) the probability** and the **total area under the graph sums to 1**

Answer 50

It’s an asymptote

Answer 51

Yes, because values greater than 24 can be infinitely close to 24 (therefore we can say they equal 24)

Answer 52

Z ~ N (0,1^2) where the values in the brackets are always zero and 1!

Answer 53

Z = (x - μ) /σ This **isn’t** in your formula booklet so you must learn it!!

Answer 54

1) work out the standard normal distribution: z=(x-μ)/σ so z=(12-10)/4 =0.5 2) the area of the standardised graph will equal the original graphs area so P(z<0.5)=P(x<12)

Answer 55

1) Select inference, then probability, then normal 2) Ensure the mean and standard deviation are set to 0 and 1 respectively (when using z) 3) use the toggle on the left to select the graph type (/ whether the symbol is greater than or equal to etcetera) 4) type in your z value 5) **write your answer to 4dp**

Answer 56

1) Work out the standard normal distribution: z=(x-μ)/σ so z=(9-8)/3 =1/3 2) P(x>9) = P(x≥9) so on the calculator select the graph icon that gives you the ≥ symbol and type in 1/3. 3) The answer is 0.3694

Answer 57

**Zero** because the data is continuous, so you can have an infinite number of values, thus the probability of choosing any one number is infinitely small

Answer 58

**1** because the probability of choosing all the numbers except 57 (one value out infinitely many) is so high it’s basically 1

Answer 59

1) work out 2 separate z-values: z= 56-56/10=0 and z=65-56/10=0.9 2) select the graph icon showing P(_≤x≤_), then insert your z-values 3) the answer is 0.3159

Answer 60

1) mean=10x0.2=2 so start testing at 2 2) P(x≤0) = 0.1073 > 0.05, **no critical region** because you never reach a value below 0.05

Answer 61

1) ≠ represents a 2 tailed test so you must half the significance level! (Now 0.005) 2) mean=30x0.4=12 so start testing above and below 12 3) x∈[0,4] U x∈[20,30]

Answer 62

The range of values which would lead you to fail to reject H0 (opposite of the critical region)

Answer 63

The **probability** of rejecting H0 when it’s actually true

Answer 64

P(X≤0) =0.0778 means **P(X=0)**=0.0778 (n,o) x 0.4^0 x 0.6^n = 0.0778 n=log0.6(0.0778) =4.999 so **n=5**

Answer 65

Used for approximating the area under a graph

Answer 66

A rectangle

Answer 67

Ordinates: y-values/ outputs/ heights Strips: trapeziums

Answer 68

h (the width of the trapeziums)

Answer 69

**Increasing the number of strips** improves the accuracy of the estimated area

Answer 70

Always n+1 ordinates

Answer 71

Area ≈ 1/2 h (first + last + 2(rest))

Answer 72

Concave: **underestimate** because the tops of the trapeziums will always be **below** the curve Convex: **overestimate** because the tops of the trapeziums will always be **above** the curve Mixture of the two: we can’t say

Answer 73

1) P(A∩B) = P(A) x P(B) 2) P(A|B) = P(A) 3) P(B|A) = P(B) All derived from same equation as equation 1 rearranges to P(A∩B)/P(B) = P(A) which rearranges to the following 2 equations

Probability Flashcards

(99 cards)