Signal Detection theory:
hit: correct answer»_space; when signal is present and decision is yes
miss: wrong answer»_space; signal is present and decision is no
false alarm: wrong answer»_space; signal is absent and decision is yes
correct reject: correct answer»_space; signal is absent and decision is no
internal response
variable/value that forms basis of observer’s decision (x axis)
criterion on a signal present/absent graph
- false alarm? correct reject? hit? miss?
accuracy equation
(#present%hits + #absent%CR)/total**
**total = present + absent
how to increase accuracy (2)
- how good is your accuracy? comparison
Why could peak accuracy be greater in a 20% present, 80% absent case compared to a 50/50 case?
why would you change the criterion? (3)
calculating total cost (money wasted by incorrect responses)
present%misscost + #absent%FAcost
Discriminability + reducing errors (2)
Cohen’s d (d’)
d’ = separation/spread = (u2-u1)/sigma
worst case scenario: d’ = 0»_space; no separation = no information
parameter vs statistic
parameter: true value of quantity in popn
statistic: value of the same quantity based on a sample (statistic used to estimate parameter)
u vs M
- accuracy or precision?
u = population mean
M = sample mean»_space; unbiased estimator of u
- unbiased = accuracy, not precision
sigma^2 vs s^2
sigma^2 = popn variance s^2 = sample variance >> unbiased estimator of sigma^2
Gaussian Distribution
Characteristics:
Gaussian Distribution:
within:
1 SD: 68%
2 SD: 95%
3 SD: 99%
Uniform Distribution
Poisson Distribution
- few events vs many events
z-score
“the standard normal”
distribution of z scores
- M = 0 and SD = 1 (same for t- score!)
percentile rank
percent measurements of score value in the distribution below that value (eg. a score in the 99th percentile = 99% of all scores are below)
if it’s gaussian we can calculate based on z-score (eg. z=1»_space; one SD from the mean (M = 50)»_space; add 34 to 50 = 84%)
how to calculate percentile from standard normal distribution table
Sir Francis Galton: CLT
- rule of thumb
central limit theorem: if x is the sum of identically distributed (uniform) variables, with a non-zero SD, then the distribution of x will approach gaussian
effect size
- small, med, large
describes relationship among variables in terms of size/amt/strength»_space; descriptive»_space; shows extent to which results are meaningful
effect size based on Cohen’s d:
small: 0.2
med: 0.5
large: 0.8
purposes of inferential stats (2)
parameter estimation: estimate value of population parameter based on random sample
hypothesis testing: whether effect occurred by chance or not (probability)