Categorical data analysis Flashcards

Question 1

Q

What is a proportion in categorical data?

Answer

A

The number with a characteristic divided by the total number of individuals (p = x/n).

Question 2

Q

Which distribution describes probability for proportions?

Answer

A

The binomial distribution, which approximates a normal distribution for large n.

Question 3

Q

What is the formula for the standard error of a proportion?

Answer

A

√p(1−p) /n

Question 4

Q

What is a 95% confidence interval for a proportion?

Answer

A

p±1.96×SE

Question 5

Q

What does a contingency table do?

Answer

A

Summarizes data for two or more categorical variables and forms the basis for Chi-squared testing.

Question 6

Q

How is the expected frequency (E) in a contingency table calculated?

Answer

A

E = rowtotal x (columntotal/grandtotal)

Question 7

Q

When should you use a Chi-squared test vs Fisher’s exact test?

Answer

A

Chi-squared: large samples (expected ≥5). Fisher’s: small samples (expected <5).

Question 8

Q

What is the difference between odds and risk?

Answer

A

Odds: outcome vs alternative. Risk: outcome vs all outcomes.

Question 9

Q

What does an odds ratio represent?

Answer

A

How much more or less likely an outcome is in one group compared to another.

Question 10

Q

What does correlation measure?

Answer

A

The degree of association between two paired variables.

Question 11

Q

What is the Pearson correlation coefficient (r)?

Answer

A

Measures how close data points lie to a straight line (strength of linear relationship).

Question 12

Q

When should you use Spearman’s rank correlation instead of Pearson’s?

Answer

A

When data are not normally distributed or the relationship is not linear.

Categorical data analysis Flashcards

(12 cards)