SRM
What does “Sample Ratio Mismatch” mean? || Actual bucket sizes differ significantly from the intended experiment split (e.g., expected 50/50, observed 60/40).
Why is SRM dangerous in A/B tests?
It invalidates the entire experiment because assignment or logging is broken; results cannot be trusted.
How do you detect SRM?
Run a chi-square goodness-of-fit test comparing observed vs expected bucket counts.
If SRM exists, should you continue analyzing results?
No — stop immediately. Results are invalid.
Top cause of SRM #1
Randomization bugs (bad hashing, split logic errors).
Top cause of SRM #2
Missing or incomplete logs (assignment logs fail, dropped events, ETL skips).
Top cause of SRM #3
Eligibility filters applied after assignment (geo/device/user type filtering).
Top cause of SRM #4
Platform issues (caching, sticky sessions, version mismatches).
What is a realistic tiny imbalance that is not SRM?
Small natural variation (e.g. 50.3% vs 49.7%) — chi-square will confirm it’s random.
In a 70/30 test with observed 80/20, is this SRM?
Yes — extreme imbalance, fails chi-square, experiment invalid.