why do bugs still escape in-house testing and analysis tools?
monitor deployed code:
online: collect info from user runs
offline: company analyzes it
two key questions
how do we get it?
what do we do with it?
practical challenges
approach to statistical debugging
how do we decide to examine or ignore each instrumented site?
randomly
independently
dynamically
for fairness and accurate picture or rare events
finding causes of bugs
tracking context
a useful measure: increase()