detects control failures and
routine misstatements/ weak
for detecting rare fraud
sampling
____ = using data, usually
through the mining of those
data, to identify patterns,
annomolies to find possible fraud symptoms
Data driven fraud detection
- proactive
- hypothesis driven
- full population focused
test the entire dataset
full population analysis
6 steps to the
Data Analysis Process
Fraud Symptom categories
(part of step 3)
the primary advantage of
the data drive approach is
the investigator takes charge
of the fraud investigation process
the primary drawback to
the data driven approach is that
it can be more expensive and time
intensive that the traditional approach
accurately predicts for many kinds
of financial data that the first digits
of each group of numbers in a set
of random numbers will conform
to the predicted distribution pattern
Benfords law
- invoce amounts/
expense reports/transaction totals
outliers identify cases
that do not match the norm
- how far outside the norm is the case
Z-score
* -1-1 = 68% of population
* -2-2 = 95%
* -3-3 = 97.5%
the splitting of complex
data sets into groupings
stratification
(fraud hides in categories:
by vendor/by buyer, etc.
a technique that produces
a single number that summarizes
each group
time trend analysis
- fraud escalates
detects near identical names
fuzzy matching
- uncocer shell companies/
related party vendors
(real-time)
modern systems can