What is Data Mining?

The fraud examiner is often faced with the task
of sifting through thousands or even millions of
transactions in order to identify symptoms,
indicators and evidence of fraud or malpractice.
Data mining (and we will also include data matching methods in this category)
assists the fraud examiner by using computerised techniques to analyse electronically
held data to "filter" these voluminous databases of information
and to graphically represent fraudulent transactions. As well as being an
effective tool for the detection and investigation of corporate fraud, data
mining programmes can also be used to identify potential loopholes or weaknesses
in controls and show where operational procedures should be improved and more
robust fraud prevention strategies implemented. The diagram below graphically
represents the use of data mining as a means of sifting out the patterns in
data that are indicative of fraudulent behaviour.
Data mining is not, however, a universal panacea, but is one of the important weapons in the fraud examiner's armoury. Arguably, with increasing constraints on the legal investigative methods available to the fraud examiner, the use of data mining will increase. Data mining does, of course, possess its own unique problems due to the requirements of data protection legislation and it is not recommend that data mining is performed without first obtaining legal opinion or other expert advice. See the legal section of the manual, which provides expert coverage of this area.
The fraud examiner is likely to encounter data mining techniques in a number of areas: