Sampling and search strategies for zap Cash
The AI is learning by all your judgments (red, yellow, green). There are many strategies to look at the results of zap Cash data analysis. Here are the best ones:
We recommend judging at least 100 pairs before starting the first AI-run. In the beginning, it is advisable to evaluate a very heterogeneous sample. To do so, you should stick to the following strategy.
You should always go through top pairs initially and after each AI-run. Switch the strategy when you do not find any more duplicate payments.
Rank 1 is the most likely.
All document pairs are clustered in 0 to n different buckets. The number of clusters depends on the size and complexity of the data-set. Usually, there are at least 5 to 20 different clusters. Filter on each cluster and investigate at least a couple of pairs before the first AI-run. This ensures that a wide range of data is considered by the AI.
Note that a cluster may be empty and filtering can give you an empty list!
zap cash calculates for each document pair how it differs from other pairs. Consider sorting by the Outlier rank to find the most curious cases. This technique is particularly good that the AI can better assess exceptions.
Outlier rank 1 is the most unusual pair.
After using the starter strategies, you may dig in more in-depth with the following techniques.
You may already have some hypotheses about your organization and know how duplicate payments have occurred in the past. For example, invoices may have been sent to the parent company and a subsidiary. You can find such cases by filtering the pairs for unequal company codes.
Besides company code, you may check for inequality of fiscal year, assignment or reference number, reference transaction and cost center.
If you have already found a few double paid invoices and marked them with a red exclamation mark, then you should click on the 'Show most similar pairs'-button. The list is filtered on the 20 most similar candidates. Check the most of them. Especially if other criteria also holds.
This technique may not make the AI-sample better than other strategies, but give you the fastest way to find more hits.