Skip to main content

Build Fraud Detection Model

Premium

Say you are at Stripe and working on a fraud detection model for online transactions to the websites of small businesses. Fraud means that the credit card usage was either erroneous (and the user will refute the transaction) or invalid (someone stole a credit card and used it, and there will be a dispute). How would you build such a model, and how would you evaluate the model? What trade-offs are involved in the evaluation?

First you need to define the features and the target of interest to decide what kind of model to run.

Then you want to think about the relevant evaluation metrics for this type of model based on real-life outcomes to the business.

Finally, based on those real-life outcomes, weigh in the trade-offs in trying to maximize or minimize certain metrics.

How does the model work? Is it providing a continuous or categorical output?

Do your evaluation metrics tie into real-life business outcomes?

What trade-offs are being evaluated based on the evaluation metrics?