Why it matters
Banks process millions of transactions every day. Only about 1% are fraudulent, but the cost of missing them — chargebacks, regulatory fines, lost customer trust — adds up fast. Every fraud caught in time is money that stays inside the bank.
Why it’s hard
When fraud is this rare, traditional metrics are misleading. A model that called every transaction "not fraud" would score 99% accuracy and be completely useless. The real challenge isn’t black-and-white classification, it’s ranking transactions by risk so the manual review team knows what to look at first.
How this model tackles it
Each transaction gets a fraud probability between 0 and 1, and the review team inspects the highest ones. At the default decision threshold, the model catches around 80% of fraud by flagging only the transactions above it — a balance you can tune live in the next section.
Next: how we measure whether the model keeps that promise, how it behaves as you move the decision threshold, which features carry the most weight, and three real cases scored by the deployed model.