Olivier Caelen scite author profile

In the last decade, the ease of online payment has opened up many new opportunities for e-commerce, lowering the geographical boundaries for retail. While e-commerce is still gaining popularity, it is also the playground of fraudsters who try to misuse the transparency of online purchases and the transfer of credit card records. This paper proposes APATE, a novel approach to detect fraudulent credit card transactions conducted in online stores. Our approach combines (1) intrinsic features derived from the characteristics of incoming transactions and the customer spending history using the fundamentals of RFM (Recency -Frequency -Monetary); and (2) network-based features by exploiting the network of credit card holders and merchants and deriving a time-dependent suspiciousness score for each network object. Our results show that both intrinsic and network-based features are two strongly intertwined sides of the same picture. The combination of these two types of features leads to the best performing models which reach AUC-scores higher than 0.98.

show abstract

Sequence classification for credit-card fraud detection

Jurgovsky

Granitzer

Ziegler

et al. 2018

Expert Systems with Applications

266

135

View full text Add to dashboard Cite

Combining unsupervised and supervised learning in credit card fraud detection

Carcillo

Borgne

Caelen

et al. 2021

Information Sciences

217

View full text Add to dashboard Cite

Credit card fraud detection and concept-drift adaptation with delayed supervised information

Pozzolo

Boracchi

Caelen³

et al. 2015

View full text Add to dashboard Cite

Most fraud-detection systems (FDSs) monitor streams of credit card transactions by means of classifiers returning alerts for the riskiest payments. Fraud detection is notably a challenging problem because of concept drift (i.e. customers' habits evolve) and class unbalance (i.e. genuine transactions far outnumber frauds). Also, FDSs differ from conventional classification because, in a first phase, only a small set of supervised samples is provided by human investigators who have time to assess only a reduced number of alerts. Labels of the vast majority of transactions are made available only several days later, when customers have possibly reported unauthorized transactions. The delay in obtaining accurate labels and the interaction between alerts and supervised information have to be carefully taken into consideration when learning in a concept-drifting environment.In this paper we address a realistic fraud-detection setting and we show that investigator's feedbacks and delayed labels have to be handled separately. We design two FDSs on the basis of an ensemble and a sliding-window approach and we show that the winning strategy consists in training two separate classifiers (on feedbacks and delayed labels, respectively), and then aggregating the outcomes. Experiments on large dataset of real-world transactions show that the alert precision, which is the primary concern of investigators, can be substantially improved by the proposed approach.

show abstract

A Bayesian interpretation of the confusion matrix

Caelen

2017

Ann Math Artif Intell

114

View full text Add to dashboard Cite

SCARFF : A scalable framework for streaming credit card fraud detection with spark

et al. 2018

View full text Add to dashboard Cite

The expansion of the electronic commerce, together with an increasing confidence of customers in electronic payments, makes of fraud detection a critical factor. Detecting frauds in (nearly) real time setting demands the design and the implementation of scalable learning techniques able to ingest and analyse massive amounts of streaming data. Recent advances in analytics and the availability of open source solutions for Big Data storage and processing open new perspectives to the fraud detection field. In this paper we present a SCAlable Real-time Fraud Finder (SCARFF) which integrates Big Data tools (Kafka, Spark and Cassandra) with a machine learning approach which deals with imbalance, nonstationarity and feedback latency. Experimental results on a massive dataset of real credit card transactions show that this framework is scalable, efficient and accurate over a big stream of transactions.

show abstract

12 3 4 5

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Olivier Caelen

Calibrating Probability with Undersampling for Unbalanced Classification

Learned lessons in credit card fraud detection from a practitioner perspective

APATE: A novel approach for automated credit card transaction fraud detection using network-based extensions

Sequence classification for credit-card fraud detection

Combining unsupervised and supervised learning in credit card fraud detection

Credit card fraud detection and concept-drift adaptation with delayed supervised information

A Bayesian interpretation of the confusion matrix

SCARFF : A scalable framework for streaming credit card fraud detection with spark

Contact Info

Product

Resources

About