Fraud Detection by Unsupervised Learning

Workflow

Fraud Detection by Unsupervised Learning

Draft Latest edits on

This workflow reads in the creditcard.csv file and trains and evaluates an Isolation Forest model that detects fraudulent transactions as outliers. The H2O Isolation Forest Predictor node produces two columns that can be used to identify outliers: outlier score and mean length. Here we identify outliers based on the mean length, which is the average number of random splits required to isolate a data point from the other data points. The threshold for the mean length is optimized using a parameter optimization loop.

External resources

Dataset on Kaggle
H2O Machine Learning Example Workflows
Four Techniques for Outlier Detection
Optimization Loop
Fraud Detection Using Random Forest, Neural Autoencoder, and Isolation Forest Techniques

Loading deploymentsLoading ad hoc jobs

Legal

By using or downloading the workflow, you agree to our terms and conditions.