This workflow reads in the creditcard.csv file and trains and evaluates an Isolation Forest model that detects fraudulent transactions as outliers. The H2O Isolation Forest Predictor node produces two columns that can be used to identify outliers: outlier score and mean length. Here we identify outliers based on the mean length, which is the average number of random splits required to isolate a data point from the other data points. The threshold for the mean length is optimized using a parameter optimization loop.
Fraud Detection by Unsupervised Learning
Used extensions & nodes
All required extensions are part of the default installation of KNIME Analytics Platform version 4.5.2
No known nodes available
Loading ad hoc executions
By using or downloading the workflow, you agree to our terms and conditions.
Discussions are currently not available, please try again later.