This workflow reads in the creditcard.csv file and trains and evaluates an Isolation Forest model that detects fraudulent transactions as outliers. The H2O Isolation Forest Predictor node produces two columns that can be used to identify outliers: outlier score and mean length. Here we identify outliers based on the mean length, which is the average number of random splits required to isolate a data point from the other data points. The threshold for the mean length is optimized using a parameter optimization loop.
Workflow
Fraud Detection by Unsupervised Learning
External resources
Used extensions & nodes
All required extensions are part of the default installation of KNIME Analytics Platform version 4.5.2
No known nodes available
Loading deployments
Loading ad hoc executions
Legal
By using or downloading the workflow, you agree to our terms and conditions.
Discussion
Discussions are currently not available, please try again later.