This node applies an Isolation Forest model to an input dataset in order to predict anomalies or outliers. The output of the node will consist of the input and, depending on the settings, one or two appended columns. One is the prediction which contains normalized anomaly score. The higher the score, the more likely it is an anomaly. The other (optionally) appended column contains the mean length of the predicted decision tree paths of each observation. The shorter, the more likely it is an anomaly.
Important note: All columns which have been used for training the model must be present in the incoming H2O frame as well.
- Type: H2O ModelH2O Isolation Forest model.H2O Isolation Forest model, e.g. an Isolation Forest model.
- Type: H2O FrameH2O input data frame.H2O frame with data that is predicted