The workflow demonstrates use of autofeat generator in classification tasks. A number of non-linear features are generated by the autofeat library. Upper panel, Random Forest, model is built using only the generated (and not the existing) features. The performance of model is comparable to the performance of model with existing features in the lower panel.
This opens the way for building stacked models with two groups of features--existing and generated--to improve the overall predictive performance.
Dataset used is Health Insuarnce Cross-sell data from Kaggle
Workflow
Classification using 'autofeat' Engineered Features
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.4.1
Legal
By using or downloading the workflow, you agree to our terms and conditions.