H2O.ai AutoML (wrapped with Python) in KNIME for regression problems - a powerful auto-machine-learning framework (https://hub.knime.com/mlauber71/spaces/Public/latest/automl/)
v 1.25 - https://forum.knime.com/t/h2o-ai-automl-in-knime-for-regression-problems/20924
It features various models like Random Forest or XGBoost along with Deep Learning. It has warppers for R and Python but also could be used from KNIME. The results will be written to a folder and the models will be stored in MOJO format to be used in KNIME (as well as on a Big Data cluster via Sparkling Water). One major parameter to set is the running time the model has to test various models and do some hyper parameter optimization as well. The best model of each round is stored and some graphics are produced to see the results.
To run this workflow you have to install Python and H2O.ai as well as R and several packages. Please refer to the green box on the right.
The results may be used also on Big Data clusters with the help of H2O.ai Sparkling Water (https://hub.knime.com/mlauber71/spaces/Public/latest/kn_example_h2o_sparkling_water)
External resources
- Meta Collection about KNIME and Python
- H2O.ai AutoML in KNIME for regression problems
- KNIME Interactive R Statistics Integration Installation Guide
- Combine Big Data, Spark and H2O.ai Sparkling Water
- A Deep dive into H2O’s AutoML
- Profile mlauber71
- KNIME Python Integration Installation Guide
- KNIME, Python and Anaconda - the short story
- Downloading & Installing H2O
- Kaggle - House Prices: Advanced Regression Techniques
Used extensions & nodes
Created with KNIME Analytics Platform version 4.3.1
- Go to item
KNIME H2O Machine Learning Integration - MOJO Extension
KNIME AG, Zurich, Switzerland
Version 4.3.1
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Legal
By using or downloading the workflow, you agree to our terms and conditions.