H2O.ai AutoML (wrapped with Python) in KNIME for regression problems - a powerful auto-machine-learning framework (https://hub.knime.com/mlauber71/spaces/Public/latest/automl/)
v 1.25 - https://forum.knime.com/t/h2o-ai-automl-in-knime-for-regression-problems/20924
It features various models like Random Forest or XGBoost along with Deep Learning. It has warppers for R and Python but also could be used from KNIME. The results will be written to a folder and the models will be stored in MOJO format to be used in KNIME (as well as on a Big Data cluster via Sparkling Water). One major parameter to set is the running time the model has to test various models and do some hyper parameter optimization as well. The best model of each round is stored and some graphics are produced to see the results.
To run this workflow you have to install Python and H2O.ai as well as R and several packages. Please refer to the green box on the right.
The results may be used also on Big Data clusters with the help of H2O.ai Sparkling Water (https://hub.knime.com/mlauber71/spaces/Public/latest/kn_example_h2o_sparkling_water)
External resources
- Kaggle - House Prices: Advanced Regression Techniques
- Downloading & Installing H2O
- KNIME, Python and Anaconda - the short story
- KNIME Python Integration Installation Guide
- Profile mlauber71
- A Deep dive into H2O’s AutoML
- Combine Big Data, Spark and H2O.ai Sparkling Water
- KNIME Interactive R Statistics Integration Installation Guide
- H2O.ai AutoML in KNIME for regression problems
- Meta Collection about KNIME and Python
Used extensions & nodes
Created with KNIME Analytics Platform version 4.3.1
- Go to item
KNIME H2O Machine Learning Integration - MOJO Extension
KNIME AG, Zurich, Switzerland
Version 4.3.1
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Loading deployments
Loading ad hoc executions
Legal
By using or downloading the workflow, you agree to our terms and conditions.
Discussion
Discussions are currently not available, please try again later.