Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System)
Use Big Data Technologies like Spark to ge a robust and scalable data preparation. Use the latest Auo ML technology like H2O.ai AutoML to cretae a robust model and deploy it in a Big Data environment (like Cloudera)
s_601 - prepare label encoding with spark
prepare the preparation of data in a big data environment
- label encode string variables
- transform numbers into Double format (Spark ML likes that)
- remove highly correlated data
- remove NaN variables
- remove continous variables
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.6.0
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Loading deployments
Loading ad hoc executions
Legal
By using or downloading the workflow, you agree to our terms and conditions.
Discussion
Discussions are currently not available, please try again later.