This workflow mixes standard KNIME nodes with the Spark nodes to find the optimal parameters for a k-means clustering using the hillclimbing approach. Other optimization strategies are available - check the Parameter Optimization Loop Start Node description for more. The workflow makes use of the Create Local Big Data Environment node to create a Spark context. You can swap this node out for a Create Spark Context (Livy) node to connect to a remote cluster.
Used extensions & nodes
Created with KNIME Analytics Platform version 4.5.0
By using or downloading the workflow, you agree to our terms and conditions.
Discussions are currently not available, please try again later.