This node applies association rules created by a Spark rule learner*. The rules are applied to a given column with item sets (tra…
This rule learner* uses Spark MLlib to compute frequent item sets and then extract association rules from the given input data. A…
This node assigns new data to an existing set of prototypes, which are obtained e.g. by a k-means clustering. Each data point is …
This node utilizes the Apache Spark collaborative filtering implementation. Notice: The matrix factorization model contains refer…
This node runs the Java code generated by the PMML Compiler on Apache Spark.
This node uses the spark.ml Decision Tree implementation to train a Decision Tree classification model in Spark. The underlying a…
This node applies the Apache Spark Decision / Regression Tree algorithm. Please note that all data must be numeric, including the…
This node uses the spark.ml implementation to train a regression model in Spark. The underlying algorithm performs a recursive bi…
Scorer for clustering results given a reference clustering. Connect the Spark DataFrame/RDD containing a column with the referenc…
This node uses Spark MLlib to compute frequent item sets. See the Spark Association Rule Learner node to generate frequent item s…
Gradient Boosted Trees are ensembles of Decision Trees. Learning a Gradient Boosted Trees model means training a sequence of Deci…
Spark Gradient Boosted Trees Learner (Regression)

Gradient Boosted Trees are ensembles of Decision Trees. They iteratively train Decision Trees in order to minimize a loss functio…
This node applies the Apache Spark Gradient-Boosted Trees (GBTs) algorithm. Note: GBTs do not yet support multiclass classificati…
This node uses the spark.ml linear regression implementation to train a linear regression model in Spark, supporting different re…
This node applies the Apache Spark Linear Regression algorithm. It outputs the learned model for later application. Please note t…
This node applies the Apache Spark Linear SVM algorithm. It outputs the the learned model for later application. Please note that…
This node uses the spark.ml logistic regression implementation to train a logistic regression model in Spark, supporting differen…
This node applies the Apache Spark Logistic Regression algorithm. It outputs the the learned model for later application. Please …
Converts supported Spark MLlib models to PMML model.
This node applies the Apache Spark Naive Bayes algorithm. It outputs the original data and the Naive Bayes predictions for the re…

