Hub
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Hub
  • Nodes
  • Spark Gradient Boosted Trees Learner (Regression)
NodeNode / Learner

Spark Gradient Boosted Trees Learner (Regression)

Tools & Services Apache Spark Mining Prediction
Drag & drop
Like
Copy short link

Gradient Boosted Trees are ensembles of Decision Trees. They iteratively train Decision Trees in order to minimize a loss function. This node uses the spark.ml Gradient Boosted Trees implementation to train a regression model in Spark. The target column must be numerical, whereas the feature columns can be either nominal or numerical.

Use the Spark Predictor (Regression) node to apply the learned model to unseen data.

Please refer to the Spark documentation for a full description of the underlying algorithm.

This node requires at least Apache Spark 2.0.

Node details

Input ports
  1. Type: Spark Data
    Input data
    Input Spark DataFrame with training data.
Output ports
  1. Type: Table
    Feature importance measures
    Table with estimates of the importance of each feature. The features are listed in order of decreasing importance and are normalized to sum up to 1.
  2. Type: Spark ML Model
    Spark ML Gradient Boosted Trees model (regression)
    Spark ML Gradient Boosted Trees model (regression)

Extension

The Spark Gradient Boosted Trees Learner (Regression) node is part of this extension:

  1. Go to item

KNIME
Open for Innovation

KNIME AG
Hardturmstrasse 66
8005 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Server
© 2022 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits