Hub
Pricing About
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Community Hub
  • Nodes
  • Gradient Boosted Trees Learner
NodeNode / Learner

Gradient Boosted Trees Learner

Analytics Mining Decision Tree Ensemble Gradient Boosting Classification
Drag & drop
Like
Copy short link

Learns Gradient Boosted Trees with the objective of classification. The algorithm uses very shallow regression trees and a special form of boosting to build an ensemble of trees. The implementation follows the algorithm in section 4.6 of the paper "Greedy Function Approximation: A Gradient Boosting Machine" by Jerome H. Friedman (1999). For more information you can also take a look at this .

The used base learner for this ensemble method is a simple regression tree as it is used in the Tree Ensemble , Random Forest and Simple Regression Tree nodes. Per default a tree is build using binary splits for numeric and nominal attributes (the later can be changed to multiway splits). The built-in missing value handling tries to find the best direction for missing values to go to by testing each possible direction and selecting the one yielding the best result (i.e. largest gain).

In a regression tree the predicted value for a leaf node is the mean target value of the records within the leaf. Hence the predictions are best (with respect to the training data) if the variance of target values within a leaf is minimal. This is achieved by splits that minimize the sum of squared errors in their respective children.

Sampling

This node allows to perform row sampling (bagging) and attribute sampling (attribute bagging) similar to the random forest* and tree ensemble nodes. If sampling is used this is usually referred to as Stochastic Gradient Boosted Trees . The respective settings can be found in the Advanced Options tab.


(*) RANDOM FORESTS is a registered trademark of Minitab, LLC and is used with Minitab’s permission.

Node details

Input ports
  1. Type: Table
    Input Data
    The data to learn from. It must contain at least one nominal target column and either a fingerprint (bit/byte/double vector) column or another numeric or nominal column.
Output ports
  1. Type: Gradient Boosting Model
    Gradient Boosted Trees Model
    The trained model.

Extension

The Gradient Boosted Trees Learner node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
    Just Knime it- 024
    Just Knime it-24 Just Knime it
    Just Knime it-24
    anilks > Just Knime It Challenges > Just Knime It Challenge > Just Knime it-024
    anilks
  2. Go to item
    Challange 24
    Transfer-Solutions Justknimeit-24
    Here is a solution for the "Just KNIME it" challange 24.
    rubendg > Just KNIME IT > Challange 24
    rubendg
  3. Go to item
    Random Forest, Gradient Boosted Trees, and TreeEnsemble
    Classification Machine learning Prediction
    +11
    This workflow solves a binary classification problem on the adult dataset using more adva…
    knime > Academic Alliance > Guide to Intelligent Data Science > Example Workflows > Chapter9 > 04_TreeEnsemble
    knime
  4. Go to item
    KNIME_project3
    huzaifa_17738 > Public > KNIME_project3
    huzaifa_17738
  5. Go to item
    hospital_death (4)
    zainab99 > Public > hospital_death (4)
    zainab99
  6. Go to item
    ensemble models demo
    ADILT22 > Course Material - Codefree Introduction to Data Science > VL10 - Ensemble > Workflows > ensemble models demo
    ADILT22
  7. Go to item
    Ensemble methods
    Classification Random forest Gradient boosted trees
    +7
    Ensembles: binary classification of house ranking (high/low rank). - Random forest - Grad…
    knime > Academic Alliance > Guide to Intelligent Data Science > Exercises > Chapter9_Ensemble_Methods > Ensemble_Solution
    knime
  8. Go to item
    Boosted trees on Titanic dataset
    Classification Machine learning Prediction
    +7
    Training boosted trees
    sebastian_sauer > Public > titanic-boostedtrees
    sebastian_sauer
  9. Go to item
    20220707 Pikairos Machine learning based on signals or sets of descriptors
    Some Machine Learning nodes in KNIME directly accept either fingerprints, Bit Vectors and…
    pikairos > Public > 20220707 Pikairos Machine learning based on signals or sets of descriptors
    pikairos
  10. Go to item
    Challange 25
    Transfer-Solutions Justknimeit-25
    Here is a solution for the "Just KNIME it" challange 25.
    rubendg > Just KNIME IT > Challange 25
    rubendg

No known nodes available

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Business Hub
© 2023 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits