Hub
Pricing About
NodeNode / Learner

Spark Linear Regression Learner

Tools & ServicesApache SparkMiningPrediction
Drag & drop
Like

This node uses the spark.ml linear regression implementation to train a linear regression model in Spark, supporting different regularization options. The target column must be numeric, whereas the feature columns can be either nominal or numerical.

Use the Spark Predictor (Regression) node to apply the learned model to unseen data.

Please refer to the Spark documentation for a full description of the underlying algorithm.

This node requires at least Apache Spark 2.4.

Node details

Input ports
  1. Type: Spark Data
    Input data
    Input Spark DataFrame with training data.
Output ports
  1. Type: Spark ML Model
    Spark ML linear learner model (regression)
    Spark ML linear learner model (regression)
  2. Type: Table
    Coefficients and Intercept
    Coefficients and statistics of the linear regression model.
  3. Type: Table
    Model Statistics
    Statistical measures of the learned regression model, when applied to the training dataset (R², explained variance, ...)

Extension

The Spark Linear Regression Learner node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits