Spark Linear Correlation


This node computes the correlation coefficient for two selected input columns using the MLlib Statistics package.

Input Ports

  1. Type: Spark Data Spark DataFrame/RDD to compute correlation coefficient for.

Output Ports

  1. Type: Data KNIME data table with the correlation coefficient of the two columns.

Find here

Tools & Services > Apache Spark > Statistics

Make sure to have this extension installed:

KNIME Extension for Apache Spark

Update site for KNIME Analytics Platform 3.7:
KNIME Analytics Platform 3.7 Update Site

How to install extensions