Hub
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Hub
  • Nodes
  • Linear Correlation
NodeNode / Other

Linear Correlation

Analytics Statistics
Drag & drop
Like
Copy short link

Calculates for each pair of selected columns a correlation coefficient, i.e. a measure of the correlation of the two variables.

Which correlation measure is applied depends on the types of the underlying variables:
numeric <-> numeric : Pearson's product-moment coefficient . Missing values in a column are ignored in such a way that for the computation of the correlation between two columns only complete records are taken into account. For instance, if there are three columns A, B and C and a row contains a missing value in column A but not in B and C, then the row will be ignored for computing the correlation between (A, B) and (A, C). It will not be ignored for the correlation between (B, C). This corresponds to the function cor(<data.frame>, use="pairwise.complete.obs") in the R statistics package.
The value of this measure ranges from -1 (strong negative correlation) to 1 (strong positive correlation). A value of 0 represents no linear correlation (the columns might still be highly dependent on each other, though).
The p-value for these columns indicates the probability of an uncorrelated system producing a correlation at least as extreme, if the mean of the correlation is zero and it follows a t-distribution with df degrees of freedom.
nominal <-> nominal : Pearson's chi square test on the contingency table . This value is then normalized to a range [0,1] using Cramer's V , whereby 0 represents no correlation and 1 a strong correlation. Missing values in nominal columns are treated such as they were a self-contained possible value. If one of the two columns contains more possible values than specified in the dialog (default 50), the correlation will not be computed.
The p-value for these columns indicates the probability of independent variables showing as extreme level of dependence. The value is the same as for a chi-square test of independence of variables in a contingency table.
Correlation measures for other pairs of columns are not available, they are represented by missing values in the output table and crosses in the accompanying view.

Node details

Input ports
  1. Type: Table
    Numeric input data
    Numeric input data to evaluate
Output ports
  1. Type: Table
    Correlation measure
    Correlation variables, p-values and degrees of freedom.
  2. Type: Table
    Correlation matrix
    Correlation variables in a matrix representation.
  3. Type: Correlation
    Correlation model
    A model containing the correlation measures. This model is appropriate to be read by the Correlation Filter node.

Extension

The Linear Correlation node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
    Linear correlation
    There has been no description set for this workflow's metadata.
    mlauber71 > Public > forum > kn_forum_correlation
  2. Go to item
    Categorical Autocorrelation
    Time series Categorical Correlation
    +1
    Uses the Linear Correlation node on a lagged categorical column to create a plot of Crame…
    corey > Public > Categorical Autocorrelation
  3. Go to item
    Linear Regression Weighting
    livinc > Public > Linear Regression Weighting
  4. Go to item
    [Community Nodes] CN-133 Bundle Generator Node
    The Bundle Generator node aggregates together one or more new Bundled Products from a sel…
    scientificstrategy > Public > Community Nodes > CN-133 Bundle Generator Node v05
  5. Go to item
    [Case Studies] CS-133 SUV Market 2013 Product Tuning
    SUV Product Tuning improves the Feature Tuning and allows the real-world Market to be con…
    scientificstrategy > Public > Case Studies > CS-133 SUV Market 2013 Product Tuning v05
  6. Go to item
    [Case Studies] CS-124 Cola Market 2015 Product Tuning
    Product-level Tuning of Cola Products improves the Feature-level Tuning and allows real-w…
    scientificstrategy > Public > Case Studies > CS-124 Cola Market 2015 Product Tuning v05
  7. Go to item
    Gruppenprojekt
    niklas_u > Public > Gruppenprojekt
  8. Go to item
    Test2
    niklas_u > Public > Test2
  9. Go to item
    customer_data_blended_standard
    lennart_h > Public > customer_data_blended_standard
  10. Go to item
    knime_challenge-21
    Justknimeit-21
    justknimeit-21
    jefleisc > Public > jefleisc-knime_challenge-21

No known nodes available

KNIME
Open for Innovation

KNIME AG
Hardturmstrasse 66
8005 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Server
© 2022 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits