Hub
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Hub
  • Nodes
  • Correlation Filter
NodeNode / Manipulator

Correlation Filter

Analytics Mining Feature Selection
Drag & drop
Like
Copy short link

This node uses the model as generated by a Correlation node to determine which columns are redundant (i.e. correlated) and filters them out. The output table will contain the reduced set of columns.

The filtering step works roughly as follows: For each column in the correlation model the count of correlated columns is determined given a threshold value for the correlation coefficient (specified in the dialog). The column with the most correlated columns is chosen to "survive" and all correlated columns are filtered out. This procedure is repeated until no more columns can be identified. The problem of finding a minimum set of columns to satisfy the constraints is difficult to solve analytically. This method applied here is known to be good approximation, however.

Node details

Input ports
  1. Type: Correlation
    Correlation Model
    The model from the correlation node.
  2. Type: Table
    Input data
    Numeric input data to filter. It must contain the set of columns that were used to create the correlation model. (Typically you connect the input data from the correlation node here.)
Output ports
  1. Type: Table
    Filtered data from input
    Filtered data from input.

Extension

The Correlation Filter node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
    Week 5 DIM Reduct
    atad_ecneics_85 > Public > Week 5 DIM Reduct
  2. Go to item
    Gruppenprojekt
    niklas_u > Public > Gruppenprojekt
  3. Go to item
    Test2
    niklas_u > Public > Test2
  4. Go to item
    customer_data_blended_standard
    lennart_h > Public > customer_data_blended_standard
  5. Go to item
    Linear Regression to calculate Price of a Lego set
    Linear regression Machine learning Data science
    +2
    The workflow trains a Linear Regression model to predict the price of a Lego set based on…
    swantikag > Public > LinearRegression-LegoSet > LinearRegressionModel > LinearRegression-LegoSet
  6. Go to item
    random_forest_test(3)
    ferryas > Public > random_forest_test(3)
  7. Go to item
    Dimensionality Reduction
    Dimensionality reduction Data manipulation Preprocessing
    +3
    Introduction to Machine Learning Algorithms course - Session 4 Solution to exercise 4 App…
    hayasaka > L4-ML-2Hrs-2021-07 > Solutions > 07_Dimensionality_Reduction_solution
  8. Go to item
    DDS_Tutorial_01
    eltings26 > Public > DDS_Tutorial_01
  9. Go to item
    Dimensionality Reduction
    Dimensionality reduction Data manipulation Preprocessing
    +3
    Introduction to Machine Learning Algorithms course - Session 4 Solution to exercise 4 App…
    knime > Education > Courses > L4-ML Introduction to Machine Learning Algorithms > Session_4 > 02_Solutions > 04_Dimensionality_Reduction_solution
  10. Go to item
    HCC_Survival.knwf
    This workflow reads in HCC.csv file and trains random forest, K-nearest and decision tree…
    tpankaj92 > Public > HCC_Survival.knwf
  1. Go to item
  2. Go to item
  3. Go to item
  4. Go to item
  5. Go to item
  6. Go to item

KNIME
Open for Innovation

KNIME AG
Hardturmstrasse 66
8005 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Server
© 2022 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits