The component calculates the correlation between two or multiple variables (continuous or categorical features) and displays the correlation measures and matrices accordingly.
For the continuous variables a scatter plot can be displayed for selected variables.
A correlation filter with threshold 0.7 removes correlated columns, joins the remaining columns with the nominal columns of the data set and provides the result at the output. The threshold can be changed in Correlation Filter node.
- Type: Flow VariablePort 1Flow variables named Default_Var_nominal and Default_Var_cont. They can be created by the component: Split features & create default flow variables.
- Type: TablePort 2Data table with categorical nominal value columns.
- Type: TablePort 3Data table with continuous value columns.