Kullback–Leibler divergence

This component computes the Kullback-Leibler divergence to output a measure of dissimilarity between two distribution of the same variable, which come from two different datasets. It could be very useful to identify shift inside data which can potentially lead to a model drift, downgrading the predictive power. Column names and types must be identical between the two tables. Continous predictors are binned into classes to make computation of the metric easier. A value close to 0 means that the two variables have the same distribution. As the value increases, the dissimilarity is higher. Columns must have the same name and the same data type.

Component details

Ports Options Views

Input ports

Type: Table
Table 1
Input table 1
Type: Table
Table 2
Input table 2

Output ports

Type: Table
Kullback-Leibler measure
Kullback-Leibler measure for each input column

Legal

By using or downloading the component, you agree to our terms and conditions.

Component details

Input ports

Output ports

KNIME Base nodes

KNIME JavaScript Views

KNIME Javasnippet

KNIME Math Expression (JEP)

Legal