This node implements the CAIM binning (discretization) algorithm according to Kurgan and Cios (2004) URL:http://citeseer.ist.psu.edu/kurgan04caim.html. The binning (discretization) is performed with respect to a selected class column. CAIM creates all possible binning boundaries and chooses those that minimize the class interdependancy measure. To reduce the runtime, this implementation creates only those boundaries where the value and the class changes. The algorithm finds a minimum number of bins (guided by the number of possible class values) and labels them "Interval_X". Only columns compatible with double values are binned and the column's type of the output table is changed to "String".
- Type: Data The data table to bin (discretize).
- Type: Data The binned data table.
- Type: CAIM The model representing the binning. Contains the intervals for each bin of each column.
Manipulation > Column > Binning
Make sure to have this extension installed:
Update site for KNIME Analytics Platform 3.7:
KNIME Analytics Platform 3.7 Update Site