This node helps discover multiple categorical columns in the dataset in one go among numeric columns. Many a time data is annonymized with a large number of numerical columns, some of which are, in fact, nominal. In this component, you specify the maximum number of distinct values for a numeric column. If distinct values are equal to or less than that specified, the column would be transformed to string column else not. The outputs of component are possible categorical columns and the rest of dataframe
The component uses 'Python Script' node to perform this function. It needs 'pandas' library.
- Type: TablePort 1KNIME dataset, as for example from a csv reader.