This node analyzes a list of relations between nodes for unconnected network components. It expects an input table with two String columns containing (named) nodes of a network. A row represents a connection between the two nodes. The output is a two-column table stating each node and its cluster ID. All nodes with the same cluster ID are transitively connected to each other but to no other node of any other cluster.
Cluster numbering starts at 1 and gaps are avoided, meaning that the maximum cluster ID represents the total number of unconnected network components. The output table is sorted according to ascending cluster IDs. The order of the clusters is unspecified. Note that cluster 1 needs not to be the biggest cluster. While this functionality can also be implemented via the Network To Row node and its 'Split-up unconnected components' option, our implementation is tuned for performance and large networks. Thus, it does not operate on KNIME's network data type but on an edge definition table with String-typed node columns directly. Examples for this node's applicability are:● In production, new products can be assigned to facilities at minimal footprint complexity by keeping distinct material clusters in distinct entities.
● In logistics, hazardous goods can be analyzed for the ability to ship in one delivery.
● In human relations, an organizational chart analysis can reveal data quality issues with employees whose reporting lines do not end at the CEO.