Recurssively applies Hierarchical Clustering to a column of string data to correct typos. Recurssion is employed to bypass compute time of clustering very large tables by clustering small chunks, correcting names based on the most common spelling in the chunk, shuffling and repeating.
Used extensions & nodes
Created with KNIME Analytics Platform version 4.0.0 Note: Not all extensions may be displayed.
Loading ad hoc executions
By using or downloading the workflow, you agree to our terms and conditions.
Discussions are currently not available, please try again later.