Hub
Pricing About
WorkflowWorkflow

Typo Correction for large dataset

ClusteringHierarchical ClusteringRecurssive LoopSpellingTypo
corey profile image
Draft Latest edits on 
Apr 23, 2019 4:35 PM
Drag & drop
Like
Download workflow
Workflow preview
Recurssively applies Hierarchical Clustering to a column of string data to correct typos. Recurssion is employed to bypass compute time of clustering very large tables by clustering small chunks, correcting names based on the most common spelling in the chunk, shuffling and repeating.
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.0.0 Note: Not all extensions may be displayed.
  • Go to item
    KNIME CoreTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.0.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits