Hub
Pricing About
ComponentComponent

Text Preprocessing

sjporter profile image
Version1.0.0Latest, created on 
Dec 20, 2023 7:10 PM
Drag & drop
Like
Use or download
The Text Preprocessing component uses extremely fast regex-based text processing to remove specific types of characters from a String column and normalize the data as much as possible without over-processing. This component eliminates the need to convert text to a Document type in order to preprocess it. It also executes extremely quickly compared to various other approaches, promoting scalability. Each option in the configuration is processed independently of the others. There is an order of operations (which can be audited or edited by drilling into the component).

Component details

Input ports
  1. Type: Table
    Input Table
    A table of input data with at least one String column.
Output ports
  1. Type: Table
    Output Table
    A table in which the input table's selected column has been processed and replaced.

Used extensions & nodes

Created with KNIME Analytics Platform version 4.2.1
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.2.2

    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.2.0

    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.2.1

    knime

This component does not have nodes, extensions, nested components and related workflows

Legal

By using or downloading the component, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits