Hub
Pricing About
ComponentComponent

Keyword Search

knime profile image
Versionv1.0Latest, created on 
Oct 20, 2023 1:30 PM
Drag & drop
Like
Use or download
This component extracts the most relevant English keywords in a corpus (a collection of documents) using three specific techniques: - Topic Extraction using LDA: this technique collects a set of keywords for each different topic which clusters documents in different groups. - Term Co-Occurrence: this other technique finds pair of keywords which appear together often in different documents. - Max(TF-IDF) measure: a ranking which measures the importance of terms throughout the corpus. This component takes as input a column of Document type (from String to Document node) and it then identifies keywords in the corpus according to the hyper-parameters defined in configuration dialogue. The collected keywords are then provided in three tables at the output, one of each of the three techniques above. The component by default is applying basic text pre-processing (e.g. stopwords and symbols removal) based on the English language. This pre-processing can be deactivated via the dialogue and performed outside of the component when working with other or multiple languages.

Component details

Input ports
  1. Type: Table
    String input of Columns
    This component requires input of text columns in String format.
Output ports
  1. Type: Table
    LDA Terms
    Output of nouns, adjectives and verbs along with weights defined by LDA in a olumn.
  2. Type: Table
    Term Co-Occurrence count
    Output of nouns, adjectives and verbs along with counts of terms occurring in corpus.
  3. Type: Table
    TF-IDF
    Table output of terms with highest TF-IDF between all documents.

Used extensions & nodes

Created with KNIME Analytics Platform version 4.4.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.4.0

    knime profile image
    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.4.0

    knime profile image
    knime
  • Go to item
    KNIME Math Expression (JEP)Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.4.0

    knime profile image
    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.4.0

    knime profile image
    knime
  • Go to item
    KNIME TextprocessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.4.0

    knime profile image
    knime

This component does not have nodes, extensions, nested components and related workflows

Legal

By using or downloading the component, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits