Hub
Pricing About
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Community Hub
  • Nodes
  • Unique Term Extractor
NodeNode / Manipulator

Unique Term Extractor

Other Data Types Text Processing Transformation
Drag & drop
Like
Copy short link

This node creates a global set of terms over all documents. Optionally, it is possible to filter the top-k words in terms of frequencies. There are three different frequencies to choose from for filtering: the term frequency, the document frequency and the inverse document frequency.

  • Term Frequency ( TF ): Overall count of a term in all documents.
  • Document Frequency ( DF ): Number of documents in which a term occurs.
  • Inverse Document Frequency ( IDF ): The logarithm of the total number of documents divided by the DF .
More information about term frequencies can be found here .

Node details

Input ports
  1. Type: Table
    Documents input table
    The input table containing the documents.
Output ports
  1. Type: Table
    Terms output table
    An output table containing a unique term column, frequency columns and an index column.

Extension

The Unique Term Extractor node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

No known nodes available

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Business Hub
© 2023 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits