Document Vector

Manipulator

This node creates a document vector for each document representing it in the terms space. The values of the feature vectors can be specified as boolean values or as values of a specified column i.e. an tf*idf column. The dimension of the vectors will be the number of distinct terms in the BoW.

Input Ports

  1. Type: Data
    The input table containing the bag of words.

Output Ports

  1. Type: Data
    An output table containing the documents with the related document vectors.
  2. Type: DocumentVectorPortObject
    A model containing node settings as well as column names of the term feature space.

Extension

This node is part of the extension

KNIME Textprocessing

v4.0.0

Short Link

Drag node into KNIME Analytics Platform