NodeDocument Vector Hashing

Manipulator

This node creates a document vector for each document representing it in the terms space. The values of the feature vectors can be specified as boolean values or as values of either the relative frequency or the absolute frequency of the terms. The advantages of using this node instead of the normal document vector node is that the dimension of the vectors is always fixed and therefore this node is streamable.

Input Ports

  1. Port Type: Data
    The input table containing the documents.

Output Ports

  1. Port Type: Data
    An output table containing the input documents with the corresponding document vectors.
  2. Port Type: VectorHashingPortObject
    The model output containing the specifications that have been used for document vector creation.