Document vector hashing (deprecated)


This node creates a document vector for each document representing it in the terms space. The values of the feature vectors can be specified as boolean values or as values of either the relative frequency or the absolute frequency of the terms. The advantages of using this node instead of the normal document vector node is that the dimension of the vectors is always fixed and therefore this node is streamable.

Input Ports

  1. Type: Data The input table containing the documents.

Output Ports

  1. Type: Data An output table containing the input documents with the corresponding document vectors.

Find here

Other Data Types > Text Processing > Transformation

Make sure to have this extension installed: