Document Vector

Manipulator

This node creates a document vector for each document representing it in the terms space. The values of the feature vectors can be specified as boolean values or as values of a specified column i.e. an tf*idf column. The dimension of the vectors will be the number of distinct terms in the BoW.

Input Ports

  1. Type: Data The input table containing the bag of words.

Output Ports

  1. Type: Data An output table containing the documents with the related document vectors.
  2. Type: DocumentVectorPortObject A model containing node settings as well as column names of the term feature space.

Find here

Other Data Types > Text Processing > Transformation

Make sure to have this extension installed: