CorpusCreator

Manipulator

This nodes creates a document corpus which contains the occurrence count of each given token in regard to every document in the corpus. This node is intended to be connected to an “NGramExtractor” node (this means, a “token” can be a word, word-n-gram, or token-n-gram).

Input Ports

  1. Type: Data Input table with a collection column which contains each document’s terms.

Output Ports

  1. Type: Data A table with a row for each token and its corresponding document count (i.e. the number of input documents which contain the given term)

Find here

Community Nodes > Palladian

Make sure to have this extension installed: