This nodes creates a document corpus which contains the occurrence count of each given token in regard to every document in the corpus. This node is intended to be connected to an “NGramExtractor” node (this means, a “token” can be a word, word-n-gram, or token-n-gram).
- Type: Data Input table with a collection column which contains each document’s terms.
- Type: Data A table with a row for each token and its corresponding document count (i.e. the number of input documents which contain the given term)
Community Nodes > Palladian
Make sure to have this extension installed: