After creating BoW we can compute frequencies and other important measures based on terms, characters, co-occurences, etc.
The nodes applied are TF, IDF, ICF, Term Document Entropy, DF, Term o-ccurance counter, and NGram creator. Moreover, the workflow shows how to compute the tf*idf measure and an application of the computed frequencies and measures.
All nodes preceding the computation of the frequencies and the other measures have been encapsulated in components, to make the workflow better readable.
Workflow
01_Frequencies and other Measures Computation
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.5.0
Legal
By using or downloading the workflow, you agree to our terms and conditions.