This workflow applies the Topic Extractor (Parallel LDA) node to detect 10 topics and describe each one of them with 5 keywords. LDA is a generative probabilistic model considered an unsupervised algorithm that finds out the top n topics, described by the most relevant m keywords. This is implemented in KNIME Analytics Platform through the Topic Extractor (Parallel LDA) node available within the Text Processing extension. LDA represents documents as random mixtures over latent topics, where each topic is characterized by a distribution over words (Blei, Ng and Jordan, 2003).
Workflow
Topic Detection Analysis Training
Used extensions & nodes
Created with KNIME Analytics Platform version 4.7.0
Legal
By using or downloading the workflow, you agree to our terms and conditions.