This example shows how to adopt the verified components Topic Extractor (STM) and the Topic Assigner (STM).
The main difference with using the Topic Extractor (Parallel LDA) node is that also the document metadata can be provided during training.
The component adopts the R library 'stm' and requires you to install conda to automatically install the R and the required libraries.
Find more info about the R library, the KNIME R Integration and the Verified Components documentation in the links below.
Workflow
Structural Topic Modelling (STM) via Verified Components
External resources
- Verified Components project - knime.com
- Verified Component project - knime.com
- Topic Extractor (STM) - KNIME Community Hub
- Topic Assigner (STM) - KNIME Community Hub
- Miniconda Download and Installation - Conda Docs
- R Installation Guide - KNIME Docs
- stm: R Package for Structural Topic Models - Roberts, Stewart and Tingley, Journal of Statistical Software (2019)
- An Introduction to the Structural Topic Model (STM)
- “PoliBlogs08” data set by Eisenstein and Xing 2010
Used extensions & nodes
Created with KNIME Analytics Platform version 4.7.7
Legal
By using or downloading the workflow, you agree to our terms and conditions.