Level: Hard
Description: You work for a travel agency and want to better understand how hotels are reviewed online. What topics are common in the reviews as a whole, and what terms are most relevant in each topic? How about when you separate the reviews per rating? A colleague has already crawled and preprocessed the reviews for you, so your job now is to identify relevant topics in the reviews, and explore their key terms. What do the reviews uncover? Hint: Topic Extraction can be very helpful in tackling this challenge. Hint 2: Coherence and perplexity are metrics that can help you pick a meaningful number of topics.
Workflow
Challenge 20 - Topics in Hotel Reviews
External resources
- Verified Components project - knime.com
- Topic Scorer (Labs) - KNIME Community Hub
- Summarizing topical content with word frequency and exclusivity - Bischof and Airoldi (2012), Proceedings of the 29th International Coference on International Conference on Machine Learning
- Optimizing semantic coherence in topic models - Mimno et al 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing 2011
Used extensions & nodes
Created with KNIME Analytics Platform version 5.2.1 Note: Not all extensions may be displayed.
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
- Go to item
Legal
By using or downloading the workflow, you agree to our terms and conditions.