Hub
Pricing About
WorkflowWorkflow

05 Bag of Words and Frequencies - Solution

Text miningText processingNLPNatural language processingTF
+3
D
Draft Latest edits on 
Jul 31, 2020 8:03 AM
Drag & drop
Like
Download workflow
Workflow preview
Solution to an L4-TP SELF-PACED COURSE exercise. Create a bag of words of a document. Calculate document frequencies (DF), term frequencies (TF), inverse document frequencies (IDF), and TF-IDF scores. CHECK YOUR ANSWERS: - The word "text" occurs in the agenda of the L4-TP course 4 times and therefore most often - 28 words occur in both agendas - The words with the highest TF-IDF scores are time (0.038), series (0.038), and text (0.025)

External resources

  • Bag of Words and Frequencies
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.7.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.7.0

    knime
  • Go to item
    KNIME Math Expression (JEP)Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.7.0

    knime
  • Go to item
    KNIME TextprocessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.7.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits