Hub
Pricing About
WorkflowWorkflow

02_Document Vector Creation

BooksText MiningFrom Words To Wisdom
vincenzo profile image
Draft Latest edits on 
Oct 20, 2014 2:00 PM
Drag & drop
Like
Download workflow
Workflow preview
Here we transform the collection of documents into numerical vectors. The dataset used in this example is the KNIME Forum Dataset. After the pre-processing phase, the relative term frequency is computed for each term inside the Transformation component. The input data set is partitioned into training set and test set. The term frequencies from the training set are used to build a vector representation of the distinct terms identified by the BoW with a Document Vector node.The same Document Vector transformation is then applied to the Documents in the test set.

External resources

  • www.knime.com/knimepress/from-words-to-wisdom
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.5.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime
  • Go to item
    KNIME TextprocessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits