Hub
Pricing About
WorkflowWorkflow

03_Streaming Document Vector Hashing Creation

BooksFrom Words To WisdomText MiningStreaming
vincenzo profile image
Draft Latest edits on 
Aug 31, 2016 5:25 PM
Drag & drop
Like
Download workflow
Workflow preview
Here we execute the workflow in a streming fashion. The aim of this workflow is to create a vector space with the collection of documents being analzsed, bz using the Document Vector Hashing node. The node creates document vectors with a fixed number of dimensions using various hashing methods. This workflow starts reading the data and converts the strings into documents, which are then preprocessed, i.e. filtered and stemmed; all in a streaming fashion. All the preprocessing steps take place in the Streaming Pre-processing component. Then a bag of word is created and finally the documents are transformed into numerical/binary document vectors with the Document vector hashin node. The all workflow is executed in a streaming fashion.

External resources

  • www.knime.com/knimepress/from-words-to-wisdom
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.5.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime
  • Go to item
    KNIME TextprocessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits