Hub
Pricing About
NodeNode / Source

Word Parser

Other Data TypesText ProcessingIO
Drag & drop
Like

This node allows you to read Word (.doc, .docx, .docm) documents and create a document for each file. The text is extracted from the word file by usage of the Apache POI library (see http://poi.apache.org/ for details). Paragraphs are taken into account. Meta information is not red. The first sentence is used as the document title.

Node details

Output ports
  1. Type: Table
    Documents output table
    An output table containing the parsed document data.

Extension

The Word Parser node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits