Hub
Pricing About
NodeNode / Manipulator

Tika Parser URL Input

Other Data TypesText ProcessingMiscStreamable
Drag & drop
Like

This node has the same function as the Tika Parser node, which is to parse any documents that are supported by Tika. The difference is that this node takes file paths from a string column as input. The type of the files can be selected in the configuration dialog. Users have the choice between selecting the file extensions, or the MIME-types. What kind of information that are to be extracted from the file (metadata and content) can also be selected in the dialog. If possible, user can also extract files that are embedded in the input files, such as attachments in E-mails, etc, and store them in a specified directory. Authentication setting is also provided to parse any encrypted files.

Node details

Input ports
  1. Type: Table
    Table containing the filepaths
    The input table containing the URLs or paths to files that are to be parsed. The input table has to contain at least one String column.
Output ports
  1. Type: Table
    Metadata output table
    An output table containing the parsed document data. The columns are the same as what was selected in the Metadata list in the configure dialog.
  2. Type: Table
    Attachment output table
    An output table containing the names of input files that contain any embedded files and also the paths to the extracted files in the output directory.

Extension

The Tika Parser URL Input node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits