Reading Text Based PDFs with Tika Parser

Workflow

Reading Text Based PDFs with Tika Parser

Draft Latest edits on

This workflow was a solution for Just Knime It Challenge 37 - Text Deduplication.

It is an example of how to use the Tika Parser node with a PDF that has text characters that can be recognized with just the Tika Parser alone (versus needing to use the Tess4J node).

Loading deploymentsLoading ad hoc jobs

Legal

By using or downloading the workflow, you agree to our terms and conditions.

Reading Text Based PDFs with Tika Parser

KNIME Base nodes

KNIME Textprocessing

Legal

Reading Text Based PDFs with Tika Parser

Used extensions & nodes

KNIME Base nodesTrusted extension

KNIME TextprocessingTrusted extension

Legal

KNIME Base nodes

KNIME Textprocessing