This workflow was a solution for Just Knime It Challenge 37 - Text Deduplication.
It is an example of how to use the Tika Parser node with a PDF that has text characters that can be recognized with just the Tika Parser alone (versus needing to use the Tess4J node).
Workflow
Reading Text Based PDFs with Tika Parser
Used extensions & nodes
Created with KNIME Analytics Platform version 5.3.1
Legal
By using or downloading the workflow, you agree to our terms and conditions.