Hub
Pricing About
WorkflowWorkflow

Reading Text Based PDFs with Tika Parser

Text processingTika parserOcr
abembenek profile image
Draft Latest edits on 
Sep 19, 2024 7:22 PM
Drag & drop
Like
Download workflow
Workflow preview

This workflow was a solution for Just Knime It Challenge 37 - Text Deduplication.

It is an example of how to use the Tika Parser node with a PDF that has text characters that can be recognized with just the Tika Parser alone (versus needing to use the Tess4J node).

Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.3.1
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.3.1

    knime
  • Go to item
    KNIME TextprocessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.3.1

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits