This workflow shows you how to OCR a Foreign Language (Japanese, but this can be changed in the Python script) from PDFs which are text-based or image-based using Python and KNIME.
This workflow requires several installations via the terminal and the location of those installation locations must be entered into the component to run this workflow.
If you have any questions please post to the KNIME Forum and tag me using @victor_palacios
This was primarily created for Mac users who want to OCR, but Windows users will find instructions in the Python node.
Workflow
OCR Foreign Language PDFs with Python and KNIME
Used extensions & nodes
Created with KNIME Analytics Platform version 4.5.2
Legal
By using or downloading the workflow, you agree to our terms and conditions.