Apache Tika integration

Workflow preview
This workflow shows how to parse files of various formats as well as their attachments, if exist, using Tika parser nodes and detect the languages of the content using Tika language detector. Based on the detected langauge a filtering is applied to keep only English texts which are finally POS tagged.
hosted by

Download workflow

By downloading the workflow, you agree to our terms and conditions.

License CC-BY-4.0

Discussion