This node allows you to read PDF documents and create a document for each file. The documents title and authors will be extracted form the PDFs meta data. The full text of the PDF is extracted, the structure of the PDF is not taken into account. For text extraction the PDFBox library is used. (see http://pdfbox.apache.org/ for details).
- Type: TableDocuments output tableAn output table containing the parsed document data.