Document Similarity Learner

Component

Versionv1.0Latest, created on

The Document Similarity Learner develops a model for identifying a new documents most similar matches from an existing corpus of documents. It consumes already processed documents (refer to Document Preprocessing Component) as input and provides as output both the corpus of documents and a model for use with the Document Similarity Predictor Component.

Component details

Ports Options Views

Input ports

Type: Table
Preprocessed Documents
Documents which have already been preprocessed (via Document Preprocessing).

Output ports

Type: Table
Corpus of Documents
The reference corpus of documents for future comparison with new documents.
Type: DocumentVectorPortObject
Document Vector Model
Model for creating document vectors on new documents in the appropriate, compatible format.

Legal

By using or downloading the component, you agree to our terms and conditions.

Document Similarity Learner

Component details

Input ports

Output ports

KNIME Base nodes

KNIME Quick Forms

KNIME Textprocessing

Legal

Document Similarity Learner

Component details

Input ports

Output ports

Used extensions & nodes

KNIME Base nodesTrusted extension

KNIME Quick FormsTrusted extension

KNIME TextprocessingTrusted extension

Legal

KNIME Base nodes

KNIME Quick Forms

KNIME Textprocessing