NodeStanfordNLP NE Learner


The StanfordNLP NE Learner creates a conditional random field model based on documents and a dictionary with entities that occur in the documents. The chosen tag and the used dictionary will be saved internally, so they can be used by the StanfordNLP NE tagger to tag new documents and validate the model. If you want to use the model externally, the model file can be found at your workflow directory:

/%KNIMEWORKSPACE%/%WORKFLOW%/StanfordNLP NE Learner(##)/port_1/object/

You can select the document column and the dictionary column to train your model with. It is possible to use multi-term entities within the dictionary. There is also a tab in the dialog to specify the learner properties. Currently, there are only a few options, since the number of parameters is pretty huge. So please contact us, if there are important/highly used parameters, we should integrate!

NOTE: If you are interested in the StanfordNLP toolkit, please visit Some of the following property descriptions are taken from the NERFeatureFactory class from StanfordNLP. Please look into it for further information.

Input Ports

  1. Port Type: Data
    The input table containing the documents to train the model with.
  2. Port Type: Data
    The input dictionary containing known single- and/or multi-term entities to train the model.

Output Ports

  1. Port Type: StanfordNERModelPortObject
    The StanfordNLP NE model.