51 results
- Go to itemThis node assigns to each term of a document a part of speech (POS) tag. It is applicable for French, English, German, Spanish an…1
- Go to itemApache Tika is a library that is mainly used to detect document types and extract textual contents and metadata from various file…1
- Go to itemThis node recognizes biomedical named entities, such as genes, proteins or cells and assigns tags to the corresponding terms like…0
- Go to itemThis node recognizes biomedical named entities, such as genes, proteins or cells and assigns tags to the corresponding terms like…0
- Go to itemThis node removes all diacritical marks in the given documents. Diacritical marks are signs that are attached to a character, usu…0
- Go to itemFilters all terms of the input documents, which are contained in the dictionary provided by the second input port. As dictionary …0
- Go to itemReplaces complete terms contained in the input documents that match with specified dictionary terms with a corresponding specifie…0
- Go to itemReplaces terms contained in the input documents that match with specified dictionary terms by the corresponding specified value. …0
- Go to itemThis node recognizes named entities specified in a dictionary column and assigns a specified tag value and type. Optionally the r…0
- Go to itemThis node recognizes named entities specified in one or more dictionary columns and assigns a specified tag value and type. Optio…0
- Go to itemThis node recognizes named entities specified in a dictionary column and assigns a specified tag value and type. Optionally the r…0
- Go to itemThe Document Data Assigner adds meta information like authors, source, category, type and publication date to input documents. Wh…0
- Go to itemThis node creates a document vector for each document representing it in the terms space. The values of the feature vectors can b…0
- Go to itemThis node creates a document vector for each document representing it in the terms space. The values of the feature vectors will …0
- Go to itemThis node creates a document vector for each document representing it in the terms space. The values of the feature vectors can b…0
- Go to itemAll terms of the input documents will be hyphenated according to the algorithm of Liang (Liang's Algorithm), see (http://www.tug.…0
- Go to itemStems all terms conaitned in the input document, using the Kuhlen stemming algorithm. The Kuhle stemmer can be applied on English…0
- Go to itemRemoves all Markup Language Tags contained in the input columns. For string inputs the complete string will be filtered. For docu…0
- Go to itemExtracts the meta information key, value pairs of documents. For each key a column is created. The column contains the specific v…0