Hub
Pricing About
NodeNode / Manipulator

Dfi

Community NodesSeqAnData Mining
Drag & drop
Like

The Deferred Frequency Index (DFI) is a tool for string mining under frequency constraints, i.e., predicates that evaluate solely the frequency of a pattern occurrence in the data. The frequency of a pattern is defined as the number of distinct sequences in a database that contain the pattern at least once. Currently the implementation contains 3 different predicates and can easily be extended by user-defined frequency predicates. The frequencies are calculated during the construction of a suffix tree over all databases, which enables to limit the index construction to a problem-specific minimum referred to as the optimal monotonic hull.

(c) Copyright 2010 by David Weese and Marcel H. Schulz

Web Documentation for Dfi

Node details

Input ports
  1. Type: URI Object
    argument-0 [fq,fastq,fa,fasta,faa,ffn,fna,frn,embl,gbk,raw,sam]
    Database files in Fasta/Fastq or text format (lines are strings). [fq,fastq,fa,fasta,faa,ffn,fna,frn,embl,gbk,raw,sam]
Output ports
  1. Type: URI Object
    output [txt]
    Change output filename. Default: <stdout>. [txt]

Extension

The Dfi node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits