Hub
Pricing About
NodeNode / Learner

Spark Frequent Item Sets

Tools & ServicesApache SparkMiningItem Sets / Association Rules
Drag & drop
Like

This node uses Spark MLlib to compute frequent item sets. See the Spark Association Rule Learner node to generate frequent item sets and association rules in one step.

Frequent item sets are computed using the FP-growth implementation provided by Spark MLlib, using input data with a collection column, where each cell holds the items of a transaction. Rows with missing values in the selected item column are ignored . FP-growth uses a suffix tree (FP-tree) structure to encode transactions without generating candidate sets explicitly and then extracts the frequent item sets from this FP-tree. This approach avoids the usually expensive generation of explicit candidates sets used in Apriori-like algorithms designed for the same purpose. More information about the FP-Growth algorithm can be found in Han et al., Mining frequent patterns without candidate generation . Spark implements Parallel FP-growth (PFP) described in Li et al., PFP: Parallel FP-Growth for Query Recommendation .

Transactions/item sets are represented as collection columns. The Spark GroupBy or Spark SQL nodes are recommended to create collection columns in Spark.

See Association rule learning (Wikipedia) for general information.

This node requires at least Apache Spark 2.0.

Node details

Input ports
  1. Type: Spark Data
    Spark data
    Spark DataFrame with a collection column, where each cell holds the items of a transaction
Output ports
  1. Type: Spark Data
    Frequent item sets
    Spark DataFrame with frequent item sets

Extension

The Spark Frequent Item Sets node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits