Hub
Pricing About
NodeNode / Other

Persist Spark DataFrame/RDD

Tools & ServicesApache SparkIO
Drag & drop
Like

This node persists (caches) the incoming SparkDataFrame/RDD using the specified persistence level. The different storage levels are described in detail in the Spark documentation .

Caching Spark DataFrames/RDDs might speed up operations that need to access the same DataFrame/RDD several times e.g. when working with the same DataFrame/RDD within a loop body in a KNIME workflow.

Node details

Input ports
  1. Type: Spark Data
    Spark DataFrame/RDD
    Spark DataFrame/RDD to persist.
Output ports
  1. Type: Spark Data
    Persisted Spark DataFrame/RDD
    The persisted Spark DataFrame/RDD.

Extension

The Persist Spark DataFrame/RDD node is part of this extension:

  1. Go to item

Related workflows & nodes

  1. Go to item
  2. Go to item
  3. Go to item

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits