PySpark Script (2 to 1)

Manipulator

This node allows you to execute Python code on Spark (See PySpark documentation). The code has to put the desired output in the data frame with the name resultDataFrame1

Input Ports

  1. Type: Spark Data First input Spark DataFrame.
  2. Type: Spark Data First input Spark DataFrame.

Output Ports

  1. Type: Spark Data Result Spark DataFrame.

Find here

Tools & Services > Apache Spark > Misc > PySpark

Make sure to have this extension installed:

KNIME Extension for Apache Spark

Update site for KNIME Analytics Platform 3.7:
KNIME Analytics Platform 3.7 Update Site

How to install extensions