Create Big Data Test Environment (legacy) (deprecated)

Node / Source

Create Big Data Test Environment (legacy) (deprecated)

This node has been deprecated and its use is not recommended. Please search for updated nodes instead.

Creates a fully functional big data environment for testing purposes, including Apache Hive, Apache Spark and a remote file system. This node has no own configuration, instead it will read its configuration from a file called flowvariables.csv from the root of the KNIME workspace. This file is expected two provide keys and values. These can be used to control what this node does.

Note: This node only creates a new Spark context upon its first execution after KNIME has started, or after the context has been destroyed. The Spark context created by during its first execution is meant to be shared between KNIME testflows.

Note: This node uses the old database connection based Hive output port.

Node details

Ports Options Views

Output ports

Type: Database Connection
Hive Connection
JDBC connection to a Hive instance. This port can be connected to the KNIME database nodes.
Type: Remote Connection
Remote file system Connection
Remote file system connection that can be used with the Spark nodes that read/write files.
Type: Spark Context
Spark Context
Spark context, that can be connected to all Spark nodes.

Extension

The Create Big Data Test Environment (legacy) (deprecated) node is part of this extension:

Go to item