Create Databricks Environment

Node / Source

Create Databricks Environment

Creates a Databricks Environment connected to an existsing Databricks cluster. See AWS or Azure Databricks documentation for more information.

Note: To avoid an accidental cluster startup, this node creates a dummy DB and Spark port if loaded in executed state from a stored workflow. Reset and execute the node to start the cluster and create a Spark execution context.

Cluster access control : KNIME uploads additional libraries to the cluster. This requires manage cluster-level permissions if your cluster is secured with access control. See the Databricks documentation on how to set up the permission.

Node details

Ports Options Views

Output ports

Type: DB Session
DB Connection
JDBC connection, that can be connected to the KNIME database nodes.
Type: File System
DBFS Connection
DBFS connection, that can be connected to the Spark nodes to read/write files.
Type: Spark Context
Spark Context
Spark context, that can be connected to all Spark nodes.

Databricks Workspace Connection (Dynamic Inport)

Databricks Workspace Connection, that can be connected to the Databricks Workspace Connector.

Type: org.knime.credentials.base.CredentialPortObject

Extension

The Create Databricks Environment node is part of this extension:

Go to item