NodeSpark Statistics


This node computes summary statistics for the selected input columns using the MLlib Statistics package.

Computed statistics:

  • Minimum value
  • Maximum value
  • Sample mean
  • Sample variance
  • L1 norm
  • L2 norm
  • Number of nonzero elements
  • Number of zero elements
  • Row count

Input Ports

  1. Port Type: Spark Data
    Spark DataFrame/RDD to compute statistics for.

Output Ports

  1. Port Type: Data
    Table with numeric values.