The input table is split into two partitions (i.e. row-wise), e.g. train and test data. The two partitions are available as database queries at the two output ports.
To perform the partitioning the node appends a new partitioning column to the entered table with a random number. This is necessary since the random number function is not deterministic even with a given seed.
Once the partition queries are consumed use the Snowflake Partitioning Cleanup component to remove the added partitioning column.
- Type: DB SessionDB SessionDB Session