To apply optimal binning on a dataset, one need to bind the 4th output port of Optimal Binning Component to 1st input port of Apply Component, and the 3rd ouput port of Optimal Binning Component to 2nd input port of Apply Component.
Step by Step Guide:
1- Initially, to run this component one should install Python Integration extensions.
2- For obtain a better Python node performance, pyarrow library should be installed.
3- Having installed pyarrow library, select serialization library as Apache Arrow under preferences. This option makes a huge difference as performance compared to Flatbuffers Column Serialization
- Type: TableData to ApplyData to Apply (from 4th output port of Optmal Binning Component)
- Type: TableIV's within ThresholdInformation Values over threshold with variable list.