Use both SMOTE (Synthetic Minority Over-sampling Technique) and ROSE (Random Over-Sampling Examples) algorithms to balance data. SMOTE is implemented within KNIME. ROSE can be accessed via R.
It is advisable to balace only your training data and leave the test/validation data as they are or you run the risk of greatly inflated values on your precision statistics.
Workflow
Use SMOTE and ROSE algorithms to balance data
External resources
Used extensions & nodes
Created with KNIME Analytics Platform version 4.1.0
Legal
By using or downloading the workflow, you agree to our terms and conditions.