Hub
Pricing About
WorkflowWorkflow

Use SMOTE and ROSE algorithms to balance data

RRoseBalanceImbalancedKnime
mlauber71 profile image
Draft Latest edits on 
May 29, 2018 6:19 PM
Drag & drop
Like
Download workflow
Workflow preview
Use both SMOTE (Synthetic Minority Over-sampling Technique) and ROSE (Random Over-Sampling Examples) algorithms to balance data. SMOTE is implemented within KNIME. ROSE can be accessed via R. It is advisable to balace only your training data and leave the test/validation data as they are or you run the risk of greatly inflated values on your precision statistics.

External resources

  • more options about unbalanced data
  • SMOTE Hints from KNIME Team members 3 (Classification Threshold Analysis)
  • SMOTE Hints from KNIME Team members 2
  • SMOTE Hints from KNIME Team members 1
  • Imbalanced Data : How to handle Imbalanced Classification Problems
  • forum entry
  • R ROSE
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.1.0
  • Go to item
    KNIME CoreTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.0.1

    knime
  • Go to item
    KNIME Interactive R Statistics IntegrationTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.0.1

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits