Hub
Pricing About
WorkflowWorkflow

8.1 Practical Machine Learning with R Churn Analysis (unbalanced)

Logistic RegressionUnbalanced DataEducationSMOTE
carstenlange profile image
Draft Latest edits on 
Mar 4, 2024 7:13 AM
Drag & drop
Like
Download workflow
Workflow preview
This workflow repeats the Churn Analysis from the textbook Practical Machine Learning with R (https:\\ai.lange-analytics.com). We are using unbalanced data for a churn analysis. "Unbalanced" means that one class (customers who did not churned) contains significantly more observations than the other class (customers who churned). Check the Value Count Node to see how imbalanced the data are. Consequently, the model focuses too much on the majority class. Check the Sensitivity and Specificity in the Scorer to see the problem.

External resources

  • Contact the author
  • Another workflow in this space uses SMOTE to balance the data
  • Open the related R analysis in RStudio
  • Practical Machine Learning with R (Chapter 8)
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.2.1
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.1

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits