Hub
Pricing About
WorkflowWorkflow

Synthetic Data Augmentation with Copulas

Data augmentationCopulasSynthetic dataData generationCopula
+1
carlosenrique84 profile image
Version1.0Latest, created on 
May 15, 2025 2:08 PM
Drag & drop
Like
Download workflow
Workflow preview

The University of Saskatchewan
Ph.D. in Interdisciplinary Studies

Created by: Carlos Enrique Diaz, MBM, P.Eng.
Email: carlos.diaz@usask.ca

Supervisor: Lori Bradford, Ph.D.
Email: lori.bradford@usask.ca

Description:

This workflow demonstrates how to assess the quality of synthetic data generated using the Synthetic Data (Copulas) component in KNIME. It uses the well-known Iris dataset as a reference.

Section 1: Original Data Analysis with 150 Observations

  • Loads and preprocesses the Iris dataset (150 rows).

  • Uses Linear Correlation and Statistics nodes to explore the original data’s structure and relationships.

Section 2: Mixed Data with 650 Observations

  • Generates 500 synthetic rows using the Synthetic Data (Copulas) component.

  • Merges the synthetic data with the original data (total: 650 rows).

  • Applies the same analysis nodes to compare the combined dataset with the original.

Section 3: Pure Synthetic Data with 500 Observations

  • Filters to keep only the 500 synthetic rows.

  • Runs correlation and statistical analysis again to evaluate the synthetic data on its own.

This workflow is a simple and effective way to visualize and compare the statistical quality of synthetic data using built-in KNIME nodes.

External resources

  • Synthetic Data (Copulas) Component
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.4.2
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.1

    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime
  • Go to item
    KNIME Python IntegrationTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.1

    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.1

    knime
  • Go to item
    KNIME Statistics NodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits