Hub
Pricing About
ComponentComponent

auto synthetic data generator

ashokharnal profile image
Draft Latest edits on 
Jan 17, 2022 5:12 AM
Drag & drop
Like
Use or download
This node uses CTGAN to generate synthetic data. CTGAN is a collection of Deep Learning based Synthetic Data Generators for single table data, which are able to learn from real data and generate synthetic clones with high fidelity. With ML tools (like the CTGAN), one inputs real data into the software. The software then learns patterns from the data and outputs data that matches those patterns. For more about this technology, you can see the paper 'Modeling Tabular data using Conditional GAN' at https://arxiv.org/abs/1907.00503 and the 'sdv' site: https://sdv.dev/SDV/user_guides/index.html . Synthetic data is generated for all the columns of table whether numeric or categorical. Set of python libraries comprising 'sdv' are required to be installed. If your KNIME is configured to access packages in 'base' Anaconda environment, then on first execution of the component, all necessary packages will be automatically installed. The principal package among these is pytorch. One of the outputs includes evaluation metrics as to how close the synthetic data is to real data.

Component details

Input ports
  1. Type: Table
    InData
    Input KNIME table with or without missing values
Output ports
  1. Type: Table
    outData
    Synthetic data
  2. Type: Python
    model
    This is the model created by the generator and can be used again to generate synthetic data.
  3. Type: Table
    metrics
    This node outputs evaluation metrics

Used extensions & nodes

Created with KNIME Analytics Platform version 4.5.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime
  • Go to item
    KNIME Python Integration

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.0

    knime

This component does not have nodes, extensions, nested components and related workflows

Legal

By using or downloading the component, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits