1. Data Preparation

Workflow

Draft Latest edits on

Data Preparation

This workflow prepares the data for the next workflow ("My first Data Model") and uses some of the most common nodes for data preparation:

Applying different strategies for missing values (Missing Value node)
Creating subsets of the data (Row Sampler and Table Partitioner nodes)
Shuffling (Shuffle node)
Concatenation of data sets (Concatenate node)
Normalizing data (Normalizer and Normalizer (Apply) nodes)

After preprocessing, the workflow writes the two subsets back to .csv files, one for the training set (top partitioning), one for test set (bottom partitioning).

External resources

KNIME Beginner's Luck (Book Homepage)

Loading deploymentsLoading ad hoc jobs

Legal

By using or downloading the workflow, you agree to our terms and conditions.

1. Data Preparation

External resources

Used extensions & nodes

KNIME Base nodesTrusted extension

Legal

KNIME Base nodes