This workflow demonstrates how to apply best practices to a simple ETL (Extract, Transform, Load) process on customer data.
The company extracts new customer data from Amazon S3. Each email in the system gets a unique customer key. Extracted data are validated, transformed, and loaded to the database. In the case of failures, responsible people are notified via an automated email.
The data files are available in the workflow data area. The dataset is generated randomly. Any reference to living persons or real events is purely coincidental.
Workflow
Best Practices for ETL on Customer Data
Used extensions & nodes
Created with KNIME Analytics Platform version 4.5.1
Legal
By using or downloading the workflow, you agree to our terms and conditions.