Hub
Pricing About
WorkflowWorkflow

04.2_ELT_Usage_-_Exercise

EducationData engineeringData engineerBest practicesSpark
+1
R
Draft Latest edits on 
Feb 5, 2025 6:18 PM
Drag & drop
Like
Download workflow
Workflow preview
The company tracks the usage of the website and stores the information about each session.

- Various data are collected, e.g., session start, duration, # clicks, etc., as well as the session satisfaction score (optional)
- The company calculates averaged statistics for each customer, e.g., total # visits, average satisfaction, etc., and updates the "statistics" table in different locations
- Session satisfaction score column has missing values which need to be imputed, e.g., with machine learning predictions.

We access the usage data from Hive and personal data (anonymized & updated in sessions 1 & 2) and contracts data from a database. We perform in-database processing, read the data into Spark, enrich the usage data with the personal and contract data to predict missing values better, aggregate usage data on Spark, and save the aggregated data.

Here, we additionally export the final status of the workflow. In the case some processes fail, we notify responsible people via an automated email.
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.3.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 5.2.1, 5.3.0

    knime
  • Go to item
    KNIME DatabaseTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    KNIME Extension for Apache SparkTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    KNIME Extension for Local Big Data EnvironmentsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.2.0

    knime
  • Go to item
    KNIME Personal Productivity ToolsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.3.0

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits