Hub
Pricing About
WorkflowWorkflow

02.2 ELT on website usage data - transform big data on cloud and Spark

EducationData engineeringData engineerBest practicesSpark
+4
knime profile image
Draft Latest edits on 
Mar 3, 2025 7:28 PM
Drag & drop
Like
Download workflow
Workflow preview

The company tracks the usage of the website and stores the information about different actions during each session: login and logout times, opened pages, clicked buttons, as well as the session satisfaction score (optional) and wants to calculate statistics for each customer, e.g., total number of visits, average satisfaction, etc.

Note. Session satisfaction score column has missing values which can be imputed using machine learning predictions.

We access the website usage data from the local big data environment (set up in the exercise workflow 02.1) and personal data (anonymized & updated in exercise workflow 01) and contracts data from a database. We then perform in-database processing, import data into Spark, enrich the website usage data with the personal and contract data to predict missing session satisfaction scores, and save the aggregated data.

Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.4.0
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime profile image
    knime
  • Go to item
    KNIME DatabaseTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime profile image
    knime
  • Go to item
    KNIME Extension for Apache SparkTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime profile image
    knime
  • Go to item
    KNIME Extension for Local Big Data EnvironmentsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime profile image
    knime
  • Go to item
    KNIME Optimization extensionTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 5.4.0

    knime profile image
    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits