Hub
Pricing About
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Community Hub
  • mlauber71
  • Spaces
  • Public
  • kn_example_bigdata_h2o_automl_spark_46
  • s_601_spark_label_encoder
WorkflowWorkflow

s_601 - Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System)

Knime Spark Hive Impala Label
+5
mlauber71 profile image

Last edited: 

Drag & drop
Like
Download workflow
Copy short link
Workflow preview
Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System) Use Big Data Technologies like Spark to ge a robust and scalable data preparation. Use the latest Auo ML technology like H2O.ai AutoML to cretae a robust model and deploy it in a Big Data environment (like Cloudera) s_601 - prepare label encoding with spark prepare the preparation of data in a big data environment - label encode string variables - transform numbers into Double format (Spark ML likes that) - remove highly correlated data - remove NaN variables - remove continous variables

External resources

  • the data used is a cleaned and updated version of Census Income dataset
  • Being Lazy is Useful — Lazy Evaluation in Spark

Used extensions & nodes

Created with KNIME Analytics Platform version 4.6.0
  • Go to item
    KNIME Base nodes Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Database Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Extension for Apache Spark Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Extension for Big Data File Formats Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Extension for Local Big Data Environments Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Javasnippet Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  • Go to item
    KNIME Math Expression (JEP) Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime
  1. Go to item
  2. Go to item
  3. Go to item
  4. Go to item
  5. Go to item
  6. Go to item
Loading deployments
Loading ad hoc executions

Legal

By using or downloading the workflow, you agree to our terms and conditions.

Discussion
Discussions are currently not available, please try again later.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Business Hub
© 2023 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits