Hub
Pricing About
WorkflowWorkflow

Spark Label Encoding - prepare the data in local Big Data environment

KnimeSparkHiveImpalaLabel
+3
mlauber71 profile image
Draft Latest edits on 
Mar 30, 2019 4:48 PM
Drag & drop
Like
Download workflow
Workflow preview
s_401 - prepare label encoding with spark prepare the preparation of data in a big data environment - label encode string variables - transform numbers into Double format (Spark ML likes that) - remove highly correlated data - remove NaN variables - remove continous variables - optional: normalize the data

External resources

  • the data used is a cleaned and updated version of Census Income dataset
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.2.0 Note: Not all extensions may be displayed.
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.1.2, 4.2.0

    knime
  • Go to item
    KNIME DatabaseTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.1.1

    knime
  • Go to item
    KNIME Extension for Apache SparkTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.1.1, 4.2.0

    knime
  • Go to item
    KNIME Extension for Big Data File FormatsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.1.0

    knime
  • Go to item
    KNIME Extension for Local Big Data EnvironmentsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.1.0

    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.2.0

    knime
  • Go to item
    KNIME Math Expression (JEP)Trusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.1.0

    knime
  • Go to item
    Vernalis KNIME NodesTrusted extension

    Vernalis Research Ltd, Cambridge, UK

    Version 1.26.0

    vernalis

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits