Hub
Pricing About
WorkflowWorkflow

01_Fetch_BioAssays

PubChemChemistryAWSApache SparkLife sciences
+5
knime profile image
Draft Latest edits on 
Apr 14, 2025 2:12 PM
Drag & drop
Like
Download workflow
Workflow preview
This is the first workflow in the PubChem Big Data story. In the top part of the workflow we download the assay data from the PubChem database using its API and upload it to a specified S3 bucket on AWS. One file per assay/experiment (AID). In the bottom part we clean up the assay data using KNIME Extension for Apache Spark and store cleaned up files on AWS. AWS Autentication component, Paths to Livy and S3 component, and Create Spark Contex (Livy) node require configuration.
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.6.0
  • Go to item
    KNIME Amazon Cloud ConnectorsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME ExpressionsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME Extension for Apache SparkTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME JSON-ProcessingTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    KNIME REST Client ExtensionTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.6.0

    knime profile image
    knime
  • Go to item
    Vernalis KNIME NodesTrusted extension

    Vernalis Research Ltd, Cambridge, UK

    Version 1.34.2

    vernalis

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits