Hub
Pricing About
WorkflowWorkflow

AP-22457_ParallelChunk_LoopPerformance

wiswedel profile image
Draft Latest edits on 
May 12, 2024 9:11 AM
Drag & drop
Like
Download workflow
Workflow preview
AP-22457: Parallel Chunk Loop (End) is unnecessarily slow due to synchronous data write (columnar backend) The issue was an unnecessary synchronization when writing the output in the Parallel Chunk End when the "Columnar Backend" was set on the workflow. Performance comparisons for 50M rows (data generator), with a par-chunker containing a row filter removing about 2/3 of the rows: Runtime comparison (on my system): - Parallel Chunk, 5.2.3 : 202s - Parallel Chunk, 5.3 Nightly: 65s - Plain Row Filter: 35s (no par-chunker, just for reference)

External resources

  • Related Forum Post
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 5.3.0 Note: Not all extensions may be displayed.

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits