Hub
Pricing About
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Community Hub
  • Search

101 results

Filter
Filter by tag
Spark
Best practices Data engineer Data engineering ELT Education Big Data Hive Analytics H2O Machine Learning Integrations
+1
  1. Go to item
    Workflow
    A meta collection about KNIME and performance and performance tuning and some problems
    Knime Performance Slow
    +18
    A meta collection about KNIME and performance and performance tuning and some problems
    mlauber71 > Public > _knime_performance_and_tuning
    3
    mlauber71
  2. Go to item
    Workflow
    Movie Recommendation Engine with Spark Collaborative Filtering
    Retail Association rules Recommandation engine
    +10
    Creates a Recommendating Engine using Spark and Big data nodes on movie rating data.
    knime > Examples > 10_Big_Data > 02_Spark_Executor > 10_Recommendation_Engine_w_Spark_Collaborative_Filtering
    2
    knime
  3. Go to item
    Workflow
    Spark Compiled Model Predictor
    Spark Hadoop Big Data
    This workflow demonstrates the usage of the Spark Compiled Model Predictor node which converts a given PMML model into machine co…
    knime > Examples > 10_Big_Data > 02_Spark_Executor > 03_PMML_to_Spark_Comprehensive_Mode_Learning_Mass_Prediction
    1
    knime
  4. Go to item
    Workflow
    Tool Migration: From Excel to Value with KNIME
    Automation Excel Machine Learning
    +4
    This workflow shows how using a no-code/low-code tool like KNIME Analytics Platform can substitute, expand and improve considerab…
    roberto_cadili > Public > Tool Migration - From Excel to Value with KNIME
    1
    roberto_cadili
  5. Go to item
    Workflow
    Big Data Analytics - Model Selection to Predict Flight Departure Delays on Hive & Spark
    Data blending Data science Machine learning
    +7
    This workflow trains a number of data analytics models on Hadoop and Spark and automatically selects the best model to predict de…
    knime > Examples > 50_Applications > 28_Predicting_Departure_Delays > 02_Scaling_Analytics_w_BigData
    1
    knime
  6. Go to item
    Workflow
    Google BigQuery meets Databricks
    Google BigQuery DB query Cloud
    +7
    This workflow connects to the Austin Bikeshare dataset, hosted among the Google BigQuery public datasets and a Databricks instanc…
    knime > Examples > 10_Big_Data > 01_Big_Data_Connectors > 07_Will_They_Blend_BigQuery_Databricks
    1
    knime
  7. Go to item
    Workflow
    Working with Google cloud services
    Google Bigquery Cloud storage
    +2
    This workflow demonstrates how to connect to various Google Cloud Services such as Google BigQuery, Google Dataproc, and Google C…
    knime > Examples > 10_Big_Data > 01_Big_Data_Connectors > 04_GoogleCloudExample
    1
    knime
  8. Go to item
    Workflow
    Working with Azure services
    Azure Microsoft Hdinsight
    +6
    This workflow demonstrates how to connect to various Azure services such as HDInsight clusters, Azure Blob Storage, and AzureSQL …
    andisa.dewi > Public > 09_AzureExample
    1
    andisa.dewi
  9. Go to item
    Workflow
    Working with Databricks
    Big data Databricks Spark
    +2
    This workflow demonstrates the usage of the Create Databricks Environment node which allows you to connect to a Databricks Cluste…
    knime > Examples > 10_Big_Data > 01_Big_Data_Connectors > 03_DatabricksExample
    1
    knime
  10. Go to item
    Workflow
    03.4_Writing_from_Spark_solution
    Education Data engineering Data engineer
    +3
    The company tracks the usage of the website and stores the information about each session. - Various data are collected, e.g., se…
    chemgirl36 > Public Space > L4-DE Best Practices for Data Engineering > solutions > Session_3_ELT_on_Big_Data > 03.4_Writing_from_Spark
    0
    chemgirl36
  11. Go to item
    Workflow
    03.3_Aggregation_on_Spark_exercise
    Education Data engineering Data engineer
    +3
    The company tracks the usage of the website and stores the information about each session. - Various data are collected, e.g., se…
    chemgirl36 > Public Space > L4-DE Best Practices for Data Engineering > exercises > Session_3_ELT_on_Big_Data > 03.3_Aggregation_on_Spark
    0
    chemgirl36
  12. Go to item
    Workflow
    04.2_ELT_Usage
    Education Data engineering Data engineer
    +3
    The company tracks the usage of the website and stores the information about each session. - Various data are collected, e.g., se…
    chemgirl36 > Public Space > L4-DE Best Practices for Data Engineering > exercises > Session_4_Orchestration > 04.2_ELT_Usage
    0
    chemgirl36
  13. Go to item
    Workflow
    04 Model Building on Big Data
    Big data Spark
    L4-BD SELF-PACED COURSE exercise: - Train a ML model in Spark - Read the prediction results into KNIME
    mferdous2012 > Public > 04 Model Building on Big Data
    0
    mferdous2012
  14. Go to item
    Workflow
    Combine Big Data, Spark and H2O.ai Sparkling Water
    Knime H2o H2o.ai
    +8
    - load data into (local) Big Data environment - load data into Spark context - load data into H2O.ai Sparkling Water context - bu…
    mlauber71 > Public > kn_example_h2o_sparkling_water
    0
    mlauber71
  15. Go to item
    Workflow
    s_601 - Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System)
    Knime Spark Hive
    +7
    Sparkling predictions and encoded labels - "the poor man's ML Ops" (on a Big Data System) Use Big Data Technologies like Spark to…
    mlauber71 > Public > kn_example_bigdata_h2o_automl_spark_46 > s_601_spark_label_encoder
    0
    mlauber71
  16. Go to item
    Workflow
    s_605 - use the stored rules and lists to actually prepare the data
    Spark H2o Data
    +4
    s_605 - apply the label encoding and other transformations stored in SQL code and the selected final column as RegEx string Get t…
    mlauber71 > Public > kn_example_bigdata_h2o_automl_spark_46 > s_605_spark_prepare_data
    0
    mlauber71
  17. Go to item
    Workflow
    s_600 - Sparkling predictions and encoded labels - "the poor man's ML Ops"
    Knime H2o H2o.ai
    +8
    s_600 - Sparkling predictions and encoded labels - "the poor man's ML Ops" Use Big Data Technologies like Spark to get a robust a…
    mlauber71 > Public > kn_example_bigdata_h2o_automl_spark_46 > s_600_spark_h2o_automl_about_this_collection
    0
    mlauber71
  18. Go to item
    Workflow
    Using hive over hdfs
    Big data Spark Hive
    This workflow demonstrates a) how to save a csv file in hdfs and access it, b) how to connect to hiveserver2, c) how to create a …
    ashokharnal > Collection of Components and Workflows > bigdata > Hive over hdfs
    0
    ashokharnal
  19. Go to item
    Workflow
    Big Data Analytics - Model Selection to Predict Flight Departure Delays on Hive & Spark
    Data blending Data science Machine learning
    +7
    This workflow trains a number of data analytics models on Hadoop and Spark and automatically selects the best model to predict de…
    aditi104 > Public > 02_Scaling_Analytics_w_BigData
    0
    aditi104
  20. Go to item
    Workflow
    03.1_In-database&Spark_processing_exercise
    Education Data engineering Data engineer
    +3
    The company tracks the usage of the website and stores the information about each session. - Various data are collected, e.g., se…
    chemgirl36 > Public Space > L4-DE Best Practices for Data Engineering > exercises > Session_3_ELT_on_Big_Data > 03.1_In-database&Spark_processing
    0
    chemgirl36

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Business Hub
© 2023 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits