Hub
Pricing About
  • Software
  • Blog
  • Forum
  • Events
  • Documentation
  • About KNIME
  • KNIME Community Hub
  • Search

142 results

Filter
Filter by tag
BigData
Big Data Education Hive Spark Best practices Data engineer Data engineering ELT
  1. Go to item
    Workflow
    Tool Migration: From Excel to Value with KNIME
    Automation Excel Machine Learning
    +4
    This workflow shows how using a no-code/low-code tool like KNIME Analytics Platform can substitute, expand and improve considerab…
    roberto_cadili > Public > Tool Migration - From Excel to Value with KNIME
    2
    roberto_cadili
  2. Go to item
    Workflow
    Incremental Data Processing with Parquet
    Parquet Incremental loading NYC taxi dataset
    +3
    In this workflow, we will use the NYC taxi dataset to show case a continous preprocessing and publishing of event data. Instead o…
    knime > Examples > 01_Data_Access > 01_Common_Type_Files > 12_Incremental_processing_Parquet_file
    2
    knime
  3. Go to item
    Workflow
    HDFS file handling
    HDFS Hadoop Big Data
    This workflow demonstrates the HDFS file handling capabilites using the file handling nodes in conjunction with an HDFS connectio…
    knime > Education > Courses > L4-BD Introduction to Big Data with KNIME Analytics Platform > 2_Hadoop > 4_Examples > 02_HDFS_and_File_Handling_Example
    1
    knime
  4. Go to item
    Workflow
    04 DB WritingToDB Exercise
    Big Data Education
    Big Data Course DB Exercise #4
    knime > Education > Courses > L4-BD Introduction to Big Data with KNIME Analytics Platform > 1_DB > 2_Exercises > 04_DB_WritingToDB
    1
    knime
  5. Go to item
    Workflow
    Working with Databricks
    Big data Databricks Spark
    +2
    This workflow demonstrates the usage of the Create Databricks Environment node which allows you to connect to a Databricks Cluste…
    knime > Examples > 10_Big_Data > 01_Big_Data_Connectors > 03_DatabricksExample
    1
    knime
  6. Go to item
    Workflow
    Working with Google cloud services
    Google Bigquery Cloud storage
    +2
    This workflow demonstrates how to connect to various Google Cloud Services such as Google BigQuery, Google Dataproc, and Google C…
    knime > Examples > 10_Big_Data > 01_Big_Data_Connectors > 04_GoogleCloudExample
    1
    knime
  7. Go to item
    Workflow
    Techniques for Dimensionality Reduction
    ETL Big data Data preprocessing
    +11
    This workflow performs classification on data sets that were reduced using the following dimensionality reduction techniques: - L…
    knime > Examples > 04_Analytics > 01_Preprocessing > 02_Techniques_for_Dimensionality_Reduction > 02_Techniques_for_Dimensionality_Reduction
    1
    knime
  8. Go to item
    Workflow
    Spark Compiled Model Predictor
    Spark Hadoop Big Data
    This workflow demonstrates the usage of the Spark Compiled Model Predictor node which converts a given PMML model into machine co…
    knime > Examples > 10_Big_Data > 02_Spark_Executor > 03_PMML_to_Spark_Comprehensive_Mode_Learning_Mass_Prediction
    1
    knime
  9. Go to item
    Workflow
    Working with Azure services
    Azure Microsoft Hdinsight
    +6
    This workflow demonstrates how to connect to various Azure services such as HDInsight clusters, Azure Blob Storage, and AzureSQL …
    andisa.dewi > Public > 09_AzureExample
    1
    andisa.dewi
  10. Go to item
    Workflow
    Cleaning the NYC taxi dataset on Spark
    Big data Exploration Visualization
    +4
    This workflow handles the preprocessing of the NYC taxi dataset (loading, cleaning, filtering, etc). The NYC taxi dataset contain…
    knime > Examples > 50_Applications > 49_NYC_Taxi_Visualization > Data_Preparation
    1
    knime
  11. Go to item
    Workflow
    HDFS file handling
    HDFS Hadoop Big Data
    This workflow demonstrates the HDFS file handling capabilites using the file handling nodes in conjunction with an HDFS connectio…
    donyriyanto > Public > 02_HDFS_and_File_Handling_Example
    0
    donyriyanto
  12. Go to item
    Workflow
    03.0_Setup_Local_Big_Data_Environment
    Education Data engineering Data engineer
    +3
    This workflow sets up a local big data environment for the next exercise. It creates a local big data environment and loads the u…
    hayasaka > KNIME Fall Summit Training 2022 > L4-DE Best Practices for Data Engineering > exercises > Session_3_ELT_on_Big_Data > 03.0_Setup_Local_Big_Data_Environment
    0
    hayasaka
  13. Go to item
    Workflow
    04.0_Reset_DB&Big_Data_Environment
    Education Data engineering Data engineer
    +3
    This workflow resets the database by overwriting the customers and statistics tables and sets up a local big data environment and…
    hayasaka > KNIME Fall Summit Training 2022 > L4-DE Best Practices for Data Engineering > solutions > Session_4_Orchestration > 04.0_Reset_DB&Big_Data_Environment
    0
    hayasaka
  14. Go to item
    Workflow
    04 Model Building on Big Data
    Big data Spark
    L4-BD SELF-PACED COURSE exercise: - Train a ML model in Spark - Read the prediction results into KNIME
    manuel1972 > Public > Self-Paced Courses > L4-BD Introduction to Big Data with KNIME Analytics Platform > Exercises > 04 Model Building on Big Data
    0
    manuel1972
  15. Go to item
    Workflow
    Hive - how to get from DB-Connectors to Hive (or Impala) tables - Legacy nodes up to vers. 3.7.x
    Big data Hive Impala
    +2
    Hive - how to get from DB-Connectors to Hive (or Impala) tables
    mlauber71 > Public > kn_example_hive_db_loader_37
    0
    mlauber71
  16. Go to item
    Workflow
    04.0_Reset_DB&Big_Data_Environment
    Education Data engineering Data engineer
    +3
    This workflow resets the database by overwriting the customers and statistics tables and sets up a local big data environment and…
    hayasaka > KNIME Spring Summit Training 2023 > L4-DE Best Practices for Data Engineering > solutions > Session_4_Orchestration > 04.0_Reset_DB&Big_Data_Environment
    0
    hayasaka
  17. Go to item
    Workflow
    Spark MLlib decision tree
    Spark Hadoop Big Data
    This workflow demonstrates the usage of the Spark MLlib Decision Tree Learner and Spark Predictor. It also demonstrates the conve…
    knime > Examples > 10_Big_Data > 02_Spark_Executor > 01_Spark_MLlib_Decision_Tree
    0
    knime
  18. Go to item
    Workflow
    03.0_Setup_Local_Big_Data_Environment
    Education Data engineering Data engineer
    +3
    This workflow sets up a local big data environment for the next exercise. It creates a local big data environment and loads the u…
    hayasaka > KNIME Spring Summit Training 2023 > L4-DE Best Practices for Data Engineering > solutions > Session_3_ELT_on_Big_Data > 03.0_Setup_Local_Big_Data_Environment
    0
    hayasaka
  19. Go to item
    Workflow
    03 Manipulating Big Data
    Big data Spark Hive
    L4-BD SELF-PACED COURSE exercise: - Manipulate data on Hive with the DB nodes - Perform ETL operations in Spark with the Spark no…
    cchioson > Public > Self-Paced Courses > L4-BD Introduction to Big Data with KNIME Analytics Platform > Exercises > 03 Manipulating Big Data
    0
    cchioson
  20. Go to item
    Workflow
    04 Model Building on Big Data
    Big data Spark
    L4-BD SELF-PACED COURSE exercise: - Train a ML model in Spark - Read the prediction results into KNIME
    cchioson > Public > Self-Paced Courses > L4-BD Introduction to Big Data with KNIME Analytics Platform > Exercises > 04 Model Building on Big Data
    0
    cchioson

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • E-Learning course
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • KNIME Open Source Story
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more on KNIME Business Hub
© 2023 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Credits