This workflow trains a number of data analytics models on Hadoop and Spark and automatically selects the best model to predict departure delays from a selected airport. Data is the airline dataset downloadable from: http://stat-computing.org/dataexpo/2009/the-data.html. Departure delay is a delay > 15min. Default selected airport is ORD. This workflow implements data reading, data blending, ETL, guided analytics, dimensionality reduction, advanced data mining models, model selection using: Hadoop, Spark, in-memory, parallelization, grid computing, multithreading and/or in-database to speed up computationally intensive operations. Data available in knime://knime.workflow/data/1_Input
Used extensions & nodes
Created with KNIME Analytics Platform version 3.4.0
By using or downloading the workflow, you agree to our terms and conditions.
Discussions are currently not available, please try again later.