Taxi Demand Prediction on Spark Deployment

Workflow

Taxi Demand Prediction on Spark Deployment

Draft Latest edits on

This workflow applies a time series prediction model (Random Forest) to the NYC taxi dataset to predict taxi demand in the next hour based on data from past hours. Given the large size of the dataset, we train and deploy the machine learning model on a Spark cluster. The KNIME Big Data Extension allows you to run a KNIME workflow on the big data platform you prefer, via in-database processing or via Spark.

Loading deploymentsLoading ad hoc jobs

Legal

By using or downloading the workflow, you agree to our terms and conditions.