Hub
Pricing About
WorkflowWorkflow

The Impact of Movie Posters and Data on Boxoffice Revenues

IMDbText MiningBoxofficeImage Feature ExtractionFace Detection
+10
simonedigreg profile image
Draft Latest edits on 
Nov 20, 2021 3:14 PM
Drag & drop
Like
Download workflow
Workflow preview
This workflow aims at performing inference on boxoffice revenues (adjusted for dollar inflation with CPI) by combining information from structured data about movies from IMDb and scraped posters retrieved through TMDB and its API, complying with the related Terms of Service. Text mining on movie titles is performed with the lexicon approach, while image feature analysis is performed in two steps, face detection through a pre trained convolutional neural network (MTCNN) and image feature extraction (labels and colors) through Google Cloud Vision API. Portability is ensured through Knime URL protocol and Conda Environment Propagation. To make the workflow run, all the data and all the workflows present in its Knime Hub directory need to be present in the working directory (please check their description in order to check their requirements and in order to configure them in the right way). The posters folder is just a placeholder, every poster needs to be scraped again, and it will be saved there. Additionally, the main dataset needs to be downloaded (and put in the working directory) from the given link in the external resources section of this page. Please remove the space in the name of the downloaded .csv file before execution.

External resources

  • Main dataset download (GitHub Repository)
  • CPI index to adjust for dollar inflation
Loading deploymentsLoading ad hoc jobs

Used extensions & nodes

Created with KNIME Analytics Platform version 4.5.1
  • Go to item
    KNIME Base nodesTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.2, 4.5.1

    knime
  • Go to item
    KNIME ExpressionsTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.5.1

    knime
  • Go to item
    KNIME HCS ToolsTrusted extension

    Max Planck Institute of Molecular Cell Biology and Genetics (MPI-CBG), Dresden, Germany

    Version 4.0.0

    mpicbg-tds
  • Go to item
    KNIME Interactive R Statistics IntegrationTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.2, 4.5.0

    knime
  • Go to item
    KNIME JavaScript ViewsTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.2, 4.5.1

    knime
  • Go to item
    KNIME JavasnippetTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.2, 4.5.0

    knime
  • Go to item
    KNIME Math Expression (JEP)Trusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.0, 4.5.0

    knime
  • Go to item
    KNIME Python Integration

    KNIME AG, Zurich, Switzerland

    Version 4.5.1

    knime
  • Go to item
    KNIME Quick FormsTrusted extension

    KNIME AG, Zurich, Switzerland

    Versions 4.4.2, 4.5.0

    knime
  • Go to item
    KNIME ServerSpaceTrusted extension

    KNIME AG, Zurich, Switzerland

    Version 4.14.1

    knime

Legal

By using or downloading the workflow, you agree to our terms and conditions.

KNIME
Open for Innovation

KNIME AG
Talacker 50
8001 Zurich, Switzerland
  • Software
  • Getting started
  • Documentation
  • Courses + Certification
  • Solutions
  • KNIME Hub
  • KNIME Forum
  • Blog
  • Events
  • Partner
  • Developers
  • KNIME Home
  • Careers
  • Contact us
Download KNIME Analytics Platform Read more about KNIME Business Hub
© 2025 KNIME AG. All rights reserved.
  • Trademarks
  • Imprint
  • Privacy
  • Terms & Conditions
  • Data Processing Agreement
  • Credits