From Web-Scraping to An Interactive Dashboard
Challenge description:
Your organization is entering the US fashion industry as a new player in the market. H&M US is your biggest competitor, so you’d like to understand how it is pricing different clothing products for women, men and kids. In particular, you’re interested in the products belonging to the categories listed under “Shop by Product” for each of the three target buyer groups. Problem: you don’t have access to that information in a well-structured, tabular format.
Hence, you are tasked to build a web scraper that, for each product category (Shirts, Jeans, Shoes, etc.), collects product items and prices. Repeat the process for all three target buyer groups. Finally, create an interactive dashboard to visually inspect H&M US’s pricing strategies.
Key requirement: your web scraper must rely on the use of the Webpage Retriever node. You are not allowed to use the nodes of the KNIME Web Interaction (Labs) extension.
Outcome:
A stable and exhaustive web scraper and an interactive dashboard displaying key insights of H&M US's pricing strategy.
Deliver your solution as a separate workflow and name it: Solution_Round_16_<your_team_name>. Place your solution workflow in the same folder of this challenge workflow.
Teams are strongly encouraged to submit high-quality work in order to improve their chances of getting maximum points. Don't be afraid to go the extra mile! :)
Dataset:
Data shall be scraped from the website of H&M US: https://www2.hm.com/en_us/index.html
Deadline:
March 10, 2024 (submission by 11:59 PM CET) **. Check the calendar of the tournament: https://info.knime.com/game-of-nodes
** We will verify the date and time of the latest edits.
KNIME Game of Nodes:
Rules, Assessment Criteria & FAQs: https://info.knime.com/game-of-nodes
External resources
Used extensions & nodes
All required extensions are part of the default installation of KNIME Analytics Platform version 5.2.1
Legal
By using or downloading the workflow, you agree to our terms and conditions.