Customer Segmentation
This workflow implements basic customer segmentation through a clustering algorithm, k-Means. Customer segmentation can help Sales and Marketing departments identify hidden patterns in customer behavior/preferences and define better expansion and retention strategies:
The sample data provided represents customer transaction and spending behaviors, as well as customer demographics. Raw data is joined, partitioned (existing vs. new customers) and preprocessed (missing value handling, outlier detection and normalization).
Using the Elbow method, it's possible to visually estimate the best number (k) of clusters for the algorithm. Next, the k-Means algorithm is used to segment customers and the obtained clusters are assigned to new customer.
Obtained clusters can be visualized and inspected in an interactive view and their quality assessed with a scoring metric. Clustered new data can be further exported for further processing or reporting.