Why would you want to use cluster analysis on your retail sales data? Well, cluster analysis helps you identify non-independence in your data. Here is an example to help illustrate the point. Lets say we want to ask loads of teachers from many different schools what they think of their principal. If you ask two different teachers from two different schools, you will get two completely different answers that will be independent. But, if you ask two teachers from the same school, the answers will not be completely independent and could be very similar – but not EXACTLY the same. Now if your job was to take the raw data and try to predict which school each teacher came from based on their answer – then you have an application of clustering.
The same thing can be applied to Walmart store performance for a supplier. You have some data points for a store like how long that store has been open, how many competitors it has located in its vicinity, what was your products sales performance for that store, some demographics for that area like unemployment and population, possibly even some historic weather data. Now you use a clustering algorithm to group your stores that are most closely related. This could be the first step in identifying under performing stores and why. It could give you a viable store list for a product test based on more than sales performance. It might help you further identify your product identity and who your actual customers are using enough demographic data. You might not find anything you didn’t already know. The important thing is that you are diving into your data to truly understand it on a level you never have before, and uncovering one of these nuggets could be millions of dollars difference to your company.
Once you’ve built your base analysis, and in our case we built our report that you see above, turned it into an in-memory cube, and then built a MicroStrategy dashboard on top of it – we can then explore slicing and dicing our data along the different data points to help identify if any of the metrics in our analysis are a key contributor to a cluster alignment. This way we can determine what factor affects sales the most. Could it be store age? or store square footage? or unemployment? Ethnic breakdown? What of these are driving markdowns?
The great thing about using this analysis as a MicroStrategy dashboard is that it is pretty easy to tweak to look for your top performing stores, and refreshing the data source is very easy. In fact, this report could be automated each week and emailed to you. There might even be an application to look for cluster changes and have something like that generate an alert so you only need to be bothered if anything changes.
Contact us today to discover how Vortisieze analytics can help you explore your own data science.