Association Rules Customer data
Hello everybody!
I am student who recently started to work with RapidMiner for a school project and I have to apply Association Analysis on some data sets. Up until now I have performed it on Market Basket Analysis to determine co-occurrence and relationships between the items. But our professor gave us a data set to apply or assigned models on it. Basically I have customer data with 10.000 examples, which determines if they are likely to buy a bike or not. Out of 10.000 examples 90% are non-bike buyers and 10% bike buyers. I am not sure if this data set is appropriate for association analysis but nevertheless I have done some analysis. I made association between the non-buyers and created some simple rules, but I don't know if it makes sense or not and all of my conclusions are non-buyers. What do I have to do to get only the Buyers?
I appreciate your help and your time.
Answers
@tolga_zm95 If this was strictly a classification problem, (i.e. buyers vs non-buyers) and you want to predict potential buyers, then I'd say you have a very unbalanced data set and would need to do some balancing but since you only want to know the rules, can you not just filter on the buyers using a Filter Example and filter for Yes on Bike Buyers?
@
Thank you so much for the help Thomas!
I have another questions and it is about discretizing. I want to create 3 groups for Age. For example Young (0-35), Middle (35-50) and Old (50-100), but when I use the discretize (user specifications) operator I can only define the upper limit. Do you know how to solve this problem?
Thank you so much in advance!
Regards
Tolga
(friendly reminder from moderator – don't forget to mark replies as solutions)
Scott