The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Getting startet with DMC 2005 dataset

mariechen666mariechen666 Member Posts: 1 Learner III
edited November 2018 in Help
Hey,

I am pretty new to data mining and I would like to work on this dataset from the data mining cup 2005 with rapidminer:
http://www.data-mining-cup.de/en/review/dmc-2005/

I would like to classify the customers as written in the task. But first I have to preprocess the data as it has many many attributes and rows... I am not sure how to start. Can anyone give me a little help on this? I started by using only a sample of the data training set, so that anything I try does not take such a long time...
As it is a dataset on fraud detection in e-commerce it would be great for me to know how this works. I'm reading a lot on data mining, but I am not sure how to handle this data set in rapidminer. Maybe anyone of you have some insights, tips or ressources where I can learn how to use rapidminer on this particular example?

You would really help me a lot! I'm looking forward to learn how to use rapidminer!

Best regards,
Marie
Sign In or Register to comment.