The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Design a model to do data cleaning
Best Answers
-
lionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 UnicornHi @JoeJoe,
Have you access to Turbo Prep inside RapidMiner ?
If Yes, you can go to CLEANSE --> AUTO CLEANSING..
Hope this helps,
Regards,
Lionel2 -
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderHi,Probably none of both settings would be best. However, for association rules you would need binary input data so you should first clean the data (without those two settings) and then discretize all numerical into binary bins. Finally, you may need to perform one-hot encoding for nominals with more than two values. Cut-off points for discretization or which value is positive vs. negative will depend on your biz problem you want to solve.Best,Ingo7
Answers