The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How can I reduce the amount of values in a attribute?
I have a data set that has 4 values. They are polynomials. One of the values in the attribute have too many and I want all the values to have around the same amount of it. Is my label. I am trying to do a decision tree. I have 400 of one of the values but I want it to lower it to 40. randomly. Is there an option?
0
Best Answer
-
David_A Administrator, Moderator, Employee-RapidMiner, RMResearcher, Member Posts: 297 RM Researchhi @BrunoC,you can use the normal Sample Operator and check the parameter balance data. Then you can specify the exact amount of examples per class that you want in your test set. you can either specify an exact number (with absolute sampling) or a ratio (with relative sampling).An alternative approach could be to group the other values, so you don't loose too many examples (going from 400 to only 40 examples can strongly reduce the efficiency of your model, especially when you want to do a Cross-Validation for testing your model). Take a look at the Replace Rare Values operator from the Operator Tool Box extension.Best,
David6