The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
missing values
Legacy User
Member Posts: 0 Newbie
Hello,
I'm new to RapidMiner and perhaps I only didn't find the switch...
Is RapidMiner able to handle missing values?
i.e. can I do a linear regression or neural net learning with missing values in the input data?
To get it right: I don't want to replace the missing values.
Regards
Udo
I'm new to RapidMiner and perhaps I only didn't find the switch...
Is RapidMiner able to handle missing values?
i.e. can I do a linear regression or neural net learning with missing values in the input data?
To get it right: I don't want to replace the missing values.
Regards
Udo
0
Answers
there are some learners which directly supports missing values (but there is no switch for that). However, there is also a more generic approach (without the need for replacing the values by something like the average): you could just define a new "category" (let's call it "missing") and use the AttributeValueMapper to map all "?" to this new (artificial) category (this of course only works on nominal data). Then you can apply all learners and they will take the "missing" information into account.
In the following example, I replaced the nominal missing values with the new category "missing" and the numerical missings simply with the average. As you can see, the decision tree learner actually uses the new category (and it is also really important to use this information as you can easily see): Hope that helps,
Ingo