The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How do Random Forests in RapidMiner support missing values?
Does a random forest predict a missing value or does it exclude it from the final prediction
Tagged:
0
Best Answers
-
MartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientisti think we go both ways and take the average of the prediction, but i would need to check
- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany5 -
varunm1 Member Posts: 1,207 UnicornGenerally, random forest algorithms impute missing values by an average of proximity values or mode. But if you are selecting Criterion as gain_ratio it uses C 4.5 algorithm developed by Quinlan, in this it doesn't impute values but it will calculate an impurity score based on missing values and uses it if it encounters missing values in the test set. So looks like it's not removing samples with missing values or it depends on criterion we are selecting.
correct me this if there any misconception.
ThanksRegards,
Varun
https://www.varunmandalapu.com/Be Safe. Follow precautions and Maintain Social Distancing
6