The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Issue found in feature weight of RandomForest for regression
marcin_blachnik
Member Posts: 61 Guru
It seems that there is an issue or a bug in the feature_weights returned by RandomForest operator, but only for regression. I found that problem on one dataset but I reconstructed it on IRIS dataset for which features a3 and a4 are the most important but according to the regression RandomForest these two features are the least important.
I evaluated other implementations of RandomForest for regression which returns correct weights (weights which are expected).
Best regards
Marcin
I evaluated other implementations of RandomForest for regression which returns correct weights (weights which are expected).
Best regards
Marcin
Tagged:
0
Answers
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Below I attach another process where it can be seen that the attribute with pure noise is the second most important variable according to RapidMiner implementation of RandomForest (the most important also seems to be attribute selected by chance). Because the trees are simple (5 trees of depth 5) one can count how many times each attribute appeared as a decision node. The noise variable is the least important.
~Martin
Dortmund, Germany