The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Increase of minimum leaf size in Decision Tree"
Hey folks,
I have increase the minimum leaf size in my decision tree, this has result in a smaller more readable tree, but a small decrease in accuracy. I'm being asked what this says about my dataset I'm presuming I'm overfitting the data but I'm not sure. Would anyone have any idea?
Thanks in advance,
Neil.
I have increase the minimum leaf size in my decision tree, this has result in a smaller more readable tree, but a small decrease in accuracy. I'm being asked what this says about my dataset I'm presuming I'm overfitting the data but I'm not sure. Would anyone have any idea?
Thanks in advance,
Neil.
Tagged:
0
Best Answer
-
IngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM FounderHi,Not necessarily. Increasing the leaf size is just a different way of pruning the tree. The goal is to find a good balance between generalizing from your training data without missing the underlying patterns.I am assuming that you refer to a properly validated test accuracy on an independent data set (e.g. by using cross validation) here. If this is the case, then this reduction in accuracy is actually not a sign that you have been overfitting before you made the change, but that you now start to miss some of the valid patterns in your data.Please also note that changes in accuracy may not be significant at all. And that there are other criteria for good models (like understandability), so you may even want to go with a less accurate but more understandable model.Hope those thoughts helps a bit,
Ingo11
Answers