The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Why would an attribute in a data set not be included in a generated decision tre
truetaurus
Member Posts: 4 Contributor I
Say I have a data set of customers with information such as bank account, age, telephone, credit history, employment, etc...
Why when I use RapidMiner, are some attributes not in the generated decision tree such as telephone or age? What could be the various reasons for this?
Why when I use RapidMiner, are some attributes not in the generated decision tree such as telephone or age? What could be the various reasons for this?
0
Answers
u can change DT settings in order to draw a tree with more branches which may include other attributes
the decision tree uses a so-called criterion to choose the next attribute for a split. By default, this criterion is the gain_ratio. You will find information about this if you search for e.g. Information Gain. There should be some good article out which explain this measure in detail.
If the Gain (with respect to the chosen criterion) that results from splitting by an attribute is less than the corresponding parameter in the decision tree, then the tree algorithm will simply not include the attribute.
Best, Marius
So what would you say would be the reason for an attribute not being included though in a decision tree?