The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Determining value for parameter
Please Help Me. I am stuck.
I have a general decision tree and also CHAID and ID-3.
The parameters are
- minimal size for split
- minimal leaf size
- minimal gain
- maximal depth
- confidence
My training data is 400.
Ny features are 6707
My amount of total text is 27910
How can I determine a good value for the parameter without testruns. Testruns would take too much time due to the high enourmous amount of data.
Who has an idea for me?
Thank you!!!
0
Answers
If you are working with text and a lot of attributes and short on time you could give Naive Bayes a try.
Also you can try pruning some of your text vectors and removing correlated attributes.
the data is not as big as you might think. It sounds pretty reasonable to use a parameter optimization on that. You can do this either by grid or with an evolutionary approach.
If this is text mining, i would recommend a SVM. Usually they score better and you only have one parameter to optimize for in the linear case (C).
Cheers,
Martin
Dortmund, Germany