The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Feature Selection
Hi,
I want to make a Backward and/or Forward Selection in RM5. Unfortunately the Workflow from the Samples (04.../09) does not work when opened. So I would like to ask for an example workflow of a forward and/or backward feature selection. If I understand it right I have to Define a performance evaluation within the Feature Selection, correct?
Would it be possible to do something like a F Value calulcation for each added descriptor?
Best regards,
Markus
I want to make a Backward and/or Forward Selection in RM5. Unfortunately the Workflow from the Samples (04.../09) does not work when opened. So I would like to ask for an example workflow of a forward and/or backward feature selection. If I understand it right I have to Define a performance evaluation within the Feature Selection, correct?
Would it be possible to do something like a F Value calulcation for each added descriptor?
Best regards,
Markus
0
Answers
Sadly 04/09 has problems as you point out, but if you want to do Backward and/or Forward Selection is not 04/10 of any use?
1) What are "good" leaners to use in the Training or which one should I better not use? IS Linear Regression bad for the feature selection? Why is KMeans used in the Online TUtorial?
2) Can I make the Feature Selection in a way, that each generation needs to improve by some performance (lets say 0.05 R²) otherwise it stops?
3) Is there a way to write up each taken descriptor right after the run is over (so to say an on-the-fly log file)?
Best regards,
Markus
beside the samples (which are partly outdated) I would recommend using the explicit Forward Attribute Selection operator. It's much more efficient than the old Attribute Selection operator and offers exactly what you are longing for: Detailed Stopping Criteria definition.
You cannot answer the question what a good learner is in general. This depends on your task, on your data and in last on your patience. The linear regression is a relatively fast learner and since the learner will be applied for each attribute in each round, it should be fast. But if it suits the data, one cannot say. Just try it and exchange with another lateron to compare. RapidMiner is designed for this kind of experimenting...
What do you mean by descriptor?
Greetings,
Sebastian