The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Answers
yes, both compute the same measure.
InfoGainRatio is the Rapid-I/RapidMiner implementation and W-GainRatioAttributeEval is the Weka implementation of this metric.
Best regards,
Ralf
But the result I am getting is different for the two operators.
I am using it alongwith AttributeWeightSelection for doing attribute selection.
The attributes selected after the AttributeWeightSelection is different in both the cases.
I am using the same value for all the parameters in AttributeWeightSelection in both the cases.
[weight - 0.0, weight_relation - greater and all other parameters set to their default value.]
Root
TextInput
StringTokenizer
TokenlengthFilter
W-GainRatio Attribute Eval / InfoGainRatioWeighing
AttributeWeightSelection
X-Validation
LibSVMLearner
OperatorChain
ModelApplier
Performance
Using TFIDF as feature Vector.
W-GainRatioAttributeEval is giving me much better result. Also it is much faster.
If someone knows it. Plez reply.
Cheers,
Ingo
Is there any way I can see the GainRatio values calculated for various attributes?
I have put a breakpoint in the W-GainRatioAttributeEval / InfoGainRatioWeighing operator. It is showing 'range' coloumn and 'statisitcs'. But I guess they are not the Gain Ratio value calculated for an attribute. Because I am using AttributeWeightSelection with attribute weight=0.0 & weight_relation=greater & the attributes that are getting pruned is not according to the value in these colomns.
the weighting operators return two results: The input example set and a weights object denoting the calculated weight. Both objects will be shown in the result view as tabs. If you switch to the AttributeWeights tab, you will see a table with the (normalized) weights, calculated by the InfoGainRatioWeighting. If you want to see the original info gain ration, you should turn of the normalization by deselecting the "normalization" parameter of the operator.
Greetings,
Sebastian