The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Keeping only the last data generated.
Hi,
I am applying an operation by groups by using "ValueSubgroupIterator". I have clicked for the option, "apply_on_complete_set" too. So, the number of output examplesets generated are (number of groups in the grouping attribute + 1). Now, I want to keep only that ExampleSet, which is generated by clicking the "apply_on_complete_set". How do i do this automatically for any number of groups in the grouping variable using IOSelector?
Thanks for your help,
Shubha
I am applying an operation by groups by using "ValueSubgroupIterator". I have clicked for the option, "apply_on_complete_set" too. So, the number of output examplesets generated are (number of groups in the grouping attribute + 1). Now, I want to keep only that ExampleSet, which is generated by clicking the "apply_on_complete_set". How do i do this automatically for any number of groups in the grouping variable using IOSelector?
Thanks for your help,
Shubha
0
Answers
Cheers,
Ingo
Being a bear of little brain I can't see what the difference would ever be between Example Set 1 and Example Set 4. So I've obviously missed the point of the question, perhaps you could elaborate?
I'm a bit puzzled by the question, wouldn't that example set be exactly the same as would be produced by just applying the operator to the original example set?
If i am correct, while applying the operator "Normalization", this may not work. Please check with the below code(attached is the data i used):
<operator name="Root" class="Process" expanded="yes">
<operator name="ExampleSource" class="ExampleSource">
<parameter key="attributes" value="C:\Documents and Settings\shubhak\Desktop\try.aml"/>
</operator>
<operator name="Complete_Norm" class="Normalization">
<parameter key="method" value="Range-Transformation"/>
<parameter key="min" value="-1.0"/>
</operator>
<operator name="ValueSubgroupIterator" class="ValueSubgroupIterator" expanded="yes">
<parameter key="apply_on_complete_set" value="true"/>
<list key="attributes">
<parameter key="Group" value="all"/>
</list>
<parameter key="filter_attribute" value="false"/>
<operator name="AttributeSubsetPreprocessing" class="AttributeSubsetPreprocessing" expanded="yes">
<parameter key="attribute_name_regex" value="x.*"/>
<parameter key="condition_class" value="is_numerical"/>
<operator name="Normalization" class="Normalization">
<parameter key="method" value="Range-Transformation"/>
<parameter key="min" value="-1.0"/>
</operator>
</operator>
</operator>
</operator>
I expected the results of "Complete_Norm" and the "Normalization"(by groupings) for complete_subset would be the same. But, it is not so.... Am i missing somethings?
Thanks, Shubha
[attachment deleted by admin]
I think the reason that you are not getting what you expect is that you normalize values that you have already normalized! :-\
To do what you intend you need to make copies of the data and compare the results. I've laid out an alternative version below which acts predictably. I think you will benefit from working rhrough the examples, as the flow of control in Rapidminer can play mirage tricks!