The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
FP--Growth <-> Apriori
Hi,
I am currently using the FPGrowth and the WEKA-Apriori Operator on the Iris Dataset.
The Process looks like this:
<operator name="Root" class="Process" expanded="yes">
<operator name="ArffExampleSource" class="ArffExampleSource">
<parameter key="data_file" value="C:\Dokumente und Einstellungen\b\Eigene Dateien\rm_workspace\sample\data\iris.arff"/>
</operator>
<operator name="Numerical2Polynominal" class="Numerical2Polynominal">
</operator>
<operator name="W-Apriori" class="W-Apriori">
<parameter key="M" value="0.0010"/>
<parameter key="I" value="true"/>
</operator>
</operator>
Both the FPGrwoth and the Apriori Op have a min_support of 0.001 . The other options are Standard.
My question is: Why is the Weka Op able to find Itemsets and the FPGrowth Op not ? Even when i lower the min_support FPGrowth doesn't
find any Itemsets at all.
I am currently using the FPGrowth and the WEKA-Apriori Operator on the Iris Dataset.
The Process looks like this:
<operator name="Root" class="Process" expanded="yes">
<operator name="ArffExampleSource" class="ArffExampleSource">
<parameter key="data_file" value="C:\Dokumente und Einstellungen\b\Eigene Dateien\rm_workspace\sample\data\iris.arff"/>
</operator>
<operator name="Numerical2Polynominal" class="Numerical2Polynominal">
</operator>
<operator name="W-Apriori" class="W-Apriori">
<parameter key="M" value="0.0010"/>
<parameter key="I" value="true"/>
</operator>
</operator>
Both the FPGrwoth and the Apriori Op have a min_support of 0.001 . The other options are Standard.
My question is: Why is the Weka Op able to find Itemsets and the FPGrowth Op not ? Even when i lower the min_support FPGrowth doesn't
find any Itemsets at all.
0
Answers
Theres still something that I don't get:
The last 4 Rules from the Apriori Result:
7. petallength = 1.300=true 7 ==> class = Iris-setosa=true 7 conf:(1)
8. petallength = 1.600=true 7 ==> class = Iris-setosa=true 7 conf:(1)
9. petalwidth = 0.400=true 7 ==> class = Iris-setosa=true 7 conf:(1)
10. petalwidth = 0.300=true 7 ==> class = Iris-setosa=true 7 conf:(1)
Are not generated from FPGrowth. Even the itemsets are not generated (The Apriori OP generates more sets than FPGrowth) . Do you have any idea why ?
Thanks in advance,
Birger
Don't want to sound like the Thought Police, but you need to check out the algorithms, which take different inputs and produce different outputs, as we have seen. http://en.wikipedia.org/wiki/Association_rule_learning is as good a place to start as any.
That being said it would be as useful as a fart in a space-suit if different algorithms were to produce wildly different associations. But fear not! If you set like against like with the minimum support the results on the Iris set are consistent, as this shows...
although FP-Growth and Apriori should return exactly the same results in theory, the implementations are quite different. This does not change the result, if the input is equal, but both operators make different assumptions. For example does the FP-Growth operator ignore special attributes, it seems to me, that the W-Apriori doesn't. So if you label is a special attribute, for example of role label, FP-Growth would ignore it, and hence no FrequentItemSet would be generated containing it.
Greetings,
Sebastian