The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Association rule creating. problem with FP-growth operator"
Hi! I have a problem with handling Excel file via association rule creating. Please let me know what do I do wrong? The error message I got is "The exampleset contains non-nominal attribute "prod.date" which is not allowed to fp-growth" .". I try to handle data from Excel file but I have transformed it to .txt in order to attach it.
<?xml version="1.0" encoding="UTF-8"?><process version="8.0.001">
<operator activated="true" class="retrieve" compatibility="8.0.001" expanded="true" height="68" name="Retrieve рабочий файл для проверки (2)" width="90" x="45" y="34">
<parameter key="repository_entry" value="//рабочий файл/первая попытка/рабочий файл для проверки"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.0.001">
<operator activated="true" class="numerical_to_binominal" compatibility="8.0.001" expanded="true" height="82" name="Numerical to Binominal" width="90" x="246" y="34">
<parameter key="attribute_filter_type" value="all"/>
<parameter key="attribute" value=""/>
<parameter key="attributes" value=""/>
<parameter key="use_except_expression" value="false"/>
<parameter key="value_type" value="numeric"/>
<parameter key="use_value_type_exception" value="false"/>
<parameter key="except_value_type" value="real"/>
<parameter key="block_type" value="value_series"/>
<parameter key="use_block_type_exception" value="false"/>
<parameter key="except_block_type" value="value_series_end"/>
<parameter key="invert_selection" value="false"/>
<parameter key="include_special_attributes" value="false"/>
<parameter key="min" value="0.0"/>
<parameter key="max" value="0.0"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.0.001">
<operator activated="true" class="fp_growth" compatibility="8.0.001" expanded="true" height="82" name="FP-Growth" width="90" x="380" y="34">
<parameter key="find_min_number_of_itemsets" value="true"/>
<parameter key="min_number_of_itemsets" value="100"/>
<parameter key="max_number_of_retries" value="15"/>
<parameter key="min_support" value="0.95"/>
<parameter key="max_items" value="-1"/>
<parameter key="keep_example_set" value="false"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.0.001">
<operator activated="true" class="create_association_rules" compatibility="8.0.001" expanded="true" height="82" name="Create Association Rules" width="90" x="514" y="34">
<parameter key="criterion" value="confidence"/>
<parameter key="min_confidence" value="0.8"/>
<parameter key="min_criterion_value" value="0.8"/>
<parameter key="gain_theta" value="2.0"/>
<parameter key="laplace_k" value="1.0"/>
</operator>
</process>
0
Answers
@mzharov can you just use a Select Attributes and remove 'prod.date?' Then see if it works.
Hi Thomas
I have deleted any dates from my file at all. Ran the same process Retrive-Nominal to Binominal-FP-Growth-Create assoc. rule and got a similar error "The exampleset contains non-nominal attribute "ID" which is not allowed to fp-growth" . ID is just unique number of lines in my file. Could you tell where is my mistake ? Thanks in advance
screen shot of error and sample of loading file are in attachment
I dont open DOCX files as a rule, so you'd have to post a screenshot. Also, your XML is corrupt. You have to post the XML correctly like from this KB article https://community.rapidminer.com/t5/RapidMiner-Studio-Knowledge-Base/How-can-I-share-processes-without-RapidMiner-Server/ta-p/37047
Hi Thomas,
ok, once again . I created new xml and attached correct files. pleace, check . thanks in advance.
@mzharov the XML is still corrupted. You have to open the XML view and copy it from there.
Also as a rule, I don't open PPTX files either. Pretty much nothing from MSFT.
Hi Thomas,
actually I do not have an xml option in View-Show panel. I have turned xml at bottom of screen on. So xml from that screen attached. Please, check it. Thanks. Konstantin
@mzharov ok, the XML works now but your data file appears to be rather strange. Is it supposed to look this way, meaning are different data-types mixed into the same attribute column? It could also be my encoding as well.
Hi Thomas,
I have fixed my file. But anyway I get the similar error . Is there something wrong with data format ?? More info about my file below in the bottom of my post
the sample of my loading data below.
Row ID code material name stock am.per serie stock date
@mzharov ok, I see what's going on here. You have two attribute columns with real numbers in them. In order to use the FP-Growth operator, you must convert all the data into binmomals (true/false). The conversion is not happening for the numerical values as the Nominal to Binomal operator can't figure out how to transform 70 to true or false.
You would have to figure out how to manipulate the numericals into true/false OR remove them from the data set.