The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
ROC-curve
domi_wiese
Member Posts: 28 Contributor II
Hello,
I want to compare ROC curves with the Golf data set. I've transformed its attributes to be binominal. When i get the chart of the comparison, there's no curve. Could someone please help.
<?xml version="1.0" encoding="UTF-8"?><process version="8.1.000">
<operator activated="true" class="retrieve" compatibility="8.1.000" expanded="true" height="68" name="Retrieve Golf" width="90" x="112" y="136">
<parameter key="repository_entry" value="//Samples/data/Golf"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.1.000">
<operator activated="true" class="nominal_to_binominal" compatibility="8.1.000" expanded="true" height="103" name="Nominal to Binominal" width="90" x="246" y="136">
<parameter key="return_preprocessing_model" value="false"/>
<parameter key="create_view" value="false"/>
<parameter key="attribute_filter_type" value="all"/>
<parameter key="attribute" value=""/>
<parameter key="attributes" value=""/>
<parameter key="use_except_expression" value="false"/>
<parameter key="value_type" value="nominal"/>
<parameter key="use_value_type_exception" value="false"/>
<parameter key="except_value_type" value="file_path"/>
<parameter key="block_type" value="single_value"/>
<parameter key="use_block_type_exception" value="false"/>
<parameter key="except_block_type" value="single_value"/>
<parameter key="invert_selection" value="false"/>
<parameter key="include_special_attributes" value="false"/>
<parameter key="transform_binominal" value="false"/>
<parameter key="use_underscore_in_name" value="false"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.1.000">
<operator activated="true" class="numerical_to_binominal" compatibility="8.1.000" expanded="true" height="82" name="Numerical to Binominal" width="90" x="380" y="136">
<parameter key="attribute_filter_type" value="all"/>
<parameter key="attribute" value=""/>
<parameter key="attributes" value=""/>
<parameter key="use_except_expression" value="false"/>
<parameter key="value_type" value="numeric"/>
<parameter key="use_value_type_exception" value="false"/>
<parameter key="except_value_type" value="real"/>
<parameter key="block_type" value="value_series"/>
<parameter key="use_block_type_exception" value="false"/>
<parameter key="except_block_type" value="value_series_end"/>
<parameter key="invert_selection" value="false"/>
<parameter key="include_special_attributes" value="false"/>
<parameter key="min" value="0.0"/>
<parameter key="max" value="0.0"/>
</operator>
</process>
<?xml version="1.0" encoding="UTF-8"?><process version="8.1.000">
<operator activated="true" class="compare_rocs" compatibility="8.1.000" expanded="true" height="82" name="Compare ROCs" width="90" x="514" y="136">
<parameter key="number_of_folds" value="10"/>
<parameter key="split_ratio" value="0.7"/>
<parameter key="sampling_type" value="stratified sampling"/>
<parameter key="use_local_random_seed" value="false"/>
<parameter key="local_random_seed" value="1992"/>
<parameter key="use_example_weights" value="true"/>
<parameter key="roc_bias" value="optimistic"/>
<process expanded="true">
<operator activated="true" class="naive_bayes" compatibility="8.1.000" expanded="true" height="82" name="Naive Bayes" width="90" x="112" y="30">
<parameter key="laplace_correction" value="true"/>
</operator>
<operator activated="true" class="concurrency:parallel_decision_tree" compatibility="8.1.000" expanded="true" height="103" name="Decision Tree" width="90" x="112" y="136">
<parameter key="criterion" value="gain_ratio"/>
<parameter key="maximal_depth" value="20"/>
<parameter key="apply_pruning" value="true"/>
<parameter key="confidence" value="0.25"/>
<parameter key="apply_prepruning" value="true"/>
<parameter key="minimal_gain" value="0.1"/>
<parameter key="minimal_leaf_size" value="2"/>
<parameter key="minimal_size_for_split" value="4"/>
<parameter key="number_of_prepruning_alternatives" value="3"/>
</operator>
<connect from_port="train 1" to_op="Naive Bayes" to_port="training set"/>
<connect from_port="train 2" to_op="Decision Tree" to_port="training set"/>
<connect from_op="Naive Bayes" from_port="model" to_port="model 1"/>
<connect from_op="Decision Tree" from_port="model" to_port="model 2"/>
<portSpacing port="source_train 1" spacing="0"/>
<portSpacing port="source_train 2" spacing="0"/>
<portSpacing port="source_train 3" spacing="0"/>
<portSpacing port="sink_model 1" spacing="0"/>
<portSpacing port="sink_model 2" spacing="0"/>
<portSpacing port="sink_model 3" spacing="0"/>
</process>
</operator>
</process>
Tagged:
0
Best Answer
-
Thomas_Ott RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,761 Unicorn
To see the Performance results, just connect the PER port from the Cross Validation to the RES output.
1
Answers
Your XML is not working for me. Please open the XML view and copy and paste from there.
@domi_wiese Did you run the tutorial for the Compare ROCs operator like I suggested? That should show you how you need to configure the operator to get what you want. If so, what are you trying to do differently with the Golf dataset that isn't working?
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hello,
Now I have my process.
@Telcontar120
@Thomas_Ott
@domi_wiese it doesn't work because you are using 10 folds and you only have 14 examples. The dataset is too small, try the sonar dataset.
Hi all,
For me, this process with the Golf dataset is working - no error is raising - but like @domi_wiese no ROC curve is displayed.
But you're right @Thomas_Ott, in this special case, when number of folds is greater than the number of members in each class,
why Rapidminer don't raise an error (or a warning) ? and so how is performed the cross - validation ?
@domi_wiese, you can decrease the parameter number of folds of the Compare ROC operator
to satisfy the condition above (for example you can set number of folds = 5).
Regards,
Lionel
@lionelderkrikor it might be working for you because of your seed, mine is probably different. Why didn't RM through an error? That's a more interesting question and I don't know the answer to that.
Hello @Thomas_Ott @lionelderkrikor @Telcontar120,
thank you for bringing my attention to this matter. Now I get what the problem was.
I've got just one more question. It's about creating a lift chart with a binominal performance operator.
I'm not sure if my process is even right, and why the performance operator isn't working at all?
I would be very thankful if someone could send me the correc process.
@Thomas_Ott @lionelderkrikor @Telcontar120
sorry, forgot to put the process online.
Hi @domi_wiese,
I can not load your last XML code in RapidMiner.
To build a lift chart, you can for example use the Create Lift Chart operator.
Best regards,
Lionel
Hi,
@lionelderkrikor @Thomas_Ott @Telcontar120
I'm really sorry for my mistakes. I hope it works now.
Hi @domi_wiese,
You can use the Cross Validation operator instead the Split Validation operator : It is considered as a best practice.
Best regards,
Lionel
Hi,
Thank you very much! I got it!