The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Compare the perfomance of various models (accuracy)"
Hi,
I'm new to data mining, and i'm stuck in a project that my teacher gave me to do.
I have a data set and 6 models and for each model i want to generate a report that compare the accuracys between them.
The illustration of my work is here:
ProcessCross Validation
I'm sorry if this is not the correct place for post this doubt, but like i said before, i'm new here.
Thanks for the attention!
Tagged:
0
Best Answer
-
MartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
Brian,
Micro is the weighted one. See: http://rushdishams.blogspot.de/2011/08/micro-and-macro-average-of-precision.html
~martin
- Sr. Director Data Solutions, Altair RapidMiner -
Dortmund, Germany0
Answers
I am not totally certain what output you are trying to achieve, but you might try the operator "Compare ROCs." You simply place your individual models into that subprocess, which is similar to the Cross Validation process you are using. It will produce an exhibit that shows the performance of all of the models together.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Hi,
i usually use Performance to Data. This can then be joined/filtered/appended how ever you like it.
~Martin
Dortmund, Germany
@mschmitz I have a related question: when running cross-validation, if performance vectors are output, there are 3 summary values returned, such as the following example:
"accuracy: 77.896% +/- 3.824% (mikro: 77.912%)"
Can you clarify the calculation of these 3 metrics? My assumption is as follows:
Thanks for the help!
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Brian,
you got it right. Mirko is german for micro . And also the rest. The one value is a simple average. The other value is a average where you use the number of examples on the testing side of the fold as a weight. I always mix it up which one is which.
With enough examples micro=macro because all weights are equal.
~Martin
Dortmund, Germany
Thanks @mschmitz!
@IngoRM do you remember which average is which (i.e., what the mikro version is, weighted or unweighted)?
Happy Thanksgiving!
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Thanks @mschmitz!
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Man, people are too fast here. I never get a chance to answer myself :smileytongue:
Thanks,
Ingo