The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Calculation

mbukombuko Member Posts: 10 Contributor II
edited July 2019 in Help
Hi,

I am a student and try to understand the RapidMiner output after applying a trained model (e.g. multiclass with SVM, KNN, NB). I am getting the prediction and for each possible class/label the confidence value. But I don't know how to interpret these confidence values and how their calculation is defined.

Do you know, if there are definitions anywhere of the calculation and interpretation of the confidence values for each classification algorithm (e.g. SVM, KNN, NB)?

Best regards
mbuko

Tagged:

Answers

  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hi,

    since the mentioned algorithms are standard algorithms you can search easily on the web. As a good quoteable, mathematical source i can recommend Hastie, Tibsheirani, Friedmann - The elements of statistical learning. But be aware - math dragons are around the corner.

    For k-NN: It is simply the (weighted) fraction of examples belonging to the class.
    NB: The result of the naive bayes formula. See: https://en.wikipedia.org/wiki/Naive_Bayes_classifier the one in RM is actually the gaussian one.
    For SVM: i recommend the book above.. Quite some fun with lagrange multipliers. In the end classification is only a scalar product. i think this is simply mapped to [0,1]

    ~Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • mbukombuko Member Posts: 10 Contributor II
    Thank you for your recommendations, I will  examine it. In my case there are also negative values for the confidence values of the SVM classification, that is why I was a bit surprised. The relationship and description of the confidence values of these algorithms I  have not seen in any literature or website yet.

    Best regards
    mBuko
Sign In or Register to comment.