The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"root_mean_squared_error using SVM"

ksullivanksullivan Member Posts: 3 Contributor I
edited June 2019 in Help

Hi all,

 

I'm trying to generate some basic models/predictions using the set-up shown:

2016-08-09_1231.png

The THz data file contains just three columns: Time (integer), Density of people (integer), and Water vapour (integer).

 

The process works, in that I get the following:

2016-08-09_1238.png

 

However, in

Tagged:

Best Answers

  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder
    Solution Accepted

    Ah, ok, a different question than I guessed (I need to work on my predictive model :smileywink:)

     

    Accuracy and precision would only be used for a classification problem but it looks like you try to predict numbers (called "regression").  You could turn this into a classification problem by applying some preprocessing step before (for example "Numerical to Polynominal").  Then you would get accuracy or precision instead.

     

    If you want to see other performance measurements in general, you should try out different "Performance (...)" operators and change their parameters.  They allow you to define exactly what type of performance measurement you want to calculate.  The operator "Performance" just guesses what you might be interested in based on the type of the label column.

     

    Cheers,

    Ingo

  • sfraunhoffersfraunhoffer Member Posts: 4 Contributor I
    Solution Accepted

    Hi,

    it's because you have a regression and not a classification task.

    The accuracy is the relative number of correctly classified examples or in other words the percentage of correct predictions.

    The precision is the relative number of correctly as positive classified examples among all examples classified as positive.

     

    Greetings,

    Sigrun

Answers

  • IngoRMIngoRM Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder

    Hi,

     

    Parts of your post got cut off - please post the rest :smileyhappy:

     

    And just in case that your question is why all the predictions are so close together: Try a higher value for the parameter C for the SVM.  Just a shot in the dark :smileywink:

     

    Cheers,

    Ingo

  • ksullivanksullivan Member Posts: 3 Contributor I

    Thanks for the rapid reply.

     

    The rest of the message should read:

     

    However, instead of getting an accuracy or precision reading for Performance, all I see is:

     

    root_mean_squared_error: 5.796 +/- 0.000

     

  • ksullivanksullivan Member Posts: 3 Contributor I

    Brilliant - thanks very much.

     

    It's the little things that trip you up - like the difference between regression and classification!

     

    What both of you said makes perfect sense. I was asking the right question of the data and getting the appropriate performance metric for the answer, but I just didn't recognise it.

     

    Thanks again.

Sign In or Register to comment.