The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Cost Analyses of inaccurate regression model

HaukeVHaukeV Member Posts: 5 Learner I
Hi,

I have built a model to estimate house prices, however I would like to classify the predictions as correct or incorrect based on a % difference with the correct price. Any idea how I can do that? I would then like to apply a cost and a benefit for every correct/incorrect prediction.

Kind regards,
Hauke
Tagged:

Answers

  • lionelderkrikorlionelderkrikor RapidMiner Certified Analyst, Member Posts: 1,195 Unicorn
    Hi @HaukeV,

    You can use the Generates Attributes operator to calculate the difference between the predicted price and the correct price : 
    for example diff = (pred(price) - true(price)) / true (price) and then apply a threshold to apply a cost or a benefit.

    Hope this helps,

    Regards,

    Lionel
  • HaukeVHaukeV Member Posts: 5 Learner I
    Hi Lionel, thanks for the answer. We now have 2 columns, one with true/false if the prediction is 15% or more lower, and a second column true/false if the prediction is 15%  or more higher. 

    Now trying when we try to run the cost analyses for classification, so that a true in either category is given the appropriate cost, but we keep running into issues. Tried manually redefining labels, predictions,.. Any help would be much appreciated.
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    Hi,
    you may use Performance (Cost) as an operator here. Is this a commercial project? That way we can help you with presales ressources to make your point.

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • HaukeVHaukeV Member Posts: 5 Learner I
    Hi Martin,

    thanks for your information. This is not a commercial project, just practice for a paper. I changed my setup and now I have only one attribute which classifies as either 0, too low, or too high. I then tried to use the Performance (cost) operator on this to add a cost matrix. To make that work I first had to add a default model which just copies the too low, too high or 0 class to a predicted class. I set all values to 0 in the matrix, and then only if the predicted class = the actual class I add the required cost. However when I do the math manually (80 cases too low, 100 cases too high, multiplied by their respective costs)this does not at all match the math that the performance (costs) module does.
    Thanks for your help!!
Sign In or Register to comment.