The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Absolute error?

masquerade23masquerade23 Member Posts: 2 Learner I
I really need some help regarding the absolute error. After I have done the Performance(Regression) I get these values back as absolute error 1.247 +/- 0.369. Does this means the error is between +1.247 and -0.369? I really have no idea how to interpret these value.

Answers

  • varunm1varunm1 Member Posts: 1,207 Unicorn
    edited June 2019
    Hello @Patrick_Haas

    The error is between (1.247+0.369) or (1.247-0.369). 

    @mschmitz any suggestion on this. This calculation with +/- is based on Confidence interval in statistics right?

    Regards,
    Varun
    Regards,
    Varun
    https://www.varunmandalapu.com/

    Be Safe. Follow precautions and Maintain Social Distancing

  • masquerade23masquerade23 Member Posts: 2 Learner I
    Hello @varunm1

    Thank you so much for your help!

    Regards,
    Patrick
  • MartinLiebigMartinLiebig Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, RapidMiner Certified Expert, University Professor Posts: 3,533 RM Data Scientist
    @masquerade23,
    to add to this, this gives you the amount of variance in the different folds of a validation. So in the different folds you havesometimes values like 1.2 but sometimes also an error of 0.9 or 3.0

    Best,
    Martin
    - Sr. Director Data Solutions, Altair RapidMiner -
    Dortmund, Germany
  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    The plus/minus component is just the standard deviation around the reported average of the performance across the different cross validation folds.  So it is related to confidence intervals but is not directly computed or dependent on them.
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts
  • jacobcybulskijacobcybulski Member, University Professor Posts: 391 Unicorn
    Remember that in cross-validation, each fold creates a different sample with its own regression line, and each regression model produces a slightly different set of estimates and errors. The MAE is calculated for each and their mean is what RM returns, indicating a range of those errors. Why is it significant? This is because a regression models for the entire population (and the data yet to be collected) falls in between the regression models you produce in cross-validation (which define a confidence interval). In this way, RapidMiner gives you an indication of the estimated MAE for the population and the future data.
Sign In or Register to comment.