The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

Learning Curves: Cost Functions

HyramHyram Member Posts: 39 Contributor II
Hi

I am trying to assess bias and variance for my classification problem. As such, I am trying to plot the cost functions J(Theta) for testing and training vs. number of examples to determine if I have high variance or high bias. How can I do this in RapidMiner? Alternatively, are there operators that let me visualise bias vs. variance?

Thanks,
Hyram

Best Answer

  • Telcontar120Telcontar120 RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 1,635 Unicorn
    Solution Accepted
    There isn't anything automatic to produce this.  You can do it somewhat manually by using the Sample operator and using different values/percentages of your total dataset and then outputting the model performance results of each of these different scenarios.   You could automate this further by using a macro for sample value and putting it into a Loop.  But there isn't any built-in operator that would easily output this all for you. 
    Brian T.
    Lindon Ventures 
    Data Science Consulting from Certified RapidMiner Experts

Answers

Sign In or Register to comment.