The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Same process different results
What could be the reason for different users obtaining completely different results (e.g. classification matrix) even though they use the exact same process, same algorithms and the parameters are exactly the same?
Also, I have noticed, I ran an estimation model with linear regression, got rmse; then I added a new model to the same process (used the Multiply operator; didn't change any parameters), and once I ran the process with two models, the rmse of the linear regression has changed. What might be the reason for that?
Tagged:
0
Answers
Hi,
have a look at your GLM operator. it should have a boolean for reproducibility. If you run a learner in parallel you might have the case that some computation is faster or slower. This might change the overall outcome a bit. The same is true for our X-Validation. Just disable all parallelism.
Best,
Martin
Dortmund, Germany
You should also make sure that the local random seed is set (any arbitrary number will do) if you want complete reproducibility, either between users or even the same user running the process in different sessions.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
Thank you. How do I check that?
You will need to "show advanced parameters" in your parameters window using the link at the bottom. Then you will see an option to check a box to use a local random seed, and when you check that, you will be given a box to enter the seed number. It will look like this:
You will need to do this for any operator that is using any pseudo-random processes (like sampling). So at a minimum it will be in your cross-validation operator, and you might have other operators (like Sample) in your process that would need it as well.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts