Avoid the execution of all the processes and nodes everytime
Hi, I'm new with Rapid Miner and I can't understand one thing. I built a process for Data Prep and now I'm working on another process for Classification. But everytime I want to run some nodes of the classification process also the initial Data Prep process need to run again from the beginning. Is it possible to, in some way, store the result of the previous running and execute only the classification process? Moreover, is it possible to do the same with nodes (for example the one reading thed dataset), and execute only the last nodes I added?
thank you!
Best Answer
-
sgenzer Administrator, Moderator, Employee-RapidMiner, RapidMiner Certified Analyst, Community Manager, Member, University Professor, PM Moderator Posts: 2,959 Community Manager
hello @f_laperna and welcome to the RapidMiner User Community. We are very happy you are here.
So yes, I would recommend using the "Store" operator to store your data prep example set so it does not need to run every time. Once you do this, you can use the "Retrieve" operator to grab that example set and keep using it for your classification.
If you need more help, please copy and paste your process (in XML) in this thread using the </> tool. It is often easier for us to help this way.
Good luck!
Scott
0
Answers
Thank you for your answer! I tried your solution but now when I run it I get an error "Input is missing". Following you can find the XML and a screenshot of the error.
I solved by creating new nodes (simply copy-pastying the old ones) and connecting everything to the new nodes. Now it works fine
You can also use breakpoints to run only part of a process and view the output up to that point---that can be helpful when building long processes. Right click on any operator and you will see the options to add breakpoints.
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts