The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
[SOLVED] RM 6.1: "Optimize Parameters (Evolutionary)" -> "no data was delivered"
System Information
Application Version: RapidMiner Studio 6.1.000 Professional Plus Edition
Machine: MacBook Pro, Retina, 15-inch, Early 2013
Operating System: OS X 10.9.5 (13F34)
Problem
I made two identical processes, with the only difference that one uses the "Optimize Parameters (Grid)" operator, while the other uses the "Optimize Parameters (Evolutionary)" operator instead.
The first one runs correctly, but the second one fails in the "Write Model" operator with the error message "no data was delivered at port".
To be sure, I grepped the two .rmp files for <connect .../> and confirmed that the connections are identical between the two processes.
By the way, I've seen the hint to disable "Parallelize main process" in another forum post that was apparently about the same problem.
But I couldn't find that option anywhere in RM or the documentation. Maybe it's suppressed in my installation.
Here's a zip file containing the processes and data for reproducing the problem:
http://gert-rieger-edv.de/RM_Problem_01/RM_Problem_01_data.zip
And here's a detailed description with screenshots:
http://gert-rieger-edv.de/RM_Problem_01
Application Version: RapidMiner Studio 6.1.000 Professional Plus Edition
Machine: MacBook Pro, Retina, 15-inch, Early 2013
Operating System: OS X 10.9.5 (13F34)
Problem
I made two identical processes, with the only difference that one uses the "Optimize Parameters (Grid)" operator, while the other uses the "Optimize Parameters (Evolutionary)" operator instead.
The first one runs correctly, but the second one fails in the "Write Model" operator with the error message "no data was delivered at port".
To be sure, I grepped the two .rmp files for <connect .../> and confirmed that the connections are identical between the two processes.
By the way, I've seen the hint to disable "Parallelize main process" in another forum post that was apparently about the same problem.
But I couldn't find that option anywhere in RM or the documentation. Maybe it's suppressed in my installation.
Here's a zip file containing the processes and data for reproducing the problem:
http://gert-rieger-edv.de/RM_Problem_01/RM_Problem_01_data.zip
And here's a detailed description with screenshots:
http://gert-rieger-edv.de/RM_Problem_01
0
Answers
The model output in the Grid search case is almost certainly not the best because the grid search will simply stop at the last combination of parameters and the output will be the last model built, not the best. The same is very likely to be true in the Evolutionary case especially if early stopping is not used.
The way to solve this is to use the Set Parameters operator combined with the parameter output from the Optimize process. This allows the model to be run again using those parameters.
regards
Andrew
as you're having a Professional Plus Edition, you can ask our Enterprise Support
Please click here to open a ticket: https://support.rapidminer.com/
Regards,
Marco
that solves the mystery, thanks.