The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
RapidMiner 9.3 Python Notebooks
jacobcybulski
Member, University Professor Posts: 391 Unicorn
According to the release information RM 9.3 has a better integration with Python, including integration with Jupyter notebooks (apparently a new operator is to be found) to "seamlessly execute notebooks", which I figured out as it is using the old "Execute Python" (which still wants the rm_main). Is there any documentation or examples of the new API features, e.g. what Python package needs to be installed to call RM functions?
Jacob
Tagged:
1
Best Answers
-
phellinger Employee-RapidMiner, Member Posts: 103 RM EngineeringHi Jacob,
The new Python library is available on GitHub: https://github.com/rapidminer/python-rapidminer
Execute Python can now use an .ipynb file besides a .py file.
Yes, it still expects the rm_main method, that makes input and output data handling possible. We considered at least 6 alternative approaches to that, e.g. user can choose a name for each variable that contains the input data, etc. All of them had their own drawbacks (e.g. need to modify the script), so we kept the rm_main method approach. It can actually be used pretty well in a notebook if one gets used to it, e.g. having this extra function does not affect how you use the same file in Jupyter. Note that you can use cell tagging to ignore or include cells when running Execute Python. And with the rapidminer Python library, you can get data from the repository directly when you are developing your code in Jupyter or a Python IDE.
We'll publish some guidance, best practices and also update docs.rapidminer.com.
Stay tuned!
Feel free to let us know how you would use these features.
Best,
Peter6 -
phellinger Employee-RapidMiner, Member Posts: 103 RM Engineering@jacobcybulski
Great!
Calling Studio from Python has this overhead, it always starts a session. It helps batch-like execution more than an interactive use case. Performing multiple operations in the same session is a feature we are considering.
Using the Server class with a Server repository directly from Python, on the other hand, is super fast. We focused now on helping collaboration there.
I am happy that generally you use RM, and may call Python from there. I would say you most probably only need the rapidminer Python package if you are working on a more complicated code using a coder tool and would then use the code in a RM process.
Best,
Peter6
Answers