The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
which operator allows me to use all four logical processors for parallel computing?
jsramirezgo
Member Posts: 6 Learner I
in Help
I Got a Rapid Miner Enterprise Medium License (I am able to use four logical processors), however, I dont know how to use it. I mean, I dont know which operator in Rapid Miner allows me to deploy parallel computing by using more than one logical processor.
I really appreciate the help because this is part of an academical research and I need urgent help in order to continue my experiment.
Thanks!
I really appreciate the help because this is part of an academical research and I need urgent help in order to continue my experiment.
Thanks!
0
Answers
Scott
I got two questions regarding your reply:
1. how can I see whether the processors all kick in?
2. the operator "execute process" that runs multiple process can be named as a parallel computing operator?
Thanks!
1. to see your CPU usage really depends on your computer. On my Mac I look at Activity Monitor. I think there are more techie ways to do this but I will let others chime in who know more. Maybe @Telcontar120? @rfuentealba? @lionelderkrikor?
2. The "Execute Process" operator will simply execute a process somewhere else in RapidMiner. If that process is optimized for parallelization, it will run parallelized.
Hope that helps?
Scott
To answer to Scott's question, I'm using Windows as OS, but
I'm using a Rainmeter skin which is displayed on the desktop of my computer and which displays the "CPU usage" (with % Core 1 , % Core 2, % Core 3 etc.) and the "RAM usage" in %.
Regards,
Lionel
Lindon Ventures
Data Science Consulting from Certified RapidMiner Experts
🤦♂️ #occamsrazor thx @Telcontar120 @lionelderkrikor
thanks a lot for your replies. About checking the processors I realised how to do it, thank you!
regarding scott’s answer, I think I have a new question that would help me to resolve this finally:
How can I know If a process is optimized for parallelization?
thanks!
Thank you very much!
I think the idea of parallel computing is to reduce time, but both escenarios had the same execution time. what was wrong? Do I need an specific operator?
I really appreciate your help. Sorry for bein persistence. Just want to clarify my doubts with RM.
thanks a lot! Quite clear. Questions solved.
regards.
I am late to this reply, hence I'll add a few more things, not closely related with parallel execution of operators but it does with processes.
I have a MacBook Pro for RapidMiner Studio, and I really don't care about parallel execution on it. However, for my world domination projects, I use 4 MacBook Pro's with 12-core i9-9900, each one with an agent configured to run up to 11 parallel tasks. If you have such a setup, use Nagios. It uses the SNMP protocol to monitor the status of the machines, and due to the nature of that protocol, it doesn't affect much of the network throughput.
Now be sure that I'm not running one task in parallel in all these computers (is it feasible? I need it badly) but many different enqueued processes. More often than not when I need this kind of power, I divide my processes and use the Schedule Process operator to cascade, or an API with data through RapidMiner Server.
A real case for this: let's say I have 32000 pages from a website that you need to apply NLP. I do convert these to examples and perform a Loop Examples, pass the entire data on a POST to the API and finish the process. This creates 32000 requests to the RapidMiner Server, and the results are solved with 44 processes. In my last development project, 42 tasks served by all 4 computers could solve nearly 1200 pages per minute, taking only 30 minutes. I did that with my old good MacBook Air and it took 7 hours to complete the same task.
If anyone has a better suggestion for me, I'm all ears.
Too bad the MacBooks aren't mine
Just my two cents.
All the best,
Rodrigo.
That way you can keep working while your experiments run, pretty neat!