The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Radoop proxy connection settings
Hi there,
I am setting up a radoop proxy on cloudera edge node, what should I put in the setting in radoop connection master address and port number? do I need to set anything in the server before I configure the radoop connection in the rapidminer studio?
Thanks,
Tagged:
0
Answers
Hi,
If you have access to Cloudera Manager, the simplest way to create the connection is to use New Connection -> Import from Cluster Manager. After setting the required (highlighted) fields, you can enable Radoop Proxy and create it via the Edit button. The second best option if to use the (compressed) xml files exported from Cloudera Manager via New Connection -> Import Hadoop Configuration Files.
Basically, the connection should look similar to the case of having no Radoop Proxy, the Proxy can then solve the networking part. The hostname resolution may still need to work from the client that runs Studio. This could be solved by adding the master nodes to the OS hosts file and setting dfs.client.use.datanode.hostname to false in the Advanced Hadoop Parameters list.
RapidMiner Server needs to be running (Radoop Proxy authenticates to it), but no specific setting is required. Even the Radoop extension and the connection are only required on Server if processes will be submitted to the Server.
Best,
Peter