The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Google big query can't connect
Hi,
I'm trying out RapidMiner 9.5 and need to connect to google BigQuery but having problems or it's because I don't know how, so I'm using the Google cloud services connection and have the in-database extension installed. I create the connection and test it and everything works fine but I don't see have big query tables, I guess they should be below the connection in the repository or need I do something else with the connection?
any ideas?
I'm trying out RapidMiner 9.5 and need to connect to google BigQuery but having problems or it's because I don't know how, so I'm using the Google cloud services connection and have the in-database extension installed. I create the connection and test it and everything works fine but I don't see have big query tables, I guess they should be below the connection in the repository or need I do something else with the connection?
any ideas?
1
Best Answers
-
phellinger Employee-RapidMiner, Member Posts: 103 RM EngineeringHi @Bautasten,
You will need an In Database Nest meta-operator with that Google Cloud Services connection, and then a Retrieve (In Database) inside it where you should be able to see and select your dataset and the tables in the dropdown parameters.
Let us know if you have any problem.
Peter
P.S: Yes, the connection list cannot display tables like the JDBC database connections on the Repository panel.11 -
phellinger Employee-RapidMiner, Member Posts: 103 RM Engineering@btibert,
the Access Scope(s) is a setting in the connection itself.
Now that you installed the In-Database Processing extension, you can choose “Google BigQuery Select ...” scope as well for this connection. A single connection can serve acces both to Google Storage and to BigQuery.
I hope this helps.
Peter0
Answers
You need to use the connection, please see the documentation here: https://docs.rapidminer.com/latest/studio/connect/database/jdbc/
Regards,
Marco
My apologies, I misread the question. I have pinged someone with experience with those connections.
Regards,
Marco
Just a follow up question, how do I supply my own sql-query to get biqquery-data into rapid miner?
Best,
Peter
There is an extension that can be installed from the Marketplace, called In-Database Processing. (Not to be confused with InDatabase Extension, a different product.)
If you install this extension, you should see many new operators after a Studio restart.
The In Database Nest meta-operator allows you to use a Google Cloud Services connection (either connect it to the input port or select it using the parameters). You can put a Retrieve (In Database) operator inside the nest. This operator's schema name and table name parameters should list your datasets and tables, respectively (after the validation thread completes, see the progress bar in the bottom right corner).
Is there a point in this setup that fails in your case? E.g. if your datasets or tables don't show up, does the process (after connecting the outputs) still run if you manually enter valid, existing dataset / table combinations?
Please let me know.
Best,
Peter
When I click refresh to get the datasets and tables, the lower right basically blips, as if it completed very quickly, but there isn't any info in the drop down options.
The error that I am getting is:
Process failed: Insufficient Access Scope(s). Operator expects 'Google BigQuery Select and Create Tables', but the actual is 'Google Storage Read and Write'.
But I have the operator Retrieve (In Database) within the subprocess of the Nest.
Will log a new ticket/case/query if there is something about the other machine. Thanks for the guidance.
I am new to Rapidminer and facing similar kind of issue and looking for some help to connect to Bigquery tables.
I created new connection in RM following steps suggested in 'Using Google Cloud Service connector' article , and test connection was also successful, however, I am not able to find either dataset or any tables or views in Rapidminer. Please help with your valuable suggestions.
Update:: This is resolved under different thread. Thank you for your support.