The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
web
Hello all,
I am new to data/web/text mining and RapidMiner and I have question regarding these two and I have the following problem:
I have text in an excel sheet in many rows and some columns. For one or two specific columns and for each row I want to make queries on a website and retrieve the results (links) for each query. Then follow the links to the websites and retrieve text from the websites. At the end compare the retrieved text for each website with a text file. The comparison should ignore words like is, and, for, etc. I need to know which words are the same or similar e.g. house, domicile, indoor etc.
Is this possible with RapidMiner or at all?
Thanks in advance
I am new to data/web/text mining and RapidMiner and I have question regarding these two and I have the following problem:
I have text in an excel sheet in many rows and some columns. For one or two specific columns and for each row I want to make queries on a website and retrieve the results (links) for each query. Then follow the links to the websites and retrieve text from the websites. At the end compare the retrieved text for each website with a text file. The comparison should ignore words like is, and, for, etc. I need to know which words are the same or similar e.g. house, domicile, indoor etc.
Is this possible with RapidMiner or at all?
Thanks in advance
Tagged:
0
Answers
I don't understand how "Enrich Data By Webservice" works.
What am I supposed to enter in "query type" and in the two "Edit List".
I cannot find any examples.
I attached a simple example below that shows how to query Wikipedia for well-known novels and retrieve the first paragraph as a description. Hope that helps.
Best,
Tobias
thanks for your quick respond, but I'm more confused than before.
How can I use this XML code in RapidMiner?
I want to do e.g. something like this:
1. Go to http://www.yellowpages.com.au/
2. Give something in the one form e.g. Jonathan
3. Click on Search
4. From the results click again one of them
5. And then get e.g. the address and the phone number
6. Put address and phone number in a file
Can RapidMiner do this and how?
I suppose I have to use the "Enrich Data by Webservice" but what kind of query is it in my case?
I don't want to enter manually the queries in the list but extract them from a list.
What's the meaning of the input in this operator?
Regards