The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
Web crawling of https pages - not working by using
Hey everbody of the community :-)
I have just started to use RapidMiner and now I would like to crawl the www by using the web crawling process in RapidMiner 9.2
Unfortunately I do not get any results.
I have tried it by crawling the URL https:xxxx ( I am not allowed yet, to include links yet, got an error message even posting here in the community) the URL can be found in the attachment.
Did I do any input in a wrong way or are there missing input value's?
In some user communities I found out, that the web crawler in RapidMiner is not working for https URL's, is that correct?
Is there any work around available?
Thanks a lot for your kind support in advance. I am really eager to learn the usage of RapidMiner and I am curious to find results.
Tanja @Move_on2
Tagged:
0
Answers
There is a similar question asked recently in this community. Here are the threads for a workaround provided by @Telcontar120 . Please take a look at the below links.
https://community.rapidminer.com/discussion/54662/how-can-i-crawl-more-than-one-web-page
https://community.rapidminer.com/discussion/54656/crawl-web-operator-does-not-return-any-results
Varun
https://www.varunmandalapu.com/
Be Safe. Follow precautions and Maintain Social Distancing
Scott