The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
How to stop the Get Pages module stopping the process when it cannot read a URL
![davidellis](https://us.v-cdn.net/6030995/uploads/defaultavatar/nCCNNSPK1YM69.jpg)
![](https://s3.amazonaws.com/rapidminer.community/vanilla-rank-images/contributor-16x16.png )
in Help
I have process that reads an excel file, gets pages and then processes the results. I have a dataset of 98 records and it runs perfectly. If I add another 500 records I get random read URL errors.
I have checked all the URLs and they work perfectly and my internet connection is solid. I found a solution on the forum based on a handle exception module but it doesn't seem to make any difference and I am not sure how it works.
Any ideas how to fix the errors or if not how to skip those URLs
I have checked all the URLs and they work perfectly and my internet connection is solid. I found a solution on the forum based on a handle exception module but it doesn't seem to make any difference and I am not sure how it works.
Any ideas how to fix the errors or if not how to skip those URLs
0
Answers
Hi David,
a long time after your post I have come to the same problem. It can be remediated with looping and using Get Page inside Handle Exception:
Strangely enough the Loop Examples operator seems to be broken, therefore I emulated it with the normal Loop operator.
It would be nice if the Get Pages operator could ignore not found responses!
Regards,
Sebastian