The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here

"Crawler Proxy"

QuazionQuazion Member Posts: 1 Learner III
edited May 2019 in Help
We are trying to datamine some websites. The internal Crawler doesnt seem to have any proxy settings also the auto update doesnt function ofcourse. Now i was wondering if there is a Proxy setting hidden somewhere which i can set.

I tried HTTrack for Crawling, but it seems the latest version is broken, you cant use exlusions, they just dont work. (Confirmed on the HTTrack forums)

Now i fetched some data manualy to test the RapidMiner software, but maybe you guys know a solution or maybe another Craweler i could try?

Thanks in advance
Tagged:
Sign In or Register to comment.