The Altair Community is migrating to a new platform to provide a better experience for you. In preparation for the migration, the Altair Community is on read-only mode from October 28 - November 6, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here
"Using Regex in the web crawler"
guitarslinger
Member Posts: 12 Contributor II
Hi there,
I am struggling with the setup of the crawlers in the web mining extension:
I can't figure out how to set the crawling rules so that the crawler produces any results.
Leaving the rules empty does not work either.
Can I find an example for crawling rules somewhere?
Thx in advance
GS
I am struggling with the setup of the crawlers in the web mining extension:
I can't figure out how to set the crawling rules so that the crawler produces any results.
Leaving the rules empty does not work either.
Can I find an example for crawling rules somewhere?
Thx in advance
GS
Tagged:
0
Answers
Here ist the XML, just having the crawler connected to the main process and having two rules:
1. follow every link ".*"
2. store every page ".*"
I thought this parameter is optional, leaving it blank will just not limit the number of pages, but actually without any value it does not crawl at all.
Works now, I am happy!
Regards GS
;D
it should be optional. ****. I will make sure, it's optional in future
Good thing you got it to work, though.
Greetings,
Sebastian