"Using Regex in the web crawler"
Contributor II
         
           Hi there,
           
I am struggling with the setup of the crawlers in the web mining extension:
           
I can't figure out how to set the crawling rules so that the crawler produces any results.
Leaving the rules empty does not work either.
           
Can I find an example for crawling rules somewhere?
           
Thx in advance
           
GS
          
          I am struggling with the setup of the crawlers in the web mining extension:
I can't figure out how to set the crawling rules so that the crawler produces any results.
Leaving the rules empty does not work either.
Can I find an example for crawling rules somewhere?
Thx in advance
GS
           Tagged:
          
          
           0
           
          
         

Answers
Here ist the XML, just having the crawler connected to the main process and having two rules:
1. follow every link ".*"
2. store every page ".*"
I thought this parameter is optional, leaving it blank will just not limit the number of pages, but actually without any value it does not crawl at all.
Works now, I am happy!
问候GS
;D
it should be optional. ****. I will make sure, it's optional in future
Good thing you got it to work, though.
Greetings,
Sebastian