HTTrack Website Copier
Free software offline browser - FORUM
Subject: follow robots.txt rules does not work
Author: Christian Coseru
Date: 03/09/2005 23:48
 
Httrack does not appear to obey robots.txt exclusion rules.
We've tried both WinHttrack and the command line httrack
with -sN2 option and in neither case does httrack obey the
robots.txt rules.

I found one posting to the forum which mentioned a hack that
allows httrack to behave like a browser and ignore too
restrictive robots exclusion rules.

Is there a way of getting httrack to obey robots exclusion
rules? 

Version used 3.32.3 (unix) and 3.32.2 (win)
 
 
Reply


All articles

Subject Author Date
follow robots.txt rules does not work

03/09/2005 23:48
Re: follow robots.txt rules does not work

03/10/2005 10:03
Re: follow robots.txt rules does not work

03/13/2005 10:23
Re: follow robots.txt rules does not work

03/14/2005 07:13




b

Created with FORUM 2.0.11