HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: external pages being downloaded
Author: John Wilson
Date: 02/01/2006 05:07
 
Thanks, I follow your suggestion.

I did another scan of a different site, which went fine.

One difference between the two scans was that for the problem one, I had
instructed the crawl to ignore robots.txt because myfriend.com had blocked
certain content from robots (not wanting it on google).

I am curious why the filter code you suggest might be necessary to the proper
operation of httrack.
 
Reply Create subthread


All articles

Subject Author Date
external pages being downloaded

01/30/2006 23:29
Re: external pages being downloaded

01/31/2006 01:48
Re: external pages being downloaded

01/31/2006 04:58
Re: external pages being downloaded

02/01/2006 03:46
Re: external pages being downloaded

02/01/2006 05:07




7

Created with FORUM 2.0.11