HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: httrack 'escaping' from expected crawling area,XML
Author: Xavier Roche
Date: 12/31/2002 10:54
 
>completed successfully (AFAIK) except for one
>endless/infinite loop on an image (actually a 404 error)

Two reasons:
- the server stpidely replies with a 200 ("OK, page found") 
message instead of a 404 message, as requested by the RFC

- httrack still sees in the fake 404 page the code:

// preload images to be placed in tooltip
// place your images in this array
var imgAr = new Array(
	"images/sm-duck.gif"
);

.. and attempt to fetch again and again the file.

I will try to find a way to avoid this, but the problem is 
not trivial..
 
Reply Create subthread


All articles

Subject Author Date
httrack 'escaping' from expected crawling area,XML

12/29/2002 00:24
Re: httrack 'escaping' from expected crawling area,XML

12/29/2002 15:41
Re: httrack 'escaping' from expected crawling area,XML

12/29/2002 17:36
Re: httrack 'escaping' from expected crawling area,XML

12/29/2002 19:27
Re: httrack 'escaping' from expected crawling area,XML

12/31/2002 01:33
Re: httrack 'escaping' from expected crawling area,XML

12/31/2002 01:34
Re: httrack 'escaping' from expected crawling area,XML

12/31/2002 10:54
Re: httrack 'escaping' from expected crawling area,XML

12/31/2002 11:51
Re: httrack 'escaping' from expected crawling area,XML

01/05/2003 08:45




8

Created with FORUM 2.0.11