HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrack wandering away/escaping from base URL
Author: Haudy Kazemi
Date: 04/24/2002 13:33
 
Hello,

I've noticed this wandering/escaping before, but 
unfortunately haven't been able to put a finger on the 
exact cause yet.  What happens is I feed WinHTTrack a 
base URL:
<http://home.att.net/~willowbrookemill/pricelessware.htm>
l
and instruct it to grab all items on this site, and 
files near to it.

After about an hour, I noticed HTTrack grabbing a 
bunch of files (mostly html) from this domain:
www.the-internet-eye.com
While I wouldn't be surprised if a few files/pages 
were considered 'near' to base URL I specified, 
hundreds of files were grabbed from this 2nd domain, 
and thousands more had queued up before I noticed.

Has anyone else noticed HTTrack wandering like this?  
I am using v3.16-2 on Win2k Pro.  I looked at the log 
file, but couldn't find the referer information 
telling me how HTTrack jumped from the base URLs to 
the html pages of the 2nd domain.  (I have detailed 
debugging logging mode enabled.)
 
Reply


All articles

Subject Author Date
httrack wandering away/escaping from base URL

04/24/2002 13:33
Re: httrack wandering away/escaping from base URL

04/24/2002 13:45
Re: httrack wandering away/escaping from base URL

04/24/2002 18:44
Re: httrack wandering away/escaping from base URL

04/24/2002 18:44
Re: httrack wandering away/escaping from base URL

04/24/2002 20:40
Re: httrack wandering away/escaping from base URL

05/12/2002 09:32




9

Created with FORUM 2.0.11