HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Site with multiple host names or aliases
Author: Xavier Roche
Date: 02/17/2003 23:38
 
> I am trying to mirror a site that uses several different
> host names for itself on different pages. Is there any way
> to make Httrack treat a list of aliases as one site, and
> stop it making several copies of the same site.??
Humm, this is on the TODO list, but I could not yet find 
the time to implement it

> On a related subject: in trying to prevent the multiple
> copies above, but get some of the missing pages that are
> only referenced by an alias, I added '+hostx/path/a*.htm' 
to
> the scan rules and set external mirror depth=0. 
> Httrack copied the whole hostx site, including files not
> matching 'a*.htm'. Is this the intended behaviour or am I 
> misusing the scan rules ?
Remember that scan rules are treated after other 
heuristics, like same-level tests. If you want to precisely 
control the scope, use first -*, as in:
-* +www.foo.com/* +*.gif

Note that as the depth option is prioritary, you shouldn't 
set it when using such filters.
 
Reply Create subthread


All articles

Subject Author Date
Site with multiple host names or aliases

02/17/2003 10:38
Re: Site with multiple host names or aliases

02/17/2003 23:38




7

Created with FORUM 2.0.11