HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Undoing part of a mirror
Author: Xavier Roche
Date: 06/24/2002 20:13
 
> In a recent mirror HTTrack got 'loose' and mirrored a 
whole 
> load of sites based around hit counters and links to 
local 
> weather / news sites. My basic mirror is good and I want 
to 
> run it regularly - my question is how can I delete those 
> parts of the mirror that I don't want without mirroring 
> them all over again next time? 
>Sould I be putting a whole 
>bunch of 'Exclude linkss' in the scan options to not 
>mirror these sites oris there an easier way?
Well, this is the easiest way ; using something like:
-www.foo.com/*weather*
to exclude all links at www.foo.com which contains the 
work "weather" in the URL

You can also wipe all default filters (+*.gif +*.jpg and so 
on) which tend to get many banner and other useless ads

And you can even use "strict filters", that is, exclusing 
everything, and then accept links per links (or family per 
family) what you want to download:
-* +www.foo.com/*.html +www.foo.com/*.gif 
+www.foo.com/bar/*.jpg 
 
Reply Create subthread


All articles

Subject Author Date
Undoing part of a mirror

06/23/2002 23:36
Re: Undoing part of a mirror

06/24/2002 01:44
Re: Undoing part of a mirror

06/24/2002 20:13




2

Created with FORUM 2.0.11