| > In a recent mirror HTTrack got 'loose' and mirrored a
whole
> load of sites based around hit counters and links to
local
> weather / news sites. My basic mirror is good and I want
to
> run it regularly - my question is how can I delete those
> parts of the mirror that I don't want without mirroring
> them all over again next time?
>Sould I be putting a whole
>bunch of 'Exclude linkss' in the scan options to not
>mirror these sites oris there an easier way?
Well, this is the easiest way ; using something like:
-www.foo.com/*weather*
to exclude all links at www.foo.com which contains the
work "weather" in the URL
You can also wipe all default filters (+*.gif +*.jpg and so
on) which tend to get many banner and other useless ads
And you can even use "strict filters", that is, exclusing
everything, and then accept links per links (or family per
family) what you want to download:
-* +www.foo.com/*.html +www.foo.com/*.gif
+www.foo.com/bar/*.jpg
| |