| I also would love to see a way to set aliases and
equivalencies for cases such as (example cases...I'm not
trying to download my own site):
kazemizadeh.net=kazemizadeh.com=www.kazemizadeh.net=www.kaze
mizadeh.com
Basically the issue is if a page on kazemizadeh.net has a
<AREF URL like this:
<http://www.kazemizadeh.com/tips/tip0012.txt>
instead of the equally valid working URL:
<http://kazemizadeh.net/tips/tip0012.txt>)
Then when I'm trying to download the whole site I'll miss
that page. If I add kazemizadeh.com to the original list
of sites to d/l, I'll likely get a duplicate of
everything...wasting my hard drive space and everyone's
bandwidth. There is a complicating case where one URL is
an imperfect 'mirror' of another...in this case I'd suggest
the aliased version be used first, and if it gives a 404
(or other such error), that the unaliased URL be tried.
I think we need something like a mod_alias or mod_rewrite
for Httrack, as there is in Apache. | |