| Hi,
I often notice that HTTrack downloads multiple copies of
the same file from the server. This is probably because the
same file can be accessed through different URI's.
I think this can be easily prevented by creating a hash
(MD5) code of all downloaded files. If HTTrack downloads a
file whose hash code matches a previous downloaded file it
can just point the link to that other file. This will
prevent multiple copies of the same file in the mirror
directory.
Am i right? | |