| Hello. httrack is very useful tool but I am faced with a problem about a
mirroring web site.
httrack usually completes mirroring and there is no error in the log. But some
files are actually corrputed on occasion. I think that the downloads were
aborted halfway and httrack could not handle an error for any reason. I can
fix that files by re-downloading using httrack or other clients like wget.
However, I have to find those from thousands of files without any clues. It's
not so easy task. I automated the corrupt detection for archives, htmls et
al.. Unfortunately, I cannot find a reliable way for pictures, audios, movies
and many formats including Flash.
The best solution I found is a "mirroring twice" plan, mirrors it two times
and compares those files. It looks nice except for server load.
Would anyone be able to suggest any solution for this issue?
Thanks,
Akira.
httrack: 3.43.9C-0.pm.1.1
GUI frontend: gttrack 3.43.9C-0.pm.1.1
OS: openSUSE 11.2 x86_64 | |