| Apart from this issue, I should add that my biggest problem with HTTrack is the
ever existed one which there is no good way to stop HTTrack from processing
html files from external addresses when in fact they were not included as web
pages but when the file is not found or something else HTTrack saves the page
and processes it. If I don't watch the operation I often get tens of thousands
of unwanted files from other websites. By default external html files should
not be processed unless explicitly configured. Probably the most common
example is wikipedia file links. Once HTTrack downloads that webpage thinking
it is an image it is game over | |