|   | I'm having an issue where the site I'm trying to download, for whatever reason,
doesn't accept paths ended with '.html', despite all of them being html
documents.
So, for instance, say one path is ' site.com/about ', if you try to fetch '
site.com/about.html ' it will give load an error page, not a bare 404
response, but proper error page.
So what httrack ends up doing is that it's downloading *hundreds* of these
error message pages because it tries to fetch all of them adding ".html" in
the end of the paths, while it should omit that when trying to fetch all html
pages on that site, although it shouldn't omit the extension of other file
types.
Any idea of how I could deal with that?  |   |