HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrack adding '.html' to paths it shouldn't
Author: Alex
Date: 10/14/2025 06:24
 
I'm having an issue where the site I'm trying to download, for whatever reason,
doesn't accept paths ended with '.html', despite all of them being html
documents.

So, for instance, say one path is ' site.com/about ', if you try to fetch '
site.com/about.html ' it will give load an error page, not a bare 404
response, but proper error page.

So what httrack ends up doing is that it's downloading *hundreds* of these
error message pages because it tries to fetch all of them adding ".html" in
the end of the paths, while it should omit that when trying to fetch all html
pages on that site, although it shouldn't omit the extension of other file
types.

Any idea of how I could deal with that?
 
Reply


All articles

Subject Author Date
httrack adding '.html' to paths it shouldn't

10/14/2025 06:24




c

Created with FORUM 2.0.11