| > A question though (sorry if it's been asked before), I am
> getting a lot of
> extra files with names such as indexcaa7.html in the
> downloaded directories,
These files are query-string URLs ; such as
<http://www.example.com/index.php?id=1234>. HTTrack always
handle collisions between URLs having different query
string values, and generate as many files as there are
different URLs.
If multiple URLs are generating the same content (such as
<http://www.example.com/index.php?foo=<random-number>>;), you
will end up with multiple copies of these files (there is
no way to detect such cases before downloading the files
remotely)
| |