| > The URLs are presented via server-side .exe - see
> www.pcrecruiter.net/pcrbin/wisdombase.exe - and I
> think the fact that the URLs contain .exe rather
> than .htm is throwing the software off track.
No, it should not. You may even have links ending with ".gif" which are
actually HTML files, and httrack will rename the files as ".html" :)
> tried fiddling with the settings, but I can't get it
> to follow the URLs and download the rendered HTML
> pages.
The robots.txt rules may prevent from downloading:
Note: due to www.pcrecruiter.net remote robots.txt rules, links beginning with
these path will be forbidden: /c
gi-bin/, /images/, /mri/, /badv/, /adv/, /oadv/, /gadv/, /iadv/, /img/,
/presentation/, /sos/, /phone/, /clients/, /RCM/, /overview/, /
mailers/, /Templates/, s.htm (see in the options to disable this)
Check in the options to disable that, but with care (if you are the site
admin, I suppose this is okay :p)
| |