| As for css (when <META NAME='robots'
CONTENT='noindex,nofollow'> is found) and javascript linked
files, I meant http:// links, thus the pages cannot be
properly browsed offline.
For javascript, php extensions are not modified even if the
html file has been downloaded and thus there is no linking
with the javascript routines and the local mirror.
> if robots.txt are enabled, "nofollow" in meta will force
httrack to skip ALL links in the page.
I understand that the links to other pages mustn't be
followed and you may be right in skipping all the links in
the page,
but if you have
<link rel="SHORTCUT ICON"
href=http://www.site.com/favicon.ico
<link rel="stylesheet"
href=http://www.site.com/css/style.css type="text/css">
<style type="text/css">
@import url(http://www.site.com/css/style2.css);
</style>
<script src=http://www.site.com/js/functions.js
type="text/javascript"></script>
(same problem with java applets, Flash etc...)
THe page (not to be followed) won't be correctly displayed
offline.
| |