| > It's hard for me to pinpoint exactly what page had
this
> problem in particular because it only has happened
to
> me on crawls of large websites with hundreds or
> thousands of pages.
By the way,
-*//*
may be a fix to (temporarily) avoid problems?
> Lastly, as an FYI I've found that httrack can't
> successfully handle the javascript used by
archive.org
Yes: because all links are WRONG, and the embedded
javascripting patch them on-the-fly after the document
load - try to disable javascripting in IE or Netscape,
and crawl the archive: it will be totally broken. This
is one of the 'impossible' site for an offline
browser, except if using a javascript engine (yuk!)
| |