| Hi Xavier, thanks for the reply. I take your point about
+*.htm* being dangerous, my mistake thanks for spotting
it!. The first page of the site I'm trying to selectively
archive seems to be www.moh.govt.nz/moh.nsf Its a java
driven nav page with target URLs like
<http://www.moh.govt.nz/moh.nsf/wpg_Index/Publications-Index>
Note no end .html
The page I archive - the first page only - contains 'live'
links that point out to the live website when clicked. The
PDF and MS Word .DOCs that I'm trying to archive do contain
file suffixes but I can't seem to 'reach' them with HTTRack.
In the mime list I have set nsf to "text/html.
This is the most unusual site I've encoutered to date...
any thoughts much appreciated, regards, dnt | |