| All,
I can't pull down GNU's documentation at <http://www.gnu.org/manual/manual.html>
. I know the site doesn't like bots, so I believe I've set everything up to be
a good archiver, including allowing only one link per second, limiting the
data rate, and so forth. Here's the output:
HTTrack3.43-2+htsswf+htsjava launched on Fri, 19 Dec 2008 06:35:34 at
<http://www.gnu.org/manual/manual.html> +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qiC2t%P0ns2b0u1j0%u0N0%I0p7DdK0c1H0%k0f2A25000%c1%f#f -F
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" -%F "<!-- Mirrored from
%s%s by HTTrack Website Copier/3.x [XR&CO'2008], %s -->" -%l "en, en, *"
<http://www.gnu.org/manual/manual.html> -O1 "C:\web-archive\Gnu documentation"
+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
06:35:34 Warning: Cache: error while moving previous cache: Permission
denied
06:35:34 Warning: Cache: error while moving previous cache: Permission
denied
06:35:36 Info: Note: due to www.gnu.org remote robots.txt rules, links
begining with these path will be forbidden: /private/ (see in the options to
disable this)
06:35:36 Warning: File not parsed, looks like binary:
www.gnu.org/manual/manual.html
06:35:36 Error: "Open error when decompressing" (-1) at link
www.gnu.org/manual/manual.html (from primary/primary)
06:35:36 Info: No data seems to have been transfered during this session! :
restoring previous one!
Anybody got ideas?
Thanks.
Chris | |