| Every time I try to mirror a site I get the message below and no pages at all.
It appears that these same messages were discussed on this board for version
3.33 back in 2003 and 2005, but no clear solution provided. Is this a problem
again and what does a novice need to do to fix this?? The person who
recommended this program to me was able to run a mirror for the same site with
the default settings. I've tried both default settings and changing some of
the parameters as I understand them, but get the same errors in either case.
Do I need to turn something off on my system, like Zone Alarm or MCAfee??
HTTrack3.41-rc1+htsswf+htsjava launched on Fri, 09 Mar 2007 14:30:17 at
<http://www.irs.gov> +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar -*/businesses/* -*/charities/* -*/govt/* -*/taxpros/*
-*/retirement/* -*/taxexemptbond/* -*/opportunities/* -*/taxstats/*
-*/advocate/* -*/accessibility/* -*/foia/* -*/privacy/* -*/espanol/*
-*/compliance/* -*/pub/* -*/app/officeLocator
-*/newsroom/article/0,,id=130650,00.html +*/individuals/*
(winhttrack -qiC2%Ps2u1%s%uN0%I0p3DaK0G1000000000c4H0%kf2A25000%c5%f0#f -F
"Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from
%s%s by HTTrack Website Copier/3.x [XR&CO'2007], %s -->" -%l "en, en, *"
<http://www.irs.gov> -O1 "C:\My Web Sites\IRSgov" +*.png +*.gif +*.jpg +*.css
+*.js -ad.doubleclick.net/* -mime:application/foobar -*/businesses/*
-*/charities/* -*/govt/* -*/taxpros/* -*/retirement/* -*/taxexemptbond/*
-*/opportunities/* -*/taxstats/* -*/advocate/* -*/accessibility/* -*/foia/*
-*/privacy/* -*/espanol/* -*/compliance/* -*/pub/* -*/app/officeLocator
-*/newsroom/article/0,,id=130650,00.html +*/individuals/* )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
14:30:17 Info: Note: due to www.irs.gov remote robots.txt rules, links
begining with these path will be forbidden:
/newsroom/article/0,,id=130650,00.html, /app/officeLocator (see in the options
to disable this)
14:30:17 Warning: File not parsed, looks like binary: www.irs.gov/
14:30:17 Error: "Open error when decompressing" (-1) at link www.irs.gov/
(from primary/primary)
14:30:17 Info: No data seems to have been transfered during this session! :
restoring previous one!
| |