| Okay I set the Browser ID in WinHTTrack to mimic the same one in wGet command
line and got a little farther...
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)"
HTTrack3.41-2+htsswf+htsjava launched on Thu, 25 Oct 2007 10:21:47 at
<http://www.photo.net> +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar
(winhttrack -qwr1C2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.1)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2007], %s -->" -%l "en, en, *"
<http://www.photo.net> -O1 "J:\Temp Folders\wbtTemp\PhotoDotNet" +*.png +*.gif
+*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
10:21:47 Warning: Redirected link is identical because of 'URL Hack' option:
www.photo.net/robots.txt and photo.net/robots.txt
10:21:47 Warning: Warning moved treated for www.photo.net/robots.txt (real
one is photo.net/robots.txt)
10:21:50 Warning: Redirected link is identical because of 'URL Hack' option:
www.photo.net/ and photo.net/
10:21:50 Warning: File has moved from www.photo.net/ to <http://photo.net/>
10:21:50 Info: Note: due to photo.net remote robots.txt rules, links begining
with these path will be forbidden: /ct/, /cta/, /spidertrap.html,
/pvt/email-article, /bboard/image?bboard_upload_id, /pay.adp, /register,
/counter, /bboard/q-and-a-thread-alert, /comments/add, /photodb/slideshow,
/admin/ (see in the options to disable this)
10:21:50 Info: No data seems to have been transfered during this session! :
restoring previous one! | |