| I am trying to use WinHTTrack. Great program. I successfully have got the url
capture mode capturing. When I then try and download the links, I get the
following error:
* * MIRROR ERROR! * *
HTTrack has detected that the current mirror is empty. If it was an update,
the previous mirror has been restored.
...
The page in question can be found at:
<https://ccnet.stanford.edu/ee261>
I can use winhttrack on it flawlessly if I do not log in.
I have tried many things, including manipulating the url to be https instead
of http, point the url to where I "think it should go", etc. I have also
filtered (I think) the logout links.
Of note: I am NOT changing the windows proxy setting during capture, just the
firefox proxy setting. (The one click proxy on/off plugin is a life saver).
Note that this login I am posting was using an INCORRECT password, but I get
the same results with a known good password. I also do not see anything
anywhere that would suggest any relevant session info, except maybe the
jibberish string attached to the -W flag?
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
Here is the post file:
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
CONNECT / HTTP/1.1
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20100101
Firefox/17.0
Proxy-Connection: keep-alive
Host: ccnet.stanford.edu
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
Here is the log, in debug mode:
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
HTTrack3.46+htsswf+htsjava launched on Tue, 11 Dec 2012 15:32:06 at
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> +*.png
+*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
-/home -*logout*
(winhttrack -WC2%Pns0u1Z%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/5.0
(Windows NT 6.1; WOW64; rv:17.0) Gecko/20100101 Firefox/17.0" -%F "<!--
Mirrored from %s%s by HTTrack Website Copier/3.x [XR&CO'2010], %s -->" -%l
"en, en, *"
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> -O1
D:\wget\httrack\ee261 +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar -/home -*logout* -calendar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
15:32:06 Info: engine: init
15:32:06 Debug: Cache: enabled=2, base=D:\wget\httrack\ee261\hts-cache\,
ro=0
15:32:06 Debug: Cache: rename D:\wget\httrack\ee261\hts-cache\new.zip ->
D:\wget\httrack\ee261\hts-cache\old.zip (00000000024F5424 00000000024F3424)
15:32:06 Debug: Cache: successfully renamed
15:32:06 Debug: Cache: size 60
15:32:06 Warning: Cache: damaged cache, trying to repair
15:32:06 Warning: Cache: 0 bytes successfully recovered in 0 entries
15:32:06 Warning: Cache: error trying to open the cache
15:32:06 Info: engine: start
15:32:06 Debug: Wait get: primary/primary
15:32:06 Info: engine: check-html: primary/primary
15:32:06 Info: engine: preprocess-html: primary/primary
15:32:06 Debug: scanning file primary/primary
(D:/wget/httrack/ee261/index.html)..
15:32:06 Debug: link detected in html (tag):
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0>
15:32:06 Debug: position link check
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0>
15:32:06 Debug: build relative link
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> with
primary/primary
15:32:06 Debug: built relative link
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> with
primary/primary ->
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0
15:32:06 Debug: wizard link test at
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0..
15:32:06 Debug: wizard test begins:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0
15:32:06 Debug: Compare addresses: ccnet.stanford.edu:443!=primary
15:32:06 Debug: result for wizard link test: 0
15:32:06 Info: engine: save-name: local name:
ccnet.stanford.edu:443/index.html -> ccnet.stanford.edu_443/index7b1f.html
15:32:06 Debug: Record:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 ->
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html
15:32:06 Debug: relative link at ccnet.stanford.edu:443 build with
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html and
D:/wget/httrack/ee261/index.html: ccnet.stanford.edu_443/index7b1f.html
15:32:06 Debug: OK, NOTE:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 ->
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html
15:32:06 Debug: Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0
15:32:06 Warning: Retry after error -4 (No data (connection closed)) at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)
15:32:06 Debug: Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0
15:32:06 Debug: (htsback): 1 slots ready moved to background
15:32:06 Warning: Retry after error -4 (No data (connection closed)) at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)
15:32:06 Debug: Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0
15:32:06 Debug: (htsback): 1 slots ready moved to background
15:32:06 Error: "No data (connection closed)" (-4) after 2 retries at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)
15:32:06 Info: No data seems to have been transfered during this session! :
restoring previous one!
15:32:06 Info: engine: end
15:32:06 Info: engine: free
| |