HTTrack Website Copier
Free software offline browser - FORUM
Subject: Not capturing sufficient session info?
Author: TheShagg
Date: 12/12/2012 00:33
 
I am trying to use WinHTTrack. Great program. I successfully have got the url
capture mode capturing. When I then try and download the links, I get the
following error:

* * MIRROR ERROR! * *
HTTrack has detected that the current mirror is empty. If it was an update,
the previous mirror has been restored.
...

The page in question can be found at: 
<https://ccnet.stanford.edu/ee261>

I can use winhttrack on it flawlessly if I do not log in.

I have tried many things, including manipulating the url to be https instead
of http, point the url to where I "think it should go", etc. I have also
filtered (I think) the logout links.

Of note: I am NOT changing the windows proxy setting during capture, just the
firefox proxy setting. (The one click proxy on/off plugin is a life saver).

Note that this login I am posting was using an INCORRECT password, but I get
the same results with a known good password. I also do not see anything
anywhere that would suggest any relevant session info, except maybe the
jibberish string attached to the -W flag?

-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
Here is the post file:
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------

CONNECT / HTTP/1.1
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:17.0) Gecko/20100101
Firefox/17.0
Proxy-Connection: keep-alive
Host: ccnet.stanford.edu


-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------
Here is the log, in debug mode:
-----------------------------------------------------------
-----------------------------------------------------------
-----------------------------------------------------------


HTTrack3.46+htsswf+htsjava launched on Tue, 11 Dec 2012 15:32:06 at
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> +*.png
+*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
-/home -*logout*

(winhttrack -WC2%Pns0u1Z%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/5.0
(Windows NT 6.1; WOW64; rv:17.0) Gecko/20100101 Firefox/17.0" -%F "<!--
Mirrored from %s%s by HTTrack Website Copier/3.x [XR&CO'2010], %s -->" -%l
"en, en, *"
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> -O1
D:\wget\httrack\ee261 +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar -/home -*logout* -calendar )



Information, Warnings and Errors reported for this mirror:

note:	the hts-log.txt file, and hts-cache folder, may contain sensitive
information,

	such as username/password authentication for websites mirrored in this
project

	do not share these files/folders if you want these information to remain
private



15:32:06	Info: 	engine: init

15:32:06	Debug: 	Cache: enabled=2, base=D:\wget\httrack\ee261\hts-cache\,
ro=0

15:32:06	Debug: 	Cache: rename D:\wget\httrack\ee261\hts-cache\new.zip ->
D:\wget\httrack\ee261\hts-cache\old.zip (00000000024F5424 00000000024F3424)

15:32:06	Debug: 	Cache: successfully renamed

15:32:06	Debug: 	Cache: size 60

15:32:06	Warning: 	Cache: damaged cache, trying to repair

15:32:06	Warning: 	Cache: 0 bytes successfully recovered in 0 entries

15:32:06	Warning: 	Cache: error trying to open the cache

15:32:06	Info: 	engine: start

15:32:06	Debug: 	Wait get: primary/primary

15:32:06	Info: 	engine: check-html: primary/primary

15:32:06	Info: 	engine: preprocess-html: primary/primary

15:32:06	Debug: 	scanning file primary/primary
(D:/wget/httrack/ee261/index.html)..

15:32:06	Debug: 	link detected in html (tag):
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0>

15:32:06	Debug: 	position link check
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0>

15:32:06	Debug: 	build relative link
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> with
primary/primary

15:32:06	Debug: 	built relative link
<http://ccnet.stanford.edu:443?>postfile:D:\wget\httrack\ee261\hts-post0> with
primary/primary ->
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0

15:32:06	Debug: 	wizard link test at
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0..

15:32:06	Debug: 	wizard test begins:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0

15:32:06	Debug: 	Compare addresses: ccnet.stanford.edu:443!=primary

15:32:06	Debug: 	result for wizard link test: 0

15:32:06	Info: 	engine: save-name: local name:
ccnet.stanford.edu:443/index.html -> ccnet.stanford.edu_443/index7b1f.html

15:32:06	Debug: 	Record:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 ->
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html

15:32:06	Debug: 	relative link at ccnet.stanford.edu:443 build with
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html and
D:/wget/httrack/ee261/index.html: ccnet.stanford.edu_443/index7b1f.html

15:32:06	Debug: 	OK, NOTE:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 ->
D:/wget/httrack/ee261/ccnet.stanford.edu_443/index7b1f.html

15:32:06	Debug: 	Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0

15:32:06	Warning: 	Retry after error -4 (No data (connection closed)) at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)

15:32:06	Debug: 	Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0

15:32:06	Debug: 	(htsback): 1 slots ready moved to background

15:32:06	Warning: 	Retry after error -4 (No data (connection closed)) at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)

15:32:06	Debug: 	Wait get:
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0

15:32:06	Debug: 	(htsback): 1 slots ready moved to background

15:32:06	Error: 	"No data (connection closed)" (-4) after 2 retries at link
ccnet.stanford.edu:443/?>postfile:D:\wget\httrack\ee261\hts-post0 (from
primary/primary)

15:32:06	Info: 	No data seems to have been transfered during this session! :
restoring previous one!

15:32:06	Info: 	engine: end

15:32:06	Info: 	engine: free










 
Reply


All articles

Subject Author Date
Not capturing sufficient session info?

12/12/2012 00:33
Re: Not capturing sufficient session info?

12/12/2012 00:46
Re: Not capturing sufficient session info?

12/12/2012 00:48
Re: Not capturing sufficient session info?

12/12/2012 01:11
Re: Not capturing sufficient session info?

12/12/2012 03:57




3

Created with FORUM 2.0.11