HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: WinHTTrack 3.47 (403 error)
Author: Ivan Fletes
Date: 04/23/2013 00:36
 
> 1) Always post the ACTUAL command line used (or log
> file line two) so we know what the site is, what ALL
> your settings are, etc.
> 2) Always post the URLs you're not getting and from
> what URL it is referenced.
> 3) Always post anything USEFUL from the log file.
> 4) If you want everything use the near flag (get
> non-html files related) not filters.
> 5) I always run with A) No External Pages so I know
> where the mirror ends. With B) browser ID=msie 6
> pulldown as some sites don't like a HTT one. With C)
> Attempt to detect all links (for JS/CSS.) With D)
> Timeout=60, retry=9 to avoid temporary network
> interruptions from deleting files.
> 
> > 14:17:19 Error:  "Forbidden" (403) at link
> > www.chesscafe.com/images/spec041013.gif (from
> > www.chesscafe.com/)
> 
> I didn't have any problem seeing the images. Perhaps
> you have to open the main page first (some type of
> linking/bandwidth protection.)
> 
> In HTT it's usually caused by #5B

By "seeing" you mean downloading them? My HTTrack doesn't attempt to get them
until 4 hrs. or so into the download, if I remember correctly. My HTT
downloads huge loads of PDFs before it gets to the images. So are you sure
your HTT downloaded them? Please explain to me clearly what you mean by
"seeing" the images.

a) HTTrack3.47+htsswf+htsjava launched on Mon, 22 Apr 2013 11:12:07 at
<http://www.chesscafe.com> +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar

(winhttrack -qiC1%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/5.0
(Windows; U; Windows NT 5.0; en-US; rv:1.1) Gecko/20020826" -%F "<!-- Mirrored
from %s%s by HTTrack Website Copier/3.x [XR&CO'2013], %s -->" -%l "en, en, *"
<http://www.chesscafe.com> -O1 "C:\Users\tactictoe\Documents\My Web
Sites\ChessCafe" +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar )

b) I'm not getting the images.  Here are some examples:
Error: 	"Forbidden" (403) at link www.chesscafe.com/images/cafehead20.gif
(from www.chesscafe.com/)

12:55:15	Error: 	"Forbidden" (403) at link www.chesscafe.com/images/died2.gif
(from www.chesscafe.com/)

12:55:15	Error: 	"Forbidden" (403) at link www.chesscafe.com/images/born4.gif
(from www.chesscafe.com/)

12:55:15	Error: 	"Forbidden" (403) at link www.chesscafe.com/images/donate.gif
(from www.chesscafe.com/)

12:55:15	Error: 	"Forbidden" (403) at link
www.chesscafe.com/images/nicbanner107.jpg (from www.chesscafe.com/)

3) Useful lines from the log file:
Error: 	"mirror stopped by user" (-1) at link
www.chesscafe.com/zip/lane2006.zip (from
www.chesscafe.com/archives/archives.htm)

This "mirror stopped by user" error gets trigger after I stop the download
intending to continue it later.  I only click Cancel once! Only once!

4) I'm not trying to get everything.

5) A) I'm running this ChessCafe projet with "No External Pages".  B) I've
already tried with different browser IDs. The results are no different.  C)
"Attempt to detect all links" option is checked.

Please help me out with this.  I know we can figure it out.  It shouldn't be
that difficult.

Have any of you HTTrack Forum members tried to download a big website (5GB+)? 
If so, what problems did you face?  How did you solve those problems?  Any
tips and tricks you think I should try to download ChessCafe in its entiret?
 
Reply Create subthread


All articles

Subject Author Date
WinHTTrack 3.47 (403 error)

04/20/2013 02:16
Re: WinHTTrack 3.47 (403 error)

04/20/2013 10:12
Re: WinHTTrack 3.47 (403 error)

04/20/2013 16:37
Re: WinHTTrack 3.47 (403 error)

04/23/2013 00:36




1

Created with FORUM 2.0.11