HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Can't download every pages of a thread
Author: WHRoeder
Date: 02/19/2013 14:18
 
1) Always post the ACTUAL command line used (or log file line two) so we know
what the site is, what ALL your settings are, etc.
2) Always post the URLs you're not getting and from what URL it is
referenced.
3) Always post anything USEFUL from the log file.
4) If you want everything use the near flag (get non-html files related) not
filters.
5) I always run with A) No External Pages so I know where the mirror ends.
With B) browser ID=msie6 as some sites don't like a HTT one. With C) Attempt
to detect all links (for JS/CSS.) With D) Timeout=60, retry=9 to avoid
temporary network interruptions from deleting files.

> (winhttrack
> -qiC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A999999%f#f -F
> "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)"
> -%F "<!-- Mirrored from %s%s by HTTrack Website
> Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *"
> <http://stage48.net/forum/> -O1 D:\Sites\Stage48
> +*.png +*.gif +*.jpg +*.css +*.js
> -ad.doubleclick.net/* -mime:application/foobar
> +*.htm +*.html +http://stage48.net/forum/viewtopic*
> )
> 
You didn't use near flag so some images/formatting maybe off.

> Sorry, I wasn't on this computer when I posted so I
> couldn't post the log.
I said anything useful, not the entire log.

> I can access the first page and the 3-4 last pages.
> Still can't understand why it doestn't work on the
The page list is 1 2 3 .. n. So the first few pages and the last are
accessable at second level, 4 and n-2, n-3 are accessed on third level. Sounds
like you didn't get the WHOLE site.

I ran:
(winhttrack -qiC1%Pnxs2u1%s%uN5%I0p3DaK0c3T60R9H0%kf2o0%c2%f#f -F "Mozilla/4.0
(compatible; MSIE 6.0; Windows NT 5.0)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *"
<http://stage48.net/forum/> -O1 C:\Users\Bill\x_HTTrack\test )
canceled it after 15 hours and 3GB and the last lines in the Log file reads:
Too many URLs, giving up..(>100000)
To avoid that: use #L option for more links (example: -#L1000000)
options -> limits -> Maximum number of links.

 
Reply Create subthread


All articles

Subject Author Date
Can't download every pages of a thread

02/18/2013 12:30
Re: Can't download every pages of a thread

02/18/2013 19:00
Re: Can't download every pages of a thread

02/18/2013 19:56
Re: Can't download every pages of a thread

02/19/2013 14:18
Re: Can't download every pages of a thread

02/19/2013 19:42
Re: Can't download every pages of a thread

02/19/2013 22:06




9

Created with FORUM 2.0.11