HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Unable to download only levels below URL in domain
Author: WHRoeder
Date: 03/07/2013 16:12
 
1) Always post the ACTUAL command line used (or log file line two) so we know
what the site is, what ALL your settings are, etc.
2) Always post the URLs you're not getting and from what URL it is
referenced.
3) Always post anything USEFUL from the log file.
4) If you want everything use the near flag (get non-html files related) not
filters.
5) I always run with A) No External Pages so I know where the mirror ends.
With B) browser ID=msie 6 pulldown as some sites don't like a HTT one. With C)
Attempt to detect all links (for JS/CSS.) With D) Timeout=60, retry=9 to avoid
temporary network interruptions from deleting files.

> I need to download a URL in a domain and all the
> levels below it. Here is the URL
> <http://www.ecfr.gov/cgi-bin/text-idx?SID>...

> For some reason, httrack is unable to download below
because it contains < meta name="robots" content="nofollow" /> and a
<http://www.ecfr.gov/robots.txt> #3

> the first level -- when I try to view the result, I
> get a an ActiveX error in IE and it also says that
The default is to go down only. so the css/js which is not down from that url
will not be. #4
 
Reply Create subthread


All articles

Subject Author Date
Unable to download only levels below URL in domain

03/07/2013 06:12
Re: Unable to download only levels below URL in domain

03/07/2013 16:12




0

Created with FORUM 2.0.11