HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: SharePoint Difficulties
Author: WHRoeder
Date: 02/19/2013 15:21
 
1) Always post the ACTUAL command line used (or log file line two) so we know
what the site is, what ALL your settings are, etc.
2) Always post the URLs you're not getting and from what URL it is
referenced.
3) Always post anything USEFUL from the log file.
4) If you want everything use the near flag (get non-html files related) not
filters.
5) I always run with A) No External Pages so I know where the mirror ends.
With B) browser ID=msie6 as some sites don't like a HTT one. With C) Attempt
to detect all links (for JS/CSS.) With D) Timeout=60, retry=9 to avoid
temporary network interruptions from deleting files.

> (winhttrack
> -qw%e0C2%Pns2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F
> "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)"
> -%F "<!-- Mirrored from %s%s by HTTrack Website
> Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *"
> <http://www.gn-npjointarchive.org/GNRHSNewby/> -O1
> "C:\My Web Sites\Newby" +*.png +*.gif +*.jpg +*.css
> +*.js -ad.doubleclick.net/* -mime:application/foobar
> +*/http://www.gn-npjointarchive.org/GNRHSNewby/*.jpg

That URL forwards to
<http://www.gn-npjointarchive.org/GNRHSNewby/Forms/AllItems.aspx> Start there.
Drop the +png..

The first level are not images but pages: <http://www.gn-npjointarchive.org>
/_layouts/listform.aspx?PageType=...
and /_layouts are not down from /GNRHSNewby. 
Drop the filter and allow bidirection or simply allow everything
+www.gn-npjointarchive.org/*

The next page is JavaScript that HTT can't handle (even if you enable extended
parsing)
 
Reply Create subthread


All articles

Subject Author Date
SharePoint Difficulties

02/19/2013 05:26
Re: SharePoint Difficulties

02/19/2013 15:21




c

Created with FORUM 2.0.11