HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: How to keep HTTrack from copying external websites
Author: William Roeder
Date: 12/12/2011 23:44
 
> (winhttrack
> -qwC2%Ps0u1%s%uN0%I0p3DaK0H0%kf2#L5000000%f#f -F
> "Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)"
> -%F "<!-- Mirrored from %s%s by HTTrack Website
> Copier/3.x [XR&CO'2010], %s -->" -%l "en, en, *"
> <http://mail.ptg.org/pipermail/pianotech/> -O1 "C:My
> Web SitesPTG Pipermail PianoTech" +*.css +*.js
> -ad.doubleclick.net/* -mime:application/foobar
> +*.gif +*.jpg +*.png +*.tif +*.bmp +*.zip +*.tar
> +*.tgz +*.gz +*.rar +*.z +*.exe +*.mov +*.mpg
> +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2 +*.rm +*.wav
> +*.vob +*.qt +*.vid +*.ac3 +*.wma +*.wmv )

1) Don't use filters like that. If you want everything use the near flag (get
non-html files related) instead. Your filters will miss things like
getImage.php?ID=000
2) Some sites don't like a HTT browser ID. I only use msie6
3) By default HTT only stays on site and your filters as posted, contain no
override so your problem should not occur. I ran for 8 hours 500MB with no
problems.
4) with 20 years of archives your going to need a lot of disk space.
 
Reply Create subthread


All articles

Subject Author Date
How to keep HTTrack from copying external websites

12/12/2011 03:20
Re: How to keep HTTrack from copying external websites

12/12/2011 23:44
Re: How to keep HTTrack from copying external websites

12/14/2011 00:29
Re: How to keep HTTrack from copying external websites

12/14/2011 15:02
have always wished it would work propper...

12/28/2012 02:29
Re: have always wished it would work propper...

12/28/2012 02:46
READ 1ST, instead. My settings.

12/28/2012 04:55




1

Created with FORUM 2.0.11