HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: .class files
Author: WHRoeder
Date: 09/04/2012 15:54
 
> -qi%e0C1%Pnxs2u1%s%uN0%I0p7DaK0c3T300R3H0%kf2A0%f#f
> -F "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT
> 5.0)" -%l "en, en, *" <http://ocw.mit.edu> -O1
> "E:\\OC" -%! -#L10000000 +*.css +*.js
> -ad.doubleclick.net/* -mime:application/foobar
> +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2
> +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma
> +*.wmv +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe +*.gif
> +*.jpg +*.png +*.tif +*.bmp +*.pdf +*.pdf +*.jnlp
> +*.jpeg +*.flv -*wikipedia.org* -*wikimedia.org*
> -*commons.wikimedia.org* -*openwetware.org*
> -*http://itunes.apple.com*
> -*http://deimos3.apple.com*
> -*http://www.archive.org* -*ZipForEndUsers*
> -*http://www.onsemi.com* -*muse.jhu.edu*
> -*www.darpa.mil* -*massgeneral.org*
If you want everything just use the near flag (get non-html files related) and
drop all the +*.rar +*.z.. filters.
Since the default is to stay on site and you haven't override that with either
external depth limit or +filters allowing html, you don't need the -itunes*
etc.
The only valid reason for those filters above would be if you want everything
but files stored on the listed sites.

You might try turning off the extended parsing (maybe it's trying to parse the
.class files)
Alternatively, try adding an assume .class = text/plain
 
Reply Create subthread


All articles

Subject Author Date
.class files

09/04/2012 07:49
Re: .class files

09/04/2012 15:54
Re: .class files

09/05/2012 08:29




2

Created with FORUM 2.0.11