Re: .class files - HTTrack Website Copier Forum

Subject: Re: .class files

Author: WHRoeder

Date: 09/04/2012 15:54

> -qi%e0C1%Pnxs2u1%s%uN0%I0p7DaK0c3T300R3H0%kf2A0%f#f
> -F "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT
> 5.0)" -%l "en, en, *" <http://ocw.mit.edu> -O1
> "E:\\OC" -%! -#L10000000 +*.css +*.js
> -ad.doubleclick.net/* -mime:application/foobar
> +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2
> +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma
> +*.wmv +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe +*.gif
> +*.jpg +*.png +*.tif +*.bmp +*.pdf +*.pdf +*.jnlp
> +*.jpeg +*.flv -*wikipedia.org* -*wikimedia.org*
> -*commons.wikimedia.org* -*openwetware.org*
> -*http://itunes.apple.com*
> -*http://deimos3.apple.com*
> -*http://www.archive.org* -*ZipForEndUsers*
> -*http://www.onsemi.com* -*muse.jhu.edu*
> -*www.darpa.mil* -*massgeneral.org*
If you want everything just use the near flag (get non-html files related) and
drop all the +*.rar +*.z.. filters.
Since the default is to stay on site and you haven't override that with either
external depth limit or +filters allowing html, you don't need the -itunes*
etc.
The only valid reason for those filters above would be if you want everything
but files stored on the listed sites.

You might try turning off the extended parsing (maybe it's trying to parse the
.class files)
Alternatively, try adding an assume .class = text/plain

Create subthread

All articles

Subject	Author	Date
.class files		09/04/2012 07:49
Re: .class files		09/04/2012 15:54
Re: .class files		09/05/2012 08:29