| > -qi%e0C1%Pnxs2u1%s%uN0%I0p7DaK0c3T300R3H0%kf2A0%f#f
> -F "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT
> 5.0)" -%l "en, en, *" <http://ocw.mit.edu> -O1
> "E:\\OC" -%! -#L10000000 +*.css +*.js
> -ad.doubleclick.net/* -mime:application/foobar
> +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2
> +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma
> +*.wmv +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe +*.gif
> +*.jpg +*.png +*.tif +*.bmp +*.pdf +*.pdf +*.jnlp
> +*.jpeg +*.flv -*wikipedia.org* -*wikimedia.org*
> -*commons.wikimedia.org* -*openwetware.org*
> -*http://itunes.apple.com*
> -*http://deimos3.apple.com*
> -*http://www.archive.org* -*ZipForEndUsers*
> -*http://www.onsemi.com* -*muse.jhu.edu*
> -*www.darpa.mil* -*massgeneral.org*
If you want everything just use the near flag (get non-html files related) and
drop all the +*.rar +*.z.. filters.
Since the default is to stay on site and you haven't override that with either
external depth limit or +filters allowing html, you don't need the -itunes*
etc.
The only valid reason for those filters above would be if you want everything
but files stored on the listed sites.
You might try turning off the extended parsing (maybe it's trying to parse the
.class files)
Alternatively, try adding an assume .class = text/plain
| |