| > Is there a way to extract ZIP linked in downloaded HTML
> files? The site just keeps popping out HTML files for me
to
> D/L (I mean a few hundred thousand), but I already have
all
> the HTML files I need. I only want to zip files, so if I
> can somehow crawl through the downloaded pages to get the
> linked ZIP files, then mission accomplished.
Hum, if somethings "pops out", I suppose there is some
nasty javascript behind. If you want to catch zip files
(and only zip files), remember that you have also to get
html pages (to get links!). Generally, you can use
Options/Filters, such as:
-* +www.yoursite.com/*.html +www.yoursite.com/*.php +*.zip
If is does not work, you'll have to extract links manually
(if the javascript prevent the engine from going on)
| |