| > Is it possible to use httrack to mimic the behavior
of
> a browser fetching a page? That is, given a URL,
> fetch the top-level object (typically HTML), also
> fetch included objects such as images or applets,
but
> not follow and fetch links? In general this would
> seem to require that the HTML-parsing portions of
the
> tool recognize at least the difference between HREF=
> and SRC=.
Well, you can use filters (as a href generally don't
contain images), such as this one:
-* +*.gif +*.jpg +*.png +*.css +*.js
(that is, exclude everyting, EXCEPT gif, jpg files and
so on..)
| |