| > (...)
> Remember that the engine can NOT 'guess' where the
pdf
> files are, and therefore you MUST also catch html
> pages.
So I *must* download html files first before they can
be processed, i.e. scanned for links? I somehow
thought, WINHTTRACK did this "on the fly" ...
>
> Use also something like among other filters:
>
> (your filters..) +www.foo.com/*.html +www.foo.com/*/*
[]
>
> This will accept all html and / (top index) files
> (...)
Thanks for the hint. Which reminds me: Is there some
kind of flowchart indicating which filters are being
applied first and how the software reacts if it comes
across two "opposite" filters (i.e.: "maximum external
depth = 2" and then "do not download external files")?
Best wishes,
Andreas.
| |