| Thanks for the fast & helpful response. That seems to
work as expected, more or less.
I find that I get the expected behavior -- capture of
the HTML page plus all referenced images -- so long as
I set Limits - Maximum Mirroring Depth to at least 2.
However, when the Max Mirroring Depth is set to only
1, I get only the HTML file, not the images it refers
to.
My actual application calls for capturing a long list
of pages (>10000, from as many distinct hosts!).
Accordingly, I'm hesitant to set Mirroring Depth to 2 -
- as that dramatically increases the number of pages
to be retreived and the duration of this task.
So, I think my questions are two:
1.) Am I right that Maximum Mirroring Depth must be
set to 2 (or higher) in order for the specified
procedure (custom Include/Exclude rule) to capture the
expected pages?
2.) Is there any other way to do this, that also
achieves my goal of efficient & fast operation?
Ben Edelman | |