| Hello all. I searched for a related question and answer but found none.
I am attempting to archive a website which uses inconsistent formatting in the
HREFs. For example:
www.domain_name.com
domain_name.com
The mirror process appears to see one or the other as REMOTE, so, I have to
specify the "Maximum External Depth" to 1, or it misses those.
Unfortunately, several links are truly external, such as Wikipedia, which
takes a single site mirror, and makes it HUGE! If I set the External to 0,
then it skips the links (doesn't parse them). Is there any way to configure
the process to handle this kind of inconsistency in a web site's design?
Thanks for any help. | |