HTTrack Website Copier
Free software offline browser - FORUM
Subject: Links with inconsistent domain formatting
Author: Chris Monro
Date: 07/16/2014 05:20
 
Hello all. I searched for a related question and answer but found none.

I am attempting to archive a website which uses inconsistent formatting in the
HREFs. For example:

www.domain_name.com
domain_name.com

The mirror process appears to see one or the other as REMOTE, so, I have to
specify the "Maximum External Depth" to 1, or it misses those. 

Unfortunately, several links are truly external, such as Wikipedia, which
takes a single site mirror, and makes it HUGE! If I set the External to 0,
then it skips the links (doesn't parse them). Is there any way to configure
the process to handle this kind of inconsistency in a web site's design?
Thanks for any help.
 
Reply


All articles

Subject Author Date
Links with inconsistent domain formatting

07/16/2014 05:20




d

Created with FORUM 2.0.11