HTTrack Website Copier
Free software offline browser - FORUM
Subject: links with anchor and nonstandard protocol
Author: aufgeist
Date: 12/20/2009 02:46
 
Hi, I encountered two problems when archiving some pages:

1, HTTrack cannot automatically identify nonstandard protocol such as ed2k://.
It will try to download it as a normal page within the start directory,
resulting ineffective link. I mitigated the issue by a filter -*ed2k://*, then
I will get an absolute link (somewhat usable although not perfect). I want to
know is there any perfect solution for this situation?
2, Some page have two links, which actually point to the same files. The only
difference is one link is normal, the other has an anchor(#) tag included, for
example, #tosubtitle. I want to just download the links with anchor tags,
because most pages don't have links with anchor tag and I don't need those
links. I just need links with anchor tag.

How should I do? I urgently need help. Thanks a lot.
 
Reply


All articles

Subject Author Date
links with anchor and nonstandard protocol

12/20/2009 02:46
Re: links with anchor and nonstandard protocol

12/20/2009 17:10
Re: links with anchor and nonstandard protocol

12/21/2009 01:27
Re: links with anchor and nonstandard protocol

12/21/2009 18:49




b

Created with FORUM 2.0.11