| > 1, HTTrack cannot automatically identify nonstandard
> protocol such as ed2k://. It will try to download it
> as a normal page within the start directory,
> resulting ineffective link. I mitigated the issue by
> a filter -*ed2k://*, then I will get an absolute
> link (somewhat usable although not perfect). I want
> to know is there any perfect solution for this
> situation?A mirror is NOT a copy. Server side code (cgi, asp, sel) can not
be gotten, forms do not work. streaming media and the like can't be handled
since all you get is a collection of files. Httrack is a web site copier, ed2k
is not supported.
> 2, Some page have two links, which actually point to
> the same files. The only difference is one link is
> normal, the other has an anchor(#) tag included, for
> example, #tosubtitle. I want to just download the
> links with anchor tags, because most pages don't
> have links with anchor tag and I don't need those
> links. I just need links with anchor tag.
Httrack will download only one copy of a url, all links to the file will work,
with or without anchors. | |