HTTrack Website Copier
Free software offline browser - FORUM
Subject: Is there a way to eliminate links in the mirror?
Author: Alan Sill
Date: 09/23/2022 23:58
 
I like the filter options and am trying to use them to pull content from a
complex site that has too many sorting and formatting options on its pages. I
see that httrack obeys the filters and does not transfer the content I want
supposed, but leave the links to that content in the html files that it
creates as it mirrors. The use case here is to eliminate the links that would
normally result in sorting etc., i.e., to eliminate the links that are
suppressed from retrieval by the filter.

I suppose I could sed and awk my way through the mirrored content to try to
get rid of those links, replacing them wiht a link to a page that says
"Sorting not available in this archive", but is there a way to get httrack to
do this by itself? the feature request is this: replace all links that are
rejected by a "-" filter with a link to a page specified when httrack is
invoked.
 
Reply


All articles

Subject Author Date
Is there a way to eliminate links in the mirror?

09/23/2022 23:58




0

Created with FORUM 2.0.11