HTTrack Website Copier
Free software offline browser - FORUM
Subject: Filter on displayed text for hyperlink
Author: Gerald Wise
Date: 03/22/2005 00:19
 
I have a sight that I am trying to mirror, but want to 
select PDF files from the site based on the text in the 
hyperlink caption.  Is this possible?
For instance:

<a href=http://somesite/somedirectory/12345-
678.pdf">Documentation for Something (fr)</a>

The site I'm mirroring contains documentation in several 
languages.  The language is usually indicated in 
parenthesis within the hyperlink caption.  For instance 
(fr) for French, (it) for Italian, (es) for Spanish, etc.  
I would like to create a filter rule to exclude everything 
in a list of undesired languages.  In most instances, 
English is denoted by NO language identifier in 
parenthesis.  In the rare case when it is identified, (en) 
is used.

Thanks,
Gerry
 
Reply


All articles

Subject Author Date
Filter on displayed text for hyperlink

03/22/2005 00:19
Re: Filter on displayed text for hyperlink

04/06/2005 14:49




a

Created with FORUM 2.0.11