HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: New feature suggestion: External Link Scan Rules
Author: Xavier Roche
Date: 07/26/2002 20:44
 
> > While those are no help, imagine there was another one 
like *[external] ... then you might have filters 
like '+foo.com/*[external]*.zip' which would include all 
ZIP files on links external to foo.com
> In this way, users could finally nest the scan rules 
(which I suggested a while back) in the fashion of
> +foo.com/*[external][*.php][*.html][*.asp,*.cgi][*.jpg]
> would finally get the resulting .jpg files after many 
levels of obscuring the actual link!
> Many unix programs are powerful exactly because of their 
powerful scripting languages, slowly evolving more 
programmable scan rules for Httrack could be very useful.
 
Wow, I thought some current httrack options were obscure ; 
but here I don't know how I would explain that in the 
FAQs :))

Anyway this is an interesting idea ; but rather difficult 
to implement ; it's like including PC-regular-expressions 
in httrack; quite interesting, but too tricky and also 
complex for the end-user (MANY users don't understand how 
current scan rules work ; the #1 question is "how can I 
grab missing pictures"). But anyway I'll think a little 
more and see if (maybe a simplfied form) it can be 
possible. But not in the near future, anyway (the priority 
is stabilizing and including "urgent" missing features)

 
Reply Create subthread


All articles

Subject Author Date
New feature suggestion: External Link Scan Rules

07/24/2002 19:52
Re: New feature suggestion: External Link Scan Rules

07/25/2002 01:15
Re: New feature suggestion: External Link Scan Rules

07/25/2002 12:08
Re: New feature suggestion: External Link Scan Rules

07/26/2002 20:44




f

Created with FORUM 2.0.11