HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Query string filtering
Author: William Roeder
Date: 02/04/2009 00:17
 
> 1) .../article?AID=/DT/DAILY/TM
> 2) .../article?AID=/DT/DAILY/TM&template=BLAH
> 3) .../section?category=BLAH
> 4) .../section?category=FOO
> 5) .../section?category=BLAH&Extra=6
> 
> What filters would I use to include 1 and 3 but not
> 2, 4 and 5 on a mass scale?
FYI -*.html means no html files with or with out arguments
FYI -*.html*\[\] means no html files with no arguments
<http://www.httrack.com/html/fcguide.html> also lists *\[name\] and *\[path\]

only 1 and not 2 -* +*article* -*template=BLAH*
only 3 and not 4 or 5 -* +category=BLAH*\[\]
only 1+3 -2 -4 -5: -* +*article* -*template=BLAH* +category=BLAH*\[\]

> I have tried:
> -*FOO*
This should reject all 4 only (FOO anywhere)

> -*/*FOO*
Since every file contains a / (site/file) this rejects FOO anywhere but in the
site name

> -*/*template=BLAH*
This should reject all 2 only

> -*/*Extra*
This should reject all 5 only

Total reject 2, 4, and 5 and allow everything else.
 
> And it seems like these filters are ignored.  Is the
> "/apps/pbcs.dll" URL structure messing it up?  Or is
> it the multiple / in AID?  Should I use the ? in the
> filter?Slashes in AID may be confusing the parser.
 
Reply Create subthread


All articles

Subject Author Date
Query string filtering

02/03/2009 22:37
Re: Query string filtering

02/04/2009 00:17
Re: Query string filtering

05/12/2009 00:24




a

Created with FORUM 2.0.11