| > I'm trying to grab all pages/threads from a
> subforum, including attachments and images. How can
> I prevent HT from grabbing pages outside of that
> specific subforum? As it stands right now, the
> links to parent forums also get crawled as they
> <http://www.sawmillcreek.org/forumdisplay.php?f=4>
Most forums would have urls like threadDisplay.php?forum=4&Topic=2 So to get
all topics in forum 4 you would filter -* +*forum=4*
Sawmillcreek as url's like:
<http://www.sawmillcreek.org/forumdisplay.php>? f=4& order=desc
<http://www.sawmillcreek.org/showthread.php>? t=98737
So only allow forum 4 and any threads:
-* +*f=4* +*/showthread*
This should work, but if a thread specifically references a thread in another
forum, you'll get that thread also. | |