| I've read the tutorial and checked my options several
times, but it's still not working.
I want to start from the URL
<http://www.newsisfree.com/sources/bycat/1>
I then want to spider through the links on that single
page, but only the links for this directory:
<http://www.newsisfree.com/sources/info/>
I don't want to download every page in /info/,
only the pages which are linked-to by "bycat/1".
For the resulting pages in the /info/ directory,
I want to download only linked-to pages in this directory:
<http://www.newsisfree.com/sources/rss/>
When I try to do this in WinHTT, I immediately see that
undesired pages are being processed, for example
<http://www.newsisfree.com/sources/bycat/0>
Thank you. | |