| > I'm trying to download specific part of a message board,
> which contains about 700 posts. However, I ended up
> downloadind thousands of html files and the size went up
> to 2 gigs and counting. I wonder is this right or I
> screwed up somewhere.
Maybe a loop, or duplicate posts (like the same page but
with ?foo=next or ?foo=previous) ; you'll have to check how
message links "look like"
I'd be greatful if someone more
> experienced with this kind of things could help me out.
> The main address is: <http://www.ryanross.net/cc/> and
the
> part I want to download is the 'Editing Board'.
At a first glance, exclude everything but images, and then
select the desired thread ; for example 1234, and do not
get "flat views" (the end must be t=XXX) :
Set Options / Scan rules:
-* +*.gif +*.png +*.jpg +*.css +*.js
+www.ryanross.net/cc/read.php*t=1234*[]
(replace 1234 by the desired thread it, that is, the number
in t=XXX)
| |