HTTrack Website Copier
Free software offline browser - FORUM
Subject: Yet Another forum run through queries (on YABBS)
Author: Hypocee
Date: 07/10/2012 00:26
 
Hi folks, I've been using (mostly Win)Httrack occasionally for a few years,
mostly with great success. I think I have a pretty good understanding of link
limits, scan rule priority and so on. However, I've taken three or four
attempts at mirroring an old private forum over the years. I just failed again
at the same point I always have, and this time I'm going to ask for help.

My command line from the hts-log, domain altered, starting at the domain to
see if I can get past this page's "malformed message" spamblocking measures: 
<http://forum.mydomain.org/index.php?action=login2?>postfile:E:\httrack\Fiji\hts-post1>
-O1 E:\httrack\Oldboard -* +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar +*php/?board=1.*
+*php/?topic=*

That's just my latest line with paranoid escapes on the question marks - I
started with -* +*board=1.* +*topic=* and have tried a number of other simple
variations. The server's robots.txt locks off addresses beginning with / . I
have tried the ignore robots option with both my current and starting filters.


My goal is to grab all thread listing pages in section 1 of the board and the
topics linked from them. The board uses an ancient YABBS installation with
URLs of form .../index.php?board=1.0, 1.25, 1.50 etc. Currently the result of
running this job is that I get one and only one page of HTML - the front page
of the forum immediately after logging in, listing the sections, with my
username and avatar displayed so I'm confident I'm getting logged into a
session OK. The logout link is index.php?action=logout and I don't get a
"you've logged out" page in the job so I assume I'm not tripping that. 

What silly mistake am I making, please? Failing to escape the equal signs is
all I can think of at this point, but I'm having a hard time searching for it
and I haven't seen anything about a need for escapes in the filters section of
the docs.
 
Reply


All articles

Subject Author Date
Yet Another forum run through queries (on YABBS)

07/10/2012 00:26
Re: Yet Another forum run through queries (on YABBS)

07/10/2012 01:22
Re: Yet Another forum run through queries (on YABBS)

07/10/2012 11:24
Re: Yet Another forum run through queries (on YABBS)

07/11/2012 00:52
Re: Yet Another forum run through queries (on YABBS)

07/11/2012 00:59
Re: Yet Another forum run through queries (on YABBS)

07/11/2012 18:42
Re: Yet Another forum run through queries (on YABBS)

07/11/2012 22:19




5

Created with FORUM 2.0.11