| Hi, I am having a similar issue as the one posted here:
<http://forum.httrack.com/readmsg/19682/index.html?q=login.php%3Fdo%3Dlogin+httrack>
I am trying to clone the members section of a VBulletin forum website. When I
use HTTrack to capture the login URL, I get confirmation in my browser that
the information was captured by HTTrack.
In HTTrack the captured URL looks like
<http://www.gpzzone.co.uk/gpzforum/login.php?do=login?>postfile:C:\Documents%20and%20Settings\Fred\My%20Documents\Sites\GPZZone\hts-post1>
In HTTrack I set the following scan rules
-*
-http://www.gpzzone.co.uk/gpzforum/login.php?do=logout*
+http://www.gpzzone.co.uk/gpzforum/
I would hope this would be sufficient to filter the logout, and grab the
member forum pages. I have also set Spider to no robots.txt rules. However, I
seem to have problems.
When I try to mirror the site, the copy finishes. When I click view site, the
welcome login screen loads (Thank you for logging in), but then after a second
or two it takes me back to the non-member area as if it did not login. None of
the member pages are downloaded.
I have verified that I have 'accept cookies' checked.
My log file states:
03:31:19 Warning: file not stored in cache due to bogus state (incomplete
type): www.gpzzone.co.uk/gpzforum/login.php?do=logout*
03:31:24 Error: "Not Found" (404) at link www.gpzzone.co.uk/gpzforum/* (from
primary/primary)
| |