HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Mirroring an Invision Power Board Forum
Author: William Roeder
Date: 12/09/2009 15:03
 
> <http://forum.playredstone.com>.
> 
> The issue is on the main page (and others), HTTrack
> will download it, and show the links properly, and
> they link to other mirrored pages. But, when the
> mirror goes deeper, such as into the topics, trying
> to access pages such as the front Index again via
> the top breadcrumbs link results in a whole
> different link being provided by HTTrack, which
> directs you to a page that doesn't exist.

You did let the mirror complete, didn't you. I ran for 12 hours and got 5GB
with only 717 files complete out of 37K.
The link to the home page is in the form
index.xxx?s=2454e3a40b486d8ea134c30bf080fcae&act=idx
Which to httrack is different then the initial index.xxx
Therefore the link to the home page will be one of the last files to be
retrieved.

> This issue isn't necessarily HTTrack's fault, but
> because the forums are set-up to have links where
> one is simply "index.php", while another is
> "index.php?act=idx" and another is
> "index.php?s=378433773".

Httrack handles these just fine by renaming the files to indexAAAA.html,
indexHHHH.html

Also because of the session ID (?s=...) you may not be able to continue or
update the mirror (if you get a different session ID, it'll redownload
everything and delete all the old files.)
 
Reply Create subthread


All articles

Subject Author Date
Mirroring an Invision Power Board Forum

12/08/2009 21:05
Re: Mirroring an Invision Power Board Forum

12/08/2009 22:11
Re: Mirroring an Invision Power Board Forum

12/09/2009 01:18
Re: Mirroring an Invision Power Board Forum

12/09/2009 15:03
Re: Mirroring an Invision Power Board Forum

12/10/2009 15:18




6

Created with FORUM 2.0.11