HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Cache bug?
Author: William Roeder
Date: 03/11/2010 00:25
 
> run 1 - site downloaded
> 
> run 2 - site updated. old cache renamed "old.zip".
> "new.zip" contains only updated files
> 
> run 3 - site updated again. cache from run 2 renamed
> "old.zip". files updated in run 2 are checked for
> updates. files not updated in run 2 are no longer in
> the cache, are assumed to be new, and get downloaded
> again.
> 
> is this really what's happening? or am I missing
> something?1) You don't say what version you're using.
2) If the file were not in the cache of run 2, that means they were no longer
referenced on the site and deleted.
Possibly the site went down, or your timeout limits are too short (I always
run with timeout=300s retries=9) [or you canceled a mirror]
3) Httrack also checks file timestamps (see update hack) and if they change
httrack will redownload. This happens when a site is rebuild but the owner
carelessly doesn't copy keeping the timestamps (copy vs xcopy for instance.)
 
Reply Create subthread


All articles

Subject Author Date
Cache bug?

03/10/2010 22:51
Re: Cache bug?

03/11/2010 00:25
Re: Cache bug?

03/11/2010 00:27
Re: Cache bug?

03/11/2010 03:39
Re: Cache bug?

03/11/2010 04:36




5

Created with FORUM 2.0.11