HTTrack Website Copier
Free software offline browser - FORUM
Subject: Writing to disk.
Author: GLF
Date: 03/20/2006 22:29
 
I'm trying to download a website that's comprised of a very large number of
small html files. Only the branch files (like index.html) get written out to
disk. The leaf files (like 001.html or file.zip) get written to cache in the
case of html files. In the case of binaries, they show up in the cache as a
0-byte file but the data disappears, even though I've watched them be
downloaded.

My command line arguments are: 
-qir20%e0C1%PnxX0s0u1%uN0I0%I0p3DaK0c2R10H0%kQo0A2048%c5%f0#f

Also, it seems the cache is just duplicating what's going onto the disk
anyway, so wouldn't it be more efficient to replace the zip with just a text
file listing the path, name, and date modified of all the files downloaded?
Anything that needs the actual files stored in the zip files would use the
files on disk instead. And, it would be easy to check if a file was modified
on the disk after it was downloaded by cross-referencing the modification
times with the entry in the text file.
 
Reply


All articles

Subject Author Date
Writing to disk.

03/20/2006 22:29
Re: Writing to disk.

03/22/2006 21:28




8

Created with FORUM 2.0.11