| I'm running httrack from cygwin on Win2K on this URL:
<http://news.com.com/2003-12_3-0.html>
The actual HTML file contains approximately 51,337 bytes.
The downloaded HTML file truncates to 1,951 bytes. There
is a NULL (0x00) character at byte 1,952 of the file.
The hts-log.txt file says:
1 links scanned, 1 files written (50964 bytes overall)
[51337 bytes received ...
Is there any way to force httrack to not truncate at NULL
values when saving the file to disk? I suspect that
httrack is actually receiving the entire file, and then
decides to truncate only when writing to the filesystem.
As a point of reference, another program, curl
(http://curl.haxx.se) seems to work fine on the URL, and
I'm able to redirect the output to disk file without any
problems of truncation. | |