Re: Writing to ARC format - HTTrack Website Copier Forum

Subject: Re: Writing to ARC format

Author: Lars Clausen

Date: 11/19/2003 09:33

> Ah, yes. Some transformations are being made:
> 200 => 206 for bogus responses with 'content-range'
> 203 => 200 because this is the same and I did not want 
> multiple tests
> 416 => 304 for bogus 'Requested Range Not Satisfiable' 
> responses when the file is actually complete after a 
> request using Content-range: bytes */<size>
> 406 => 200 OK is good: links inside
> 206 => 200 for partial files (200 is good)
> 'brain fcked request' => 200 because brainfcked servers 
> exists :(

I didn't know that was a lgeal response code:)

> 304 => 200 because as we have a cache, we can 
> simulate 'fake' 200 responses (so that we don't have to 
> bother with the update process)

But you don't do 200 => 30* for 200 responses with a
Location: header?
> There are also negative codes (-1,..-10) for timeouts, and 
> other various errors (see the 'msg' member of the htsblk 
> structure, which is the error string message)
> 
> > get_header shasta.cs.uiuc.edu/~lrclause: (-5) 
> > * shasta.cs.uiuc.edu/~lrclause (270 bytes) - 301
> 
> Argh - forgot to say that the receive-header callback is 
> called **BEFORE** headers are being processed - therefore 
> most fields are irrelevant at this state!

Aha!  I thought that might be the case.  Is there anything
in the htsblk that's reliable at that point?  And is there
something I can do to have the headers processed (short of
doing it myself)?  Don't see a way to indicate in htsblk
that the headers are parsed, so if I call something to fill
in htsblk, that'd be done again afterwards, right?
> > Note that it depends on GLib, since I had HTTracks own 
> hash
> > table crash on me.
> 
> Darn! Using htsinthash.* functions ?
Yup.  Don't have the time to go in and understand it, so I
just picked the known GLib implementation.  Also, I'm not
sure what to do to remove an entry from the htsinthash.

-Lars

Create subthread

All articles

Subject	Author	Date
Re: Writing to ARC format		11/18/2003 20:39
Re: Writing to ARC format		11/19/2003 09:33
Re: Writing to ARC format		11/22/2003 16:10