| I think, as Xavier pointed out in another recent post,
that some of this is rather dependant upon the server
itself and it sending the correct HTTP headers.
If a file is not found on a server, it is programmed
to display a 404 page, and in that page it SHOULD
have a "404 Not Found" HTTP status header. If it
doesn't then it would be recognised as an ordinary
page.
Coupled with that is also the "type" of a file. I
believe this is also up to the server. It can look
at the requested file, see what type it is, and set
the appropriate header.
I don't think it would be very good "looking" at an
HTML page for <html>, as not every HTML page starts
with this, and sometimes doesn't even have one at all.
| |