HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Defaults, unquoted links, UTF-to-ISO conversion
Author: Joe Forster/STA
Date: 03/13/2004 15:02
 
Hi Xavier,

I don't remember where I found this unquoted HREF problem. 
When I find it again, I'll drop you a note. :-)

As for the detection of encoding, I have a simple idea, 
leaned from Opera:

1. If the web page has a encoding meta tag
(like <META HTTP-EQUIV="Content-Type" CONTENT="text/html; 
charset=ISO-8859-1">) then the encoding is obvious.

2. If the web page has no such meta tag then there should be 
a _default_ encoding that the user can configure.

This is in the "Language" page of the Opera preferences, 
option "Encoding to assume for pages lacking specification".

And HTTrack has to read through the complete web page anyway 
as it has to find links. During this process, the encoding 
meta tag can also be noticed. Thanks for your time,

Joe
 
Reply Create subthread


All articles

Subject Author Date
Defaults, unquoted links, UTF-to-ISO conversion

03/10/2004 15:34
Re: Defaults, unquoted links, UTF-to-ISO conversion

03/13/2004 09:24
Re: Defaults, unquoted links, UTF-to-ISO conversion

03/13/2004 15:02




c

Created with FORUM 2.0.11