HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Linking Error but can't detect
Author: Xavier Roche
Date: 05/18/2002 14:02
 
> You can do it. It may be due to the file saved in your 
> folder is 6 character, a non-chinese character file name.

Ahhh okay I see the problem:

- UTF-8 characters are saved into their unicode equivalent 
(UCS table) on the windows filesystem, because HTTrack does 
not escape them when opening the file (fopen() call) and 
because the Chinese version of windows automatically detect 
UTF format and translate it
(I strongly suspect windows to check the UTF sequence and 
see if it is valid)

- IE (or Netscape) can not access these files because the 
HTML encoded form (%xx) force them to convert EACH 
characters into the native UCS-2 form, and therefore the 
subsystem won't translate them.

Argh.

The only way would be to detect UTF encoding in HTML pages 
(generally neither in headers nor in meta), argh argh. I'll 
add this problem on the TODO list, but don't yet know how 
to solve it

> Last but not the least, you are very very good helper and 
> willing to provide support. Great !

Thanks :)
 
Reply Create subthread


All articles

Subject Author Date
Linking Error but can't detect

05/14/2002 13:19
Re: Linking Error but can't detect

05/14/2002 22:44
Re: Linking Error but can't detect

05/15/2002 20:16
Re: Linking Error but can't detect

05/15/2002 20:45
Re: Linking Error but can't detect

05/16/2002 20:59
Re: Linking Error but can't detect

05/16/2002 21:44
Re: Linking Error but can't detect

05/17/2002 19:38
Re: Linking Error but can't detect

05/18/2002 14:02




d

Created with FORUM 2.0.11