HTTrack Website Copier
Free software offline browser - FORUM
Subject: non-english file name support
Author: Shankar
Date: 12/11/2010 15:12
 
First I want to congratulate you on such a wonderful piece of software.

I searched before posting, but it was not clear. I repost what I found and ask
for clarification.  Here's one post I found:

	

> Good afternoon, Is it possible to improve UNICODE
> support (names and content of files) in any future
> version of amazing program winhttrack ?> I want to download blog on
> <http://jinepravo.blogspot.com/> for offline reading
> but this blog contains in name of files and their
> content UNICODE of Czech language or Can you help me

You have replied:

"Httrack will rename files to match your windows settings and adjust the
links
to match. All the other text is untouched.

If your browser can't display the unicode correctly, you need to adjust it
and
windows."

What windows settings?
I get the following from the hts-log.txt:

Info: 	engine: warning: serialize error for 

ta.wikipedia.org/wiki/%E0%AE%AE%E0%AE%BE%E0%AE%B0%E0%AF%8D%E0%AE%9A%E0%AF%8D%E0%AE%9A%E0%AF%81_21,_2009_%E0%AE%AA%E0%AF%86%E0%AE%99%E0%AF%8D%E0%AE%95%E0%AE%B3%E0%AF%82%E0%AE%B0%E0%AF%8D_%E0%AE%87%E0%AE%A8%E0%AF%8D%E0%AE%A4%E0%AE%BF%E0%AE%AF_%E0%AE%85%E0%AE%B1%E0%AE%BF%E0%AE%B5%E0%AE%BF%E0%AE%AF%E0%AE%B2%E0%AF%8D_%E0%AE%A8%E0%AE%BF%E0%AE%B1%E0%AF%81%E0%AE%B5%E0%AE%A9%E0%AE%AE%E0%AF%8D_%E0%AE%A4%E0%AE%AE%E0%AE%BF%E0%AE%B4%E0%AF%8D_%E0%AE%B5%E0%AE%BF%E0%AE%95%E0%AF%8D%E0%AE%95%E0%AE%BF%E0%AE%AA%E0%AF%8D%E0%AE%AA%E0%AF%80%E0%AE%9F%E0%AE%BF%E0%AE%AF%E0%AE%BE_%E0%AE%AA%E0%AE%9F%E0%AF%8D%E0%AE%9F%E0%AE%B1%E0%AF%88
to E:/h/websites/012
wiki-ta/ta.wikipedia.org/wiki/மார்ச்சு_21,_2009_பெங்களூர்_இந்திய_அறிவியல்_நிறுவனம்_தமிழ்_விக்கிப்பீடியா_பட்டறை.html.tmp:
open error: No such file or directory (directory exists, file does not exist)
18:06:50

I get many such errors.  that is it is writing the file name in url encoded
ascii or something instead of using unicode file name.  I was trying to
download ta.wikipedia.org.  It creates some files properly but many files
don't get created at all.  It said files:5870 when i stopped it but only 197
files actually got written.  Also bytes saved was 1.1 GiB but only 100 MB
network usage is network traffic different from bytes saved?
Could you please try to rip a few pages off ta.wikipedia.org and see if
httrack needs to be fine tuned for non english file names?  Also the links
between the files do not work.  The main index page loads properly but when I
click on the link the browser is searching for unicode file names but the
files have been saved in url mangled ascii or something.

Thank you for a wonderful product.

Shankar.
 
Reply


All articles

Subject Author Date
non-english file name support

12/11/2010 15:12
Re: non-english file name support

01/29/2013 03:00




3

Created with FORUM 2.0.11