HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Deleting weird characters from links
Author: Xavier Roche
Date: 07/17/2004 15:49
 
> E.g. go to <http://xoro.com/product/XOR000.prod>, click on 
> "Description" and you will get to <http://xoro>.
> com/product/daten_XOR000.prod . However, if you have a 
look 
> at the page source, you will find leading and trailing 
> spaces and, get a grip, "&#xD;" and "&#xA;" codes (CR and 
LF 
> characters_ in the links!

Argh!
href="&#xD;&#xA;                               
daten_XOR000.prod&#xD;&#xA;                            "

Obviously, there is some oddities in this site..

Currently, httrack can't download such links, I'll have to 
see how those weird characters can be filtered safely.
 
Reply Create subthread


All articles

Subject Author Date
Deleting weird characters from links

07/13/2004 17:08
Re: Deleting weird characters from links

07/17/2004 15:49




7

Created with FORUM 2.0.11