HTTrack Website Copier
Free software offline browser - FORUM
Subject: Grabbing external images - possible bug
Author: Phil
Date: 10/24/2006 18:25
 
I have been trying to grab a web site that has the images posted on an external
server.  Some of the images were grabbed and others weren't.  After a series
of filter, configuration and debugging attempts, I have arrived at a simple
change that works/breaks the image grabbing.

HTML of Failure Example:
<img alt="v Hannover at Muscato Tourney >  IMG_2955_filtered" title="v
Hannover at Muscato Tourney >  IMG_2955_filtered" border="0"
src=http://my.site.com/photos/11486-S.jpg  width="200"  height="300" />

HTML of Success Example:
<img alt="v Hannover at Muscato Tourney -  IMG_2955_filtered" title="v
Hannover at Muscato Tourney -  IMG_2955_filtered" border="0"
src=http://my.site.com/photos/11486-S.jpg  width="200"  height="300" />

The difference is the '>' character in the alt= and title=

It appears that the '>' character is being parsed incorrectly causing the
failed download.

Has anyone else experienced this?  Any work-arounds?
 
Reply


All articles

Subject Author Date
Grabbing external images - possible bug

10/24/2006 18:25




5

Created with FORUM 2.0.11