| I have been trying to grab a web site that has the images posted on an external
server. Some of the images were grabbed and others weren't. After a series
of filter, configuration and debugging attempts, I have arrived at a simple
change that works/breaks the image grabbing.
HTML of Failure Example:
<img alt="v Hannover at Muscato Tourney > IMG_2955_filtered" title="v
Hannover at Muscato Tourney > IMG_2955_filtered" border="0"
src=http://my.site.com/photos/11486-S.jpg width="200" height="300" />
HTML of Success Example:
<img alt="v Hannover at Muscato Tourney - IMG_2955_filtered" title="v
Hannover at Muscato Tourney - IMG_2955_filtered" border="0"
src=http://my.site.com/photos/11486-S.jpg width="200" height="300" />
The difference is the '>' character in the alt= and title=
It appears that the '>' character is being parsed incorrectly causing the
failed download.
Has anyone else experienced this? Any work-arounds?
| |