| ok so indeed the culprit must have been this html in my shell pipe.
Replace the dot with a space between a and href in the following line to set
it to work:
cat httrack/manga.animea.net/real*.html | grep "onerror=\"this.src" | sed
's!<a.href="\(.*\).html"><img src="\(.*\)" onerror="this.src=.*!YEAH
httrack/manga.animea.net/\2 \1.jpg!' | grep "YEAH " | grep -v "notfound.png" |
less
This parses the lines of the html files to extract the name of the images I'm
interested in and also prints an approximation to the filename (so that I can
go check it). It can be seen that some images are not local and they're
exactly the images with %2F names; also, if you | grep http: | wc -l, you see
there are many more non-local images than warning messages in the log.
Oups, gotta go. Thanks.
3.44-4 Debian/gnome console
| |