| Hi All,
I'm wondering if this is possible:
I'm wanting to scrape all images from say this base url:
www.someweb.com/images/
Setting the filters I'm only interested in scraping urls linked to from links
like:
www.someweb.com/images/a/alpha
www.someweb.com/images/a/anthrax
www.someweb.com/images/b/beta
www.someweb.com/images/z/zulu
Now within www.someweb.com/images/a/alpha, it links to the full size images I
am interested in, but these are on an external site, e.g.
www.pichost.com/somefolder/abc.jpg
www.pichost.com/somefolder/def.jpg
www.pichost.com/somefolder/xyz.jpg
Now my issue is that images abc, def and xyz are related to "alpha" so I want
them stored as
www.someweb.com/images/a/alpha/abc.jpg
www.someweb.com/images/a/alpha/def.jpg
www.someweb.com/images/a/alpha/xyz.jpg
i.e., ignore the fact they come from someweb.com/...
Is this possible, or any ideas how this might be achieved?
Many thanks,
Rob | |