| > Is there an instruction for "Find all
> folders on www.dailymail.co.uk named 2013 and bring
> me the all the .jpg files."?
Unfortunately, unless you can view the directory of the given folder(s) [ie.
if the server allows that], you have to crawl the pages where the references
are present.
You may have to define scan rules such as:
-* +*2013* +www.example.com/the-location-of-html-files/*.html
[exclude everything, except things with 2013 in them, or html files to be
crawled] | |