| > httrack <http://www.websitetospider.com/page.cfm>
> -O "./spidereddata" "-*" "+.cfm" "+.htm" "+-html"
> "+*websitetospider.com/listings.cfm/listing/*" -r6
missing asterisks on the cfm/htm..
> httrack will follow the links that don't match my
> patterns to find the other ones. I really want it to
It won't. The best you can do is spider the html and only download images etc
on the pages you want:
-* +*/listing/* +*.cfm +*.htm*
Alternative is to start httrack on a */listing/* page, then
-* +*/listing/* will work.
| |