site crawling and specific files only download - HTTrack Website Copier Forum

Subject: site crawling and specific files only download

Author: Alex

Date: 05/02/2012 18:36

HI,

I have a list of websites which I wish to crawl and download sepcific file
types only.

In the Action I am using "Get Seperated Files"

Then for the websites I have .txt file with seperated site:

www.mysite.com
www.hersite.co.uk
www.hissite.net

etc

I only want to download certain files mainly PDF, TXT, DOC, EXCEL no css, html
files etc

my filters box is currently set up like this

+*.png +*.gif +*.jpg *.pdf *.txt *.doc *.docx
+www.*.com/*.html +*.zip +*.pdf
+www.*.co.uk/*.html +*.zip +*.pdf
+www.*.net/*.html +*.zip +*.pdf
+www.*.org/*.html +*.zip +*.pdf

But I still not seem to be getting what I am after. I know that for example
www.mysite.com does have PDF's on it so why wont it find and get them?
Any help would be greatly appretiated, Thanks

All articles

Subject	Author	Date
site crawling and specific files only download		05/02/2012 18:36
Re: site crawling and specific files only download		05/02/2012 20:02