HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: probs with in-/excluding file types
Author: Xavier Roche
Date: 08/25/2001 18:41
 
> i'm afraid i still have problems with the concept of 
> file filters: there's a site from which i just want 
to 
> copy all the pdf-files which are exactly two links 
> away from the index. i tried a filter like
> -*.*
> +*.pdf
> but it didn't copy anything at all except for the 
> index. do you have any ideas how to improve this 
> search?
Remember that the engine can NOT "guess" where the pdf 
files are, and therefore you MUST also catch html 
pages.

Use also something like among other filters:

(your filters..) +www.foo.com/*.html +www.foo.com/*/*[]

This will accept all html and / (top index) files

 
Reply Create subthread


All articles

Subject Author Date
probs with in-/excluding file types

08/25/2001 10:44
Re: probs with in-/excluding file types

08/25/2001 18:41
Re: probs with in-/excluding file types

08/27/2001 14:48
Re: probs with in-/excluding file types

08/28/2001 17:51
Re: probs with in-/excluding file types

10/18/2001 08:28




3

Created with FORUM 2.0.11