Re: probs with in-/excluding file types - HTTrack Website Copier Forum

Subject: Re: probs with in-/excluding file types

Author: Andreas

Date: 08/27/2001 14:48

> (...)
> Remember that the engine can NOT 'guess' where the 
pdf 
> files are, and therefore you MUST also catch html 
> pages.

So I *must* download html files first before they can 
be processed, i.e. scanned for links? I somehow 
thought, WINHTTRACK did this "on the fly" ...

> 
> Use also something like among other filters:
> 
> (your filters..) +www.foo.com/*.html +www.foo.com/*/*
[]
> 
> This will accept all html and / (top index) files
> (...)

Thanks for the hint. Which reminds me: Is there some 
kind of flowchart indicating which filters are being 
applied first and how the software reacts if it comes 
across two "opposite" filters (i.e.: "maximum external 
depth = 2" and then "do not download external files")?
Best wishes,

Andreas.

Create subthread

All articles

Subject	Author	Date
probs with in-/excluding file types		08/25/2001 10:44
Re: probs with in-/excluding file types		08/25/2001 18:41
Re: probs with in-/excluding file types		08/27/2001 14:48
Re: probs with in-/excluding file types		08/28/2001 17:51
Re: probs with in-/excluding file types		10/18/2001 08:28