HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: probs with in-/excluding file types
Author: Andreas
Date: 08/27/2001 14:48
 
> (...)
> Remember that the engine can NOT 'guess' where the 
pdf 
> files are, and therefore you MUST also catch html 
> pages.

So I *must* download html files first before they can 
be processed, i.e. scanned for links? I somehow 
thought, WINHTTRACK did this "on the fly" ...

> 
> Use also something like among other filters:
> 
> (your filters..) +www.foo.com/*.html +www.foo.com/*/*
[]
> 
> This will accept all html and / (top index) files
> (...)

Thanks for the hint. Which reminds me: Is there some 
kind of flowchart indicating which filters are being 
applied first and how the software reacts if it comes 
across two "opposite" filters (i.e.: "maximum external 
depth = 2" and then "do not download external files")?
Best wishes,

Andreas.
 
Reply Create subthread


All articles

Subject Author Date
probs with in-/excluding file types

08/25/2001 10:44
Re: probs with in-/excluding file types

08/25/2001 18:41
Re: probs with in-/excluding file types

08/27/2001 14:48
Re: probs with in-/excluding file types

08/28/2001 17:51
Re: probs with in-/excluding file types

10/18/2001 08:28




a

Created with FORUM 2.0.11