HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: probs with in-/excluding file types
Author: Xavier Roche
Date: 08/28/2001 17:51
 
> Thanks for the hint. Which reminds me: Is there some 
> kind of flowchart indicating which filters are being 
> applied first and how the software reacts if it 

The test routine consists of 5 different phases:

PHASE 1 : 
- expert "html only/non html only" option -> "i don't 
know" or "DO NOT catch" (abandon immediately)
- too far level reached -> same as above

PHASE 2 :
- check the link position (up/down/same address.. 
parameters ) -> "may catch" or "i dont know" status

PHASE 3 :
- "get non html files near a link" -> "may catch" 
or "i dont know" status

PHASE 4 :
- filters, external depth and robots.txt if used -
> "DO catch", "DO NOT catch" or "I don't know"

At this point, the engine makes its choice:
* if status == "may catch" ==> "DO catch"
* if status == "DO catch"  ==> "DO catch"
* if status == "DO NOT catch" ==> "DO NOT catch"
* else
if quiete wizard : "DO NOT catch" by default
wizard mode (asks questions) : asks the user!

PHASE 5 :
- if status is "DO NOT catch", and non html file 
encountered, and option "test all links", then ==> "DO 
test" (test mode)

 
Reply Create subthread


All articles

Subject Author Date
probs with in-/excluding file types

08/25/2001 10:44
Re: probs with in-/excluding file types

08/25/2001 18:41
Re: probs with in-/excluding file types

08/27/2001 14:48
Re: probs with in-/excluding file types

08/28/2001 17:51
Re: probs with in-/excluding file types

10/18/2001 08:28




a

Created with FORUM 2.0.11