HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Bug in the way httack handles multiple URLS
Author: Haudy Kazemi
Date: 12/14/2002 22:41
 
> > > > Another solution is to use filters: (Options/Scan 
> rules)
> > > > 
> > > > -www.geocities.com +www.geocities.com/alpha/* 
> > > > +www.geocities.com/beta/* 
> +www.geocities.com/charlie/*
> > > 
> > > Sorry, the correct one is:
> > > -www.geocities.com/* +www.geocities.com/alpha/* 
> > > +www.geocities.com/beta/* +www.geocities.com/charlie/*
> > 
> > What difference will that trailing '/' make in the 
> filter 
> > processing?  I presume that the anything that matches 
> the 
> > pattern 'www.geocities.com/' will also match the 
> > pattern 'www.geocities.com'.  Even if you forget 
the '/' 
> in 
> > the exclusion rule, won't things still work?> 
> It's not the trailing '/' that makes the difference, it 
is 
> the '*' that makes the difference.  The first exclusion 
> rule would only match exactly 'www.geocities.com', which 
> probably would match nothing in this case.

Doh!  I knew I was missing something obvious...Thanks :)
 
Reply Create subthread


All articles

Subject Author Date
Bug in the way httack handles multiple URLS

12/13/2002 22:13
Re: Bug in the way httack handles multiple URLS

12/13/2002 22:43
Re: Bug in the way httack handles multiple URLS

12/13/2002 22:44
Re: Bug in the way httack handles multiple URLS

12/13/2002 23:32
Re: Bug in the way httack handles multiple URLS

12/14/2002 08:33
Re: Bug in the way httack handles multiple URLS

12/14/2002 22:41
Re: Bug in the way httack handles multiple URLS

05/07/2005 19:45




1

Created with FORUM 2.0.11