HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Restricting harvests to certain file types
Author: Xavier Roche
Date: 01/07/2003 21:14
 
> If you add +*.htm +*.html to the filters be sure you 
have  
> mirror level depth limits set otherwise you're going to 
> download the whole internet.

Exactly - use instead:

-* +www.foo.com/*.htm +www.foo.com/*.html 
+www.bar.com/*.htm +www.bar.com/*.html
.. and so on (for each site you want to crawl, add 
+<site>/*.htm +<site>/*.html)

 
Reply Create subthread


All articles

Subject Author Date
Restricting harvests to certain file types

01/05/2003 23:54
Re: Restricting harvests to certain file types

01/06/2003 22:20
Re: Restricting harvests to certain file types

01/06/2003 22:42
Re: Restricting harvests to certain file types

01/07/2003 21:14
Re: Restricting harvests to certain file types

01/07/2003 21:48
Re: Restricting harvests to certain file types

01/09/2003 09:54




7

Created with FORUM 2.0.11