HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: How to ignore external url's and spider levels
Author: Xavier Roche
Date: 08/16/2005 18:45
 
> 1. I want to spider based on my given levels.

Internal level ? You can set the depth to a defined level -- but generally you
don't need one (scan rules are sufficient).

> 2. I want to ignore external url's.

External URL's are ignored unless you ask not to ignore them. Just erase all
default filters (+*.jpg and so on..)

> 3. I don't want to download mentioned images.

Use scan rules too.

An example:

+www.example.com/*
-www.example.com/images/*
-*/bar.gif

Will mirror www.example.com, except the /images/ folder, and all images named
"bar.gif"

 
Reply Create subthread


All articles

Subject Author Date
How to ignore external url's and spider levels

08/16/2005 10:19
Re: How to ignore external url's and spider levels

08/16/2005 18:45
Re: How to ignore external url's and spider levels

08/18/2005 04:42




c

Created with FORUM 2.0.11