| >if i have no filters set up, does that mean httrack
>will download everything it spiders? if that is so,
>then you don't really need to add include links (+),
>no? i guess my question is, what is the default scan
>behavior if no scan rules are added?
Whatever the user-defined filters are, the engine will
always put DEFAULT filters.
For example, if you crawl www.foo.com/bar/index.html,
the engine will automatically add the filter
+www.foo.com/bar/* before all other filters (user-
defined filters will therefore override it)
There are also default engine behaviour, such as
staying on the same domain and same folder level.
All these default settings were designed so that you
can grab a website without having to define any
additional options (that is, generally, you'll only
have to enter the website address and click
twice "NEXT" to start the mirror)
| |