HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Filters and WinHTTrack
Author: Xavier Roche
Date: 06/21/2003 07:31
 
> In the help file I keep seeing references to filters, but
> nowhere in the interface do I see where/how to use them.
> Basically I have a problem, I want to do a cache of a
> news site, say news.google.com or www.cnn.com, but just
> cache the 'Top Stories', so for instance, I don't want
> the 'World' sub page, or 'sports' etc... Just the front
> page and the stories which are immediately linked to that
> page.
> How would I do this in winhttrack?
Filters (set options/scan rules) are generally good for 
that.

For example, if information related to "sports" is 
contained in the /sports folder in www.example.com, use:

-* +www.example.com/sports/*

But this can be also a parameter of the query string (such 
as ?sports)

-* +www.example.com/*?*sports*

But if there is no way to "detect" these links based on 
their name, it will be much more difficult..
 
Reply Create subthread


All articles

Subject Author Date
Filters and WinHTTrack

06/20/2003 23:14
Re: Filters and WinHTTrack

06/21/2003 07:31




1

Created with FORUM 2.0.11