HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Scan Rules in WinTTrack Web site Copier how to use
Author: Xavier Roche
Date: 06/17/2002 21:08
 
> For instance  from the site
> <http://members.tripod.com/bub125/album/albums.html> I
> want to download only the pages shown under Important 
Links 
> ( 1, 2, 3, 4, 5, & 6) and
> that too without any ads and counters in the links.

Hum, ads are indeed VERY annoying on tripod :)

First use 'strict' filters (go to Options/Scan rules), like:
-* +members.tripod.com/bub125/album/*

This will exclude EVERYTHING by default. Then, you may 
accept links or link types categories by categiries

The problem with this site is that there are MANY external 
links (to geocities.com, for example) and therefore 
identifying exactly what to download is quite difficult
For example, the first 'important 'link' is located at 
geocities.com:
<http://www.geocities.com/dkbhat/kundli/ALBUM1.HTML>
and has many images located at:
<http://www.geocities.com/dkbhat/kundli/xxx.jpg>

I would then suggest the additional filters:
+www.geocities.com/dkbhat/kundli/*
+www.geocities.com/*.jpg

The second one is at fortunecity:
<http://www.fortunecity.com/skyscraper/daisywheel/1285/Family>
Tree.html

Therefore, use:
+www.fortunecity.com/skyscraper/daisywheel/1285/*

and so on.. you'll have something like:
-* +members.tripod.com/bub125/album/*
+www.geocities.com/dkbhat/kundli/*
+www.geocities.com/*.jpg
+www.fortunecity.com/skyscraper/daisywheel/1285/*
..

but og course, you'll have to note each links and then add 
proper filter(s) (note: do not put the leading http:// on 
filters)

This may appear quite complicated, but this site IS hard to 
capture..
 
Reply Create subthread


All articles

Subject Author Date
Scan Rules in WinTTrack Web site Copier how to use

06/14/2002 20:50
Re: Scan Rules in WinTTrack Web site Copier how to use

06/17/2002 21:08
Re: Scan Rules in WinTTrack Web site Copier how to use

01/19/2004 12:44




d

Created with FORUM 2.0.11