HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrack always downloads external pages... ?
Author: Richard
Date: 03/05/2001 01:47
 
Hi ! :-)

Lets take an example:

I want to mirror my homepage, which is
<http://www.complang.tuwien.ac.at/fachmann>
This is a very simple one, just plain html and some images. No frames, no
java, nothing else...

One link on my page goes to the file
<http://www.isy-timing.de/result99/index.html> ...

I did not manage to tell winhttrack (2.02b and 3.00RC10) to only download all
files from my page, and nothing else... the huge site www.easytiming.de (a
company which measures e.g. inlineskate-racingtimes) always gets being
downloaded too...

(please feel free to try it with my page and to view its sourcecode :-) )...

(i dont want to do this with filters, because normally i have to download
about 20 pages, given in a url_list.txt (generated automatically)
periodically, and i would have to set 20 filters each time...? - shoult be
possible with the options, but i think that i tried all combinations... :-)
).


And - how can i download e.g. all JPeg-Images taller than 10 kBytes, and
nothing else (ok, if it is necessary, all *htm, *.html) from 20 given pages
(and from none others) beginning in the starting directory with all
subdirectories, without setting filters for each page individually (i want to
use url_list.txt again) ?
(i tried e.g. &quot; -* +*.htm +*.html +*.jpg[&gt;10] &quot; or 
&quot; +*.htm +*.html +*.jpg[&gt;10] -* &quot; ... never did what i wanted
:-)) )...

Thanks a lot in advance, bye, Richard :-)
 
Reply


All articles

Subject Author Date
httrack always downloads external pages... ?

03/05/2001 01:47
Re: httrack always downloads external pages... ?

03/05/2001 13:02
Re: httrack always downloads external pages... ?

03/05/2001 17:14
Re: httrack always downloads external pages... ?

03/05/2001 19:19
Re: httrack always downloads external pages... ?

03/06/2001 00:06
Re: httrack always downloads external pages... ?

03/07/2001 20:47




c

Created with FORUM 2.0.11