HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Not deep enough
Author: Xavier Roche
Date: 04/25/2002 07:49
 
> I am trying to mirror my web site which is not 
public 
> yet.  I receive the opening page and the first page 
of 
> each tab (about us etc.), but cannot go deeper.  
When 
> I view the result and try to access the next level 
> down it wants to go on-line to access those pages.  

The "deeper" pages may be in another domain OR another 
higher/different structures. In this case, use filters 
(Options/Scan rules).

Example:
+www.yoursite.com/*
to accept everything on www.yoursite.com
or
+www.yoursite.com/foobar/*
to accept everything in www.yoursite.com/foobar/

Why is this needed?By default, HTTrack will always stay on the same 
domain (except for images), and stay on the same 
DIRECTORY structure (OR in deeper directory 
structures). For example, it may go from 
www.foo.com/bar/ to www.foo.com/bar/babar/ but NOT 
from www.foo.com/bar/ to www.foo.com/files/, 
because "files" is seen as a "same level" directory.
It won't go to www.anotherfoo.com, too. 

These default rules are setup to avoid too large 
mirrors ; for example mirroring 
www.geocities.com/custommer52145/ should NOT cause to 
mirror all other custommer websites. And the "same 
domaine" limit is also an obvious protection: you may 
not want to mirror the WHOLE WWW :)

 
Reply Create subthread


All articles

Subject Author Date
Not deep enough

04/25/2002 06:07
Re: Not deep enough

04/25/2002 07:49




4

Created with FORUM 2.0.11