| > I am trying to mirror my web site which is not
public
> yet. I receive the opening page and the first page
of
> each tab (about us etc.), but cannot go deeper.
When
> I view the result and try to access the next level
> down it wants to go on-line to access those pages.
The "deeper" pages may be in another domain OR another
higher/different structures. In this case, use filters
(Options/Scan rules).
Example:
+www.yoursite.com/*
to accept everything on www.yoursite.com
or
+www.yoursite.com/foobar/*
to accept everything in www.yoursite.com/foobar/
Why is this needed?By default, HTTrack will always stay on the same
domain (except for images), and stay on the same
DIRECTORY structure (OR in deeper directory
structures). For example, it may go from
www.foo.com/bar/ to www.foo.com/bar/babar/ but NOT
from www.foo.com/bar/ to www.foo.com/files/,
because "files" is seen as a "same level" directory.
It won't go to www.anotherfoo.com, too.
These default rules are setup to avoid too large
mirrors ; for example mirroring
www.geocities.com/custommer52145/ should NOT cause to
mirror all other custommer websites. And the "same
domaine" limit is also an obvious protection: you may
not want to mirror the WHOLE WWW :)
| |