| > How about the trick filter
> -www.foo.com/bar/homepage.htm*
> I found that / and make big difference, when on PC,
> they are same, but Httrack only take /.
> If I include '*' at the end, only one page download.
> If I don't include '*', it still goes in loops.
>
> Is this expected behavior?
"/" is the correct character for URLs. Windows/IE tries to be smart that when
you type in "\" it automatically converts them to "/".
The "*" is vital to making the filter work. The above filter says, exclude
the page 'homepage.htm' and ANYTHING which comes after it in the URL -- the
querystring.
If the filter was "-www.server.com/homepage.htm" without the "*", then if a
URL pointed to that with a querystring it would be downloaded.
Of course, this may be a problem if all pages in the site use the same page,
eg.
www.server.com/page.nsf?page=home
www.server.com/page.nsf?page=page&id=12345
In this case you will be excluding all linked pages :( | |