| > (winhttrack
> -WC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F ...
> <http://olympics.thestar.com/2010/article/767972--cox>
> -after-shootout-win-let-soul-searching-begin -O1 ...
> +*.png +*.gif +*.jpg
> +*.css +*.js -ad.doubleclick.net/*
> -mime:application/foobar )
1) don't use filters like that. Those allow files with those extentions to
come from anywhere, but other extentions, external to the site (EG
www.thestar.com) will not be mirrored. Just use the near option (get non-html
files related=checked)
2) Had you looked at the log file you would have seen the message about
robots.txt:
User-agent: *
Disallow: /*.axd$
That you will have to override to get those.
3) The site contains code like:
document.write('<scr'+'ipt language="javascript1.1"
So much of the site may not be renderable
4) Always run with no external pages=checked so you know what has not been
mirrored.
| |