HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: cant view copied page as it appeas online!
Author: William Roeder
Date: 04/11/2010 17:34
 
> (winhttrack
> -WC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F ...
> <http://olympics.thestar.com/2010/article/767972--cox>
> -after-shootout-win-let-soul-searching-begin -O1 ...
> +*.png +*.gif +*.jpg
> +*.css +*.js -ad.doubleclick.net/*
> -mime:application/foobar )
1) don't use filters like that. Those allow files with those extentions to
come from anywhere, but other extentions, external to the site (EG
www.thestar.com) will not be mirrored. Just use the near option (get non-html
files related=checked)

2) Had you looked at the log file you would have seen the message about
robots.txt:
User-agent: *
Disallow: /*.axd$
That you will have to override to get those.

3) The site contains code like:
document.write('<scr'+'ipt language="javascript1.1"
So much of the site may not be renderable

4) Always run with no external pages=checked so you know what has not been
mirrored.
 
Reply Create subthread


All articles

Subject Author Date
cant view copied page as it appeas online!

04/10/2010 02:12
Re: cant view copied page as it appeas online!

04/10/2010 16:00
Re: cant view copied page as it appeas online!

04/11/2010 00:33
Re: cant view copied page as it appeas online!

04/11/2010 17:34




6

Created with FORUM 2.0.11