|
> <http://www.dianeetphilippe.canalblog.com/robots.txt>
> User-agent: OmniExplorer_Bot
> Disallow: /
>
> options -> spider -> Spider=no robots
Thanks William for your idea.
I did what you said in the options : "options -> spider -> Spider=no robots".
Httrack scanned 9 links instead of 4 last time. The mirrored Website it always
the first page, no more !
The log is :
HTTrack3.42-3+htsswf+htsjava launched on Fri, 26 Sep 2008 15:37:08 at
<http://www.dianeetphilippe.canalblog.com/> +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar +*.gif +*.jpg +*.png +*.tif +*.bmp +*.zip +*.tar
+*.tgz +*.gz +*.rar +*.z +*.exe +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3
+*.mp2 +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma +*.wmv
(winhttrack -qwC2%Ps0u1%s%uN0%I0p3DaK0c1H0%kf2A25000%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2007], %s -->" -%l "en, en, *"
<http://www.dianeetphilippe.canalblog.com/> -O1 "E:\Mariage Diane-Philippe\Site
au complet\Diane 2" +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar +*.gif +*.jpg +*.png +*.tif +*.bmp +*.zip +*.tar
+*.tgz +*.gz +*.rar +*.z +*.exe +*.mov +*.mpg +*.mpeg +*.avi +*.asf +*.mp3
+*.mp2 +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma +*.wmv )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
15:37:12 Warning: link is probably looping, type unknown, aborting:
storage.canalblog.com/73/10/510364/30111975.jpg
HTTrack Website Copier/3.42-3 mirror complete in 21 seconds : 9 links scanned,
8 files written (418068 bytes overall) [422494 bytes received at 20118
bytes/sec]
(No errors, 1 warnings, 0 messages).
What do you think about it ?
bedeka
| |