| Hi all,
I've used HTTrack to copy other sites with great success.
I've recently tried to mirror a part of www.books24x7.com.
I am registered there and can view the books online,
however when I try to copy it, HTTrack is not getting it. I
know that they are dynamically generating link id values in
their HTML.
Anyone tried coping from this site before?Thanks for any comments.
Cheers,
Eric
Here is a snippet of the log...
09:40:37 Info: Note: www.books24x7.com robots.txt rules
are too restrictive, ignoring /
09:40:37 Info: Note: due to www.books24x7.com remote
robots.txt rules, links begining with these path will be
forbidden: /images/, /bookimages/, /coverimages/, /viewer.as
p, /viewer_l.asp, /viewer_r.asp (see in the options to
disable this)
09:40:38 Warning: File has moved from
www.books24x7.com/book/id_5450/viewer.asp?bookid=5450&chunkid=0186060801 to
offline.asp
No files purged
HTTrack mirror complete in 3 seconds : 4 links scanned, 2
files written (6311 bytes overall) [927 bytes received at
309 bytes/sec]
(No errors, 1 warnings, 2 messages)
| |