HTTrack Website Copier
Free software offline browser - FORUM
Subject: bug- some links are not changed in the local copy
Author: rich painter
Date: 04/16/2004 08:13
 
the log file:
HTTrack3.32+swf launched on Thu, 15 Apr 2004 23:33:19 at
<http://www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/>
+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
(winhttrack -qiC2%Ps2u1%sN0%I0p3DaK0H0%kf2A25000%f#f -F
"Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F
"<!-- Mirrored from %s%s by HTTrack Website Copier/3.x
[XR&CO'2004], %s -->" -%l "en, en, *"
<http://www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/>
-O "D:\tmp\Cisco 6160\Cisco 6160,D:\tmp\Cisco 6160\Cisco
6160" +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -%A
php3,php,php2,asp,jsp,pl,cfm,nsf=text/html )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may
contain sensitive information,
 such as username/password authentication for websites
mirrored in this project
 do not share these files/folders if you want these
information to remain private
23:33:21 Info:  Note: due to www.cisco.com remote robots.txt
rules, links begining with these path will be forbidden:
/bug-navigator, /cgi-bin, /pcgi-bin, /univ-src/ccden,
/cpropub/univercd, /jobs (see in the options to disable this)
23:35:52 Error:  "Not Found" (404) at link
www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/user/index.htm
(from
www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/upgrde/index.htm)
23:35:59 Error:  "Not Found" (404) at link
www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/hwguide/messa
(from
www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/hwguide/03inpref.htm)
23:35:59 Warning:  Warning, link #126 empty
No files purged
HTTrack Website Copier/3.32 mirror complete in 2 minutes 40
seconds : 125 links scanned, 122 files written (23279148
bytes overall), no files updated [59918 bytes received at
374 bytes/sec]
(2 errors, 1 warnings, 1 messages)


disregard the "not found" links as these are NOT the problem
i an reporting.

1. there is no mention in the log (and no error log file
does not exist) that the page in question "should" be a problem.

2. one of the pages that has the links that are not
rewritten is
www.cisco.com/univercd/cc/td/doc/product/dsl_prod/6160/software/index.htm

the link named "Cisco DSL Manager" on this page does NOT get
rewritten but instead continues to point to the cisco web
site:
<http://www.cisco.com/univercd/cc/td/doc/product/rtrmgmt/cdm/index.htm>

3. there are many of these similar pages and links that do
not get rewritten.

4. yet, other similarly located files do get correctly
rewritten links.

5. NONE of the pages that have these problems are referenced
in the "robots.txt".

is this posting sufficient to report this bug?
thanks,
rich
 
Reply


All articles

Subject Author Date
bug- some links are not changed in the local copy

04/16/2004 08:13
Re: bug- some links are not changed in the local copy

04/18/2004 19:08
Re: bug- some links are not changed in the local c

04/19/2004 09:07
Re: bug- some links are not changed in the local c

04/22/2004 15:24
Re: bug- some links are not changed in the local copy

02/16/2009 21:29
Re: bug- some links are not changed in the local copy

02/16/2009 21:29




e

Created with FORUM 2.0.11