| Good evening - I have searched the forum and there seems to be a recurring
issue that I am also experiencing: The mirror does not seem to be capturing
the entire site, and pages that are captured have a lot (not all) links
pointing to the on-line "real" or "live" website. I did a little digging into
the index.html page of what HTT downloaded and noticed that URLs following the
"src=" attribute are in-fact converted to the local file locations on my hard
drive (the mirrored locations). URLs that follow "OPTION VALUE=",
"background=", "onmouseover..." and oddly enough "href" do not convert to the
local file location.
I am ignoring the robot.txt, changed the browser ID to a none httrack one,
etc.
I would greatly appreciate some insight - I've been adjusting parameters for
several days now. I hope there's room for the log file...
TTrack3.43-3+htsswf+htsjava launched on Fri, 13 Aug 2010 18:04:25 at
<http://www.vinyflex.com> +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar
(winhttrack
-qwr10%e1C2t%Pns0u1%B%s%uN0%I0p7DlK0c3T30J1200R2H0%kf2A25000%c1#L100000%f#f -F
"Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" -%F "<!-- Mirrored from
%s%s by HTTrack Website Copier/3.x [XR&CO'2008], %s -->" -%l "en, en, *"
<http://www.vinyflex.com> -O1 H:\Test_Data\Test +*.png +*.gif +*.jpg +*.css
+*.js -ad.doubleclick.net/* -mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
18:05:13 Info: engine: warning: entry cleaned up, but no trace on heap:
www.vinyflex.com/page/eng/images/space1.gif
(H:/Test_Data/Test/www.vinyflex.com/page/eng/images/space1.gif)
18:05:14 Info: engine: warning: entry cleaned up, but no trace on heap:
www.vinyflex.com/page/eng/images/title_left_corner.jpg
(H:/Test_Data/Test/www.vinyflex.com/page/eng/images/title_left_corner.jpg)
18:05:15 Info: engine: warning: entry cleaned up, but no trace on heap:
www.vinyflex.com/page/eng/images/sample01.jpg
(H:/Test_Data/Test/www.vinyflex.com/page/eng/images/sample01.jpg)
18:05:16 Info: engine: warning: entry cleaned up, but no trace on heap:
www.vinyflex.com/page/eng/images/sample02.jpg
(H:/Test_Data/Test/www.vinyflex.com/page/eng/images/sample02.jpg)
... there's a lot of those messages ...
... and then some "404" errors" ...
18:06:29 Error: "Not Found" (404) at link
www.vinyflex.com/link/hospital_on.gif (from www.vinyflex.com/)
18:06:29 Error: "Not Found" (404) at link www.vinyflex.com/link/homeB_on.gif
(from www.vinyflex.com/)
18:06:29 Error: "Not Found" (404) at link
www.vinyflex.com/link/transport_on.gif (from www.vinyflex.com/)
HTTrack Website Copier/3.43-3 mirror complete in 3 minutes 28 seconds : 103
links scanned, 143 files written (817017 bytes overall) [690610 bytes received
at 3320 bytes/sec], 2.2 requests per connection
(22 errors, 1 warnings, 63 messages)
| |