HTTrack Website Copier
Free software offline browser - FORUM
Subject: Using HTTrack w/ WinXP, inc. parameters - a query
Author: Padraic I.M. MacUidhir
Date: 01/20/2004 17:43
 
Heill, Mr. Roche. I have a question that may or may not
be relevant to the usage of your program. I am curious
as to whether you or any other forum reader may be
aware of what I am doing wrong. I had little trouble
previously with HTTrack other than the problem you
answered here in the forum some time ago for me. 

The sites I have attempted to recently mirror are as
follows:

Bibliomania Free Online Literature and Study Guides
<http://www.bibliomania.com/1/frameset.html>

Project Runeberg
<http://www.lysator.liu.se/runeberg/>

Folklore and Mythology Electronic Texts
<http://www.pitt.edu/~dash/folktexts.html>

Mark Nodine's Online Welsh Course
<http://oldweb.cs.cf.ac.uk/fun/welsh/>

I have included the appropriate password/login data 
whenever it was needed. 

My current one is Mark Nodine's free Welsh course online.
I am getting the same type of error message from all
WWWsites that I try to mirror, though, so I figure this
problem is human error. ;) I include here the options
that I have been using for all recent attempted mirrors,
and also the error message that I receive for all of
them. In this, I request the assistance of anyone who
knows the application of HTTrack better than I do.

I am running WindowsXP, all useful updates installed,
MSOfficeXP installed, a few other things as well. My
Component Services are modified according to the 
advice found at 
<http://www.theeldergeek.com/services_guide.htm>
On this computer, the one on which I try to mirror,
I have a broadband cable connection via Time Warner,
and it gets plenty of I/O bandwidth.

I have no screen saver loaded, my hardware is 1.3 gigahertz
Pentium 4 /Mobo is 850 series if I remember correctly,
video is 128Mb GeforceXT, and the HDDs are plenty big
for mirroring the sites. I cannot think of anything else
that may be relevant at this point to the problem.

Here is the data I am using for HTTrack right now:

Web Address:
<http://oldweb.cs.cf.ac.uk/fun/welsh/>
<http://oldweb.cs.cf.ac.uk/fun/welsh/>*.*

Action: 
Download Web Site(s)

Set Options-Proxy:
none

Set Options-Scan Rules:
+*.css +*.js -ad.doubleclick.net/*
+*.zip +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe
+*.gif +*.jpg +*.png +*.tif +*.bmp
+http://oldweb.cs.cf.ac.uk/fun/welsh/*

Set Options-Limits:
Max Transfer Rate (B/s) - 5000
Max Connections / seconds - 2
Maximum number of links - 100000
[every other field is blank]

Set Options-Flow Control:
Number of Connections - 1
TimeOut(s) - 600 sec.
Retries - 2

Set Options-Links:
[all four options are checked]

Set Options-Build:
Local Structure Type - Site-structure (default)
[the only thing that is checked here is "Do not purge old 
files]

Set Options-Spider:
Accept Cookies (if unknown), Parse Jave Files, and
Update hack (limit re-transfers) are all checked; the 
others are not. Spider is set to 'no robots.txt rules'.

Set Options-Experts Only:
Use a cache for updates is checked
[everything else is set to defaults, and Active Debugging
Mode (winhttrack.log) is *not* checked.]

Set Options-Log,Index,Cache:
[everything is checked *except* Make A Word Database

Set Options-Browser ID:
Browser Identity: Mozilla/4.0 (compatible; MSIE 6.0; 
Windows NT 5.0)
HTML Footer: (none)

Set Options-MIME Associations
[still at default settings, i.e., php and others are set
to 'text/html']

****************
Mr. Roche, I would have sent this via e-mail, but I did not 
know if it
would be viewed or not, and I figure that other folk may 
have a similiar
problem, so 'tis best to post it publically for others to 
guide themselves
by should they come here and want to know the answer as 
well. 
So, I apologise for the length of the post, but I figure if 
I am going
to ask you or others for advice, I had best provide the 
relevant 
data in order to give it if possible. I have worked in tech 
support
before; I know what it is like to not actually *be* where 
the problem
is occuring.    ;)

Here is the error message I gain, and which is essentially 
the same 
as the ones from the other WWWsites I listed, plus a few 
others:

HTTrack3.30+swf launched on Tue, 20 Jan 2004 11:36:32 
at <http://oldweb.cs.cf.ac.uk/fun/welsh/> 
<http://oldweb.cs.cf.ac.uk/fun/welsh/>*.* 
+*.css +*.js -ad.doubleclick.net/* 
+*.zip +*.tar +*.tgz +*.gz +*.rar +*.z 
+*.exe +*.gif +*.jpg +*.png +*.tif +*.bmp 
+http://oldweb.cs.cf.ac.uk/fun/welsh/*
(winhttrack -qiC0%nt%PnX0s0u2k%sN0%I0p7DaK0c1T6
00R2H0%kf2A5000%c2#L100000%f0#f -F "Mozilla/4.0 
(compatible; MSIE 6.0; Windows NT 5.0)" -%F  -P 
WINTERMUTE:8123 -%l "en, *" 
<http://oldweb.cs.cf.ac.uk/fun/welsh/> 
<http://oldweb.cs.cf.ac.uk/fun/welsh/>*.* -O 
"E:\My Web Sites\OnlineWelshCourseNodine,
E:\My Web Sites\OnlineWelshCourseNodine" 
+*.css +*.js -ad.doubleclick.net/* 
+*.zip +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe 
+*.gif +*.jpg +*.png +*.tif +*.bmp 
+http://oldweb.cs.cf.ac.uk/fun/welsh/* 
-%A php3,php,php2,asp,jsp,pl,cfm,nsf=text/html )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may 
contain 
sensitive information, such as username/password 
authentication 
for websites mirrored in this project do not share these 
files/folders if you want these information to remain 
private
11:36:36 Error:  "Interrupted transfer" (-4) after 2 
retries at link 
oldweb.cs.cf.ac.uk/fun/welsh/ (from primary/primary)
11:36:36 Error:  "Interrupted transfer" (-4) after 2 
retries at link 
oldweb.cs.cf.ac.uk/fun/welsh/*.* (from primary/primary)
11:36:36 Info:  No data seems to have been transfered 
during 
this session! : restoring previous one!
 
Reply


All articles

Subject Author Date
Using HTTrack w/ WinXP, inc. parameters - a query

01/20/2004 17:43
Re: Using HTTrack w/ WinXP, inc. parameters - a query

01/23/2004 23:13
Re: Using HTTrack w/ WinXP, inc. parameters - a query

01/24/2004 18:47




a

Created with FORUM 2.0.11