| > <http://water4gas.com/online-books/>
>
> Despite setting +water4gas.com/+*.gif +*.jpg +*.png
> +*.tif +*.bmp - and "get non-html links" checked
> (links tab).
I assume that's a typo and you meant: +water4gas.com/* +*.gif
> All I'm getting is .html. And its not following
> into the folders below the main folder despite
> having the mirroring depth set to 5 and external
> depth at 0.
The log said the site is restricted by meta tag robots.txt
So you have to set no robots.txt
> I've been at this for two hours. Can anyone please
> provide some suggestions or an example of the proper
> settings for this job.
I got HTTrack Website Copier/3.43 mirror complete in 4 minutes 21 seconds :
417 links scanned, 395 files written (14551481 bytes overall), 393 files
updated [14646649 bytes received at 56117 bytes/sec] (20 errors, 0 warnings,
393 messages)
The 20 errors are "Not Found" (404) on some images/js files.
My winprofile.ini:
Near=1
Test=0
ParseAll=1
HTMLFirst=1
Cache=1
NoRecatch=0
Dos=0
Index=1
WordIndex=0
Log=1
RemoveTimeout=0
RemoveRateout=1
KeepAlive=1
FollowRobotsTxt=0
NoErrorPages=1
NoExternalPages=1
NoPwdInPages=0
NoQueryStrings=0
NoPurgeOldFiles=1
Cookies=1
CheckType=1
ParseJava=1
HTTP10=0
TolerantRequests=0
UpdateHack=1
URLHack=1
StoreAllInCache=0
LogType=0
UseHTTPProxyForFTP=1
Build=5
PrimaryScan=3
Travel=1
GlobalTravel=0
RewriteLinks=0
BuildString=%%h%%p/%%n%%q.%%t
Category=
MaxHtml=
MaxOther=
MaxAll=
MaxWait=
Sockets=2
Retry=9
MaxTime=
TimeOut=300
RateOut=5
UserID=Mozilla/4.0 (compatible; MSIE 5.0; Win32)
Footer=<!-- Mirrored from %%s%%s by HTTrack Website Copier/3.x [XR&CO'2004],
%%s -->
MaxRate=97100
WildCardFilters=
Proxy=
Port=
Depth=5
ExtDepth=
MaxConn=5
MaxLinks=
MIMEDefsExt1=asp,php3,php,php2,asp,jsp,pl,cfm,nsf
MIMEDefsExt2=wmv
MIMEDefsExt3=rmj
MIMEDefsExt4=
MIMEDefsExt5=
MIMEDefsExt6=
MIMEDefsExt7=
MIMEDefsExt8=
MIMEDefsMime1=text/html
MIMEDefsMime2=video/x-ms-wmv
MIMEDefsMime3=application/vnd.rn-realsystem-rmj
MIMEDefsMime4=
MIMEDefsMime5=
MIMEDefsMime6=
MIMEDefsMime7=
MIMEDefsMime8=
CurrentUrl=http://water4gas.com/online-books/
CurrentAction=6
CurrentURLList=
| |