Subject: Re: link is probably looping, type unknown, aborti
Author: Hosh
Date: 02/25/2007 16:19
Hi,
thanks for this great Software! It's more transparent and customizable than
Teleport and OS!
I agree the problem "link is probably looping, type unknown, aborting" is a
nasty one (WinHTTrack 3.41-2).
Unsure I am bout wether it maybe is an intentional server side "feature",
maybe invented to hinder webspidering (server in this case seemed to be
Apache)? Because also other webspiders in a project often don't get all files
clean in one run (from my experience: seems to be almost inevitable to do
iterative re-runs!)
My goal was to download pictures with a linklist of links like
<http://hoshhosh.net/image.php?id=925>.
These refer to a php page which holds an embedded picture.
It seems that roughly 50 % (which select randomly) of the files from my
linklist will download.
Funny thing is, when I copied the "looped" links from the protocol and reran
them as a new project, again 50 % would be successful - And so on.
A simple hint / workaround for me is to just update the old project a few
times, then all of the files will load.
BTW Does anyone have an idea if some HTTrack preferences maybe already solve
such issues?
Here is data from my winprofile.ini:
Near=1 Test=0 ParseAll=1 HTMLFirst=0 Cache=1 NoRecatch=0 Dos=0 Index=0
WordIndex=0 Log=1 RemoveTimeout=0 RemoveRateout=0 KeepAlive=1
FollowRobotsTxt=-1 NoErrorPages=0 NoExternalPages=0 NoPwdInPages=0
NoQueryStrings=0 NoPurgeOldFiles=0 Cookies=1 CheckType=-1 ParseJava=1 HTTP10=0
TolerantRequests=1 UpdateHack=1 URLHack=0 StoreAllInCache=0 LogType=-1
UseHTTPProxyForFTP=0 Build=-1 PrimaryScan=-1 Travel=-1 GlobalTravel=-1
RewriteLinks=-1 BuildString=%%h%%p/%%n%%q.%%t Category= MaxHtml= MaxOther=
MaxAll= MaxWait= Sockets=3 Retry= MaxTime= TimeOut= RateOut=
UserID=Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US; rv:1.1) Gecko/20020826
Footer= MaxRate=999999 WildCardFilters=+*.jpg -ad.doubleclick.net/* Proxy=
Port= Depth=1 ExtDepth= MaxConn= MaxLinks= MIMEDefsExt1= MIMEDefsExt2=
MIMEDefsExt3= MIMEDefsExt4= MIMEDefsExt5= MIMEDefsExt6= MIMEDefsExt7=
MIMEDefsExt8= MIMEDefsMime1= MIMEDefsMime2= MIMEDefsMime3= MIMEDefsMime4=
MIMEDefsMime5= MIMEDefsMime6= MIMEDefsMime7= MIMEDefsMime8= CurrentUrl=
CurrentAction=6 CurrentURLList= <bla>