| I am trying to mirror this site with WinHTTrack 3.15:
<http://www.freiburg.linux.de/~uae/>
When HTTrack gets to this link (which should be
downloaded and saved according to the filters) it
appears to download it, but no file or directory is
created for it. One of the files in question is:
<http://www.freiburg.linux.de/~uae/bin/sources/develop/u>
ae-0.8.21.tar.gz
All other files in the
<http://www.freiburg.linux.de/~uae/bin/sources/>
path are being skipped too.
I have HTTrack setup this way:
-ignore robots.txt
-download all *.zip *.gz etc on the scan filters page
-get near non-html files
-internal and external levels are 'blank' (default)
Unsuccessfully tried:
-2 different computers on different Internet
connections, one with Win95, one with Win98SE with no
luck.
No matter what I those files are not downloaded and
saved. (Or seemed to be downloaded, but then thrown
away/lost/deleted).
The whole website + linked files is about 6mb, and it
mostly works using another free website copier called
WebReaper (but that program lacks Javascript support).
You can see a copy of the winprofile.ini and hts-
log.txt files here if it helps for diagnosis.
<http://kazemizadeh.net:8080/httrackproblem>-
uae/winprofile.ini.txt
<http://kazemizadeh.net:8080/httrackproblem-uae/hts>-
log.txt
----------------------------------------
Here is the contents of winprofile.ini
WinHTTrack creates in the htt-cache folder:
Near=1
Test=0
ParseAll=1
HTMLFirst=0
Cache=1
NoRecatch=0
Dos=0
Index=1
WordIndex=0
Log=1
RemoveTimeout=0
RemoveRateout=0
FollowRobotsTxt=0
NoErrorPages=0
NoExternalPages=0
NoPwdInPages=1
NoQueryStrings=0
NoPurgeOldFiles=1
Cookies=1
CheckType=1
ParseJava=1
HTTP10=0
TolerantRequests=0
UpdateHack=1
StoreAllInCache=0
LogType=1
UseHTTPProxyForFTP=0
Build=0
PrimaryScan=3
Travel=1
GlobalTravel=0
RewriteLinks=0
BuildString=%%h%%p/%%n%%q.%%t
MaxHtml=
MaxOther=
MaxAll=
MaxWait=
Sockets=4
Retry=
MaxTime=
TimeOut=
RateOut=
UserID=Mozilla/4.5 (compatible; HTTrack 3.0x; Windows
98)
Footer=<!-- Mirrored from %%s by HTTrack Website
Copier/3.15 [XR&CO'2001] -->
MaxRate=
WildCardFilters=+*.css +*.js -ad.doubleclick.net/*%0d%
0a-ad.linksynergy.com/* -www.respond.com/*%0d%0a+*.gif
+*.jpg +*.png +*.tif +*.bmp%0d%0a+*.zip +*.tar +*.tgz
+*.gz +*.rar +*.z +*.exe%0d%0a+*.mov +*.mpg +*.mpeg
+*.avi +*.asf +*.mp3 +*.mp2 +*.rm +*.wav +*.vob +*.qt
+*.vid +*.ac3
Proxy=
Port=
Depth=
ExtDepth=
MaxConn=
MaxLinks=
MIMEDefsExt1=php,php3,asp
MIMEDefsExt2=
MIMEDefsExt3=
MIMEDefsExt4=
MIMEDefsExt5=
MIMEDefsExt6=
MIMEDefsExt7=
MIMEDefsExt8=
MIMEDefsMime1=text/html
MIMEDefsMime2=
MIMEDefsMime3=
MIMEDefsMime4=
MIMEDefsMime5=
MIMEDefsMime6=
MIMEDefsMime7=
MIMEDefsMime8=
CurrentUrl=http://www.freiburg.linux.de/~uae/
CurrentAction=0
CurrentURLList=
| |