HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrackrc Configuration
Author: John Bowyer
Date: 06/01/2005 12:11
 
This program is awesome but I could use some help.  

Can some one help me port my windows configurations to the commandline?
After much studying of the online help I seem to have entered all the
parameters I need in WinHttrack to mirror web site.  I am having a great deal
of difficulty porting these options to the dos command line through either the
command line options or a configuration file.

This configuration gets all files including aspx, js and gif files from the my
site.  I map the aspx to text files as a hack to prevent WinHttrack from
renaming the extension while still allowing it to parse for links, others may
be interested in this.  I am using basic auth so I want to hide passwords.  I
added 20+ files in the webaddresses because I wanted to make sure the program
downloaded all the js files, some of which are dynamically loaded based on the
users operating system.

I have included a text description of my user interface settings and a copy of
my options file below.

Web Addresses
<http://user:password@dev.mysite.info/default.aspx>
.. 20+ more of these

Scan Rules
+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/* +*.aspx
-dev.i-site.info/ResourceConnect/_layouts/1033/*.aspx
-dev.i-site.info/default.aspx

Links
Attempt to detect all links
Get non-HTML files related to link, eg external .zip

MimeTypes	
aspx <->	text/text

Build	
Hide Passwords	
%h%p/%n%q.%t 	

My option file is below:

Near=1
Test=0
ParseAll=1
HTMLFirst=0
Cache=1
NoRecatch=0
Dos=0
Index=0
WordIndex=0
Log=1
RemoveTimeout=0
RemoveRateout=0
KeepAlive=1
FollowRobotsTxt=2
NoErrorPages=0
NoExternalPages=0
NoPwdInPages=1
NoQueryStrings=0
NoPurgeOldFiles=0
Cookies=1
CheckType=1
ParseJava=1
HTTP10=0
TolerantRequests=0
UpdateHack=1
StoreAllInCache=0
LogType=0
UseHTTPProxyForFTP=1
Build=14
PrimaryScan=3
Travel=1
GlobalTravel=0
RewriteLinks=0
BuildString=%%h%%p/%%n%%q.%%t 
MaxHtml=
MaxOther=
MaxAll=
MaxWait=
Sockets=
Retry=
MaxTime=
TimeOut=
RateOut=
UserID=Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)
Footer=(none)
MaxRate=25000
WildCardFilters=+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
+*.aspx%0d%0a-website/ResourceConnect/_layouts/1033/*.aspx%0d%0a-website/default.aspx
%0d%0a
Proxy=
Port=
Depth=
ExtDepth=
MaxConn=
MaxLinks=
MIMEDefsExt1=php3,php,php2,asp,jsp,pl,cfm,nsf
MIMEDefsExt2=aspx
MIMEDefsExt3=
MIMEDefsExt4=
MIMEDefsExt5=
MIMEDefsExt6=
MIMEDefsExt7=
MIMEDefsExt8=
MIMEDefsMime1=text/html
MIMEDefsMime2=text/text
MIMEDefsMime3=
MIMEDefsMime4=
MIMEDefsMime5=
MIMEDefsMime6=
MIMEDefsMime7=
MIMEDefsMime8=
CurrentUrl=http://user:password@website/ResourceConnect/default.aspx%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_ee.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_ice.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_ie.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_konq.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_ns.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_ns6.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_opera.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/browser_opera7.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/dqm_loader.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/global.css%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/global.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/ie.css%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/mac.css%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/ns4.css%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/ns6.css%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/sample_settings.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/tbrowser_ee.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/tbrowser_ice.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/tbrowser_ie.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/tbrowser_konq.js%0d%0ahttp://user:password@website/_layouts/resourceconnect/resources/tbrowser_ns.js%0d%0a
CurrentAction=5
CurrentURLList=
 
Reply


All articles

Subject Author Date
httrackrc Configuration

06/01/2005 12:11




3

Created with FORUM 2.0.11