HTTrack Website Copier
Free software offline browser - FORUM
Subject: Mirror without footer from Command Prompt?
Author: Iain Elder
Date: 11/24/2012 21:58
 
I created a project using WinHTTrack. It mirrors pages without a footer. It can
continue a canceled or timed-out mirroring session with no apparent problems.

When I used the command `httrack --continue` to start a mirroring session from
the command line, all I got was output like this:

> Example: -%F "<!-- Mirrored from %s by HTTrack Website Copier/3.x
[XR&CO'2010], %s -->"
> * Option %F needs to be followed by a blank space, and a footer string

I worked around the problem by removing the %F parameter from parameter from
doit.log.

Now the command produces output like this:

> Mirror launched on Sat, 24 Nov 2012 19:22:10 by HTTrack Website
Copier/3.46+htsswf+htsjava [XR&CO'2010]
> mirroring
<http://saa.gov.uk/search.php?SEARCHED=1&SEARCH_TABLE=council_tax&SEARCH_TERM=City+of+Edinburgh&DISPLAY_COUNT=100>
-* +*search.php?SEARCHED=1* -*DISPLAY_MODE=FULL* with the wizard help..
> Done.
> Thanks for using HTTrack!

That's better, but httrack added a footer to all new pages like this:

> <!-- Mirrored from
saa.gov.uk/search.php?SEARCHED=1&SEARCH_TABLE=council_tax&SEARCH_TERM=City+of+Edinburgh%2C+EDINBURGH&DISPLAY_COUNT=100&PAGE=0&ASSESSOR_ID=&TYPE_FLAG=C&ORDER_BY=SET+DESC&ORIGINAL_SEARCH_TERM=City+of+Edinburgh&DRILL_SEARCH_TERM=CLAREMONT+GARDENS%2C+EDINBURGH&DD_TOWN=EDINBURGH&DD_STREET=CLAREMONT+GARDENS
by HTTrack Website Copier/3.x [XR&CO'2010], Sat, 24 Nov 2012 19:52:28 GMT -->

Before I removed the %F parameter, the first line of my doit.log looked like
this:

> -qiC1%P0s0b0u1j0%s%u0N0%I0p1DaK0c1T30H0%kf2E1800A25000%c0.1%f#f 
> -F "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)" 
> -%F "" 
> -%l "en, en, *" 
>
<http://saa.gov.uk/search.php?SEARCHED=1&SEARCH_TABLE=council_tax&SEARCH_TERM=City+of+Edinburgh&DISPLAY_COUNT=100>

> -O1 
> "C:\\Users\\Iain\\Projects\\Council Tax Analysis\\Code\\HTTrack\\Council Tax
Valuation List" 
> -* \
> +*search.php?SEARCHED=1* 
> -*DISPLAY_MODE=FULL*

For clarity, I have put each parameter on a new line.

My winprofile.ini looks like this:

> Near=0
> Test=0
> ParseAll=0
> HTMLFirst=0
> Cache=1
> NoRecatch=0
> Dos=0
> Index=1
> WordIndex=0
> Log=1
> RemoveTimeout=0
> RemoveRateout=0
> KeepAlive=1
> FollowRobotsTxt=0
> NoErrorPages=0
> NoExternalPages=0
> NoPwdInPages=0
> NoQueryStrings=0
> NoPurgeOldFiles=0
> Cookies=0
> CheckType=1
> ParseJava=0
> HTTP10=0
> TolerantRequests=0
> UpdateHack=1
> URLHack=0
> StoreAllInCache=0
> LogType=0
> UseHTTPProxyForFTP=1
> Build=0
> PrimaryScan=1
> Travel=1
> GlobalTravel=0
> RewriteLinks=0
> BuildString=%%h%%p/%%n%%q.%%t
> Category=
> MaxHtml=
> MaxOther=
> MaxAll=
> MaxWait=
> Sockets=1
> Retry=
> MaxTime=1800
> TimeOut=30
> RateOut=
> UserID=Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)
> Footer=(none)
> MaxRate=25000
>
WildCardFilters=-*%0d%0a+*search.php?SEARCHED%3d1*%0d%0a-*DISPLAY_MODE%3dFULL*
> Proxy=
> Port=
> Depth=
> ExtDepth=
> MaxConn=0.1
> MaxLinks=
> MIMEDefsExt1=
> MIMEDefsExt2=
> MIMEDefsExt3=
> MIMEDefsExt4=
> MIMEDefsExt5=
> MIMEDefsExt6=
> MIMEDefsExt7=
> MIMEDefsExt8=
> MIMEDefsMime1=
> MIMEDefsMime2=
> MIMEDefsMime3=
> MIMEDefsMime4=
> MIMEDefsMime5=
> MIMEDefsMime6=
> MIMEDefsMime7=
> MIMEDefsMime8=
>
CurrentUrl=http://saa.gov.uk/search.php?SEARCHED%3d1&SEARCH_TABLE%3dcouncil_tax&SEARCH_TERM%3dCity+of+Edinburgh&DISPLAY_COUNT%3d100%0d%0a
> CurrentAction=5
> CurrentURLList=

How do I tell httrack to add no footer to pages?
 
Reply


All articles

Subject Author Date
Mirror without footer from Command Prompt?

11/24/2012 21:58
Re: Mirror without footer from Command Prompt?

11/25/2012 00:11




0

Created with FORUM 2.0.11