| I'm trying to copy a site requiring authentication with a URL starting with
https using WinHTTrack 3.47-27 on Windows 8 with a Mozilla browser. To do this
I followed the procedure described at
<http://forum.httrack.com/readmsg/29365/index.html?q=https>, which was to use
the CatchURL process as described in
<http://httrack.kauler.com/help/CatchURL_tutorial>, with the start of the target
URL using http instead of https (hardwired into the Start URL screen), and
with HTTrack cookies.txt containing only the cookies obtained from the target
site.
WinHTTrack runs, but the resulting web site copy contains many copies of the
same HTML page, with embedded links which still point to Web locations rather
than mirror location and ask for authentication before access.
I am using all default settings for scan rules.
Command line from the log file is
(winhttrack -qiC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2013], %s -->" -%l "en, *"
<http://wattlecourses.xxx.yyy.zz/my/?>postfile:E:\Zenbook\MyWebSites\Wattle2\hts-post0>
-O1 E:\Zenbook\MyWebSites\Wattle2 +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar )
post file is
GET /my/ HTTP/1.1
Host: wattlecourses.xxx.yyy.zz
User-Agent: Mozilla/5.0 (Windows NT 6.2; WOW64; rv:27.0) Gecko/20100101
Firefox/27.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-US,en;q=0.5
Accept-Encoding: gzip, deflate
Cookie: __utma=46075508.1472004848.1392009163.1393359493.1393480370.3;
__utmz=46075508.1393480370.3.3.utmcsr=linkedin.com|utmccn=(referral)|utmcmd=referral|utmcct=/lite/external-redirect;
MoodleSession=51fgjq7t97pvqg31rhcfarntb7; NSSID=shr-mdlweb-prod-akw1e;
MOODLEID1_=%25021%25F9%259F%25D6%25E9S%2586
Connection: keep-alive
| |