HTTrack Website Copier
Free software offline browser - FORUM
Subject: copy from HTTPS site not behaving as expected
Author: Simon Kravis
Date: 03/06/2014 11:09
 
I'm trying to copy a site requiring authentication with a URL starting with
https using WinHTTrack 3.47-27 on Windows 8 with a Mozilla browser. To do this
I followed the procedure described at
<http://forum.httrack.com/readmsg/29365/index.html?q=https>, which was to use
the CatchURL process as described in
<http://httrack.kauler.com/help/CatchURL_tutorial>, with the start of the target
URL using http instead of https (hardwired into the Start URL screen), and
with HTTrack cookies.txt containing only the cookies obtained from the target
site.

WinHTTrack runs, but the resulting web site copy contains many copies of the
same HTML page, with embedded links which still point to Web locations rather
than mirror location and ask for authentication before access.

I am using all default settings for scan rules.

Command line from the log file is
(winhttrack -qiC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2013], %s -->" -%l "en, *"
<http://wattlecourses.xxx.yyy.zz/my/?>postfile:E:\Zenbook\MyWebSites\Wattle2\hts-post0>
-O1 E:\Zenbook\MyWebSites\Wattle2 +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar )

post file is

GET /my/ HTTP/1.1
Host: wattlecourses.xxx.yyy.zz
User-Agent: Mozilla/5.0 (Windows NT 6.2; WOW64; rv:27.0) Gecko/20100101
Firefox/27.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-US,en;q=0.5
Accept-Encoding: gzip, deflate
Cookie: __utma=46075508.1472004848.1392009163.1393359493.1393480370.3;
__utmz=46075508.1393480370.3.3.utmcsr=linkedin.com|utmccn=(referral)|utmcmd=referral|utmcct=/lite/external-redirect;
MoodleSession=51fgjq7t97pvqg31rhcfarntb7; NSSID=shr-mdlweb-prod-akw1e;
MOODLEID1_=%25021%25F9%259F%25D6%25E9S%2586
Connection: keep-alive

 
Reply


All articles

Subject Author Date
copy from HTTPS site not behaving as expected

03/06/2014 11:09




1

Created with FORUM 2.0.11