Trouble downloading single pages with https assets

Subject: Trouble downloading single pages with https assets

Author: Jens Kohl

Date: 06/25/2012 13:17

I want to download one single html file with all of its assets (css, js,
images, etc.). For that I use the following params:

httrack <http://www.jobstairs.de/stellenangebot/accenture/14221.html> --http-10
-v -r2 -%e1 -s0 --urlhack --updatehack --language "de, en" -I0 -N1 -O
/tmp/mirror

This html site is rather special since it loads its images from another
webserver via HTTPS (yeah, it doesn't make any sense, but I can't change it).

All links point to images via HTTPS will not be downloaded and replaced. So
the output of:

cat /tmp/mirror/web/14221.html | grep https | wc -l

is 8, but I expected it to be 0.

All articles

Subject	Author	Date
Trouble downloading single pages with https assets		06/25/2012 13:17
Re: Trouble downloading single pages with https assets		06/25/2012 13:30