I want to download one single html file with all of its assets (css, js,
images, etc.). For that I use the following params:
httrack <http://www.jobstairs.de/stellenangebot/accenture/14221.html> --http-10
-v -r2 -%e1 -s0 --urlhack --updatehack --language "de, en" -I0 -N1 -O
This html site is rather special since it loads its images from another
webserver via HTTPS (yeah, it doesn't make any sense, but I can't change it).
All links point to images via HTTPS will not be downloaded and replaced. So
the output of:
cat /tmp/mirror/web/14221.html | grep https | wc -l
is 8, but I expected it to be 0.