HTTrack Website Copier
Free software offline browser - FORUM
Subject: Comics website prevents download of images
Author: Mike
Date: 04/12/2017 11:39
 
Hi Guys,

I am new to this software and tried to use it on two different websites with
the same result:

Website contains comics and each page is on a new HTML. Each image is stored
outside the original domain of the website and somehow they prevent it from
being downloaded:

Example:
<https://www.website.eu/comics/picture/Comic/1> shows the page including the
picture but the picture itself is stored in
<https://cdn.ampproject.org/1/www.website.eu/data/picture1.jpg>.

When I play around with the settings at some point I am able to download
everything but the page that shows the picture. It creates a html of that page
that is not accessible and spotted by my virusscanner as dangerous (The error
in the log is Error:  "Not Found" (404) at link for each of them). The folder
that contains the pictures are not even downloaded at all though no error is
shown in the log nor any reference to the domain. 

My settings:

Action: Download website(s)
Set Options: Kept everything defaulted apart from the following:
Scan Rules:
+*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
+*.gif +*.jpg +*.jpeg +*.png +*.tif +*.bmp
+https://www.website.eu/comics/picture/*
+https://cdn.ampproject.org/1/www.website.com/data/*

Limits->
Adjusted maximum number of links to max. 99999999999
Adjusted maximum transfer rate to 100000

Flow control->
Adjusted number of connections to 2

Spider-> no robot.txt rules (Also tried with but no difference in result)

Browser ID-> adjusted browser identity to "IE"

All other settings are default.

Is there anyone that could help me get these settings right to avoid these
aparant securities the website has build in to prevent it from being
downloaded offline? It is very much appreciated! (By the way the website is
not user / password protected and is freely accessible when browsing)

Kr,
Mike


 
Reply


All articles

Subject Author Date
Comics website prevents download of images 04/12/2017 11:39
Re: Comics website prevents download of images 04/12/2017 12:02
Re: Comics website prevents download of images 04/12/2017 13:36
Re: Comics website prevents download of images 04/12/2017 21:55
Re: Comics website prevents download of images 04/13/2017 20:25




e

Created with FORUM 2.0.11